The present disclosure relates generally to determining information regarding breasts and breast tissue, and more specifically, to exemplary embodiments of exemplary systems, methods and computer-accessible medium for determining breast cancer response using a convolutional neural network.
Breast cancer is one of the most ubiquitous malignancies afflicting women worldwide, and is the second most common cause of cancer deaths among women in the United States. (See, e.g., Reference 37). Not all breast cancers are the same, with a wide spectrum of intrinsic biologic diversity seen across multiple subtypes indicating variable biologic behavior and treatment options. (See, e.g., Reference 38). If a patient meets the criteria of estrogen receptor positive (“ER+”), human epidermal growth factor receptor-2 negative (“HER2−”), and node-negative, adjuvant chemotherapy may not be indicated, as the risk of recurrence is comparable to the harm from toxicity. (See, e.g., References 39 and 40). These patients can receive surgery, endocrine therapy, or radiation. (See, e.g., Reference 40).
Oncotype Dx (Genomic Health, Redwood City, Calif.) is a validated 21-gene reverse transcriptase polymerase chain reaction (“RT-PCR”) assay involved in tumor cell proliferation and hormonal response, which provides a recurrence score (“RS”) to quantitatively predict outcomes in patients who meet the criteria of ER+/HER2−/node negative invasive breast carcinoma. (See, e.g., References 41-43). In 2016 the updated guidelines of the American Society of Clinical Oncology (“ASCO”) recommended use of this RS in ER+/HER2−/node negative breast cancer to help determine the utility of adjuvant systemic chemotherapy. (See, e.g., Reference 39).
Although effective, genetic analysis such as Oncotype Dx is invasive and expensive, which has motivated the investigation of imaging analysis to determine tumor heterogeneity. Magnetic resonance imaging (“MRI”) is a common modality used in the diagnosis of breast cancer given its high soft-tissue contrast and sensitivity. (See, e.g., Reference 44). In recent years there have been investigations into quantitative analysis of specific extracted imaging features termed “radiomics.” Further correlation of these quantitative imaging features to molecular gene expression defines “radiogenomics.” (See, e.g., Reference 45).
The field of radiomics and radiogenomics has developed largely due to the contribution of machine-learning procedures utilizing the extraction of pertinent imaging features and correlating with clinical data. More recently, due to advances in the computer hardware technology, a subset of machine learning utilizing a type of artificial neural network called convolutional neural networks (“CNNs”) has begun to proliferate for medical imaging analysis. In contrast to traditional procedures that utilize hand-crafted features based on human extracted patterns, neural networks facilitate the computer to automatically construct predictive statistical models, tailored to solve a specific problem subset. (See, e.g., Reference 46). The laborious task of human engineers inputting specific patterns to be recognized could be replaced by inputting curated data and facilitating the technology to self-optimize and discriminate through increasingly complex layers.
Neoadjuvant chemotherapy (“NAC”) has become a widely used treatment approach in the management of breast cancer. In addition to the established benefits of increasing rates of operability and breast-conservation for locally-advanced tumors, NAC facilitates the assessment of the clinical efficacy of novel systemic combinations and targeted therapies in vivo within a treatment-naïve patient population. (See, e.g., Reference 1).
Several large randomized neoadjuvant trials have demonstrated pathological complete response (“pCR”) to be a potential surrogate marker for clinical efficacy as there can be a significant correlation between patients who achieved a pCR and improved disease-free and overall survival. (See, e.g., References 2 and 3). This association varies among subtypes, with the strongest diagnostic accuracy seen in HER2 positive and triple-negative breast cancer. (See, e.g., References 4 and 5). While systemic treatments delivered in the adjuvant setting needs many years of follow-up to validate a clinical benefit, pCR serves as an attractive surrogate end point for improved long-term outcome after only several weeks of neoadjuvant therapy. (See, e.g., References 6 and 7).
Axillary lymph node pCR has been shown to be a dominant prognostic factor in long-term outcome across all breast cancer subtypes. A large prospective study, including 403 patients with proven axillary lymph node metastases who underwent NAC followed by sentinel lymph node biopsy (“SLND”) or ALND showed 22% achieved axillary pCR, of which 69% achieved pCR of the primary tumor. The overall survival (“OS”) in patients who achieved axillary pCR was significantly higher compared with those with axillary residual disease (93% [95% confidence interval [CI] 87.5-98.5] vs. 72% [95% CI 66.5-77.5], P<0.0001). In patients who achieved axillary pCR, there was no significant difference in recurrence-free survival (“RFS”) or OS in those who had residual primary disease versus achieved primary tumor pCR. Although limited by a small sample size, these findings suggest residual primary tumor in the setting of axillary pCR does not infer a worse prognosis, possibly secondary to a difference in metastatic potential of the tissues of the axilla compared to the breast. (See, e.g., Reference 69). The prognostic value of axillary lymph node status accurately assesses the treatment response critical in the management of breast cancer.
Advances in genomics have demonstrated breast cancer to be a disease with a spectrum of biologically relevant molecular subtypes. This significant disease heterogeneity poses a major challenge in the development of novel treatments. Targeted therapies may only be effective in a small subset of breast cancers, which has contributed to the difficulty establishing a therapeutic benefit in a large, heterogeneous, clinical trial. (See, e.g., References 8 and 9). There can be potential for significant clinical benefit in streamlining the testing of novel NAC with early response assessment and prediction. This can be the goal of the ongoing adaptive neoadjuvant I-SPY 2 (Investigation of Serial Studies to Predict Your Therapeutic Response with Imaging and Molecular Analysis 2) trials, which have already “graduated” neratinib in HER2 positive disease and veliparib-carboplatin in triple-negative disease. (See, e.g., References 10 and 11). Timely identification of responders to therapy can reduce the time, cost, and patient numbers needed to identify new beneficial therapies. Furthermore, early identification of non-responders can be beneficial in minimizing the potential toxicity of ineffective treatments and the delay of further exploration into potential alternative preoperative therapy. (See, e.g., Reference 12).
Quantitative MRI has emerged as a powerful imaging modality in the neoadjuvant treatment response assessment and identification of potential imaging-based biomarkers, with successful incorporation into the clinical trial setting. (See, e.g., References 13-16). Recent approaches have correlated changes in specific morphologic and kinetic parameters between a baseline and interval MRI after the initiation of chemotherapy, as early as after the first cycle, to predict treatment response and pCR. Further integration of clinically-relevant mathematical models to account for biologic features of tumor growth and treatment response have enhanced the predictive accuracy of these methods. (See, e.g., References 17 and 18). The vast majority of current models in early-response assessment depend on interval imaging after the initiation of therapy, without the ability to successfully determine a priori treatment response or pCR, prior to the initiation of treatment given the challenges of tumor heterogeneity. (See, e.g., References 13-18).
Deep learning through CNNs has demonstrated strong performance in various image classification tasks in recent years with a growing number of applications. (See, e.g., Reference 19). Deep learning methods facilitate a machine to extract high-level information from raw input images using several non-linear modules to amplify important features for image discrimination and classification. Machine learning can be further supervised using adjustable parameters to intricately correlate specific inputs and outputs.
While radiographic complete response (“rCR”) of the primary tumor can be determined objectively by the lack of residual enhancement, axillary rCR can be challenging given variability of normal lymph node morphology and enhancement pattern. MRI before and after NAC in correlation with pathologic evaluation was examined in 128 patients with breast cancer and demonstrated axillary rCR to only achieve a negative predictive value (“NPV”) of 66.7% and a positive predictive value (“PPV”) of 65.6%. (See, e.g., Reference 71). An additional prospective examination correlating MRI before and after NAC with axillary biopsy results in 43 patients with breast cancer showed that pre-NAC MRI was significantly associated with pathology (P=0.014) with a false-positive rate, false-negative rate, sensitivity, and specificity of 50, 3, 97, and 50%, respectively. However, post-NAC MRI was not predictive of surgical pathologic findings (P=0.342), with a false-positive rate, false-negative rate, sensitivity, and specificity of 38, 46, 55, and 63%, respectively. (See, e.g., Reference 79). The results of these studies demonstrate the challenges of accurately assessing the axilla in the post-NAC setting.
Management of breast cancer, specifically in axillary metastasis, has shown a continuous trend toward less invasive therapy, with initially accepted ALND largely replaced by SLND. The National Surgical Adjuvant Breast and Bowel Project (“NSABP”) B-32 trial showed sentinel lymph node identification to have a success rate of 96.2% with a false-negative rate of 6.7%. (See, e.g., Reference 80). However, initially, there was a concern for an accurate pathologic analysis of SLND after NAC. A large meta-analysis, including 10 studies with a total of 449 patients with clinically node-negative disease who underwent SLND after NAC, showed a pooled identification rate of 94.3% with a false-negative rate (“FNR”) of 7.4%, which can be comparable to the standard accepted identification rates of 88-97% and FNRs of 5-12%. (See, e.g., Reference 81).
However, SLND after NAC in node-positive breast cancer remains a point of controversy. Fibrosis of the axilla after chemotherapy alters lymphatic drainage and increases difficulty of surgical dissection. A large, prospective study that included 525 patients with cN1 disease who underwent SLND followed by ALND showed approximately 41% achieved pCR with a FNR as high as 12.6%. Although the FNR was higher than the pre-specified threshold of 10%, the rate was lowered with application of dual-agent mapping procedure and evaluation of three or more lymph nodes. (See, e.g., Reference 78). A better evaluation of an axillary treatment response using noninvasive imaging procedures has a potential to significantly impact breast cancer patients, particularly in node-positive disease.
Thus, it may be beneficial to provide an exemplary system method and computer-accessible medium for determining breast cancer response using a convolutional neural network which can overcome at least some of the deficiencies described herein above.
An exemplary system, method and computer-accessible medium for determining a breast cancer response(s) for a patient(s) can include, for example, receiving an image(s) of an internal portion(s) of a breast of the patient(s), and determining the breast cancer response(s) by applying a neural network(s) to the image(s). The breast cancer response(s) can be a response to at least one chemotherapy treatment. The breast cancer response(s) can include an Oncotype DX recurrence score. The breast cancer response(s) can be a neoadjuvant axillary response. The image(s) can be a magnetic resonance image(s) (MRI). The MRI(s) can include a dynamic contrast enhanced MRI(s).
In some exemplary embodiments of the present disclosure, the neural network can include a convolutional neural network (CNN). The CNN can include a plurality of layers. The layers can include (i) a plurality of combined convolutional and rectified linear unit (ReLu) layers, (ii) a plurality of max pooling layers, (iii) a combined fully connected and ReLu layer(s), and (iv) a dropout layer(s). The combined convolutional and rectified linear unit (ReLu) layers can include at least ten combined convolutional and rectified linear unit (ReLu) layers, and the max pooling layers can include at least four max pooling layers. Two of the at least ten combined convolutional and rectified linear unit (ReLu) layers can have 64×64×64 feature channels, two of the at least ten combined convolutional and rectified linear unit (ReLu) layers can have 32×32×128 feature channels, three of the at least ten combined convolutional and rectified linear unit (ReLu) layers can have 16×16×128 feature channels, and three of the at least ten combined convolutional and rectified linear unit (ReLu) layers can have 8×8×512 feature channels.
In certain exemplary embodiments of the present disclosure, a score(s) can be determined based on the image(s) using the neural network(s). The breast cancer response(s) can be determined based on the score. The breast cancer response(s) can be determined based on the score being above 0.5. The image can be normalized by, for example, subtracting a mean for a plurality of images of further internal portions of further breasts, and dividing by a standard deviation for the image(s). The image(s) can be translated, rotated, scaled, and sheared.
These and other objects, features and advantages of the exemplary embodiments of the present disclosure will become apparent upon reading the following detailed description of the exemplary embodiments of the present disclosure, when taken in conjunction with the appended claims.
Further objects, features and advantages of the present disclosure will become apparent from the following detailed description taken in conjunction with the accompanying Figures showing illustrative embodiments of the present disclosure, in which:
Throughout the drawings, the same reference numerals and characters, unless otherwise stated, are used to denote like features, elements, components or portions of the illustrated embodiments. Moreover, while the present disclosure will now be described in detail with reference to the figures, it is done so in connection with the illustrative embodiments and is not limited by the particular embodiments illustrated in the figures and the appended claims.
The exemplary system, method, and computer-accessible medium, according to an exemplary embodiment of the present disclosure, can include an exemplary determination breast cancer response using various exemplary imaging modalities. For example, the exemplary system, method, and computer-accessible medium according to an exemplary embodiment of the present disclosure is described herein using mammographic images and/or optical coherence tomography (“OCT”) images. However, the exemplary system, method, and computer-accessible medium according to an exemplary embodiment of the present disclosure can also be used on other suitable imaging modalities, including, but not limited to, magnetic resonance imaging, positron emission tomography, ultrasound, and computed tomography.
A retrospective review of identified 141 patients with the diagnosis of breast cancer between Jan. 1, 2005 and Jun. 1, 2016. All patients met the following criteria: (i) underwent a staging breast MRI prior to the initiation of therapy; (ii) received adriamycin-based and/or taxane-based neoadjuvant chemotherapy with additional HER2 directed therapy (e.g., trastuzumab/pertuzumab) in patients with HER2 positive tumor; and (iii) successfully underwent surgical resection of their primary breast tumor with appropriate lymph node sampling.
Data on tumor pathologic characteristics were obtained from the original pathology reports of the core biopsy specimen. Breast tumor subtype was determined based on immunohistochemical (“IHC”) staining of the ER and progesterone receptor (“PR”) interpreted according to the American Society of Clinical Oncology and College of American Pathologists Guidelines. Tumors were considered receptor positive if either ER or PR demonstrated greater than about 1% positive staining. (See, e.g., Reference 21). Tumors were considered HER2 positive if they were 3+ by immunohistochemistry or demonstrated gene amplification with a ratio of HER2/CEP17>2 by in situ hybridization. (See, e.g., Reference 22). Breast tumor subtypes were defined as follows: (i) Luminal A (e.g., ER/PR positive, HER2 negative); (ii) luminal B (e.g., ER/PR positive, HER2 positive); (iii) HER2 positive (e.g., ER/PR negative, HER2 positive); and (iv) triple negative or basal-like (e.g., ER/PR and HER2 negative). Clinical and pathologic staging was determined based on the American Joint Committee on Cancer TNM Staging Manual, 7th edition. Patients were classified into 3 groups based on their NAC response confirmed on final surgical pathology: Pathologic complete response (group 1), partial response (group 2) and no response/progression (group 3). pCR was defined as no residual invasive disease in the breast or lymph nodes on surgical pathology specimens (ypTO/Tis ypNO).
An exemplary MRI procedure was performed on a 1.5-T or 3.0-T commercially available system using an eight-channel breast array coil. A bilateral sagittal T1-weighted fat-suppressed fast spoiled gradient-echo sequence (17/2.4; flip angle, 35°; bandwidth, 31-25 Hz) was then performed before and after a rapid bolus injection (gadobenate dimeglumine/Multihance; Bracco Imaging; 0.1 mmol/kg) delivered through an IV catheter. Image acquisition started after contrast material injection, and was obtained consecutively with each acquisition time of 120 seconds. Section thickness was 2-3 mm using a matrix of 256×192 and a field of view of 18-22 cm. Frequency was in the antero-posterior direction.
As shown in the images of
As shown in the exemplary diagram of
The exemplary data was divided into a validation set, which included 80% of the data, and a test set, which included 20% of the data. The validation test set was then divided into 5 folds, and 5 fold cross validation was performed. Training from scratch without pretrained weights was performed over 100 epochs using adam optimizer with nesterov momentum at an initial learning rate of 0.002. Each of the 5 models was tested against the 20% hold out data to obtain sensitivity, specificity and accuracy. Receiver operator curves were also calculated for each of the 5 models.
A total of 141 patients met the criteria for inclusion in this study. Three class neo-adjuvant prediction model was evaluated for the three patient groups. The breakdown of tumor response and molecular subtype is shown in Table 1 below. Group 1 included 46 patients with pathologic complete response. Group 2 included 57 patients with partial response. Group 3 included 38 patients with no response to progression on chemotherapy. The molecular subtype based on IHC staining included: (i) 61 luminal A; (ii) 39 luminal B; (iii) 16 HER2 positive; and (iv) 25 triple negative or basal-like.
The rate of pCR is shown in Table 2 below, demonstrating: (i) 18% (11/61) of the luminal A, (ii) 46% (18/39) of the luminal B group, (iii) 50% (8/16) of the HER2 positive group, (iv) and 36% (9/25) of the triple negative group achieved pCR. Combined luminal B, HER2 positive, and triple-negative tumors had a significantly higher rate of pCR compared to luminal A, with a rate of 44% (35/80) versus 18% (11/61) respectively (p=0.002).
The rate of no response/progression of disease is shown in Table 3 below, demonstrating: (i) 43% (26/61) of the luminal A group, (ii) 10% (4/39) of the luminal B group, (iii) 13% (2/16) of the HER2 positive group, and (iv) 24% (6/25) of the triple negative group showed no treatment response or progression of disease. Luminal A tumors had a significantly higher rate of no response/progression compared to the other three groups, with a rate of 43% (26/61) versus 15% (12/80) respectively (p=0.0005).
The confusion matrix, shown in table 4 below, shows the exemplary CNN predicted class of the hold out test data versus the true class of the hold out test data. The values represent the average number of slices over the five folds of cross validation plus or minus the standard deviation. A final softmax score threshold of 0.5 was used for classification. The exemplary CNN achieved an overall mean accuracy of 88% (95% CI, f 0.6%) in three class prediction of NAC treatment response on a five-fold validation accuracy test.
Prior to initiation of therapy, the exemplary CNN procedure achieved an overall accuracy of 88% in predicting NAC response in patients with locally advanced breast cancer. The exemplary results demonstrate that the exemplary system, method, and computer-accessible medium can utilize a CNN to predict NAC response prior to initiation of therapy. This represents an improved approach to early treatment response assessment based on a baseline breast MRI obtained prior to the initiation of treatment, and significantly improves on current prediction methods that rely on interval imaging after the initiation of therapy.
Although there has been significant progress in MRI to assess therapy response, the vast majority of studies thus far depend on interval imaging after initiation of therapy. Quantitative imaging procedures have become an active area of research given the limitations of qualitative tumor response assessment using the Response Evaluation Criteria in Solid Tumors (“RECIST”). (See, e.g., Reference 27). Quantitative methods of response assessment have examined changes in kinetic parameters (e.g., volume transfer constant Ktrans, exchange rate constant kept) in dynamic contrast-enhanced MRI (“DCE-MRI”), (see, e.g., References 28-30) as well as morphologic changes (e.g., three-dimensional volume, signal enhancement ratio, tissue cellularity) using DCE-MRI, and diffusion-weighted MRI (“DW-MRI”) with predictive value after one or more cycles of therapy. (See, e.g., References 14, 15 and 31). The limitations of these methods include the often delayed morphologic-based changes that occur despite treatment-induced biologic response that may not be reflected by imaging performed during or shortly after completion of therapy. By incorporating a mechanically coupled reaction-diffusion model using patient-specific imaging data to drive a biomechanical model of tumor growth, improved prediction of therapy response as compared to prior procedures can be achieved, resulting in a sensitivity and specificity of 92% and 84%, respectively. (See, e.g., Reference 32). While significant advances in response assessment have been shown, the previously described studies all rely on interval imaging after initiation of therapy.
Currently available clinical and pathologic data shows luminal B, HER2-positive, and triple-negative breast cancer responds best to NAC. A large meta-analysis of thirty studies including 11,695 patients investigating pCR after NAC showed average rates of pCR were 8.3% in luminal A, 18.7% in luminal B, 38.9% in HER2 positive, and 31.1% in triple-negative breast cancer subtypes. (See, e.g., Reference 33). Similarly, HER2-positive, and triple-negative tumors achieved significantly higher pCR, compared to the luminal A subtype using the exemplary CNN. While this information can be helpful, it cannot be used solely to predict who can respond to NAC given over half of these patients do not have pCR.
An exemplary MRI procedure was performed on a 1.5 T or 3.0 T commercially available system using an eight-channel breast array coil. The imaging sequences included a triplane localizing sequence followed by a sagittal fat-suppressed T2-weighted sequence (e.g., repetition time/echo time (“TR/TE”), 4000-7000/85; section thickness, 3 mm; matrix, 256×192; field of view (“FOV”), 18-22 cm; no gap). A bilateral sagittal T1-weighted fat-suppressed fast spoiled gradient-echo sequence (e.g., 17/2.4; flip angle, 35°; bandwidth, 31-25 Hz) was then performed before, and three times after, a rapid bolus injection (e.g., gadobenate dimeglumine/Multihance; Bracco Imaging, Princeton, N.J.; 0.1 mmol/kg) delivered through an IV catheter. Image acquisition started after contrast material injection and was obtained consecutively with each acquisition time of 120 seconds. Section thickness was 2-3 mm using a matrix of 256×192 and an FOV of 18-22 cm. Frequency was in the anteroposterior direction. After the examination, post-processing was performed including subtraction of the unenhanced images from the first contrast-enhanced images on a pixel-by-pixel basis and reformation of sagittal images to axial images.
Each tumor specimen was transmitted to Genomic Health as standard of care and the Oncotype Dx RS was determined ranging from 0-100. Patients were classified into three groups based on the risk of recurrence 10 years after treatment: (i) low risk (group 1, RS <18), (ii) intermediate risk (group 2, RS 18-30), and (iii) high risk (group 3, RS >30).
Exemplary Image Preprocessing.
For all patients, breast tumor regions were manually annotated by a board-certified radiologist using a region-of-interest (“ROI”) drawn in 3DSlicer (see, e.g., Reference 46), based on first post-contrast DCE-MRI images. For 134 tumors, 1649 volumetric slices (e.g., mean 12.3 slices per tumor) in 32×32 voxel resolution were evaluated from the segmented tumor data. The intensity values at each pixel of the image were normalized by subtracting the mean intensity value of the image and dividing by the SD for each image.
Exemplary Neural Network Architecture.
The exemplary CNN can be structured as a sequential set of convolution filters applied to the original image, followed by activation functions. The exemplary filters can apply learnable functions that can be trained with each new batch of input images. The filter weights can be updated by minimizing the cost function, which can compare the predicted output with ground truth training labels (e.g., an Oncotype Dx group). The L2 regularization, which can add a “squared magnitude” of a coefficient as a penalty term to the loss function, was used to discourage parameters of this learnable filter from becoming too large, and to prevent overfitting of the model to the training data. In the exemplary network, L2-norm (e.g., least squares error (“LSE”) was used on the fully connected layer. The exemplary L2-norm can minimize the sum of the square of the differences (S) between the target value (Yi) and the estimated values (f(xi), resulting in, for example:
The exemplary activation function following convolutional filtering can introduce nonlinearities that can create a hierarchy of layers. This exemplary layered hierarchy can be used to facilitate depth in a network. Hierarchical depth in the network can facilitate filters to represent more complex features. The optimization of the network can include proper scaling of the input data and the learning rate step size. A proper preprocessing normalization of the data can be used to facilitate network convergence.
As shown in the exemplary diagram of
As one example, for each breast tumor, a final softmax score threshold of 0.5 was used for classification. The softmax score, also known as softmax function, is a normalized exponential function. It can be a generalization of the logistic function that “squashes” a K-dimensional vector of arbitrary real values to a K-dimensional vector of real values, where each entry can be in the range (0, 1), and all the entries add up to 1. The softmax score provides the probability for each class label. The probability of each class can sum to 1 as dictated by the normalization constraint.
Two sets of experiments were performed, one three-class model to train the exemplary CNN model to predict low, moderate, or high Oncotype Dx RS and the second to predict two-class low vs. (e.g., moderate+high) Oncotype Dx RS. Five-fold cross-validation was performed with 80% of the data used as training and 20% used for testing purposes. In the three-class model, three different sensitivity and specificity metrics are provided, one for each class. The performance metrics can be calculated from the test dataset reserved for performance characterization upon which the training model was never exposed to. Training was implemented using the Adam optimizer, a procedure for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments. (See, e.g., References 51 and 52). Parameters were initialized using a suitable heuristic. (See, e.g., Reference 53). To account for training dynamics, the learning rate can be annealed whenever training loss plateaus.
An exemplary statistical analysis was performed. Age was calculated at the time of diagnosis. Descriptive statistics were used to summarize clinical, imaging, and pathologic parameters. Classification performance was evaluated using a multiclass receiver operating characteristics (“ROC”) analysis. This included generating ROC plots for each group versus the other two combined groups. For each of these two-class classifications, the sensitivity and specificity was reported.
The tumor grade was 17.9% low grade (24/134), 65.7% intermediate grade (88/134), and 16.4% high grade (22-134). Axillary lymph node status was 92.5% negative (124/134) and 7.5% positive (10/134). Based on the American Joint Committee on Cancer, TNM classifications were as follows: T1 (73.8%, 99/134), T2 (25.4%, 34/134), T3 (0.7%, 1/134), T4 (0%); NO (92.5%, 124/134), N1 (7.5%, 10/134), N2 (0%), N3 (0%); M0 (100%, 134/134), M1 (0%). Most (97%, 130/134) of the patients had unifocal disease. Four patients had multifocal disease. Three out of four patients had one additional tumor. One out of four patients had two additional tumors. No contralateral tumors were present. Only the primary tumor that underwent Oncotype Dx evaluation was matched for MRI image analysis. The additional tumors did not undergo Oncotype Dx evaluation. Breast MRI was performed on 1.5 T in 61.2% (82/134) of the patients and on 3.0 T in 38.8% (52/134) of the patients. 11.2% (15/134) of the tumors demonstrated non-mass enhancement. 88.8% (119/134) of the tumors demonstrated mass enhancement.
The median Oncotype Dx score was 16 (range, 1-75). Patients were classified into three groups based on the risk of recurrence 10 years after treatment: low risk (group 1, RS <18), intermediate risk (group 2, RS of 18-30), and high risk (group 3, RS >30). The low-risk group consisted of 77 patients. The intermediate-risk group consisted of 40 patients. The high-risk group consisted of 17 patients.
A total of 134 breast cancer cases with Oncotype Dx recurrence scores were included. For each breast tumor, a final softmax score threshold of 0.5 was used for classification. The exemplary CNN was trained for a total of 200 epochs (e.g., batch size of 32) before convergence. Based on this, mean 5-fold validation accuracy was calculated. Initially, a three-class prediction model was utilized, classifying results into a low-risk group, intermediate-risk group, and high-risk group. The exemplary CNN achieved an overall accuracy of 81% (e.g., 95% confidence interval [CI]+4%). Subsequently, a two-class Oncotype Dx prediction model was evaluated in two groups consisting of 77 and 57 patients (e.g., group 1 vs. groups 2 and 3). The exemplary CNN achieved an overall accuracy of 84% (95% CI±5%) in two-class prediction.
The exemplary ROC plot is shown in the graphs of
The exemplary CNN achieved an overall accuracy of 84% in predicting patents with low Oncotype Dx RS compared to patients with intermediate/high Oncotype Dx RS. The exemplary results indicate the likelihood of utilizing the CNN procedure to predict Oncotype Dx RS.
An exemplary analysis was performed 127 locally advanced breast cancer patients who: (i) underwent breast MRI before the initiation of NAC, (ii) successfully completed Adriamycin/Taxane-based NAC, and (iii) underwent surgery, including sentinel lymph node evaluation/axillary lymph node dissection with available final surgical pathology data. Data on tumor pathologic characteristics were obtained from the original pathology reports of the core biopsy specimen. Breast tumor receptors were determined based on IHC staining of the ER and PR interpreted according to the American Society of Clinical Oncology and College of American Pathologists Guidelines. Tumors were considered receptor positive if either ER or PR demonstrated ≥1% positive staining. (See, e.g., Reference 73). Tumors were considered HER2-positive if they were 3+ by immunohistochemistry or demonstrated gene amplification with a ratio of HER2/CEP17≥2 by in situ hybridization. (See, e.g., Reference 81).
Clinical and pathologic staging was determined based on the American Joint Committee on Cancer TNM Staging Manual, 7th edition. All patients included have biopsy-proven lymph node metastasis before NAC. After NAC, patients were classified into two groups based on their NAC response confirmed on final surgical pathology: pCR of the axilla (group 1), and non-pCR of the axilla (group 2).
Images from all cases were normalized for signal intensity. An exemplary normalization of an image included, for example, subtracting the mean and dividing by the standard deviation for each image. Mean and standard deviation of gray levels were calculated across all data and applied pixel-wise to each individual image. To limit overfitting, data augmentation was performed in the form of translation, rotation, scaling, and shear of the original images was applied to aid in the training of a spatially invariant model.
The cases were randomly separated into a training set, which included 80% of the cases, and a test set, which included 20% of the cases. The training data set was split into five class balanced folds for cross validated training. For each breast MRI, a tumor was identified on first set of T1 post-contrast dynamic images and underwent 3D segmentation using an open source software platform 3D Slicer. A total of 2811 slices from the 127 tumors were extracted with a threshold of 75 voxels per slice. From each slice that contained segmented tumor data, a patch of 64×64 pixels was extracted that completely contained the segmented tumor and was used for analysis.
The exemplary system, method, and computer-accessible medium, according to an exemplary embodiment of the present disclosure, can utilize the exemplary CNN shown in
The exemplary CNN was optimized with nadam (see, e.g., Reference 76), an adaptive moment estimation optimizer that utilizes nesterov momentum. The exemplary CNN was independently trained using k-fold cross validation. For each breast tumor, the maximum SoftMax score calculated by the exemplary CNN was used to predict pathologic response of the axilla. Code was implemented in open source software Keras with TensorFlow on a Linux workstation with NVIDIA GTX 1070 Pascal GPU.
The trained CNN was used to predict classes on the earlier withheld testing dataset. Overall diagnostic performance in the form of sensitivity, specificity, and accuracy was reported with 95% confidence intervals. ROC curves are plotted as a function of different threshold criteria, as well as area under the ROC curve (“AUC”).
Table 5 below indicates patient demographics and tumor characteristics. Patient population median age was 50 (range 23-82) years. The most frequent histologic tumor type was invasive ductal carcinoma 86.6% (100/127). The median size of the tumor was 3.2 (range 0.9-9.5) cm. Most of the tumor was either intermediate or high grade (96%, 122/127). Lymphovascular invasion was present in 33.9% (43/127) of the cases. Receptor status of tumors was: ER+, HER2−, 59 (46.5%), ER+, HER2+, 21 (16.5%), ER−, HER2+, 14 (11%), and ER−, HER2−, 33 (26%).
On final surgical pathology, 49 patients (38.6%, 49/127) achieved pCR of the axilla and 78 patients (61.4%, 78/127) did not with residual metastasis detected. Table 5 shows patient demographics and tumor characteristics stratified by the pCR of axilla and non-pCR of the axilla. Two class neoadjuvant prediction model of the axilla was evaluated for the two patient groups. Group 1 included of 49 patients with pCR of the axilla. Group 2 included of 78 patients with non-pCR of the axilla.
After 3D segmentation, a total of 2811 slices (e.g., min 2, max 41, median 18, and average 22) from the 127 tumors were extracted. A class-balanced separation of data allocated 80% for training and 20% for testing. The training data was split into five class balanced folds and independently trained five times. The following results report the average value over 95% confidence intervals of diagnostic performance of the model against training data. A final SoftMax score threshold of 0.5 was used for classification. The exemplary CNN achieved an overall accuracy of 83% (95% CI±5) with sensitivity of 93% (95% CI±6) and specificity of 77% (95% CI±4).
Early prediction of axillary treatment response is beneficial in the management of locally advanced breast cancer with the potential to avoid the morbidity of ALND and create novel NAC combinations in non-responders. The nodal pCR rate was 38.5%, comparable to the 41.1% ACOSOG Z1071 overall axillary pCR rate. (See, e.g., Reference 78). Before the initiation of therapy, the exemplary CNN procedure achieved an overall accuracy of 83% in predicting NAC response in patients with node-positive breast cancer. Thus, the exemplary system, method, and computer-accessible medium can significantly improve on currently available prediction models, which depend on clinicopathologic information and post-NAC imaging analysis.
Overfitting can be an intrinsic limitation to CNN when using a relatively small dataset. In order to overcome this issue, over-fitting was minimized by application of suitable methods including, but not limited to, 50% dropout, data augmentation, and L2 regularization. Lastly, CNN is a type of artificial neural network, most recently developed due to advances in computer hardware technology. In contrast to traditional procedures, which utilize handcrafted tumor features based on human extracted patterns, neural networks facilitate the computer to automatically construct predictive statistical models, tailored to solve a specific problem subset. The laborious task of human engineers inputting specific patterns to be recognized can be replaced by inputting curated data and facilitating the technology to self-optimize and discriminate through increasingly complex layers. (See, e.g., Reference 72). Because training a CNN can be an end-to-end process, it does not clearly reveal the reasoning behind the final result in a deterministic manner. This can be an ongoing area of research to improve human understanding and intuition behind the predictions of a neural network.
The exemplary system, method, and computer-accessible medium, according to an exemplary embodiment of the present disclosure, can utilize an exemplary CNN to accurately predict axillary treatment response in node positive breast cancer using a baseline MRI tumor dataset. In non-responders, the exemplary system, method, and computer-accessible medium according to an exemplary embodiment of the present disclosure can impact clinical management to direct individualized treatment, minimize toxicity from ineffective agents, and explore novel neoadjuvant therapies. The exemplary CNN can further impact management of NAC responders, with the potential to avoid the morbidity of ALND and even SLNB.
As shown in
Further, the exemplary processing arrangement 1505 can be provided with or include an input/output ports 1535, which can include, for example a wired network, a wireless network, the internet, an intranet, a data collection probe, a sensor, etc. As shown in
The foregoing merely illustrates the principles of the disclosure. Various modifications and alterations to the described embodiments will be apparent to those skilled in the art in view of the teachings herein. It will thus be appreciated that those skilled in the art will be able to devise numerous systems, arrangements, and procedures which, although not explicitly shown or described herein, embody the principles of the disclosure and can be thus within the spirit and scope of the disclosure. Various different exemplary embodiments can be used together with one another, as well as interchangeably therewith, as should be understood by those having ordinary skill in the art. In addition, certain terms used in the present disclosure, including the specification, drawings and claims thereof, can be used synonymously in certain instances, including, but not limited to, for example, data and information. It should be understood that, while these words, and/or other words that can be synonymous to one another, can be used synonymously herein, that there can be instances when such words can be intended to not be used synonymously. Further, to the extent that the prior art knowledge has not been explicitly incorporated by reference herein above, it is explicitly incorporated herein in its entirety. All publications referenced are incorporated herein by reference in their entireties.
The following references are hereby incorporated by reference in their entireties:
This application relates to and claims priority from U.S. Patent Application No. 62/589,924, filed on Nov. 22, 2017, the entire disclosure of which is incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2018/062319 | 11/21/2018 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62589924 | Nov 2017 | US |