Resistance to anti-estrogens is one of the major challenges in the treatment of breast cancer. For more than 25 years, the golden standard for the endocrine treatment of all stages of estrogen receptor-positive breast cancer has been tamoxifen (Jordan, 2003, Nat. Rev. Drug Discov., 2:205-13; Osborne, 1998, N. Engl. J Med. 339:1609-18). However, in the advanced setting when metastasis is detected approximately half of the patients with estrogen receptor-α (ER-α) -positive breast tumors will not respond to endocrine treatment, whereas response rates in patients with ER-α-negative primary tumors are very low. Therefore additional biomarkers are needed to identify patients who will not respond and to select patients for various tailored treatments.
In the past 20 years a large number of cell biological factors, other than steroid receptors, has been reported that identify those patients who will benefit from endocrine therapy or fail to respond (for review see Klijn et al.:, 2002, Ingle WRMaJN (ed): Endocrine Therapy in Breast Cancer. New York, Marcel Dekker). Few of these, however, appeared valuable and useful in daily clinical practice. In these individual studies only a limited number of factors have been evaluated simultaneously. Breast cancer is known as a heterogeneous and multifactorial disease, with accumulation of (epi)genetic alterations leading to transformation of normal cells into cancer cells. With the advent of high-throughput quantification of gene-expression, simultaneous assessment of thousands of genes is now possible in a single experiment (Brown et al., 1999, Nat. Genet. 21:33-7; Holloway, et al., 2002, Gynecol. Oncol., 87:8-16, 2002). Gene-expression profiling provides a strategy for discovering gene-expression characteristics that may be useful to predict clinical outcome.
Using microarray expression profiling, gene signatures, marker genes, and methods were developed for predicting response or resistance to anti-estrogen, for example, tamoxiphen therapy and predicting outcome for recurring breast cancer patients. Using a gene profile described herein, analysis of a patient's primary breast tumor against the gene profile is predictive of patient response to anti-estrogen, for example, tamoxiphen therapy, for example, tamoxifen therapy for the treatment of recurring breast cancer.
Useful gene signatures for predicting outcome (response or resistance) and progression free survival in recurring breast cancer treated with anti-estrogen, for example, tamoxiphen therapy and include the genes of the 81-gene signature and the 44-gene signature shown in
Gene expression levels can be determined using various known methods including nucleic acid hybridization in microarrays, nucleic acid amplification methods such as quantitative polymerase chain reaction (qPCR), and immunoassay of proteins expressed by the genes of the predictive gene profile. Expression levels and expression level ratios of two or more genes of the predictive gene profile can be determined, for example, using real-time quantitative reverse-transcriptase PCR (qRT-PCR).
The gene signatures of the invention are useful in assays to predict response and/or outcome of anti-estrogen, for example, tamoxiphen therapy for recurring breast cancer. In one embodiment, gene expression is analyzed in a primary breast tumor tissue sample and compared to the expressed gene signature determined from retrospective patient data as described in the Examples below. Sample expression data can be analyzed against a classification algorithm determined from a “training” set of data as described in the Examples below.
In another embodiment, a gene expression ratio of two or more genes, or a threshold expression level of one or more predictive genes is analyzed. In a preferred embodiment, expression of at least one upregulated gene and at least one down regulated gene is analyzed. A ratio of the expression of the upregulated gene to that of the down regulated gene is calculated, where the ratio is predictive of response and/or outcome of anti-estrogen, for example, tamoxiphen therapy for treating recurring breast cancer. The predictive ratio or ratios may be stored in a database for comparison to the test data.
The invention includes diagnostic systems and methods such as arrays containing one or more probes to detect expression of one or more genes of the predictive profile. Preferably, the assay system contains at least one of the genes of the 81-gene signature or of the 44-gene signature shown in
The gene signatures of the invention are also useful for identifying lead compounds useful in the treatment of estrogen-dependent recurring breast cancer. Primary estrogen-dependent breast tumor tissue can be contacted with the potential therapeutic drug, and the expression of one or more genes of the gene signature analyzed and compared with an untreated control.
These and other features of the invention are described more fully below.
Gene Signature as used herein, refers to a profile of gene expression that correlates with a therapeutic outcome, for example as shown in the heat map of
Cluster I and Cluster II gene profiles are shown in
Differential expression, as used herein, refers to gene expression in primary breast tumor tissues that differs with a patient's outcome in treatment of recurring breast cancer with anti-estrogen therapy.
Objective response as used herein includes complete remission and partial remission.
Outcome as used herein, refers to Response (complete or partial) or Resistance (progressive disease or stable disease less than 6 months).
Recurring disease or recurring breast cancer is used herein to mean cancer that develops after the primary breast cancer has been removed, for example metastatic breast cancer that occurs after a primar tumor has been excised.
Stable disease refers to patients with no change in disease status, as well as those with no evident tumor reduction of at least 50% or more and those with tumor progression. Patients with stable disease are divided into those with no change (stable disease) for six months or longer, and those with no change (stable disease) for less than six months.
Tumor progression or Progressive Disease, as used herein, is meant to describe growth of about 25% or more tumor mass, or one or more new lesions within a three-month period.
Because patients with stable disease for 6 months or more exhibited a PFS similar to patients with partial remission, these patients were classified as responders to tamoxifen as described in the manual for clinical research and treatment in breast cancer of the European Organization for Research and Treatment of Cancer. European Organization for Research and Treatment of Cancer Breast Cancer Cooperative Group. Manual for clinical research and treatment in breast cancer, Almere, The Netherlands: Excerpta Medica; 2000. p. 116-7.
Clinical benefit was defined in the studies described herein as objective response, including complete and partial remission, and stable disease for six months or more, as described in Ravdin et al., 1992, J. Clin. Oncol. 10:1284-91; Foekens et al., 1994, Br. J Cancer, 70:1217-23; and Robertson et al., 1997, Eur. J Cancer, 33:17749.
Only patients with measurable disease were evaluated in these studies, and selected patients with no change (stable disease), had received tamoxifen at least for a period of 6 months.
Gene expression profiling of retrospective breast cancer tumor tissue using high density cDNA arrays was used herein to generate differential gene expression patterns correlated with patient response and outcome data for treatment of recurrent breast cancer with anti-estrogen, for example, tamoxiphen therapy. Using tumor RNA obtained from a training set of 46 tumors comprising primary tumors from 25 patients exhibiting progressive disease after anti-estrogen, for example, tamoxiphen therapy for recurring breast cancer and primary tumors from 21 patients exhibiting objective response to anti-estrogen, for example, tamoxiphen therapy for recurring breast cancer, differentially expressed genes/ests were identified. Using microarray data analysis tools, (BRB Array Tools), under a significance level of 0.05, a total of 569 and 449 genes were identified as differentially expressed and correlated with progressive disease and objective response, respectively.
81-gene Signature
The overlap of these differentially expressed genes identified an initial signature set of 81 differentially expressed genes having a pattern correlated with progressive disease or objective response for anti-estrogen, for example, tamoxiphen therapy treatment of recurring breast cancer. These 81 genes were classified and subjected to cluster analysis. The results are shown in the heat map of
This 81-gene signature was used to correctly classify retrospective patient samples as having a gene expression pattern correlated with progressive disease or with objective response to anti-estrogen, for example, tamoxiphen therapy in the treatment of metastatic (recurring) breast cancer. 21 of 25 patients with progressive disease and 19 of 21 patients with objective response were correctly classified by this 81-gene signature, as discussed in the Examples below.
44-gene Signature
With further analysis and rank ordering of genes on the basis of significance level, followed by a step-up calculation of correlation coefficient of expression, a supervised learning approach was used to reduce the original 81-gene signature to a smaller 44-gene predictive signature having similar accuracy.
Using a validation set of 66 tumors, the 44-gene signature correctly classified 27 of 35 patients with progressive disease and 15 of 31 patients with objective response. Univariate analysis showed the response predictions by the 44-gene signature to be superior to predictions based on the analysis of traditional factors such as menopausal status, disease-free interval, first dominant site of relapse, estrogen and progesterone receptor status.
Univariate and multivariate analysis showed the 44-gene signature to be predictive of progression free survival, e.g., the time until tumor progression was seen.
Expression levels of mRNA from individual genes in the 81-gene signature were measured by quantitative PCR as disclosed in the Examples below. The qPCR data was correlated with the mircroarray data. Of eight tested genes (CASP2, DLX2, USP9X, CHD6, MST4, RABEP, SIAH2, and TNC), Spearman rank correlations were positive for all except for MST4.
The genes in the 81-gene predictive signature contained 15 ESTs and 66 known genes. See the listing of genes in
Seventeen genes were regulated by or associated with estrogen (receptor) action, with 9 genes upregulated (LOC51186; TSC22; TEMP3; SPARC; GABARAPL1; CFP1; LDHA; ENO2; Hs. 99743) and 8 genes downregulated (TXN2; CDC42BP4; HLA-C; PSME1; Hs. 437986; SIAH2; UGCG; FMNL) in the primary tumors of tamoxifen-resistant patients, as shown in
Six genes associated with the extracellular matrix (TIMP 3, FN1, LOX, COL1A1, SPARC, AND TNC) and were overexpressed in patients with tamoxifen-resistant disease. Another cluster of seven genes were associated with apoptosis (IL4R, LDHA, MSP2K4, NPM1, SIAH2, CASP2, and TXN2), while two genes were related to anti-apoptosis activities (AP15, BNIP3). Four apoptosis genes were upregulated (AP15, NPM1, LDHA, BNIP3), while the other 5 were downregulated in primary tumors of patients with tamoxifen-resistant disease. A cluster of 4 genes linked to the immune system was downregulated (FCGRT, PSME-1, HLA-C, and NFATC3).
The 81-gene signature contains a significant number of genes located on chromosome 17, and particularly localized to cytoband 17q21-q22. For example, 5 of 66 (6.5%) informative genes (APPBP2, COL1A1, EZH1, KIAA0563, and FMNL) are localized to this cytoband, as compared with 199 of 12771 known genes (1.3%) for the entire microarray.
Breast cancer tissue samples useful for diagnostic assay can be obtained from primary tumor tissue, for example, biopsy tissue. In some instances, RNA may be obtained from the sample and used directly for analysis of expression. In general, RNA extracted from the tissue will be amplified, e.g., by polymerase chain reaction. For protein analysis, tissue can be paraffin-embedded and sectioned, for example, for immunohistochemstry and in situ hybridization analyses.
In one embodiment, primary breast tumor tissue is analyzed for MRNA transcripts, for example, by hybridizing to cDNA probes. In another embodiment, the tissue is analyzed for protein, for example by immunoassay, for example, immunohisto chemistry. Individual genes of the 81-gene signature are known. NCBI Accession Numbers provided in
Gene expression in the primary tumor tissue sample is compared with the expression pattern of one or more marker genes identified from the 81-gene signature or from genes identified from cluster analysis and association with the genes of the 81-gene signature, as disclosed in the Examples below.
A nucleic acid marker as used herein is a nucleic acid molecule that, by its expression pattern in primary breast tumor tissue, alone or in combination with or compared with the expression patterns of one or more additional nucleic acid molecule, correlates with response or resistance to anti-estrogen, for example, tamoxiphen therapy for recurring breast cancer, or with outcome, such as progressive disease, stable disease, or progression-free survival.
Hybridization methods useful to analyze gene expression are well known. Nucleic acid molecules in the tumor tissue, for example MRNA, can hybridize under stringent hybridization conditions with a complementary nucleic acid probe. The nucleic acid hybridization probe need not be a full-length molecule, but can be a fragment or portion of the a fragment of the full-length cDNA, a variant thereof, a SNP, or iRNA. The probe can also be degenerate, or otherwise contain modifications such as nucleic acid additions, deletions, and substitutions. What is required is that the probe retain its ability to bind or hybridize with the sample nucleic acid molecule, in order to recognize the expressed product in the sample.
Marker gene expression can be analyzed by known assay methods, including mehtods for detecting expressed nucleic acid molecules, such as RNA and encoded polypeptides. Nucleic acid probes and polypeptide binding ligands useful in such methods, can be prepared by conventional methods or obtained commercially. Detection of expression can be direct or indirect, using know labels and detection methods.
For analysis of nucleic acid molecules, standard methods, for example, microarray technology and qRT-PCR can be used to identify patterns of nucleic acid expression in the sample tissue. Methods of microarray technology, including DNA chip technology, gene chip technology, solid phase nucleic acid array technology, multiplex PCR, nucleic-acid spotted fluidity cards, and the like, are known, and may be used to determine the expression patterns of nucleic acid molecules in a patient's tumor sample. In one embodiment, array of identified nucleic acid probes is provided on a substrate. In a preferred embodiment, the expression of signature genes is assayed by qPCR techniques.
For analysis of expressed polypeptides, known binding assay methods, such as immunoassay methods can be used. Examples include imunohitochemistry, ELISA, radioimmunoassay, BIACore, and the like.
The invention is described herein with reference to the following examples that serve to illustrate the embodiments of the invention, and are not intended to limit the scope of the invention in any way.
The Examples below describe studies undertaken to determine the measurable effect of anti-estrogen, for example, tamoxiphen therapy for breast cancer on tumor size and on time until tumor progression (progression free survival). The analysis was performed on 112 estrogen receptor positive primary breast cancer samples from patients who developed advanced disease that showed the most pronounced types of response (objective response versus progressive disease from the start of treatment). In addition, these studies describe underlying gene (signaling) pathways that provide novel potential targets for therapeutic intervention.
Patients and Treatment The study design was approved by the medical ethical committee of the Erasmus MC Rotterdam, the Netherlands (EC 02.953). To evaluate the predictive value of gene-expression profiling in relation to tamoxifen treatment in patients with recurrent breast cancer, 112 fresh frozen ER-α-positive (>=10 fmol/mg of protein) primary breast tumor tissue specimens of patients with primary operable (invasive) breast cancer diagnosed between 1981 and 1992 were included. The median age at time of primary surgery (breast conserving lumpectomy, 33 patients; modified mastectomy, 79 patients) was 60 years (range, 32-89 years).
In this retrospective study, all patients were selected for disease recurrence (14 local or regional relapse, 86 distant metastasis) that was treated with tamoxifen (40 mg daily) as first-line treatment. At the start of tamoxifen treatment, the median age was 63 years (range 33-90 years), and 27 patients (24%) were premenopausal. None of the patients had received endocrine (neo)adjuvant systemic therapy nor were exposed to any hormonal treatment at an earlier stage, i.e. hormo-naïve. Eighteen patients (16%) received adjuvant chemotherapy. Of these patients, 7 were postmenopausal whereas 11 were premenopausal at time of surgery. At start of tamoxifen monotherapy 8 patients were still pre-menopausal, whereas 3 patients changed to the post-menopausal status before recurrence. Two of these three patients showed objective response to tamoxifen. Therefore, chemocastration as prior endocrine therapy could not have had a significant impact on the results.
The median follow-up of patients alive was 94 months (range, 21-165 months) from primary surgery, and 53 months (range, 2-131) from the start of tamoxifen treatment. Tumor progression after tamoxifen occurred in 103 (92%) of the patients. During follow-up, 94 patients (84%) died. After tumor progression on first-line tamoxifen treatment 69 patients were treated with one or more additional endocrine agents, while 64 patients were subsequently treated with one or more regimens of chemotherapy such as cyclophosphamide methotrexate 5-Fluorouracil (CMF) or 5-fluorouracil, adriamycin, cyclophosphamide (FAC) after the occurrence of hormonal resistance.
Criteria for follow-up, type of response, response to therapy was defined by standard UICC criteria (Hayward, et al., 1977, Cancer, 39:1289-94), and for progression free survival Were described previously (Foekens, et al., 2001, Cancer Res., 61:5407-14). Complete and partial response (CR and PR) was observed in 12 and 40 patients, respectively, resulting in 52 patients with an objective response (OR); progressive disease (PD) within 3-6 months from start of treatment was observed in 60 patients. Median progression free survival-time of objective response was 17 months, whereas the median progression free survival-time of patients with progressive disease was 3 months.
Total RNA was isolated from 30 μm frozen sections (approximately 20-50 mg tumor tissue) with RNABee (Campro Scientific). The percentage of tumor cells was determined in two Haematoxylin eosin stained frozen 5 μm sections that were cut before and after sectioning for RNA isolation. The tumor samples had a median tumor content of 65%. A T7dT oligo primer was used to synthesize double-stranded cDNA from 3 μg total RNA and subsequently to generate aRNA by in vitro transcription with T7 RNA polymerase (T7 MEGAscript™ High Yield Transcription kit, Ambion Ltd., Huntingdon, UK). Two micrograms of aRNA was labeled with Cy3 or Cy5 (CyDye, Amersham Biosciences) in a reverse transcription reaction. The labeled cDNA probes were purified using Qiagen PCR clean up columns (Qiagen, Westburg BV, Leusden, The Netherlands).
Similar to the Stanford protocol, a cell line pool of 13 cell lines derived from different tissue origins was used as reference for all microarray hybridizations (details are available at MIAMExpress (http://www.ebi.ac.uk/miamexpress/). Probes of the cell line pool were always labeled with Cy5.
Total RNA isolated for the microarray analysis was used to verify the quantity of specific messengers by real-time PCR. The RNA was reverse-transcribed and real-time PCR products were generated in 35 cycles from 15 ng cDNA in an ABI Prism 7700 apparatus (Applied Biosystems, Foster City, USA) in a mixture containing SYBR-green (Applied Biosystems, Stratagene) and 330 nM primers for differentially expressed genes (i.e. CASP2, DLX2, EZH1, CHD6, MST4, RABEP, SIAH2, and TNC). SYBR-green fluorescent signals were used to generate Cycle threshold (Ct) values from which MRNA ratios were calculated when normalized against the average of three housekeeping genes, i.e. hypoxanthine-guanine phospho-ribosyltransferase (HPRT), porphobilinogen deaminase (PBGD), and β-2-microglobulin (B2M) (Martens, et al., 2003, Thromb. Haemost., 89:393-404).
cDNA Microarrays: Preparation, Hybridization, and Data Acquisition
Microarray slides were manufactured at the Central Microarray Facility at the Netherlands Cancer Institute (Weige, et al., 2003, Proc. Natl. Acad. Sci. U.S.A., 100:15901-5). Sequence-verified clones obtained from Research Genetics (Huntsville, Ala.) were spotted with a complexity of 19,200 spots per glass slide using the Microgrid II arrayer (Biorobotic, Cambridge, U.K.) The gene ID list can be found at http://microarrays.nki.nl. Labeled cDNA probes were heated at 95° C. for 2 minutes and added to preheated hybridization buffer (Slide hybrization buffer 1, Ambion). The probe mixture was hybridized to cDNA microarrays for 16 hours at 45° C.
Fluorescent images of microarrays were obtained by using the GeneTAC™ LS II microarray scanner (Genomic Solutions; Perkin Elmer). IMAGENE v5.5 (Biodiscovery, Marina Del Rey, Calif.) was used to quantify and correct Cy3 and Cy5 intensities for background noise. Spot quality was assessed with the flagging tool of IMAGENE, in this study set at R>2 for both Cy3 and Cy5. Fluorescent intensities of each microarray were normalized per subgrid using the NKI MicroArray Normalization Tools (http://dexter.nki.nl) to adjust for a variety of biases that affect intensity measurements (e.g. color-, print tips, local background bias) (Yang, et al., 2002, Nucleic Acids Res., 30:e15). All ratios were log2 transformed.
Microarray data analyses were performed with the software packages BRB Array Tools, developed by the Biometric Research Branch of the US National Cancer Institute, (http://linus.nci.nih.gov/BRB-ArrayTools.html), and Spotfire (www.spotfire.com, Goteborg, Sweden and Sommerville, Mass.). BRB was implemented for statistical analysis of microarray data whereas Spotfire was used for cluster analysis. The class comparison tool in BRB combines a univariate F-test and permutation test (n=2000) in order to find discriminating genes and to confirm their statistical significance. In the class comparison a significance level of 0.05 was chosen in order to limit the number of false negatives.
Spotfire was used to perform hierarchical clustering. To analyze microarray data from different batches of slides, genes were Z-score normalized per batch. The Z-score was defined as [value—mean]/SD. After normalization, microarray data were clustered via complete linkage. The similarity measure for clustering was based on cosine correlation and average value.
Sensitivity, specificity, positive and negative predictive value (PPV and NPV, respectively) and odds ratios (OR) were calculated and presented with their 95% confidence interval (CI). The data are shown in Table 2. The performance of the signature in the validation set was determined via the likelihood ratio of the Chi square test. A supervised learning approach was applied to reduce the 81 differentially expressed genes to a smaller 44-gene predictive signature. First, all 81 genes were rank ordered on the basis of their significance as calculated with the BRB class comparison tool. Next, starting with the most significant gene, the Pearson correlation coefficient of expression with the other 80 genes was calculated. Succeeding genes were excluded from the signature as long as their expression correlated significantly (P<0.05) with the most significant gene. The first gene of the 81 gene profile that did not correlate with expression of the most significant gene was added to the final signature, and the whole procedure of expression correlation analysis with this second gene was repeated with the remaining less significant genes. In this way, genes with overlap in their expression were removed and the 44-gene predictive signature was derived.
The predictive score for the traditional-based model included menopausal status, disease free interval (DFI>12 months versus DFI≦12 months after primary surgery), dominant site of relapse (relapse to viscera or bone versus relapse to soft tissue), log estrogen receptor (ER) and log progesterone receptor (PgR) levels. In the analyses of progression free survival, the Cox proportional hazards model was used to calculate the hazard ratios (HRs) and 95% CI. Survival curves were generated using the method of Kaplan and Meier (1958, J. Am. Stat. Assoc., 53:457-481) and a log rank test for trend was used to test for differences. Correlation between microarray data and real-time PCR data was determined with Spearman rank correlation test. Computations were performed with the STATA statistical package, release 7.0 (STATA Corp., College Station, Tex.). All p-values are two-sided.
For the validation of the 44-gene signature, a classification algorithm (Gene Prediction Tool (GPT)) was developed that is comparable to the Compound Covariate Predictor (CCP) from BRB Array Tools. In detail, GPT applies two cut-off values instead of the midpoint used in the CCP tool for classification. The two thresholds are the median values of progressive disease and objective response and are defined in the tumors of the training set.
To obtain a robust classification algorithm, genes from the signature only become classifiers whenever the expression values are outside the two thresholds and as a result mainly represent one class, either progressive disease or objective response. When the expression level falls between the two cut-off values, the gene is excluded as classifier because the value can represent both response classes, i.e. progressive disease but also objective response. The gene classifiers from the predictive 44-gene signature are identified for each tumor from the validation set using the algorithms described herein. Finally, the ratio between the identified response predicting genes and resistance predicting genes determines the predicted signature-based response outcome.
To select discriminatory genes for the type of response to tamoxifen, a training set of 46 tumors was defined that comprised primary tumors of 25 patients with progressive disease (PD) and of 21 patients with objective response (OR, see
After supervised hierarchical clustering (shown in
In a validation set of 66 tumors, the predictive 44-gene signature correctly classified 27 of 35 patients with progressive disease (77% sensitivity, 95% CI: 0.59-0.89) and 15 of 31 patients with objective response (48% specificity, 95% CI: 0.31-0.67) with an odds ratio of 3.16 (95% CI: 1.10-9.11, p=0.03). In univariate analysis for response, the predictive signature appeared to be superior, i.e. more than 2-fold higher odds ratio, to most traditional factors (i.e. menopausal status, disease-free interval, first dominant site of relapse, estrogen and progesterone receptor status), of which only estrogen receptor-level (odds ratio, 1.54; 95% CI: 1.00-2.40; p=0.05) and progesterone receptor-level (odds ratio, 1.37; 95% CI: 1.05-1.79; p=0.02) showed significant predictive value. In multivariate analysis for response, the signature-based classification did not significantly (increase in X2=1.45) add to the traditional based-factor score (data not shown).
In addition, in univariate analysis only the 44-gene signature (hazard ratio, 0.54 [95% CI: 0.31-0.94]; p=0.03) and progesterone receptor-level (hazard ratio, 0.83 [95% CI: 0.73-0.96]; p=0.01) were significantly correlated with a longer time until tumor progression and this was retained for the signature in the multivariable analysis (hazard ratio, 0.48 [95% CI: 0.26-0.91]; p=0.03). Progesterone receptor is also independent, but with a less striking hazard ratio (0.82 [95% CI: 0.71-0.94]; p=0.01). After addition of the signature-based classification to the traditional-factors-based score the increase in 2 was 5.18 (df=1, p=0.02), indicating that the predictive signature independently contributed to the traditional predictive factors for progression free survival. In Kaplan-Meier analyses, the median difference in progression free survival time for patients with a favorable and poor response was 2-fold longer when the 44-gene signature (
aMenopausal status: post- vs premenopausal;
bDFI: >12 months vs ≦12 months;
c44-gene signature: sensitive vs resistant.
The mRNA expression levels of 8 genes of the 81-gene signature were analyzed by quantitative real-time PCR. The genes included: CASP2, DLX2, USP9X, CHD6, MST4, RABEP, SIAH2, and TNC. qPCR data was correlated with microarray data. Spearman rank correlations were positive for all genes but MST4.
The 81-gene signature described above in Example 1 was analyzed for the functional aspects of the genes contained in the signature. The genes were examined for functional relationships using Ingenuity Pathway Analysis tools. (Mountain View, Calif.) The signature contains 15 ESTs and 66 known genes (see
The patterns of expression of many genes that are associated with anti-estrogen, for example, tamoxiphen resistance and sensitivity are highly complex. The 81 differentially expressed genes includes, as expected, genes regulated by or associated with estrogen (receptor) action (van 't Veer, et al., 2002, Nature, 415:530-6; Tang, et al., 2004, Nucleic Acids Res., 32 Database issue: D533-6; Pusztai, et al., 2003, Clin. Cancer Res., 9:2406-15; Gruvberger, et al., 2001, Cancer Res., 61:5979-84; Charpentier et al., 2000, Cancer Res., 60:5977-83; Frasor, et al. 2003, Endocrinology, 144:4562-74), but also genes involved in extracellular matrix formation and apoptosis.
Seventeen genes were regulated by or associated with estrogen (receptor) action, of which 9 genes showed upregulation (LOC51186; TSC22; TIMP3; SPARC; GABARAPL1; CFP1; LDHA; ENO2; Hs. 99743) and 8 genes downregulation (TXN2; CDC42BP4; HLA-C; PSME1; Hs. 437986; SIAH2; UGCG; FMNL) in the primary tumors of patients who were resistant to tamoxifen therapy for recurring breast cancer (see
A cluster of 6 genes was identified as associated with the extracellular matrix (ECM). These genes, TIMP3, FN1, LOX, COL1A1, SPARC, and TNC were overexpressed in the primary tumors of patients that demonstrated resistance to anti-estrogen, for example, tamoxiphen therapy for treatment of recurring breast cancer (progressive disease).
Besides cytostatic effects, the anti-estrogen tamoxifen is known to have cytolytic effects by induction of apoptosis, as reviewed by Mandlekar and Kong (Mandlekar, et al. 2001, Apoptosis, 6:469-77). Based on Swiss prot, PubMed, and Ingenuity analysis information, nine genes (LOC51186; TSC22; TIMP3; SPARC; GABARAPL1; CFP1; LDHA; ENO2; Hs. 99743) in the 81-gene signature are related to programmed cell death, of which three genes inhibit apoptosis (API5, NPM1, and TXN2) whereas three other genes induced apoptosis (CASP2, MAP2K4, and SIAH2). Interestingly, the two latter genes (MAP2K4, SIAH2) induce the apoptotic machinery of fibroblasts.
Seven genes of the Signature were associated with apoptosis (IL4R, LDHA, MAP2K4, NPM1, SIAH2, CASP2, and TXN2), whereas two other signature genes (API5, BNIP3) were related to anti-apoptosis processes.
In general, the expression patterns indicate that anti-estrogen, for example, tamoxiphen resistance is mainly associated with inhibition of apoptosis. Interestingly, 4 apoptosis genes (API5, NPM1, LDHA, and BNIP3) were upregulated and 5 genes (IL4R, MAP2K4, SIAH2, CASP2, and TXN2) were downregulated in primary tumors of patients that were resistant to anti-estrogen, for example, tamoxiphen therapy for treatment of recurring breast cancer (progressive disease).
Ten genes selected from the 81-gene signature (CHD6, FN1, TNC, CASP2, EZH1, RABEP1, THRAP2, SIAH2, DEME-6, COX6C) were analyzed to date against 272 tumors. Uni- and multivariable analyses were performed to determine the response and duration of response (progression free survival), using the methods described above. In multivariable analysis the individual genes were compared with the clinically used model of traditional predictive factors (i.e., menopausal status, disease-free survival, dominant site of relapse, log ER, and log PR).
Specific calculation of threshold values (cutpoints) (Table 3) for prediction of Overall Response and Progression Free Survival were calculated as described above. As shown in Tables 3-6, marker genes DEME-6, CASP2, and SIAH2 were useful as individual markers of clinical outcome. Reliability of the prediction increased with combinations of the markers (see Tables 5 and 6).
None of the ECM genes in the signature is found in the 70-gene classifier for poor prognosis of node negative breast cancer patients by van 't Veer et al. (van 't Veer, et al., 2002, Nature, 415:530-6), suggesting that the ECM gene cluster is specific for the prediction of tamoxifen resistance. Furthermore, SPARC/osteonectin, a myoepithelial cell marker that is estrogen (co-)regulated, was recently described as an independent marker of poor prognosis in unselected breast cancers (Iacobuzio-Donahue, et al., 2002, Cancer Res., 62:5351-7; Mackay, et al., 2003, Oncogene, 22:2680-8; Jones, et al., 2004, Cancer Res., 64:3037-3045). In addition, a new cluster of genes linked to the immune system (FCGRT, PSME1, HLA-C, and NFATC3) was downregulated in the patients with progressive disease compared to those with objective response.
The 81-gene signature showed an overrepresentation of genes located to chromosome 17, but an under representation of genes located to chromosomes 4, 15, 18 and 21 (
The studies described above in Examples 1-4 demonstrate that expression array technology can be effectively and reproducibly used to classify primary breast cancer tumors according to a predicted resistance or sensitivity to anti-estrogen, for example, tamoxifen treatment for recurring breast cancer. An 81-gene signature with multiple individual genes predictive of response and outcome, alone or in combination with other genes is described and validated. A 44-gene signature is described that predicted anti-estrogen, for example, tamoxiphen therapy outcome in 112 breast cancer patients with ER positive recurrent disease. Overall, a prediction of anti-estrogen, for example, tamoxiphen resistance was accomplished with an accuracy of 80%. Moreover, the 44-gene gene signature predicted a significantly longer progression free survival time that is superior to the prediction obtained by a traditional factors-based score. Differences in RNA expression were confirmed by quantitative real-time PCR.
The predictive value of the 44-gene signature compares favorably and contributes independently with that of traditional prognostic factors, including the estrogen receptor, currently the validated factor for response prediction to hormonal therapy in breast cancer. The estrogen receptor, present in about 70-75% of breast cancers, correctly predicts response to tamoxifen in about 50-60% of the patients (Osborne, 1998, N. Engl. J Med. 339:1609-18), while the gene signature predicts resistance to tamoxifen in 77% of the patients in the validation set.
The present 44-gene signature, due to its significant association with time to treatment failure, may be used to classify patients based on time to treatment failure.
In general, the arrays used in these different studies comprise different genes/ESTs than those disclosed in the prior art. Of these arrays, approximately half of the genes show overlap. This could result in few overlapping genes in the generated gene-signatures. Therefore, comparison of pathways based on the extracted gene signatures from different studies could be more informative. At present, none of these differentially expressed genes that are regulated by or associated with estrogen (receptor) action have been directly linked by others with endocrine resistance in clinical samples. The data described herein provides a better understanding of endocrine resistance and provides novel potential therapeutic targets for individualized treatment.
A diagnostic assay was recently developed by Genomic Health, the Oncotype DX diagnostic assay based on a candidate gene selection (not genome wide) approach. This test provides a recurrence score for lymph node negative breast cancer patients with estrogen receptor positive tumors that have received adjuvant tamoxifen (Paik, et al., 2003, Breast Cancer Res. Treat., 82:S10). Their multiplex 21-gene test includes genes associated with proliferation, estrogen and HER2 action, invasion and 5 control genes. None of the genes however, overlap with the 81-gene signature that was selected through microarray based gene expression profiling.
Recently, Sgroi et al. (Ma, et al., 2004, Cancer Cell, 5:607-16) also analyzed tumors from patients with adjuvant tamoxifen therapy using microarray analyses. They extracted a two-gene ratio that predicts “a tumor's response to tamoxifen or its intrinsic aggressiveness, or both”. Interestingly Sgroi et al. (Ma, et al., 2004, Cancer Cell, 5:607-16) showed that HOXB13, located to 17q21 was overexpressed in tamoxifen resistant cases with recurrence after adjuvant tamoxifen. In the 81-gene signature, we observed 5 genes located to chromosome 17q21-22 that could be of importance for tamoxifen resistance. In this region, the signature gene COL1A1 was discriminative and highly expressed in the signature. Moreover, HOXB13, like COL1A1, is not positioned in the 17q21 HER2/ERBB2 amplicon (Hyman, et al., 2002, Cancer Res., 62:6240-5) but in the second of three regions (i.e. 17q12-HER2-, 17q21.2-HOXB2-7-, 17q23-PPM1D-) highly amplified in breast cancer. This implies that genes other than those of the ERBB2 amplicon region, like HOXB13 and COL1A1 are important for resistance to tamoxifen and present potential therapeutic targets.
The expression of the other 4 signature genes located to chromosome 17q does not correlate with the ERBB2 expression, since they (EZH1, FMNL, KIAA0563, and APPBP2) were down regulated in the tamoxifen resistant tumors. This region has been implicated for LOH in 30% of breast cancer cases (Osborne, et al., 2000, Cancer Res., 60:3706-12). Only recently, JUP/plakoglobin/gamma-catenin was identified as a LOH, whereas LOH of BRCA1 is frequently observed in high-grade tumors (Ding, et al., 2004, Br. J Cancer, 90:1995-2001). The signature gene EZH1 located between JUP and BRCA1 may, therefore, be another LOH candidate gene.
Numerous reports have described that ERBB2 amplification and over-expression in ER positive patients is associated with a reduction in response rate to first-line hormone therapy (Lipton, et al., 2003, J. Clin. Oncol., 21:1967-72; Ferrero-Pous, et al., 2000, Clin. Cancer Res., 6:4745-54; Wright, et al., 1989, Cancer Res., 49:2087-90). Since the expression patterns of the 5 signature genes on 17q21-q22 are not significantly correlated with ERBB2 expression in this array study, this indicates that another, yet unknown, mechanism may be activated.
An 81-gene signature of differentially expressed genes and a 44-gene signature that predicts anti-estrogen, for example, tamoxifen therapy resistance and time to progression in ER-positive breast cancer patients with recurrent disease have been developed. The gene signatures demonstrate a significantly better performance than the commonly used traditional clinical predictive factors in uni- and multivariate analyses, and (3). In contrast to the traditional factors site of relapse and disease free interval (DFI), the prediction of response can be derived from the gene-expression profile of primary tumors.
The 81-gene signature was validated with quantitative PCR analysis (as described above) on RNA obtained from a larger series of 272 tumors from breast cancer patients who underwent first-line tamoxifen therapy for advanced disease. Included were patients having stable disease. Of these, 59 showed an objective response, 120 had stable disease, and 93 had progressive disease.
Ten genes selected from the 81-gene signature (CHD6, FN1, TNC, CASP2, EZH1, RABEP1, THRAP2, SIAH2, DEME-6, COX6C) have been analyzed to date against all 272 tumors. Uni- and multivariable analyses have been performed to determine the response and duration of response (progression free survival). In multivariable analysis the individual genes were compared with the clinically used model of traditional predictive factors (i.e., menopausal status, disease-free survival, dominant site of relapse, log ER, and log PR).
Clinical implications for patients predicted to have a poor response to tamoxifen therapy are that these patients should be candidates for other treatments or novel therapies, based on different targets present in their tumor profiles. This will reduce the use of ineffective treatments.
indicates data missing or illegible when filed
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB04/04405 | 12/3/2004 | WO | 00 | 4/13/2007 |
Number | Date | Country | |
---|---|---|---|
60527608 | Dec 2003 | US |