The invention relates to the field of oncology. More specifically, the invention relates to a method for typing breast cancer cells. The invention provides means and methods for classification of breast cancer cells and provides a treatment protocol based on the typing of the cells.
About 70% of human breast cancers are ERα positive and depend on this hormone receptor for their proliferation (Harvey et al., 1999. J Clin Oncol 17: 1474-81), rendering ERα an ideal target for endocrine treatment. Tamoxifen is one of the most commonly used drugs in the management of ERα positive breast cancer. In early breast cancer, 5 years of adjuvant treatment with tamoxifen almost halves the rate of disease recurrence and reduces the annual breast cancer death rate by one-third (EBCTCG, 2005. Lancet 365: 1687-717). Despite this adjuvant treatment with tamoxifen, one-third of women still develop recurrent disease in the next 15 years (EBCTCG, 2005. Lancet 365: 1687-717), illustrating that endocrine resistance is a major problem in the management of breast cancer.
Several mechanisms may contribute to tamoxifen resistance. At presentation, not all ERα positive tumours are sensitive to tamoxifen. This intrinsic endocrine resistance can be the result of ERα phosphorylation (Musgrove and Sutherland, 2009. Nat Rev Cancer 9: 631-43; Michalides et al., 2004. Cancer Cell 5: 597-605; Campbell et al., 2001. J Biol Chem 276: 9817-24). In addition, intrinsic tamoxifen resistance is found to correlate with increased levels or activity of ERα co-activators (AIB1), growth factor receptors (EGFR, HER2, IGF1R), kinases (AKT and ERK1/2) or adaptor proteins (BCAR1, c-SRC and PAK1) (Musgrove and Sutherland, 2009. Nat Rev Cancer 9: 631-43; Beelen et al., 2012. Nature reviews Clinical oncology 9: 529-41). Loss of CDK10 expression (Iorns et al., 2008. Cancer Cell 13: 91-104) and loss of insulin-like growth factor binding protein 5 (IGFBP5) expression (Ahn et al., 2010. Cancer Res 70: 3013-3019) can also lead to tamoxifen resistance. Furthermore, high levels of lemur tyrosine kinase-3 (LMTK3) or CUEDC2 protein are associated with tamoxifen resistance (Giamas et al., 2011. Nat Med 17: 715-719; Pan et al., Nat Med 17: 708-149). Acquired endocrine resistance develops in a certain proportion of metastasized ERα-positive breast cancer that was initially sensitive to tamoxifen palliative treatment. One possible mechanism of this resistance is upregulation of the PI3K-mTOR pathway, leading to ligand independent phosphorylation of ERα at serine 167 by S6K1 (Yamnik et al., 2009. J Biol Chem 284: 6361-9; Yue et al., 2007. J Steroid Biochem Mol Biol 106: 102-10; Miller et al., 2010. J Clin Invest 120: 2406-13). It is nevertheless likely that additional mechanisms of unresponsiveness to endocrine treatment play a role, that remain to be identified.
To elucidate novel mechanisms of tamoxifen resistance in breast cancer, a shRNA screen was performed in the hormone-dependent human luminal breast cancer cell line ZR-75-1 to identify genes whose suppression can induce tamoxifen resistance. The present inventors surprisingly found that loss of USP9X enhances ERα/chromatin interactions in the presence of tamoxifen, leading to tamoxifen-stimulated gene expression of ERα target genes and cell proliferation.
The present inventors have developed a gene expression profile that is indicative of the activity of USP9X in a breast cancer cell in the presence of tamoxifen. Methods of typing a sample from a breast cancer patient to determine the presence or absence of activity of USP9X, comprise determining the level of expression of genes from the gene profile.
The invention provides a method of typing a sample from a breast cancer patient that is treated with tamoxifen, the method comprising determining a level of expression for USP9X and/or for at least two genes that are selected from Table 1 in a relevant sample from the breast cancer patient, whereby the sample comprises expression products from a cancer cell of the patient; comparing said determined level of expression of USP9X or of the at least two genes to the level of expression of USP9X or the at least two genes in a reference; and typing said sample as being responsive to treatment with tamoxifen or not, based on the comparison of the determined levels of expression.
In a preferred method according to the invention, the sample is typed by determining a level of RNA expression for at least two genes that are selected from Table 1 and comparing said determined RNA level of expression to the level of RNA expression of the at least two genes in a reference. Said reference is preferably a measure of the average level of said at least two genes in at least 10 independent individuals.
A further preferred method according to the invention comprises determining a level of expression of at least five genes from Table 1, more preferred 10 genes from Table 1, more preferred 20 genes from Table 1, more preferred 50 genes from Table 1, more preferred 100 genes from Table 1, more preferred all genes from Table 1.
The invention further provides a method of assigning anti-estrogen receptor-directed therapy (antiER) comprising tamoxifen to a breast cancer patient, comprising typing a sample from the breast cancer patient with a method according to the invention; and assigning anti-estrogen receptor-directed therapy comprising tamoxifen to a patient of which the sample is typed as being responsive to treatment with tamoxifen.
The invention further provides a method of assigning further antiER directed therapy or chemotherapy to a breast cancer patient, comprising typing a sample from the breast cancer patient with a method according to the invention; and assigning chemotherapy to a patient of which the sample is typed as being non-responsive to treatment with tamoxifen.
Said further antiER directed therapy comprises the administration of a selective estrogen receptor modulator not being tamoxifen, an aromatase inhibitor, preferably anastrozole, and/or GnRH or a GnRH-analogue.
Said chemotherapy preferably comprises administration of a platinum agent, preferably cisplatin, and/or a PARP inhibitor, preferably ABT-888.
(A) Set up of the screen. ZR-75-1 cells stably expressing the murine ecotropic receptor were infected with retroviral supernatants containing a selection of the NM pRS-shRNA library divided in 44 pools—each pool contains 285 distinct short hairpin RNA's against 95 genes—or pRS as control. After puromycin selection 2×105 cells of each pool and control were plated in 15 cm dishes and cultured in DMEM with 1 μM 4OH-tamoxifen for 4-6 weeks. Tamoxifen resistant individual colonies were isolated and one of the rescuing shRNAs was identified as USP9X.
(B) Knockdown of USP9X rescues tamoxifen induced growth arrest. ZR-75-1 cells were infected with the single shRNA against USP9X recovered from the initial screen or pRS-GFP as control. Cells were cultured for 4-6 weeks in the presence of 1 μM 4OH-tamoxifen. When colonies appeared, cells were fixed and stained.
(C) USP9X hit validation. Five independent shRNAs targeting different regions of the USP9X gene were designed and colony formation assays with ZR-75-1 cells infected with each shRNA were performed. Rescue from tamoxifen induced growth arrest by USP9X knockdown was validated by three independent shRNAs.
(D) Knockdown of USP9X decreases USP9X mRNA levels.
(E) Knockdown of USP9X decreases USP9X protein levels.
(F) Knockdown of USP9X rescues tamoxifen-induced growth arrest in T47D cells. T47D cells were infected with the shRNA against USP9X recovered from the initial screen, pRS-USP9X II or pRS-GFP as control. Cells were cultured for 4-6 weeks in the presence of 1 μM 4OHtamoxifen. When colonies appeared cells were fixed and subsequently stained.
(A) Knockdown of USP9X increases activity on an ERE luciferase reporter in serum supplemented DMEM, in the absence and presence of tamoxifen. Data are represented as mean and standard deviation (SD) of three independent experiments.
(B) Knockdown of USP9X increases ERE luciferase in hormone-deprived, estradiol and 4OH20 tamoxifen treated cells. Data are represented as mean and SD of three independent experiments.
(C) USP9X knockdown in the presence of estradiol increases mRNA levels of the ERα target genes PGR, TFF1 and ERα. Data are represented as mean and SD of three independent experiments.
(D) Knockdown of USP9X increases ERα and PR protein levels in hormone-deprived, estradiol or 4OH-tamoxifen treated cells.
(A) Exogenous expressed ERα binds to endogenous USP9X in Phoenix cells. 48 hours after transfection with ERα, immunoprecipitations were performed for either anti-ERα (third lane) or anti-USP9X (fourth lane) and Westerns were stained for ERα and USP9X. The first lane shows 10% input of the whole cell lysate (wcl), the second lane shows immunoprecipitation with normal mouse serum (nms) as control.
(B) Endogenous ERα binds to endogenous USP9X in ZR-75-1 breast cancer cells. (Experimental conditions were identical to A).
(A) ERα ChIP-seq signal in control cells (top part) and shUSP9X cells (lower part) in the presence of indicated ligand. Tag counts (Y-axis) and genomic locations (X-axis) are indicated.
(B) Heatmap visualization, depicting a vertical alignment of all identified peaks of control (shGFP, left) and USP9XKD (shUSP9X, right) raw read counts of veh, E2, or 4-OHT treated cells. Arrowhead indicates top of the peak and scale bar is indicated.
(C) Read count quantification of data presented in Fig. B showing enrichment of ERα/DNA interactions in the presence of 4-OHT in the shUSP9X cells compared to the control (shGFP) cells. Y-axis: average tag count (arbitrary units). X-axis shows distance from centre of the peak (−2.5 kb, +2.5 kb).
(D) Venn diagrams showing a significant increase in the number of ERα/chromatin binding events in the shUSP9X (right) cells compared to control (shGFP) cells (left) in the presence of 4-OHT, representing a subset of the E2-induced binding patterns. Numbers indicate binding events in each subgroup (veh; dark grey, E2; black, 4-OHT; light gray).
(E) Venn diagrams showing shared and unique peaks for control cells (left hand circles) and shUSP9X cells (right hand circles) under vehicle (left), E2 (middle) and 4-OHT (right) conditions. Numbers indicate binding events in each subgroup.
(F) Genomic distributions of peaks under all tested conditions. Locations are indicated relative to the most proximal genes. 4-OHT shUSP9X unique: unique binding sites in tamoxifen treated shUSP9X cells as compared to tamoxifen-treated control cells.
(G) De novo motif enrichment analysis identified ESR motifs enriched for 4-OHT shUSP9X unique peaks and peaks shared by 4-OHT-treated shGFP control cells and shUSP9X cells.
(A) Left panel: Venn diagram showing differentially expressed genes in control cells (shGFP) after treatment with E2 (black) or 4-OHT, (grey), as compared to vehicle control (p<0.05). The 1906 differentially expressed genes after 4-OHT treatment represent a subset of the 8794 E2 induced genes. Right panel: Venn diagram showing differentially expressed genes after E2 treatment in control cells (left hand circle) and differentially expressed genes in 4-OHT-treated shUSP9X cells compared to 4-OHT-treated control (right hand circle). Differentially expressed genes in 4-OHT-treated shUSP9X cells represent a subset of E2-responsive genes in the control cells.
(B) Proximal ERα binding events for differentially expressed genes in 4-OHT-treated shUSP9X cells. ERα binding events found only in 4-OHT-treated control cells (left), 4-OHTtreated shUSP9X cells (middle) or shared between both conditions (right) were analysed for proximal binding (<20 kb) to transcription start sites of differential expressed genes in 4-OHTtreated shUSP9X cells and 4-OHT-treated control cells. Y-axis shows absolute number of differentially expressed genes.
(C) Average ERα read count intensity of ERα chromatin binding sites in 4-OHT-treated shUSP9X cells, proximal to (<20 kb) TSS regions of genes, differential expressed between 4-OHT-treated shUSP9X cells and 4-OHT-treated control cells. Y-axis shows average read count (a.u.). X-axis distance from centre of the peak (−2.5 kb, +2.5 kb).
(D) USP9X-differentially expressed genes in the presence of 4-OHT, with proximal ERα binding sites, were analysed for containing genes from the Perou-signature basal and luminal genes. Y-as shows percentage.
(E) Heatmap showing differentially expressed genes between 250 patients with primary ERα-positive breast cancer who received adjuvant tamoxifen. X-axis: patients. Y-axis: genes.
(F) Left panel: A USP9X knockdown tamoxifen gene signature identifies breast cancer patients with poor outcome after adjuvant tamoxifen treatment. Kaplan-Meier survival curves for distant metastasis free survival (DMFS) in a publically available cohort of primary ERα positive breast cancer patients treated with adjuvant tamoxifen (n=250). Middle panel: The USP9X knockdown tamoxifen gene signature is validated in a second cohort of primary ERα positive breast cancer patients treated with adjuvant tamoxifen. Kaplan-Meier survival curves for DMFS in a cohort of primary ERα positive breast cancer patients treated with adjuvant tamoxifen (n=134). Right panel: The USP9X knockdown tamoxifen gene signature does not correlate with outcome in breast cancer patients who did not receive any adjuvant endocrine treatment. Kaplan-Meier survival curves for DMFS in a cohort of primary ERα positive breast cancer patients that did not receive adjuvant endocrine treatment (n=209).
The term USP9X, as used herein, refers to a ubiquitin specific peptidase 9 which is X-linked. Alternative names for this gene are ubiquitin specific protease 9, X-linked; FAF-X; Drosophila Fat Facets related, X-Linked (DFFRX); Fat Facets Protein-Related, X-Linked; and Ubiquitin Thioesterase.
The term tamoxifen, as used herein, refers to a compounds that bind to the estrogen receptor and that blocks the effects of the hormone estrogen on cancer cells, thereby lowering the chance that breast cancer cells will grow. The term tamoxifen includes the compound tamoxifen ((Z)2-[4-(1,2-diphenyl-1-butenyl) phenoxy]-N, N-dimethylethanamine 2-hydroxy-1,2,3-propanetricarboxylate (1:1)) and variants thereof such as toremifene (2-{4-[(1Z)-4-chloro-1,2-diphenyl-but-1-en-1-yl]phenoxy}-N,N-dimethylethanamine).
The term further antiER directed therapy, as used herein, refers to compounds that modulate the levels of estrogen, the binding of estrogen to the receptor, and/or gene activation by the estrogen receptor. The term further antiER directed therapy excludes tamoxifen. Examples of further antiER directed therapy are provided by selective estrogen receptor modulators apart from tamoxifen, GnRH or a GnRH-analogue and/or of an aromatase inhibitor.
The term typing refers to the classification of a sample from a cancer patient, preferably a breast cancer patient. Said typing is preferably used to predict whether the individual has a high risk of being or becoming resistant to treatment with anti-estrogen receptor-directed therapy selected from tamoxifen, or a low risk of being or becoming resistant to treatment with said anti-estrogen receptor-directed therapy. For this, the level of expression of USP9X or of at least two genes of the set of genes selected from Table 1 is determined in a relevant sample from the individual. Modulation of the level of expression of USP9X, when compared to the level of expression of USP9X in a reference, or modulation of the level of expression of the at least two genes of the set of genes selected from Table 1, compared to the level of expression of the at least two genes of the set of genes selected from Table 1 in a reference, is indicative of a high risk of being or becoming resistant to treatment with tamoxifen.
The term sample, as used herein, refers to a relevant sample comprising expression products from a cancer cell of the patient, preferably a breast cancer cell. Said sample is preferably derived from a primary or metastasized breast cancer. A sample comprising expression products from a cancer cell of an individual suffering from breast cancer is provided after the removal of all or part of a cancerous growth from the individual, for example after biopsy. For example, a sample comprising expression products may be obtained from a needle biopsy sample or from a tissue sample comprising breast cancer cells that was previously removed by surgery. The surgical step of removing a relevant tissue sample, preferably a part of the cancer, from an individual is not part of a method according to the invention. It is preferred that at least 10% of the cells or tissue from which a relevant sample comprising expression products is derived, are breast cancer cells, more preferred at least 20%, more preferred at least 30%, more preferred at least 50%. The sample may have been fixed, for example a formalin-fixed paraffin-embedded (FFPE) sample.
The term expression products, as is used herein, refers to protein expression products or, preferably, RNA expression products. A sample from an individual suffering from breast cancer comprising protein expression products from a cancer of the patient can be obtained in numerous ways, as is known to a skilled person. For example, proteins can be isolated from a sample using, for example, cell disruption and extraction of cellular contents. Suitable methods and means are known in the art, such as dounce pestles and sonication methods. In addition, preferred methods include reagent-based lysis methods using detergents. These methods not only lyse cells but also solubilize proteins. Cell disruption may be followed by methods for enrichment of specific proteins, including subcellular fractionation and depletion of high abundant proteins. Differences in protein expression between a sample from an individual suffering from cancer and a reference sample is studied, for example, by two-dimensional (2D) gel electrophoresis and/or mass spectrometry techniques such as, for example, electrospray ionization and matrix-assisted laser desorption ionization.
The term reference, as used herein, refers to a sample comprising expression products from a related or an unrelated source. A preferred reference comprises expression products from a cancer cell, preferably a breast cancer cell, that is known to be resistant to tamoxifen, from a cancer cell, preferably a breast cancer cell, that is known not to be resistant to tamoxifen, or from a mixture of resistant and non-resistant cancer cells.
The term functionally inactivated, as used herein, refers to an alteration that diminishes or abolishes the expression and/or activity of USP9X. Said alteration can be a genetic alteration, for example an insertion, a point mutation, or, preferably, two or more point mutations in the gene encoding USPX, or an alteration in one of more genes of which the expression product is involved, preferably required, in a USP9X-mediated activity or pathway.
The term target protein, as is used herein, refers to the USP9X protein and/or to a protein product of a gene that is depicted in Table 1.
Methods of Typing a Sample from a Breast Cancer Patient
The present inventors surprisingly found that downregulation of USP9X induces tamoxifen-stimulatory effects on ERα action, leading to resistance to ER-targeting therapy such as tamoxifen. Furthermore, it is shown that a tamoxifen-induced gene expression signature in USP9X knockdown cells can be used to identify cancer patients, especially breast cancer patients, with a poor outcome after tamoxifen treatment and that are likely not to benefit from further tamoxifen treatment.
As is indicated hereinabove, USP9X is an X-linked ubiquitin-specific peptidase. Ubiquitination serves a role in both protein degradation and regulation of protein function. The level of protein ubiquitination is highly regulated by two families of enzymes with opposing activities: the ubiquitin ligases, which add ubiquitin moieties to proteins and deubiquitinating enzymes (DUBs) that remove them. The X-linked deubiquitinase USP9X is a member of the family of DUB enzymes and regulates multiple cellular functions by deubiquitinating and stabilizing its substrates. USP9X has been shown to regulate, amongst others, cell adhesion molecules like 6-catenin and E-cadherin, cell polarity, chromosome segregation, NOTCH, mTOR and TGF-beta signalling as well as apoptosis (Taya et al., (1998) J Cell Biol 142, 1053-1062; Taya et al., (1999) Genes Cells 4, 757-767; Murray et al., (2004) Mol Biol Cell 15, 1591-1599; Théard et al., (2010) EMBO J 29, 1499-1509; Dupont et al., (2009) Cell 136, 123-35).
A shRNA screen in the hormone-dependent human luminal breast cancer cell line ZR-75-1 was employed to identify genes whose suppression can induce tamoxifen resistance. An unexpected role for USP9X in the response to tamoxifen was identified. Loss of expression products of USP9X enhance ERα/chromatin interactions in the presence of tamoxifen, leading to tamoxifen stimulated gene expression of ERα target genes and cell proliferation.
Furthermore, a Tamoxifen-Induced Gene Expression Signature (TIGES) was identified in USP9X knockdown cells that can be used to identify cancer patients, especially breast cancer patients, with a poor outcome after tamoxifen treatment. These genes, as indicated in Tables 1A and 1B, were identified as their relative level of expression was found to be modulated by the presence or absence of USP9X. The term relative is used to indicate that the level of expression was compared to the level of expression in a reference, for example pooled breast cancer samples. The expression of each of the genes depicted in Table 1 correlates with one of two phenotypes. This correlation is represented as a UP or DOWN, indicating upregulation (UP) in the absence of USP9X, and downregulation (DOWN) in the absence of USP9X. For example, upregulation of A1BG or AKT2, and downregulation of ABAT, is indicative of the presence of functionally inactived USP9X.
Methods of classifying a sample from a breast cancer patient that is treated with anti-estrogen receptor-directed therapy selected from tamoxifen according to the presence or absence of a TIGES profile in a breast cancer cell comprise determining the level of expression of at least 2 genes from the gene profile, as indicated in Table 1. The methods of the invention allow classifying a breast cancer sample as likely to become resistant to treatment with anti-estrogen receptor-directed therapy, or not. Therefore, the TIGES profile allows the functional classification of functional inactivation of USP9X in a breast cancer sample. In addition, the TIGES profile can also be used to classify a sample from a breast cancer patient in which a process or signaling pathway involving USP9X is functionally inactivated by functional inactivation of one or more genes encoding other necessary components of the process or pathway.
In a preferred method according to the invention, a level of expression of at least five genes from Table 1 is determined, more preferred a level of expression of at least ten genes from Table 1, more preferred a level of expression of at least twenty genes from Table 1, more preferred a level of expression of at least thirty genes from Table 1, more preferred a level of expression of at least forty genes from Table 1, more preferred a level of expression of at least fifty genes from Table 1, more preferred a level of RNA expression of all two hundred thirty four genes from Table 1.
Said tamoxifen-induced gene expression signature preferably comprises at least two genes from Table 1. Said at least two genes preferably comprise genes with the highest Z-scores. Said at least two genes preferably comprise zinc finger protein 608 ((Z-score −1.008943904) and BUB1 mitotic checkpoint serine/threonine kinase B (Z-score 1.065024239). Said at least two genes preferably comprise zinc finger protein 608 ((Z-score −1.008943904), calpain 2, (m/II) large subunit (Z-score −0.936786567), BUB1 mitotic checkpoint serine/threonine kinase B (Z-score 1.065024239) and centromere protein A (Z-score 1.01511874). Said at least two genes preferably comprise zinc finger protein 608 ((Z-score −1.008943904), calpain 2, (m/II) large subunit (Z-score −0.936786567), FBJ murine osteosarcoma viral oncogene homolog (Z-score −0.920787895), ets homologous factor (Z-score −0.912814779), chondroitin sulfate synthase 1 (Z-score −0.897709367), BUB1 mitotic checkpoint serine/threonine kinase B (Z-score 1.065024239), centromere protein A (Z-score 1.01511874), cell division cycle 45 (Z-score 0.983080062), cell division cycle associated 3 (Z-score 0.97567222), and solute carrier family 25 (mitochondrial thiamine pyrophosphate carrier), member 19 (Z-score 0.974852744).
It is further preferred that said tamoxifen-induced gene expression signature comprises v-myb avian myeloblastosis viral oncogene homolog-like 2 and chondroitin sulfate synthase 1 (P value 1.25E-06 (Loi); 2.32E-05 (Buffa)), v-myb avian myeloblastosis viral oncogene homolog-like 2 and calpain 2, (m/II) large subunit (P value 1.56E-05 (Loi); 4.59E-05 (Buffa)), BUB1 mitotic checkpoint serine/threonine kinase B and calpain 2, (m/II) large subunit (P value 2.67E-06 (Loi); 1.37E-05 (Buffa)), v-myb avian myeloblastosis viral oncogene homolog-like 2, isocitrate dehydrogenase 3 (NAD+) alpha, and calpain 2, (m/II) large subunit (P value 4.77E-08 (Loi); 2.19E-05 (Buffa)), v-myb avian myeloblastosis viral oncogene homolog-like 2, isocitrate dehydrogenase 3 (NAD+) alpha, and calpain 2, (m/II) large subunit (P value 1.56E-06 (Loi); 4.59E-05 (Buffa)), v-myb avian myeloblastosis viral oncogene homolog-like 2, BUB1 mitotic checkpoint serine/threonine kinase B and calpain 2, (m/II) large subunit (P value 4.90E-05 (Loi); 5.42E-06 (Buffa)), chondroitin sulfate synthase 1, BUB1 mitotic checkpoint serine/threonine kinase B and calpain 2, (m/II) large subunit (P value 4.77E-08 (Loi); 2.19E-05 (Buffa)), and/or v-myb avian myeloblastosis viral oncogene homolog-like 2, chondroitin sulfate synthase 1, isocitrate dehydrogenase 3 (NAD+) alpha, and calpain 2, (m/II) large subunit (P value 6.99E-09 (Loi); 1.70E-05 (Buffa)).
More preferably, said signature comprises v-myb avian myeloblastosis viral oncogene homolog-like 2, chondroitin sulfate synthase 1 and isocitrate dehydrogenase 3 (NAD+) alpha (P value 7.75E-06 (Loi); 3.34E-08 (Buffa)), v-myb avian myeloblastosis viral oncogene homolog-like 2, chondroitin sulfate synthase 1, isocitrate dehydrogenase 3 (NAD+) alpha and BUB1 mitotic checkpoint serine/threonine kinase B (P value 8.95E-07 (Loi); 5.78E-08 (Buffa)), v-myb avian myeloblastosis viral oncogene homolog-like 2, chondroitin sulfate synthase 1, isocitrate dehydrogenase 3 (NAD+) alpha, BUB1 mitotic checkpoint serine/threonine kinase B and calpain 2, (m/II) large subunit (P value 6.36E-07 (Loi); 2.28E-07 (Buffa)), and/or chondroitin sulfate synthase 1, isocitrate dehydrogenase 3 (NAD+) alpha, BUB1 mitotic checkpoint serine/threonine kinase B and calpain 2, (m/II) large subunit (P value 3.70E-07 (Loi); 4.46E-07 (Buffa)).
The term P value (Loi) refers to the P-value obtained from a set of 250 ER+ patients that were treated with tamoxifen, as described in Loi et al., 2007. J Clin Oncol 25: 1239-46. The term P value (Buffa) refers to the P-value obtained from a set of 134 ER+ patients that were treated with tamoxifen, as described in Buffa et al., 2011. Cancer Res 71: 5635-45.
A preferred subset comprises calpain 2 (CAPN2). A further preferred subset comprises CAPN2 and BUB1B. A further preferred subset comprises MYBL2, IDH3A, CHSY1, BUB1B, CAPN2. A selection of MYBL2, IDH3A, CHSY1, BUB1B, CAPN2 gave rise to the largest survival differences among 5 independent cohorts that were tested: Cohort 1 (GSE6532; Loi et al., 2007. J Clin Oncol 25: 1239-46); cohort 2 (GSE12093; Zhang et al., 2009. Breast Cancer Res Treat 116: 303-9), cohort 3 (GSE26971; Filipits et al., 2011. Clin Cancer Res 17:6012-20), cohort 4 (GSE9195; Loi et al., 2008. BMC Genomics 9:239), and cohort 5 (GSE17705; Symmans et al., 2010. J Clin Oncol 28:4111-9).
Downregulation of USP9X and/or modulation of the expression of at least two of the genes identified in Table 1, can be monitored at the RNA and protein level. Quantitation of the expression of a gene at the protein level can be either in absolute amount (e.g., μg/ml) or a relative amount (e.g., relative intensity of signals). Usually such procedures are performed by dedicated biochemical assays, such as chromatographic, mass spectrometric or hybridization assays.
Preferred chromatographic assays include Western-blotting assays, following one- or two-dimensional gel electrophoresis.
Hybridization techniques, such as ELISA techniques, immunohistochemistry (IHC), and in situ hybridization, and are very suitable to determine the concentration of a protein in a biological sample. Such techniques preferably involve the production of a calibration curve of label intensity, for example fluorescence intensity, vs. protein concentration, or the use of a competitive ELISA format, wherein known amounts of unlabeled protein are provided in the test. Alternatively, multiple sandwich ELISA can be developed using as second antibody, for instance an antibody raised by peptide immunisation against a second epitope of the target protein (a second synthetic peptide), or against a determinant that is formed by a complex that is formed between the target protein and the antibody.
In this regard, it is preferred to generate a non-natural intermediate, for example an antibody-gene product complex, by reaction of the sample with a first antibody that is directed against the target protein, followed by the application of a detection agent that detects the antibody-target protein complex. It is noted that the antibody-target protein complex does not exist in nature.
Preferred mass spectrometric assays include liquid chromatography-mass spectrometry (LC-MS, or alternatively HPLC-MS), tandem mass spectrometry (MS-MS), matrix assisted laser desorption (MALDI); matrix assisted laser desorption/ionisation time-of-flight (MALDI-TOF), MALDI-Fourier transform ion cyclotron resonance (MALDI-FTICR).
Methods to quantify expression levels of USP9X and/or of at least two of the genes identified in Table 1 at the RNA level are known to a skilled person and include, but are not limited to, Northern blotting, quantitative Polymerase chain reaction (qPCR), also termed real time PCR (rtPCR), microarray analysis and RNA sequencing, preferably next generation sequencing such as whole transcriptome shotgun sequencing. The term qPCR refers to a method that allows amplification of relatively short (usually 100 to 1000 basepairs) of DNA sequences. In order to measure messenger RNA (mRNA), the method involves a reverse transcriptase to convert mRNA into complementary DNA (cDNA) which is then amplified by PCR. The amount of product that is amplified can be quantified using, for example, TaqMan® (Applied Biosystems, Foster City, Calif., USA), Molecular Beacons, Scorpions® and SYBR® Green (Molecular Probes). Methods such as self sustained sequence replication (3SR), loop mediated isothermal amplification (LAMP), strand displacement amplification (SDA), rolling circle amplification (RCA) and quantitative nucleic acid sequence based amplification (qNASBA) can be used as an alternative for qPCR, as is known to the skilled person.
RNA may be isolated from a sample by any technique known in the art, including but not limited to Trizol (Invitrogen; Carlsbad, Calif.), RNAqueous® (Applied Biosystems/Ambion, Austin, Tx), Qiazol® (Qiagen, Hilden, Germany), RNeasy Isolation Kit (Qiagen, Hilden, Germany) Agilent Total RNA Isolation Kits (Agilent; Santa Clara, Calif.), RNA-Bee® (Tel-Test. Friendswood, Tex.), and Maxwell™ Total RNA Purification Kit (Promega; Madison, Wis.). A preferred RNA isolation procedure involves the use of Qiazol® (Qiagen, Hilden, Germany). A further preferred RNA isolation procedure involves the use of the Qiagen RNeasy FFPE RNA isolation Kits (Qiagen, Hilden, Germany). RNA can be extracted from a whole sample or from a portion of a sample generated from the cell sample by, for example, section or laser dissection.
A preferred method for determining a level of RNA expression is microarray analysis. For microarray analysis, a hybridization mixture is prepared by extracting and labelling of RNA. The extracted RNA is preferably converted into a labelled sample comprising either complementary DNA (cDNA) or cRNA using a reverse-transcriptase enzyme and labelled nucleotides. A preferred labelling introduces fluorescently-labelled nucleotides such as, but not limited to, cyanine-3-CTP or cyanine-5-CTP. Examples of labelling methods are known in the art and include Low RNA Input Fluorescent Labelling Kit (Agilent Technologies), MessageAmp Kit (Ambion) and Microarray Labelling Kit (Stratagene).
A labelled sample may comprise two dyes that are used in a so-called two-colour array. For this, the sample is split in two or more parts, and one of the parts is labelled with a first fluorescent dye, while a second part is labelled with a second fluorescent dye. The labelled first part and the labelled second part are independently hybridized to a microarray. The duplicate hybridizations with the same samples allow compensating for dye bias.
More preferably, a sample is labelled with a first fluorescent dye, while a reference, for example a sample from a breast cancer pool or a sample from a relevant cell line or mixture of cell lines, is labelled with a second fluorescent dye (known as dual channel). The labelled sample and the labelled reference are co-hybridized to a microarray.
Even more preferred, a sample is labelled with a fluorescent dye and hybridized to a microarray without a reference (known as single channel).
The labelled sample can be hybridized against the probe molecules that are spotted on the array. A molecule in the labelled sample will bind to its appropriate complementary target sequence on the array. Before hybridization, the arrays are preferably incubated at high temperature with solutions of saline-sodium buffer (SSC), Sodium Dodecyl Sulfate (SDS) and bovine serum albumin (BSA) to reduce background due to nonspecific binding, as is known to a skilled person.
The arrays are preferably washed after hybridization to remove labelled sample that did not hybridize on the array, and to increase stringency of the experiment by reducing cross hybridization of the labelled sample to a partial complementary probe sequence on the array. An increased stringency will substantially reduce non-specific hybridization of the sample, while specific hybridization of the sample is not substantially reduced. Stringent conditions include, for example, washing steps for five minutes at room temperature 0.1× Sodium chloride-Sodium Citrate buffer (SSC)/0.005% Triton X-102. More stringent conditions include washing steps at elevated temperatures, such as 37 degrees Celsius, 45 degrees Celsius, or 65 degrees Celsius, either or not combined with a reduction in ionic strength of the buffer to 0.05×SSC or 0.01×SSC as is known to a skilled person.
Image acquisition and data analysis can subsequently be performed to produce an image of the surface of the hybridised array. For this, the slide can be dried and placed into a laser scanner to determine the amount of labelled sample that is bound to a target spot. Laser excitation yields an emission with characteristic spectra that is indicative of the labelled sample that is hybridized to a probe molecule. In addition, the amount of labelled sample can be quantified.
The level of expression, preferably mRNA expression levels of genes depicted in Table 1, is preferably compared to levels of expression of the same genes in a template. A template is preferably an RNA sample isolated from a tissue of a healthy individual, preferably comprising breast cells. A preferred template comprises a RNA sample from a relevant cell line or mixture of cell lines. The RNA from a cell line or cell line mixture can be produced in-house or obtained from a commercial source such as, for example, Stratagene Human Reference RNA. A further preferred template comprises RNA isolated and pooled from normal breast tissue that is adjacent to the cancer tissue.
A more preferred template comprises an RNA sample from an individual suffering from breast cancer, more preferred from multiple individuals suffering from breast cancer. It is preferred that said multiple samples are pooled from more than 10 individuals, more preferred more than 20 individuals, more preferred more than 30 individuals, more preferred more than 40 individuals, most preferred more than 50 individuals. A most preferred template comprises a pooled RNA sample that is isolated from tissue comprising breast cancer cells from multiple individuals suffering from breast cancer.
As an alternative, a static template can be generated which enables performing single channel hybridizations. A preferred static template is calculated by measuring the median/mean background-subtracted level of expression (for example green-median/MeanSignal or red-median/MeanSignal) of a gene across 1-5 hybridization replicates of a probe sequence. The level of expression may be normalized as is known by a skilled person. Subsequently, a log transformation of each gene/probe gene signal is generated. With this transformation, the variance is stabilized (as with linear values as the signal gets higher the variance also increases; it compresses the range of data) and it makes the data more normally distributed, which allows statistics to be applied to the data. The signal intensity measurements obtain a distribution that is closer to a normal distribution with the variation being independent of the magnitude, allowing statistics to be applied to the data.
Typing of a sample can be performed in various ways. In one method, a coefficient is determined that is a measure of a similarity or dissimilarity of a sample with said template. A number of different coefficients can be used for determining a correlation between the RNA expression level in an RNA sample from an individual and a template. Preferred methods are parametric methods which assume a normal distribution of the data.
The result of a comparison of the determined expression levels with the expression levels of the same genes in at least one template is preferably displayed or outputted to a user interface device, a computer readable storage medium, or a local or remote computer system. The storage medium may include, but is not limited to, a floppy disk, an optical disk, a compact disk read-only memory (CD-ROM), a compact disk rewritable (CD-RW), a memory stick, and a magneto-optical disk.
The expression data are preferably normalized. Normalization refers to a method for adjusting or correcting a systematic error in the measurements of detected label.
Systemic bias results in variation by inter-array differences in overall performance, which can be due to for example inconsistencies in array fabrication, staining and scanning, and variation between labelled RNA samples, which can be due for example to variations in purity. Systemic bias can be introduced during the handling of the sample in a microarray experiment.
To reduce systemic bias, the determined RNA levels are preferably corrected for background non-specific hybridization and normalized using, for example, Feature Extraction software (Agilent Technologies). Other methods that are or will be known to a person of ordinary skill in the art, such as a dye swap experiment (Martin-Magniette et al., Bioinformatics 21:1995-2000 (2005)) can also be applied to normalize differences introduced by dye bias. Normalization of the expression levels results in normalized expression values.
Conventional methods for normalization of array data include global analysis, which is based on the assumption that the majority of genetic markers on an array are not differentially expressed between samples [Yang et al., Nucl Acids Res 30: 15 (2002)]. Alternatively, the array may comprise specific probes that are used for normalization. These probes preferably detect RNA products from housekeeping genes such as glyceraldehyde-3-phosphate dehydrogenase and 18S rRNA levels, of which the RNA level is thought to be constant in a given cell and independent from the developmental stage or prognosis of said cell.
Therefore, a preferred method according to the invention further comprises normalizing the determined RNA levels of said set of at least ten of the genes listed in Table 1 in said sample.
Said normalization preferably comprises previously mentioned global analysis “median centering”, in which the “centers” of the array data are brought to the same level under the assumption that the majority of genes are not changed between conditions (with median being more robust to outliers than the mean). Said normalization preferably comprises Lowess (LOcally WEighted Scatterplot Smoothing) local regression normalization to correct for both print-tip and intensity-dependent bias (for dual channel arrays) or “quantile normalization” (which transforms all the arrays to have a common distribution of intensities) for single channel arrays
In a preferred embodiment, genes are selected of which the RNA expression levels are largely constant between individual tissue samples comprising cancer cells from one individual, and between tissue samples comprising cancer cells from different individuals. It will be clear to a skilled artisan that the RNA levels of said set of normalization genes preferably allow normalization over the whole range of RNA levels. An example of a set of normalization genes is provided in WO 2008/039071, which is hereby incorporated by reference.
The levels of expression of genes from the TIGES signature in a sample of a patient are compared to the levels of expression of the same genes in a reference. Said comparison may result in an index score indicating a similarity of the determined expression levels in a sample of a patient with the expression levels in the reference. For example, an index can be generated by determining a fold change/ratio between the median value of gene expression across samples that have been typed as being responsive to treatment with tamoxifen and the median value of gene expression across samples that are typed as being non-responsive to treatment with tamoxifen. The significance of this fold change/ratio as being significant between the two respective groups can be tested primarily in an ANOVA (Analysis of variance) model. Univariate p-values can be calculated in the model and after multiple correction testing (Benjamini & Hochberg, 1995, JRSS, B, 57, 289-300) can be used as a threshold for determining significance that the gene expression shows a clear difference between the groups. Multivariate analysis may also be performed in adding covariates such as hormone expression, tumor stage/grade/size into the ANOVA model. Significant genes can be imputed into a prediction model such as Diagonal Linear Discriminant analysis (DLDA) to determine the minimal and most reliable group of gene signals that can predict the factor (response to therapy).
As an alternative, an index can be determined by Pearson correlation between the expression levels of the genes in a sample of a patient and the expression levels in one or more breast cancer samples that are known to respond to tamoxifen, and the average expression levels in one or more breast cancer samples that are known not to respond to tamoxifen. The resultant Pearson scores can be used to provide an index score. Said score may vary between +1, indicating a prefect similarity, and −1, indicating a reverse similarity. Preferably, an arbitrary threshold is used to type samples as being responsive or as not being responsive. More preferably, samples are classified as responsive or as not responsive based on the respective highest similarity measurement. A similarity score is preferably displayed or outputted to a user interface device, a computer readable storage medium, or a local or remote computer system.
The present invention further provides a method of assigning treatment to a breast cancer patient, the method comprising typing a sample from the breast cancer patient with a method according to the invention, and assigning treatment comprising tamoxifen to a patient of which the sample is typed as being responsive to treatment with tamoxifen.
Tamoxifen and tamoxifen derivatives such as toremifene, are known antagonistic compounds of the estrogen receptor. Methods for providing tamoxifen and/or toremifene to an individual in need thereof suffering from breast are known in the art. For example, tamoxifen may be administered at 20 to 200 mg/kg per day, for example as Tamoxifen Citrate Tablets USP for oral administration. Toremifene similarly can be administered as toremifene citrate at 10 to 800 mg/d orally.
The present invention further provides a method of not assigning tamoxifen-comprising therapy to a breast cancer patient, comprising typing a sample from the breast cancer patient with a method according to the invention; and not assigning tamoxifen to a patient of which the sample is typed as being non-responsive to treatment with tamoxifen. Said method preferably comprises the assignment of further antiER directed therapy and/or chemotherapy to a breast cancer patient of which the sample is typed as being non-responsive to treatment with tamoxifen.
Said further antiER directed therapy comprises selective estrogen receptor modulators (SERM), not including tamoxifen, GnRH or a GnRH-analogue and/or of an aromatase inhibitor.
A preferred non-tamoxifen SERM is provided by fulvestrant (7α,17β)-7-{9-[(4,4,5,5,5-pentafluoropentyl)sulfinyl]nonyl}estra-1,3,5(10)-triene-3,17-diol), which is an estrogen receptor antagonist with no agonist effects, which works by down-regulating the estrogen receptor. It is administered as a once-monthly injection at 500 mg.
A further preferred non-tamoxifen SERM is provided by raloxifene ([6-hydroxy-2-(4-hydroxyphenyl)-benzothiophen-3-yl]-[4-[2-(1-piperidyl)ethoxy]phenyl]-methanone). It is an estrogen receptor antagonist in breast cells, including breast cancer cells. It can be orally administered at 60-240 mg/kg/day.
Yet a further preferred non-tamoxifen SERM is provided by lasofoxifene ((5R,6S)-6-phenyl-5-[4-(2-pyrrolidin-1-ylethoxy)phenyl]-5,6,7,8-tetrahydronaphthalen-2-ol). It is an estrogen receptor antagonist in breast cells, including breast cancer cells. It can be orally administered at 0.001 mg/kg-1.0 mg/kg/day.
A further preferred antiER directed therapy comprises the administration of an aromatase inhibitor. These non-steroidal inhibitors inhibit the synthesis of estrogen via reversible competition for the aromatase enzyme. Preferred aromatase inhibitors include anastrozole (2,2′-[5-(1H-1,2,4-triazol-1-ylmethyl)-1,3-phenylene]bis(2-methylpropanenitrile) and exemestane (6-Methylideneandrosta-1,4-diene-3,17-dione). Anastrozole can be orally administered at 1.0-10 mg/day. Exemestane can be orally administered at 25-50 mg/day
Yet a further preferred antiER directed therapy comprises the administration of gonadotropin-releasing hormone (GnRH), also known as Luteinizing-hormone-releasing hormone (LHRH) and luliberin. GnRH is a trophic peptide hormone responsible for the release of follicle-stimulating hormone (FSH) and luteinizing hormone (LH) from the anterior pituitary. GnRH is synthesized and released from neurons within the hypothalamus. The peptide belongs to gonadotropin-releasing hormone family. Administration of GnRH lowers the levels of oestrogen and progesterone, resulting in estrogen levels that resemble that of a menopausal or post-menopausal woman.
As is known to the skilled person, a GnRH-analogue, for example Leuprolide, is a synthetic peptide drug that is modeled after the human GnRH. A GnRH-analogue is designed to interact with the GnRH receptor and modify the release of pituitary gonadotropins FSH and LH for therapeutic purposes. The synthetic hormone is preferably injected (1 and 3 month depot injections are available) or prescribed as nasal spray. However, the nasal spray is rarely used, because a constant and regular drug level is difficult to maintain.
Yet a further preferred therapy comprises chemotherapy, which includes the use of a chemotherapeutic agent such as an alkylating agent such as nitrogen mustard, e.g. cyclophosphamide, mechlorethamine or mustine, uramustine or uracil mustard, melphalan, chlorambucil, ifosfamide; a nitrosourea such as carmustine, lomustine, streptozocin; an alkyl sulfonate such as busulfan, an ethylenime such as thiotepa and analogues thereof, a hydrazine/triazine such as dacarbazine, altretamine, mitozolomide, temozolomide, altretamine, procarbazine, dacarbazine and temozolomide, which are capable of causing DNA damage; an intercalating agent such as a platinum agent like cisplatin, carboplatin, nedaplatin, oxaliplatin and satraplatin; an antibiotic such as an anthracycline such as doxorubicin, daunorubicin, epirubicin and idarubicin; mitomycin-C, dactinomycin, bleomycin, adriamycin, mithramycin; an antimetabolite such as capecitabine and 5-fluorouracil, gemcitabine, a folate analogue such as methotrexate, hydroxyurea, mercaptopurine, thioguanine; a mitostatic agent such as eribulin, ixabepilone, irinotecan, vincristine, mitoxantrone, vinorelbine and a taxane such as paclitaxel and docetaxel; an inhibitor of the enzyme poly ADP ribose polymerase (PARP), a receptor tyrosine kinase inhibitor such as gefitinib, erlotinib, EKB-569, lapatinib, CI-1033, cetuximab, panitumumab, PKI-166, AEE788, sunitinib, sorafenib, dasatinib, nilotinib, pazopanib, vandetaniv, cediranib, afatinib, motesanib, CUDC-101, and imatinib mesylate; and kinase inhibitors such as a MEK inhibitor including CKI-27, RO-4987655, RO-5126766, PD-0325901, WX-554, AZD-8330, G-573, RG-7167, SF-2626, GDC-0623, RO-5068760, and AD-GL0001; a B-RAF inhibitor including CEP-32496, vemurafenib, GSK-2118436, ARQ-736, RG-7256, XL-281, DCC-2036, GDC-0879, AZ628, and an antibody fragment EphB4/Raf inhibitor; a serine/threonine kinase receptor inhibitor, including an Alk-1 inhibitor such as crizotinib, ASP-3026, LDK378, AF802, and CEP37440, and combinations thereof.
Said chemotherapy is preferably selected from a platinum agent like cisplatin, carboplatin, oxaliplatin and satraplatin; taxane including paclitaxel and docetaxel, a PARP inhibitor, doxorubicin, daunorubicin, epirubicin, cyclophosphamide, 5-fluorouracil, gemcitabine, eribulin, ixabepilone, methotrexate, mitomycin-C, mitoxantrone, vinorelbine, thiotepa, vincristine, capecitabine, a receptor tyrosine kinase inhibitor and/or irinotecan, and combinations thereof.
A preferred PARP inhibitor includes 3-aminobenzamide, 4-(3-(1-(cyclopropanecarbonyl)piperazine-4-carbonyl)-4-fluorobenzyl)phthalazin-1(2H)-one (AZD-2281), 8-fluoro-2-{4-[(methylamino)methyl]phenyl}-1,3,4,5-tetrahydro-6H-pyrrolo[4,3,2-ef][2]benzazepin-6-one phosphate (1:1) (AG014699), 2-[(2R)-2-Methylpyrrolidin-2-yl]-1H-benzimidazole-4-carboxamide dihydrochloride benzimidazole carboxamide (ABT-888), and (8S,9R)-5-fluoro-8-(4-fluorophenyl)-9-(1-methyl-1H-1,2,4-triazol-5-yl)-8,9-dihydro-2H-pyrido[4,3,2-de]phthalazin-3(7H)-one (BMN-673).
More preferably, said chemotherapy comprises administration of a platinum agent and/or a PARP inhibitor. A most preferred platinum agent is cisplatin. A most preferred PARP inhibitor is ABT-888.
The human breast cancer cell lines ZR-75-1 (ATCC CRL-1500), MDA-MB-231 (ATCC HTB-26) and T47D (ATCC HTB133) were cultured in DMEM supplemented with 10% FCS, 2 mM glutamine, 100 μg/ml penicillin, 100 μg/ml streptomycin, and 1 nM estradiol at 37° C. in 5% CO2. In proliferation assays, estradiol was replaced by DMSO (vehicle), 1 μM 4OHtamoxifen (hereinafter: tamoxifen) or 10-7 M fulvestrant. Phoenix cells (ATCC CRL-3214) were cultured at 37° C. in 5% CO2 in DMEM with 10% FCS, 2 mM glutamine, 100 μg/ml penicillin, and 100 μg/ml streptomycin.
Phoenix cells were transfected using calcium phosphate method. Viral supernatant was cleared through a 0.45 μm filter. Target cells were infected with the viral supernatant in the presence of polybrene (8 μg/ml) and the infection was repeated once. For transient transfection of ZR-75-1 cells Lipofectamine 2000 (Invitrogen) was used, according to the manufacturers protocol.
NKI shRNA Library
The construction of the library was described previously (Berns et al., 2004. Nature 428: 431-7). Briefly, the NKI shRNA library was designed to target 7914 human genes, using three shRNA vectors for every targeted gene. The shRNAs are cloned into a retroviral vector (pRetroSUPER (pRS)) to enable infection of target cells.
Cells were infected with retroviral supernatant and selected with puromycin (2.0 μg/ml). When the selection was completed 5×104 cells were seeded in 10 cm dishes and cultured in DMEM with 1 μM 4OH-tamoxifen for 4-6 weeks. When colonies appeared, cells were fixed in MeOH/HAc (3:1) and subsequently stained with 50% MeOH/10% HAc/0.1% Coomassie.
shRNA Screen and Recovery of shRNA Inserts
ZR-75-1 cells stably expressing the murine ecotropic receptor were infected with retroviral supernatants containing a selection of the NKI pRS-shRNA library (12,540 shRNA vectors targeting 4180 genes divided in 44 pools—each pool contains 285 distinct short hairpin RNA's against 95 genes) or pRS as control (Berns et al., 2004. Nature 428: 431-7). After puromycin selection (2 μg/ml) 2×105 cells of each pool and control were plated in 15 cm dishes and cultured in DMEM with 1 μM 4OHtamoxifen for 4-6 weeks. Individual colonies that grew out in the presence of tamoxifen were isolated and expanded. Genomic DNA was isolated using DNAzol (Life Technologies). PCR amplification of the shRNA inserts was performed with Expand Long Template PCR system (Roche) and the use of pRS-fw primer: 5′-CCCTTGAACCTCCTCGTTCGACC-3′ and pRS-rev primer: 5′-GAGACGTGCTACTTCCATTTGTC-3′. Products were digested with EcoRI/XhoI and recloned into pRS. Hairpins were sequenced with Big Dye Terminator (Perkin Elmer) using pRS-seq primer: 5′-GCTGACGTCATCAACCCGCT-3′.
For retroviral transduction of human breast cancer cells, ZR-75-1 cells and T47D cells were transfected with pBabeHygro-Ecotropic Receptor and selected with hygromycin (100 μg/ml) and subsequently infected with the supernatant of the Phoenix ecotrophic virus packaging cell line.
The short hairpin sequence targeting USP9X recovered from the NKI shRNA library was:
For the generation of additional shRNA vectors targeting USP9X the following 19-mer sequences cloned in pRetroSuper were used:
To identify genes causally involved in tamoxifen resistance, a loss-of-function genetic screen was performed in ZR-75-1 luminal breast cancer cells. We first stably expressed the murine ecotropic receptor (Scholz and Beato, 1996. Nucleic Acids Res 24: 979-980) in these cells and subsequently infected them with retroviral supernatants containing a selection of the NM pRS-shRNA library (12,540 shRNA vectors targeting 4180 genes) or pRS as control (Berns et al., 2004. Nature 428: 431-7) (
To investigate whether the escape from tamoxifen induced proliferation arrest was the result of an “on target effect of the shRNA”, 5 additional shRNAs targeting different regions of the USP9X gene were designed and tested for their ability to confer tamoxifen resistance.
Importantly, knockdown of USP9X did not rescue cells from a proliferation arrest induced by the estrogen receptor downregulator fulvestrant, illustrating that shUSP9X-effects on cell proliferation are ERα-dependent (data not shown). In line with these data, knockdown of USP9X in the ERα negative cell line MDA-MB-231 did not induce cell proliferation, even resulting in a growth disadvantage in these cells (not shown).
Monoclonal cell lines stably expressing pRS-USP9X or pRS-GFP as control were plated in triplicate in 6 wells plates in regular DMEM. The next morning cells were washed with PBS and fresh DMEM+10% FCS without Pen/Strep was added followed by Lipofectamine (Invitrogen) transfection according to the manufactures protocol with 1.75 μg ERE-TATA luciferase reporter plasmid vector and 0.5 μg pRL-CMV Renilla luciferase (Promega) per well. Eight hours after transfection cells were washed with PBS and supplied with fresh fenol red free DMEM with 10% charcoal stripped serum or DMEM with 10% FCS. 24 hours after transfection medium was refreshed with ligands as indicated and 48 hours after transfection cells were lysed with passive lysis buffer and the luciferase reaction was performed conform the manufactures protocol (Dual Luciferase Reporter Assay System, Promega). The Renilla luciferase activity was used to correct for differences in transfection efficiency. The relative reporter activity in the absence of ligand was used as a reference and set at 1.
Total RNA was isolated using TRIzol (Invitrogen) or using the Quick RNA MiniPrep kit (R1055 Zymo Research). From the total RNA, cDNA was generated using Superscript II (Invitrogen) with random hexamer primers (Invitrogen). cDNA was diluted and the QRT reaction was performed using SYBR green PCR master mix (Applied Biosystems). All QRT reactions were run in parallel for GAPDH to control for amount cDNA input. The QRT reaction was followed by a melting curve to confirm the formation of a single PCR product. The QRT reactions were run at an AB7500 Fast Real Time PCR system (Applied Biosystems). The following PCR primer sequences were used:
Next we examined whether the rescue from tamoxifen-induced proliferation arrest was the result of increased ERα signaling. Therefore, ZR-75-1 cell lines stably expressing pRSUSP9X or control pRS-GFP were created. First, we tested whether knockdown of USP9X increased ERα activity, as judged by the activity of a reporter construct having Estrogen Responsive Elements linked to luciferase (ERE-luciferase), under the conditions used in the shRNA screen.
For immunoprecipitation cells were lysed in ELB containing 250 mM NaCl, 0.1% NP-40, 50 mM Hepes pH 7.3, and Complete protease inhibitor cocktail from Roche. Supernatants of the lysates were incubated with either anti-USP9X (clone 1C4; Abnova/Sigma Aldrich), or anti-ERα (D-12; Santa Cruz) coupled to protein A/G sepharose beads. Normal mouse serum coupled to protein A/G sepharose beads was used as control. For Western blotting antibodies were used detecting USP9X (clone 1C4; Abnova/Sigma Aldrich), ERα (clone 1D5; Dako), Progesterone Receptor (clone 1A6; Novocastra), and beta-actin (clone AC-74; Sigma Aldrich A 5316).
Given the functional interaction between USP9X and ERα, we next tested whether ERα and USP9X physically interact. We expressed human ERα in Phoenix cells. Cells were lysed in mild detergent and the lysate was immunoprecipitated with anti-USP9X antibody or anti-ERα antibody and Western blotting was performed. As shown in
Chromatin Immunoprecipitations (ChIP) were performed as described before (Schmidt et al., 2009. Methods 48: 240-8). For each ChIP, 10 μg of antibody was used, and 100 μl of Protein A magnetic beads (Invitrogen). The antibody used was raised against ERα (SC-543; Santa Cruz).
ChIP DNA was amplified as described (Schmidt et al., 2009. Methods 48: 240-8). Sequences were generated by the Illumina Hiseq 2000 genome analyser (using 50 bp reads), and aligned to the Human Reference Genome (assembly hg19, February 2009). Enriched regions of the genome were identified by comparing the ChIP samples to mixed input using the MACS peak caller (Zhang et al., 2008. Genome Biol 9: R137) version 1.3.7.1.
ChIP-seq data snapshots were generated using the Integrative Genome Viewer IGV 2.1 (www.broadinstitute.org/igv/). Motif analyses were performed through the Cistrome (cistrome.org), applying the SeqPos motif tool (He et al., 2010. Nat Genet 42: 343-7). The genomic distributions of binding sites were analysed using the cis-regulatory element annotation system (CEAS) (Ji et al., 2006. Nucleic Acids Res 34: W551-4). The genes closest to the binding site on both strands were analysed. If the binding region is within a gene, CEAS software indicates whether it is in a 5′UTR, a 3′UTR, a coding exon, or an intron. Promoter is defined as 3 kb upstream from RefSeq 5′ start. If a binding site is >3 kb away from the RefSeq transcription start site, it is considered distal intergenic.
Normalised mRNA expression data for three patient series were downloaded from GEO: GSE6532 (Loi et al., 2007. J Clin Oncol 25: 1239-46), GSE22219 (Buffa et al., 2011. Cancer Res 71: 5635-45), and GSE2034 (Wang et al., 2005. Lancet 365: 671-9). From these, two sets of ERα-positive, tamoxifen-treated patients (Loi, n=250; Buffa, n=134), and one set of ERα-positive untreated patients (Wang, n=209) were extracted, for which followup was available. Probes in the Buffa and Wang data were median-centered before further processing. The Loi data had already been median-centered. The 526 genes of the USP9X knockdown tamoxifen signature were mapped to the corresponding microarray platforms by selecting all probes for matching genes, and ignoring genes not present on the array. For the Loi data, this selected 949 probe sets represent 488 different genes. For the Buffa data, 363 probes were selected representing 295 genes and for the Wang data, 792 probe sets representing 391 genes were available. 254 of the signature genes were present on all three array platforms. Patients were stratified into two groups by applying a hierarchical complete linkage clustering using Pearson correlation distance, and dividing by the first split of the clustering. Significant differences in distant metastasis free survival time between these two groups were tested for using the log-rank test. Survival times longer than ten years were right-censored. The array platform used for the untreated Wang data provides a subset of the probes available for the treated Loi data (792 out of 949).
To verify that this difference does not affect the comparison between treated and untreated, the Loi samples were additionally clustered based on this subset only. This clustering was found still to stratify patients according to prognosis (log-rank p=1.3×10-5). The directionality of USP9X knockdown tamoxifen classification genes in the good and poor outcome patient groups is shown in Table 1.
The effects of USP9X knockdown on ERα/chromatin interactions were tested for hormone-depleted (vehicle), estradiol and tamoxifen-conditions, using chromatin immunoprecipitation, followed by high-throughput sequencing (ChIP-seq). ZR-75-1 cell lines stably expressing pRS-USP9X or pRS-GFP (control) were plated in hormone depleted medium for 72 hours. Typically, ERα ChIP-seq experiments are performed after a treatment for 45 minutes with ligand (Carroll et al., 2005. Cell 122: 33-43; Hurtado et al., 2011. Nat Genet 43: 27-33). Since USP9X suppression causes long-term resistance to tamoxifen, we were interested in ERα biology after prolonged ligand treatment and the effects of USP9X knockdown thereon. Therefore, the cells were treated with vehicle, estradiol or 4OH-tamoxifen for 48 hours before the ChIP assay.
In control cells, estradiol treatment greatly enhanced ERα/chromatin interactions, while this was far less pronounced when treating the cells with 4OH-tamoxifen. USP9X knockdown had no effect on ERα/chromatin interactions in vehicle and estradiol treated cells, but significantly increased chromatin binding intensity upon 4OH-tamoxifen treatment as exemplified in
ERα rarely binds promoters (5%), and the vast majority of ERα binding events are found at distal enhancers (Carroll et al., 2005. Cell 122: 33-43.38). We could confirm these data for estradiol and 4OH-tamoxifen conditions, both in control and USP9XKD cells (
Collectively, these data show that USP9X knockdown induces ERα binding events, selectively in the presence of 4OH-tamoxifen, that represent a subset of estradiol-induced sites and do not deviate in normal ERα behaviour with respect to genomic distributions and DNA motif enrichment.
Transcriptome sequencing analysis of the cell line ZR-75-1 with stable USP9X knockdown or a control vector were performed using RNA-Seq. The reads (14-30 million 50 bp single-end) were mapped to the human reference genome (hg19) using TopHat (Trapnell et al., 2009. Bioinformatics 25: 1105-1127), which allows to span exon-exon splice junctions. TopHat was supplied with a known set of gene models (Ensembl version 64). The open-source tool HTSeq-Count was used to obtain gene expressions. This tool generates a list of the total number of uniquely mapped sequencing reads for each gene that is present in the provided Gene Transfer Format (GTF) file. In order to identify differentially expressed genes, the random sampling model in the R package DEGseq (Wang et al., 2010. Bioinformatics 26: 136-8.28) was used. We have taken a p-value of 0.05 as a cut-off to determine whether a gene is significantly differentially expressed. The input of this method is the absolute number of reads for a gene, which is the output of HTSeq-count. Genes with no expression across both samples in the comparison were discarded from the dataset. The expression levels of the remaining genes were added with 1 in order to avoid negative values after log 2 transformation during the normalization step within this method.
Our ChIP-seq analyses indicate that USP9X knockdown selectively increases ERα/chromatin interactions in the presence of tamoxifen that are normally found for estradiol conditions. We therefore asked whether USP9X knockdown in tamoxifen-treated cells would also give rise to a typical estradiol-responsive gene set. To address this, we performed RNAseq on ZR-75-1 cells stably expressing pRS-USP9X or pRS-GFP (control) that—after hormone depletion for 72 hours—were treated for 48 hours with vehicle, estradiol or 4OHtamoxifen. Comparing gene expression in both cell lines, we found that estradiol-treatment led to an altered expression of 8794 genes as compared to vehicle, while after 4OHtamoxifen treatment 1906 genes were differentially expressed. All altered transcripts under 4OH-tamoxifen conditions represented a subset of the estradiol-responsive genes (
The majority of genes that are differentially expressed upon tamoxifen treatment in the USP9XKD cells were shared with estradiol induction. ERα-positive breast tumors are hallmarked by a selective and specific enrichment of so-called ‘luminal-signature genes’ (Perou et al., 2000. Nature 406: 747-52). Therefore, the USP9X knockdown tamoxifen gene set, which was shared with the estradiol responsive gene list and which also showed enhanced proximal ERα binding events, (526 out of 4336 genes, see
The RNA-seq analyses revealed that the majority of genes that were differentially expressed upon tamoxifen treatment in the USP9XKD cells were a subgroup of estradiol induced genes (4336 out of 8794). Furthermore, integrating these results with the ChIP-seq data showed that a subgroup of these genes (526 out of 4336) is enriched for proximal ERα binding events. This particular subgroup of genes is expected to represent a direct ERα target gene signature in contrast to the (potentially indirectly regulated) genes that were not enriched for ERα binding. Since these directly ERα regulated genes would also be the genes that are directly affected under tamoxifen resistant conditions, differential expression of these particular genes in breast tumors could hallmark tamoxifen unresponsiveness.
To test this hypothesis, we investigated whether these genes were differentially expressed in a publically available data set of 250 patients with primary ERα positive breast cancer with known outcome (Loi et al., 2007. J Clin Oncol 25: 1239-46). All these patients received adjuvant tamoxifen. For relevant clinicopathological parameters, see Table 2. As visualised in a heatmap (
Gene expression data from five publically available studies were used for developing or validating the USP9X signature. All cohorts consist of ERα-positive, tamoxifen-treated breast cancer patients. Cohort 1 (GSE6532; Loi et al., 2007. J Clin Oncol 25: 1239-46) was used in our unsupervised clustering analysis to identify the two USP9X clusters. Furthermore, it was also used in the supervised gene selection procedures described below, and will henceforth be referred to as the training data. Data from four other studies were exclusively used for validating the trained classifier: cohort 2 (GSE12093; Zhang et al., 2009. Breast Cancer Res Treat 116: 303-9), cohort 3 (GSE26971; Filipits et al., 2011. Clin Cancer Res 17:6012-20), cohort 4 (GSE9195; Loi et al., 2008. BMC Genomics 9:239), and cohort 5 (GSE17705; Symmans et al., 2010. J Clin Oncol 28:4111-9). The complete data set for cohort 5 includes 102 samples that overlap with cohort 1. For the validation, we removed the overlapping samples from cohort 5.
The training data were used for supervised training of a classifier that assigns new tumor samples to one of the two USP9X clusters. The two clusters identified by the unsupervised clustering of the training data were used as the gold standard. For training the classifier, we used the nearest shrunken centroid (NSC) method (Tibshirani et al., 2002. Proc Natl Acad Sci USA 99:6567-72). In short, class centroids are estimated based on the within-class means of the signature genes. Then, a shrinkage parameter is tuned to shrink the within-class means towards the overall means per gene. Genes for which the within-class mean is fully shrunk to the overall mean do not discriminate between the two classes, and are therefore not used for classification. Because of this, tuning the shrinkage parameter yields an optimised subset of genes to use for classification. We tuned the NSC shrinkage parameter to maximise the cross-validated area under the ROC curve (AUC), using a 10-fold cross validation (CV) procedure. We tested this gene selection procedure on the training set in a nested cross validation set-up. Within each outer-CV iteration, the shrinkage parameter was tuned on 90% of the training samples using an internal CV as described above. Subsequently, the selected shrinkage parameter was used to classify the remaining 10% of the training samples. The cross-validated AUC of the outer-CV was 0.95, which confirms the validity of the gene selection procedure. Subsequently, we trained the final classifier by estimating class centroids and tuning the shrinkage parameter on the entire training set. The best cross-validated AUC performance was obtained by selecting 155.
We looked for even smaller sets of signature genes in a more stringent gene selection procedure. For this, we performed a similar CV procedure as above, but used L1-regularised logistic regression instead of NSC. This choice was made because L1 regularisation generally leads to sparser gene selections. We repeated the CV gene selection procedure 100,000 times, with randomly sampled fold assignments, and kept those genes that were selected in at least 99% of the iterations. Using this procedure, we selected 9 genes. Next, we tested for each subset of these 9 genes, whether clustering on the subset yields two clusters that show a significant difference in survival on the training set. A selection of 5 genes: MYBL2, IDH3A, CHSY1, BUB1B, CAPN2 gave rise to the largest survival differences among all subsets. However, most smaller subsets of these 5 do still separate good from poor survival to a large extent.
The NSC classifier was trained on the training data, selecting 155 genes in the process. We subsequently used it to classify tumors from cohorts 2, 3, 4, and 5. None of these cohorts was used in training the classifier or selecting the genes. Survival curves for the classifications are shown in
We also validated the minimal, 5 gene signature on the validation cohorts. A nearest centroid classifier for this signature was trained on the training data and subsequently used to classify the tumors in cohorts 2-5. The resulting survival curves are shown in
To establish the minimum signature size that still allows successful stratification of patients, we randomly sampled smaller subsets of genes, and evaluated their classification performance. For each gene set size between 2 and 50, we drew 200 random subsets from the 155 genes selected for the USP9X classifier. Nearest centroid classifiers based on the random subsets were evaluated in a 10-fold cross validation set-up. Next, the mean of the cross-validated areas under the ROC curve (AUC) were estimated per subset size.
Mean AUCs per subset size are shown in
Number | Date | Country | Kind |
---|---|---|---|
13197871.0 | Dec 2013 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/NL2014/050870 | 12/17/2014 | WO | 00 |