The invention relates to the field of methods and related compositions for predicting resistance to anticancer treatment. In certain embodiments, the invention relates to the field of methods and related compositions for predicting resistance to anticancer treatment in a cancer patient by detecting a reduced expression level of a SWI/SNF complex and/or MEDIATOR complex and/or RAS-GAP gene and/or protein in one or more cancer cells of the patient. In other embodiments, the invention relates to the field of methods and related compositions for predicting resistance to anticancer treatment by detecting one or more inactivating mutations in a SWI/SNF complex and/or MEDIATOR complex and/or RAS-GAP gene. In some embodiments, the invention relates to the field of methods and related compositions for predicting resistance to anticancer treatment by detecting dysfunction and/or inactivity of one or more SWI/SNF complex and/or MEDIATOR complex and/or RAS-GAP proteins. In some embodiments, the invention relates to the field of methods and related compositions for predicting resistance to anticancer treatment by detecting the expression levels of one or more TGF-beta pathway nucleic acids and/or proteins.
Activation of signaling pathways in cancer is often the result of genomic alterations (mutations, translocations, copy number gains and/or losses) in key components of these pathways. Cancer cells often depend on the continued presence of the signals that emanate from these genomic alterations and sudden inhibition frequently results in death of the cancer cells, a phenomenon coined “oncogene addiction” (Sharma and Settleman, 2007). The presence of specific changes in the genomes of cancer cells can therefore have strong predictive value for responsiveness to therapies that target these mutations (Pao and Chmielecki).
Such drug response biomarkers are urgently needed for the rational selection of patients for these therapies, as their clinical benefit is often limited due to the fact that only a subset of patients responds. Considering the high cost of targeted therapeutics, response biomarkers are not only a clinical necessity, but also an economic requirement to keep the cost of cancer care in check by reducing the number of patients that receive expensive drugs without experiencing therapeutic benefit.
Lung cancer is a leading cause of cancer deaths worldwide and tobacco smoking remains the major risk factor. Genomic alterations of receptor tyrosine kinases are frequently found in non-small cell lung cancers, the predominant histological subtype, and are particularly enriched (˜40%) in non-smokers (Rudin et al., 2009). Lung cancers with activating mutations of the EGFR (epidermal growth factor receptor) respond well to treatment with EGFR inhibitors (gefitinib and erlotinib) in the clinic and constitute the largest subgroup of patients (˜10%-20%) tractable for an effective tyrosine kinase inhibitor therapy (Lynch et al., 2004; Maemondo et al.; Rosell et al., 2009; Sharma et al., 2007). Recently, EML4-ALK translocations were identified in ˜2%-5% of NSCLC providing a second promising molecular target for the treatment of NSCLC (Soda et al., 2007). The fusion of the N-terminal part of EML4 (echinoderm microtubule associated protein like 4) with the C-terminal kinase domain of ALK (anaplastic large cell lymphoma kinase) results in the stable dimerization and constitutive activation of the EML4-ALK fusion kinase. The dual tyrosine kinase inhibitor crizotinib potently inhibits ALK/MET and is currently evaluated in clinical trials. The first phase I study with crizotinib in EML4-ALK positive advanced NSCLC demonstrated remarkable activity (Kwak et al.).
Despite these encouraging clinical results, lung cancers with EGFR mutations or EML4-ALK translocations do not respond equally well to these inhibitors (primary resistance) and all tumors develop resistance (acquired resistance) under prolonged treatment (Jackman et al.). Several acquired resistance mechanisms were identified in pre-clinical studies and also confirmed in specimens from relapsed patients that initially responded well to EGFR or ALK inhibitor treatment. Second site mutations of the EGFR (EGFRT790M) and MET amplifications account for ˜50% of the cases of acquired resistance to EGFR inhibitors (Engelman et al., 2007; Hammerman et al., 2009; Kobayashi et al., 2005). The EGFRT790M gatekeeper mutation prevents binding of the inhibitors to the kinase domain, but preserves the activity of the kinase. The frequency of EML4-ALK second site mutations in relapsed tumors is currently unknown and was only found in a single case so far (Choi et al.).
Nevertheless, in a large number of cases the mechanism of resistance to EGFR or ALK inhibitors remains unknown and in particular the determinants of primary resistance are obscure. Understanding the relevant genes and signaling pathways that contribute to resistance of NSCLC cells to tyrosine kinase inhibitors might not only provide drug response markers to stratify treatment options, but might also delineate new therapeutic strategies to overcome the drug resistance.
Citation or identification of any document in this application is not an admission that such document is available as prior art to the present invention.
In certain embodiments, the invention provides a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising (a) measuring expression levels of one or more SWI/SNF complex and/or MEDIATOR complex nucleic acid and/or proteins in the patient; and (b) comparing the expression levels of the one or more SWI/SNF complex and/or MEDIATOR complex nucleic acid and/or proteins in (a) with the expression levels of one or more reference SWI/SNF complex and/or MEDIATOR complex nucleic acid and/or proteins, wherein the one or more reference SWI/SNF complex and/or MEDIATOR complex nucleic acid and/or proteins are from a control sample, wherein a reduction in the expression of the one or more SWI/SNF complex and/or MEDIATOR complex nucleic acid and/or proteins in comparison to the one or more reference SWI/SNF complex and/or MEDIATOR complex nucleic acid and/or proteins is indicative of resistance to anticancer treatment in the patient.
In other embodiments, the invention provides a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising (a) isolating nucleic acid from the patient, wherein the nucleic acid comprises one or more SWI/SNF complex and/or MEDIATOR complex DNA and/or RNA; and (b) analyzing the nucleic acid of (a) for the presence of one or more inactivating mutations in the SWI/SNF complex and/or MEDIATOR complex DNA and/or RNA, wherein the presence of one or more inactivating mutations in the one or more SWI/SNF complex and/or MEDIATOR complex DNA and/or RNA analyzed in (b) is indicative of resistance to anticancer treatment in the patient.
In some embodiments, the invention relates to a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising (a) isolating protein from the patient, wherein the protein comprises one or more SWI/SNF complex and/or MEDIATOR complex proteins (b) analyzing the activity of the one or more SWI/SNF complex and/or MEDIATOR complex proteins in (a); and (c) comparing the activity of the one or more SWI/SNF complex and/or MEDIATOR complex proteins in (b) with the activity of one or more reference SWI/SNF complex and/or MEDIATOR complex proteins, wherein a difference in activity of the one or more SWI/SNF complex and/or MEDIATOR complex proteins from (b) in comparison to the one or more SWI/SNF complex and/or MEDIATOR complex reference proteins in (c) is indicative of resistance to anticancer treatment in the patient.
In certain embodiments, the expression levels of one or more SWI/SNF complex nucleic acids (e.g., DNA, RNA) and/or proteins are measured.
In certain embodiments, the expression levels of one or more MEDIATOR complex nucleic acids (e.g., DNA, RNA) and/or proteins are measured.
In some embodiments, the invention provides a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising (a) measuring expression levels of one or more RAS-GAP nucleic acid and/or proteins in the patient; and (b) comparing the expression levels of the one or more RAS-GAP nucleic acid and/or proteins in (a) with the expression levels of one or more reference RAS-GAP nucleic acid and/or proteins, wherein the one or more reference RAS-GAP nucleic acid and/or proteins are from a control sample, wherein a reduction in the expression of the one or more RAS-GAP nucleic acid and/or proteins in comparison to the one or more reference RAS-GAP nucleic acid and/or proteins is indicative of resistance to anticancer treatment in the patient.
In other embodiments, the invention provides a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising (a) isolating nucleic acid from the patient, wherein the nucleic acid comprises one or more RAS-GAP DNA and/or RNA; and (b) analyzing the nucleic acid of (a) for the presence of one or more inactivating mutations in the RAS-GAP DNA and/or RNA, wherein the presence of one or more inactivating mutations in the one or more RAS-GAP DNA and/or RNA analyzed in (b) is indicative of resistance to anticancer treatment in the patient.
In yet other embodiments, the invention provides a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising (a) isolating protein from the patient, wherein the protein comprises one or more RAS-GAP proteins; (b) analyzing the activity of the one or more RAS-GAP proteins in (a); and (c) comparing the activity of the one or more RAS-GAP proteins in (b) with the activity of one or more reference RAS-GAP proteins, wherein a difference in activity of the one or more RAS-GAP proteins from (b) in comparison to the one or more RAS-GAP reference proteins in (c) is indicative of resistance to anticancer treatment in the patient.
In some embodiments the expression levels of one or more RAS-GAP nucleic acids (e.g., DNA, RNA) are measured. In other embodiments, the expression levels of one or more RAS-GAP proteins are measured.
In some embodiments of the methods described herein for evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, the patient has lung cancer (e.g., non-small-cell lung cancer), breast cancer, ovarian cancer, lung cancer, head and neck cancer, bladder cancer, colorectal cancer, cervical cancer, mesothelioma, solid tumors, renal cell carcinoma, stomach cancer, sarcoma, prostate cancer, melanoma, thyroid cancer, brain cancer, adenocarcinoma, glioma, glioblastoma, esophageal cancer, neuroblastoma, and/or lymphoma.
In some embodiments, the resistance to anticancer treatment is resistance to treatment with a receptor tyrosine kinase inhibitor. Examples of receptor tyrosine kinase inhibitors include gefitinib, erlotinib, EKB-569, lapatinib, CI-1033, cetuximab, panitumumab, PKI-166, AEE788, sunitinib, sorafenib, dasatinib, nilotinib, pazopanib, vandetaniv, cediranib, afatinib, motesanib, CUDC-101, imatinib mesylate, crizotinib, ASP-3026, LDK378, AF802, and CEP37440.
In some embodiments, the resistance to anticancer treatment is resistance to treatment with an inhibitor of ERK activation. In certain embodiments, the inhibitor of ERK activation inhibits a cellular protein that interacts directly with ERK. In other embodiments, the inhibitor of ERK activation inhibits a cellular protein that interacts indirectly with ERK. In yet other embodiments, the inhibitor of ERK activation is a receptor tyrosine kinase inhibitor.
Examples of SWI/SNF complex nucleic acids and/or proteins include ARID1A, ARID1B, ARID2, SMARCA2, SMARCA4, PBRM1, SMARCC2, SMARCC1, SMARCD1, SMARCD2, SMARCD3, SMARCE1, ACTL6A, ACTL6B, and SMARCB1.
Examples of MEDIATOR complex nucleic acids and/or proteins include MED22, MED11, MED17, MED20, MED30, MED19, MED18, MED8, MED6, MED28, MED9, MED21, MED4, MED7, MED31, MED10, MED1, MED26, MED2, MED3, MED25, MED23, MED5, MED14, MED16, MED15, CycC, CDK8, MED13, MED12, MED12L, and MED13L.
Examples of RAS-GAP nucleic acids and/or proteins include DAB2IP, NF1, and RASAL3.
In some embodiments, analyzing nucleic acid comprises sequencing the nucleic acid. In other embodiments, analyzing nucleic acid comprises subjecting the nucleic acid to MLPA. In yet other embodiments, analyzing nucleic acid comprises subjecting the nucleic acid to CGH. In certain embodiments, analyzing nucleic acid comprises subjecting the nucleic acid to FISH.
In certain embodiments, an inactivating mutation is selected from the group consisting of: point mutations, translocations, amplifications, deletions, and hypomorphic mutations.
In certain embodiments, nucleic acid in a method of the invention comprises one or more SWI/SNF complex genes. In other embodiments, the nucleic acid comprises one or more MEDIATOR complex genes. In yet other embodiments, the nucleic acid comprises one or more RAS-GAP genes.
In certain embodiments, one or more SWI/SNF complex and/or MEDIATOR complex proteins analyzed are inactive. In further embodiments, the one or more SWI/SNF complex and/or MEDIATOR complex proteins are inactive due to one or more posttranslational modifications. In some embodiments, one or more RAS-GAP proteins analyzed are inactive. In further embodiments, the one or more RAS-GAP proteins are inactive due to one or more posttranslational modifications
In some embodiments, the invention relates to a microarray comprising a plurality of polynucleotide probes each complementary and hybridizable to a sequence in a different gene that is a SWI/SNF complex gene that is a marker for resistance to anticancer treatment in a patient that has cancer.
In other embodiments, the invention relates to a microarray comprising a plurality of polynucleotide probes each complementary and hybridizable to a sequence in a different gene that is a MEDIATOR complex gene that is a marker for resistance to anticancer treatment in a patient that has cancer.
In some embodiments, the invention relates to a microarray comprising a plurality of polynucleotide probes each complementary and hybridizable to a sequence in a different gene that is a SWI/SNF complex and/or MEDIATOR complex gene that is a marker for resistance to anticancer treatment in a patient that has cancer.
In other embodiments, the invention relates to a microarray comprising a plurality of polynucleotide probes each complementary and hybridizable to a sequence in a different gene that is a RAS-GAP gene that is a marker for resistance to anticancer treatment in a patient that has cancer.
In certain embodiments, a microarray of the invention comprises a plurality of probes, wherein the plurality of probes is at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% of the probes on the microarray.
In certain embodiments, in a microarray of the invention, the SWI/SNF complex gene that is a marker for resistance to anticancer treatment is selected from the group consisting of ARID1A, ARID1B, ARID2, SMARCA2, SMARCA4, PBRM1, SMARCC2, SMARCC1, SMARCD1, SMARCD2, SMARCD3, SMARCE1, ACTL6A, ACTL6B, and SMARCB1.
In other embodiments, in a microarray of the invention, the MEDIATOR complex gene that is a marker for resistance to anticancer treatment is selected from the group consisting of MED22, MED11, MED17, MED20, MED30, MED19, MED18, MED8, MED6, MED28, MED9, MED21, MED4, MED7, MED31, MED10, MED1, MED26, MED2, MED3, MED25, MED23, MED5, MED14, MED16, MED15, CycC, CDK8, MED13, MED12, MED13L, and MED12L.
In still other embodiments, in a microarray of the invention, the RAS-GAP gene is selected from the group consisting of: DAB2IP, NF1, and RASAL3.
In some embodiments, the invention relates to a kit, comprising at least one pair of primers specific for a SWI/SNF complex gene that is a marker for resistance to anticancer treatment in a patient that has cancer, at least one reagent for amplification of the SWI/SNF complex gene, and instructions for use.
In other embodiments, the invention relates to a kit, comprising at least one pair of primers specific for a MEDIATOR complex gene that is a marker for resistance to anticancer treatment in a patient that has cancer, at least one reagent for amplification of the MEDIATOR complex gene, and instructions for use.
In some embodiments, the invention relates to a kit, comprising at least one pair of primers specific for a SWI/SNF complex and/or a MEDIATOR complex gene that is a marker for resistance to anticancer treatment in a patient that has cancer, at least one reagent for amplification of the SWI/SNF complex and/or MEDIATOR complex gene, and instructions for use.
In other embodiments, the invention relates to a kit, comprising at least one pair of primers specific for a RAS-GAP gene that is a marker for resistance to anticancer treatment in a patient that has cancer, at least one reagent for amplification of the RAS-GAP gene, and instructions for use.
In certain embodiments, in a kit of the invention, the primers are specific for a SWI/SNF complex gene selected from the group consisting of ARID1A, ARID1B, ARID2, SMARCA2, SMARCA4, PBRM1, SMARCC2, SMARCC1, SMARCD1, SMARCD2, SMARCD3, SMARCE1, ACTL6A, ACTL6B, and SMARCB1.
In certain embodiments, in a kit of the invention, the primers are specific for a MEDIATOR complex gene selected from the group consisting of MED22, MED11, MED17, MED20, MED30, MED19, MED18, MED8, MED6, MED28, MED9, MED21, MED4, MED7, MED31, MED10, MED1, MED26, MED2, MED3, MED25, MED23, MED5, MED14, MED16, MED15, CycC, CDK8, MED13, MED12, MED13L, and MED12L.
In certain embodiments, in a kit of the invention, the primers are specific for a RAS-GAP gene selected from the group, consisting of: DAB2IP, NF1, and RASAL3.
In certain embodiments, in a kit of the invention, the marker for resistance to anticancer treatment is a marker for resistance to a receptor tyrosine kinase inhibitor.
In certain embodiments, in a kit of the invention, the marker for resistance to anticancer treatment is a marker for resistance to an inhibitor of ERK activation. In some embodiments, the inhibitor of ERK activation inhibits a cellular protein that interacts directly with ERK. In some embodiments, the inhibitor of ERK activation inhibits a cellular protein that interacts indirectly with ERK. In other embodiments, the inhibitor of ERK activation is a receptor tyrosine kinase inhibitor.
In certain embodiments, the kit is a PCR kit. In other embodiments, the kit is an MLPA kit. In yet other embodiments, the kit is an RT-MLPA kit.
In some embodiments, the level of expression of one or more SWI/SNF complex and/or MEDIATOR complex and/or RAS-GAP genes in a method of the invention is measured by determination of their level of transcription, using a DNA array. In other embodiments, the level of expression of one or more SWI/SNF complex and/or MEDIATOR complex and/or RAS-GAP genes is measured by determination of their level of transcription, using quantitative RT-PCR.
In some embodiments the level of expression of one or more SWI/SNF complex and/or MEDIATOR complex and/or RAS-GAP genes in a method of the invention is measured in a tumor sample from the patient. In certain further embodiments, the tumor sample is a lung tumor sample.
In some embodiments, the resistance to anticancer treatment is resistance to treatment with a B-RAF inhibitor. Examples of B-RAF inhibitors include CEP-32496, vemurafenib, GSK-2118436, ARQ-736, RG-7256, XL-281, DCC-2036, GDC-0879, AZ628, and antibody fragment EphB4/Raf inhibitors.
In some embodiments, resistance to anticancer treatment is resistance to treatment with a MEK inhibitor. Examples of MEK inhibitors include CKI-27, RO-4987655, RO-5126766, PD-0325901, WX-554, AZD-8330, G-573, RG-7167, SF-2626, GDC-0623, RO-5068760, and AD-GL0001.
In certain embodiments, in a kit of the invention, the marker for resistance to anticancer treatment is a marker for resistance to treatment with a B-RAF inhibitor. In other embodiments, the marker for resistance to anticancer-treatment is a marker for resistance to treatment with a MEK inhibitor.
In certain embodiments, in the methods of the invention, the expression levels of SWI/SNF and/or MEDIATOR complex or RAS-GAP nucleic acid and/or proteins are measured in one or more cancer cells of the patient. In some embodiments, nucleic acid is isolated from one or more cancer cells of the patient. In other embodiments, protein is isolated from one or more cancer cells of the patient.
In certain embodiments, in a method of the invention, resistance to anticancer treatment in one or more cancer cells in a patient is primary resistance to anticancer treatment. In other embodiments, the resistance is secondary resistance to anticancer treatment.
In certain embodiments, the instant application relates to a method of treating resistance to one or more inhibitors of ERK activation in a patient in need thereof, comprising administering to the patient at least one inhibitor of the TGF-beta pathway in combination with the one or more inhibitors of ERK activation. In some embodiments, the inhibitor of ERK activation is selected from the group consisting of direct and indirect inhibitors of ERK activation. In certain embodiments, the direct inhibitor of ERK activation is a MEK inhibitor. In certain embodiments, the indirect inhibitor of ERK activation is selected from the group consisting of RTK inhibitors, RAS inhibitors, and B-RAF inhibitors.
In some embodiments, the resistance to one or more inhibitors of ERK activation is primary resistance. In other embodiments, the resistance to one or more inhibitors of ERK activation is secondary resistance. In yet other embodiments, the resistance to one or more inhibitors of ERK activation is evaluated and/or predicted according to a method as disclosed herein.
In other embodiments, the instant application relates to a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising (a) measuring expression levels of one or more TGFβ pathway nucleic acid and/or proteins in the patient; and (b) comparing the expression levels of the one or more TGFβ pathway nucleic acid and/or proteins in (a) with the expression levels of one or more reference TGFβ pathway nucleic acid and/or proteins, wherein the one or more reference TGFβ pathway nucleic acid and/or proteins are from a control sample, wherein an increase in the expression of the one or more TGFβ pathway nucleic acid and/or proteins in comparison to the one or more reference TGFβ pathway nucleic acid and/or proteins is indicative of resistance to anticancer treatment in the patient.
In yet other embodiments, the instant application relates to a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising (a) isolating nucleic acid from the patient, wherein the nucleic acid comprises one or more TGFβ pathway DNA and/or RNA; and (b) analyzing the nucleic acid of (a) for the presence of one or more activating mutations in the TGFβ pathway complex DNA and/or RNA, wherein the presence of one or more activating mutations in the one or more TGFβ pathway DNA and/or RNA analyzed in (b) is indicative of resistance to anticancer treatment in the patient.
In some embodiments, the instant application relates to a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising (a) isolating protein from the patient, wherein the protein comprises one or more TGFβ pathway proteins; (b) analyzing the activity of the one or more TGFβ pathway proteins in (a); and (c) comparing the activity of the one or more TGFβ pathway proteins in (b) with the activity of one or more reference TGFβ pathway proteins, wherein a difference in activity of the one or more TGFβ pathway proteins from (b) in comparison to the one or more TGFβ pathway reference proteins in (c) is indicative of resistance to anticancer treatment in the patient.
In certain embodiments, the instant application relates to a method of treating cancer in a patient in need thereof, comprising administering to the patient an inhibitor of ERK activation in combination with an inhibitor of TGFβ pathway activation. In certain further embodiments, the cancer is selected from the group consisting of: liver cancer, lung cancer, breast cancer, ovarian cancer, head and neck cancer, bladder cancer, colorectal cancer, cervical cancer, mesothelioma, solid tumors, renal cell carcinoma, stomach cancer, sarcoma, prostate cancer, melanoma, thyroid cancer, brain cancer, adenocarcinoma, glioma, glioblastoma, esophageal cancer, neuroblastoma, subependymal giant cell astrocytoma, endometrial cancer, a hematological cancer, and lymphoma.
In certain embodiments, the inhibitor of ERK activation is selected from the group consisting of: RTK inhibitors, RAS inhibitors, B-RAF inhibitors, and MEK inhibitors. In a particular embodiment, the inhibitor of ERK activation is a MET inhibitor.
In certain embodiments, the expression levels are measured of one or more of TGFβ pathway nucleic acid that is a TGFβ pathway target gene selected from the group consisting of: ALOX5AP, COL5A, TAGLN, ANGPTL4, LGALS1, IL11, LBH, and COL4A1.
In some embodiments, the inhibitor of TGFβ pathway activation is LY2157299. In certain embodiments, the inhibitor of TGFβ pathway activation inhibits MED12/TGFβ binding.
In some embodiments, inhibitor of ERK activation is crizotinib or gefitinib. In certain embodiments, the inhibitor of ERK activation inhibits MED12/TGFβ binding.
In some embodiments, the instant application relates to a method of identifying an inhibitor of ERK activation, comprising: measuring MED12/TGFβ binding in the presence and absence of a test compound, wherein a reduction in the amount of MED12/TGFβ binding in the presence of the test compound in comparison to the absence of the test compound indicates an inhibitor of ERK activation has been identified.
In other embodiments, the instant application relates to a method of identifying an inhibitor of TGFβ pathway activation, comprising: measuring MED12/TGFβ binding in the presence and absence of a test compound, wherein a reduction in the amount of MED12/TGFβ binding in the presence of the test compound in comparison to the absence of the test compound indicates an inhibitor of TGFβ pathway activation has been identified.
In yet other embodiments, the instant application relates to a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising: (a) measuring expression levels of one or more MED12 nucleic acid and/or proteins in the patient; (b) measuring one or more markers of an EMT-like phenotype; and (c) comparing the expression levels of the one or more MED12 nucleic acid and/or proteins in (a) with the expression levels of one or more reference MED12 nucleic acid and/or proteins, wherein a reduction in the expression of the one or more MED12 nucleic acid and/or proteins in comparison to the one or more reference MED12 nucleic acid and/or proteins in (c) and wherein one or more markers are measured of an EMT-like phenotype in (b) is indicative of resistance to anticancer treatment in the patient.
In some embodiments, the nucleic acid in (a) is isolated from one or more cancer cells from the patient. In other embodiments, the protein in (a) is isolated from one or more cancer cells from the patient. In certain embodiments, the one or more markers of an EMT-like phenotype are measured in one or more cancer cells from the patient. In certain further embodiments, the cancer is selected from the group consisting of: liver cancer, lung cancer, breast cancer, ovarian cancer, head and neck cancer, bladder cancer, colorectal cancer, cervical cancer, mesothelioma, solid tumors, renal cell carcinoma, stomach cancer, sarcoma, prostate cancer, melanoma, thyroid cancer, brain cancer, adenocarcinoma, glioma, glioblastoma, esophageal cancer, neuroblastoma, subependymal giant cell astrocytoma, endometrial cancer, a hematological cancer, and lymphoma. In a particular embodiment, the cancer is colorectal cancer.
In certain embodiments, the resistance to anticancer treatment is resistance to treatment with a MEK inhibitor. In further embodiments, the MEK inhibitor is selected from the group consisting of: CKI-27, RO-4987655, RO-5126766, PD-0325901, WX-554, AZD-8330, G-573, RG-7167, SF-2626, GDC-0623, RO-5068760, and AD-GL0001.
In some embodiments, the resistance to anticancer treatment is resistance to treatment with a B-RAF inhibitor. In certain further embodiments, the B-RAF inhibitor is selected from the group consisting of: CEP-32496, vemurafenib, GSK-2118436, ARQ-736, RG-7256, XL-281, DCC-2036, GDC-0879, AZ628, and antibody fragment EphB4/Raf inhibitors.
In some embodiments, the one or more markers of an EMT-like phenotype are selected from mesenchymal markers. In certain embodiments, the one or more mesenchymal markers are selected from vimentin and N-cadherin.
In other embodiments, the instant application relates to a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising: (a) measuring expression levels of one or more MED12KD signature nucleic acid and/or proteins in one or more cancer cells of the patient; and (b) comparing the expression levels of the one or more MED12KD signature nucleic acid and/or proteins in (a) with the expression levels of one or more positive reference MED12KD signature nucleic acid and/or proteins, wherein if expression of the one or more MED12KD signature nucleic acid and/or proteins in (a) is similar to the one or more positive reference MED12KD signature nucleic acid and/or proteins, then resistance to anticancer treatment is indicated in the patient. In certain embodiments, the expression of the one or more MED12KD signature nucleic acid and/or proteins in (a) is about 2-fold, about 3-fold, about 4-fold, about 5-fold, about 6-fold, about 7-fold, about 8-fold, about 9-fold, or about 10-fold greater or lesser than the one or more positive reference MED12KD signature nucleic acid and/or proteins. In other embodiments, the expression of the one or more MED12KD signature nucleic acid and/or proteins in (a) is about the same as the one or more positive reference MED12KD signature nucleic acid and/or proteins.
In yet other embodiments, the instant application relates to a method of evaluating and/or predicting resistance to anticancer treatment in a patient in need thereof, comprising: (a) measuring expression levels of one or more MED12KD signature nucleic acid and/or proteins in one or more cancer cells of the patient; and (b) comparing the expression levels of the one or more MED12KD signature nucleic acid and/or proteins in (a) with the expression levels of one or more negative reference MED12KD signature nucleic acid and/or proteins, wherein if expression of the one or more MED12KD signature nucleic acid and/or proteins in (a) is greater or lesser than the expression of the one or more negative reference MED12KD signature nucleic acid and/or proteins, then resistance to anticancer treatment is indicated in the patient. In some embodiments, the one or more cancer cells of the patient in (a) are from cancer cells of the patient after the anticancer treatment, and wherein the negative reference MED12KD signature nucleic acid and/or proteins are from one or more cancerous cells of the patient prior to the anticancer treatment. In certain embodiments, the expression of the one or more MED12KD signature nucleic acid and/or proteins in (a) is greater than or equal to about 1.2 fold higher or lower than the expression of the one or more negative reference. MED12KD signature nucleic acid and/or proteins.
In some embodiments, the one or more cancer cells of the patient in (a) are from one or more cancer cells of the patient prior to the anticancer treatment. In other embodiments, the one or more cancer cells of the patient in (a) are from one or more cancer cells of the patient after the anticancer treatment.
In certain embodiments, the negative reference MED12KD signature nucleic acid and/or proteins are from one or more non-cancerous cells of the patient. In some embodiments, the negative reference MED12KD signature nucleic acid and/or proteins are from one or more cells known to be sensitive to the anticancer treatment. In certain embodiments, the negative reference MED12KD signature nucleic acid and/or proteins is the average expression of the MED12KD signature nucleic acid and/or proteins in one or more tumor or cell line samples known to be sensitive to the anticancer treatment.
In some embodiments, the one or more MED12KD signature nucleic acids are upregulated nucleic acids. In certain embodiments, the upregulated nucleic acids are selected from the upregulated nucleic acids presented in
In other embodiments, the one or more MED12KD signature nucleic acids are downregulated nucleic acids. In certain embodiments, the downregulated nucleic acids are selected from the downregulated nucleic acids presented in
In some embodiments, the resistance to anticancer treatment is resistance to treatment with a MEK inhibitor. In certain embodiments, the MEK inhibitor is selected from the group consisting of: CKI-27, RO-4987655, RO-5126766, PD-0325901, WX-554, AZD-8330, G-573, RG-7167, SF-2626, GDC-0623, RO-5068760, and AD-GL0001.
In some embodiments, the resistance to anticancer treatment is resistance to treatment with a B-RAF inhibitor. In certain embodiments, the B-RAF inhibitor is selected from the group consisting of: CEP-32496, vemurafenib, GSK-2118436, ARQ-736, RG-7256, XL-281, DCC-2036, GDC-0879, AZ628, and antibody fragment EphB4/Raf inhibitors.
In certain embodiments, the cancer is selected from the group consisting of: liver cancer, lung cancer, breast cancer, ovarian cancer, head and neck cancer, bladder cancer, colorectal cancer, cervical cancer, mesothelioma, solid tumors, renal cell carcinoma, stomach cancer, sarcoma, prostate cancer, melanoma, thyroid cancer, brain cancer, adenocarcinoma, glioma, glioblastoma, esophageal cancer, neuroblastoma, subependymal giant cell astrocytoma, endometrial cancer, a hematological cancer, and lymphoma.
In some embodiments, the instant application relates to a method of evaluating and/or predicting of resistance to anticancer treatment in a patient in need thereof, comprising: measuring expression levels of one or more MED12KD signature nucleic acid and/or proteins in one or more cancer cells of the patient; and comparing the expression levels of the one or more MED12KD signature nucleic acid and/or proteins in (a) with the expression levels of (i) one or more MED12KD signature nucleic acid and/or proteins from cells known to be resistant to said anticancer treatment AND (ii) one or more MED12KD signature nucleic acid and/or proteins from cells known to be sensitive to said anticancer treatment, whereby the cancer cells of the patient are considered to be resistant if the difference in expression levels between the cells in (a) and the cells in (i) is smaller than the difference in expression levels between the cells in (a) and the cells in (ii).
In other embodiments, the instant application relates to a method of evaluating and/or predicting of resistance to anticancer treatment in a patient in need thereof, comprising measuring expression levels of one or more MED12KD signature nucleic acid and/or proteins in one or more cancer cells of the patient; and comparing the expression levels of the one or more MED12KD signature nucleic acid and/or proteins in (a) with the expression levels of (i) one or more MED12KD signature nucleic acid and/or proteins from cells known to be resistant to said anticancer treatment AND (ii) one or more MED12KD signature nucleic acid and/or proteins from cells known to be sensitive to said anticancer treatment, whereby the cancer cells of the patient are considered to be sensitive if the difference in expression levels between the cells in (a) and the cells in (i) is greater than the difference in expression levels between the cells in (a) and the cells in (ii).
In yet other embodiments, the present application relates to a method of evaluating and/or predicting of resistance to anticancer treatment in a patient in need thereof, comprising measuring expression levels of one or more MED12KD signature nucleic acid and/or proteins in one or more cancer cells of the patient; and comparing the expression levels of the one or more MED12KD signature nucleic acid and/or proteins in (a) with the average expression levels of (i) one or more MED12KD signature nucleic acid and/or proteins taken from two or more cell samples, whereby the cancer cells of the patient are considered to be resistant if the difference in expression levels of the one or more MED12KD signature nucleic acid and/or proteins between the cells in (a) and the average expression levels of the one or more MED12KD signature nucleic acid and/or proteins in (i) is greater than a factor 1.2.
These and other embodiments are disclosed or are obvious from and encompassed by, the following Detailed Description.
(A) Schematic outline of the crizotinib resistance barcode screen performed in H3122 cells. NKI human shRNA library polyclonal virus was produced to infect H3122 cells, which were then left untreated (control) or treated with 300 nM crizotinib for 14 or 28 days, respectively. After selection, shRNA inserts from both populations were recovered, labeled and hybridized to DNA oligonucleotide barcode arrays. (B) Analysis of the relative abundance of the recovered shRNA cassettes from crizotinib barcode experiment. Averaged data from three independent experiments were normalized and 2 log transformed. Among the 43 top shRNA candidates (M>2 and A>7), two independent shMED12 vectors (in light gray at end of arrow points) were identified. (F to H) Suppression of MED12 also confers to EGFR inhibitors. F) Colony formation assay of PC9 cells expressing pLKO control or independent lentiviral shMED12 vectors (#4 and #5) were cultured in 50 nM gefitinib or 50 nM erlotinib. The cells were fixed, stained and photographed after 10 (untreated) or 28 days (treated). G) The level of knockdown of MED12 by each of the shRNAs was measured by examining the MED12 mRNA levels by qRT-PCR. Error bars denote SD. H) The level of knockdown of MED12 protein was measured by western blotting.
The top panel represents a histogram of IC50 values for the MEK inhibitor, AZD6244, across the 152 cell lines. Below the histogram, the individual IC50 values are plotted using squares (sensitive cell lines) and circles (resistant cell lines). The panel on the left depicts the histogram for the expression levels of gene ZBED2. To the right of the histogram, the individual expression levels are plotted using plus signs (upregulated), crosses (normal expression) and stars (downregulated). The scatter plot depicts the IC50 values and gene expression for each cell line. In this case, there are significantly many cell lines that show resistance to AZD6244 and are upregulated for ZBED2. These cell lines are found in the top-right area of the scatter plot and are indicated by plus signs inside of circles. The MED12 knockdown signature contains a significantly large number of such genes indicating the potential predictive value of this signature.
The instant invention provides methods and related compositions pertaining to the identification of a tumor that will be resistant to treatment by a certain compound or class of compounds. In certain embodiments, the invention provides one or more markers for resistance to anticancer treatment in a patient. In some embodiments, the marker is a MEDIATOR complex and/or SWI/SNF complex gene.
Examples of MEDIATOR complex genes that may serve as a marker for resistance to anticancer treatment in a patient as described herein include MED22, MED11, MED17, MED20, MED30, MED19, MED18, MED8, MED6, MED28, MED9, MED21, MED4, MED7, MED31, MED10, MED1, MED26, MED2, MED3, MED25, MED23, MED5, MED14, MED16, MED15, CycC, CDK8, MED13, MED12, MED13L, and MED12L (see e.g., MED12L Gene ID: 116931 available from the National Center for Biotechnology Information (NCBI) website). See, e.g., Malik, S, Roeder, R G, “The metazoan Mediator co-activator complex as an integrative hub for transcriptional regulation” Nat Rev Genet. (2010) 11(11):761-72.
Examples of SWI/SNF complex genes that may serve as a marker for resistance to anticancer treatment in a patient as described herein include ARID1A, ARID1B, ARID2, SMARCA2, SMARCA4, PBRM1, SMARCC2, SMARCC1, SMARCD1, SMARCD2, SMARCD3, SMARCE1, ACTL6A, ACTL6B, and SMARCB1. See, e.g., Reisman, D et al. “The SWI/SNF complex and cancer” Oncogene. (2009) 28(14):1653-68.
In some embodiments, the invention provides methods whereby measurement of reduced expression of a MEDIATOR complex and/or SWI/SNF complex gene in one or more cancer cells of a patient identifies these cancer cells as cells that may be resistant to treatment by one or more receptor tyrosine kinase (RTK) inhibitors. RTKs are involved in a number of diverse physiological processes, including proliferation and differentiation, cell survival and metabolism, cell migration, and cell-cycle control (see, e.g., Lemmon, M A, Schlessinger, J “Cell Signaling by Receptor Tyrosine Kinases” Cell (2010) 141:1117-1134).
In addition, an overview of non-small cell lung cancer signaling pathways may be found at www(dot)n-of-one(dot)com/cancer-news-info/egfr/ and the figure presented therein adapted from Herbst, et al. NEJM 2008.
Described herein is the use of a large-scale loss-of-function genetic screen to identify genes whose suppression can confer resistance to crizotinib in a NSCLC cell line harboring an EML4-ALK translocation. Applicants identify a key component of the transcriptional MEDIATOR complex, MED12, as a determinant of crizotinib response in NSCLC. Remarkably, Applicants find that suppression of MED12 also confers resistance to a range of targeted cancer drugs in other cancer types as well, including colon cancer, melanoma and liver cancer. Applicants identify an unexpected activity of MED12 in regulating TGFβ receptor signaling, as the major mechanism of drug resistance induction.
Applicants identify herein MED12 as a candidate biomarker of response to a range of targeted cancer drugs in a variety of cancer types through a previously unappreciated role of this protein in TGFβ receptor signaling. MED12 is a component of the MEDIATOR transcriptional adapter complex that serves as a molecular bridge between the basal transcription machinery and its upstream activators (Conaway et al., 2005). More specifically, MED12 is a subunit of the “kinase” module of the MEDIATOR complex, which also contains MED13, CYCLIN C and CDK8, whose gene sequence is amplified in some 50% of colon cancers (Firestein et al., 2008). The involvement of MEDIATOR components in responses to TKIs was unexpected, as most of the known genes that influence responses to TKIs involve components of signaling pathways that act downstream or in parallel of these receptors. Applicants reconcile this apparent discrepancy by demonstrating that part of MED12 also resides in the cytosol, where it interacts with the TGFβ type II receptor to inhibit its activity. Consequently, downregulation of MED12 by RNAi strongly activates TGFβ signaling, as evidenced by phosphorylation of SMAD2 and induction of many canonical TGFβ target genes. Activation of TGFβ signaling has been linked previously to activation of ERK signaling (reviewed by (Zhang, 2009)). Consistent with this, Applicants observed activation of ERK signaling by MED12 suppression, which persists in the presence of drugs like crizotinib, gefitinib, vemurafenib, seluteminib and sorafenib (
Applicants' data indicate that MED12 suppression also induces an EMT-like phenotype, as judged by the upregulation of the mesenchymal markers Vimentin and N-cadherin (
In certain embodiments, cancer cells that undergo an EMT-like process do so through suppression of MED12 expression. Investigation of this would require biopsies of tumors that have progressed following exposure to targeted therapies, which are very rare in today's clinical practice. Applicants' data show that the changes of gene expression triggered by MED12 suppression (through analysis of a set of MED12KD signature genes) are prognostic for disease outcome in colon cancer (
In certain embodiments, identification of a reduced expression of a MEDIATOR complex and/or SWI/SNF complex gene in one or more cancer cells of a patient is indicative that the one or more cancer cells will be resistant to treatment by a compound or class of compounds, such as one or more receptor tyrosine kinase inhibitor compounds. Examples of RTK inhibitor compounds that cells expressing a reduced level of a MEDIATOR complex and/or SWI/SNF complex gene may be resistant to include gefitinib, erlotinib, EKB-569, lapatinib, CI-1033, cetuximab, panitumumab, PKI-166, AEE788, sunitinib, sorafenib, dasatinib, nilotinib, pazopanib, vandetaniv, cediranib, afatinib, motesanib, CUDC-101, and imatinib mesylate. Other RTK inhibitors that cells expressing a reduced level of a MEDIATOR complex and/or SWI/SNF complex gene may be resistant to include the Alk-1 inhibitors crizotinib, ASP-3026, LDK378, AF802, and CEP37440.
In certain embodiments, identification of a reduced expression of a MEDIATOR complex and/or SWI/SNF complex gene in one or more cancer cells of a patient is indicative that the one or more cancer cells will be resistant to treatment by one or more ERK activation inhibitor compounds. Examples of ERK activation inhibitor compounds that cells expressing a reduced level of a MEDIATOR complex and/or SWI/SNF complex gene may be resistant to include compounds that inhibit the activity of a signaling protein upstream of ERK. Examples of signaling proteins upstream of ERK include MEK1, MEK2, A-RAF, B-RAF, RAF1, MOS, RTKs, and G-protein-coupled receptors. In certain embodiments, the compound that inhibits the activity of a signaling protein upstream of ERK inhibits a direct activator of ERK. Examples of direct ERK activators include MEK1 and MEK2. Examples of MEK inhibitors include CKI-27, RO-4987655, RO-5126766, PD-0325901, WX-554, AZD-8330, G-573, RG-7167, SF-2626, GDC-0623, RO-5068760, and AD-GL0001. In other embodiments, the compound that inhibits the activity of a signaling protein upstream of ERK inhibits an indirect activator of ERK. Examples of indirect ERK activators include A-RAF, B-RAF, RAF1RAF1, MOS, RTKs, and G-protein-coupled receptors. See, e.g., Roux, P P, Blenis, J “ERK and p38 MAPK-activated protein kinases: a family of protein kinases with diverse biological functions” Microbiol Mol Biol Rev. (2004) 68(2):320-44. Examples of B-RAF inhibitors include CEP-32496, vemurafenib, GSK-2118436, ARQ-736, RG-7256, XL-281, DCC-2036, GDC-0879, AZ628, and antibody fragment EphB4/Raf inhibitors.
In some embodiments, an inhibitor inhibits the wild-type version of a protein, such as wild-type B-RAF. In other embodiments, an inhibitor inhibits a mutant form of a protein, such as mutant B-RAF (e.g., V600E). In yet other embodiments, an inhibitor inhibits both the wild-type and mutant form of a protein (e.g., both wild-type B-RAF and B-RAFV600E).
In certain embodiments, identification of a reduced expression of a MEDIATOR complex and/or SWI/SNF complex gene in one or more cancer cells of a patient is indicative that the one or more cancer cells will be resistant to treatment by one or more compounds that are activators of one or more proteins that inactivate ERK. Examples of protein inactivators of ERK include phosphatases, such as the indirect inactivator of ERK, protein phosphatase 5 (PP5), which inactivates the ERK upstream activator, RAF1, by dephosphorylation.
In certain embodiments, the prognostic methods and compositions of the instant invention predict resistance to anticancer treatment to a combination of chemotherapeutic agents, wherein the at least two chemotherapeutic agents are administered at the same time and/or sequentially. In further embodiments, the invention provides methods wherein a measurement of reduced expression of a MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP gene in one or more cancer cells of a tumor of a patient identifies the tumor as one that may be resistant to treatment by a combination of at least two ERK activation inhibitors. In other embodiments, the tumor is one that may be resistant to treatment by a combination of at least two compounds that activate one or more proteins upstream of ERK that inactivates ERK signaling.
In some embodiments, activation of the TGF-β (transforming grow factor beta) pathway rescues ERK activation in, for example, a cancer cell. Accordingly, in some embodiments, the prognostic methods and compositions of the instant invention provide methods and compositions wherein a measurement of reduced expression of a MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP gene in one or more cancer cells of a tumor of a patient identifies the tumor as one that may benefit from treatment with an inhibitor of the TGFβ pathway (e.g., a TGFβ inhibitor and/or inhibitor of one or more downstream signaling proteins in the TGF-β pathway) in combination with one or more ERK activation inhibitors. In other embodiments, the prognostic methods and compositions of the instant invention provide methods and compositions wherein a measurement of reduced expression of a MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP gene in one or more cancer cells of a tumor of a patient identifies the tumor as one that may benefit from treatment with an inhibitor of the TGF-β pathway in combination with one or more compounds that activate one or more proteins upstream of ERK that inactivates ERK signaling. In certain embodiments, the inhibitor of ERK activation is an RTK inhibitor. In other embodiments, the inhibitor of ERK activation is a B-RAF inhibitor. In yet other embodiments, the inhibitor of ERK activation is a MEK inhibitor. In still other embodiments, the inhibitor of ERK activation is a RAS inhibitor.
In other embodiments, the prognostic methods and compositions of the instant invention provide methods and compositions wherein a measurement of increased expression of a TGFβ pathway gene in one or more cancer cells of a tumor of a patient identifies the tumor as one that may benefit from treatment with an inhibitor of the TGFβ pathway (e.g., a TGFβ inhibitor and/or inhibitor of one or more downstream signaling proteins in the TGFβ pathway) in combination with one or more ERK activation inhibitors. In certain embodiments, the patient is one in need of treatment with an ERK activation inhibitor. In other embodiments, the patient is one in need of treatment with an inhibitor of a TGFβ pathway gene or protein. In other embodiments, the prognostic methods and compositions of the instant invention provide methods and compositions wherein a measurement of increased expression of a TGFβ pathway gene in one or more cancer cells of a tumor of a patient identifies the tumor as one that may benefit from treatment with an inhibitor of the TGFβ pathway in combination with one or more compounds that activate one or more proteins upstream of ERK that inactivates ERK signaling. In certain embodiments, the inhibitor of ERK activation is an RTK inhibitor. In other embodiments, the inhibitor of ERK activation is a B-RAF inhibitor. In yet other embodiments, the inhibitor of ERK activation is a MEK inhibitor. In still other embodiments, the inhibitor of ERK activation is a RAS inhibitor.
In other embodiments, the prognostic methods and compositions of the instant invention provide methods and compositions wherein a measurement of increased expression of a TGFβ pathway gene in one or more cancer cells of a patient indicates the patient may be resistant to anticancer treatment. In other embodiments, the prognostic methods and compositions of the instant invention provide methods and compositions wherein a measurement of an activating mutation in a TGFβ pathway gene in one or more cancer cells of a patient identifies the one or more cancer cells as cells that may be resistant to anticancer treatment.
In some embodiments, the invention provides methods and compositions for the treatment of primary and/or secondary resistance to one or more anticancer agents in a patient in need thereof, comprising administration of at least one inhibitor of the TGFβ pathway in combination with the one or more anticancer agents to which primary and/or secondary resistance in the patient has developed. For example, in some embodiments, the invention relates to a method of treating secondary resistance to an inhibitor of ERK activation in a patient in need thereof, comprising administering to the patient at least one inhibitor of the TGFβ pathway (e.g., a TGFβ inhibitor) in combination with the inhibitor of ERK activation.
In certain embodiments, the invention provides methods and compositions related to a method of treating cancer in a patient in need thereof, comprising administering to the patient an inhibitor of ERK activation in combination with an inhibitor of TGFβ pathway activation. In some embodiments, the patient is treated without determining whether the patient would be likely to be resistant to one or more of the ERK activation and/or TGFβ pathway activation inhibitors.
In some embodiments, the markers of the instant invention enable the detection of resistance to anticancer treatment in a patient in combination with one or more known markers of hypersensitivity to a chemotherapeutic agent or class of agents. In certain embodiments, expression levels of one or more MEDIATOR complex and/or SWI/SNF complex genes (e.g., MED12, SMARCE1, and/or ARIDA1) are measured in one or more cancer cells of a patient in combination with an array profile, such as a CGH (comparative genomic hybridization) array analysis.
In certain embodiments, the invention provides methods and compositions for identifying a cancer patient who would likely not benefit from a certain chemotherapeutic treatment. For example, an aspect of the invention is a method of screening cancer patients to determine those cancer patients more likely to benefit from a particular chemotherapy, such as RTK inhibitor chemotherapy, comprising obtaining a sample of genetic material from a tumor of the patient; and assaying for the presence of a genotype in the patient that is associated with resistance to the particular chemotherapy, the genotype characterized by an inactivating mutation in one or more MEDIATOR complex and/or SWI/SNF complex genes. In some embodiments, the genotype is further characterized by an inactivating mutation in one or more known markers for chemotherapeutic resistance. In some embodiments, the genetic material is nucleic acid that is characterized by a reduced expression (e.g., reduced mRNA levels) of one or more MEDIATOR complex and/or SWI/SNF complex genes. In further embodiments, reduced mRNA levels are assessed by the evaluating the corresponding cDNA.
In a particular embodiment, the instant invention provides methods and compositions for the identification of a lung cancer patient who would likely not benefit from RTK inhibitor chemotherapy (e.g., the patient will be recurrence-free for a period of time less than a patient undergoing the same chemotherapy). In some embodiments, the methods of the instant invention predict whether a chemotherapeutic agent or other compound is likely to be cytotoxic to one or more cancer cells.
Cancers for which the prognostic methods and compositions of the instant invention may provide predictive results for resistance to anticancer treatment include cancers such as breast cancer (e.g., BRCA-1 deficient, stage-III HER2-negative), ovarian cancer (e.g., BRCA-1 deficient, epithelial ovarian cancer), lung cancer (e.g., non-small-cell lung cancer or small cell lung cancer, metastatic non-small cell lung cancer), liver cancer (e.g., hepatocellular carcinoma), head and neck cancer (e.g., metastatic squamous cell carcinoma of the head and neck (SCCHN), squamous cell carcinoma, laryngeal cancer, hypopharyngeal cancer, oropharyngeal cancer, and oral cavity cancer), bladder cancer (e.g., transitional cell carcinoma of the bladder), and colorectal cancer (e.g., advanced (non-resectable locally advanced or metastatic) colorectal cancer). Other cancers for which the methods and compositions of the invention may provide predictive results for resistance to anticancer treatment include cervical cancer (e.g., recurrent and stage IVB), mesothelioma, solid tumors (e.g., advanced solid tumors), renal cell carcinoma (e.g., advanced renal cell carcinoma), stomach cancer, sarcoma, prostate cancer (e.g., hormone refractory prostate cancer), melanoma, thyroid cancer (e.g., papillary thyroid cancer), brain cancer, adenocarcinoma, subependymal giant cell astrocytoma, endometrial cancer, glioma, glioblastoma, and other tumors that have metastasized to the brain, esophageal cancer, neuroblastoma, hematological cancers, and lymphoma.
In some embodiments, the cancer is one in which one or more RTK inhibitor drugs are employed either alone or in combination with other chemotherapeutic agents as a part of an anticancer treatment regimen. In other embodiments, the cancer is one in which one or more RTK inhibitor drugs are employed either alone or in combination with additional treatment regimens, such as surgical procedures, radiation, and/or other anticancer treatments. In certain embodiments, the cancer is one in which one or more RTK inhibitor agents are used as a first-line form of treatment. In yet other embodiments, the one or more RTK inhibitor drugs are employed in combination with an inhibitor of the TGF-beta pathway.
In certain embodiments, the instant invention relates to methods and compositions encompassing the detection of expression levels of a MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP gene in one or more cells of a subject. Typically, the subject is a human patient who has or is suspected of having at least one type of cancer, and the expression levels of the MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP gene are detected in a sample of one or more cells, typically one or more tumor cells, from the human patient, which are then compared with the expression levels of the MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP gene in a control sample. A control sample will generally be one in which the MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP gene expression levels are known and correlated with resistance to anticancer treatment to a certain drug or group of drugs. In some embodiments, the control sample is one in which the MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP gene expression levels are known and correlated with a lack of resistance to anticancer treatment to a certain drug or group of drugs. In certain embodiments, the MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP gene expression levels in one or more tumor cells of a patient are compared with the expression levels in one or more normal cells of the patient, wherein a reduced expression in the one or more tumor cells in comparison to the normal cells of the patient are predictive of resistance to anticancer treatment to a certain drug or group of drugs. In some embodiments, more than one control sample is used for comparative purposes with the test sample from the subject. In certain embodiments, the expression levels of a MEDIATOR complex gene are detected. In other embodiments, the expression levels of a SWI/SNF complex gene are detected. In yet other embodiments, the expression levels of a RAS-GAP gene are detected.
In certain embodiments, the invention relates to a method for predicting a lung cancer patient's response to RTK inhibitor drug chemotherapy, such as gefitinib or erlotinib treatment. In some embodiments, the lung cancer patient has not yet received RTK inhibitor drug chemotherapy. In further embodiments, a sample of the lung cancer cells from the patient is analyzed for the levels of expression of a MEDIATOR complex and/or SWI/SNF complex gene, such as MED12, SMARCE1, and/or ARIDA1, and or a RAS-GAP gene, such as DAB2IP, NF1, and/or RASAL3. If expression levels of the MEDIATOR complex and/or SWI/SNF complex gene (e.g., MED12, SMARCE1, and/or ARIDA1) and/or RAS-GAP gene (e.g., DAB2IP, NF1, and/or RASAL3) are low compared to expression levels in normal lung tissue, then the lung cancer cells in the patient are likely resistant to RTK inhibitor anticancer treatment.
In certain embodiments, the expression level of the MEDIATOR complex and/or SWI/SNF complex gene, such as MED12, SMARCE1, and/or ARIDA1, and/or RAS-GAP gene, such as DAB2IP, NF1, and/or RASAL3 in cancer tissue is lower than the expression level of the gene in normal tissue. In predicting resistance to anticancer treatment of a tumor, cut-off levels of expression may be determined empirically for the subject cancer for which resistance to anticancer treatment is being assessed.
In other embodiments, the instant invention relates to methods and compositions encompassing the detection of one or more inactivating mutations in a MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP gene in one or more cells of a subject. Typically, the subject is a human patient who has or is suspected of having at least one type of cancer, and the nucleic acid of the MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP are isolated from a sample of one or more cells, typically one or more tumor cells, from the human patient, which are then compared with the nucleic acid of the MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP in a control sample. A control sample will generally be one in which the MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP nucleic acid sequences are known and correlated with resistance to anticancer treatment to a certain drug or group of drugs. In some embodiments, the control sample is one in which the MEDIATOR complex and/or SWI/SNF complex and/or RAS-GAP nucleic acid sequences are known and correlated with a lack of resistance to anticancer treatment to a certain drug or group of drugs. In some embodiments, more than one control sample is used for comparative purposes with the test sample from the subject. In certain embodiments, the inactivating mutation is a point mutation. In some embodiments, the inactivating mutation is a hypomorphic mutation. In other embodiments, the inactivating mutation is a gene deletion. In yet other embodiments, the inactivating mutation is an amplification.
In some embodiments, the instant invention relates to methods and compositions encompassing evaluating the protein activity and/or sequence and/or posttranslational modification state of one or more RAS-GAP proteins and/or proteins in a MEDIATOR complex and/or SWI/SNF complex in one or more cells of a subject. Typically, the subject is a human patient who has or is suspected of having at least one type of cancer, and the RAS-GAP protein and/or protein of the MEDIATOR complex and/or SWI/SNF complex is isolated from a sample of one or more cells, typically one or more tumor cells, from the human patient, which are then compared with the RAS-GAP protein and/or protein of the MEDIATOR complex and/or SWI/SNF complex in a control sample. A control sample will generally be one in which the RAS-GAP protein and/or MEDIATOR complex and/or SWI/SNF complex protein sequences and/or activity and/or posttranslational modification state are known and correlated with resistance to anticancer treatment to a certain drug or group of drugs. In some embodiments, the control sample is one in which the RAS-GAP protein and/or MEDIATOR complex and/or SWI/SNF complex protein sequences and/or activity and/or posttranslational modification state are known and correlated with a lack of resistance to anticancer treatment to a certain drug or group of drugs.
Evaluation of protein activity includes assaying the enzymatic activity of the protein. In certain embodiments, the posttranslational modification status of the protein is assessed. In further embodiments, one or more posttranslational modifications (or lack thereof) is associated with protein dysfunction, such as reduced enzymatic activity by the protein. In some embodiments, the RAS-GAP and/or MEDIATOR complex and/or SWI/SNF complex protein in one or more cells of a subject is dysfunctional, and this dysfunction is indicative of resistance to one or more anticancer treatments. Examples of protein dysfunction include reduced or no enzymatic and/or binding activity of the protein; reduced or no protein expression; and/or improper protein modification, such as phosphorylation that results in inactivity of the protein.
The terms “marker” and “biomarker” are used interchangeably herein and refer to a gene, protein, or fragment thereof, the expression or level or activity of which changes between certain conditions. Where the expression or level or activity of the gene, protein, or fragment thereof correlates with a certain condition, the gene, protein, or fragment thereof is a marker for that condition.
“Resistant,” “resistance,” or “resistance to anticancer treatment” in the context of treatment of a cancer cell with a chemotherapeutic agent or other compound means that the chemotherapeutic agent or other compound is not likely to have an optimal effect on the cancer cell. In some embodiments, the compound is not likely to have any effect on the cancer cells. In certain embodiments, the effect of a compound on one or more cancer cells is reduced. In certain further embodiments, a tumor is likely to be less sensitive to a compound but not completely resistant to it. In certain embodiments, the compound is not likely to be cytotoxic to the cancer cell. In some embodiments, the compound is not cytotoxic to the cancer cell.
By “primary resistance” with regard to one or more cancer cells in a patient is meant cells that are naïve for anticancer treatment. For example, a tumor that demonstrates primary resistance to an anticancer treatment includes one that has never been treated with the anticancer drug or drugs but demonstrates or is predicted to demonstrate resistance to the anticancer drug or drugs once treatment has begun.
By “secondary resistance” with regard to one or more cancer cells in a patient is meant cells that have acquired resistance to an anticancer treatment. For example, a tumor that demonstrates secondary resistance to an anticancer treatment includes one that has been treated for a prolonged period of time with one or more anticancer drugs but resistance arises to the one or more anticancer drugs after treatment.
By “inactivating mutation” is meant a mutation in, for example, a nucleic acid that encodes a protein that is inactive. This includes, for example, mutations that result in the loss of protein expression and/or activity and includes genetic mutations such as point mutations, translocations, amplifications, deletions (including whole gene deletions), and hypomorphic mutations (e.g., where an altered gene product possesses a reduced level of activity or where the wild-type gene product is expressed at a reduced level). “Inactivating mutation” also includes biomarker dysfunctions due to post-translational protein regulation, for example, where a protein biomarker is inactive or exhibits impaired activity due to, for example, one or more posttranslational modifications, such as phosphorylation that results in protein inactivity.
The term “biomarker dysfunction” with regard to a protein or protein fragment refers to dysfunction of the protein or fragment thereof as a result of improper regulation at the posttranslational level, such as, for example, phosphorylation that results in protein inactivity.
By “MEDIATOR complex gene” is meant any gene encoding for a protein of the MEDIATOR complex.
By “reference MEDIATOR complex gene” is meant a MEDIATOR complex gene in a control sample, e.g., a normal sample such as a non-cancerous tissue sample. Typically, the expression levels of a reference MEDIATOR complex gene serve as a reference for comparative purposes with the levels of expression of the same MEDIATOR complex gene in a different sample, typically a test sample, such as a lung tumor sample.
By “SWI/SNF complex gene” is meant any gene encoding for a protein of the SWI/SNF complex.
By “reference SWI/SNF complex gene” is meant a SWI/SNF complex gene in a control sample, e.g., a normal sample such as a non-cancerous tissue sample. Typically, the expression levels of a reference SWI/SNF complex gene serve as a reference for comparative purposes with the levels of expression of the same SWI/SNF complex gene in a different sample, typically a test sample, such as a lung tumor sample.
By “RAS-GAP gene” is meant any gene encoding for a RAS-GAP protein.
By “reference RAS-GAP gene” is meant a RAS-GAP gene in a control sample, e.g., a normal sample such as a non-cancerous tissue sample. Typically, the expression levels of a reference RAS-GAP gene serve as a reference for comparative purposes with the levels of expression of the same RAS-GAP gene in a different sample, typically a test sample, such as a lung tumor sample.
By “TGFβ pathway gene” is meant any gene encoding for a protein in the TGFβ signaling pathway.
By “TGFβ pathway target gene” is meant any gene whose expression is regulated by TGFβ signaling.
By “reference TGFβ pathway gene” is meant a TGFβ signaling pathway gene in a control sample, e.g., a normal sample such as a non-cancerous tissue sample. Typically, the expression levels of a reference TGFβ pathway gene serve as a reference for comparative purposes with the levels of expression of the same TGFβ pathway gene in a different sample, typically a test sample, such as a lung tumor sample.
By “MED12KD signature” is meant the nucleic acid expression profile depicted in
By “positive reference MED12KD signature nucleic acid and/or proteins” is meant the nucleic acid expression profile of one or more genes depicted in
By “negative reference MED12KD signature nucleic acid and/or proteins” is meant the nucleic acid expression profile of one or more genes depicted in
The term “EMT-like phenotype” refers to a partial epithelial-mesenchymal transition (EMT), leading to the induction of mesenchymal markers such as vimentin (VIM) and N-cadherin (CDH2), but not the loss of at least one epithelial marker, such as E-cadherin. As described herein, MED12KD causes expression of the mesenchymal markers VIM and CDH2, indicating that an EMT-like process is initiated in MED12KD cells.
By “interact directly” is meant that a protein or other molecular compound binds and/or enzymatically interacts with a target protein. For example, MEK1 interacts directly with ERK.
By “interact indirectly” is meant that a protein or other molecular compound binds and/or enzymatically interacts with a cellular protein or other molecular compound that may itself interact with a second cellular protein and so forth until a final cellular protein interacts directly with a target protein. This includes any upstream activators of a target protein, such as ERK, in a signaling cascade, such as a receptor tyrosine kinase signaling cascade. Examples of proteins that interact indirectly with ERK include A-RAF, B-RAF, RAF1, MOS, RTKs, and G-protein-coupled receptors.
By “similar” in the context of the expression of one or more nucleic acid and/or proteins is meant that the expression levels of one or more nucleic acid and/or proteins in one sample is the same as or about the same as the expression levels of the one or more nucleic acid and/or proteins in a second sample. In certain embodiments, the expression levels of a gene are the same (e.g., no measurable difference) between two different samples. In other embodiments, the expression levels of a gene are about the same (e.g., within experimental margins of error) between two different samples.
In various aspects, determination of a level of expression of nucleic acid and/or protein in a test sample that is the same, greater than, or less than that produced by the corresponding nucleic, acid and/or protein in a positive reference MED12KD signature is indicative of resistance to anticancer treatment in the tumor from which the test sample was derived. Accordingly, in certain embodiments detection of signal intensity from a test sample that is the same, within experimentally acceptable margins of error, as the signal intensity produced by the positive reference MED12KD signature sample is sufficient to classify the tumor from which the test sample was produced as anticancer treatment resistant. In certain embodiments, detection of signal intensity from a test sample that is greater, within experimentally acceptable margins of error, than the signal intensity produced by the positive reference MED12KD signature sample is sufficient to classify the tumor from which the test sample was produced as anticancer treatment resistant. In certain embodiments, detection of signal intensity from a test sample that is less, within experimentally acceptable margins of error, than the signal intensity produced by the positive reference MED12KD signature sample is sufficient to classify the tumor from which the test sample was produced as anticancer treatment resistant.
In certain embodiments, the deviation of signal intensity of the test sample from the positive reference MED12KD signature sample is measured as a percent difference. In certain embodiments, a test sample is deemed to have produced a signal that is greater than the positive reference MED12KD signature sample if the signal intensity of the test sample measures at a level selected from: the signal intensity of the positive reference MED12KD signature sample greater than 1%; greater than 2%; greater than 5%; greater than 10%; greater than 15%; greater than 20%; the greater than 25%; greater than 30%; greater than 35%; greater than 40%; greater than 45%; greater than 50%; greater than 55%; greater than 60%; greater than 65%; greater than 70%; greater than 75%; greater than 80%; greater than 85%; greater than 90%; greater than 95%; or greater than 100%.
In certain embodiments, a test sample is deemed to have produced a signal that is less than the positive reference MED12KD signature sample if the signal intensity of the test sample measures at a level selected from: the signal intensity of the reference sample less 1%; less 2%; less 5%; less 10%; less 15%; less 20%; less 25%; less 30%; less 35%; less 40%; less 45%; less 50%; less 55%; less 60%; less 65%; less 70%; less 75%; less 80%; less 85%; less 90%; less 95%; or less 100% (or no signal produced by the test sample).
In certain embodiments, the deviation of signal intensity of the test sample from the positive reference MED12KD signature sample is measured as a-fold difference, or a difference based upon unit signal production. In certain embodiments, a test sample is deemed to have produced a signal that is greater than the positive reference MED12KD signature sample if the signal intensity of the test sample is selected from: two-fold greater than; three-fold greater than; four-fold greater than; five-fold greater than; six-fold greater than; seven-fold greater than; eight-fold greater than; nine-fold greater than; ten-fold greater; and more than ten-fold greater than the signal intensity of the positive reference MED12KD signature sample.
In certain embodiments, a test sample is deemed to have produced a signal that is less than the positive reference MED12KD signature sample if the signal intensity of the test sample is selected from: two-fold less than; three-fold less than; four-fold less than; five-fold less than; six-fold less than; seven-fold less than; eight-fold less than; nine-fold less than; ten-fold less than; and greater than ten-fold less than the signal intensity of the positive reference MED12KD signature sample.
In certain embodiments where the expression of a nucleic acid and/or protein in a test sample is compared with the expression level of the same nucleic acid and/or protein in a positive reference MED12KD signature nucleic acid and/or protein sample, expression of the test sample nucleic acid and/or protein that is the same as (e.g., no measurable difference) or greater than (e.g., more than 10-fold greater than) the expression level of the nucleic acid and/or protein corresponding to an upregulated gene in the positive reference MED12KD signature, then resistance to anticancer treatment in the test sample is indicated.
In certain embodiments where the expression of a nucleic acid and/or protein in a test sample is compared with the expression level of the same nucleic acid and/or protein in a positive reference MED12KD signature nucleic acid and/or protein, expression of the test sample nucleic acid and/or protein that is the same as (e.g., no measurable difference) or less than (e.g., more than 10-fold less than) the expression level of the nucleic acid and/or protein corresponding to a downregulated gene in the positive reference MED12KD signature, then resistance to anticancer treatment in the test sample is indicated.
In various aspects, determination of a level of expression of nucleic acid and/or protein in a test sample that is greater than or less than that produced by the corresponding nucleic acid and/or protein in a negative reference MED12KD signature is indicative of resistance to anticancer treatment in the tumor from which the test sample was derived. Accordingly, in certain embodiments, detection of signal intensity from a test sample that is greater, within experimentally acceptable margins of error, than the signal intensity produced by the negative reference MED12KD signature sample is sufficient to classify the tumor from which the test sample was produced as anticancer treatment resistant. In certain embodiments, detection of signal intensity from a test sample that is less, within experimentally acceptable margins of error, than the signal intensity produced by the negative reference MED12KD signature sample is sufficient to classify the tumor from which the test sample was produced as anticancer treatment resistant.
In certain embodiments, the deviation of signal intensity of the test sample from the negative reference MED12KD signature sample is measured as a percent difference. In certain embodiments, a test sample is deemed to have produced a signal that is greater than the positive reference MED12KD signature sample if the signal intensity of the test sample measures at a level selected from: the signal intensity of the positive reference MED12KD signature sample greater than 1%, greater than 2%, greater than 5%; greater than 10%; greater than 15%; greater than 20%; the greater than 25%; greater than 30%; greater than 35%; greater than 40%; greater than 45%; greater than 50%; greater than 55%; greater than 60%; greater than 65%; greater than 70%; greater than 75%; greater than 80%; greater than 85%; greater than 90%; greater than 95%; or greater than 100%.
In certain embodiments, a test sample is deemed to have produced a signal that is less than the negative reference MED12KD signature sample if the signal intensity of the test sample measures at a level selected from: the signal intensity of the reference sample less 1%, less 2%, less 5%; less 10%; less 15%; less 20%; less 25%; less 30%; less 35%; less 40%; less 45%; less 50%; less 55%; less 60%; less 65%; less 70%; less 75%; less 80%; less 85%; less 90%; less 95%; or less 100% (or no signal produced by the test sample).
In certain embodiments, the deviation of signal intensity of the test sample from the negative reference MED12KD signature sample is measured as a-fold difference, or a difference based upon unit signal production. In certain embodiments, a test sample is deemed to have produced a signal that is greater than the negative reference MED12KD signature sample if the signal intensity of the test sample is selected from: one-fold greater than; one-and-half-fold greater than; two-fold greater than; three-fold greater than; four-fold greater than; five-fold greater than; six-fold greater than; seven-fold greater than; eight-fold greater than; nine-fold greater than; ten-fold greater; and more than ten-fold greater than the signal intensity of the negative reference MED12KD signature sample.
In certain embodiments, a test sample is deemed to have produced a signal that is less than the negative reference MED12KD signature sample if the signal intensity of the test sample is selected from: one-fold less than; one-and-half-fold less than; two-fold less than; three-fold less than; four-fold less than; five-fold less than; six-fold less than; seven-fold less than; eight-fold less than; nine-fold less than; ten-fold less than; and greater than ten-fold less than the signal intensity of the negative reference MED12KD signature sample.
In certain embodiments where the expression of a nucleic acid and/or protein in a test sample is compared with the expression level of the same nucleic acid and/or protein in a negative reference MED12KD signature nucleic acid and/or protein sample, expression of the test sample nucleic acid and/or protein that is greater than (e.g., more than 1.2-fold greater than) the expression level of the nucleic acid and/or protein corresponding to an upregulated gene in the negative reference MED12KD signature, then resistance to anticancer treatment in the test sample is indicated.
In certain embodiments where the expression of a nucleic acid and/or protein in a test sample is compared with the expression level of the same nucleic acid and/or protein in a negative reference MED12KD signature nucleic acid and/or protein, expression of the test sample nucleic acid and/or protein that is less than (e.g., more than 1.2-fold less than) the expression level of the nucleic acid and/or protein corresponding to a downregulated gene in the negative reference MED12KD signature, then resistance to anticancer treatment in the test sample is indicated.
As used herein, the terms “drug,” “agent,” and “compound,” either alone or together with “chemotherapeutic” or “chemotherapy,” encompass any composition of matter or mixture which provides some pharmacologic effect that can be demonstrated in-vivo or in vitro. This includes small molecules, antibodies, microbiologicals, vaccines, vitamins, and other beneficial agents. As used herein, the terms further include any physiologically or pharmacologically active substance that produces a localized or systemic effect in a patient.
The term “nucleic acid” encompasses DNA, RNA (e.g., mRNA, tRNA), heteroduplexes, and synthetic molecules capable of encoding a polypeptide and includes all analogs and backbone substitutes such as PNA that one of ordinary skill in the art would recognize as capable of substituting for naturally occurring nucleotides and backbones thereof. Nucleic acids may be single stranded or double stranded, and may be chemical modifications. The terms “nucleic acid” and “polynucleotide” are used interchangeably. Because the genetic code is degenerate, more than one codon may be used to encode a particular amino acid, and the present compositions and methods encompass nucleotide sequences which encode a particular amino acid sequence.
Unless otherwise indicated, nucleic acids are written left to right in 5′ to 3′ orientation; amino acid sequences are written left to right in amino to carboxy orientation, respectively.
“Antisense” nucleic acids are DNA or RNA molecules that are complementary to at least a portion of a specific mRNA molecule (Weintraub, Scientific American 262 40, 1990). In the cell, the antisense nucleic acids hybridize to the corresponding mRNA, forming a double-stranded molecule. This interferes with the translation of the mRNA since the cell will not translate an mRNA that is double-stranded. Antisense oligomers of at least about 15, about 20, about 25, about 30, about 35, about 40, or of at least about 50 nucleotides are preferred, since they are easily synthesized and are less likely to cause non-specific interference with translation than larger molecules. The use of antisense methods to inhibit the in vitro translation of genes is well known in the art (Marcus-Sakura Anal. Biochem. 172: 289, 1998).
Short double-stranded RNAs (dsRNAs; typically <30 nucleotides) can be used to silence the expression of target genes in animals and animal cells. Upon introduction, the long dsRNAs enter the RNA interference (RNAi) pathway which involves the production of shorter (20-25 nucleotide) small interfering RNAs (siRNAs) and assembly of the siRNAs into RNA-induced silencing complexes (RISCs). The siRNA strands are then unwound to form activated RISCs, which cleave the target RNA. Double stranded RNA has been shown to be extremely effective in silencing a target RNA.
General methods of using antisense, ribozyme technology and RNAi technology, to control gene expression, or of gene therapy methods for expression of an exogenous gene in this manner are well known in the art. Each of these methods utilizes a system, such as a vector, encoding either an antisense or ribozyme transcript. The term “RNAi” stands for RNA interference. This term is understood in the art to encompass technology using RNA molecules that can silence genes. See, for example, McManus, et al. Nature Reviews Genetics 3: 737, 2002. In this application, the term “RNAi” encompasses molecules such as small interfering or short interfering RNA (siRNA), small hairpin or short hairpin RNA (shRNA), microRNAs, and small temporal RNA (stRNA). Generally speaking, RNA interference results from the interaction of double-stranded RNA with genes.
The antisense oligonucleotides can be of any length; for example, in alternative aspects, the antisense oligonucleotides are between about 5 to 100, about 10 to 80, about 15 to 60, about 18 to 40. The optimal length can be determined by routine screening. The antisense oligonucleotides can be present at any concentration. The optimal concentration can be determined by routine screening. In certain embodiments, siRNA molecules are 12-28 nucleotides long, more preferably 15-25 nucleotides long, still more preferably 19-23 nucleotides long and most preferably 21-23 nucleotides long. In certain embodiments, preferred siRNA molecules are 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27 28 or 29 nucleotides in length.
As used herein, the term “amino acid sequence” is synonymous with the terms “polypeptide,” “protein,” and “peptide,” and are used interchangeably. Where such amino acid sequences exhibit activity, they may be referred to as an “enzyme.” The conventional one-letter or three-letter code for amino acid residues are used herein.
As used herein, a “synthetic” molecule is produced by in vitro chemical or enzymatic synthesis rather than by an organism.
As used herein, the term “expression” refers to the process by which a polypeptide is produced based on the nucleic acid sequence of a gene. The process includes both transcription and translation. The term “expression” also includes the protein product of a translated mRNA. The term “expression” as it refers to protein includes both protein levels and protein activity (e.g., protein binding, enzymatic activity, etc.). The term “expression” also refers to the transcription of non-translated nucleic acid (e.g., non-coding mRNA).
A “gene” refers to the DNA segment encoding a polypeptide or RNA.
By “homolog” is meant an entity having a certain degree of identity with the subject amino acid sequences and the subject nucleotide sequences. As used herein, the term “homolog” covers identity with respect to structure and/or function, for example, the expression product of the resultant nucleotide sequence has the enzymatic activity of a subject amino acid sequence. With respect to sequence identity, preferably there is at least 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or even 99% sequence identity. These terms also encompass allelic variations of the sequences. The term, homolog, may apply to the relationship between genes separated by the event of speciation or to the relationship between genes separated by the event of genetic duplication.
Relative sequence identity can be determined by commercially available computer programs that can calculate % identity between two or more sequences using any suitable algorithm for determining identity, using, for example, default parameters. A typical example of such a computer program is CLUSTAL. Advantageously, the BLAST algorithm is employed, with parameters set to default-values. The BLAST algorithm is described in detail on the National Center for Biotechnology Information (NCBI) website.
The homologs of the peptides as provided herein typically have structural similarity with such peptides. A homolog of a polypeptide includes one or more conservative amino acid substitutions, which may be selected from the same or different members of the class to which the amino acid belongs.
In one embodiment, the sequences may also have deletions, insertions or substitutions of amino acid residues which produce a silent change and result in a functionally equivalent substance. Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long as the secondary binding activity of the substance is retained. For example, negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, asparagine, glutamine, serine, threonine, phenylalanine, and tyrosine.
The present invention also encompasses conservative substitution (substitution and replacement are both used herein to mean the interchange of an existing amino acid residue with an alternative residue) that may occur e.g., like-for-like substitution such as basic for basic, acidic for acidic, polar for polar, etc. Non-conservative substitution may also occur e.g., from one class of residue to another or alternatively involving the inclusion of unnatural amino acids such as ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine (hereinafter referred to as B), norleucine ornithine (hereinafter referred to as O), pyriylalanine, thienylalanine, naphthylalanine and phenylglycine. Conservative substitutions that may be made are, for example, within the groups of basic amino acids (Arginine, Lysine and Histidine), acidic amino acids (glutamic acid and aspartic acid), aliphatic amino acids (Alanine, Valine, Leucine, Isoleucine), polar amino acids (Glutamine, Asparagine, Serine, Threonine), aromatic amino acids (Phenylalanine, Tryptophan and Tyrosine), hydroxylamino acids (Serine, Threonine), large amino acids (Phenylalanine and Tryptophan) and small amino acids (Glycine, Alanine).
The present invention employs, unless otherwise indicated, conventional techniques of chemistry, molecular biology, microbiology, recombinant DNA and immunology, which are within the capabilities of a person of ordinary skill in the art. Such techniques are explained in the literature. See, for example, J. Sambrook, E. F. Fritsch, and T. Maniatis, 1989, Molecular Cloning: A Laboratory Manual, Second Edition, Books 1-3, Cold Spring Harbor Laboratory Press; Ausubel, F. M. et al. (1995 and periodic supplements; Current Protocols in Molecular Biology, ch. 9, 13, and 16, John Wiley & Sons, New York, N.Y.); B. Roe, J. Crabtree, and A. Kahn, 1996, DNA Isolation and Sequencing: Essential Techniques, John Wiley & Sons; M. J. Gait (Editor), 1984, Oligonucleotide Synthesis: A Practical Approach, Irl Press; and, D. M. J. Lilley and J. E. Dahlberg, 1992, Methods of Enzymology: DNA Structure Part A: Synthesis and Physical Analysis of DNA Methods in Enzymology, Academic Press. Each of these general texts is herein incorporated by reference.
There are many methods known in the art for determining the genotype of a patient. Any method for determining genotype can be used for determining genotypes in the present invention. Such methods include, but are not limited to, amplimer sequencing, DNA sequencing, fluorescence spectroscopy, fluorescence resonance energy transfer (or “FRET”)-based hybridization analysis, high throughput screening, mass spectroscopy, nucleic acid hybridization, polymerase chain reaction (PCR), RFLP analysis and size chromatography (e.g., capillary or gel chromatography), all of which are well known to one of ordinary skill in the art.
Many methods of sequencing genomic DNA are known in the art, and any such method can be used, see for example Sambrook et al., Molecular Cloning; A Laboratory Manual 2d ed. (1989). For example, a DNA fragment of interest can be amplified using the polymerase chain reaction or some other cyclic polymerase mediated amplification reaction. The amplified region of DNA can then be sequenced using any method known in the art. Advantageously, the nucleic acid sequencing is by automated methods (reviewed by Meldrum, Genome Res. September 2000; 10(9):1288-303, the disclosure of which is incorporated by reference in its entirety), for example using a Beckman CEQ 8000 Genetic Analysis System (Beckman Coulter Instruments, Inc.). Methods for sequencing nucleic acids include, but are not limited to, automated fluorescent DNA sequencing (see, e.g., Watts & MacBeath, Methods Mol. Biol. 2001; 167:153-70 and MacBeath et al., Methods Mol. Biol. 2001; 167:119-52), capillary electrophoresis (see, e.g., Bosserhoff et al., Comb Chem High Throughput Screen. December 2000; 3(6):455-66), DNA sequencing chips (see, e.g., Jain, Pharmacogenomics. August 2000; 1(3):289-307), mass spectrometry (see, e.g., Yates, Trends Genet. January 2000; 16(1):5-8), pyrosequencing (see, e.g., Ronaghi, Genome Res. January 2001; 11(1):3-11), and ultrathin-layer gel electrophoresis (see, e.g., Guttman & Ronai, Electrophoresis. December 2000; 21 (18):3952-64), the disclosures of which are hereby incorporated by reference in their entireties. The sequencing can also be done by any commercial company. Examples of such companies include, but are not limited to, the University of Georgia Molecular Genetics Instrumentation Facility (Athens, Ga.) or SeqWright DNA Technologies Services (Houston, Tex.).
Any one of the methods known in the art for amplification of DNA may be used, such as for example, the polymerase chain reaction (PCR), the ligase chain reaction (LCR) (Barany, F., Proc. Natl. Acad. Sci. (U.S.A.) 88:189-193 (1991)), the strand displacement assay (SDA), or the oligonucleotide ligation assay (“OLA”) (Landegren, U. et al., Science 241:1077-1080 (1988)). Nickerson, D. A. et al. have described a nucleic acid detection assay that combines attributes of PCR and OLA (Nickerson, D. A. et al., Proc. Natl. Acad. Sci. (U.S.A.) 87:8923-8927 (1990)). Other known nucleic acid amplification procedures, such as transcription-based amplification systems (Malek, L. T. et al., U.S. Pat. No. 5,130,238; Davey, C. et al., European Patent Application 329,822; Schuster et al., U.S. Pat. No. 5,169,766; Miller, H. I. et al., PCT Application WO89/06700; Kwoh, D. et al., Proc. Natl. Acad. Sci. (U.S.A.) 86:1173 (1989); Gingeras, T. R. et al., PCT Application WO88/10315)), or isothermal amplification methods (Walker, G. T. et al., Proc. Natl. Acad. Sci. (U.S.A.) 89:392-396 (1992)) may also be used.
To perform a cyclic polymerase mediated amplification reaction according to the present invention, the primers are hybridized or annealed to opposite strands of the target DNA, the temperature is then raised to permit the thermostable DNA polymerase to extend the primers and thus replicate the specific segment of DNA spanning the region between the two primers. Then the reaction is thermocycled so that at each cycle the amount of DNA representing the sequences between the two primers is doubled, and specific amplification of gene DNA sequences, if present, results.
Any of a variety of polymerases can be used in the present invention. For thermocyclic reactions, the polymerases are thermostable polymerases such as Taq, KlenTaq, Stoffel Fragment, Deep Vent, Tth, Pfu, Vent, and UlTma, each of which are readily available from commercial sources. For non-thermocyclic reactions, and in certain thermocyclic reactions, the polymerase will often be one of many polymerases commonly used in the field, and commercially available, such as DNA pol 1, Klenow fragment, T7 DNA polymerase, and T4 DNA polymerase. Guidance for the use of such polymerases can readily be found in product literature and in general molecular biology guides.
Typically, the annealing of the primers to the target DNA sequence is carried out for about 2 minutes at about 37-55° C., extension of the primer sequence by the polymerase enzyme (such as Taq polymerase) in the presence of nucleoside triphosphates is carried out for about 3 minutes at about 70-75° C., and the denaturing step to release the extended primer is carried out for about 1 minute at about 90-95° C. However, these parameters can be varied, and one of skill in the art would readily know how to adjust the temperature and time parameters of the reaction to achieve the desired results. For example, cycles may be as short as 10, 8, 6, 5, 4.5, 4, 2, 1, 0.5 minutes or less.
Also, “two temperature” techniques can be used where the annealing and extension steps may both be carried out at the same temperature, typically between about 60-65° C., thus reducing the length of each amplification cycle and resulting in a shorter assay time.
Typically, the reactions described herein are repeated until a detectable amount of product is generated. Often, such detectable amounts of product are between about 10 ng and about 100 ng, although larger quantities, e.g. 200 ng, 500 ng, 1 mg or more can also, of course, be detected. In terms of concentration, the amount of detectable product can be from about 0.01 pmol, 0.1 pmol, 1 pmol, 10 pmol, or more. Thus, the number of cycles of the reaction that are performed can be varied, the more cycles are performed, the more amplified product is produced. In certain embodiments, the reaction comprises 2, 5, 10, 15, 20, 30, 40, 50, or more cycles.
For example, the PCR reaction may be carried out using about 25-50 μl samples containing about 0.01 to 1.0 ng of template amplification sequence, about 10 to 100 pmol of each generic primer, about 1.5 units of Taq DNA polymerase (Promega Corp.), about 0.2 mM dDATP, about 0.2 mM dCTP, about 0.2 mM dGTP, about 0.2 mM dTTP, about 15 mM MgCl.sub.2, about 10 mM Tris-HCl (pH 9.0), about 50 mM KCl, about 1 μg/ml gelatin, and about 10 μl/ml Triton X-100 (Saiki, 1988).
Those of ordinary skill in the art are aware of the variety of nucleotides available for use in the cyclic polymerase mediated reactions. Typically, the nucleotides will consist at least in part of deoxynucleotide triphosphates (dNTPs), which are readily commercially available. Parameters for optimal use of dNTPs are also known to those of skill, and are described in the literature. In addition, a large number of nucleotide derivatives are known to those of skill and can be used in the present reaction. Such derivatives include fluorescently labeled nucleotides, allowing the detection of the product including such labeled nucleotides, as described below. Also included in this group are nucleotides that allow the sequencing of nucleic acids including such nucleotides, such as chain-terminating nucleotides, dideoxynucleotides and boronated nuclease-resistant nucleotides. Commercial kits containing the reagents most typically used for these methods of DNA sequencing are available and widely used. Other nucleotide analogs include nucleotides with bromo-, iodo-, or other modifying groups, which affect numerous properties of resulting nucleic acids including their antigenicity, their replicatability, their melting temperatures, their binding properties, etc. In addition, certain nucleotides include reactive side groups, such as sulfhydryl groups, amino groups, N-hydroxysuccinimidyl groups, that allow the further modification of nucleic acids comprising them.
In certain embodiments, oligonucleotides that can be used as primers to amplify specific nucleic acid sequences of a gene in cyclic polymerase-mediated amplification reactions, such as PCR reactions, consist of oligonucleotide fragments. Such fragments should be of sufficient length to enable specific annealing or hybridization to the nucleic acid sample. The sequences typically will be about 8 to about 44 nucleotides in length, but may be longer. Longer sequences, e.g., from about 14 to about 50, are advantageous for certain embodiments.
In embodiments where it is desired to amplify a fragment of DNA, primers having contiguous stretches of about 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 nucleotides from a gene sequence are contemplated.
As used herein, “hybridization” refers to the process by which one strand of nucleic acid base pairs with a complementary strand, as occurs during blot hybridization techniques and PCR techniques.
Whichever probe sequences and hybridization methods are used, one ordinarily skilled in the art can readily determine suitable hybridization conditions, such as temperature and chemical conditions. Such hybridization methods are well known in the art. For example, for applications requiring high selectivity, one will typically desire to employ relatively stringent conditions for the hybridization reactions, e.g., one will select relatively low salt and/or high temperature conditions, such as provided by about 0.02 M to about 0.10 M NaCl at temperatures of about 50° C. to about 70° C. Such high stringency conditions tolerate little, if any, mismatch between the probe and the template or target strand. It is generally appreciated that conditions can be rendered more stringent by the addition of increasing amounts of formamide. Other variations in hybridization reaction conditions are well known in the art (see for example, Sambrook et al., Molecular Cloning; A Laboratory Manual 2d ed. (1989)).
Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid binding complex, as taught, e.g., in Berger and Kimmel (1987, Guide to Molecular Cloning Techniques, Methods in Enzymology, Vol 152, Academic Press, San Diego Calif.), and confer a defined “stringency” as explained below.
Maximum stringency typically occurs at about Tm-5° C. (5° C. below the Tm of the probe); high stringency at about 5° C. to 10° C. below Tm; intermediate stringency at about 10° C. to 20° C. below Tm; and low stringency at about 20° C. to 25° C. below Tm. As will be understood by those of ordinary skill in the art, a maximum stringency hybridization can be used to identify or detect identical nucleotide sequences while an intermediate (or low) stringency hybridization can be used to identify or detect similar or related polynucleotide sequences.
In one aspect, the present invention employs nucleotide sequences that can hybridize to another nucleotide sequence under stringent conditions (e.g., 65° C. and 0.1×SSC {1×SSC=0.15 M NaCl, 0.015 M Na3 Citrate pH 7.0). Where the nucleotide sequence is double-stranded, both strands of the duplex, either individually or in combination, may be employed by the present invention. Where the nucleotide sequence is single-stranded, it is to be understood that the complementary sequence of that nucleotide sequence is also included within the scope of the present invention.
Stringency of hybridization refers to conditions under which polynucleic acid hybrids are stable. Such conditions are evident to those of ordinary skill in the field. As known to those of ordinary skill in the art, the stability of hybrids is reflected in the melting temperature (Tm) of the hybrid which decreases approximately 1 to 1.5° C. with every 1% decrease in sequence homology. In general, the stability of a hybrid is a function of sodium ion concentration and temperature. Typically, the hybridization reaction is performed under conditions of higher stringency, followed by washes of varying stringency.
As used herein, high stringency includes conditions that permit hybridization of only those nucleic acid sequences that form stable hybrids in 1 M Na+ at 65-68° C. High stringency conditions can be provided, for example, by hybridization in an aqueous solution containing 6×SSC, 5×Denhardt's, 1% SDS (sodium dodecyl sulphate), 0.1 Na+ pyrophosphate and 0.1 mg/ml denatured salmon sperm DNA as non-specific competitor. Following hybridization, high stringency washing may be done in several steps, with a final wash (about 30 minutes) at the hybridization temperature in 0.2-0.1×SSC, 0.1% SDS.
It is understood that these conditions may be adapted and duplicated using a variety of buffers, e.g., formamide-based buffers, and temperatures. Denhardt's solution and SSC are well known to those of ordinary skill in the art as are other suitable hybridization buffers (see, e.g., Sambrook, et al., eds. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York or Ausubel, et al., eds. (1990) Current Protocols in Molecular Biology, John Wiley & Sons, Inc.). Optimal hybridization conditions are typically determined empirically, as the length and the GC content of the hybridizing pair also play a role.
Nucleic acid molecules that differ from the sequences of the primers and probes disclosed herein, are intended to be within the scope of the invention. Nucleic acid sequences that are complementary to these sequences, or that are hybridizable to the sequences described herein under conditions of standard or stringent hybridization, and also analogs and derivatives are also intended to be within the scope of the invention. Advantageously, such variations will differ from the sequences described herein by only a small number of nucleotides, for example by 1, 2, or 3 nucleotides.
Nucleic acid molecules corresponding to natural allelic variants, homologues (i.e., nucleic acids derived from other species), or other related sequences (e.g., paralogs) of the sequences described herein can be isolated based on their homology to the nucleic acids disclosed herein, for example by performing standard or stringent hybridization reactions using all or a portion of the known sequences as probes. Such methods for nucleic acid hybridization and cloning are well known in the art.
Similarly, a nucleic acid molecule detected in the methods of the invention may include only a fragment of the specific sequences described. Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic acids, a length sufficient to allow for specific hybridization of nucleic acid primers or probes, and are at most some portion less than a full-length sequence. Fragments may be derived from any contiguous portion of a nucleic acid sequence of choice. Derivatives and analogs may be full length or other than full length, if the derivative or analog contains a modified nucleic acid or amino acid, as described below.
Derivatives, analogs, homologues, and variants of the nucleic acids of the invention include, but are not limited to, molecules comprising regions that are substantially homologous to the nucleic acids of the invention, in various embodiments, by at least about 70%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or even 99% identity over a nucleic acid sequence of identical size or when compared to an aligned sequence in which the alignment is done by a computer homology program known in the art.
For the purposes of the present invention, sequence identity or homology is determined by comparing the sequences when aligned so as to maximize overlap and identity while minimizing sequence gaps. In particular, sequence identity may be determined using any of a number of mathematical algorithms. A nonlimiting example of a mathematical algorithm used for comparison of two sequences is the algorithm of Karlin & Altschul, Proc. Natl. Acad. Sci. USA 1990; 87: 2264-2268, modified as in Karlin & Altschul, Proc. Natl. Acad. Sci. USA 1993; 90: 5873-5877.
Another example of a mathematical algorithm used for comparison of sequences is the algorithm of Myers & Miller, CABIOS 1988; 4: 11-17. Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the GCG sequence alignment software package. When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. Yet another useful algorithm for identifying regions of local sequence similarity and alignment is the FASTA algorithm as described in Pearson & Lipman, Proc. Natl. Acad. Sci. USA 1988; 85: 2444-2448.
Advantageous for use according to the present invention is the WU-BLAST (Washington University BLAST) version 2.0 software. WU-BLAST version 2.0 executable programs for several UNIX platforms can be downloaded from ftp://blast.wustl.edu/blast/executables. This program is based on WU-BLAST version 1.4, which in turn is based on the public domain NCBI-BLAST version 1.4 (Altschul & Gish, 1996, Local alignment statistics, Doolittle ed., Methods in Enzymology 266: 460-480; Altschul et al., Journal of Molecular Biology 1990; 215: 403-410; Gish & States, 1993; Nature Genetics 3: 266-272; Karlin & Altschul, 1993; Proc. Natl. Acad. Sci. USA 90: 5873-5877; all of which are incorporated by reference herein).
In all search programs in the suite the gapped alignment routines are integral to the database search itself. Gapping can be turned off if desired. The default penalty (Q) for a gap of length one is Q=9 for proteins and BLASTP, and Q=10 for BLASTN, but may be changed to any integer. The default per-residue penalty for extending a gap (R) is R=2 for proteins and BLASTP, and R=10 for BLASTN, but may be changed to any integer. Any combination of values for Q and R can be used in order to align sequences so as to maximize overlap and identity while minimizing sequence gaps. The default amino acid comparison matrix is BLOSUM62, but other amino acid comparison matrices such as PAM can be utilized.
Alternatively or additionally, the term “homology” or “identity”, for instance, with respect to a nucleotide or amino acid sequence, can indicate a quantitative measure of homology between two sequences. The percent sequence homology can be calculated as (NCref−Ndif)*100/−Nref, wherein Ndif is the total number of non-identical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences. Hence, the DNA sequence AGTCAGTC will have a sequence identity of 75% with the sequence AATCAATC (N Nref=8; N Ndif=2). “Homology” or “identity” can refer to the number of positions with identical nucleotides or amino acids divided by the number of nucleotides or amino acids in the shorter of the two sequences wherein alignment of the two sequences can be determined in accordance with the Wilbur and Lipman algorithm (Wilbur & Lipman, Proc Natl Acad Sci USA 1983; 80:726, incorporated herein by reference), for instance, using a window size of 20 nucleotides, a word length of 4 nucleotides, and a gap penalty of 4, and computer-assisted analysis and interpretation of the sequence data including alignment can be conveniently performed using commercially available programs (e.g., Intelligenetics™ Suite, Intelligenetics Inc. CA). When RNA sequences are said to be similar, or have a degree of sequence identity or homology with DNA sequences, thymidine (T) in the DNA sequence is considered equal to uracil (U) in the RNA sequence. Thus, RNA sequences are within the scope of the invention and can be derived from DNA sequences, by thymidine (T) in the DNA sequence being considered equal to uracil (U) in RNA sequences. Without undue experimentation, the skilled artisan can consult with many other programs or references for determining percent homology.
In embodiments where expression of a particular gene is assessed by determining the expression of the protein product of the gene, any suitable assay for detecting protein levels and/or activity may be employed. For example, suitable protein activity assays include ubiquitination assays, kinase assays, protein-binding assays, DNA-binding and unwinding assays, and any other suitable assay for assessing the activity of the protein product of a translated gene according to the invention.
In order to determine the genotype or expression level of a particular SWI/SNF complex and/or MEDIATOR complex gene of a patient according to the methods of the present invention, it may be necessary to obtain a sample of genomic DNA or RNA from that patient. That sample of genomic DNA or RNA may be obtained from a sample of tissue or cells taken from that patient.
A sample may comprise any clinically relevant tissue sample, such as a tumor biopsy or fine needle aspirate, hair (including roots), skin, buccal swabs, saliva, or a sample of bodily fluid, such as blood, plasma, serum, lymph, ascitic fluid, cystic fluid, urine or nipple exudate. The sample may be taken from a human, or, in a veterinary context, from non-human animals such as ruminants, horses, swine or sheep, or from domestic companion animals such as felines and canines.
The tissue sample may be marked with an identifying number or other indicia that relates the sample to the individual patient from which the sample was taken. The identity of the sample advantageously remains constant throughout the methods of the invention thereby guaranteeing the integrity and continuity of the sample during extraction and analysis. Alternatively, the indicia may be changed in a regular fashion that ensures that the data, and any other associated data, can be related back to the patient from whom the data was obtained. The amount/size of sample required is known to those ordinarily skilled in the art.
Generally, the tissue sample may be placed in a container that is labeled using a numbering system bearing a code corresponding to the patient. Accordingly, the genotype of a particular patient is easily traceable.
In one embodiment of the invention, a sampling device and/or container may be supplied to the physician. The sampling device advantageously takes a consistent and reproducible sample from individual patients while simultaneously avoiding any cross-contamination of tissue. Accordingly, the size and volume of sample tissues derived from individual patients would be consistent.
According to the present invention, a sample of genomic DNA or RNA is obtained from the tissue sample of the patient of interest. Whatever source of cells or tissue is used, a sufficient amount of cells must be obtained to provide a sufficient amount of DNA or RNA for analysis. This amount will be known or readily determinable by those ordinarily skilled in the art.
DNA or RNA is isolated from the tissue/cells by techniques known to those ordinarily skilled in the art (see, e.g., U.S. Pat. Nos. 6,548,256 and 5,989,431, Hirota et al., Jinrui Idengaku Zasshi. September 1989; 34(3):217-23 and John et al., Nucleic Acids Res. Jan. 25. 1991; 19(2):408; the disclosures of which are incorporated by reference in their entireties). For example, high molecular weight DNA may be purified from cells or tissue using proteinase K extraction and ethanol precipitation. DNA may be extracted from a patient specimen using any other suitable methods known in the art.
In certain embodiments, target polynucleotide molecules are extracted from a sample taken from an individual afflicted with breast cancer. The sample may be collected in any clinically acceptable manner, but must be collected such that marker-derived polynucleotides (e.g., RNA) are preserved. mRNA or nucleic acids derived therefrom (e.g., cDNA or amplified DNA) are preferably labeled distinguishably from standard or control polynucleotide molecules, and both are simultaneously or independently hybridized to a microarray comprising one or more markers of resistance to anticancer treatment as described above. Alternatively, mRNA or nucleic acids derived therefrom may be labeled with the same label as the standard or control polynucleotide molecules, wherein the intensity of hybridization of each at a particular probe is compared.
Methods for preparing total and poly(A)+ RNA are well known and are described generally in Sambrook et al., MOLECULAR CLONING-A LABORATORY MANUAL (2ND ED.), Vols. 1-3, Cold. Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989)) and Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, vol. 2, Current Protocols Publishing, New York (1994)).
RNA may be isolated from eukaryotic cells by procedures that involve lysis of the cells and denaturation of the proteins contained therein. Cells of interest include wild-type cells (i.e., non-cancerous), drug-exposed wild-type cells, tumor- or tumor-derived cells, modified cells, normal or tumor cell line cells, and drug-exposed modified cells.
Additional steps may be employed to remove DNA. Cell lysis may be accomplished with a nonionic detergent, followed by microcentrifugation to remove the nuclei and hence the bulk of the cellular DNA. In one embodiment, RNA is extracted from cells of the various types of interest using guanidinium thiocyanate lysis followed by CsCl centrifugation to separate the RNA from DNA (Chirgwin et al., Biochemistry 18:5294-5299 (1979)). Poly(A)+ RNA is selected by selection with oligo-dT cellulose (see Sambrook et al, MOLECULAR CLONING-A LABORATORY MANUAL (2ND ED.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989). Alternatively, separation of RNA from DNA can be accomplished by organic extraction, for example, with hot phenol or phenol/chloroform/isoamyl alcohol.
If desired, RNase inhibitors may be added to the lysis buffer. Likewise, for certain cell types, it may be desirable to add a protein denaturation/digestion step to the protocol.
In certain embodiments, it is desirable to preferentially enrich mRNA with respect to other cellular RNAs, such as transfer RNA (tRNA) and ribosomal RNA (rRNA). Most mRNAs contain a poly(A) tail at their 3′ end. This allows them to be enriched by affinity chromatography, for example, using oligo(dT) or poly(U) coupled to a solid support, such as cellulose or Sephadex™ (see Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, vol. 2, Current Protocols Publishing, New York (1994). Once bound, poly(A)+ mRNA is eluted from the affinity column using 2 mM EDTA/0.1% SDS.
The sample of RNA can comprise a plurality of different mRNA molecules, each different mRNA molecule having a different nucleotide sequence. In a specific embodiment, the RNA sample is a mammalian RNA sample.
In a specific embodiment, total RNA or mRNA from cells are used in the methods of the invention. The source of the RNA can be cells of any animal, human, mammal, primate, non-human animal, dog, cat, mouse, rat, bird, yeast, eukaryote, etc. In specific embodiments, the method of the invention is used with a sample containing total mRNA or total RNA from 1×106 cells or less. In another embodiment, proteins can be isolated from the foregoing sources, by methods known in the art, for use in expression analysis at the protein level.
In certain embodiments, expression of a biomarker according to the invention is measured using multiplex ligation-dependent probe amplification (MLPA) (see, e.g., WO 01/61033 and Schouten, J P et al. (2002) “Relative quantification of 40 nucleic acid sequences by multiplex ligation-dependent probe amplification” Nucleic Acids Res 30, e57) or reverse transcriptase MLPA (RT-MLPA) (see, e.g., Eldering, E et al. (2003) “Expression profiling via novel multiplex assay allows rapid assessment of gene regulation in defined signaling pathways” Nucleic Acids Res 31, e153). In RT-MLPA, mRNA is converted to cDNA by reverse transcriptase, followed by a normal MLPA reaction. In other embodiments, methylation-specific MLPA is employed to detect expression of a biomarker according to the instant invention (see, e.g., Nygren, A O et al. (2005) “Methylation-specific MLPA (MS-MPLA): simultaneous detection of CpG methylation and copy number changes of up to 40 sequences” Nucleic Acids Res 33, 14:e128).
As defined herein, a “nucleic acid array” refers to a plurality of unique nucleic acids (or “nucleic acid members”) attached to a support where each of the nucleic acid members is attached to a support in a unique pre-selected region.
In one embodiment, the nucleic acid member attached to the surface of the support is DNA. In another embodiment, the nucleic acid member attached to the surface of the support is either cDNA or oligonucleotides. In another embodiment, the nucleic acid member attached to the surface of the support is cDNA synthesized by polymerase chain reaction (PCR). In another embodiment, sequences bound to the array can be an isolated oligonucleotide, cDNA, EST or PCR product corresponding to any biomarker of the invention total cellular RNA is applied to the array.
Array technology and the various-techniques and applications associated with it is described generally in numerous textbooks and documents. These include Lemieux et al., 1998, Molecular Breeding 4, 277-289, Schena and Davis. Parallel Analysis with Biological Chips, in PCR Methods Manual (eds. M. Innis, D. Gelfand, J. Sninsky), Schena and Davis, 1999, Genes, Genomes and Chips. In DNA Microarrays: A Practical Approach (ed. M. Schena), Oxford University Press, Oxford, UK, 1999), The Chipping Forecast (Nature Genetics special issue; January 1999 Supplement), Mark Schena (Ed.), Microarray Biochip Technology, (Eaton Publishing Company), Cortes, 2000, The Scientist 14[17]:25, Gwynne and Page, Microarray analysis: the next revolution in molecular biology, Science, 1999 Aug. 6; and Eakins and Chu, 1999, Trends in Biotechnology, 17, 217-218.
Major applications for array technology include the identification of sequence (gene/gene mutation) and the determination of expression level (abundance) of genes. Gene expression profiling may make use of array technology, optionally in combination with proteomics techniques (Celis et al, 2000, FEBS Lett, 480(1):2-16; Lockhart and Winzeler, 2000, Nature 405(6788):827-836; Khan et al., 1999, 20(2):223-9). Other applications of array technology are also known in the art; for example, gene discovery, cancer research (Marx, 2000, Science 289: 1670-1672; Scherf, et-al, 2000, Nat Genet; 24(3):236-44; Ross et al, 2000, Nat. Genet. 2000 March; 24(3):227-35), SNP analysis (Wang et al, 1998, Science, 280(5366):1077-82), drug discovery, pharmacogenomics, disease diagnosis (for example, utilising microfluidics devices: Chemical & Engineering News, Feb. 22, 1999, 77(8):27-36), toxicology (Rockett and Dix (2000), Xenobiotica, 30(2): 155-77; Afshari et al., 1999, Cancer Res1; 59(19):4759-60) and toxicogenomics (a hybrid of functional genomics and molecular toxicology).
In general, any library may be arranged in an orderly manner into an array, by spatially separating the members of the library. Examples of suitable libraries for arraying include nucleic acid libraries (including DNA, cDNA, oligonucleotide, etc. libraries), peptide, polypeptide and protein libraries, as well as libraries comprising any molecules, such as ligand libraries, among others.
The samples (e.g., members of a library) are generally fixed or immobilized onto a solid phase, preferably a solid substrate, to limit diffusion and admixing of the samples. In particular, the libraries may be immobilized to a substantially planar solid phase, including membranes and non-porous substrates such as plastic and glass. Furthermore, the samples are preferably arranged in such a way that indexing (i.e., reference or access to a particular sample) is facilitated. Typically the samples are applied as spots in a grid formation. Common assay systems may be adapted for this purpose. For example, an array may be immobilized on the surface of a microplate, either with multiple samples in a well, or with a single sample in each well. Furthermore, the solid substrate may be a membrane, such as a nitrocellulose or nylon membrane (for example, membranes used in blotting experiments). Alternative substrates include glass, or silica-based substrates. Thus, the samples are immobilized by any suitable method known in the art, for example, by charge interactions, or by chemical coupling to the walls or bottom of the wells, or the surface of the membrane. Other means of arranging and fixing may be used, for example, pipetting, drop-touch, piezoelectric means, ink-jet and bubblejet technology, electrostatic application, etc. In the case of silicon-based chips, photolithography may be utilised to arrange and fix the samples on the chip.
The samples may be arranged by being “spotted” onto the solid substrate; this may be done by hand or by making use of robotics to deposit the sample. In general, arrays may be described as macroarrays or microarrays, the difference being the size of the sample spots. Macroarrays typically contain sample spot sizes of about 300 microns or larger and may be easily imaged by existing gel and blot scanners. The sample spot sizes in microarrays are typically less than 200 microns in diameter and these arrays usually contain thousands of spots. Thus, microarrays may require specialized robotics and imaging equipment, which may need to be custom made. Instrumentation is described generally in a review by Cortese, 2000, The Scientist 14[11]:26.
Techniques for producing immobilized libraries of DNA molecules have been described in the art. Generally, most prior art methods described how to synthesize single-stranded nucleic acid molecule libraries, using for example masking techniques to build up various permutations of sequences at the various discrete positions on the solid substrate. U.S. Pat. No. 5,837,832 describes an improved method for producing DNA arrays immobilized to silicon substrates based on very large scale integration technology. In particular, U.S. Pat. No. 5,837,832 describes a strategy called “tiling” to synthesize specific sets of probes at spatially-defined locations on a substrate which may be used to produced the immobilized DNA libraries of the present invention. U.S. Pat. No. 5,837,832 also provides references for earlier techniques that may also be used. Arrays may also be built using photo deposition chemistry.
To aid detection, labels are typically used—such as any readily detectable reporter, for example, a fluorescent, bioluminescent, phosphorescent, radioactive, etc. reporter. Labelling of probes and targets is also disclosed in Shalon et al., 1996, Genome Res 6(7):639-45.
Examples of DNA arrays include where probe cDNA (500˜5,000 bases long) is immobilized to a solid surface such as glass using robot spotting and exposed to a set of targets either separately or in a mixture. This method is widely considered as having been developed at Stanford University (Ekins and Chu, 1999, Trends in Biotechnology, 1999, 17, 217-218).
Another example of a DNA array is where an array of oligonucleotides (20-25-mer oligos, preferably, 40-60 mer oligos) or peptide nucleic acid (PNA) probes are synthesized either in situ (on-chip) or by conventional synthesis followed by on-chip immobilization. The array is exposed to labelled sample DNA, hybridized, and the identity/abundance of complementary sequences are determined. Such a DNA chip is sold by Affymetrix, Inc., under the GeneChip® trademark. Agilent and Nimblegen also provide suitable arrays (eg. genomic tiling arrays).
In other embodiments, high throughput DNA sequencing promises to become an affordable and more quantitative alternative for microarrays to analyze large collections of DNA sequences. Examples of high-throughput sequencing approaches are listed in E. Y. Chan, Mutation Research 573 (2005) 13-40 and include, but are not limited to, near-term sequencing approaches such as cycle-extension approaches, polymerase reading approaches and exonuclease sequencing, revolutionary sequencing approaches such as DNA scanning and nanopore sequencing and direct linear analysis. Examples of current high-throughput sequencing methods are 454 (pyro)sequencing, Solexa Genome Analysis System, Agencourt SOLiD sequencing method (Applied Biosystems), MS-PET sequencing (Ng et al., 2006, http://nar(dot)oxfordjournals(dot)org/cgi/content/full/34/12/e84).
As used herein, the term “probe” refers to a molecule (e.g., an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, recombinantly or by PCR amplification), that is capable of hybridizing to another molecule of interest (e.g., another oligonucleotide). When probes are oligonucleotides they may be single-stranded or double-stranded. Probes are useful in the detection, identification and isolation of particular targets (e.g., gene sequences). As described herein, it is contemplated that probes used in the present invention may be labelled with a label so that is detectable in any detection system, including, but not limited to enzyme (e.g., ELISA, as well as enzyme-based histochemical assays), fluorescent, radioactive, and luminescent systems.
With respect to arrays and microarrays, the term “probe” is used to refer to any hybridizable material that is affixed to the array for the purpose of detecting a nucleotide sequence that has hybridized to said probe. Preferably, these probes are 25-60 mers or longer.
The present invention further encompasses probes according to the present invention that are immobilized on a solid or flexible support, such as paper, nylon or other type of membrane, filter, chip, glass slide, microchips, microbeads, or any other such matrix, all of which are within the scope of this invention.
The primers and probes described herein may be readily prepared by, for example, directly synthesizing the fragment by chemical means or by introducing selected sequences into recombinant vectors for recombinant production. Methods for making a vector or recombinants or plasmid for amplification of the fragment either in vivo or in vitro can be any desired method, e.g., a method which is by or analogous to the methods disclosed in, or disclosed in documents cited in: U.S. Pat. Nos. 4,603,112; 4,769,330; 4,394,448; 4,722,848; 4,745,051; 4,769,331; 4,945,050; 5,494,807; 5,514,375; 5,744,140; 5,744,141; 5,756,103; 5,762,938; 5,766,599; 5,990,091; 5,174,993; 5,505,941; 5,338,683; 5,494,807; 5,591,639; 5,589,466; 5,677,178; 5,591,439; 5,552,143; 5,580,859; 6,130,066; 6,004,777; 6,130,066; 6,497,883; 6,464,984; 6,451,770; 6,391,314; 6,387,376; 6,376,473; 6,368,603; 6,348,196; 6,306,400; 6,228,846; 6,221,362; 6,217,883; 6,207,166; 6,207,165; 6,159,477; 6,153,199; 6,090,393; 6,074,649; 6,045,803; 6,033,670; 6,485,729; 6,103,526; 6,224,882; 6,312,682; 6,348,450 and 6,312,683; U.S. patent application Ser. No. 920,197, filed Oct. 16, 1986; WO 90/01543; WO91/11525; WO 94/16716; WO 96/39491; WO 98/33510; EP 265785; EP 0 370 573; Andreansky et al., Proc. Natl. Acad. Sci. USA 1996; 93:11313-11318; Ballay et al., EMBO J. 1993; 4:3861-65; Feigner et al., J. Biol. Chem. 1994; 269:2550-2561; Frolov et al., Proc. Natl. Acad. Sci. USA 1996; 93:11371-11377; Graham, Tibtech 1990; 8:85-87; Grunhaus et al., Sem. Virol. 1992; 3:237-52; Ju et al., Diabetologia 1998; 41:736-739; Kitson et al., J. Virol. 1991; 65:3068-3075; McClements et al., Proc. Natl. Acad. Sci. USA 1996; 93:11414-1.1420; Moss, Proc. Natl. Acad. Sci. USA 1996; 93:11341-11348; Paoletti, Proc. Natl. Acad. Sci. USA 1996; 93:11349-11353; Pennock et al., Mol. Cell. Biol. 1984; 4:399-406; Richardson (Ed), Methods in Molecular Biology 1995; 39, “Baculovirus Expression Protocols,” Humana Press Inc.; Smith et al. (1983) Mol. Cell. Biol. 1983; 3:2156-2165; Robertson et al., Proc. Natl. Acad. Sci. USA 1996; 93:11334-11340; Robinson et al., Sem. Immunol. 1997; 9:271; and Roizman, Proc. Natl. Acad. Sci. USA 1996; 93:11307-11312. Strategies for probe design are described in WO95/11995, EP 717,113 and WO97/29212.
In order to generate data from array-based assays a signal is detected that signifies the presence of or absence of hybridization between a probe and a nucleotide sequence. The present invention further contemplates direct and indirect labelling techniques. For example, direct labelling incorporates fluorescent dyes directly into the nucleotide sequences that hybridize to the array-associated probes (e.g., dyes are incorporated into nucleotide sequence by enzymatic synthesis in the presence of labelled nucleotides or PCR primers). Direct labelling schemes yield strong hybridization signals, typically using families of fluorescent dyes with similar chemical structures and characteristics, and are simple to implement. In some embodiments comprising direct labelling of nucleic acids, cyanine or alexa analogs are utilized in multiple-fluor comparative array analyses. In other embodiments, indirect labelling schemes can be utilized to incorporate epitopes into the nucleic acids either prior to or after hybridization to the microarray probes. One or more staining procedures and reagents are used to label the hybridized complex (e.g., a fluorescent molecule that binds to the epitopes, thereby providing a fluorescent signal by virtue of the conjugation of dye molecule to the epitope of the hybridised species).
Oligonucleotide sequences used as probes according to the present invention may be labeled with a detectable moiety. Various labeling moieties are known in the art. Said moiety may be, for example, a radiolabel (e.g., 3H, 125I, 35S, 14C, 32P, etc.), detectable enzyme (e.g. horse radish peroxidase (HRP), alkaline phosphatase etc.), a fluorescent dye (e.g., fluorescein isothiocyanate, Texas red, rhodamine, Cy3, Cy5, Bodipy, Bodipy Far Red, Lucifer Yellow, Bodipy 630/650-X, Bodipy R6G-X and 5-CR 6G, and the like), a colorimetric label such as colloidal gold or colored glass or plastic (e.g. polystyrene, polypropylene, latex, etc.), beads, or any other moiety capable of generating a detectable signal such as a colorimetric, fluorescent, chemiluminescent or electrochemiluminescent (ECL) signal.
Probes may be labeled directly or indirectly with a detectable moiety, or synthesized to incorporate the detectable moiety. In one embodiment, a detectable label is incorporated into a nucleic acid during at least one cycle of a cyclic polymerase-mediated amplification reaction. For example, polymerases can be used to incorporate fluorescent nucleotides during the course of polymerase-mediated amplification reactions. Alternatively, fluorescent nucleotides may be incorporated during synthesis of nucleic acid primers or probes. To label an oligonucleotide with the fluorescent dye, one of conventionally-known labeling methods can be used (Nature Biotechnology, 14, 303-308, 1996; Applied and Environmental Microbiology, 63, 1143-1147, 1997; Nucleic Acids Research, 24, 4532-4535, 1996). An advantageous probe is one labeled with a fluorescent dye at the 3′ or 5′ end and containing G or C as the base at the labeled end. If the 5′ end is labeled and the 3′ end is not labeled, the OH group on the C atom at the 3′-position of the 3′ end ribose or deoxyribose may be modified with a phosphate group or the like although no limitation is imposed in this respect.
Spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means can be used to detect such labels. The detection device and method may include, but is not limited to, optical imaging, electronic imaging, imaging with a CCD camera, integrated optical imaging, and mass spectrometry. Further, the amount of labeled or unlabeled probe bound to the target may be quantified. Such quantification may include statistical analysis. In other embodiments the detection may be via conductivity differences between concordant and discordant sites, by quenching, by fluorescence perturbation analysis, or by electron transport between donor and acceptor molecules.
In yet another embodiment, detection may be via energy transfer between molecules in the hybridization complexes in PCR or hybridization reactions, such as by fluorescence energy transfer (FET) or fluorescence resonance energy transfer (FRET). In FET and FRET methods, one or more nucleic acid probes are labeled with fluorescent molecules, one of which is able to act as an energy donor and the other of which is an energy acceptor molecule. These are sometimes known as a reporter molecule and a quencher molecule respectively. The donor molecule is excited with a specific wavelength of light for which it will normally exhibit a fluorescence emission wavelength. The acceptor molecule is also excited at this wavelength such that it can accept the emission energy of the donor molecule by a variety of distance-dependent energy transfer mechanisms. Generally the acceptor molecule accepts the emission energy of the donor molecule when they are in close proximity (e.g., on the same, or a neighboring molecule). FET and FRET techniques are well known in the art. See for example U.S. Pat. Nos. 5,668,648, 5,707,804, 5,728,528, 5,853,992, and 5,869,255 (for a description of FRET dyes), Tyagi et al. Nature Biotech. vol. 14, p 303-8 (1996), and Tyagi et al., Nature Biotech. vol 16, p 49-53 (1998) (for a description of molecular beacons for FET), and Mergny et al. Nucleic Acid Res. vol 22, p 920-928, (1994) and Wolf et al. PNAS vol 85, p 8790-94 (1988) (for general descriptions and methods fir FET and FRET), each of which is hereby incorporated by reference.
The probes for use in an array of the invention may be greater than 40 nucleotides in length and may be isothermal.
In some embodiments, the probes, array of probes or set of probes will be immobilized on a support. Supports (e.g., solid supports) can be made of a variety of materials, such as glass, silica, plastic, nylon or nitrocellulose. Supports are preferably rigid and have a planar surface. Supports typically have from about 1-10,000,000 discrete spatially addressable regions, or cells. Supports having about 10-1,000,000 or about 100-100,000 or about 1000-100,000 cells are common. The density of cells is typically at least about 1000, 10,000, 100,000 or 1,000,000 cells within a square centimeter. In some supports, all cells are occupied by pooled mixtures of probes or a set of probes. In other supports, some cells are occupied by pooled mixtures of probes or a set of probes, and other cells are occupied, at least to the degree of purity obtainable by synthesis methods, by a single type of oligonucleotide.
Arrays of probes or sets of probes may be synthesized in a step-by-step manner on a support or can be attached in presynthesized form. One method of synthesis is VLSIPS™ (as described in U.S. Pat. No. 5,143,854 and EP 476,014), which entails the use of light to direct the synthesis of oligonucleotide probes in high-density, miniaturized arrays. Algorithms for design of masks to reduce the number of synthesis cycles are described in U.S. Pat. No. 5,571,639 and U.S. Pat. No. 5,593,839. Arrays can also be synthesized in a combinatorial fashion by delivering monomers to cells of a support by mechanically constrained flowpaths, as described in EP 624,059. Arrays can also be synthesized by spotting reagents on to a support using an ink jet printer (see, for example, EP 728,520).
Data analysis is also an important part of an experiment involving arrays. The raw data from an array experiment typically are images, which need to be transformed into matrices—tables where rows represent, for example, genes, columns represent, for example, various samples such as tissues or experimental conditions, and numbers in each cell for example characterize the expression of a particular sequence (for example, a second sequence that has ligated to the first (target) nucleotide sequence) in the particular sample. These matrices have to be analyzed further, if any knowledge about the underlying biological processes is to be extracted. Methods of data analysis (including supervised and unsupervised data analysis as well as bioinformatics approaches) are disclosed in Brazma and Vilo J (2000) FEBS Lett 480(1):17-24.
The materials for use in the methods of the present invention are ideally suited for preparation of kits. Oligonucleotides may be provided in containers that can be in any form, e.g., lyophilized, or in solution (e.g., a distilled water or buffered solution), etc. In one aspect of the present invention, there is provided a kit comprising a set of probes as described herein, an array and optionally one or more labels. In another aspect, there is provided an RT-MLPA kit comprising a set of reverse transcriptase primers as described herein, and appropriate ligases, buffers, and PCR primers. In the kits of the invention, a set of instructions will also typically be included.
The oligonucleotide primers and probes of the present invention have commercial applications in prognostic kits for the detection of the expression level of a gene, such as a MEDIATOR complex and/or SWI/SNF complex gene, in the tumor cells of a patient. A test kit according to the invention may comprise any of the oligonucleotide primers or probes according to the invention. Such a test kit may additionally comprise one or more reagents for use in cyclic polymerase mediated amplification reactions, such as DNA polymerases, nucleotides (dNTPs), buffers, and the like. A kit according to the invention may also include, for example, a lysing buffer for lysing cells contained in the specimen.
A test kit according to the invention may comprise a pair of oligonucleotide primers according to the invention and a probe comprising an oligonucleotide according to the invention. Advantageously, the kit further comprises additional means, such as reagents, for detecting or measuring the binding of the primers and probes of the present invention, and also ideally a positive and negative control.
The invention will now be further described by way of the following non-limiting examples.
The ALK inhibitors crizotinib and NVP-TAE684 potently inhibit the human NSCLC cell lines that harbor EML4-ALK translocations (Galkin et al., 2007; Koivunen et al., 2008; Soda et al., 2007). The NSCLC cell line H3122 carries the EML4-ALK translocation and is exquisitely sensitive to ALK inhibitors. To identify novel determinants of resistance to ALK inhibitors in NSCLC cell lines, Applicants performed a large-scale RNAi-based loss-of-function genetic screen using a collection of 24,000 short hairpin (shRNA) vectors targeting 8,000 human genes (Berns et al., 2004; Brummelkamp et al., 2002). Applicants used a barcoding technology to identify genes whose suppression causes resistance to ALK inhibitors (Brummelkamp et al., 2006; Holzel et al.). The entire shRNA library was introduced into H3122 cells by retroviral infection and cells were plated at low density with or without ALK inhibitors (
The MED12 gene encodes for a component of the large mediator complex (˜2MDa) that contains at least 33 different subunits and associates with RNA polymerase II at the promoters of genes (Malik and Roeder). Thereby, the Mediator complex is involved in transcriptional regulation. Initially it was thought that the mediator complex is exclusively required for active transcription of genes, but recent studies suggest additional and broader roles in transcriptional regulation, such as epigenetic silencing. In particular, MED12 was implicated in contributing to silencing of neuronal genes in non-neuronal cells by the recruitment of the H3K9 histone methyltransferase EHMT2 (G9a) in a REST dependent manner (Ding et al., 2008). Interestingly, mutations in MED12 are causal for some rare mental retardation syndromes and aberrant gene regulation might contribute to the phenotypic manifestations of these diseases (Risheg et al., 2007; Schwartz et al., 2007). In general, only a few studies have addressed the specific function of individual components of the mediator complex.
ARID1A and SMARCE1 are both components of the SWI/SNF chromatin-remodeling complex (Reisman et al., 2009). The SWI/SNF complex is also a large multi-subunit complex that contains two mutual exclusive but non-redundant subunits with ATPase activity. The ATPases SMARCA2 (BRM1) and SMARCA4 (BRG1) are required for the ATP dependent re-positioning of histones within the chromatin. This ATP-dependent chromatin remodeling activity impacts diverse chromatin related biological processes such as gene transcription and DNA repair. The SWI/SNF complex is conserved throughout evolution from yeast to man. Hence, it is remarkable that several subunits of the SWI/SNF complex have been identified as tumor suppressors. Deletions of SMARCB1 (INI1, BAF47) are found in malignant rhabdoid tumors, a highly aggressive childhood cancer (Versteege et al., 1998). Inactivating truncating mutations of ARID1A and PBRM1 were found in more than 50% and 40% of clear cell ovarian and renal cancer, respectively (Jones et al.; Varela et al.). SMARCA4 (BRG1) is frequently mutated in NSCLC cell lines, but also in primary tumors (Medina et al., 2008; Rodriguez-Nieto et al.). In conclusion, there is substantial evidence in the literature that specific components of the SWI/SNF complex function as tumor suppressors in a tumor type dependent manner, but the molecular basis of this selectivity remains unknown.
Validation of shRNA Barcode Screen Results
To validate the results of their screen, Applicants individually introduced the respective knockdown vectors from the NKI shRNA library against MED12 (#1 and #2), ARID1A and SMARCE1 into H3122 cells by retroviral infections and confirmed that all four shRNA vectors confer resistance to the ALK inhibitors crizotinib and NVP-TAE684 in H3122 cells (
To validate MED12 as a gene whose suppression confers resistance to crizotinib, Applicants individually introduced the two MED12 shRNA vectors (#1 and #2) from the library and one newly generated shRNA (#3) into H3122 cells by retroviral infection. Empty vector (pRS) or shRNA targeting GFP (shGFP) served as controls throughout the study. All three distinct MED12 knockdown vectors conferred resistance to both crizotinib and the second ALK inhibitor NVP-TAE684 in long-term colony formation assays (
Next, Applicants validated that ARID1A and SMARCE1 are on-target hits causally involved in the resistance to ALK inhibitors. As Applicants have only identified single shRNAs (shARID1A#1, shSMARCE1#1) against these genes from the barcode screen, they generated additional non-overlapping shRNAs against ARID1A and SMARCE1 (shARID1A#2, shSMARCE1#2) and introduced them into H3122 cells by retroviral infection. The independent shRNAs recapitulated the resistance to ALK inhibitors (
Next, Applicants introduced silent mutations into a human SMARCE1 cDNA expression construct and thereby generated two separate shRNA resistant (non-degradable, ND) forms of SMARCE1 (SMARCE1-ND) that cannot be targeted by shSMARCE1#1 and shSMARCE1#2. H3122 cells stably infected with pRS, shSMARCE1#1 or #2 were super-infected with retroviral expression constructs encoding for the respective non-degradable forms of SMARCE1 or the pMx empty control vector. Reconstitution of SMARCE1 restored sensitivity of SMARCE1 knockdown cells to ALK inhibitors (
Next, Applicants addressed the context dependency of their findings by studying independent NSCLC cell lines. The RAS/PI3K signaling cascade is a common denominator of all activated tyrosine kinases in NSCLC such as the EGFR (Pao and Chmielecki). Therefore, Applicants hypothesized that loss of MED12, SMARCE1 and ARID1A might also confer resistance to other tyrosine kinase inhibitors in cell lines that harbor respective activating mutations or amplifications.
NSCLC with activating mutations of the EGFR can be effectively treated with the EGFR inhibitors gefitinib and erlotinib. Several NSCLC cell lines with EGFR mutations (PC9, H3255) were identified that are exquisitely sensitive to gefitinib and erlotinib at low nanomolar concentrations. Applicants introduced MED12 specific shRNAs (shMED12_TRC#3 and #5) into PC9 cells (EGFRdelE746-A750). Suppression of MED12 rendered PC9 cells insensitive to the EGFR inhibitor gefitinib (
Next, Applicants asked whether ARID1A determines sensitivity to tyrosine kinase inhibitors in multiple NSCLC cell lines (context dependency). Applicants introduced the retroviral shRNA vectors against ARID1A (#1 and #2) or control vectors (pRS and shGFP) into PC9 (EGFRdelE746-A750) and H1993 (MET-amplified) cells (
Now, Applicants addressed whether SMARCE1 is also determinant of tyrosine kinase inhibitor sensitivity in multiple NSCLC cell lines (context dependency). PC9 (EGFRdelE746-A750), H1993 (MET-amplified) and EBC-1 (MET-amplified) cells were stably infected with the retroviral shRNA constructs pRS, shSMARCE1#1 and #2 and were treated with the EGFR inhibitor geftitinib (PC9) or MET inhibitor crizotinib (H1993, EBC1). In all cases, suppression of SMARCE1 conferred resistance to the respective inhibitors (
Constitutive signaling from mutated receptor tyrosine kinases such EGFR leads to activation of the RAS small GTP-binding proteins (KRAS, HRAS, NRAS). In particular KRAS is one of the most frequently mutated genes in a variety of cancers including NSCLC. RAS mutations impair the intrinsic GTPase activity and therefore prevent the conversion of active GTP-bound form into the inactive GDP-bound form (Karnoub and Weinberg, 2008). Introduction of constitutive active alleles of RAS in NSCLC cell lines renders the insensitive to tyrosine kinase inhibitors (data not shown). Therefore, inhibition of RAS is key mechanism of the efficacy of tyrosine kinase inhibitors. Applicants reasoned that direct negative regulators of RAS proteins might be critical determinants of sensitivity to tyrosine kinase inhibitors in NSCLC cell lines. The human genome encodes for 14 putative RAS-GTPase activating proteins (RAS-GAPs) that stimulate the GTPase activity of RAS proteins and promote the conversion of active GTP-loaded RAS into the inactive GDP-loaded form (Bernards, 2003). Applicants retrieved shRNAs covering the 14 putative human RAS-GAPs from the TRC shRNA collection and all shRNAs targeting the same gene were pooled together. Applicants infected PC9 cells with the 14 RAS-GAP pools in addition to the control vectors pLKO and shGFP. The cells were plated at low density and treated with the two EGFR inhibitors gefitinib and erlotinib or left untreated (
First, Applicants focused on the RAS-GAPs DAB2IP and NF1. NF1 is bona-fide tumor suppressor mutated in several cancers and also causal for the hereditable disease neurofibromatosis type I, a benign tumor syndrome with strong predisposition to several malignant cancers (Cichowski and Jacks, 2001). DAP2IP plays an important role in prostate cancer and loss of its expression is associated with an aggressive metastatic disease (Min et al.). To validate the results of Applicants' focused shRNA mini-screen, Applicants individually introduced the five DAB2IP shRNAs from the TRC shRNA collection into PC9 cells (
Given that loss of MED12 or SMARCE1 causes resistance to multiple tyrosine kinase inhibitors in NSCLC cell lines, Applicants asked whether the activity of downstream components of receptor tyrosine kinase signaling is altered. ERK is a key downstream component and its phosphorylation status positively correlates with its activation that can be determined by specific antibodies against the phosphorylated form of ERK. H3122 cells were infected with two independent controls shRNA vectors or shRNAs targeting either MED12 or SMARCE1 and confirmed loss of MED12 or SMARCE1 protein by immunoblotting (
Applicants' finding that MED12 suppression confers resistance to both ALK and EGFR inhibitors in NSCLCs suggests that MED12 might act on a critical pathway downstream of both ALK and EGFR. As pointed out above, RAS signaling is downstream of all activated RTKs in NSCLC (Pao and Chmielecki, 2010). Applicants first asked which components of the RAS pathway could cause resistance to RTK inhibition in H3122 and PC9 cells by expressing active alleles of these genes (
If suppression of MED12 leads to ERK activation, one would expect that MED12 loss might also confer resistance to other cancer drugs targeting the MAPKs upstream of ERK. The small molecule drug PLX4032 (vemurafenib) has proven to be very effective in the treatment of melanoma with BRAFV600E mutations and the MEK inhibitor AZD6244 (seluteminib) is being tested in the clinical trials for the treatment of several cancers. A375 melanoma cells harboring the BRAFV600E mutation are highly sensitive to PLX4032 and AZD6244. Consistent with Applicants' observations made in NSCLC models, Applicants found that suppression of MED12 in A375 cells caused ERK activation (
To extend their findings even further, Applicants asked whether MED12 also confers resistance to a class of multi-kinase inhibitors. Sorafenib targets multiple tyrosine kinases and RAF kinases and is used clinically to treat advanced renal cell carcinoma and hepatocellular carcinoma (HCC). HCC Huh-7 cells are sensitive to sorafenib, but became resistant after knockdown of MED12 (
As a first step in expanding Applicants' finding in NSCLC, they examined the potential role of MED12 in drug responses to BRAF and MEK inhibitors in BRAFV600E melanomas where activation of ERK is a common feature of resistant tumors. Since MED12 knockdown leads to higher levels of ERK phosphorylation in NSCLC cells, Applicants asked if MED12 is also critical for drug responses to BRAF and MEK inhibitors in BRAFV600E melanoma cells. A375 (BRAFV600E) melanoma cells stably expressing the retroviral shRNA constructs pRS, shGFP, shSMARCE1#1 and #2 were treated with the BRAFV600E inhibitor PLX4720 or MEK inhibitor PD-0325901. In all cases, suppression of MED12 conferred resistance to the respective inhibitors (
In addition, Applicants observed similar effects in the melanoma cell line, SK-MEL-28, which expresses BRAFV600E. In particular, Applicants demonstrate that downregulation of MED12 induces resistance to the BRAF inhibitor, PLX 4032, in SK-MEL-28 cells.
shRNA Barcode Screen
The human NKI shRNA library and the barcode screen were performed as described (Berns et al., 2004; Brummelkamp et al., 2006). Additional details can be found at http://www(dot)screeninc(dot)nki(dot)nl.
Single cell suspensions of the lung cancer cell lines were seeded into 6-well plates (2×104 cells/well) and cultured both in the absence and presence of the ALK inhibitors. At the endpoints of colony formation assays, cells were fixed with formaldehyde, stained with crystal violet (0.1% w/v) and photographed. All relevant assays were performed independently at least three times. All knockdown and overexpression experiments were done by retroviral or lentiviral infections.
H3122, PC9, H1993, EBC-1, H3255, SK-CO-1, and SW1417 cells were cultured in RPMI with 8% heat-inactivated fetal bovine serum, penicillin and streptomycin at 5% CO2. 293T, Phoenix cells, A375, SK-MEL-28, and Huh-7 cells were cultured in DMEM with 8% heat-inactivated fetal bovine, serum, penicillin and streptomycin at 5% CO2. Subclones of each NSCLC cell line expressing the murine ecotropic receptor were generated and used for all experiments shown. Retroviral infections were performed using Phoenix cells as producers of retroviral supernatants using 2.5-3 μg of plasmid DNA as described (http://www(dot)stanford(dot)edu/group/nolan/retroviral systems/phx(dot)html). 293T cells were used as producers of lentiviral supernatants by co-transfecting 3rd generation lentiviral 15 packaging constructs (2 μg of plasmid DNA) along with the pLKO shRNA vectors (2 μg of plasmid DNA). For transfections of 293T cells, Applicants seeded 1.8×106 cells in a 6-well dish in the morning and transfected the cells 6-8 hours later. For transfections of Phoenix cells, Applicants seeded 1.0×106 cells in a 6-well dish in the morning and transfected the cells 6-8 hours later. Cells were refreshed the next day in the morning and afternoon. Viral supernatant was harvested the day thereafter for infections of the target cells. The calcium phosphate method was used for the transfection of Phoenix and 293T cells. Infected NSCLC cells were selected for successful retroviral integration using 2 μg/ml of puromycin.
Crizotinib (S1068), NVP-TAE648 (S1108), gefitinib (S1025), erlotinib (S1023), PLX4032 (S1267) and AZD6244 (S1008) were purchased from Selleck Chemicals. TRC human genome-wide shRNA collection (TRC-Hs1.0) was purchased from Open Biosystems (Huntsville, USA). Further information is available at http://www(dot)broad(dot)mit(dot)edu/genome bio/trc/rnai(dot)html. Antibody against MED12 (A300-774A), SMARCE1 (A300-810A), DAB2IP (A302-439A) and NF1 (A300-140A) was from Bethyl Laboratories; antibody against Vimentin (RV202) was from Abcam; antibody against N-cadherin (ab18203) was from Cell Signaling; antibodies against NF1 (SC-67), HSP90 (H-14), p-ERK (E-4), ERK1 (C-16), ERK2 (C-14), CDK8 (D-9), Lamin A/C (636), SP1 (PEP2) and α-TUBULIN(H-183) were from Santa Cruz Biotechnology; The antibody against ARID1A (H00008289-M01) was from Abnova. A mixture of ERK1 and ERK2 antibodies was used for detection of total ERK.
All retroviral shRNA vectors were generated by ligating synthetic oligonucleotides (Invitrogen) against the target genes into in the pRetroSuper (pRS) retroviral vector as described (Brummelkamp et al., 2002). The following RNAi target sequences were used for this study.
All lentiviral shRNA vectors (TRCN number) were retrieved from the arrayed human TRC shRNA library. Additional information about the shRNA vectors can be found at http://www.broadinstitute.org/rnai/public/clone/search using the TRCN number.
The mouse Med12 expression constructs were generated by the following steps:
1), An linker containing first 89 bp of Med12 open reading frame (ORF) and multiple restriction sites was cloned into pcDNA3.1(+) vector by NheI and BamHI restriction sites and was sequence verified; The oligo sequences of the top strand for the linker is CTAGCTCGAGTCGACCATGGCGGCTTTCGGGATCTTGAGCTATGAACACCGACCC CTGAAGCGGCTGCGGCTGGGGCCTCCCGATGTGTACCCTCAG and the bottom strand is GATCCTGAGGGTACACATCGGGAGGCCCCAGCCGCAGCCGCTTCAGGGGTCGGT GTTCATAGCTCAAGATCCCGAAAGCCGCCATGGTCGACTCGAG.
2), A PCR fragment of partial Med12 (from 89 to 1777 bp) was generated using a forward primer (CAGGATCCCAAACAGAAGGAGGATGAACTGACGGCTTTGAATGTAA), a reverse primer (TGGGAGAAGACATCATGTCG) and a Med12 partial cDNA as the template (IMAGE id: 6830443); This PCR fragment was then cloned into the pcDNA3.1(+)-Med12 (first 89 bp) vector described in step 1 by BamHI and EcoRI restriction sites and was sequence verified. Note that a silence mutation (A to G) at 81 bp of Med12 ORF was introduced in the forward PCR primer to generate BamHI site in the PCR fragment.
3), An EcoRI/NotI fragment (containing from 1778 to 6573 bp of Med12 ORF) from the Med12 partial cDNA (IMAGE id: 6830443) was cloned into the pcDNA3.1 (+)-Med12 (first 1777 bp) described above by EcoRI and NotI restriction sites to generate the pcDNA3.1(+)-Med12 (full-length).
4), The XhoI/NotI fragment containing the full-length Med12 ORF from pcDNA3.1(+)-Med12 was then cloned into the retroviral expression vector pMX-IRES-blasticidine using the XhoI and NotI restriction sites.
The human SMARCE1 expression construct and the non-degradable (ND) forms of were generated by PCR amplifying SMARCE1 from H3122 cDNA using the following primers:
Reverse, GAATAAGTGTTGCCTTGTTTTGTGCTCGAGACTG. The fragment was cloned into the retroviral expression vector pMX-IRES-blasticidine using the EcoRI and XhoI restriction sites in the multiple cloning site and sequence verified. The SMARCE1-ND that is resistant against shSMARCE1#1 was generated by site directed mutagenesis using the following primer pair:
The SMARCE1-ND that is resistant against shSMARCE1#2 was generated by site directed mutagenesis using the following primer pair:
Reverse, GAGATTGTCTCTGTCGGCTCTCCTCCTCTAAAGCAGCTTC. Both SMARCE1-ND clones were sequence verified.
Retroviral expression constructs (pBabe) for KRASG 12V (#12544), MEK-DD (#15268), RALAQ75L (#19719), RALBQ72L (#19721), PIK3CAH1047R (#12524) and pCMV5BTGFbeta receptor II (#24801) were obtained from Addgene and sequence validated. The pBabe-BRAFV600E plasmid was a kind gift of Daniel Peeper. The cDNA encoding Myr-AKT was cloned into pBabe-puro and validated by sequencing. These active alleles of RAS effector pathways were also described previously (Holzel et al., 2010)
Quantitative RT-PCR (qRT-PCR)
QRT-PCR assays were carried out to measure mRNA levels of genes using 7500 Fast Real-Time PCR System. (Applied Biosystems). Total RNA was isolated using Trizol (Invitrogen) and 1 μg of total RNA was used for cDNA synthesis using superscript II reverse transcriptase (Invitrogen) and random hexamer primers (Invitrogen). Relative mRNA levels of each gene shown were normalized to the expression of the house keeping gene GAPDH. The sequences of the primers for assays using SYBR® Green master mix (Roche) are listed below (h, human: m, mouse).
The studies described herein show that suppression of MED12 leads to ERK activation and thus confers what in some embodiments is a “multi targeted-drug resistance” phenotype. To gain further mechanistic insights, Applicants set out to screen a lentiviral shRNA library representing the full complement of 518 human kinases (the “kinome”, (Manning et al., 2002)) and 17 additional kinase-related genes (
Next, Applicants asked whether activation of TGFβ signaling alone is sufficient to cause resistance to the cancer drugs studied above. In the absence of exogenous TGFβ, proliferation of the H3122 cells was greatly inhibited by crizotinib. In contrast, cells treated with TGFβ in combination with crizotinib continued to proliferate in a TGFβ-dosage dependent manner (
Recombinant TGFβ treatment also conferred resistance to EGFR inhibitors in PC9 and H3255 NSCLC cells (
The fact that TGFβ signaling is required for the drug resistance driven by MED12 suppression and that activation of TGFβ signaling phenocopies MED12KD in mediating drug resistance suggested that MED12 can act as a suppressor of TGFβ signaling. Applicants explored this possibility by studying differential gene expression by unbiased transcriptome sequencing analysis using next generation sequencing (RNA-Seq) for the same panel of cells lines tested above (H3122, PC9, SK-CO-1, A375 and Huh-7), for both the parental cells and multiple MED12KD derivatives thereof. The genes deregulated by MED12KD (>2 fold) in at least three out of five cell lines used are listed in
To further elucidate the molecular mechanism by which MED12 suppresses TGFβ signaling, Applicants studied the effect of knockdown of MED12 on expression and activation of key components of the TGFβ signaling pathway. Strikingly, Applicants found that suppression of MED12 resulted in a strong induction of TGFβR2 protein levels in H3122 and PC9 cells (
Since MED12 is part of the MEDIATOR transcriptional complex that functions in the nucleus, Applicants assumed that MED12 would act on TGFβR2 through a transcriptional step. However, there was only a marginal increase of TGFβR2 mRNA upon MED12 knockdown (
The observation of the cytoplasmic localization of MED12 prompted Applicants to examine a potential physical interaction between MED12 and TGFβR2. Since low expression of endogenous TGFβRs in most cell types hinders the study of physical interaction with TGFβRs, Applicants performed co-immunoprecipitation experiments using Phoenix cells cotransfected with TGFβR2 and MED12. As indicated in
As described above, MED12 suppression leads to activation of TGFβ signaling and expression of mesenchymal markers, suggestive of a partial EMT-like process. Recently, EMT has been identified as a program in human CRC that correlates with poor prognosis (Loboda et al., 2011). Applicants therefore asked whether MED12KD indeed induces an EMT-like process and whether the processes induced by MED12KD are likewise associated with poor survival in CRC.
Applicants first compared the 237 genes that were upregulated in the MED12KD signature (as described herein;
To further substantiate Applicants' finding that MED12 suppression confers resistance to cancer drugs targeting the MEK-ERK pathway downstream of RTKs, Applicants asked if the MED12KD signature could predict responses to MEK inhibitors in a large and heterogeneous panel of cancer cell lines of different tissue types. Since MEK inhibitors are currently being evaluated for the treatment of tumors having activating mutations in RAS or BRAF, Applicants focused their studies on 152 tumor cell lines harboring either RAS or BRAF mutations for whom the IC50 values of four different MEK inhibitors and gene expression patterns have been determined (
Applicants have demonstrated that TGFβ activation by either MED12 loss or recombinant TGFβ stimulation confers resistance to multiple targeted cancer drugs in a range of cancer types. It is therefore of potential clinical relevance to explore new treatment strategies to target drug resistant tumors having acquired elevated TGFβ signaling. Since inhibition of TGFβR2 by RNAi re-sensitized MED12KD NSCLC cells to TKIs (
To test this concept; Applicants cultured control or MED12KD H3122 cells in the absence and the presence of crizotinib, the TGFβR inhibitor LY2157299 or the combination of crizotinib and LY2157299 (
Pooled “Dropout” shRNA Screen
A Kinome shRNA library targeting the full complement of 518 human kinases and 17 kinaserelated genes was constructed from the TRC human genome-wide shRNA collection (TRCHs1.0). The Kinome library was used to generate pools of lentiviral shRNA to infect H3122 cells stably expressing shMED12. Cells were cultured in the presence or absence of crizotinib. Massive parallel sequencing was applied to determine the abundance of shRNA in cells. shRNAs prioritized for further analysis were selected by the fold of depletion by crizotinib treatment.
Cells were seeded into 6-well plates (2-5×104 cells/well) and cultured both in the absence and presence of drugs as indicated. More details are described in Huang et al., 2009 (Huang et al., 2009). All knockdown and overexpression experiments were done by retroviral or lentiviral infection. All relevant assays were performed independently at least three times.
Transcriptome sequencing analysis of cell lines were performed using RNA-Seq. To rule out “off-target” effects, Applicants considered genes that are significantly deregulated in the same direction by two independent shMED12 vectors. The MED12KD gene signature was then assembled containing genes that were more than 2 folds up- or downregulated upon MED12 knock-down in at least three out of five cell lines. This signature was employed to hierarchically cluster a dataset consisting of gene expression data for 231 which CRC tumor samples. Differences in disease specific survival were determined using the Kaplan-Meier statistics.
An EMT signature was created by combining EMT expression signatures published by Taube et al. (Taube et al., 2010) and Loboda et al. (Loboda et al., 2011), and from the SABiosciences EMT PCR array (SABiosciences, Frederick, Md.). All genes were annotated as down- or upregulated during EMT according to the source. Genes with annotation of conflicting expression changes in several sources were excluded. All gene symbols were translated to probe set identifiers.
Drug response data (IC50 values) and gene expression levels were obtained from COSMIC (Forbes et al., 2010) for 152 cell lines that have activating mutations in RAS or BRAF. The IC50 values were classified as sensitive or resistant and gene expression levels were classified as normal, up- or downregulated. For each pair of a gene and a MEK inhibitor an overlap enrichment test was applied to evaluate if significantly many cell lines were both upregulated for the gene and resistant to the MEK inhibitor. The number of significant associations within in the MED12 signature gene set was counted and compared to 100,000 randomly drawn sets of the same size and variance distribution to evaluate the significance of the MED12 signature.
Subcellular fractionation experiments were performed according manufacture protocol using the NE-PER Nuclear and Cytoplasmic Extraction Kit (78835) purchased from Thermo Scientific.
shRNA “Dropout” Screen with a Custom TRC Kinome Library
Lentiviral plasmids (pLKO.1) encoding shRNA that target kinome candidates were listed in
Lentiviral supernatants were generated as described at http://www(dot)broadinstitute(dot)org/rnai/public/resources/protocols. H3122 cells stably expressing shMED12#3 were infected separately by the 7 virus pools (Multiplicity Of Infection of 1). Cells were then pooled and plated at 300,000 cells per 15 cm dish in absence or presence of 300 nM crizotinib (5 dishes for each condition) and the medium was refreshed twice per week for 10 days. Genomic DNA was isolated as described (Brummelkamp et al., 2006). shRNA inserts were retrieved from 8 ug genomic DNA by PCR amplification (PCR1 and PCR2, see below for primer information) using the following conditions: (1) 98° C., 30 s; (2) 98° C., 10 s; (3) 60° C., 20 s; (4) 72° C., 1 min; (5) to step 2, 15 cycles; (6) 72° C., 5 min; (7) 4° C. Indexes and adaptors for deep sequencing (Illumina) were incorporated into PCR primers. 2.5 ul PCR1 products were used as templates for PCR2 reaction. PCR products were purified using Qiagen PCR purification Kit according to the manufacturer manual. Sample quantification is performed by BioAnalyzer to ensure samples generated at different conditions were pooled at the same molar ratio before analyzed by Illumina genome analyzer.
shRNA stem sequence was segregated from each sequencing reads and aligned to TRC library. The matched reads were counted and the counts were transformed to abundance that was assigned to the corresponding shRNA.
Primers used are as follows:
Total mRNA of each sample was converted into a library of template molecules suitable for subsequent cluster generation using the reagents provided in the Illumina® TruSeq™ RNA Sample Preparation Kit, following the manufacture protocol. Sequence reads were generated using Illumina HiSeq 2000 with TruSeq™ v3 reagent kits and software. The reads (between 20-45 million 50 bp paired-end reads per sample) were mapped to the human reference genome (build 37) using TopHat (v. 1.3.1, (Trapnell et al., 2009)), which allows to span exon-exon splice junctions. The open-source tool HTSeq-count (v. 0.5.3p3), available from EMBL, was then used to generate a list of the total number of uniquely mapped reads (between 16-33 million pairs of reads per sample) for each gene that is present in the provided Gene Transfer Format (GTF) file.
In order to determine which genes are differentially expressed between samples, the R package DEGseq (Wang et al., 2010) was used, which takes the output of HTSeq-count as input. The method used to identify differentially expressed genes is the MA-plot-based method with technical Replicates (MATR), which makes use of the presence of technical replicates. The genes that have no expression for all samples in the comparison were discarded from the dataset. The expression levels of all remaining genes in the dataset were added with 1 in order to avoid negative values after log 2 transformation. Normalization for the number of reads is performed within this method and the cut off for differentially expressed genes is based on a p-value of 0.05.
Gene expression datasets GSE14333 (Jorissen et al., 2009), GSE17536 and GSE17537 (Smith et al., 2010) were downloaded from the Gene Expression Omnibus (Barrett et al., 2011).
Duplicated samples in GSE14333 and GSE17536 were removed from GSE14333 resulting in a final dataset comprising 389 tumor samples. Expression data were first normalized together using the RMA method as implemented in the affy package (Gautier et al., 2004) for R/Bioconductor (Gentleman et al., 2004) and then mean-centered separately for each dataset. The hclust method was employed for hierarchically clustering the samples based on MED12KD and Pearson correlation distance. The survival and Design packages were used for performing a Kaplan-Meier survival time analysis and plotting survival curves, respectively.
The predictive value of the MED12 knockdown signature was assessed using the Catalogue Of Somatic Mutations In Cancer (COSMIC), which is part of the Cancer Genome Project (CGP) (Forbes et al., 2010). From COSMIC Applicants collected the IC50 values of four MEK inhibitors (AZD6244, CI-1040, PD-0325901 and RDEA119) for 152 cell lines that have a mutation in KRAS, HRAS, NRAS and/or BRAF. For these cell lines Applicants also obtained gene expression levels for 11354 genes from COSMIC.
The IC50 values across the 152 cell lines for each MEK inhibitor were discretized into “sensitive” and “resistant” using a simple discretization strategy. Briefly, if the distribution of IC50 values was not unimodal (using Hartigan's dip test (Hartigan and Hartigan, 1985), p<0.05), a two component Gaussian mixture model was used to assign the cell lines to the sensitive or resistant category. Otherwise, an outlier detection strategy was used to call the cell lines that are far to the left of the bulk of the data (i.e., low IC50 values) as sensitive and the others as resistant. Overall, about 18% of the cell lines were called sensitive for each of the MEK inhibitors.
The same strategy was used to discretize the expression levels of each gene into “downregulated”, “normal”, and “upregulated.” In this case, either a two or three component mixture model was used for multimodal distributions (using the BIC to choose the number of components), and for unimodal distributions the outlier scheme called cell lines to the right of the bulk (i.e. high expression levels) as upregulated and those to the left (i.e. low expression levels) as down-regulated.
Next, for each pairing of a gene and a MEK inhibitor a simple enrichment test (i.e. hypergeometric test) was applied to evaluate if significantly many cell lines were both upregulated for the gene and resistant to the MEK inhibitor. For the four MEK inhibitors, AZD6244, CI-1040, PD-0325901 and RDEA119, Applicants respectively detected 474, 807, 856 and 681 genes at p<0.05.
Applicants evaluated whether there was an overrepresentation of the MED12 signature genes in these sets of genes. Of the 237 genes upregulated after MED12 knockdown, 170 are part of the gene expression set of COSMIC. Of the 22 genes downregulated after MED12 knockdown, only 12 are present in the gene expression set. Because the latter set is very small, Applicants decided to focus only on the set of 170 upregulated genes. In these 170 genes, and the four MEK inhibitors, AZD6244, CI-1040, PD-0325901 and RDEA119, Applicants detected 22, 36, 35, and 26 genes at p<0.05, respectively. Seven genes were found in all of the four groups. The association of gene expression with response to AZD6244 for one of these genes, ZBED2, is depicted in
In order to determine the statistical significance of the number of genes in the MED12 signature whose gene expression was found to be associated with each of the inhibitors, Applicants compared these numbers to what would be expected under the null hypothesis. More specifically, Applicants randomly drew 100,000 sets of 170 genes with the same distribution of expression variance across the dataset as the 170 MED12 upregulated signature genes. Applicants computed a permutation test p-value, which indicates the fraction of times (out of 100,000) that the randomly drawn gene set showed more significantly associated genes than the 170 MED12 signature genes. These p-values are 0.009, 0.004, 0.007 and 0.013 for AZD6244, CI-1040, PD-0325901 and RDEA119, respectively. These numbers are found in
Applicants observed that the variance of genes in the MED12 signature was higher than the average for the complete expression dataset. Applicants focused on random gene sets with the same variance distribution, since genes with no or low variance across the dataset can never be significantly associated with the varying IC50 values, and should therefore not be part of the random gene sets.
Having thus described in detail embodiments of the present invention, it is to be understood that the invention defined by the above paragraphs is not to be limited to particular details set forth in the above description as many apparent variations thereof are possible without departing from the spirit or scope of the present invention.
Each patent, patent application, and publication cited or described in the present application is hereby incorporated by reference in its entirety as if each individual patent, patent application, or publication was specifically and individually indicated to be incorporated by reference.
This application claims the benefit of priority of U.S. Provisional Application Ser. No. 61/471,601 filed Apr. 4, 2011; U.S. Provisional Application Ser. No. 61/472,165, filed Apr. 5, 2011; and U.S. Provisional Application Ser. No. 61/610,349 filed Mar. 13, 2012, which are incorporated herein by reference in their entirety.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US12/32202 | 4/4/2012 | WO | 00 | 5/8/2014 |
Number | Date | Country | |
---|---|---|---|
61471601 | Apr 2011 | US | |
61472165 | Apr 2011 | US | |
61610349 | Mar 2012 | US |