This disclosure relates to gene analysis and, in particular, to gene expression profiling for identifying molecular signature of cancer disease, in particular the G1 phase of the cell cycle, such as myeloproliferative disorders (MPD) or breast cancer and studying cancer.
Myeloproliferative disorders (MPD) are clonal proliferative diseases of the hematopoietic stem cells. After an initial phase they may progress to an acute syndrome. V617F mutation of the JAK2 kinase are found in polycythemia vera, essential thrombocythemia, and idiopathic myelofibrosis. BCR-ABL fusion occurs in chronic myeloid leukemia, and a variety of fusions involving PDGF and FGF receptors are found in other MPDs. Fusion kinases that result from a translocation are made of a constitutively activated kinase domain and an N-terminal region encoded by a partner gene. The activated kinase triggers sustained proliferation and survival of the hematopoietic cells but maturation is not affected.
Because of the side effects and risk of cancer therapy as well as resistance of certain patient to the treatment with certain drug, it would be desirable to offer new means to monitor cancer treatment and to be able to identify patients in need of such treatment.
We provide methods for analyzing differential gene expression associated with cancer disease, more particularly, the G1 phase of the cell cycle such as myeloproliferative disorders (MPD) or breast cancer, comprising detecting the upregulation and/or downregulation of a pool of polynucleotide sequences in a cell or tissue sample, the pool corresponding to all or part the polynucleotide sequences, subsequences or complement thereof, of the genes listed in Tables 1, 2 and 3.
More particularly, the methods are carried out on a tumor cell or tissue sample. It may be any sample that may be taken from a patient, such as for example serum, plasma, urine or a biopsy sample.
Upregulation and/or downregulation of a pool of polynucleotide sequences according to the method of the present invention identify a molecular signature of activated MPD kinase.
Table 2 displays 188 upregulated genes/EST and table 3 displays 48 genes/EST downregulated in activated kinase-expressing cells.
Table 1 represents the most significant (p-value inferior at 2·10−2) and most often represented (including at least 3 genes) biological processes. Many of the upregulated genes encode nucleolar proteins involved in “ribosome biogenesis” (GO:0007046, 6 genes, p=4.28·10−11), “rRNA processing” (GO:0006364; 7 genes, p=3.07·10−11), and “protein biosynthesis” (GO:0006412, 9 genes, p=3.60·10−0.5).
Tables 1, 2 and 3 indicate the name of the gene (gene Symbol). We define the nucleotide sequences by the name of the gene or fragments thereof. Each polynucleotide sequence in Tables 2 and 3 may be considered as a marker of the corresponding gene. Each marker corresponds to a gene in the human genome, i.e., such marker is identifiable as all or a portion of a gene. Any RNA transcribed from a marker gene (mRNAs), any cDNA or cRNA produced therefrom, and any nucleic acid derived therefrom, such as synthetic nucleic acid having a sequence derived from the gene corresponding to the marker gene are also encompassed by the present invention.
A “pool of polynucleotide sequences” may comprise one or more sequences, preferably 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 350, 400, 450, 500 sequences.
The number of genes may vary in the range of from 1 to the total number of genes described in Tables 1, 2 or 3, for example, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, 120, 130, 140, 150, 160, 170, 180, 188 genes, depending of the number of genes in said tables.
The pool of polynucleotide sequences may comprise the polynucleotide sequences, subsequences or complement thereof, of the genes listed in Tables 2. More particularly, the methods relate to upregulated genes encoding nucleolar proteins (CIRH1A, LARP1, NOL1, NOL11, NOL5, NOL5A, NOLA1, NOLA2, NOLC1, MKI67IP, SFRS2, SURF6), ribosomal proteins (RPL3, RPL12, RPL41, RPS9, RRS1), small nuclear ribonucleoproteins and interactors (U3/MPHOSPH10, LSM2, RNU22, RNU3IP2), components of RNA polymerase I (POLR1A, POLR1B), II (POLR2H, TAF9) and III (POLR3E, POLR3H), DEAD-box (DDX18, DDX56) and WD repeat (WDR4, WDR43, WDR74, WDR77, GRWD1, PWP1) proteins, eukaryotic initiation and elongation factors (EIF1A, EIF3S1, EIF3S4, EEF1E1), and components of the exosome (EXOSC1, EXOSC2, EXOSC6).
The genes also relate to upregulated genes encoding proteins of the NOL5A-associated preribosomal ribonucleoprotein complex involved in pre-rRNA processing: NOL5A, PPAN, NOLC1, and BXDC2. The gene encoding EBNA1BP2 was upregulated; it encodes a protein that binds to nucleolar FGF3 and is regularly upregulated in tumors.
The genes further relate to the most upregulated sequence GAS5, a non-protein-coding multiple small nucleolar RNA (snoRNA).
Other significant processed genes includes “protein folding” (4 genes), “ubiquitin-dependent protein catabolism” (3 genes), “nuclear mRNA splicing, via spliceosome” (3 genes), and “regulation of cell cycle” (3 genes).
A second major category of upregulated genes encode CCND2 (cyclin D2) and CDC25A, two major regulators needed for G1 progression.
Moreover, the list of upregulated genes included MYC. Many genes upregulated by MYC and NMYC oncogenes were also upregulated in our experiments, including CCND2, CDC25A and others (DDX18, EBNA1BP2, EEF1E1, MAT2A, MKI67IP, NOL5A, NOLA1, PHB, SFRS2, SHMT1, SLC16A1, SURF6, SRM, RPL3, RPL12, RPL41, RPS9 and RRS1).
The pool of polynucleotide sequences may include all or part the polynucleotide sequences, subsequences or complement thereof, of the genes listed in Tables 3. More particularly, the pool relates to downregulated genes encoding proteins with known or potential inhibitory function such as PIAS3, an inhibitor of STAT3, one of the main substrates of MPD kinases, and regulator of CDC25A, Erbin, and PLZF/ZBTB16, a MYC repressor.
The detection of over or under expression of polynucleotide sequences may be carried out by FISH or IHC. The detection may be performed on DNA microarrays. The level of the RNA transcripts can be measured by any available technique such as quantitative PCR.
We further provide a polynucleotide library useful for the molecular characterization of a cancer comprising or corresponding to a pool of polynucleotide sequences either upregulated or downregulated in tissue, said pool corresponding to all or part of the polynucleotide sequences selected as defined above.
The polynucleotide library may be immobilized on a solid support, for example selected from the group comprising nylon membrane, nitrocellulose membrane, glass slide, glass beads, membranes on glass support or silicon chip, plastic support.
We further provides a method for analysing differential gene expression associated with cancer disease, comprising:
The polynucleotide sample may be labelled before reaction step (b). The label may be selected from the group consisting of at least one of radioactive, calorimetric, enzymatic, molecular amplification, bioluminescent or fluorescent labels. Preferably, the label is calorimetric, e.g., biotin or digoxygenin.
The method may further comprise:
The polynucleotide sample may be cDNA, RNA or mRNA. The mRNA may be isolated from the polynucleotide sample and cDNA may be obtained by reverse transcription of said mRNA.
We further provide a method for determining the nature of a cancer from a cancer cell or tissue comprising the implementation of the method on nucleic acids from cells or tissues from a patient.
More particularly, the method is useful for profiling samples of various human MPDs. It may help define the initial steps of oncogenesis, for which G1 activation may be important. Cell quiescence may be defined by the absence of protein synthesis, whereas proliferation starts with nucleolar activity, ribosome biogenesis and rRNA processing. Second, MPDs can benefit from treatment targeting not only the activated kinases but also the PI3K/AKT/TOR signaling pathway and the G1 phase of the cell cycle, in synergy with anti-kinase drugs and/or in case of resistance. G1 targeting is frequent in many types of cancer. The signature can be used in transcriptome studies of any type of cancer to identify G1 activation, classify tumors, and use the appropriate drugs.
Therefore, we further provide a method of prognosis or diagnostic or prediction of tumours susceptible to a molecule acting on the G1 phase of the cell cycle such as rapamycine and its derivatives, and therefore we provide a method for monitoring the treatment of a patient with a cancer comprising the implementation of the method of analysis disclosed hereabove on nucleic acids from a patient to identify the corresponding signature.
We also provide a method of selection of patient susceptible to be treated with molecule acting on the G1 phase of the cell cycle such as rapamycine or its derivative, comprising the implementation of the method hereabove on nucleic acids from a patient in order to identify the corresponding signature.
This signature provides tools for classifying basal tumors for which the high proliferation results from the PI3K/TOR pathway. Therefore, we provide methods for determining whether a tumor has an activated proliferation according to PI3K/TOR and is sensitive to chemotherapy.
Myeloproliferative disorders (MPD) are clonal hematopoietic diseases characterized by the proliferation and expansion of one or several myeloid cell lineages in the bone marrow. During the chronic phase, the cells follow their normal differentiation pathway and become mature blood cells. During a second phase, an acute syndrome may occur. The conventional classification separates MPDs in clinical entities. These include chronic myeloid leukemia (CML), chronic neutrophilic leukemia, chronic eosinophilic leukemia, polycythemia vera (PV), essential thrombocythemia (ET) and idiopathic myelofibrosis (IMF). MPDs with different syndromes and molecular abnormalities are grouped in non-classical MPDs. Systemic mastocytosis is not classically included in MPDs but is a related disease.
We studied the gene expression profiles of murine Ba/F3 cells transfected by various oncogenic MPD kinases by using whole-genome Affymetrix 430 2.0 mouse oligonucleotide microarrays (www.Affymetrix.com). Ba/F3 cells were grown in RPMI medium supplemented with 10% fetal calf serum (FCS) and IL3. Expression of an MPD kinase bypasses the IL3 dependence of Ba/F3 cells. RNA was extracted by using Trizol (Trizol Reagent, Invitrogen Life Technologies, Carlsbad, Calif.) from frozen pellets of: i)—Ba/F3 cells, ii)—Ba/F3 transfected with different pCDNA expression vectors expressing a mutant, kinase-defective FOP-FGFR1 KD or an oncogenic MPD kinase (BCR-ABL, FOP-FGFR1, CEP1-FGFR1, BCR-FGFR1), and iii)—Ba/F3 transfected by the MIGR vector, either empty (MIGR) or with JAK2 (JAK2 WT, mutated V617F JAK2 or mutated V617F IND JAK2 i.e. able to grow independently of IL3). Before RNA extraction, cells were starved for 7 hours in RPMI plus 0.5% FCS. RNA integrity was controlled by microanalysis (Agilent Bioanalyzer, Palo Alto, Calif.). Preparation of cRNA, hybridizations, washes, detection and quantification were done as recommended by the supplier (Affymetrix). Data were analyzed by the RMA (Robust Multichip Average) method in R using Bioconductor and associated package. Before analysis, a filtering process removed from the dataset the genes with low and poorly measured expression as defined by an expression value inferior to 100 units in all samples, retaining 17.885 genes/ESTs. For paired samples, RNA was prepared independently from different cultures of cells. The correlation between paired samples ranged between 0.97 and 0.98.
Gene expression profiles of Ba/F3 cells transfected by fusion or mutated kinases (9 samples: BCR-ABL, 2 BCR-FGFR1, 2 CEP1-FGFR1, 2 FOP-FGFR1, 2 V617F JAK2) were compared to that of control cells (6 samples) including parental Ba/F3 cells (4 samples) and Ba/F3 expressing a kinase-defective mutant of FOP-FGFR1 (2 samples). Supervised analysis, based on 17,885 filtered probe-sets, identified 294 differentially expressed probe sets (theoretical number of produced false positives=1.7) (
To translate the RNA expression profiles into functionality, discriminator genes/ESTs were interrogated by Onto-Express.7 Table 1 represents the most significant (p-value inferior at 3·10−2) and most often represented (including at least 3 genes) biological processes. Many of the upregulated genes encode nucleolar proteins involved in “ribosome biogenesis” (GO:0007046, 6 genes, p=4.28·10−11), “rRNA processing” (GO:0006364; 7 genes, p=3.07·10−11), and “protein biosynthesis” (GO:0006412, 9 genes, p=3.60·10−05). Upregulated genes encode nucleolar proteins (CIRH1A, LARP1, NOL1, NOL11, NOL5, NOL5A, NOLA1, NOLA2, NOLC1, MKI67IP, SFRS2, SURF6), ribosomal proteins (RPL3, RPL12, RPL41, RPS9, RRS1), small nuclear ribonucleoproteins and interactors (U3/MPHOSPH10, LSM2, RNU22, RNU3IP2), components of RNA polymerase I (POLR1A, POLR1B), II (POLR2H, TAF9) and III (POLR3E, POLR3H), DEAD-box (DDX18, DDX56) and WD repeat (WDR4, WDR43, WDR74, WDR77, GRWD1, PWP1) proteins, eukaryotic initiation and elongation factors (EIF1A, EIF3S1, EIF3S4, EEF1E1), and components of the exosome (EXOSC1, EXOSC2, EXOSC6). Upregulated genes also encode proteins of the NOL5A-associated preribosomal ribonucleoprotein complex involved in pre-rRNA processing: NOL5A, PPAN, NOLC1, and BXDC2. The gene encoding EBNA1BP2 was upregulated; it encodes a protein that binds to nucleolar FGF3 and is regularly upregulated in tumors. The most upregulated sequence was GAS5, a non-protein-coding multiple small nucleolar RNA (snoRNA).
Other significant processes included “protein folding” (GO:0006457; 4 genes, p=6.72·10−03), “ubiquitin-dependent protein catabolism” (GO:0006511, 3 genes, p=1.47·10−02), “nuclear mRNA splicing, via spliceosome” (GO:0000398, 3 genes, p=1.65·10−02), and “regulation of cell cycle” (GO:0000074, 3 genes, p=1.65·10−02). The second major category of upregulated genes encode CCND2 (cyclin D2) and CDC25A, two major regulators needed for G1 progression. CCND2 RNA was found upregulated by BCR-ABL in previous gene expression studies. Cyclin D2 is necessary for BCR-ABL-induced activity. Inhibition of V617F JAK2 correlates with decreased expression of cyclin D2. Other G1 cyclins may play a role in the oncogenic activity of fusion kinases but cyclin D2 seems to be a rate-limiting element. We used Western blot analysis to validate the differential expression of cyclin D2. The amount of cyclin D2 protein was increased in Ba/F3 cells expressing activated kinases as compared to controls (
MYC directly or indirectly regulates the G1 phase of the cell cycle. The list of upregulated genes included MYC. Many genes upregulated by MYC and NMYC oncogenes were also upregulated in our experiments, including CCND2, CDC25A and others (DDX18, EBNA1BP2, EEF1E1, MAT2A, MKI67IP, NOL5A, NOLA1, PHB, SFRS2, SHMT1, SLC16A1, SURF6, SRM, RPL3, RPL12, RPL41, RPS9 and RRS1). This similarity suggests that MYC proteins and MPD kinases have similar oncogenic effects, whose main target would be the CDKN2-RB protein pathway during the G1 phase of the cell cycle. Once induced, MYC may in turn act on the transcription of G1/S regulators and genes involved in protein synthesis. MPD fusion kinases are thought to target the hematopoietic stem cell. Activation of MYC is in perfect agreement with what we know of stem cell proliferation. A similar program was also turned on by IL3 stimulation (not shown).
Downregulated genes were more difficult to classify with Onto-Express, but several encode proteins with known or potential inhibitory function such as PIAS3, an inhibitor of STAT3, one of the main substrates of MPD kinases, and regulator of CDC25A, Erbin, and PLZF/ZBTB16, a MYC repressor.
We tested the validity of our classification by the “leave-one-out” cross-validation method. Iteratively, one of the 15 samples was removed, and a multigene predictor was generated from the remaining samples: 93% of samples were correctly assigned by the predictors with a sensitivity of 89% and a specificity of 100%.
Thus, in Ba/F3 cells, MPD fusion kinases induce both G1 activators and protein synthesis components, thus starting the cell proliferation machinery. This effect may be mediated by the PI3 kinase-AKT-TOR pathway, which controls and coordinates both protein synthesis and early phases of the cell cycle. Prominent downstream targets of the AKT pathway are cyclins D1, D2 and MYC.
The subject matter of the references set forth below are hereby incorporated by reference in their entirety:
indicates data missing or illegible when filed
indicates data missing or illegible when filed
This Application claims priority to U.S. Provisional Application No. 60/874,357 filed on 12 Dec. 2006; the entire contents of which are hereby incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
60874357 | Dec 2006 | US |