The present invention relates to polynucleotides comprising Alu-derived exons derived by alternative splicing from Alu elements, which are involved in cancer and genetic diseases. The present invention further relates to use of oligonucleotides for identification of the sequences associated with Alu exonization for diagnosis and treatment of cancer and genetic diseases.
Alu elements are short, interspersed elements (SINEs) about 300 nucleotides in length, which amplify in primate genomes through a process of retroposition. Alu elements represent a significant fraction of noncoding DNA.
Numerous studies describe splicing-mediated insertions of part of Alu sequence into mature mRNAs, a process known as exonization of Alu elements. It was recently described by some of the applicants of the present invention that 5.2% of the alternatively spliced exons observed in public mRNA and EST databases originate from Alu sequence, whereas none of the constitutively spliced exons are Alu-derived (Sorek et al., Genome Res., 12:1060-1067, 2002). The results indicate that internal exons containing an Alu sequence are predominantly, if not exclusively, alternatively spliced. Sorek et al. postulated that the absence or dearth of constitutive Alu-containing exons in the human genome indicates the negative selection of such exons, which probably interfere with normal protein production.
A study published after the priority date of the present application by the inventors of the present invention discloses that two positions along the inverted Alu sequence are most commonly used as a 3′ splice-site in Alu exonization (Lev-Maor et al., Science 300:1288-1291, July 2003). Another study published after the priority date of the present application by the inventors of the present invention discloses that certain mutational changes are required in order to create functional 5′splice-sites in Alu elements (Sorek et al., Molecular Cell, 14:221-231, April 2004).
A putative role for an Alu upstream motif or cis element in glaucoma pathogenesis is disclosed in U.S. Pat. No. 6,171,788. This patent discloses that transcription of Trabecular Meshwork Inducible Glucocorticoid Response (TIGR) promoter molecules can be effected by agents capable of altering the biochemical properties or concentration of nuclear factors or their homologues, including nuclear factors bound to Alu elements. Such agents can be used in the study and treatment of glaucoma.
Association of Alu introns with hereditary types of cancer is disclosed in U.S. Pat. No. 6,733,966. This patent discloses unusual high concentration of Alu-elements in the BRCA1 gene intronic regions favoring the induction of large genomic deletions and inversions in a situation of increased genomic instability.
Predisposition to genetic diseases determined by detecting mutations in DNA sequences is disclosed in numerous publications. For example, U.S. Pat. No. 5,849,483 discloses a high-throughput method for screening nucleic acid samples to identify target sequences or genetic alterations in target sequences present in the nucleic acid samples including randomly permuted alterations in nucleic acid sequences of interest. Methods of detecting mutations in DNA sequences using a competitive oligonucleotide priming system is disclosed in U.S. Pat. No. 6,015,675. Mass spectrometry-based processes for detecting particular nucleic acid molecules and sequences in the molecules are disclosed in U.S. Pat. No. 6,258,538. Depending upon the sequence to be detected, the processes can be used to diagnose a genetic disease or a chromosomal abnormality, a predisposition to a disease or condition, or infection by a pathogen, or for determining identity or heredity.
A variety of ‘wet laboratory’ techniques for identification of specific genetic sequences or genetic markers are known in the art. For example, U.S. Pat. No. 6,528,256 discloses a method for the identification and isolation of specific genetic sequences or genetic markers of cDNA or of genomic DNA derived from the cells, tissues or organs of an organism, particularly a mammal, employing an Amplification Fragment Length Polymorphism (AFLP)-based technique. AFLP is disclosed in EP 0 534 858 and is PCR-based with specific combinations of restriction endonucleases and adapters of discrete sequences, as well as primers that contain the common sequences of the adapters. In this way, a sequence or fragment of DNA in a complex sample may be specifically amplified and used for further analysis.
Identification of specific genetic sequences or genetic markers can be also effected by computational algorithms, by mining data from existing databases.
Treating genetic diseases by preventing expression of undesired genetic elements is known in the art. U.S. Pat. No. 5,681,747 discloses methods for inhibiting human-PKCα expression with an oligonucleotide specifically hybridizable to a portion of the 3′-untranslated region of PKCα. A method for gene therapy using small fragment homologous replacement is disclosed in U.S. Pat. No. 6,010,908. The method introduces small fragments of exogenous DNA into regions of endogenous genomic DNA virtually homologous to the exogenous DNA. The exogenous DNA fragment contains sequence modification that correct mutations in the endogenous DNA or introduce mutations that alter cellular or an infecting pathogen phenotype. U.S. Pat. No. 6,506,559 discloses methods for inhibiting gene expression in vitro using certain isiRNA constructs that mediate RNAi. A strategy for suppressing expression of an endogenous gene comprising providing suppression effectors able to bind to the non-coding regions of a gene to be suppressed, to prevent the functional expression thereof, is disclosed in U.S. Pat. No. 6,713,457. The suppression effectors may be antisense nucleic acids, and the non-coding regions can include the transcribed but non-translated regions of a gene. The strategy can also introduce a replacement gene. International Publication No. WO 99/53050 discloses certain methods for decreasing the phenotypic expression of a nucleic acid in plant cells using certain double strand RNAs. International Publication No. WO 01/49844 discloses specific DNA constructs for use in facilitating gene silencing in targeted organisms. International Publications Nos. WO 02/055692, WO02/055693, and EP 1144623 B1 disclose methods for inhibiting gene expression using RNAi. International Publications Nos. WO 99/49029 and WO01/70949, and AU 4037501 describe certain vector expressed siRNA molecules.
Nowhere in the background art it is taught or suggested that Alu-derived silent sequences may be transformed into exonic sequences and that such exonized Alu-derived sequences may be involved in the pathogenesis of genetic disease.
The present invention relates generally to identification and use of genetic elements that are hitherto uncharacterized markers of diseases or predisposition to genetic diseases. In particular, the present invention relates to genetic markers comprising Alu-derived exons derived from Alu introns.
It is now disclosed that the exonization of mutated Alu sequences is a factor that may be linked to the loss or alteration of normal gene function. The detection of exons derived from Alu introns is now disclosed to provide novel markers useful for prognosis, diagnosis or treatment of a disease or condition selected from the group consisting of a genetic disease, a chromosomal abnormality, a genetic predisposition, and a viral infection.
The present invention is based in part on the unexpected discovery that Alu elements or fragments thereof, predominantly in their antisense orientation inserted into mature mRNAs by way of splicing (‘exonization’), are associated with genetic disease and cancer.
Thus, the present invention relates to method of identifying, inhibiting and treating diseases, including but not limited to genetic diseases and cancer, utilizing Alu-derived genetic markers. The Alu-derived genetic markers of the invention can be used for detection or treatment of a disease and can be also used for identifying alternatively spliced mRNA isoforms containing same that are prevalent in the disease, thereby enabling use of the mRNA isoform for detection or treatment of said disease. The first step of the latter approach can be conducted in silico, using well known EST, cDNA and mRNA libraries for the identification of alternatively spliced mRNA isoforms containing Alu-derived exons.
According to one aspect the present invention provides an isolated polynucleotide comprising at least one Alu-derived exon comprising a sequence derived by alternative splicing of an Alu element, the Alu element comprising at least one mutated splice site. According to particular embodiments the mutated splice site is as set forth in any one of SEQ ID NOS:1-58.
According to one embodiment, the sequence comprising the mutated splice site of the Alu element is at least 80% homologous, preferably 85%, more preferably 90%, most preferably 95% or more homologous to any one of SEQ ID NOS:1-58.
Without wishing to be bound by any particular theory or mechanism of action, Alu introns transform into Alu exons upon evolution of certain mutations within pseudo splice-sites (SS) in the intronic Alu sequences, a process referred to herein as exonization.
Surprisingly, the inventors of the present invention discovered particular positions along Alu sequences that are most commonly used as splice-sites in Alu exonization. Based on these findings the inventors of the present invention identified 7,810 Alu sequences within EST libraries (Sorek et al. 2004, ibid).
According to yet another embodiment, the isolated polynucleotides of the present invention comprise alternative splice-sites, also referred herein as pseudo splice-sites. According to yet another embodiment, the polynucleotides of the present invention comprise sequences downstream to the 3′ splice-site that is involved in Alu exonization. According to certain embodiments, the 3′ splice-site is within positions 275 to 279 of the consensus Alu sequence, corresponding to positions 12 to 16 of SEQ ID NOS:1-32. According to yet another embodiment, the polynucleotides of the present invention comprise sequences upstream to the 5′ splice-site that is involved in Alu exonization. According to certain embodiments, the 5′ splice-site is in positions 154 to 160 within the consensus Alu sequence, corresponding to positions 13 to 19 of SEQ ID NOS: 33-58.
It is now disclosed for the first time that that out of the at least 238,000 antisense Alu sequences located within introns in the human genome 52,935 Alus carry a potential ADAR2-like 3′SS (Slavov et al., GENE, 299:83, 2002) and 23,012 carry a potential PGT-like 3′SS (NCBI GenBank Access No.: AA225691) suggesting that many of these silent intronic Alu elements might be susceptible to exonization by the same single point mutation, and are thus under strict selective pressure. Such point mutations in human genomic antisense Alu sequences may be the molecular basis for predisposition to previously uncharacterized genetic diseases.
According to another aspect the present invention provides an oligonucleotide comprising at least a portion of an Alu-derived exon selected from an oligonucleotide probe and an oligonucleotide primer, the Alu-derived exon comprising a sequence derived by alternative splicing of an Alu element, the Alu element comprising at least one mutated splice site. According to particular embodiments the mutated splice site is as set forth in any one of SEQ ID NOS:1-58.
According to one embodiment, said oligonucleotide further comprises a detectable label. The labels include, but are not limited to, fluorophores, chromophores, radioactive isotopes, electron dense reagents, enzymes, enzymatic substrates and ligands.
It is to be understood explicitly that the scope of the present invention encompasses homologs, analogs, variants and derivatives, including but not limited to shorter and longer polynucleotides or oligonucleotides, with one or more nucleic acid substitution, as well as nucleic acid derivatives, non-natural nucleic acids and synthetic nucleic acids as are known in the art.
According to one embodiment, the present invention provides a vector comprising the isolated polynucleotide of the invention. According to another embodiment, the present invention provides a vector comprising the oligonucleotide of the invention. According to yet another embodiment, the present invention provides a construct comprising the oligonucleotide of the invention.
According to yet another aspect, the present invention provides a method for diagnosing, detecting or predisposition to a disease, comprising determining the presence in a biological specimen of a polynucleotide comprising a sequence selected from: a) an Alu-derived sequence comprising at least one mutated alternative splice site; and b) an alternatively spliced mRNA isoform comprising an Alu-derived exon.
According to one embodiment, the Alu-derived sequence of a) comprises a sequence as set forth in any one of SEQ ID NOS: 1-58, fragments or extensions thereof.
According to another embodiment, the disease is selected from the group consisting of: cancer, a genetic disease, a chromosomal abnormality, a genetic predisposition and a viral infection. According to another embodiment, the genetic disease is selected from the group consisting of: Alport syndrome, Sly syndrome, CCFDN (congenital cataracts, facial dysmorphism, and neuropathy) syndrome and OAT deficiency.
According to yet another embodiment, cancer is selected from the group consisting of: prostate cancer, breast cancer, ovarian cancer, lung cancer, melanoma, renal cancer, bladder cancer, fibrosarcoma, hepatocellular carcinoma, osteocarcinoma, primary ductal carcinoma, giant cell sarcoma, ductal carcinoma, Hodgkin's disease, colorectal carcinoma, lymphoma, transitional cell carcinoma, uterine sarcoma, adenocarcinoma, plasmacytoma, epidermoid carcinoma, Burkitt's lymphoma, Ewing's sarcoma, gastric carcinoma, squamous cell carcinoma, neuroblastoma and rhabdomyosarcoma.
According to one embodiment, the method comprises:
According to yet another embodiment, the biological specimen is derived from a human or non-human primate. According to yet another embodiment, the biological specimen is selected from a tissue sample, fluids, cells, cellular extract, tissue extract and genetic material extracted from same, including but not limited to DNA and RNA. According to yet another embodiment, the fluids are selected from the group consisting of: blood, urine and saliva.
According to another embodiment, the method further comprises:
According to one embodiment, the method comprises identifying the alternatively spliced mRNA isoforms in silico from a database of sequences selected from the group consisting of: cDNA, EST, and mRNA.
According to another embodiment, the alternatively spliced mRNA isoform is identified by a method selected from a group consisting of: Northern blot analysis, RNase protection, in situ hybridization and selective hybridization to arrayed cDNA libraries.
According to yet another embodiment, the method further comprises contacting said biological specimen with a control nucleic acid probe capable of forming a hybrid with at least one control gene or fragments thereof. According to yet another embodiment, the at least one control gene is selected from the group consisting of: β-actin, glycer-aldehyde-phophate-dehydrogenase (GAPDH), S16 rRNA, phosphoribosyl transferase (HPRT), dihydrofolate reductase (DHFR), adenosine deaminase, phosphoglycerol kinase (PGK), pyruvate kinase, phosphoglycerol mutase.
According to yet another embodiment, the method further comprises contacting the nucleic acid probe with a biological standard. According to certain embodiments, the biological standard is derived from ostensibly healthy tissue of a human or a non-human primate and said biological specimen is derived from a non-healthy tissue derived from the human or the non-human primate. Alternatively, said biological specimen is derived from a non-healthy tissue derived from a different human or non-human primate.
According to yet another embodiment, the nucleic acid probe is labeled. According to certain embodiments, the labels are selected from fluorophores, chromophores, radioactive isotopes, electron dense reagents, enzymes, and ligands having specific binding partners.
According to yet another embodiment, the method comprising:
According to yet another embodiment, the plurality of primers comprises at least one primer that is labeled to enable detection of the amplified products. According to some alternative embodiments, the amplified products are separated by means of molecular size exclusion.
According to yet another embodiment, the reaction mixture comprises the four nucleoside triphosphates: dATP, dCTP, dTTP, and dGTP. According to an alternative embodiment, the reaction mixture comprises dATP, dCTP, dTTP, dGTP, and an analog of dGTP, including but not limited to, inosine, 7-deaza-guanosine, 7-deaza inosine deoxyribonucleotides and 2′-deoxy analogs thereof.
According to another embodiment, the genetic material is extracted from tissue, cells and body fluids. According to yet another embodiment, the fluids are selected from the group consisting of: blood, sperm, urine and saliva.
According to a preferred embodiment, the alternatively spliced mRNA isoform is an ACAD-9 mRNA (Zhang et al., Biochem. Biophys. Res. Commun., 4:297:1033, 2002). According another preferred embodiment, the disease is ovarian cancer. According to certain embodiment, the alternatively spliced mRNA isoform is TIF-IA mRNA. According to another certain embodiment, the disease is lymphoma.
According to yet another aspect, the present invention provides a method for preventing or inhibiting a disease in a subject in need thereof, comprising treating the subject in need thereof with a therapeutically effective amount of a pharmaceutical composition containing as an active ingredient an oligonucleotide capable of silencing the polynucleotide of the invention or fragments thereof.
According to one embodiment, the silencing polynucleotide is selected from the group consisting of: antisense nucleotide sequence, sense nucleotide sequence, short interfering RNA, ribozyme and aptamer.
Other objects, features and advantages of the present invention will become clear from the following description and drawings.
The present invention provides novel genetic elements that are hitherto uncharacterized markers of cancer, genetic diseases or predisposition to genetic disease. It is now disclosed that these novel markers comprise exons derived from Alu elements and that these novel markers are useful for prognosis, diagnosis or treatment of genetic diseases.
According to one aspect the present invention provides an isolated polynucleotide comprising at least one Alu-derived exon comprising a sequence derived by alternative splicing of an Alu element, the Alu element comprising at least one mutated splice site. According to particular embodiments the mutated splice site is as set forth in any one of SEQ ID NOS:1-58.
The terms “Alu element” or “Alu sequence” are interchangeably used herein to describe short, interspersed (with an average spacing of 4 Kb), repetitive elements (SINEs) about 300 nucleotides in length, which amplify in primate genomes through a process of retroposition. These elements have reached a copy number of about 1.4 million in the human genome, comprising more than 10% of it. A typical Alu is a dimer, built of two similar sequence elements (left and right arms) that are separated by a short A-rich linker. Most Alus have a long poly-A tail of about 20-100 bases.
According to one embodiment, the mutated splice site of the Alu element is present in a sequence having at least 80% homology, preferably 85%, more preferably 90%, most preferably 95% or more homology to any one of SEQ ID NOS:1-58.
The term “X % homology” is not intended to be limited to sequences having an X % homology over the entire length of the polynucleotide but is intended to include X % homology occurring in identified functional areas therein, preferably, in areas comprising splice site involved in Alu exonization.
Without wishing to be bound by any particular theory or mechanism of action, Alu introns transform into Alu exons, also termed herein Alu-derived sequences, upon evolution of certain mutations within pseudo splice-sites (SS) in the intronic Alu sequences, a process referred herein as exonization.
The term “exonization” as used herein is to be construed in its most general sense and refers to insertion of genetic material, preferably introns, more preferably polynucleotides derived from Alu gene, into mature mRNAs by way of splicing (
Alternative splicing is a major mechanism that diversifies the genetic information by producing more than one mRNA from a single gene. Bioinformatics studies showed that 32-59% of human genes are involved in alternative splicing. Alternative splicing is often regulated either according to cell type, developmental stage, and sex, or in response to an external stimulus. Aberrant regulation of alternative splicing has been implicated in an increasing number of human diseases, including cancer The availability of the complete human genome sequence has made it clear that gene number is not the sole determinant of proteome complexity. Sequencing of the human genome showed that humans possess >30,000 protein coding genes, which is only 1.5-fold the number of genes in D. melanogaster. This surprisingly low number contrasts with the number of human proteins, estimated to be more than 90,000. Clearly, alternative splicing contributes to this huge number of proteins that are generated from the low number of genes.
The inventors of the present invention discovered particular positions within Alu sequences that are most commonly used as splice-sites in Alu exonization. Based on these findings the inventors of the present invention identified 7,810 Alu sequences (Sorek et al. 2004, ibid).
Thus, according to another embodiment, the polynucleotides of the present invention comprise alternatively spliced mRNAs that result from mutated splice-sites. According to some embodiments, the mutated splice-sites are 3′ splice-site as set forth in positions 12 to 16 of SEQ ID NOS:1-32 (corresponding to positions 275 to 279 in the consensus Alu sequence;
In referring to 5′ or 3′ for a polynucleotide sequence, the direction of transcription is with 5′ being upstream from 3′. Thus, sequences downstream to the 3′ splice-site within an Alu element, as set forth in positions 12 to 16 of SEQ ID NOS:1-32, correspond to the 5′ end of the exonized Alu and sequences upstream to the 5′ splice-site as set forth in positions 13 to 19 of SEQ ID NOS: 33-58 correspond to the 3′ end of the exonized Alu.
According to another aspect the present invention provides an oligonucleotide comprising at least a portion of an Alu-derived exon selected from an oligonucleotide probe and an oligonucleotide primer. According to one embodiment, said oligonucleotide further comprises a detectable label. The labels include, but are not limited to, fluorophores, chromophores, radioactive isotopes, electron dense reagents, enzymes, enzymatic substrates and ligands.
It is to be understood explicitly that the scope of the present invention encompasses homologs, analogs, variants and derivatives, including but not limited to shorter and longer oligonucleotides or polynucleotides, with one or more nucleic acid substitution, as well as nucleic acid derivatives, non-natural nucleic acids and synthetic nucleic acids as are known in the art, with the stipulation that the homologs, analogs, variants and derivatives must preserve the hybridization properties of the original molecule.
It is now disclosed for the first time that out of the at least 238,000 antisense Alu sequences located within introns in the human genome 52,935 Alus carry a potential ADAR2-like (Slavov et al., ibid) 3′SS and 23,012 carry a potential PGT-like (NCBI GenBank Access No.: AA225691) 3′SS suggesting that many of these silent intronic Alu elements might be susceptible to exonization by the same single point mutation, and are thus under strict selective pressure. Such point mutations in human genomic antisense Alu sequences may be the molecular basis for predisposition to previously uncharacterized genetic diseases.
The terms “polynucleotide” refers to a polymer of nucleotides comprising a series of nucleic acids in a 5′ to 3′ phosphate diester linkage. The polynucleotide may be a single or double stranded DNA sequence. If the polynucleotide of the present invention is a single stranded DNA, then its complementary nucleic acid sequence is also within the scope of the present invention. The term “oligonucleotide” as used herein refers to a short polynucleotide, comprising about 200 nucleotides, preferably about 100, more preferably about 50 nucleotides or less.
According to another embodiment, the polynucleotide of the invention is recombinant. As used herein, a “recombinant” nucleic acid or protein molecule is a molecule where the nucleic acid molecule which encodes the protein has been modified in vitro, so that its sequence is not naturally occurring, or corresponds to naturally occurring sequences that are not positioned as they would be positioned in a genome which has not been modified.
The term “genetic disease” as used herein refers preferably to diseases derived from somatic mutations. Somatic mutations are mutations occurring only in certain tissues, e.g., in the tumor tissue, and are not inherited in the germline. Germline mutations can be found in any tissues or cells and are inherited. Somatic gene mutations result when there is an alteration of the nucleotide sequence of a gene encoding sequence. Such change may include a substitution of one nucleotide for another, or the insertion or deletion of one or more nucleotides, resulting in a change in the corresponding amino acid sequence of which is encoded. Somatic gene mutations are distributed largely stochastically in that, with some exceptions, they are distributed at random among the genes of the genome. However, mutations are more likely to occur in cells which are undergoing mitosis, or cell division, because they often result from inaccurate DNA replication. Thus, a cell lineage which has undergone repeated divisions, as is the case in clonal amplification, has a higher probability of accumulating somatic gene mutations. Many diseases are believed to stem from somatic genetic mutation rather than inherited gene abnormalities, including different types of cancers and neurodegenerative diseases. Often, diseases, which are caused by somatic mutation, are age-related.
According to yet another embodiment, said polynucleotide further comprises a detectable label. The labels include, but are not limited to, fluorophores, chromophores, radioactive isotopes, electron dense reagents, enzymes, enzymatic substrates and ligands.
According to one embodiment, the present invention provides a vector comprising the isolated polynucleotide of the invention. According to another embodiment, the present invention provides a vector comprising the oligonucleotide of the invention. According to yet another embodiment, the present invention provides a construct comprising the oligonucleotide of the invention.
By “vector” is meant a polynucleotide molecule, preferably a DNA molecule derived, for example, from a plasmid, bacteriophage, or plant virus, into which a polynucleotide can be inserted or cloned. A vector preferably contains one or more unique restriction sites and can be capable of autonomous replication in a defined host cell including a target cell or tissue or a progenitor cell or tissue thereof, or be integrable with the genome of the defined host such that the cloned sequence is reproducible. The choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced. The vector can also include a selection marker such as an antibiotic resistance gene that can be used for selection of suitable transformants. Examples of such resistance genes are known to those of skill in the art and include the nptII gene that confers resistance to the antibiotics kanamycin and G418 (Geneticin™) and the hph gene which confers resistance to the antibiotic hygromycin B.
According to another aspect, the present invention provides a method for diagnosing, detecting or predicting a disease, comprising determining the presence in a biological specimen of a polynucleotide comprising a sequence selected from: a) an Alu-derived sequence comprising at least one mutated alternative splice site; and b) an alternatively spliced mRNA isoform comprising an Alu-derived exon.
According to one embodiment, the Alu-derived sequence of a) comprises a sequence as set forth in any one of SEQ ID NOS: 1-58, fragments or extensions thereof.
According to another embodiment, the disease is selected from the group consisting of: cancer, a genetic disease, a chromosomal abnormality, a genetic predisposition and a viral infection. According to another embodiment, the genetic disease is selected from the group consisting of: Alport syndrome, Sly syndrome, CCFDN (congenital cataracts, facial dysmorphism, and neuropathy) syndrome and OAT deficiency.
According to yet another embodiment, the disease is cancer selected from the group consisting of: prostate cancer, breast cancer, ovarian cancer, lung cancer, melanoma, renal cancer, bladder cancer, fibrosarcoma, hepatocellular carcinoma, osteocarcinoma, primary ductal carcinoma, giant cell sarcoma, ductal carcinoma, Hodgkin's disease, colorectal carcinoma, lymphoma, transitional cell carcinoma, uterine sarcoma, adenocarcinoma, plasmacytoma, epidermoid carcinoma, Burkitt's lymphoma, Ewing's sarcoma, gastric carcinoma, squamous cell carcinoma, neuroblastoma and rhabdomyosarcoma.
According to another embodiment, the present invention provides a method for detecting a disease, comprising
According to another embodiment, the method further comprises:
According to yet another embodiment, the method comprises obtaining the alternatively spliced mRNA isoform by specific hybridizations, each hybridization is for of a single oligonucleotide probe complementary to a known sequence of an individual molecule. Hybridization methods include, but are not limited to, Northern blot analysis, RNase protection, in situ hybridization or selective hybridization to arrayed cDNA libraries (see Sambrook et al., Molecular cloning, A laboratory manual, Cold Spring Harbor press, NY, 1989) summarized infra.
(i) EST sequencing. The basic idea is to create cDNA libraries from tissues of interest, pick clones randomly from these libraries and then perform a single sequencing reaction from a large number of clones. Each sequencing reaction generates 300 base pairs or so of sequence that represents a unique sequence tag for a particular transcript. An EST sequencing project is technically simple to execute since it requires only a cDNA library, automated DNA sequencing capabilities and standard bioinformatics protocols. To generate meaningful amounts of data, however, high throughput template preparation, sequencing and analysis protocols must be applied. As such, the number of new genes identified as well as the statistical significance of the data is proportional to the number of clones sequenced as well as the complexity of the tissue being analyzed (Hillier et al., Genome Res. 6:807, 1996).
(ii) Subtractive cloning. Subtractive cloning offers an inexpensive and flexible alternative to EST sequencing and cDNA array hybridization. In this approach, double-stranded cDNA is created from the two-cell or tissue populations of interest, linkers are ligated to the ends of the cDNA fragments and the cDNA pools are then amplified by PCR. The cDNA pool from which unique clones are desired is designated the “tester”, and the cDNA pool that is used to subtract away shared sequences is designated the “driver”. Following initial PCR amplification, the linkers are removed from both cDNA pools and unique linkers are ligated to the tester sample. The tester is then hybridized to a vast excess of driver DNA and sequences that are unique to the tester cDNA pool are amplified by PCR. The primary limitation of subtractive methods is that they are not always comprehensive. The cDNAs identified are typically those, which differ significantly in expression level between cell-populations and subtle quantitative differences are often missed. In addition each experiment is a pair wise comparison and since subtractions are based on a series of sensitive biochemical reactions it is difficult to directly compare a series of RNA samples.
(iii) Differential display. Differential display is another PCR-based differential cloning method (Liang and Pardee, Science 257:967, 1992). In classical differential display, reverse transcription is primed with either oligo-dT or an arbitrary primer. Thereafter an arbitrary primer is used in conjunction with the reverse transcription primer to amplify cDNA fragments and the cDNA fragments are separated on a polyacrylamide gel. Differences in gene expression are visualized by the presence or absence of bands on the gel and quantitative differences in gene expression are identified by differences in the intensity of bands. Adaptation of differential display methods for fluorescent DNA sequencing machines has enhanced the ability to quantify differences in gene expression (Kato, Nucleic Acids Res. 18:3685, 1995).
A limitation of the classical differential display approach is that false positive results are often generated during PCR or in the process of cloning the differentially expressed PCR products. Although a variety of methods have been developed to discriminate true from false positives, these typically rely on the availability of relatively large amounts of RNA.
(iv) Serial analysis of gene expression (SAGE). this DNA sequence based method is essentially an accelerated version of EST sequencing (Valculescu et al., Science 270:484, 1995). In this method a digestible unique sequence tag of 13 or more bases is generated for each transcript in the cell or tissue of interest, thereby generating a SAGE library. Sequencing each SAGE library creates transcript profiles. Since each sequencing reaction yields information for twenty or more genes, it is possible to generate data points for tens of thousands of transcripts in modest sequencing efforts. The relative abundance of each gene is determined by counting or clustering sequence tags. The advantages of SAGE over many other methods include the high throughput that can be achieved and the ability to accumulate and compare SAGE tag data from a variety of samples, however the technical difficulties concerning the generation of good SAGE libraries and data analysis are significant.
Since a single human cell is estimated to express 10,000-30,000 genes, single probe methods to identify all sequences in a complex sample are ineffective and laborious. Thus, according to yet another embodiment, the method comprises obtaining the alternatively spliced mRNA isform in silico from a library of sequences selected from the group consisting of: cDNA, EST and mRNA.
The term “expressed sequence tags”, or “EST”, refers to sequences of randomly picked cDNA clones that were sequenced in a large-scale single-pass to generate over 500,000 human cDNA clones, ESTs, as described in Genome Res., 6:807, 1996. ESTs provide a large reservoir of information on human genes and are potentially powerful tools for discovery of disease genes and regional gene mapping to individual chromosomes particularly since less than 5% of the approximately 100,000 genes in the human genome have been sequenced and assigned biological functions. A number of ESTs from genes expressed in normal, precancerous, and cancerous tissues have been generated, and distributed according to various classifications, in a large number of EST libraries. In an attempt to provide more accurate genetic information, numerous methods have been developed for constructing full-length cDNA libraries from different tissue samples. For example, U.S. Pat. No. 6,265,165 provides a method for cloning of EST-specific full-length cDNA and/or full-length cDNA library. The method involves cDNA synthesis from highly enriched, homogeneously purified mRNAs and includes hybrid selection for purification of specific mRNAs from total RNA by employing antisense oligonucleotide primers of EST sequences in a single or multiplex approach. Devices and computer systems have been developed for collecting information about gene expression or expressed sequence tag (EST) expression in large numbers of tissue samples. For example, PCT application WO92/10588, incorporated herein by reference for all purposes, describes techniques for sequencing or sequence checking nucleic acids and other materials. Probes for performing these operations may be formed in arrays according to the methods of, for example, the pioneering techniques disclosed in U.S. Pat. No. 5,143,854; U.S. Pat. No. 5,571,639 and U.S. Pat. No. 6,308,170 all three incorporated herein by reference for all purposes.
According to a preferred embodiment, the alternatively spliced mRNA isoform is an ACAD-9 mRNA (Zhang et al., ibid). According another preferred embodiment, the disease is ovarian cancer. According to certain embodiment, the alternatively spliced mRNA isoform is TIF-IA mRNA. According to another certain embodiment, the disease is lymphoma.
According to yet another embodiment, the biological specimen is derived from a human or a non-human primate. According to yet another embodiment, the biological specimen is selected from a tissue sample, fluids, cells, cellular extract, tissue extract and genetic material extracted from same, including but not limited to DNA and RNA. According to yet another embodiment, the fluids are selected from the group consisting of: blood, urine, sperm and saliva. According to certain embodiments the biological specimen is fixed tissue microsections selected from: frozen section and paraffin sections among others. The method for detecting a disease in the microsection may be in-situ hybridization (ISH) as known in the art.
According to yet another embodiment, the nucleic acid probe is labeled. The labels include, but are not limited to, fluorophores, chromophores, radioactive isotopes, electron dense reagents, enzymes, and ligands having specific binding partners (e.g. biotin and avidin).
There are essentially four types of nucleic acid probes that can be used for hybridization according to the method of the present invention, wherein hybridization can be performed using in situ hybridization (ISH) protocols among others. The first type of probe is oligonucleotide probe. The oligonucleotide probes are typically produced synthetically by an automated chemical synthesis utilizing readily available deoxynucleotides. Designing the sequence of the probe is one of the more critical decisions required when using oligonucleotide probes and is not a matter of picking any region within the coding region of the target gene to bind to but requires careful design. These probes have the advantages of being resistant to RNases and are small, generally around 40-50 base-pairs. This is ideal for ISH because their small size allows for easy penetration into the cells or tissue of interest. In addition, because they are synthetically designed, it is possible to make a series of probes that have the same GC content; Since G/C base pairs bond more strongly than A/U base pairs, differences in GC content would require different hybridization conditions, so with oligonucleotides protocols can be standardized for many different probes irrespective of the target genes being measured. Another advantage of the oligonucleotide probes is that they are single stranded therefore excluding the possibility of renaturation. Benefits of using oligonucleotide probes.
The other type of probe that may be utilized for hybridization in accordance with the principles of the present invention is single stranded DNA probe. The single stranded DNA probes have similar advantages to the oligonucleotide probes except they are much larger, probably in the 200-500 bp size range. They can be produced by reverse transcription of RNA or by amplified primer extension of a PCR-generated fragment in the presence of a single antisense primer. That is, once the sequence of interest is amplified, a subsequent round of PCR is carried out using the first PCR product as template, but only using the anti-sense primers, thus producing single stranded DNA. This is therefore their disadvantage.
The third type of probe that may be utilized for hybridization in accordance with the principles of the present invention is the double stranded DNA probe. It can be produced by the inclusion of the sequence of interest in a bacteria which is replicated, lysed and the DNA extracted, purified and the sequence of interest is excised with restriction enzymes. On the other hand, if the sequence is known then by designing appropriate primers one can produce the relevant sequence very rapidly by PCR, potentially obtaining a very clean sample. The advantage of the bacterial preparation is that it is possible to obtain large quantities of the probe sequence in question. Because the probe is double stranded, it means that denaturation or melting has to be carried out prior to hybridization in order for one strand to hybridize with the mRNA of interest. These probes are generally less sensitive because of the tendency of the DNA strands to rehybridize to each other and are not as widely used today.
A forth type of probe that may be utilized for hybridization in accordance with the principles of the present invention is RNA probe (cRNA probe or riboprobe). RNA probes have the advantage that RNA-RNA hybrids are very thermostable and are resistant to digestion by RNases. This allows the possibility of post-hybridization digestion with RNase to remove non-hybridized RNA and therefore reduces the possibility of background staining. There are two methods of preparing RNA probes: (a) complimentary RNAs are prepared by an RNA polymerase-catalyzed transcription of mRNA in the 3′ to 5′ prime direction; and (b) in vitro transcription of linearized plasmid DNA with RNA polymerase can be used to produce the RNA probes. Here plasmid vectors containing polymerase from bacteriophages T3, T7 or SP6 are used.
These probes however can be difficult to work with as they are very sensitive to RNases (ubiquitous RNA degrading enzymes) and so, scrupulous sterile technique should be observed or these probes can easily be destroyed. RNA probes may still be used according to the teaching present invention, with in situ hybridization.
Oligonucleotide gene probes have multiple advantages over RNA or cDNA probes when used for in situ hybridization in the context of the present invention. The major advantages are as follows: stability (in particular, they are not degraded by RNAses), availability, faster and less expensive to use, easier to work with, more specific, better tissue penetration, better reproducibility and a wide range of labeling methods that do not interfere with target detection.
For detecting the desired oligonucleotides in frozen tissue sections, using ISH, the following probes and detection means are typically used:
For detecting the desired oligonucleotides in paraffin tissue sections, using ISH, the following probes and detection means are typically used:
There are several options for labeling a nucleic acid probe, which are applicable for use in the present invention. Classically oligonucleotide probes are either 5′ or 3′ end-labeled or 3′ tailed with modified nucleotides that have a “label” attached that can be detected after the probe has hybridized to its target. With end-labeling a single modified ddNTP (that incorporates the label) is added to either the 5′ or the 3′ end of the molecule enzymatically or during probe synthesis. 3′ tailing involves addition of a tail (on average 5-50 nucleotides long of modified dNTPs depending on the method used) using the enzyme terminal transferase (TdT). A recently developed non-enzymatic proprietary labeling technology, GreenGene™ allows for covalent 3′ labeling of oligonucleotides subsequent to probe synthesis. Using this technology an optimized number of labels (either Biotin, FITC, rhodamine or DIG) separated by proprietary linker molecules can be added to the 3′ end of an oligonucleotide probe in one chemical step. Radiolabeled probes, such as 35Sulphur (35S) radioisotope, may be used in the present invention. The radiolabeled probes may be then visualized by exposure of the tissue section or cells (to which the labeled oligonucleotide has be hybridized) against photographic film which is then developed. Instead of using photographic film to detect the probe within the tissue section, the slide containing the section of interest may also be dipped into a photographic emulsion which is allowed to dry. The slide is stored in the dark at −80° C. to allow the slide emulsion to become exposed. After the incubation period, the slides are then developed in the same way as normal photographic film wherein the black silver grains indicate the sites of the labeled transcripts. This is particularly useful for investigating gene expression on a cell by cell level. If immunocytochemistry is performed on the tissue before in situ hybridization, it is possible to examine gene expression in phenotypically defined cell populations at single cell resolution.
On the other hand several non-radioactive labels may be used in the present invention, include Biotin, digoxin and digoxigenin (DIG), alkaline phosphatase and the fluorescent labels, fluorescein (FITC), Texas Red and rhodamine.
Labeling the probe may be achieved using suitable kits, such as GreenStar™ labeled probes. Probes may be 5′ end labeled (addition of a biotin-, rhodamine-, FITC-modified nucleotide to the 5′ end) during the synthesis cycle if required. For 3′ tailing both radiolabels and non-radioactive labels are “attached” to the single stranded oligonucleotide probe by using the enzyme terminal transferase to add a tail of labeled dioxy nucleotide(s) (dNTPs, for example 35S-dATP, DIG-dUTP, Biotin-dUTP or FITC-dUTP) to the 3′ end of the oligonucleotide. Many commercial kits are available that allow to generate 3′ tailed probe (e.g. DIG tailing kits of Roche Molecular Biochemical). It is also possible to end label the 3′ end of the oligonucleotide probe if modified di-deoxy nucleotides (ddNTPs) are used as the substrate for TdT, for example by using DIG-ddUTP instead of DIG-dUTP with a slightly modified protocol. In this case TdT will add a single modified and labeled nucleotide to the 3′ end of the oligonucleotide facilitating later detection.
In contrast to adding the labeled nucleotides to either end of the oligonucleotide probe it is also possible to have labels incorporated into the oligonucleotide when it is being synthesized, for example by adding biotin- or FITC-labeled dATP in place of non-labeled dATP during synthesis so that a label or “tag” appears every time that the ATP nucleotide appears in the probe sequence. However with this method there is the strong possibility especially with short oligonucleotide probes of causing significant disruption of the hybridization of the probe to the target sequence in the tissue.
As mentioned, radiolabeled probes are detectable using either photographic film or photographic emulsion. The fluorescent labels described above are detectible “directly” by using a fluorescent microscope or plate reader to examine the tissue or cells on which the labeled oligonucleotide probe has hybridized to. The use of fluorescent labels with in situ hybridization has come to be known as FISH (fluorescent in situ hybridization) and one advantage of these fluorescent labels is that two or more different probes can be visualized at one time. Additionally FITC labeled probes can be detected using anti-FITC antibodies available from many scientific supply houses.
In contrast both Biotin and DIG labeled oligonucleotide probes generally require an intermediate step(s) before detection of the probe can occur and they are thus detected “indirectly” much like in a typical immunocytochemistry protocol where it would be unusual to have the label on the primary antibody, rather a secondary labeled antibody is used to detect the primary antibody in the tissue section. Specific anti-DIG antibodies can be used to detect the presence of a DIG-labeled probe. Digoxigenin (DIG) is a steroid isolated from the digitalis plant and as the blossoms and leaves are the only known source of digoxigenin, the anti-DIG antibodies are not likely to bind to other biological material. The digoxigenin is linked by a spacer arm containing 11 carbon atoms to the C-5 position of the uridine nucleotide. The advantage of using a DIG labeled probe is that it can be detected with antibodies conjugated to a number of different labels such as alkaline phosphatase, which results in a blue precipitate when the enzyme is incubated in the presence of the substrate NBT/BCIP (Tetrazoliun salt/5-bromo-4-chloro-3 idolyl-phosphate) or becomes a fluorescent label when incubated with HNPP (2-hydroxy-3-naphthoic acid-2′-phenylanilide phosphate). The Anti-DIG antibodies can be conjugated to other labels that require no development, such as FITC, Texas Red or Rhodamine. In some protocols anti-DIG antibodies may be unlabelled wherein a secondary conjugated antibody is used to visualize the probe. Being closely related in structure to digoxigenin, GreenStar™ digoxin hyperlabeled probes can be detected using the same methods and kits designed for digoxigenin detection.
Biotin is the other common compound used in the labeling of oligonucleotide probes. Linked to ATP (other nucleotides have also been biotinylated) it can be detected with antibodies but more often a 65 kd glygoprotein Avidin from egg white or Streptavidin from the fungi Streptomyces avidinii is used, as they have a high binding capacity to biotin and can be conjugated to a similar range of visual and fluorescent labels.
In general, of the two “indirect” labels it is thought that the DIG label is more sensitive than the Biotin label and that the DIG label allows comparable sensitivity to 35S radiolabeled probes.
Applying in situ hybridization for carrying out the present invention, apart from the necessary controls the most important issue is permeablization of the sectioned tissue in order to enable the probe to reach the target, that is the mRNA of the target gene located within the fixed tissue section or the fixed sample of cells. The act of fixation results in cross-linking of proteins, which may present an obstacle to good infiltration of the probe, and finally mRNA sequences are often surrounded by proteins which may mask the target sequence. Therefore permeabilization is essential. There are a number of different elements in permeabilization procedures, wherein the following three reagents are commonly used to permeabilize tissue: HCl (typically, incubation in 0.2M HCl for 20-30 min), detergents (Triton or SDS) and Proteinase K. Detergent treatment, usually with Triton X-100 or SDS, is frequently used to permeabilize the membranes by extracting the lipids. This is not usually required in tissue that has been embedded in wax, but for intact cells or cryostat sections these may be more critical steps. Proteinase K is an endopeptidase which is non-specific and attacks all peptide bonds, is active over wide pH range and not easily inactivated. It is used to remove protein that surrounds the target sequence. Optimal concentration have to be determined but a normal starting concentration is 1 μg/ml. Incubation has to be carefully monitored because if the digestion proceeds to far you could end up destroying most of the tissue or cell integrity.
Pretreatment/Prehybridization is generally carried out to reduce background staining. Many of the non-radioactive oligonucleotide probe detection methods utilize enzymes such as peroxidases or alkaline phosphatases to visualize the label. Therefore any endogenous tissue enzymes which could result in giving a very high background should be neutralized. This can be achieved with peroxidases by treating the tissues with 1% H2O2 in methanol for 30 minutes. For Alkaline phosphatases, the drug levamisole may be added to the substrate solution. In general, however, this is considered to be unnecessary since residual alkaline phosphatase activity is usually lost during hybridization.
Another commonly observed pre treatment when using RNA probes is acetylation with acetic anhydride (0.25%) in triethanolamine. This treatment is thought to be important for decreasing background but it also appears to inactivate RNases and may help in producing a strong signal. Luckily RNases are not an issue when oligonucleotide probes are used. Fixation of tissue effectively “secures the target RNA” i.e. the mRNA within the tissue from RNAse digestion and oligonucleotide probes are naturally resistant to RNases. This is why solutions do not need to be RNase free when using oligonucleotide probes with in situ hybridization.
Prehybridization involves incubating the tissue/section with a solution that is composed of all the elements of the hybridization solution, minus the probe although not all protocols use a prehybridization step.
The composition of the hybridization solution is critical in controlling the efficiency of the hybridization process. Hybridization depends on the ability of the oligonucleotide to anneal to a complementary mRNA strand just below its melting point (Tm; the value of the Tm is the temperature at which half of the oligonucleotide is present in a single stranded form).
The major factors that influence the hybridization of the oligonucleotide probe to the target mRNA are: Temperature; pH; monovalent cation concentration and presence of organic solvents. Typical hybridization solutions with a hybridization temperature of around 37° C. and an overnight incubation period are: dextran sulphate; the organic solvents formamide and DTT (dithiothreitol); SSC (NaCl+Sodium citrate); and EDTA. Other components are added to decrease the chance of nonspecific binding of the oligonucleotide probe, for example: ssDNA, tRNA (acts as a carrier RNA), polyA and Denhardts solution.
Following hybridization the material is washed to remove unbound probe or probe which has loosely bound to imperfectly matched sequences. Washing should be carried out at or close to the stringency condition at which the hybridization takes place with a final low stringency wash.
The terms “stringent” or “stringency condition” as used herein, refer to the conditions for hybridization as defined by the nucleic acid, salt, and temperature. These conditions are well known in the art and may be altered in order to identify or detect identical or related polynucleotide sequences. Numerous equivalent conditions comprising either low or high stringency depend on factors such as the length and nature of the sequence (DNA, RNA, base composition), nature of the target (DNA, RNA, base composition), milieu (in solution or immobilized on a solid substrate), concentration of salts and other components (e.g., formamide, dextran sulfate and/or polyethylene glycol), and temperature of the reactions (within a range from about 5° C. below the melting temperature of the probe to about 20-25° C. below the melting temperature). One or more factors be may be varied to generate conditions of either low or high stringency.
Of course an important part of any experimental procedure is the inclusion of controls. Thus, according to one embodiment, the method further comprises contacting said biological specimen with a control nucleic acid probe capable of forming a hybrid with at least one control gene or fragments thereof. According to yet another embodiment, the at least one control gene is selected from the group consisting of: β-actin, glycer-aldehyde-phophate-dehydrogenase (GAPDH), S16 rRNA, phosphoribosyl transferase (HPRT), dihydrofolate reductase (DHFR), adenosine deaminase, phosphoglycerol kinase (PGK), pyruvate kinase, phosphoglycerol mutase.
According to yet another embodiment, the method further comprises contacting the nucleic acid probe with a biological standard. According to certain embodiments, the biological standard is derived from ostensibly healthy tissue, without any discernible pathology, of a human or a non-human primate and said biological specimen is derived from a non-healthy tissue derived from the human or a non-human primate. Alternatively, said biological specimen is derived from a non-healthy tissue derived from a different human or non-human primate.
In carrying out an in situ hybridization experiment the following controls are commonly used: polyd(T) probe (detects total mRNA poly A tails), nonsense probe (GeneDetect™), sense and antisense probes, pan-species actin probe and probes against house keeping sequences including, but not limited to, Actin and beta-tubulin.
In order to determine that the probe is only binding to the desired mRNA it is recommended to digest the tissue with RNase, in representative control samples, prior to hybridization with the oligonucleotide probe. The absence of binding after RNase treatment indicates that binding was indeed to RNA within the tissue.
For verifying specific versus non-specific binding the first control involves hybridization of the tissue with both labeled sense and antisense probes in parallel. The antisense probe in theory detects both the target mRNA and any non-specific targets it can bind to due to the chemical properties of the probe (but not due to the probe sequence). The sense control probe gives a measure of non-specific probe binding only due to the chemical properties of the probe. In essence if the sense probe detects nothing, then it may be deduced that any signal detected by the antisense probe is due to sequence-specific binding to mRNA and not due to binding to other targets within the cell. Competition studies with labeled and excess unlabeled probes also help distinguish between specific versus non-specific binding. This is because by definition specific binding is saturable (i.e. there are finite target mRNA molecules to which the probe can bind) while non-specific binding is not (there are infinite non-specific targets). Therefore excess unlabeled probe can displace (by competition for binding sites) the specific binding of the labeled probe (i.e. to the target mRNA) but not non-specific binding of the labeled probe. According to certain alternative embodiments, the tissue is prehybridized with 10× molar amount of: (1) unlabeled correct sequence oligonucleotide probe and (2) unlabelled nonsense probe before hybridizing with labeled probe. The nonsense probe should preferably have a similar CG content, a similar length and have no homology to the sequence of interest. It is important to note however that competition studies do not verify the identity of the mRNA to which the labeled probe is binding since both the labeled and unlabeled probes have the same sequence.
Another way to ensure that the probe is binding to the correct target sequence is by choosing a correct probe sequence, preferably in silico, from the start and having high stringency hybridization and wash conditions. In summary the following controls are performed in parallel with ISH experiments: poly(dT) probe hybridized to sections, RNase treatment of sections before labeled antisense probe hybridized, hybridize in parallel labeled sense and labeled antisense probes and hybridize labeled antisense probe in presence of a) 10× unlabeled antisense probe and separately in presence of b) 10×unlabeled nonsense probe.
According to yet another embodiment, the present invention provides a method for detecting a disease in a biological specimen utilizing polymerase chain reaction, the method comprising:
According to yet another embodiment, the method comprises selecting the alternatively spliced mRNA isforms from a library of sequences selected from the group consisting of: cDNA, EST and mRNA.
According to yet another embodiment, the plurality of primers comprises at least one primer that is labeled to enable detection of the amplified products. According to some alternative embodiments, the amplified products are separated by means of molecular size exclusion and the size of the amplified products.
According to yet another embodiment, the reaction mixture comprises the four nucleoside triphosphates: dATP, dCTP, dTTP, and dGTP. According to an alternative embodiment, the reaction mixture comprises dATP, dCTP, dTTP, dGTP, and an analog of dGTP, including but not limited to, inosine, 7-deaza-guanosine, 7-deaza inosine deoxyribonucleotides and 2′-deoxy analogs thereof.
According to another embodiment, the biological specimen comprises genetic material extracted from tissue, cells and fluids. According to yet another embodiment, the fluids are selected from the group consisting of: blood, urine and saliva.
According to a certain embodiment, the biological specimen is derived from a cancer patient. According to another certain embodiment, the standard is a biological specimen derived from a healthy individual.
According to a preferred embodiment, the alternatively spliced mRNA isoform is an ACAD-9 mRNA. According another preferred embodiment, the disease is ovarian cancer. According to certain embodiment, the alternatively spliced mRNA isoform is TIF-IA mRNA. According to another certain embodiment, the disease is lymphoma.
Labels may be attached to the primer using a variety of techniques and can be attached at the 5′ end, and/or the 3′ end and/or at an internal nucleotide as mentioned above. The label can also be attached to spacer arms of various sizes, which are attached to the primer. These spacer arms are useful for obtaining a desired distance between multiple labels attached to the primer. The fluorescent dyes useful for labeling the primers are commercially available and include fluorescein dyes, rhodamine dyes, and various derivatives thereof including, but not limited to, FAM (6-carboxyfluorescein), Lucifer yellow, TAMRA (6-carboxy-tetramethyl-rhodamine) or JOE (2′,7′-dimethoxy-4′,5′-6-carboxyrhodamine). Often functional groups are introduced into the phenyl group of the dyes to serve as a linkage site to an oligonucleotide.
The PCR is performed under conditions known in the art. Typically, the range of temperatures in which denaturation of DNA is achieved is between 90° C. to 105° C. for a time period of 0.5 to 6 minutes. The annealing process during which the primer hybridizes with its template is performed at a temperature of about 37° C. to about 70° C., preferably at a temperature of about 45° C. to 70° C. for about 0.5 to 5 minutes, more preferably for 1-3 minutes. The extension of the PCR products is achieved by heating the reaction mixture to a temperature of about 40° C. to 80° C. for a period of about 0.5 to 40 minutes. The number of PCR cycles is variable but preferably the number is at least 10-20 times and at most 30-40 times.
To improve the yield of the PCR extension products, at least one additional constituent selected from DMSO, formamide, Tween 20, and tetramethylammonium chloride (TAM) are typically added to the reaction mixture at the respective concentration of 1-5%, 0.1-1%, 15-100 mM. Preferably, DMSO is added at a concentration of about 5 to 15%.
Analysis of the fluorescently labeled PCR products is performed typically on a denaturing gel. The denaturing gel may be composed of polyacrylamide and urea in a ratio of 6% polyacrylamide and 7 mM urea. Alternatively, other ratios of polyacrylamide and urea or other denaturing gels may be used as known in the art.
Monitoring the amplified product during amplification may be performed using any of a variety of real time amplification methods. For example, certain methods involve monitoring the formation of amplification products directly using labels, which bind to the amplification product to form a complex that creates a detectable signal. Alternatively, the formation of amplification products can be monitored using probes, such as fluorescently labeled probes, which are complementary to the amplification products. During the amplification process, alteration of the probe generates a detectable signal, which correlates with the formation of the amplification product. Fluorogenic nuclease assays such as the “TaqMan” assay exemplify this type of approach, wherein a probe is used to monitor amplification product formation.
According to yet another aspect, the present invention provides a method for preventing or inhibiting a disease in a subject in need thereof, comprising treating the subject in need thereof with a therapeutically effective amount of a pharmaceutical composition containing as an active ingredient a polynucleotide capable of hybridizing to an oligonucleotide comprising a sequence selected from: Alu-derived exon, alternatively spliced mRNA isoform comprising one or more Alu-derived exons and fragments thereof.
According to another embodiment, the genetic disease is selected from the group consisting of: Alport syndrome, Sly syndrome, CCFDN (congenital cataracts, facial dysmorphism, and neuropathy) syndrome and OAT deficiency.
According to one embodiment, the polynucleotide is selected from the group consisting of: antisense nucleotide sequence, sense nucleotide sequence, short interfering RNA, ribozyme and aptamer.
In recent years, advances in nucleic acid chemistry and gene transfer have inspired new approaches for treating genetic diseases by gene interference. Antisense technology has been one of the most commonly described approaches in protocols to achieve gene-specific interference. For antisense strategies, stochiometric amounts of single-stranded nucleic acid complementary to the messenger RNA for the gene of interest are introduced into the cell.
International Publication No. WO 02/10365 provides a method for gene suppression in eukaryotes by transformation with a recombinant construct containing a promoter, at least one antisense and/or sense nucleotide sequence for the gene(s) to be suppressed, wherein the nucleus-to-cytoplasm transport of the transcription products of the construct is inhibited. In one embodiment, nucleus-to-cytoplasm transport is inhibited by the absence of a normal 3′ UTR. The construct can optionally include at least one self-cleaving ribozyme. The construct can also optionally include sense and/or antisense sequences to multiple genes that are to be simultaneously downregulated using a single promoter. Also disclosed are vectors, plants, animals, seeds, gametes, and embryos containing the recombinant constructs.
European Patent Application No. 0223399 A1 describes methods for the use of genetic engineering technology in plants to achieve useful somatic changes to plants, not involving the expression of any exogenous proteins, but instead controlling the expression of an endogenous protein or other DNA or RNA factor naturally introduced into the plant cells through outside agents, such as agents of disease or infection.
Antisense has recently become accepted as therapeutic moieties in the treatment of disease states. For example, U.S. Pat. No. 5,098,890 is directed to antisense oligonucleotide therapies for certain cancerous conditions. U.S. Pat. No. 5,135,917 provides antisense oligonucleotides that inhibit human interleukin-1 receptor expression. U.S. Pat. No. 5,087,617 provides methods for treating cancer patients with antisense oligonucleotides. U.S. Pat. No. 5,166,195 provides oligonucleotide inhibitors of HIV. U.S. Pat. No. 5,004,810 provides oligomers capable of hybridizing to herpes simplex virus Vmw65 mRNA and inhibiting replication. U.S. Pat. No. 5,194,428 provides antisense oligonucleotides having antiviral activity against influenza virus. U.S. Pat. No. 4,806,463 provides antisense oligonucleotides and methods using them to inhibit HTLV-III (human T-cell lymphotropic virus type III, known also as HIV) replication. Antisense oligonucleotides have been safely administered to humans and several clinical trials of antisense oligonucleotides are presently underway. It is, thus, established that oligonucleotides can be useful therapeutic instrumentalities and that the same can be configured to be useful in treatment regimes for treatment of cells and animals, especially humans.
Aptamers are specifically binding oligonucleotides for non-oligonucleotide targets that generally bind nucleic acids. The use of single-stranded DNA as an appropriate material for generating aptamers is disclosed in U.S. Pat. No. 5,840,567. Use of DNA aptamers has several advantages over RNA including increased nuclease stability, in particular plasma nuclease stability, and ease of amplification by PCR or other methods. RNA generally is converted to DNA prior to amplification using reverse transcriptase, a process that is not equally efficient with all sequences, resulting in loss of some aptamers from a selected pool.
The methods of the invention may be further utilized for screening short interfering RNAs (siRNAs; Fire et al., Nature 391:806, 1998). RNA interference refers to the process of sequence-specific post-transcriptional gene silencing in animals mediated by siRNAs. The corresponding process in plants is commonly referred to as post-transcriptional gene silencing or RNA silencing. The process of post-transcriptional gene silencing is thought to be an evolutionarily conserved cellular defense mechanism used to prevent the expression of foreign genes and is commonly shared by diverse flora and phyla.
RNA interference is a phenomenon in which double stranded RNA (dsRNA) reduces the expression of the gene to which the dsRNA corresponds. The phenomenon of RNAi was subsequently proven to exist in many organisms and to be a naturally occurring cellular process. The RNAi pathway can be used by the organism to inhibit viral infections, transposon jumping and to regulate the expression of endogenous genes (e.g. Zamore P D., Nat Struct Biol. 8:746, 2001).
International Publication No. WO 00/01846 discloses methods for identifying specific genes responsible for conferring a particular phenotype in a cell using specific dsRNA molecules. International Publication No. WO 01/29058 discloses specific genes involved in dsRNA-mediated RNAi. International Publication No. WO 99/07409 discloses specific compositions consisting of particular dsRNA molecules combined with certain anti-viral agents. International Publication No. WO 99/53050 discloses certain methods for decreasing the phenotypic expression of a nucleic acid in plant cells using certain dsRNAs. International Publication No. WO 01/49844 discloses specific DNA constructs for use in facilitating gene silencing in targeted organisms.
International Publications Nos. WO 02/055692, WO02/055693, and EP 1144623 B1 disclose methods for inhibiting gene expression using RNAi. International Publications Nos. WO 99/49029 and WO01/70949, and AU 4037501 describe certain vector expressed siRNA molecules. U.S. Pat. No. 6,506,559, discloses methods for inhibiting gene expression in vitro using certain siRNA constructs that mediate RNAi. U.S. Pat. No. 5,681,747 discloses methods for inhibiting human-PKCα expression with an oligonucleotide specifically hybridizable to a portion of the 3′-untranslated region of PKCα.
The following examples are to be construed in a non-limitative fashion and are intended merely to be illustrative of the principles of the invention disclosed.
Materials & Methods:
Building a Genomic Alu Profile
The Alu Gene database was used to extract full-length genomic Alu sequences that are found within introns in the antisense orientation of Alu. This search yielded 166,276 full-length Alus. Each of these Alus was aligned to its ancestral sequence using ClustalW 1.8. For each position along the Alu consensus, occurrence of each nucleotide was counted, and a frequency profile was built. In a similar manner, the sequences of 25 Alus that were exonized using position 157 as 5′ss were aligned to their consensus, and an “observed” profile was built. A parallel “expected” profile was calculated by multiplying the frequency profile matrix from the 166,276 non-exonized intronic Alus by 25 (the number of exonized sequences). Statistical significance between the observed and expected profile were calculated using Chi statistical test.
Plasmid Constructs
Oligonucleotide primers were designed to amplify (from human genomic DNA) a mini-gene that contains exons 7, 8, and 9 of the adenosine deaminase (ADAR2) gene, and exons 11, 12, and 13 of the putative glucosyltransferase (PGT) gene. Each primer contained an additional extension encoding a restriction enzyme sequence. The PCR product of ADAR2 and PGT (2.2 kb and 3 kb, respectively) was restriction digested and inserted between the KpnI/BglII sites in the pEGFP-C3 vector (Clontech). The hSlu7 cDNA was a kind gift from Robin Reed and was inserted as described above into pEGFP-C3. The U1 gene was cloned in the pCR vector.
Site-Directed Mutagenesis
Oligonucleotide primers containing the desired mutations were used to amplify the mutation-containing replica of the wild-type ADAR2 mini-gene plasmid or the U1 gene respectively. The products were treated with DpnI restriction enzyme (12U) (New England Biolabs) at 37° C. for 1 h. 1-4 μL of the mutant DNA was transformed into E.coli DH5α strain. Colonies were picked followed by mini-prep (QIAgene) and midi-prep (BRL). All plasmids were confirmed by sequencing.
Transfection, RNA Isolation and RT-PCR Amplification
293T, HeLa and HT1080 cells lines were cultured in Dulbecco's Modification of Eagle Medium, supplemented with 4.5 g/ml glucose (Biological Industries) and 10% fetal calf serum, and cultured in 60 mm dish under standard conditions at 37° C. with 5% CO2. Cells were grown to 50% confluence, and transfection was performed using Metafectene (Biontex) with 10 μg of plasmid DNA or using FuGENE6 (Roche) with 6 μg of plasmid DNA. Cells were harvested after 48 hr. Total cytoplasmic RNA was extracted using Tri Reagent (Sigma), followed by treatment with 1U of RNase-free DNase (Promega). Reverse transcription (RT) was preformed on 2 μg total cytoplasmic RNA for 1 hr at 42° C., using a pEGFP-C1-specific reverse primer and 2U of reverse transcriptase avian myeloblastosis virus (A-AMV, Roche).
The spliced cDNA products derived from the expressed mini-genes were detected by PCR, using the pEGFP-C3-specific reverse primer and an exon 7- or 11-forward primer (ADAR2 and PGT respectively): Amplification was performed for 30 cycles, consisting of 1 min at 94° C., 45 sec at 61° C., and 1 min at 72° C. The products were resolved on 2% agarose gel and confirmed by sequencing. The level of mRNA of the house-keeping gene, Glycerol-3-phosphate dehydrogenase, was used as the internal control for each transfection.
Nonsense-Mediated Decay (NMD)
The level of the isoforms are unaffected by the Nonsense-Mediated Decay. This was indicted by incubation of the transfected cells with 300 μg/ml puromycin (Sigma) for 4 hr before RNA collection.
Real-Time PCR
The LightCycler PCR and detection system (Roche) was used for quantification of the PCR products. The PCR reaction for each cDNA was performed twice, using specific primers—one amplified only the upper band (exons 7, 8, and 9) and the other amplified only the lower band (exons 7 and 9). The PCR mixture (Roche) contained Taq DNA polymerase, reaction mix (buffer, SYBR Green I dye, dNTPs with dUTP instead of dTTP, 13 mM magnesium chloride) and 12.5 pmol of primers. The samples were run for 45 cycles of repeated 10 sec at 95° C., 10 sec at 61° C., and 10 sec at 72° C. Another reaction was performed, using specific primers for the GAPDH gene, that was used as an endogenous expressed control.
To study the alternative splicing regulation of exonized Alus, a dataset of exonized Alus was compiled from the human genome. Analysis of this dataset revealed that two positions along the inverted Alu sequence are most commonly used as 3′ splice-site (3′SS) in Alu exonizations: position 279 (‘proximal AG’) and position 275 (‘distal AG’). The relationships between two near AGs in a 3′SS were well characterized previously in the context of constitutive splicing. To pinpoint the sequence determinants by which the spliceosome decides which of the two possible AGs is selected in the context of alternative splicing, the exonized Alus that use either of these AGs were aligned to their ancestor.
Rows 30-32 represent the 3′SS of Alu sequences that their constitutive exonization was shown to cause genetic diseases: Alport syndrome (COL4A3), Sly syndrome (GUSB) and OAT deficiency (OAT). The mutation causing Alport syndrome is marked light-blue (position −7 G to T); exonizaiton in Sly syndrome and OAT deficiency resulted from mutations in the 5′SS. As shown in
However, in five cases (
A mini-gene of ADAR2 gene (adenosine deaminase, involved in RNA editing) was cloned (
To characterize the relationship between proximal and distal AGs in the context of alternative splicing, a set (13) of mutations within the 3′SS was generated in the wild-type ADAR2 (
While the Alu-exon in the wild-type ADAR2 was included in 40% of the transcripts (
To check whether the proximal AG affects the selection of the distal AG, the proximal AG was mutated to UC or GA (
We then sought to understand whether the nucleotide composition between the two adjacent AGs affects 3′SS selection and ratio of alternative splicing. The two AGs are separated by an AC dinucleotide (
To test whether increased distance between the two AGs shifts the selection toward the proximal AG, additional nucleotides between the two AGs were introduced (
We further examined whether hSlu7 (Human Synergistic Lethal with U5 snRNA), a second step splicing factor, is involved in the activation of the proximal AG (
Rows 17-22 in
It was further assumed that, in normal conditions, these Alu exons would be skipped. For investigating this assumption, a gene encoding a putative glucosyltransferase (PGT) (
Indeed, when PGT mini-gene was transfected into HT1080 and to 293T cell-lines, only a single mRNA isoform appeared, corresponding to Alu-exon skipping (
Position −7 in the PGT mini-gene was mutated in order to test whether such mutation in a completely silent intronic Alu element would result in exonization. As seen in
Data for 26 exonized Alus (Sorek et al. 2002, ibid) are shown in
The comparison of adjacent regions of the exonized Alus with the Alu antisense consensus sequence allowed the identification of nucleotide positions along the Alu sequence that need to be changed or remain conserved in order to enable Alu exonization. Two types of 5′ss were found: Introns that start with GC (
The most significant change observed between exonized Alus and their ancestral sequences was at position 156 (position 2 of the intron), where a mutation from C to T generates a canonical GT 5′ss at positions 157-156. This occurred in 21 of the 25 (84%) exonized Alus. In those cases in which positions 157-156 remained GC as in the ancestral sequence (rows 1-4), position 155 (representing position 3 of the intron) was found to be mutated from G to A.
In the ancestral sequence, CG dinucleotides are found in positions 156-155, 154-153, and 152-151 (
Some of the changes observed in
Since sense and antisense-oriented Alus can be under different selective pressures in the human genome, we also compiled a dataset of 136,151 intronic Alus that are found in the sense orientation in introns and repeated the statistical comparison to exonized Alus. This yielded the same results, indicating positions 2 and 5 of the intron as significantly different between exonized and non-exonized Alus (
Significantly, the same mutation, from C to T at position 156, of an antisense Alu sequence in intron 6 of the CTDP1 gene was found to be the cause of CCFDN (congenital cataracts, facial dysmorphism, and neuropathy) syndrome (
Following these results we wished to understand how the alternative splicing of these exonized Alus is regulated. In mRNA splicing, the 5′ss is recognized by three small nuclear RNPs (complexes of snRNA and proteins), one of which (U1) basepairs across the 5′ss junction (potential base-pairing between positions −3 to +6; for clarity, “−” and “+” indicate positions upstream or downstream of the 5′ss, respectively). This base-pairing is a prerequisite step for splicing in most introns. Although the importance of the U1 snRNA:5′ss base-pairing is well established in constitutive splicing, the function of this base-pairing in alternative splicing is only partially understood. We, therefore, set to understand the manner in which U1 affects the alternative splicing of Alu exons.
We used a mini-gene containing the genomic sequence of adenosine deaminase gene, ADAR2, from exon 7 to 9, in which exon 8 is an alternatively spliced Alu exon (Lev-Maor et al., ibid). The 5′ss of this Alu exon is of the GC type (
Our results indicate that the GC 5′ss maintains alternative rather than constitutive splicing directly because the C in position 2 of the 5′ss un-pairs with U1. This conclusion is supported by the fact that a compensatory mutation in U1 (A to G in position 7), which restores the base-pairing with position 2, results in the constitutive splicing of the exon (
As expected from our in silico analysis (
Position 4 of the 5′ss affects the level of alternative splicing of the Alu exon. Mutation of that position from T to A, which strengthens the base-pairing with position 5 in U1, resulted in constitutive inclusion of the Alu exon (
To understand whether or not a cooperative effect between positions 3 and 4 exists, we produced double mutants and tested their inclusion ratios. Mutations that introduced a mis-pairing between U1 and position 3 (A to T) but allowed better base-pairing of U1 with position 4 (T to A), restored alternative splicing of the Alu exon with a ratio of inclusion of 78% (
The cooperative effect between positions 3 and 4 did not occur in double mutants in which position 3 was mutated to C and position 4 to A (
We further studied the importance of positions 5 and 6 in GC 5′ss. Mutations in these positions resulted in the total skipping of the Alu exon, which compensatory mutations in U1 failed to restore (
To study the regulation of Alu exons with GT in their 5′ss, we mutated the C in position 156 of the ADAR2 Alu exon to T, creating a GT 5′ss (
In contrast to the GC 5′ss, which required an obligatory A in position 3, mutation of that position to G in the GT 5′ss reduced the splicing from constitutive to alternative, but did not eliminate exon inclusion entirely (
To test the importance of position 4 in the GT 5′ss, we mutated it to A, which base-pairs with the WT U1. This maintained the constitutive inclusion of the Alu exon (
From these serial mutations in the 5′ss and the compensatory mutations in U1 we conclude the following: (i) U1:5′ss base-pairing is involved in both GC and GT 5′ss selection. (ii) The alternative splicing of ADAR2 Alu exon is maintained due to the un-pairing of U1 with position 2 of the 5′ss. (iii) The nucleotide composition of positions 3 and 4 in the intron, both in GT 5′ss and in GC 5′ss, control the delicate skipping/inclusion ratio depending on the canonical/non-canonical base-pairing of these positions with U1. (iv) An A in position 3 of the intron is more important in GC 5′ss than in GT 5′ss, possibly due to the need to avoid two successive non-pairing positions. (v) G in position 5 is essential for the selection of 5′ss in the two types of introns.
The results for position 4 of the 5′ss indicate that when this position is mutated to A the ADAR2 exon becomes constitutive. However, there are two cases among the 25 alternatively spliced Alu exons in our compilation that contain A in position 4 (
When position 6 in ADAR2 is mutated to C, the 5′ss score is reduced from 73 to 65 (
When position 4 in ADAR2 is mutated to A, the 5′ss score increases from 73 to 83, and the AG becomes −6.0 (
Overall, the 5′ss score for exonized Alus was similar to that for non-Alu alternative cassette exons, averaging 78.3 in the 25 Alu exons and 79.8 in the set of 243 non-Alu cassette exons from Sorek and Ast (Genome Res. 13:1631-7, 2003). The free energy value was also similar between the sets, with an average of −5.08 in the 25 Alu exons and −5.29 in the non-Alu cassette exons.
The results presented here and in Lev-Maor et al. (ibid) indicate that the sequence composition of both 3′ss and 5′ss determines the alternative splicing ratio of Alu exons. In addition, our statistical comparison between intronic Alus and exonized ones, shows that only two positions along the Alu sequence substantially differ between intronic and exonized Alus; both in the 5′ss sequence (
Still, it is possible that the splicing of Alu exons is regulated by splicing enhancers or silencers residing outside the splice sites. We used the ESE finder to search the sequence of the Alu exon of ADAR2 for such potential sequences. Seven potential binding sites for SR proteins were found, 3 of them in close proximity (
To examine whether other sites are involved in the regulation of the Alu exon splicing, we co-transfected 293T cells with the ADAR2 mini-gene and various plasmids containing the most abundant splicing regulatory proteins (SR proteins and hnRNP A1, the later was shown to promote exon-skipping (
The sequence elements needed for the creation of an alternatively spliced Alu exon are depicted in
The results presented in the present application demonstrate that an activation of an intronic Alu sequence (silent Alu sequence never spliced in) can be induced either by point mutations in certain positions along the Alu sequence, or by changing the concentration of the splicing regulatory proteins in the cell. Both high rate of mutations and aberrant concentration of splicing regulatory proteins are implicated in cancer cells.
If alternative RNA splicing isoforms are associated with specific cancer cell types, then they could potentially serve as diagnostic markers for cancer. This idea was tested herein. Alternative splicing isoforms of ACAD-9 gene (e.g. SEQ ID NO:64) containing Alu exons (e.g. SEQ ID NO:63) were classified as tumor or normal, based both on the source of the mRNA used to construct the relevant cDNA library and whether the isoform was present at a higher frequency in tumor cDNA libraries than in normal cDNA libraries. This analysis showed that 24 out of the 61 alternative RNA-splicing isoforms containing Alu exon were significantly associated with tumor libraries. The alternative-splicing isoforms were associated with ovary, brain, head-neck, lung, kidney, and prostate cancers. A partial list presented in Table 1.
To validate the computation-predicted alternative RNA-splicing isoforms, we selected two genes: ACAD-9 (AF327351) and TIF-IA.
ACAD-9 belongs to the family of Acyl-CoA dehydrogenses which are mitochondrial enzymes catalyzing the initial rate-limiting step in the b-oxidation of fatty acyl-CoA. The reaction provides main source of energy for human heart and skeletal muscle. The ACAD-9 gene contains 18 exons mapped to chromosome 3q26, it shares 65% similarity with VLCAD (very long chain acyl-CoA dehydrogense). Deficiency of this protein may leads to severe human genetic diseases, for example: sudden death in infancy, infantile cardiomyopathy, hypoketotic hypoglycemia, hepatic dysfunction. The protein is ubiquitously expressed in most normal tissues with high expression in heart, skeletal muscles, brain, kidney and liver and also in most cancer cell lines.
For ACAD-9 gene we identified 7 ESTs (Expressed Sequence Tag), from 2 different libraries, containing the Alu exon, all 7 ESTs were detected exclusively in human ovarian cancer. The exonized Alu insert is a 76 bases exon originated from intronic Alu in intron 4 (
The alternative splicing pattern of the ACAD-9 gene in ovarian cancer was examined in 16 tissue samples of human ovarian cancer (obtained from Gynecologic Oncology Group) and 3 samples of “normal” tissues (derived from the humans bearing ovarian cancer). Total cytoplasmic mRNA was isolated and cDNA synthesis was performed using primers that were designed to detect regular- and alternative-splicing products (directed to the flanking exons of the Alu element): Reverse primer—5′ CACTGATCCATCAGAATCAACG 3′ (SEQ ID NO: 61) and Forward primer—5′ GCACTACATCCTCAATGGCTC 3′ (SEQ ID NO: 62). Reaction products were analyzed by agarose gel electrophoresis (
The fate of ACAD-9 gene, in the level of genomic DNA, mRNA and proteins in is examined in a collection of 100 ovarian cancer tissues, normal tissues, and uterus cancer tissues.
TIF-IA is a 75 kDa protein that plays a major role in directing growth-dependent regulation of ribosomal DNA transcription. The amount or activity of TIF-IA fluctuates in response to cell proliferation. TIF-IA is also an essential initiation factor of RNA polymerase I (Pol I) transcription in growing cells. The fact that regulation of ribosomal gene transcription is a crucial element in controlling cell growth, and normally balances the effects of growth factors to prevent uncontrolled cell proliferation, appoints TIF-IA as a fundamental functional protein in adapting cellular biosynthetic activities to cell growth.
The TIF-IA gene was found to have six mRNA isoforms which contain an Alu exon. These TIF-IA isoforms were detected mostly in various cancers as shown in
Exonization of intronic Alu element derived from the Sx subfamily is presented schematically in
The function of the Alu exon(s) in TIF-IA is unknown. The most prevalent Alu exon (4 occurrences) is 180 bp long and does not posses a stop codon, therefore after splicing, the sequence translates to a protein with an additional domain whose function is yet unknown. Bioinformatics analysis reveals potential phosphorylation sites on this region.
The alternative splicing pattern of the TIF-IA gene in different leukemia (mostly lymphatic) cell-lines was examined. Total cytoplasmic mRNA was isolated and cDNA synthesis was performed using primers that were designed to detect regular- and alternative-splicing products (directed to the flanking exons of the Alu element; Forward primer: GTC GCG TTA GTT CGG CCC (SEQ ID NO:59); Reverse primer: CTT CAG CAA GAC TTC TGT CAC (SEQ ID NO:60). Reaction products were analyzed by agarose gel electrophoresis (
The presence of this RNA and is sought by screening the number and type of the mRNA isoforms generated from this gene on at least 100 primary samples obtained, with consent, from patients suffering from childhood acute lymphoblastic leukemia. These samples are collected as a part of the Israeli National Study of childhood leukemia treated on the international BFM protocol. Specificity is obtained by of collecting and performing a similar analysis on 100 samples of tissues from ovarian and uterus cancers on samples of normal peripheral blood cells or bone marrow cells (collected from donors for stem cell transplantation). The examination is carried out as follows:
To examine the effect TIF-IA mRNA containing an Alu exon on different cellular activities TIF-IA cDNA is cloned with and without the Alu exon. The ultimate goal of the TIF-IA cDNA cloning is to use it as the basis for functional studies in cell lines. Total RNA is isolated from several lymphoblastic cell lines known to express the two mRNA isoforms. cDNA is produced by reverse transcribing the mRNA and large amounts of cDNA encoding both isoforms of TIF-IA are obtained using PCR. The amplified product(s) are cloned into the pEGFP-C3 reporter plasmid (Clontech), which expresses the Green Fluorescence Protein (GFP), and into pcDNA3.1 (Invitrogen) which possesses the strong CMV promoter and a T7 RNA polymerase promoter and to MIGR1 retrovirus that is based on the MSCV promoter and expresses GFP under IRES control. Constructs are confirmed by sequencing.
I. Association of SRP9/14 with the Alu-exon. The aim of this experiment is to check whether SRP9/14 proteins bind the Alu exon of TIF-IA mRNA in the cells. RNA folding programs predicted that the Alu exon of TIF-IA contain a putative binding site for SRP9/14 proteins (
II. The effect of TIF-IA with the Alu-exon on translation. The purpose of the in vitro transcription and translation assay is to study the possible affect of the Alu exon on translation. The hypothesis behind this experiment is that mRNA containing an Alu exon that binds SRP9/14 proteins may effect normal translation. Therefore, TIF-IA cDNA isoforms inserted into the pcDNA3.1 plasmid are incubated in the T7 coupled transcription and translation reticulocyte lysate system (Promega) in the presence of [35S] methionine (Amersham). The protein products are resolved in 10% polyacrylamide SDS gel and the amount of protein produced is evaluated using a phosphorimager.
The effect of the Alu RNA on translation is examined by adding these RNAs to an in vitro translation system and exploring the level of translation of several reported genes. To investigate the effect of Alu RNA on the expression of a reporter gene in vitro, Alu is cloned in the pGEMluc plasmid containing the luciferase gene driven by the SP6 promoter. The construct is used as substrates for coupled in vitro transcription/translation using a rabbit reticulosyte lysate translation system supplemented with cytoplasmic extracts from Human HeLa cells. Luciferase activity is determined using a luminometer, and luciferase protein levels are determined by denaturing gel analysis of translation products. UV crosslinking experiments are carried out for determining the differences in binding of SRP proteins in the cytoplasmic extracts to Alu RNA according to the protocols of Malca et al. (Mol Cell Biol 23:3442-55, 2003).
Cytoplasmic extracts of HeLa cells are incubated with a labeled transcript. RNase A and T1 are added and proteins are resolved in SDS-polyacrlamide gels. An excess of unlabeled specific or non-specific Alu RNA are added to determine specificity of the protein binding.
III. The effect of T1F-IA with Alu exon on cell growth. Previous studies have shown that different regions of TIF-IA have a functional significant activity on polymerase I transcription and on cell growth (Zhao et al., Mol Cell 11:405-13, 2003). In order to test whether the Alu exon of TIF-IA enhances or abrogates TIF-IA activity thus positively or negatively effecting cell cycle or growth, a Fluorescence Activated Cell Sorter (FACS) analysis is performed. To test this, 293T and HeLa cells are transfected with cDNAs encoding both isoforms of TIF-IA co-expressed with the GFP protein. At several time points (24, 36, 48, and 60 hr) after transfection cells are stained with propidium iodide (0.05 mg/ml) and analyzed by dual-parameter flow cytometry (FACS). Initially, cells expressing GFP-tagged TIF-IA are identified and sub-grouped. Then, this group is used to measure the DNA content for cell cycle analysis. More than 50,000 events are recorded from each sample. The number of cells expressing any TIF-IA isoform and the percentage of each cell cycle stage is calculated. The results indicate that cells transfected with TIF-IA cDNA with the Alu exon are mostly in the S phase.
Because HeLa and 293T cells are of a different cell lineage than lymphoblastic leukemia similar experiments on the mouse IL3 dependent pre-B lymphoblastic cell line Baf-3 is carried out. This system is ideal for testing pro-survival factors since the cells undergo rapid and complete apoptosis upon withdrawal of IL3. Both isoforms of TIF-IA are expressed by these cells through retroviral infection. GFP positive cells are sorted and analyzed for cell cycle and growth properties. Growth factor dependency is tested by measuring proliferation and apoptosis in the presence of low concentration of IL3. Apoptosis is measured by FACS analysis using annexin positive- and propidium iodide negative-cells.
The foregoing description of the specific embodiments will so fully reveal the general nature of the invention that others can, by applying current knowledge, readily modify and/or adapt for various applications such specific embodiments without undue experimentation and without departing from the generic concept, and, therefore, such adaptations and modifications should and are intended to be comprehended within the meaning and range of equivalents of the disclosed embodiments. It is to be understood that the phraseology or terminology employed herein is for the purpose of description and not of limitation. The means, materials, and steps for carrying out various disclosed functions may take a variety of alternative forms without departing from the invention.
Number | Date | Country | Kind |
---|---|---|---|
IL 156072 | May 2003 | IL | national |
This application claims the benefit of provisional application 60/526,599 filed Dec. 4, 2003, the entire content of which is expressly incorporated herein by reference thereto.
Number | Date | Country | |
---|---|---|---|
60526599 | Dec 2003 | US |