The present invention relates to toxic sequences in polynucleotides in general, including polynucleotides comprising RNA, and RNAs that can participate in RNA-induced gene silencing.
A variety of molecules, including those that are proteinaceous, nucleic acid, lipid, or carbohydrate in nature, can induce cytotoxic effects (see, for instance, Gururaja, T. et al. (2003) “Cellular interacting proteins of functional screen-derived antiproliferative and cytotoxic peptides discovered using shotgun peptide sequencing” Chem. Biol. 10(10):927-37). Knowledge of the cellular specificity and mechanism of action of such molecules is valuable from both a research and therapeutic perspective. For instance, studies of anthrax toxins identified these factors as initiators of caspase-dependent apoptosis. Similarly, studies of toxin B from Clostridium difficile aided in the elucidation of the function of the Rho family of proteins in cell signaling (see, for instance, Schmidt, M. et al. (1996) “Inhibition of receptor signaling to phospholipase D by Clostridium difficile toxin B. Role of Rho proteins.” J. Biol. Chem. 271(5):2422-6).
Although proteins are the focus of most current drug discovery efforts, research has recently begun that aims to exploit nucleic acids as novel agents and targets for pharmaceutical development. Toxic RNA, DNA, or RNA-DNA hybrid sequences (either single stranded or double stranded) can be valuable as therapeutic agents or co-agents that can be used in collaboration with other molecules to, e.g, sensitize target cells to undergo apoptosis or necrosis. The targets of these molecules can be diverse. In some instances, the target(s) of a toxic oligonucleotide is nucleic acid in nature (DNA or RNA) and the mechanism of action is related to the relative degree of homology that the toxic sequence has for a specific target molecule (e.g., antisense and RNA interference (RNAi), Layery, K. S. et al. (2003) “Antisense and RNAi: powerful tools in drug target discovery and validation” Curr. Opin. Drug Discov. Devel. 6(4):561-9; Provost et al., (2002) “Ribonuclease Activity and RNA Binding of Recombinant Human Dicer” E.M.B.O. J., 21(21): 5864-5874; Tabara et al. (2002) “The dsRNA Binding Protein RDE-4 Interacts with RDE-1, DCR-1 and a DexH-box Helicase to Direct RNAi in C. elegans” Cell 109(7):861-71; Ketting et al. (2001) “Dicer Functions in RNA Interference and in Synthesis of Small RNA Involved in Developmental Timing in C. elegans” Genes and Development 20:2654-9; Martinez et al. (2002) “Single-Stranded Antisense siRNAs Guide Target RNA Cleavage in RNAi” Cell 110(5):563; Hutvagner & Zamore (2002) “A microRNA in a multiple-turnover RNAi enzyme complex” Science 297:2056). Alternatively, oligonucleotides can target protein sequences. In these instances, the mechanism of toxicity can involve the molecule assuming a three-dimensional structure that is capable of blocking a critical function of the target protein. Alternatively, the sequence of the oligonucleotide can be recognized by the protein and, for instance, eliminate the ability of that target from participating in critical cellular reactions.
There is a need in the art for knowledge of oligonucleotide sequences that induce such toxicity in cells. In some instances, such sequences need to be identified so that their use can be avoided. In other instances, toxic sequences can be identified and used to benefit in a variety of applications, including pharmaceutical applications. The following disclosure addresses these needs.
The present invention is directed to toxic polynucleotide sequences and methods of using and identifying them.
According to a first embodiment, the present invention provides a unimolecular polynucleotide, comprising at least one toxicity region comprising a sequence selected from the group consisting of GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, or a complement of any of the foregoing, wherein said unimolecular polynucleotide is capable of forming an intramolecular duplex of 5 or more base pairs, and wherein said duplex comprises a sense region and an antisense region that are at least substantially complementary.
According to a second embodiment, the invention provides a double stranded polynucleotide, comprising at least one toxicity region comprising a sequence selected from the group consisting of GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG, (SEQ. ID NO.4) AGAC (SEQ. ID NO.5), UGGC, (SEQ. ID NO.6) NUUU, (SEQ. ID NO.7) wherein N is any nucleotide, or a complement of any of the foregoing, wherein said double stranded polynucleotide is capable of forming a duplex of 5 or more base pairs, and wherein said duplex comprises a sense strand and an antisense strand that are at least substantially complementary.
According to a third embodiment, the invention provides a composition for inducing a toxic response in a cell, comprising a nucleotide sequence GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, or a complement of any of the foregoing, wherein said nucleotide sequence comprises a duplex region that is at least 5 base pairs in length, and wherein said duplex region comprises at least two regions that are at least substantially complementary.
According to a fourth embodiment, the invention provides a method of inducing a toxic response in a cell, said method comprising introducing into the cell a unimolecular polynucleotide or a double stranded polynucleotide, wherein said unimolecular polynucleotide or double stranded polynucleotide comprises at least one toxicity region comprising a sequence selected from the group consisting of GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, or a complement of any of the foregoing, wherein said unimolecular polynucleotide or double stranded polynucleotide comprises a duplex region of 5 or more base pairs, and wherein said unimolecular polynucleotide or double stranded polynucleotide comprises a sense region and an antisense region that are at least substantially complementary.
According to a fifth embodiment, the invention provides a method for screening a library of nucleic acids for a toxicity region, comprising screening a database containing nucleic acid sequences and identifying those sequences that contain toxic motifs.
According to a sixth embodiment, the invention provides a transfection control method, comprising: (a) transfecting a first group of cells with one or more polynucleotides or double-stranded polynucleotides; (b) transfecting a second group of cells with a duplex RNA, wherein said duplex RNA comprises at least one toxicity region comprising a sequence selected from the group consisting of GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, or a complement of any of the foregoing, wherein said duplex RNA is 5 or more base pairs in length, and wherein said duplex RNA comprises a sense region and an antisense region that are at least substantially complementary, and wherein said first and said second cells are transfected under similar conditions; (c) maintaining said first and said second groups of cells under conditions sufficient for cell growth; and (d) determining the level of cell viability in said second group of cells.
For a better understanding of the present invention together with other and further advantages and embodiments, reference is made to the following description taken in conjunction with the examples, the scope of which is set forth in the appended claims.
The preferred embodiments of the invention have been chosen for purposes of illustration and description but are not intended to restrict the scope of the invention in any way. The preferred embodiments of certain aspects of the invention are shown in the accompanying figures, wherein:
a illustrates survival rates of HeLa cells transfected with siRNA directed against raf1, mek1 (MAP2K1), mek2 (MAP2K2), mapk1, mapk3, PI3k-Ca, PI3k-Cb, Bcl2, Bcl13, SRD5A1, SRD5A2, or AR. Lipofectamine 2000 was used to introduce the duplexes into the cells. siRNA concentrations were 10 nanomolar. Four siRNAs were tested against each gene. Cell survival rate was measured 72 hours post-transfection using Alamar Blue. 1b illustrates the ratio of toxic and non-toxic siRNA in a second set of sequences. 1c illustrates the distribution of toxic and non-toxic siRNAs across a walk covering a portion of the DBI gene. Gray bars show toxicity. Black bars show gene silencing.
a illustrates the sequences of toxic and non-toxic duplexes and the frequency with which these motifs are found in toxic and non-toxic populations identified in
b-1 to 2b-3 illustrates the frequency of finding toxic siRNA in groups that have the UUU/AAA motif, (SEQ. ID NO.8/9) the GCCA/UGGC motif, (SEQ. ID NO.10/11) or no motif at all. Black bars represent toxic siRNA. Gray bars represent non-toxic siRNA.
c illustrates the distribution of toxic and non-toxic siRNA in a large collection (>290) siRNA used in a statistical analysis to identify toxic motifs. Black bars represent toxic siRNA, gray bars represent non-toxic siRNA.
d illustrates the relative frequency of various motifs in the RISC-entering strand and non-entering strand of toxic and non-toxic siRNA.
a illustrates the design of Ago2 (eIF2C2) knockdown experiments; 5b—iillustrate the results of control experiments. Knockdown of Ago2 prevents subsequent attempts to knockdown a reporter gene (EGFP) using EGFP-targeting siRNA; 5j illustrates that toxic siRNA are not toxic in an Ago2− cell; 5K illustrates that when toxic siRNA (19mers) are reduced to 17mers, toxicity is attenuated; 5l illustrates that addition of chemical modifications that eliminate off-target effects, attenuates toxicity.
a-e illustrate the level of cell death (apoptosis) induced in HeLa, PC3, MCF7, LnCAP, and BxPC3 cell lines using non-toxic and toxic siRNA. Abbreviations and additional sequences used in these experiments include: MAP2K2-1=m21, MAP2K2-3=m23, SRD5A2-1=s21, SRD5A2-3=s23, Luciferase (Luc) dx 1-2 (l 12, 5′-UUUGUGGACGAAGUACCGA, sense (SEQ. ID NO.14)), Luc dx 1-4=l 14 (5′ UGUUUGUGGACGAAGUACC, sense (SEQ. ID NO.15)), Luc dx 2-3 (l 23, 5′ GAGUUGUGUUUGUGGACGA, sense (SEQ. ID NO.16)), PPIB dx10=cyclophilin 10=c10 (5′-UUGGCUACAAAAACAGCAA, sense (SEQ. ID NO.17)), PPIB dx 5=cyclophilin 5=c5 (5′ AAAACAGUGGAUAAUUUUG, sense (SEQ. ID NO.18)), PPIB dx 8=cyclophilin 8=c8 (5′ GGAUAAUUUUGUGGCCUUA, sense (SEQ. ID NO.19)).
Unless stated otherwise, the following terms and phrases include the meanings provided below:
Agent that Stresses a Cell
The phrase “agent that stresses a cell” includes any agent known in the art, or that comes to be known in the art, that can induce—on its own or in combination with any of the compositions or methods of the present invention—a toxic or stress response in a cell, including but not limited to apoptosis and cell death. Agents that induce stress can be chemical in nature (e.g., H2O2), physical in nature (e.g., less than optimal temperatures), biological (e.g., viral infection), and more. Alternatively, cells can be stressed by the absence of essential agents such as growth factors (e.g., insulin), O2, and other factors. Further, cellular stress can be measured in a variety of ways including but not limited to monitoring cell viability (cell death), cell doubling times, cell morphology, and expression of genes or gene families including those related to hypoxia responses, heat shock responses, cell cycle regulation, the interferon response pathway and others.
Antisense Region
The phrase “antisense region” refers to a sequence of nucleotides in a polynucleotide that is at least substantially complementary to a sense region in the same polynucleotide (if the polynucleotide is a unimolecular polynucleotide having both a sense and antisense sequence, wherein the sense and antisense sequences are capable of annealing by reason of the polynucleotide forming intramolecular interactions such as, for example, a hairpin structure), or in a different polynucleotide (in the case of a double stranded polynucleotide that comprises two separate strands, one bearing a sense sequence and one bearing an antisense sequence, wherein the sense and antisense sequences are capable of annealing by reason of the two strands undergoing an intermolecular interaction to form, for example, a duplex).
Antisense Strand
The phrase “antisense strand” as used herein, refers to a polynucleotide that is at least substantially or 100% complementary to a target nucleic acid of interest. An antisense strand may comprise a polynucleotide that is RNA, DNA or chimeric RNA/DNA. For example, an antisense strand may be complementary, in whole or in part, to a molecule of messenger RNA, an RNA sequence that is not mRNA (e.g., tRNA, rRNA and hnRNA) or a sequence of DNA that is either coding or non-coding.
Complementary
The term “complementary” refers to the ability of polynucleotides to form base pairs with one another. Base pairs are typically formed by hydrogen bonds between nucleotide units in antiparallel polynucleotide strands. Complementary polynucleotide strands can base pair in the Watson-Crick manner (e.g., A to T, A to U, C to G), or in any other manner that allows for the formation of duplexes. As persons skilled in the art are aware, when using RNA as opposed to DNA, uracil rather than thymine is the base that is considered to be complementary to adenosine. However, when a U is denoted in the context of the present invention, the ability to substitute a T is implied, unless otherwise stated.
Perfect complementarity or 100% complementarity refers to the situation in which each nucleotide unit of one polynucleotide strand can hydrogen bond with a nucleotide unit of a second polynucleotide strand. Less than perfect complementarity refers to the situation in which some, but not all, nucleotide units of two strands can hydrogen bond with each other. For example, for two 20-mers, if only two base pairs on each strand can hydrogen bond with each other, the polynucleotide strands exhibit 10% complementarity. In the same example, if 18 base pairs on each strand can hydrogen bond with each other, the polynucleotide strands exhibit 90% complementarity. “Substantial complementarity” refers to polynucleotide strands exhibiting 79% or greater complementarity, excluding regions of the polynucleotide strands, such as overhangs, that are selected so as to be noncomplementary. (“Substantial similarity” refers to polynucleotide strands exhibiting 79% or greater similarity, excluding regions of the polynucleotide strands, such as overhangs, that are selected so as not to be similar.) Thus, for example, two polynucleotides of 29 nucleotide units each, wherein each comprises a di-dT at the 3′ terminus such that the duplex region spans 27 bases, and wherein 26 of the 27 bases of the duplex region on each strand are complementary, are substantially complementary since they are 96.3% complementary when excluding the di-dT overhangs.
Deoxynucleotide
The term “deoxynucleotide” refers to a nucleotide or polynucleotide lacking a hydroxyl group (OH group) at the 2′ and/or 3′ position of a sugar moiety. Instead it has a hydrogen bonded to the 2′ and/or 3′ carbon. Within an RNA molecule that comprises one or more deoxynucleotides, “deoxynucleotide” refers to the lack of an OH group at the 2′ position of the sugar moiety, having instead a hydrogen bonded directly to the 2′ carbon.
Deoxyribonucleotide
The terms “deoxyribonucleotide” and “DNA” refer to a nucleotide or polynucleotide comprising at least one sugar moiety that has an H, rather than an OH, at its 2′ and/or 3′ position.
Duplex Region
The phrase “duplex region” or “duplex RNA” refers to the region in two complementary or at least substantially complementary polynucleotides that form base pairs with one another, either by Watson-Crick base pairing or any other manner that allows for a stabilized duplex between polynucleotide strands that are complementary or at least substantially complementary. For example, a polynucleotide strand having 21 nucleotide units can base pair with another polynucleotide of 21 nucleotide units, yet only 19 bases on each strand are complementary or at least substantially complementary, such that the “duplex region” has 19 base pairs. The remaining bases may, for example, exist as 5′ and 3′ overhangs. Further, within the duplex region, 100% complementarity is not required; substantial complementarity is allowable within a duplex region. Substantial complementarity refers to 79% or greater complementarity. For example, a mismatch in a duplex region consisting of 19 base pairs results in 94.7% complementarity, rendering the duplex region substantially complementary. Duplex regions or duplex RNA can be the result of the pairing of two separate strands. Alternatively, duplexes regions can be the result of pairing of two complementary regions existing within a unimolecular sequence.
Essential Gene
The term “essential gene” refers to a specific nucleotide coding sequence whose expression product is vital for cell survival. An essential gene may encode any one of a variety of different polypeptides whose function is indispensable for cell viability. Consequently, inactivation of an essential gene generally results in cell death and/or cell stress. Inactivation may occur through a variety of mechanisms occurring at the DNA, mRNA, and protein levels. A genetic variation, for example, such as a single nucleotide polymorphism (SNP), occurring within the DNA coding sequence itself may alter, diminish, or eliminate the biological function of the resulting expression product. Alternatively, an essential gene may be inactivated at the mRNA level through an siRNA-mediated RNA interference pathway.
Gene Silencing
The phrase “gene silencing” refers to a process by which the expression of a specific gene product is lessened or attenuated. Gene silencing can take place by a variety of pathways. Unless specified otherwise, as used herein, gene silencing refers to decreases in gene product expression that results from RNA interference (RNAi), a defined, though partially characterized pathway whereby small inhibitory RNA (siRNA) act in concert with host proteins (e.g., the RNA induced silencing complex, RISC, or the RNA-induced Initiation of Transcriptional Gene Silencing, RITS) to degrade messenger RNA (mRNA) in a sequence-dependent fashion or affect gene expression by other pathways or mechanisms, including but not limited to epigenetic mechanisms such as DNA and/or histone methylation. The level of gene silencing can be measured by a variety of means, including, but not limited to, measurement of transcript levels by Northern Blot Analysis, B-DNA techniques, transcription-sensitive reporter constructs, expression profiling (e.g., DNA chips), and related technologies. Alternatively, the level of silencing can be measured by assessing the level of the protein encoded by a specific gene. This can be accomplished by performing a number of studies including Western Analysis, measuring the levels of expression of a reporter protein that has, for example, fluorescent properties (e.g., GFP) or enzymatic activity (e.g. alkaline phosphatases), or other procedures.
Nucleotide
The term “nucleotide” refers to a ribonucleotide or a deoxyribonucleotide or modified form thereof, as well as an analog thereof. Nucleotides include species that comprise purines, e.g., adenine, hypoxanthine, guanine, and their derivatives and analogs, as well as pyrimidines, e.g., cytosine, uracil, thymine, and their derivatives and analogs.
Nucleotide analogs include nucleotides having modifications in the chemical structure of the base, sugar and/or phosphate, including, but not limited to, 5-position pyrimidine modifications, 8-position purine modifications, modifications at cytosine exocyclic amines, and substitution of 5-bromo-uracil; and 2′-position sugar modifications, including but not limited to, sugar-modified ribonucleotides in which the 2′-OH is replaced by a group such as an H, OR, R, halo, SH, SR, NH2, NHR, NR2, or CN, wherein R is an alkyl moiety as defined herein. Nucleotide analogs are also meant to include nucleotides with bases such as inosine, queuosine, xanthine, sugars such as 2′-methyl ribose, non-natural phosphodiester linkages such as methylphosphonates, phosphorothioates and peptides.
Modified bases refer to nucleotide bases such as, for example, adenine, guanine, cytosine, thymine, uracil, xanthine, inosine, and queuosine that have been modified by the replacement or addition of one or more atoms or groups. Some examples of types of modifications that can comprise nucleotides that are modified with respect to the base moieties include but are not limited to, alkylated, halogenated, thiolated, aminated, amidated, or acetylated bases, individually or in combination. More specific examples include, for example, 5-propynyluridine, 5-propynylcytidine, 6-methyladenine, 6-methylguanine, N,N,-dimethyladenine, 2-propyladenine, 2-propylguanine, 2-aminoadenine, 1-methylinosine, 3-methyluridine, 5-methylcytidine, 5-methyluridine and other nucleotides having a modification at the 5 position, 5-(2-amino)propyl uridine, 5-halocytidine, 5-halouridine, 4-acetylcytidine, 1-methyladenosine, 2-methyladenosine, 3-methylcytidine, 6-methyluridine, 2-methylguanosine, 7-methylguanosine, 2,2-dimethylguanosine, 5-methylaminoethyluridine, 5-methyloxyuridine, deazanucleotides such as 7-deaza-adenosine, 6-azouridine, 6-azocytidine, 6-azothymidine, 5-methyl-2-thiouridine, other thio bases such as 2-thiouridine and 4-thiouridine and 2-thiocytidine, dihydrouridine, pseudouridine, queuosine, archaeosine, naphthyl and substituted naphthyl groups, any O- and N-alkylated purines and pyrimidines such as N6-methyladenosine, 5-methylcarbonylmethyluridine, uridine 5-oxyacetic acid, pyridine-4-one, pyridine-2-one, phenyl and modified phenyl groups such as aminophenol or 2,4,6-trimethoxy benzene, modified cytosines that act as G-clamp nucleotides, 8-substituted adenines and guanines, 5-substituted uracils and thymines, azapyrimidines, carboxyhydroxyalkyl nucleotides, carboxyalkylaminoalkyl nucleotides, and alkylcarbonylalkylated nucleotides. Modified nucleotides also include those nucleotides that are modified with respect to the sugar moiety, as well as nucleotides having sugars or analogs thereof that are not ribosyl. For example, the sugar moieties may be, or be based on, mannoses, arabinoses, glucopyranoses, galactopyranoses, 4′-thioribose, and other sugars, heterocycles, or carbocycles.
The term nucleotide is also meant to include what are known in the art as universal bases. By way of example, universal bases include but are not limited to 3-nitropyrrole, 5-nitroindole, or nebularine. The term “nucleotide” is also meant to include the N3′ to P5′ phosphoramidate, resulting from the substitution of a ribosyl 3′ oxygen with an amine group.
Further, the term nucleotide also includes those species that have a detectable label, such as for example a radioactive or fluorescent moiety, or mass label attached to the nucleotide. Nucleotides can also be detected by their inherent mass.
Off-Target
The term “off-target” and the phrase “off-target effects” refer to any instance in which an siRNA or shRNA directed against a given target causes an unintended effect by interacting either directly or indirectly with another mRNA sequence, a DNA sequence or a cellular protein or other moiety. For example, an “off-target effect” may occur when there is a simultaneous degradation of other transcripts due to partial homology or complementarity between that other transcript and the sense and/or antisense strand of the siRNA or shRNA
Overhang
The term “overhang” refers to terminal non-base pairing nucleotide(s) resulting from one strand extending beyond the terminus of the complementary strand to which the first strand forms a doubled stranded polynucleotide. One or both of two polynucleotides that are capable of forming a duplex through hydrogen bonding of base pairs may have a 5′ and/or 3′ end that extends beyond the 3′ and/or 5′ end of complementarity shared by the two polynucleotides. The single-stranded region extending beyond the 3′ and/or 5′ end of the duplex is referred to as an overhang.
Pharmaceutically Acceptable Carrier
The phrase “pharmaceutically acceptable carrier” includes compositions that facilitate the introduction of dsRNA, dsDNA, or dsRNA/DNA hybrids into a cell and includes but is not limited to solvents or dispersants, coatings, anti-infective agents, isotonic agents, and agents that mediate absorption time or release of the inventive polynucleotides and double stranded polynucleotides.
Polynucleotide
The term “polynucleotide” refers to polymers of nucleotides, and includes but is not limited to DNA, RNA, DNA/RNA hybrids including polynucleotide chains of regularly and irregularly alternating deoxyribosyl moieties and ribosyl moieties (i.e., wherein alternate nucleotide units have an —OH, then and —H, then an —OH, then an —H, and so on at the 2′ position of a sugar moiety), and modifications of these kinds of polynucleotides, wherein the attachment of various entities or moieties to the nucleotide units at any position are included. A polynucleotide comprises two or more nucleotides.
Polyribonucleotide
The term “polyribonucleotide” refers to a polynucleotide comprising two or more modified or unmodified ribonucleotides and/or their analogs. The term “polyribonucleotide” is used interchangeably with the term “oligoribonucleotide.”
Rational Design
The term “rational design” describes a set of criteria, developed at Dharmacon, Inc. that allow the identification of functional, highly functional, and hyperfunctional siRNAs. The set of criteria used to identify these siRNA have been incorporated into an algorithm that can be applied to any gene, regardless of the gene's origin. The criteria associated with rational design have been described in U.S. Provisional Patent Application Ser. No. 60/426,137, filed Nov. 14, 2002, entitled “Combinatorial Pooling Approach for siRNA Induced Gene Silencing and Methods for Selecting siRNA”; U.S. Provisional Patent Application Ser. No. 60/502,050, filed Sep. 10, 2003, entitled “Methods for Selecting siRNA”; U.S. patent application Ser. No. 10/714,333, filed Nov. 14, 2003, entitled “Functional and Hyperfunctional siRNA,” each of which is incorporated by reference herein.
Ribonucleotide and Ribonucleic Acid
The term “ribonucleotide” and the phrase “ribonucleic acid” (RNA), refer to a modified or unmodified nucleotide or polynucleotide comprising at least one ribonucleotide unit. A ribonucleotide unit comprises a hydroxyl group attached to the 2′ position of a ribosyl moiety that has a nitrogenous base attached in N-glycosidic linkage at the 1′ position of a ribosyl moiety, and a moiety that either allows for linkage to another nucleotide or precludes linkage.
RISC-Entering Strand
The term “RISC-entering strand” refers to the strand of a siRNA that preferably enters RISC. Determination of which strand preferably enters RISC can be made based on thermodynamic calculations that take into consideration end stability of the duplex as well as the average internal stability profile (AISP) of the entire siRNA.
RNA Interference and RNAi
The phrase “RNA interference” and the term “RNAi” are synonymous and refer to the process by which a polynucleotide or double stranded polynucleotide comprising at least one ribonucleotide unit exerts an effect on a biological process. The process includes but is not limited to gene silencing by degrading mRNA, interactions with tRNA, rRNA, hnRNA, cDNA and genomic DNA, as well as methylation of DNA with ancillary proteins.
Sense Region
The phrase “sense region” refers to a sequence of nucleotides in a polynucleotide that is at least substantially complementary to an antisense region in the same polynucleotide (if the polynucleotide is a unimolecular polynucleotide having both a sense and antisense sequence, wherein the sense and antisense sequences are capable of annealing by reason of the polynucleotide forming intramolecular interactions such as, for example, a hairpin structure), or in a different polynucleotide (in the case of a double stranded polynucleotide that comprises two separate strands, one bearing a sense sequence and one bearing an antisense sequence, wherein the sense and antisense sequences are capable of annealing by reason of the two strands undergoing an intermolecular interaction to form, for example, a duplex).
Sense Strand
The phrase “sense strand” refers to a polynucleotide that has the same nucleotide sequence, in whole or in part, as a target nucleic acid such as a messenger RNA or a sequence of DNA.
siRNA
The term “siRNA” refers to small inhibitory RNA duplexes that induce the RNA interference (RNAi) pathway. These molecules can vary in length (typically between 18-30 base pair) and contain varying degrees of complementarity to their target mRNA in the antisense strand. Some, but not all siRNA have unpaired, overhanging bases on the 5′ or 3′ end of the sense strand and/or the antisense strand.
An siRNA molecule can be bimolecular, comprising separate sense and antisense strands annealed through non-covalent interaction, or can be unimolecular, as when sense and antisense strands comprise regions of a hairpin structure that comprises a loop structure and, optionally, a stem region and/or terminal structure. Thus, a short hairpin RNA (shRNA) is a species of the genus siRNA.
Stressing a Cell
The phrase “stressing a cell” or “stress a cell,” includes placing a cell under conditions that are identified as being less than optimal for growth. Agents that induce stress can be chemical in nature (e.g., H2O2), physical in nature (e.g., less than optimal temperatures), biological (e.g., viral infection), and others. Alternatively, cells can be stressed by the absence of needed agents such as growth factors, O2, and other factors. Stressing a cell also includes inducing cell death by apoptosis or other means. Cellular stress can be measured in a variety of ways and includes monitoring cell viability (cell death), cell doubling times, cell morphology, and expression of genes or gene families including those related to hypoxia responses, heat shock responses, cell cycle regulation, the interferon response pathway and others. Further, stressing a cell or population of cells can make said cells more susceptible to secondary reagents, such as toxic agents. Methods for identifying toxicity regions are disclosed herein.
Target
The term “target” is used in a variety of different contexts herein and is defined by the context in which it is used. “Target mRNA” refers to a messenger RNA to which a given siRNA can be directed against. “Target sequence” and “target site” refer to a sequence within the mRNA to which the sense strand of an siRNA shows varying degrees of homology and the antisense strand exhibits varying degrees of complementarity. The term “siRNA target” can refer to the gene, mRNA, or protein against which an siRNA is directed. Similarly “target silencing” can refer to the state of a gene, or the corresponding mRNA or protein. The phrase “target of a toxic sequence” refers to the nucleotide or protein to which a given siRNA or shRNA interacts with to induce a state of stress in the cell. These differences in context reflect the mechanism(s) by which toxic sequences can exert their toxic effects on a cell.
Toxicity Region
The phrase “toxicity region” refers to a nucleotide sequence in a polynucleotide that confers upon the polynucleotide the ability to stress a cell.
Toxic Response
The phrase “toxic response” includes cellular responses to stress. Such responses can be identified by any suitable method in the art for measuring the effect of toxins on cells, or for measuring cell viability. Suitable methods include, for example, those methods used in the art to measure cell death (e.g., apoptosis), DNA replication, cell metabolism, and induction of one or more pathways associated with response to cell stress including hypoxia responses, heat shock responses, cell cycle regulation, the interferon response pathway and others.
Transfection
The term “transfection” refers to a process by which agents are introduced into a cell. The list of agents that can be transfected is large and includes, but is not limited to, siRNA, sense and/or anti-sense sequences, DNA encoding one or more genes and organized into an expression plasmid, proteins, protein fragments, and more. There are multiple methods for transfecting agents into a cell including, but not limited to, electroporation, CaPO4-based transfections, DEAE-dextran-based transfections, lipid-based transfections, molecular-conjugate-based transfections (e.g., polylysine-DNA conjugates), microinjection and others.
All nucleotide sequences are written from 5′ to 3′, that is, the 5′ end of the sequence on the left, and the 3′ end of the sequence on the right.
The present invention will now be described in connection with preferred embodiments. These embodiments are presented to aid in an understanding of the present invention and are not intended, and should not be construed to limit the invention in any way. All alternatives, modifications and equivalents that may become apparent to those of ordinary skill upon reading this disclosure are included within the spirit and scope of the present invention.
This disclosure is not a primer on compositions and methods for performing RNA interference. Basic concepts known to those skilled in the art have not been set forth in detail.
The present invention includes a collection of novel toxic motifs and methods of use of said sequences. The toxic motifs include: GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, and complements of any of the foregoing.
Knowledge of said motifs can be valuable in a variety of fields. For example, in instances pertaining to the use of RNAi as a method of gene function analysis, unintentional introduction of an siRNA comprising a toxic motif into a cell can result in cell death and misinterpretation of the essential nature of said gene. Thus, under these conditions, knowledge of toxic motifs allows researches to better design and/or select for siRNA that lack such undesirable sequences.
Alternatively, for example, where siRNAs are being used as therapeutic reagents and are being designed to, for example, induce cell death in a population of cells such as, for example, diseased cells, siRNAs directed against a given target that contain toxic sequences are more likely to induce cell death than siRNAs that do not contain toxic sequences. siRNAs containing one or more toxic sequences can be used individually, or can be combined with one or more additional therapeutic agents to treat a given disease. In the latter case, duplex RNA carrying one or more toxic motifs can be used to sensitize the target, such as, for example, a diseased cell, to a second reagent.
Double stranded RNA carrying toxic motifs are also valuable as controls in experiments that require transfection of polynucleotides such as, for example, DNA and/or RNA, into cultured cells. In this instance, the level of cell death induced by a duplex carrying a toxic motif can be used to assess the success of the transfection procedure, thus minimizing the costs associated with processing samples and assessing data derived from failed transfections.
According to a first embodiment, the present invention provides a unimolecular polynucleotide, comprising at least one toxicity region comprising a sequence selected from the group consisting of GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, or a complement of any of the foregoing, wherein said unimolecular polynucleotide is capable of forming an intramolecular duplex of 5 or more base pairs, and wherein said duplex comprises a sense region and an antisense region that are at least substantially complementary.
The unimolecular polynucleotide can have a sense region that comprises at least one toxicity region, and/or an antisense region that comprises at least one toxicity region. As experiments described in the Examples clearly demonstrate regarding the strand preference of the toxic motif, in the cases of NUUU (SEQ. ID NO.7), or UGGC motifs (SEQ. ID NO.6), preferably the motifs are present in the RISC-entering strand when the desired effect is to induce cellular toxicity.
The at least one toxicity region can be located in a variety of positions within the duplex.
Preferably, the unimolecular polynucleotide has an antisense region that comprises from 19 to 40 bases. The unimolecular polynucleotide can comprise a loop region, a stem region, and/or a terminal region. Preferably, the sense region and antisense region are more than substantially complementary over the range of base pairs, and more preferably 100% complementary over this range. Preferably the polynucleotide is RNA.
In the case of a unimolecular polynucleotide, the order or sequence in which each component appears in the sequence can vary, being either 5′ S-loop-AS or 5′-AS-loop-S. The preferred arrangement is 5′ AS-loop-S. The preferred length of both the sense and antisense regions is 19-40 nucleotides in length. Preferably, the unimolecular polynucleotide comprises a loop, wherein the loop preferably comprises 4-20 nucleotides. Moreover, the terminal region of the unimolecular polynucleotide can be blunt, or have overhangs of 1-5 nucleotides on the 3′ or 5′ end. Preferably, the overhangs are on the 3′ end of the molecule. Preferably, the 5′ end of the molecule has a phosphate group on the 5′ carbon.
In some cases, where strong toxicity is desired, the region of the duplex comprises RNA greater than 40 base pairs. Such duplexes (>40 base pairs) are also capable of inducing cell death by the interferon response pathway. Thus, in cases where toxic sequences are associated with longer (>40 base pair) duplexes, cell death is induced by at least two mechanisms: (1) the action of the toxic motif, and (2) the induction of the interferon response. More preferably, in cases where strong toxicity is desired, the toxic sequence is associated with a long (>40 base-pair) RNA duplex that also targets an essential gene or essential non-coding RNA (e.g., an miRNA) by the RNAi pathway. In these cases, cell death is the result of three separate actions: (1) the action of the toxic motif; (2) the induction of the interferon response pathway by the long double-stranded RNA; and (3) the loss of function of an essential gene or function.
According to a second embodiment, the invention provides a double stranded polynucleotide, comprising at least one toxicity region comprising a sequence selected from the group consisting of GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, or a complement of any of the foregoing, wherein said double stranded polynucleotide is capable of forming a duplex of 5 or more base pairs, and wherein said duplex comprises a sense strand and an antisense strand that are at least substantially complementary.
In one preferred embodiment, the double stranded polynucleotide comprises a sense strand comprising at least one toxicity region. In another preferred embodiment, the double stranded polynucleotide comprises an antisense strand comprising at least one toxicity region.
The toxic motif can be located in a variety of positions within the duplex.
In cases where the double stranded polynucleotide is an siRNA, preferably the siRNA comprises from 1940 base pairs, or from 19-23 base pairs, exclusive of overhangs. Preferably, the sense strands and antisense strands are at least substantially complementary over the range of base pairs, and more preferably 100% complementary over this range. Preferably the polynucleotide is RNA.
The double stranded polynucleotide may also contain overhangs at either the 5′ or 3′ end of either the sense strand or the antisense strand. However, if there are any overhangs, they are preferably on the 3′ end of the sense strand and/or the antisense strand. Additionally, any overhangs are preferably six or fewer bases in length, more preferably two or fewer bases in length. Most preferably, there are either no overhangs, or overhangs of two bases on one or both of the sense strand and antisense strand.
The first 5′ terminal antisense nucleotide and/or the first 5′ terminal sense nucleotide may or may not be modified with a phosphate group attached to the 5′ carbon of the sugar moiety of the nucleotide. If there is a phosphate group, preferably there is only one phosphate group.
In some cases, where strong toxicity is desired, the region of the duplex comprises RNA and is greater than 40 base pairs. Such duplexes (>40 base pairs) are capable of inducing cell death by the interferon response pathway. Thus, in cases where toxic sequences are associated with longer (>40 base pair) RNA duplexes, cell death is induced by at least two mechanisms: (1) the action of the toxic motif, and (2) the induction of the interferon response. Even more preferably, in cases where strong toxicity is desired, the toxic sequence is associated with a long (>40 base-pair) RNA duplex that also targets an essential gene by the RNAi pathway. In these cases, cell death is the result of three separate actions: (1) the action of the toxic motif; (2) the induction of the interferon response pathway by the long double-stranded RNA; and (3) the loss of function of an essential gene by the RNAi pathway.
According to a third embodiment, the invention provides a composition for inducing a toxic response in a cell, comprising a nucleotide sequence GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, or a complement of any of the foregoing, wherein said nucleotide sequence comprises a duplex region that is at least 5 base pairs in length, and wherein said duplex region is comprises at least two regions that are at least substantially complementary. The duplex region can be unimolecular such as, for example, a hairpin structure described in Embodiment 1, or can comprise two separate strands (such as the molecule(s) described in Embodiment 2. Preferably, the composition is an RNA or an siRNA.
According to a fourth embodiment, the invention provides a method of inducing a toxic response in a cell, comprising introducing into the cell a unimolecular polynucleotide or a double stranded polynucleotide, wherein said unimolecular polynucleotide or double stranded polynucleotide comprises at least one toxicity region comprising a sequence selected from the group consisting of GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, or a complement of any of the foregoing, wherein said unimolecular polynucleotide or double stranded polynucleotide comprises a duplex region of 5 or more base pairs, and wherein said unimolecular polynucleotide or double stranded polynucleotide comprises a sense region and an antisense region that are at least substantially complementary.
Toxic motifs can be located in a variety of positions within the duplex.
In one preferred embodiment, the method further comprises exposing the cell to at least one agent or condition that stresses the cell, in addition to the cited toxicity sequence(s). The at least one agent or condition that stresses the cell can be any agent or condition known in the art that can stress a cell. Agents can include, for example, chemotherapeutic agents such as cisplatin, proleukin, Campath, 9-cis-retinoic acid, cyclophosphamide, dacarbazine, as well as other siRNA/antisense agents that are directed against specific targets. In cases where the agent that stresses the cell is a peroxide, the peroxide is preferably hydrogen peroxide.
Inducing a toxic response in a cell includes sensitizing a cell to stress. In this manner, a cell can be sensitized to the actions of other agents, stressors, or environments, such as, for example, those that induce apoptosis or cell death. Any agent or environment known in the art, or that comes to be known, that can induce stress, apoptosis, or cell death can be used in conjunction with the compositions and methods of the invention. Thus, for instance, conditions that deplete the local environment of, for example, necessary nutrients, growth factors, oxygen (e.g., hypoxia) or other necessary factors, are included herein. Such conditions can include, for example, radiation.
According to a fifth embodiment, the invention provides a method for screening a library of nucleic acids for a toxicity region, comprising screening a database containing nucleic acid sequences and identifying those sequences that contain toxic motifs. Screening can be accomplished visually or with the aid of a computer (i.e., in silico) to identify toxic motifs. A search of a library of sequences in one or more data files can be performed to identify library members that comprise one or more toxic regions. Thus, for example, the siRNA library described in U.S. patent application Ser. No. 10/714,333, filed Nov. 14, 2003, entitled “Functional and Hyperfunctional siRNA,” which contains roughly 1.6 million sequences, can be screened. Members of this library are rationally designed to silence specific gene targets from the human genome and can be useful in, for example, dissecting the roles of genes of known and unknown function. The presence of toxic sequences can be of inherent value, or detrimental, depending upon the use of such sequences. In cases where such agents are intended for therapeutic purposes, designed to, for example, kill diseased cells, presence of a toxic sequence within the siRNA can be valuable. Under these conditions, it would be desirable to screen through the library to identify sequences that contain toxic motifs. In contrast, in instances where the siRNAs are intended to be used for, for example, gene function analysis, presence of a toxic motif within the siRNA is undesirable. Under these conditions, it would be beneficial to screen through the contents of this library and identify/eliminate any library members that contain said toxic regions. Such screens may be performed with or without a computer program that allows for the cross-referencing of the toxic sequence with each of the library's siRNA. In the case where computer programs are used, the program may, for example, be accessible from a local terminal or personal computer, over an internal network or over the Internet. The computer program that may be used may be developed in any computer language that is known to be useful for scoring nucleotide sequences, or it may be developed with the assistance of a commercially available product.
Knowledge of toxic sequences will also enable modification of random and/or random-biased nucleic acid library synthesis strategies so as to minimize (or maximize) the possibility of introducing toxic sequences into one or more library members. Nucleic acid library design can be an important factor in obtaining, for example, functional siRNA, ribozymes, and/or antisense molecules that are capable of inducing high levels of, for example, (1) gene silencing, or (2) cell death. The occurrence of toxic sequences within such agents can be detrimental or beneficial, depending upon the intended use of such sequences. For example, in a research setting where a single, non-essential gene or gene product is under investigation, knockdown studies that employ siRNA that contain toxic sequences may induce phenotypes that mislead an investigator into believing that the gene is essential. Thus, under these circumstances, foreknowledge of a toxic sequence will enable the applicants to employ biases during, for example, random siRNA library design procedures that would minimize the chances of introducing toxic sequences into library members.
In another example, existing algorithms for designing polynucleotides such as, for example, siRNAs, or libraries of siRNAs, can be improved using the compositions and methods of the present invention. In one non-limiting example, algorithms for the selection of siRNAs of varying functionalities can be improved by incorporating into such algorithms, scripts (i.e., software code) that eliminate all sequences that contain the toxic motifs disclosed herein. Alternatively, algorithms can be modified to specifically select siRNAs that contain one or more toxic sequences.
According to a sixth embodiment, the invention provides a transfection control method, comprising: (a) transfecting a first group of cells with one or more polynucleotides or double-stranded polynucleotides; (b) transfecting a second group of cells with a duplex RNA, wherein said duplex RNA comprises at least one toxicity region comprising a sequence selected from the group consisting of GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, or a complement of any of the foregoing, wherein said duplex RNA is 5 or more base pairs in length, and wherein said duplex RNA comprises a sense region and an antisense region that are at least substantially complementary, and wherein said first and said second cells are transfected under similar conditions; (c) maintaining said first and said second groups of cells under conditions sufficient for cell growth; and (d) determining the level of cell viability in said second group of cells. Preferably, the RNA duplex is greater than 40 base pairs in length. More preferably, the RNA duplex is greater than 64 base pairs in length. In a preferred embodiment, the RNA duplex is an siRNA and targets an essential gene.
Applicants have found that of the toxic sequences described herein, the sequences GUUU (SEQ. ID NO.1) or UGGC (SEQ. ID NO.6), when present together or independently in a duplex of 17 basepairs or less, do not induce a toxic response, or induce a toxic effect. However, a duplex of 19 basepairs or more that comprises the sequences GUUU (SEQ. ID NO.1) or UGGC (SEQ. ID NO.6), independently or together, does induce a toxic effect. Thus, preferably, in order for an siRNA having the sequences GUUU (SEQ. ID NO.1) or UGGC (SEQ. ID NO.6), independently or together, to induce a toxic effect, the siRNA is preferably at least 18 basepairs in length.
The methods and compositions described herein can be used, for example, in the design and uses of molecules for control purposes. For example, toxic sequences can be incorporated into the design of molecules intended for use as, for example, transfection controls. Control transfections that use the methods and compositions described herein can be run alongside experimental transfections to verify, for example, the efficiency of transfection and the quality of the transfection reagents. In another non-limiting example, transfection controls in accordance with the methods and compositions described herein can help identify optimum conditions for transfection of any cell lines.
Typical RNAi silencing experiments benefit from controls that allow the experimenter to assess the fraction of cells that have been successfully transfected with a given test siRNA. In some instances, these controls can target specific genes and transfection efficiency can be determined by assessing the level of silencing of the targeted gene (e.g., by Northern or Western blot). In other instances, transfection efficiency can be assessed by labeling an siRNA with, for example, a fluorescent label, and subsequently identifying the fraction of cells that fluoresce using fluorescence microscopy or fluorescence-activated cell sorting (FACS). However, measuring transfection efficiency by assessing the level of silencing of a control gene is labor intensive and expensive. Similarly, instruments for measuring cellular fluorescence are expensive, and many laboratories may lack the resources and/or training necessary to access and operate such instruments. The present invention offers a novel alternative, whereby transfection efficiency is more easily measured using toxic polynucleotide sequences.
In one non-limiting example, the level of transfection in a given experiment can be assessed by measuring the level of cell death induced by transfection of a sequence that induces cell death. For instance, cells in control wells can be transfected with a duplex RNA that comprises one or more of the toxic sequences disclosed herein, for example, GUUU (SEQ. ID NO.1), AGCA (SEQ. ID NO.2), GCAC (SEQ. ID NO.3), CUGG (SEQ. ID NO.4), AGAC (SEQ. ID NO.5), UGGC (SEQ. ID NO.6), NUUU (SEQ. ID NO.7), wherein N is any nucleotide, motifs. Subsequently, after a period of, for example, 24, 48, or 72 hours the number of dead and/or dying cells can be determined using any suitable assay, including but not limited to Alamar Blue assays. If the total number of living cells represents only a small fraction (for instance 10%) of those present in wells that were not transfected with the duplex RNA containing the toxic motif, then this would indicate that 90% of the cells were successfully transfected.
In cases where duplex RNA comprising a toxic motif is used as a positive control for transfection efficiency, the number of motifs can be greater than one and the size of the duplex can vary greatly. In general, larger RNA duplexes that carry a toxic motif and target an essential gene via the RNAi pathway are preferred, since such molecules induce cytotoxicity by at least three mechanisms: (1) toxicity due to the toxic motif; (2) toxicity due to introduction of a large double stranded RNA that induces the interferon response; and (3) toxicity due to the introduction of a double stranded RNA that also targets an essential gene. Where duplex RNA comprising a toxic motif is used as a positive control for transfection efficiency, the size of the duplex carrying the toxic motif can be between 19-30 base pairs. More preferably, the size of the duplex carrying the toxic motif is between 19 and 42 base pairs. Even more preferably, the size of duplex carrying the toxic motif is between 19 and 64 base pairs. Even more preferably, the size of the RNA duplexes containing the toxic motif is greater than 64 base pairs. Most preferably, the RNA duplex carries one or more toxic motifs, is greater than 64 base pairs, and targets an essential gene by the RNAi pathway.
The methods of the embodiments of the invention are not limited by the cell type used, the methods of transfection, or the assay utilized to assess the cell stress, apoptosis, or cell death. Thus the present invention may use a diverse set of cell types, including primary cells, germ cell lines and somatic cell lines. The cells may be stem cells or differentiated cells. For example, the cell types may be embryonic cells, oocytes, sperm cells, adipocytes, fibroblasts, myocytes, cardiomyocytes, endothelium, neurons, glia, blood cells, megakaryocytes, lymphocytes, macrophages, neutrophils, eosinophils, basophils, mast cells, leukocytes, granulocytes, keratinocytes, chondrocytes, osteoblasts, osteoclasts, hepatocytes and cells of the endocrine or exocrine glands. Said cells can be plated at a variety of densities and. cultured in a variety of formats (e.g. 96- or 384-well plates). To induce a toxic response, cells are preferably plated at a high density (>90% confluency). More preferably, the cells are plated at a moderate density (65-90% confluency). Most preferably, the cells are plated at a low density (<50% confluency).
Many techniques are known in the art for introducing nucleic acids into cells. Any suitable technique can be used for introducing toxic sequences into cells. These techniques include, but are not limited to, electroporation, lipid-mediated transfection, chemical-mediated transfection, viral-mediated transduction and others. In the case of viral mediated expression of toxic sequences, vectors that are based on lentiviral or retroviral systems are preferred. Any method of introducing polynucleotides, or double stranded polynucleotides, that are known in the art, or that come to be known, can be used with the methods and compositions herein.
Many techniques for assessing cell stress, apoptosis, and cell death are known in the art. Any suitable technique can be used in combination with the toxic motifs reported in this document. These techniques include, but are not limited to, one or more kits that utilize dyes (e.g., Alamar Blue), antibodies (e.g., Apo2.7), enzymatic activities (e.g., caspases), gene expression (e.g., microarrays), or other parameters to quantitate the level or degree of cell death. Any method of assessing cell stress, apoptosis, or cell death that is known in the art, or that comes to be known, can be used with the methods and compositions herein.
Regarding each of the embodiments of the invention, the toxic sequence(s) or their complement(s) are not specifically designed to target a specific gene. As shown in Example 3, there is no necessary correlation between the toxicity of an siRNA carrying said motif, and target knockdown. Instead, toxic siRNA carrying the motif(s) of the invention appear to be acting by inducing off-target gene knockdown. Without wishing to be bound by any particular theory, the toxic effects associated with exposing a cell to a toxic sequence may be the result of the toxic sequence interacting with a specific and essential target in the cell. Without limitation, such a target may be proteinaceous in nature. Alternatively, and without limitation, the target of the toxic sequence can be nucleic acid, lipid, or carbohydrate in nature.
Any of the compositions and methods of the invention can be used in combination with other compositions and methods known in the art. For example, toxic sequences can be used in conjunction with therapeutic small molecules, other therapeutic nucleic acids, small peptides, proteins, lipids, combinations thereof, or other agents that alter or affect the functionality of one or more targets within the cell. Introduction of such agents can precede the introduction of a toxic sequence, follow the introduction of a toxic sequence, or be applied simultaneously with a toxic sequence. Thus, for instance, a toxic sequence can be delivered along with a second agent that is a unimolecular polynucleotide or double stranded polynucleotide, wherein the second agent recognizes and down regulates a transcript responsible for a specific disease (for example, a cancer). In this non-limiting example, conditions are selected so as to introduce either agent individually, inducing minimal levels of cell death. Yet the combination of both the toxic sequence and the second agent that is a unimolecular polynucleotide or double stranded polynucleotide is sufficiently toxic to induce cell death in diseased cells.
Potential benefits can be realized by including knowledge of toxic sequences in, for example, siRNA design. For instance, when trying to kill or disable cells that contain, for example, a disease-related SNP (single nucleotide polymorphism), identification of an siRNA or antisense molecule that (in addition to recognizing the SNP) also contains a toxic sequence, would be beneficial. The methods and compositions of the invention can be useful in the design of therapeutics and therapies that employ agents comprising nucleic acids, including agents that comprise polynucleotides, to eliminate one or more cells or cell types related to a disease or an undesirable phenotype.
Another benefit of the methods and compositions of the invention is the ability to target certain cells, such as, for example, deleterious cells, for destruction. For example, the present invention may be used in RNA interference applications, wherein an siRNA having a toxic sequence is designed to be directed against a specific gene in a specific cell type. For these applications, an organism suspected of having a disease or disorder that is amenable to modulation by manipulation of a particular target nucleic acid of interest is treated by administering siRNA. The organism can be a mammal, such as, for example, a mouse, rat, sheep, cow, or human. Results of the siRNA treatment may be ameliorative, palliative, prophylactic, and/or diagnostic of a particular disease or disorder. Preferably, the siRNA is administered in a pharmaceutically acceptable manner with a pharmaceutically acceptable carrier or diluent.
Therapeutic applications of the present invention can be performed with a variety of therapeutic compositions and methods of administration. Pharmaceutically acceptable carriers and diluents are known to persons skilled in the art. Methods of administration to cells and organisms are also known to persons skilled in the art. Dosing regimens, for example, are known to depend on the severity and degree of responsiveness of the disease or disorder to be treated, with a course of treatment spanning from days to months, or until the desired effect on the disorder or disease state is achieved. Chronic administration of siRNAs may be required for lasting desired effects with some diseases or disorders. Suitable dosing regimens can be determined by, for example, administering varying amounts of one or more siRNAs in a pharmaceutically acceptable carrier or diluent, by a pharmaceutically acceptable delivery route, and amount of drug accumulated in the body of the recipient organism can be determined at various times following administration. Similarly, the desired effect (for example, degree of suppression of expression of a gene product or gene activity) can be measured at various times following administration of the siRNA, and this data can be correlated with other pharmacokinetic data, such as body or organ accumulation. Those of ordinary skill can determine optimum dosages, dosing regimens, and the like. Those of ordinary skill may employ EC50 data from in vivo and in vitro animal models as guides for human studies.
Further, the polynucleotides can be administered in a cream or ointment topically, an oral preparation such as a capsule or tablet or suspension or solution, and the like. The route of administration may be intravenous, intramuscular, dermal, subdermal, cutaneous, subcutaneous, intranasal, oral, rectal, by eye drops, by tissue implantation of a device that releases the siRNA at an advantageous location, such as near an organ or tissue or cell type harboring a target nucleic acid of interest.
Still further, the present invention may be used in RNA interference applications, such as diagnostics, prophylactics, and therapeutics including use of the composition in the manufacture of a medicament in animals, preferably mammals, more preferably humans, in the treatment of diseases, or over or under expression of a target. Preferably, the disease or disorder is one that arises from the malfunction of one or more genes, the disease or disorder of which is relate to the expression of the gene product of the one or more genes. For example, it is widely recognized that certain cancers of the human breast are related to the malfunction of a protein expressed from a gene commonly known as the “bcl-2” gene. A medicament can be manufactured in accordance with the compositions and teachings of the present invention, employing one or more siRNAs directed against the bcl-2 gene, and optionally combined with a pharmaceutically acceptable carrier, diluent and/or adjuvant, which medicament can be used for the treatment of breast cancer. Applicants have established the use of the methods and compositions of the present invention in cellular models. Methods of delivery of polynucleotides to cells within animals, including humans, are well known in the art. Any delivery vehicle now known in the art, or that comes to be known, and has utility for introducing polynucleotides to animals, including humans, is expected to be useful in the manufacture of a medicament in accordance with the present invention, so long as the delivery vehicle is not incompatible with any modifications that may be present in a composition made according to the present invention. A delivery vehicle that is not compatible with a composition made according to the present invention is one that reduces the efficacy of the composition by greater than 95% as measured against efficacy in cell culture.
Animal models exist for many disorders, including, for example, cancers, diseases of the vascular system, inborn errors or metabolism, and the like. It is within ordinary skill in the art to administer nucleic acids to animals in dosing regimens to arrive at an optimal dosing regimen for a particular disease or disorder in an animal such as a mammal, for example, a mouse, rat or non-human primate. Once efficacy is established in the mammal by routine experimentation by one of ordinary skill, dosing regimens for the commencement of human trials can be arrived at based on data arrived at in such studies.
Dosages of medicaments manufactured in accordance with the present invention may vary from micrograms per kilogram to hundreds of milligrams per kilogram of a subject. As is known in the art, dosage will vary according to the mass of the mammal receiving the dose, the nature of the mammal receiving the dose, the severity of the disease or disorder, and the stability of the medicament in the serum of the subject, among other factors well known to persons of ordinary skill in the art.
The compositions and methods of the present invention can be employed with any suitable modifications known in the art, as long as the modifications do not substantially interfere with the efficacy of the methods or compositions of the invention. Substantial interference with the methods or compositions of the invention results when the modification(s) reduces the efficacy of the composition or method by greater than 90%, as compared to the efficacy of the composition or method in the absence of the modification. Many modifications are known in the art. Preferred modifications are disclosed in U.S. patent application Ser. No. 10/406,908, filed Apr. 2, 2003, entitled “Stabilized Polynucleotides for Use in RNA Interference,” and U.S. patent application Ser. No. 10/613,077, filed Jul. 1, 2003, entitled “Stabilized Polynucleotides for Use in RNA Interference,” each of which is incorporated by reference herein.
Having described the invention with a degree of particularity, examples will now be provided. These examples are not intended to and should not be construed to limit the scope of the claims in any way. Although the invention may be more readily understood through reference to the following examples, they are provided by way of illustration and are not intended to limit the present invention unless specified.
The following examples are intended to explain the invention further.
General Procedures
Transfection
HEK293, HeLa, MCF7 and DU145 cell lines were obtained from ATCC (Manassas, Va.). Cells were grown at 37° C. in a humidified atmosphere with 5% CO2 in cell line-specific media: HEK293-DMEM, 10% FBS (Invitrogen), HeLa, DMEM, 10% FBS, MCF7, MEM (Invitrogen), 10% FBS, PC3, RPMI, 10% FBS. All propagation media was supplemented with penicillin (100 U/ml) and streptomycin (100 ug/ml). For transfection experiments, cells were seeded at 5×103 cells/well in 96 well plates 24 h before the experiment in antibiotic free media. The cell density described for these experiments was critical for observing siRNA-induced toxicity. Cells were transfected with siRNA (10 nM, 1 pmole/well) using Lipofectamine 2000 (0.1 ul/well, Invitrogen) according to manufacturer instruction. For gene expression analysis in HEK293 (LUC walk) cells were transfected as described earlier [Reynolds, 2004 #1081]. Presented graphs represent the average values obtained from three independent experiments, each performed in triplicate. Error bars represent standard deviation.
Cell Viability Assay
The survival of cells after treatment was determined by Alamar Blue (BioSource Int.) cytotoxicity assay according to manufacturers instructions. Briefly, 72 h (HeLa) or 144 h (MCF7, PC3) after transfection, 25 ul of Alamar Blue dye were added to wells containing cells in 100 ul of media. Cells were then incubated (0.5 hrs (HeLa) or 2 hrs (MCF7 and DU145) at 37° C. in a humidified atmosphere with 5% CO2. The fluorescence was subsequently measured on a Perkin Elmer Wallac Vector2 1420 multi-label counter with excitation at 540 nm and emission at 590 nm. The data presented are an average of nine data points coming from three independent experiments performed on different days. For the purpose of this study, siRNAs were defined as toxic when the average from nine different experiments (taking into account standard deviations) showed cell viability below 75%.
Gene Expression Analysis
mRNA expression levels were determined using Quantigene® Kits (Genospectra, Fremont, Calif.) for branched DNA (bDNA) assay [Collins, 1997 #1018] according to manufacturer instructions. Level of mRNA of GAPDH (a housekeeping gene) was used as a reference.
Microscopy
Regular and fluorescent microscopy was used to obtain data on cellular and nuclei morphology. Live cells were stained with cell-permeable nuclear fluorescent dye Hoechst 33342 (2 ug/ml, 15 min at 37° C., Molecular Probes). Pictures were taken using Leica DML fluorescent microscope InSight CCD camera and SPOT 3.5 software.
Down-Regulation of Gene Silencing
HeLa cells were transfected (T1) with a pool of siRNA directed against eIF2C2 (1 pmole/well) or with a control siRNA (1 pmole/well). Cells were then replated at 48 hrs (5×103 cells/well in 96 well plates) and co-transfected a second time (T2) with an EGFP expressing plasmid (20 ng/well) and (1) a control siRNA (0.1 pmole/well) or (2) EGFP siRNA (0.1 pmole/well). Twenty-four hours later cells were assayed for EGFP knockdown at mRNA level (branched DNA) and protein level (fluorescent microscopy). For toxicity analysis, cells were pre-transfected (T1) with control or eIF2C2 siRNA pool, replated and and then transfected with a set of toxic siRNAs.
To identify toxic sequences, HeLa cells were plated (5,000 cells/well) in a 96 well plate and cultured overnight. On the following day, cells (35-50% confluent) were transfected with one of 48 different siRNAs directed against one of 12 different targets (4 siRNA directed against each of the following genes: raf1, mek1 (MAP2K1), mek2 (MAP2K2), mapk1, mapk3, PI3k-Ca, PI3k-Cb, Bcl2, Bcl3, SRD5A1, SRD5A2, AR see Table 1). For transfection, siRNA concentrations were 10 nanomolar and the siRNA:lipid (Lipofectamine 2000) ratio was 1 picomole per 0.1 microgram. Twenty-four hours after transfection, cells received an additional 100 microliters of media (+serum). Subsequently, at t=72 hours, cell survival was assayed using the Alamar Blue cytotoxicity assay (Alamar Biosciences, Inc).
The results of these experiments, presented in
Again, the conclusion of this study was two-fold: 1) a fraction of siRNA induce toxicity and 2) the toxicity was target knockdown independent.
In a separate experiment 90 separate siRNAs covering a region of the DBI gene (NM—020548, one base pair shift) were analyzed. The sequences used in this study are listed below.
Results of these studies are presented in
Toxic and non-toxic sequences were sorted into separate groups and analyzed to identify one or more motifs that were present in high frequencies in the toxic collection, but absent or rarely observed in the non-toxic group. The analysis of this data set identified three motifs, A/G UUU A/G/U, G/C AAA G/C and GCCA (or their complements), that exhibited the desired distribution (
siRNAs that contained either the UUU/AAA, GCCA/UGGC, or neither motif were randomly chosen and assessed using the toxicity assay. Sequences of the siRNA used in this study include the following:
Results of these studies are shown in
To further elucidate the identity and mechanism of toxic sequences, the following procedures were performed: first, 297 siRNA were collected and the RISC-entry strand bias was determined using standard thermodynamic calculation. A more detailed description of thermodynamic calculations can be found in the following patent applications: U.S. Provisional Patent Application Ser. No. 60/426,137, filed Nov. 14, 2002; U.S. Provisional Patent Application Ser. No. 60/502,050, filed Sep. 10, 2003; U.S. patent application Ser. No. 10/714,333, filed Nov. 14, 2003; International Patent Application No. PCT/US2003/036787, filed Nov. 14, 2003 and published as WO 2004/045543 A2 on Jun. 3, 2004; U.S. patent application Ser. No. 10/940,892, filed Sep. 14, 2004; and International Patent Application No. PCT/US04/14885, filed May 12, 2004; each of the foregoing applications are incorporated herein by reference. Subsequently the toxicity of each siRNA was assessed using the previously described toxicity assay. Lastly, a statistical analysis was performed whereby the frequency at which all-possible 4mers was determined. This data was then sorted based on toxicity (i.e., toxic vs. non-toxic), and the P-value was determined using Standard T-Tests to identify sequences that were present (at high frequencies) in the RISC-entering strand of toxic sequences, but not non-toxic sequences.
The identity of the RISC-entering strand (preferred strand, listed 5′ 3′) for the sequences used in this study are identified below:
The results of these studies are as follows:
Dose Dependence Toxicity
To determine whether the observed toxicity induced by Mek2-3 (MAP2K2-3, m23) represented a titratable event, the toxic Mek2-3 sequence (5′ UCCAGGAGUUUGUCAAUAA, sense strand (SEQ. ID NO.638)) s transfected (Lipofectamine 2000) into HeLa cells at concentrations that varied between 1.25-10 nanomolar. The total concentration of siRNA in all of the experiments was 10 nanomolar with a non-toxic mek2-1 siRNA (5′ CAAAGACGAUGACUUCGAA, sense strand (SEQ. ID NO.639)) ing up the difference at lower Mek2-3 concentrations. The results of these experiments are shown in
To further determine whether a correlation existed between the level of silencing induced by a given siRNA and the amount of toxicity brought on by that same sequence, the four siRNAs directed against either Mek1 (MAP2K1) or Mek2 (MAP2K2) were simultaneously tested for toxicity and gene knockdown (sequences for all of the duplexes are listed in Table 1). In these experiments, the method of siRNA transfection and the means of determining toxicity were performed as previously described. To determine the level of gene knockdown, cells transfected with each siRNA were harvested and Mek1 or Mek2 expression levels were compared with those of a control gene (GAPDH) using Quantigene® Kits (Genospectra, Fremont, Calif.) that make use of art-recognized branched DNA technology. The results of these experiments are shown in
Three different approaches were used to evaluate the contributions of the RNAi pathway (and specifically siRNA off-targeting activity) to the observed siRNA-induced toxicity. First, the ability of toxic motif containing siRNA to induce cell death was investigated under circumstances where the RNAi mechanism was severely compromised. Previous studies have shown that eIF2C2(hAgo2) is responsible for RNAi-mediated mRNA cleavage and that knockdown of this gene product severely cripples the pathway (Meister, G. et al. (2004) Human Argonaute2 mediates RNA cleavage targeted by miRNAs and siRNAs. Mol Cell 15, 185-97; Tabara, H. et al. (1999) The rde-1 gene, RNA interference, and transposon silencing in C. elegans, Cell 99, 123-32. To confirm this, cells transfected with an eIF2C2-siRNA pool that induced more than 70% silencing or control siRNA were subsequently co-transfected with an EGFP expression plasmid plus either a) a control siRNA or b) an siRNA directed against EGFP (second transfection, see
As expected, eIF2C2 knockdown disabled subsequent siRNA induced silencing of EGFP (
When eIF2C2 minus cells (cells treated with eIF2C2 targeting siRNAs) were subsequently transfected with toxic siRNA, no effect on cell viability was observed (
As parallel experiments where cells were pre-transfected with control siRNA demonstrated levels of toxicity characteristic of these sequences, it was concluded that an uncompromised RNAi pathway was necessary for development of siRNA-induced toxicity.
In a second approach, the ability of toxic siRNA to induce cell death was tested when the size of the duplex was reduced from 19 bp to 17 bp. Previous studies have shown that duplexes that are shorter than 19 bp targeted mRNA sequences inefficiently, suggesting that Dicer and/or RISC fail to mediate RNAi when duplex sequence length drops below 19 bp (Elbashir, S. M., Martinez, J., Patkaniowska, A., Lendeckel, W. & Tuschl, T. (2001) Functional anatomy of siRNAs for mediating efficient RNAi in Drosophila melanogaster embryo lysate. EMBO Journal 20, 6877-88). Known toxic siRNA were then tested for toxicity in both the 19 bp and 17 bp format. Sequences used in this study are listed below.
When the length of known 19 bp toxic siRNA was reduced by 2 bp (17 bp total length, no disruption of the motif) the level of toxicity was reduced dramatically (
Finally, in a third approach, chemical modifications that eliminate RNAi-mediated off-target effects were tested for the ability to abolish siRNA-induced toxicity. Recent studies have shown that minimal chemical modification of both strands of siRNA dramatically limits off-target effects without altering target specific knockdown (see U.S. Provisional Patent Application Ser. No. 60/630,228, filed Nov. 22, 2004, entitled “Modified siRNAs with Enhanced Selectivity”; and U.S. patent application Ser. No. 11/019,831, filed Dec. 22, 2004, entitled “Modified Polynucleotides for Reducing Off-Target Effects in RNA Interference”; each of which is herein incorporated by reference). To test the effect of this modification pattern on siRNA induced toxicity, we applied the chemical modification pattern [sense strand: 2′-O-methyl modification on nucleotides 1 and 2 (counting from the 5′ end of the sense strand); antisense strand: 2′-O-methyl modification of positions 1 and 2 (counting from the 5′ end of the antisense strand), 5′ phosphorylation of the first nucleotide of the antisense strand (counting from the 5′ end of the antisense strand)] to known toxic siRNA and transfected these duplexes into HeLa cells. The sequences of the duplexes used in this study are listed in the table below:
As shown in
To better understand the mechanism behind the toxic effects of the AUUUA and AUUUG motifs, cells transfected with siRNA containing these sequences were examined by microscopy. Specifically, HeLa cells transfected with either GGACAUUUGUGUACUCACU (SEQ. ID NO.640) or GCUACUAUCUGAUUUACUG (sense strand) (SEQ. ID NO.641) siRNA were cultured for 48 hours and then examined by phase contrast microscopy. Additionally, cells were stained with Hoechst 33342 (2 micrograms/ml, 30 minutes, 37° C.) and examined by fluorescence microscopy.
To determine the cell specificity of siRNA containing toxic motifs, multiple cell types were transfected with siRNA containing toxic motifs. Specifically, HeLa cells, PC3 cells, MCF7 cells, LnCap cells, and BXPC3 cells (ATCC, Manassas, Va.) were plated (5,000 cells per well) and transfected (10 nanomolar, Lipofectamine 2000) with non-toxic (e.g., MAP2K2-1, SRD5A2-1, PPIB-dx8, PPIB-dx10 and Luciferase 1-2 and toxic (e.g. MAP2K2-3, SRD5A2-3, PPIB-dx5 and Luciferase 2-3 (5′ GAGUUGUGUUUGUGGACGA, sense strand (SEQ. ID NO.646)) siRNA and examined for the induction of cell death using the Alamar Blue assay. Results (
Experiments were performed to determine whether introduction of toxic motifs could sensitize cells to other agents that induced apoptosis. Specifically, HeLa cells were plated and transfected as in Example 1. At t=24 hours, the media was replaced with media that contained 200 micromolar hydrogen peroxide (H2O2). The 200 micromolar hydrogen peroxide has previously been shown to be non-toxic to HeLa cells. Subsequently, cell survival was measured 24 hours later using the Alamar Blue Assay.
Results of these experiments are reported in
Effects of Non-Specific siRNA Containing the GCCA Motif
Two non-specific sequences (i.e., sequences that are not designed to target a particular gene via the RNAi pathway), ACUCUAUCGCCAGCGUGAC and ACUCUAGCGCCAUCGUGCC, (SEQ. ID NO.647) both containing the GCCA motif were transfected into HeLa cells (10 nanomolar, Lipofectamine 2000) and tested for toxicity using the Alamar Blue assay at t=72 hours. Results showed (
To develop an siRNA that can be used as a transfection control reagent, the sequence of the Eg5 gene (NM—004523.2), also known as Kinesin family member 11 or TRIP5) was scanned to identify a sequence that contained one or more toxic motifs. A 62 base pair sequence (called Eg5-tox) containing two toxic motifs (sense, AUUUU and antisense, GCCA) was identified:
Sense strand:
Antisense strand:
To test the ability of this 62 base-pair sequence to induce cell death, HeLa cells were plated at a density of 10,000 cells per well (96-well plate) and transfected with the Eg5-tox sequence at varying concentrations (0.5-200 nanomolar) using Lipofectamine 2000. Subsequently, cells were cultured over the course of 72 hours and assessed for cell viability by staining with Hoechst 33342 dye. Results of these experiments showed that transfection of the Eg5-tox duplexes induced significant levels of cell death. Transfections at 50 nanomolar and 12 nanomolar concentrations were sufficient to induce greater than 90% cell death within 24 and 48 hours, respectively.
To further assess the effects of toxic molecules as transfection controls under these conditions, successively smaller molecules were tested for the ability to induce cell death. Specifically, Eg5-tox duplexes that were used are described in the table below:
The duplexes described in the above table contained at least one toxic motif, and were introduced into HeLa cells using the previously described conditions and assessed for the ability to induce cell death. Results of these experiment established that all of the fragments with the exception of the smallest duplex (21 bp) were capable of inducing cell death. Small siRNAs or pools of siRNA that did not contain toxic motifs, but did down regulate the Eg5 target, did not induce cell death in these time frames (144 hours).
While the invention has been described in connection with specific embodiments thereof, it will be understood that it is capable of further modifications and this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departure from the present disclosure as come within known or customary practice within the art to which the invention pertains and as may be applied to the essential features hereinbefore set forth and as follows in the scope of the appended claims.
This application is a nonprovisional of, and claims the benefit of the filing date of, U.S. Provisional Patent Application Ser. No. 60/538,874, filed Jan. 23, 2004; U.S. Provisional Patent Application Ser. No. 60/548,285, filed Feb. 27, 2004; and U.S. Provisional Patent Application Ser. No. ______, filed Jan. 7, 2005, as attorney docket no. 13591PA2, entitled “Identification of Toxic Sequences,” with Express Mail Label No. EV279582915US; each of the above-mentioned applications is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
60538874 | Jan 2004 | US | |
60548285 | Feb 2004 | US | |
60642059 | Jan 2005 | US |