The invention relates generally to recombinant or synthetic nucleic acid constructs that are capable of interacting with non-coding RNAs, e.g., a microRNA (miRNA or miR), such that the interaction modulates the structure of the nucleic acid construct, leading to a higher affinity binding of the non-coding RNA to the nucleic acid construct than to a natural target.
MicroRNAs (miRNAs) are an abundant class of short endogenous RNAs that act as post-transcriptional regulators of gene expression by base-pairing with their target mRNAs. The approximately 22 nucleotide (nt) mature miRNAs are processed sequentially from longer hairpin transcripts (primary miRNA or precursor miRNA) by the RNAse III ribonucleases Drosha (Lee et al. (2003) S
Since the discovery of miRNA, the creation of effective miRNA inhibitors (referred to in this application as anti-miRs) has been essential, first to study and verify miRNA interactions and secondly as a therapeutic tool. But, creating an effective anti-miR has proven challenging. Although a perfectly complimentary strand of RNA the same length as the miRNA may seem like the simplistic and ideal anti-miR, such an approach has limitations. For example, a perfectly complementary sequence can induce the RISC complex, inducing cleavage of the anti-miR and limiting the utility of this approach. To overcome this, a number of strategies have been rationally designed and tested as shown in
In addition to sequences and surrounding structures, chemical modifications to the RNA backbone can also be made to increase binding and decrease the likelihood of cleavage. This strategy may be acceptable for molecular tool use, but may cause hepatotoxicity when used as a therapeutic. Anti-miR therapies, including those targeting miR-122 (a liver-specific miRNA thought to be involved in fatty acid metabolism) and miR-155 (overexpressed in glioblastoma) for Hepatitis C Virus infection and brain cancer respectively hold promise. However, at present they require further improvements to reduce hepatotoxicity to fully enable their development and FDA approval.
Many commercial anti-miR suppliers suggest using 10× as much anti-miR as miRNA mimic (an ectopically added miRNA) before any effect can be seen, even over a broad 12 to 72 hour window. In other words, if 1× of an miRNA mimic produces an observable effect (inhibition of the miRNA target), it typically will require 10× of an anti-miR to see an observable effect (inhibition of an miRNA). The need for such large quantities for efficacy and the expansive timeframe make it clear that current commercial anti-miRs are inadequate.
In summary, none of these rationally designed approaches to creating anti-miRs are sufficient.
The methods described herein relate, in part, to a method of inhibiting a non-coding RNA (e.g., a miRNA) using a nucleotide construct, wherein the non-coding RNA is capable of preferentially binding to the RNA construct as compared to a natural target (e.g., an mRNA). In certain embodiments, the method comprises contacting a miRNA with an RNA construct that inhibits the miRNA (i.e., an anti-miRNA or anti-miR), wherein the miRNA binds preferentially to the anti-miRNA as compared to a natural target, e.g., a complementary messenger RNA (mRNA), thereby inhibiting the miRNA. In certain embodiments, the RNA constructs (1) bind with high affinity to a non-coding RNA, (2) do not induce the RISC complex, and/or (3) require little to no chemical modification. These features can provide advantages for both therapeutic and research applications.
In certain aspects, the methods described herein provide an RNA construct which is an RNA that hybridizes with all or a portion of the non-coding RNA and which is selected such that the hybridization stabilizes the interaction between the RNA construct and non-coding RNA. In some embodiments, hybridization of the non-coding RNA to the RNA construct may cause a conformational change in the RNA construct that acts as a trigger to activate, suppress, or modulate a cascade of biochemical events, leading to some prophylactic or therapeutic effect. For example, hybridization of the non-coding RNA to the RNA construct may stabilize a stem-loop structure in the RNA construct to enhance binding of a cellular protein. In certain embodiments, a cellular protein can bind to the RNA construct, e.g., to the stem-loop of the RNA construct, whereby the cellular protein is inhibited. In certain embodiments, inhibition of the cellular protein occurs because the cellular protein is sequestered by the RNA construct and therefore unable to function.
Thus, one aspect of the invention involves an RNA construct that includes a non-naturally occurring, continuous sequence of ribonucleotide bases. These bases define a stem-loop structure. The RNA construct has simulated, three way junction joining regions 3′ and 5′ of the stem-loop structure. In addition, the RNA construct has a first region 5′ of the 5′ joining region, including bases complementary to a 3′ region of the non-coding RNA, and a second region 3′ of the 3′ joining region, including bases complementary to a 5′ region of the non-coding RNA. The non-coding RNA binds preferentially to the RNA construct compared to a natural target.
In certain embodiments, the base sequence of the first and second regions are selected to hybridize with complementary bases on the non-coding RNA and are spaced apart by an intermediate region on the non-coding RNA defining another three way junction joining region.
In certain embodiments, the non-coding RNA is an miRNA, an siRNA, a piRNA, an snoRNA, or an lncRNA. In certain embodiments, the RNA construct comprises a modified nucleic acid base. In certain embodiments, the RNA construct further comprises a detectable marker, for example, a fluorescent moiety or biotin.
In certain embodiments, the method further comprises transfecting a DNA encoding the RNA construct into a cell.
In certain embodiments, binding of the non-coding RNA to the RNA construct is detectable on a non-denaturing gel. In certain embodiments, binding of the non-coding RNA to the RNA construct is detectable on a denaturing gel. In certain embodiments, binding of the non-coding RNA to the RNA construct is detectable on a 1 M, 3 M, 7 M and/or 8 M denaturing urea gel.
In certain embodiments, the non-coding RNA is an miRNA and the RNA construct inhibits the function of the miRNA as determined by (a) measuring the quantity of a downstream target of the miRNA in the presence of the RNA construct; (b) measuring the quantity of the downstream target of the miRNA in the absence of the RNA construct; and (c) determining that the quantity of the downstream target in the presence of the RNA construct is greater that the quantity of the downstream target the absence of the RNA construct. The downstream target can be, for example, an mRNA or a protein.
In certain embodiments, the reduction in free energy that occurs when an RNA construct binds a non-coding RNA is at least about 10 kCal, at least about 25 kCal, at least about 50 kCal, at least about 100 kCal, at least about 1,000 kCal. In certain embodiments, the reduction in free energy that occurs when a non-coding RNA binds an RNA construct is at least 1% more, at least 2% more, at least 5% more, at least 10% more, at least 25% more, at least 50% more at least 75% more, at least 95% more, at least 100% more, at least 200% more, at least 500% more than the reduction in free energy that occurs when a non-coding RNA binds its natural target.
In certain embodiments, the non-coding RNA is an miRNA, and at least 50% of the miRNA remains uncleaved by the RISC complex.
In certain embodiments, the RNA construct inhibits the non-coding RNA and in the presence of the non-coding RNA, the construct assumes a stem-loop conformation promoting association with a cellular protein, whereby association of the stem-loop with the cellular protein inhibits the cellular protein. In certain embodiments, the RNA construct inhibits the non-coding RNA by sequestering the non-coding RNA, and association of the stem-loop with the cellular protein inhibits the cellular protein by sequestering the cellular protein. In certain embodiments, inhibition of the non-coding RNA and cellular protein is lethal to the cell.
In certain embodiments, a joining region comprises no more than about 10 nucleotides. In certain embodiments, at least one three-way junction joining region comprises at least one unpaired nucleotide. In certain embodiments, each three-way junction joining region comprises at least one unpaired nucleotide. In certain embodiments, at least one three-way junction joining region comprises at least one unpaired nucleotide, at least 2 unpaired nucleotides, at least 3 unpaired nucleotides, at least 4 unpaired nucleotides, at least 5 unpaired nucleotides, at least 6 unpaired nucleotides, at least 7 unpaired nucleotides, at least 8 unpaired nucleotides, at least 9 unpaired nucleotides, or at least 10 unpaired nucleotides. In certain embodiments, each three-way junction joining region comprises at least one unpaired nucleotide, at least 2 unpaired nucleotides, at least 3 unpaired nucleotides, at least 4 unpaired nucleotides, at least 5 unpaired nucleotides, at least 6 unpaired nucleotides, at least 7 unpaired nucleotides, at least 8 unpaired nucleotides, at least 9 unpaired nucleotides, or at least 10 unpaired nucleotides.
In certain embodiments, at least one base in the stem of the stem-loop structure is mismatched with its potential binding partner so as to reduce the stability of the stem-loop conformation.
In certain embodiments, the non-coding RNA is characteristic of a pathogen. In certain embodiments, the pathogen is a virus or a single-celled microorganism. In certain embodiments, the non-coding RNA is expressed preferentially in a cell type of a multicellular organism. In certain embodiments, the cell type is an infected cell or a neoplastic cell, and the non-coding RNA is expressed by an organism infecting the cell or by the neoplastic cell.
In certain embodiments, the RNA construct comprises more than one said stem-loop structure.
The foregoing and other objects, features, and advantages of the present invention, as well as the invention itself, may be more fully understood from the following description of preferred embodiments, when read together with the accompanying drawings.
The present invention relates to the use of interactions between a nucleic acid construct, such as an RNA construct, and a non-coding RNA. As disclosed herein, an RNA construct can be designed to bind to a non-coding RNA, wherein the non-coding RNA binds to the RNA construct preferentially as compared to a natural target. In certain embodiments, the stem-loop of the RNA construct binds to a cellular protein (e.g., eIF3; eIF4E, G, or A; aconitase 1; the 40S subunit) thereby inhibiting the cellular protein. In certain embodiments, the stem-loop of the RNA construct is stabilized by binding to a non-coding RNA. Stabilization of the RNA construct by the non-coding RNA can promote the interaction of the stem-loop structure with a cellular protein thereby inhibiting the cellular protein. In certain embodiments, the cellular protein is an RNA binding protein (RBP), a protein that binds single or double stranded RNA and contains one or more characteristic structural motifs. Examples of RBPs include stem-loop binding protein (SLBP), iron-responsive element binding protein-2 (IREB-2), and SECIS binding protein (SBP2).
In certain embodiments, the invention described herein relates to a post-transcriptional regulatory mechanism called “structurally interacting RNA” (or sxRNA). sxRNA is a trans-mRNA/miRNA interaction that forms a three way junction (3WJ). Certain features of this interaction can be used to design highly stable RNA constructs that inhibit non-coding RNAs (e.g., that act as anti-miRs), for example, by using a stem-loop instead of a bulge in the middle of an anti-miR RNA construct. This approach has three advantages for use as an anti-miR: (1) the presence of a stem-loop prevents cleavage of the anti-miR by the RISC complex, (2) many of the interactions we have informatically and experimentally observed have flanks of a stem-loop that are perfectly complementary to a miRNA without recruiting the RISC complex, and (3) importantly, three way junctions appear to be very stable structures.
The stability of three way junctions might not be readily apparent. However, integral functional RNA based molecules such as ribosomal RNA would not be effective unless three way junctions were stable enough to hold them in a particular conformation the vast majority of the time. In studying three way junctions Zhang et al. ((2013) “Crystal structure of 3WJ core revealing divalent ion-promoted thermostability and assembly of the Phi29 hexameric motor pRNA,” RNA 19:1226-1237) found that DNA-DNA or DNA-RNA based three way junctions were moderately stable, but RNA-RNA based three way junctions were so strong that they remained bound, even in an 8 M urea denaturing gel. Similar results, shown in
Making RNA constructs (e.g., anti-miRs) against specific non-coding RNAs (e.g., miRNAs) is described herein. RNA constructs comprise at least one stem-loop structure, but no specific stem-loop is preferred. Examples described herein predominantly use a histone stem-loop, but any stem-loop can be used to create an RNA construct. The stem can be long or short, and the sequence can be varied as long as the sequences will form a stem in the proper confirmation. The loop itself also can be varied, and can range from a hairpin structure to a complex loop comprising a variety of secondary structure including additional loops, cloverleafs, etc., comprising up to thousands of bases, as long as a non-coding RNA can preferentially bind the RNA construct as compared to a natural target. In certain embodiments, the stem-loop in its final conformation optionally can be a binding site for a cellular protein (e.g., an RBP), depending upon the desired effect. The stem-loop can also be incorporated anywhere between the start and end of the nucleotides complementary to the non-coding RNA with at least one complementary nucleotide on both sides of the stem-loop. The flanks of the stem-loop need to be generally complementary to the non-coding RNA (e.g., a miRNA target, however, as can be seen in
Use of an RNA construct described herein can be used in combination with other strategies known in the art for inhibiting non-coding RNAs to increase efficacy. For example, use of modified nucleic acid bases can be used to achieve high affinity and/or stability by inhibiting catalytic cleavage of an RNA construct bound by the non-coding RNA (e.g., a miRNA) or by inhibiting other cellular nucleases. Examples of modified nucleic acids include phosphate backbone modification (e.g., phosphodiester or phosphothiorate internucleotide linkages), sugar modifications (e.g., locked nucleic acids (LNAs), 2′OMe, 2′-fluoro RNA (2′F) and 2′MOE), and non-ribose backbones (e.g., phosphoramidate morpholino oligo (PMO) and peptide nucleic acid (PNA)). See, Lennox et al. (2001) “Chemical modification and design of anti-miRNA oligonucleotides,” G
Other modifications or additions can be made to the non-coding RNA such as the addition of a biotin tag or fluorescent tag, or other changes generally used in the art.
A. General
The RNA construct comprises structural elements that aid in the binding of the non-coding RNA to the RNA construct such that the non-coding RNA binds preferentially to the RNA construct as compared to a natural target (e.g. an mRNA). Specifically, the RNA construct typically contains nucleotides that will lack binding partners at joining regions formed when bound to a non-coding RNA, providing flexibility which enhances the affinity with which the RNA construct binds to a non-coding RNA and/or the stability of the RNA construct-non-coding RNA complex. Modified nucleic acid bases, described above, can be used in the constructs described herein.
B. Uses
1. Inhibition of miRNA
An RNA construct can be designed to bind to a non-coding RNA, wherein the non-coding RNA binds to the RNA construct preferentially as compared to a natural target. Inhibition of the non-coding RNA typically results in the opposing effect of the function of the non-coding RNA. For example, inhibition of an miRNA can lead to increased production of the miRNA's natural target. In certain embodiments, an miRNA that normally promotes cellular functioning (e.g., by suppressing translation of an apoptotic protein) can be inhibited to kill the cell (e.g., an infected cell). This strategy can be used to target any pathogen, including viruses, bacteria, fungi or protozoa. Thus, the strategy can kill the cell harboring a pathogen and/or the pathogen itself. Beyond pathogens, this strategy can be used to eliminate diseased cells, such as neoplastic cells (e.g., cancer cells). Alternatively, an miRNA that normally inhibits cellular functioning (e.g., by suppressing translation of an inhibitor of apoptosis) can be inhibited to promote survival of the cell (e.g., a neurodegenerating cell).
2. Inhibition of miRNA and a Cellular Protein
In certain embodiments, RNA constructs comprise a binding site for a cellular protein (e.g., an RBP). The binding site may be present in the presence or absence of a non-coding RNA, or may form only when a non-coding RNA binds the RNA construct and stabilizes the binding site for the cellular protein. Binding of the cellular protein can inhibit the function of the cellular protein. In certain embodiments, inhibition of the cellular protein occurs because the cellular protein is sequestered by the RNA construct and therefore unable to function because it cannot make the necessary binding contacts or because it is mis-localized.
In certain embodiments, an RNA construct inhibits both a non-coding RNA and a cellular binding protein, leading to a certain functional output Inhibition of a non-coding RNA and a cellular protein can, for example, lead to the death of the cell. In certain embodiments, an RNA construct binds to and inhibits a non-coding RNA in an infected cell, and binds to and inhibits a cellular protein necessary for any essential cellular function (e.g., transcription and/or translation) Inhibition of both the non-coding RNA and the cellular protein leads to the death of the infected cell. In certain embodiments, inhibition of both the non-coding RNA and the cellular protein leads to the synergistic inhibition of a cellular function than inhibition of either the non-coding RNA or the cellular protein alone.
C. Construct Design
The following section describes how to design an RNA construct of the invention. RNA constructs comprise at least one stem-loop structure, but no specific stem-loop is preferred. Examples described herein predominantly use a histone stem-loop, but any stem-loop can be used to create an RNA construct. The stem can be long or short, and sequence can be varied as long as the sequences will form a stem in the proper confirmation. The loop itself also can be varied, and can range from a hairpin structure to a complex loop comprising a variety of secondary structure including additional loops, cloverleafs, etc., comprising up to thousands of bases, as long as a non-coding RNA can preferentially bind the RNA construct as compared to a natural target. In certain embodiments, the stem-loop in its final conformation optionally can be a binding site for a cellular protein (e.g., an RBP), depending upon the desired effect. The stem-loop can also be incorporated anywhere between the start and end of the nucleotides complementary to the non-coding RNA with at least one complementary nucleotide on both sides of the stem-loop.
The flanks of the stem-loop need to be generally complementary to the non-coding RNA (e.g., a miRNA target), however, as can be seen in
While some details of design will depend upon the stem-loop structure, and its corresponding mode of action, typically design of RNA constructs will involve the following steps (which need not be performed in the order listed).
Three-way junctions are found in nature. (See, e.g., Lescoute et al. (2006) RNA 12:83-93.) The present invention simulates these natural structures and utilizes these simulated three-way junctions as a pattern upon which to build an RNA construct.
When developing an RNA construct, the joining regions of the construct as well as the non-coding RNA are chosen according to
If an RNA construct patterned on Family A is desired, J3 is selected to include any number of unpaired nucleotides from 0 to about 10 and J2 is selected to include any number of unpaired nucleotides, from 0 to about 10, such that the number of unpaired nucleotides selected for J3 is less that the number selected for J2. J1 includes any number of unpaired nucleotides from 0 to about 10. In certain embodiments, the number of unpaired nucleotides selected for J3 is between 1 and 3, the number of unpaired nucleotides in region J2 is between 3 and 9, and the number of unpaired nucleotides in region J1 is between 0 and 4. If an RNA construct patterned on the Family B conformation is desired, J3 is selected to include any number of unpaired nucleotides from 0 to about 10 and should be approximately equal (e.g., equal to 0 to about 10) to the number selected for J2. J1 includes any number of unpaired nucleotides from 0 to about 10. In certain embodiments, the number of unpaired nucleotides in each region (J1, J2 and J3) is between 2 and 6. If an RNA construct patterned on Family C is desired, J3 is selected to include any number of unpaired nucleotides from 0 to about 10 and J2 is selected to include any number of unpaired nucleotides, from 0 to about 10, such that the number of unpaired nucleotides in J3 is greater than the number selected for J2. J1 includes any number of unpaired nucleotides from 0 to about 10. In certain embodiments, the number of unpaired nucleotides selected for J3 is between 3 and 9, the number of unpaired nucleotides in region J2 is between 1 and 5, and the number of unpaired nucleotides in region J1 is between 0 and 5.
Tertiary interactions of bases in the joining regions with bases in the stem regions, as identified by crystallography in natural forms of three way junctions, may be engineered. On this basis, one can change the conformation to improve a given activity or increase affinity and/or stability.
To design a weakened stem-loop, point mutations are engineered in the wild-type (including a consensus motif sequence) by introducing non-canonical base pairs (i.e., mismatches) so that structure formation, without a trans-acting support, is no longer energetically favorable. The resulting weakened structure typically should not form bonds strong enough to spontaneously establish an active form of the structure, and when structure is re-formed via trans-acting stabilization, the bond distances typically should not interfere with function. The positions and identity of nucleotides important for activity typically are identified so that these bases are not changed. While vital nucleotides typically are not altered, the nucleotide with which a given vital nucleotide pairs may be able to be altered to weaken the stem without sacrificing binding to the reconstituted structure. Stem nucleotides that only provide structural integrity can be altered to reduce thermodynamic favorability of stem formation. In this case, both bases of a pair can be altered if desired. Canonical base pairs are defined as A-U, U-A, G-C, C-G. There are a number of alternative pairings such as G-U and U-U with varying energy levels and bond distance. The choice as to which bases should be substituted is made with the awareness that if relative bond distance and/or conformation of certain bases in the stem is important to activity, appropriate non-canonical pairings should be chosen (See, for example, Leontis, et al. (2002) N
In addition to bond distance, bond angles and their effects on helix twist and stem “kink” may be considered.
In the case of a cellular protein binding site, a consensus structure, which can be generated by binding assays, typically is the starting point, but any wild-type sequence (including any consensus sequence) can be used. Although the discussion of RNA construct design principles herein typically refers to an RNA sequence as the starting point, it should be understood that not only RNA sequences but also equivalent DNA sequences can be used. As the skilled artisan understands, an RNA sequence is equivalent to the sequence of a coding strand of genomic DNA or cDNA, because RNA that is transcribed from genomic DNA or cDNA has an identical sequence to the coding strand sequence of the genomic DNA or cDNA (with “T” nucleotides in the DNA sequence and “U” nucleotides in the RNA sequence). Thus, the design principles described herein are applicable when using RNA as a starting point or DNA as a starting point.
In order to demonstrate the concepts of the present invention, a generalized, mock stem-loop consensus sequence is depicted in
Flank regions are the regions of nucleotides adjacent to the stem on the 5′ and 3′ sides of the stem-loop structure that form two of the three joining regions and that bind to the non-coding RNA.
As shown in
Design of an RNA construct can be aided by use of any number of existing informatic RNA folding programs such as mfold, Sfold and the Vienna software. In addition, RNAFold can be used to examine optimal folds of a single RNA molecule; RNACofold can be used to examine optimal folds of two interacting RNA molecules; MiRanda or RNAHybrid can be used as part of high throughput screens to find naturally occurring instances of RNA construct-non-coding RNA interactions; and JMol can be used for visualizing existing crystal structures. These tools predict the probable structure of RNA and can be used in the design of complementary RNA constructs with predictable structural influences upon contact with the non-coding RNA. Depending on the non-coding RNA sequence, structural aspects may be exploited in the design of the RNA construct, especially with respect to areas of uniqueness in the non-coding RNA. Examples of this strategy include hairpin, stem-loops, cloverleafs, kissing complexes and other conformational structures known to result from traditional RNA-RNA interactions.
Generally, SELEX (Systematic Evolution of Ligands by Exponential Enrichment) is a method for generating a ligand of interest by taking a population of randomly-generated ligands (e.g., small molecules, polynucleotides or polypeptides) and systematically selecting and amplifying ligands that meet chosen criteria. To select for a ligand of interest, a very large polynucleotide library of random sequences is synthesized. Next, the sequences are exposed to a target binding partner (e.g., a protein or small molecule). Unbound sequences are removed, and the bound sequences are eluted and amplified. The amplified population is again exposed to the target binding partner, and the process is repeated. The stringency of binding and elution conditions can be increased to preferentially enrich only the tightest binding sequences. After a number of cycles, the remaining sequences are identified.
An RNA construct can be produced using a combination of the design approach outlined above and a SELEX-like approach. Use of a SELEX-like approach allows efficient screening of a large range of all possible RNA construct sequences for the best functional candidates. Designing an RNA construct with a SELEX-like approach has several steps in common with the above-described design approach, including choosing a functional motif for the RNA construct (e.g., histone stem-loop); identifying nucleotides important for desired activity (e.g., binding to a cellular protein); choosing a non-coding RNA sequence, such as an miRNA (e.g., HCMV-miR-US4 or miR-373), which is specific to a desired environment (e.g., HCMV-infected cells or breast cancer cells); and designing the flank sequences of the RNA construct by choosing the region of the non-coding RNA sequence to which the RNA construct will bind.
A SELEX-like approach may be taken by keeping the flank sequences and nucleotides critical to the functioning of the motif constant and randomly generating sequences for the remaining positions. Specifically, a library of nucleic acids is generated having sequences in which the constant positions are the same in each sequence, and random nucleotides are placed into the variable positions.
A starting library of potential RNA constructs having sequences in which constant positions are kept constant, and variable regions containing random sequences, is initially screened. In the initial screen, nucleic acids having sequences that bind to a protein of interest (e.g., a cellular binding protein) in the absence of the non-coding RNA are removed. Then, remaining nucleic acids are exposed to the protein of interest in the presence of the non-coding RNA, and unbound nucleic acids are removed. Bound nucleic acids are eluted, amplified, and the process is repeated under desired stringency conditions. Nucleic acids that remain after multiple rounds of selection are identified and further tested for desired activity. For example, identified nucleic acids may be attached to a polypeptide coding region that includes one or more reporter genes, placed in an expression environment in the presence or absence of the non-coding RNA and assayed to determine whether the reporter gene is expressed. Desired RNA constructs will be those that express the reporter gene only in the presence of the non-coding RNA.
An expression environment is any environment in which a polypeptide coding region can be expressed (e.g., a naturally-occurring cell, such as a neoplastic cell or infected cell; a non-naturally occurring cell; or a cell-free expression system (e.g., available from Life Technologies, Foster City, Calif.), such as rabbit reticulocyte lysate, wheat germ extract or E. coli cell-free system).
D. Histone Stem-Loop (HSL)
The principles of RNA construct design can be applied to a histone stem-loop starting point. For example, a wild-type histone stem-loop (HSL) motif (including a consensus sequence of a HSL motif) can be used as the starting point for further design of an RNA construct in accordance with the principles described above.
Metazoan cell cycle-regulated histone mRNAs are the only known cellular mRNAs that do not terminate in a poly(A) tail. Instead, these messages terminate in a conserved HSL motif, which functions to increase translation. The HSL motif is recognized and bound by “Stem-loop Binding Protein” (SLBP), an RBP, which is believed to upregulate the translation of one or more adjacent polypeptide coding regions through the action of a protein complex, shown in
The HSL motif has a conserved size and structure. The visual depiction of this is shown in the following HSL consensus sequence.
Table 3 shows the FASTA nucleic acid codes that are used throughout the description.
In the HSL, one or more of the guanines at the base of the 5′ side stem, the adenine on the 3′ side stem, and the first and third uridines in the loop are typically needed for binding to a SLBP. Accordingly, if SLBP binding is desired, care should be taken not to alter these bases when designing an RNA construct. However, in some instances the bases that are typically needed for binding to a SLBP may be altered even if some amount of activity is lost. The remaining nucleotides are more variable. Examples of naturally occurring HSL sequences are identified by GenBank number in FIG. 11 of U.S. Pat. No. 8,841,438, incorporated by reference herein, and can be accessed at www.ncbi.nlm.nih.gov/. “Start” and “end” indicate the location of the HSL sequence within the entire nucleotide sequence. Accordingly, any HSL consensus sequence or sequence from FIG. 11 of U.S. Pat. No. 8,841,438 can be used as the starting point to construct an RNA construct according to the invention.
While the HSL consensus sequence can be described visually, it can also be described using the logic search pattern (“PatSearch description”) described in Grillo, et al. (2003) N
The PatSearch description for the HSL described visually above is:
“r1” indicates the rules for pairing to be applied to the consensus sequence. For example, the HSL stem can include “AU” base pairings, “UA” base pairings, “GC” base pairings, “CG” base pairings, “GU” base pairings, and “UG” base pairings. The next line shows the consensus sequence, beginning at the 5′ end with three “M” nucleotides. Next, “p1” indicates the first pattern of the consensus sequence, “GGYYY,” which represents the 5′ side of the stem. The next nucleotides in the sequence are UHHUHA, which represent the loop. Next, “r1˜p1” indicates that the pairing rules “r1” must be applied to the “p1” sequence to furnish the 3′ side of the stem. Finally, the 3′ end of the consensus sequence has two amino nucleotides, “MM.”
Other examples of HSL consensus sequences are shown below.
It should be understood that any wild-type HSL sequence (including a consensus sequence) can be used as the starting point to make any number of RNA constructs. From this wild-type HSL, at least one stem-weakening mutation is incorporated into the construct to decrease the formation of the stem-loop structure in the absence of a non-coding RNA. Consensus sequences that include non-canonical base pairings (i.e., mismatches) are considered to be weakened when such a pairing (e.g., a GU or a UG pairing) is chosen, even if the consensus sequence is broad enough to include non-canonical pairings at those same positions (e.g., because some wild-type sequences falling within the consensus sequence have canonical base pairing, rather than non-canonical base pairing, at the location). Alternatively, an RNA construct can contain a wild-type stem-loop if the stem-loop, in the context of the RNA construct, has an increased Minimum Free Energy (MFE) compared to the stem-loop in a wild-type context, such that the stem-loop of the RNA construct will be less stable than will the stem-loop in a wild-type context.
2. Design of an Example RNA Construct when Starting from an HSL Motif
This section illustrates the design of an RNA construct, shown in
a. Joining Regions
When designing an RNA construct based on an HSL motif, the size of the J2 joining region can be chosen so that the RNA construct-non-coding RNA structure forms a Family A, Family B or Family C structure. In the example depicted in
HSL Consensus Sequence I begins with three “M” nucleotides. In
The HSL consensus sequence ends with two “M” nucleotides. In
b. Optional Weakening of the Stem
The HSL consensus sequence has the sequence “GGYYY” on the 5′ side of the stem, corresponding to “GGUUC” in
c. Flank Regions
Flank regions can be designed according to the principles discussed above. The flank regions of the RNA construct include two of the three joining regions as well as sequence that is complementary to the non-coding RNA. In the example shown in
As described above, the selected HSL sequence is then further modified to attach the polypeptide coding region using standard molecular biological protocols.
E. Iron Response Element (IRE)
Alternatively, a wild-type Iron Response Element (IRE), including a consensus sequence of an IRE motif, can be used as the starting point for functional design of an RNA construct in accordance with the principles described above. The IRE is an stem-loop motif with a binding site for RBPs such as iron-responsive binding proteins, for example, aconitase 1 (ACO1, also known as iron-responsive element binding protein 1 or IRP1) and iron-responsive element binding protein 2 (IREB-2, also known as IRP2). (See Hentze et al. (1996) P
The structure of the IRE motif is conserved but the exact composition of nucleotides can be variable, providing a useful platform for design of an RNA construct. Three subtypes of IRE consensus sequences can be used as a starting point from which to design an RNA construct.
The PatSearch pattern for IRE subtype 1 is as follows:
1.) IRE1
“r1” indicates the rules for pairing to be applied to the consensus sequence. For example, the IRE1 consensus sequence can include “AU” base pairings, “UA” base pairings, “GC” base pairings, “CG” base pairings, “GU” base pairings, and “UG” base pairings. The IRE1 consensus sequence begins at the 5′ end with the first pattern, “p1,” which forms the first part of the 5′ side of the stem. “p1” defines a string of 2 to 8 nucleotides of any composition (A, G, C, or U). The next part of the 5′ side of the stem is an unpaired “C” nucleotide, followed by the second pattern “p2,” the last part of the 5′ stem. “p2” defines a string of exactly 5 nucleotides of any composition (A, G, C, or U). Following “p2” is the loop, which consists of the sequence “CAGWGH.” Following the loop is the first part of the 3′ side of the stem, which can be any string of nucleotides that pairs to the “p2” pattern, provided that the nucleotides follow the “r1” rules for pairing (as indicated by the PatSearch description “r1˜p2”). The final part of the 3′ stem can be any string of nucleotides that pairs to the “p1” pattern, provided that the nucleotides follow the “r1” rules for pairing (as indicated by the PatSearch description “r1˜p1”). Examples of specific naturally-occurring IRE1 sequences are identified by GenBank number in FIG. 14 of U.S. Pat. No. 8,841,438, incorporated by reference herein, and can be accessed at www.ncbi.nlm.nih.gov/. “Start” and “end” indicate the location of the IRE1 sequence within the entire nucleotide sequence.
The PatSearch pattern for IRE subtype 2 is as follows:
2.) IRE2
“r1” indicates the rules for pairing to be applied to the consensus sequence. For example, the IRE2 consensus sequence can include “AU” base pairings, “UA” base pairings, “GC” base pairings, “CG” base pairings, “GU” base pairings, and “UG” base pairings. The IRE2 consensus sequence begins at the 5′ end with the first pattern, “p3,” which forms the first part of the 5′ side of the stem. “p3” defines a string of 2 to 8 nucleotides of any composition (A, G, C, or U). The next part of the 5′ side of the stem is the sequence “NNC,” which will be unpaired. Next, the second pattern, “p4,” forms the last part of the 5′ stem. “p4” defines a string of exactly 5 nucleotides of any composition (A, G, C, or U). Following “p4” is the loop, which consists of the sequence “CAGWGH.” Following the loop is the first part of the 3′ side of the stem, which can be any string of nucleotides that pairs to the “p4” pattern, provided that the nucleotides follow the “r1” rules for pairing (as indicated by the PatSearch description “r1˜p4”). The second part of the 3′ stem can be any single unpaired nucleotide “N.” The final part of the 3′ stem can be any string of nucleotides that pairs to the “p3” pattern, provided that the nucleotides follow the “r1” rules for pairing (as indicated by the PatSearch description “r1˜p3”). Examples of specific naturally-occurring IRE2 sequences are identified in by GenBank number in FIG. 15 of U.S. Pat. No. 8,841,438, incorporated by reference herein, and can be accessed at www.ncbi.nlm.nih.gov/. “Start” and “end” indicate the location of the IRE2 sequence within the entire nucleotide sequence.
The PatSearch pattern for IRE subtype 3 is as follows:
3.) IRE3
“r1” indicates the rules for pairing to be applied to the consensus sequence. For example, the IRE3 consensus sequence can include “AU” base pairings, “UA” base pairings, “GC” base pairings, “CG” base pairings, “GU” base pairings, and “UG” base pairings. The IRE3 consensus sequence begins at the 5′ end with the first pattern, “p5,” which forms the first part of the 5′ side of the stem. “p5” defines a string of 6 to 8 nucleotides of any composition (A, G, C, or U). The next part of the 5′ side of the stem is an unpaired “C” nucleotide, followed by the second pattern “p6,” which makes up the third part of the 5′ stem. “p6” defines a string of exactly 2 nucleotides of any composition (A, G, C, or U). The final part of the 5′ stem is the pattern “p7,” which consists of exactly 3 nucleotides. Following “p7” is the loop, which consists of the sequence “CAGWGH.” Following the loop is the first part of the 3′ side of the stem, which can be any string of nucleotides that pairs to the “p7” pattern, provided that the nucleotides follow the “r1” rules for pairing (as indicated by the PatSearch description “r1˜p7”). The second part of the 3′ stem is the unpaired nucleotide “U.” The next part of the 3′ stem can be any string of nucleotides that pairs to the “p6” pattern, provided that the nucleotides follow the “r1” rules for pairing (as indicated by the PatSearch description “r1˜p6”). The final part of the 3′ stem can be any string of nucleotides that pairs to the “p5” pattern, provided that the nucleotides follow the “r1” rules for pairing (as indicated by the PatSearch description “r1˜p5”). Examples of specific naturally-occurring IRE3 sequences are identified by GenBank number in FIG. 16 of U.S. Pat. No. 8,841,438, incorporated by reference herein, and can be accessed at www.ncbi.nlm.nih.gov/. “Start” and “end” indicate the location of the IRE3 sequence within the entire nucleotide sequence.
F. SECIS
Alternatively, a wild type Selenocysteine Insertion Element (SECIS), including a consensus sequence of a SECIS motif, can be used as the starting point for functional design of an RNA construct in accordance with the principles described above. Two varieties of SECIS elements exist, type 1 and type 2 (SECIS1, SECIS2). An active SECIS element present in the 3′UTR of an mRNA causes the translation machinery to reinterpret a UGA stop codon as a selenocysteine residue. Selenocysteine incorporation at UGA codons also uses specialized trans-acting factors. They include a selenocysteine-specific tRNA, an elongation factor specific for this tRNA and a SECIS-binding protein, (e.g., SBP2, which is an RBP), which recruits the elongation factor to the selenoprotein mRNA. Ribosomal Protein L30 also binds the SECIS element. It is believed that a complex of SELB (the specialized elongation factor), Sec-tRNASec (selenocysteine tRNA), GTP (guanosine triphosphate), SBP2 and SECIS form and then associate with the ribosome via L30. At this point, a conformational change in the SECIS element triggers the release of Sec-tRNASec and GTP hydrolysis, which allows incorporation of the selenocysteine residue into the polypeptide. (See, e.g., Chavatte et al. (2005) N
The PatSearch description for the SECIS type 1 consensus sequence is as follows:
A.) SECIS1
The PatSearch description for the SECIS type 2 consensus sequence is as follows:
B.) SECIS2
“r1” indicates the rules for pairing to be applied to the consensus sequence. For example, the SECIS2 consensus sequence can include “AU” base pairings, “UA” base pairings, “GC” base pairings, “CG” base pairings, “GU” base pairings, and “UG” base pairings. The SECIS2 consensus sequence begins at the 5′ end with the first pattern, “p1,” which forms the first part of the 5′ side of the stem. “p1” defines a string of 4 to 19 nucleotides of any composition (A, G, C, or U). The next part of the 5′ side of the stem is the second pattern “p2,” which defines a string of 2 to 9 unpaired nucleotides of any composition (A, G, C, or U), followed by an “A.” The next part of the 5′ side of the stem is the string of nucleotides “UGAN.” The next part of the 5′ side of the stem is defined by the pattern “p3,” which consists of 8-12 nucleotides. “p3” is further defined by the PatSearch description, “p3:(((̂rr|̂mc)|̂su) 6 . . . 10),” which indicates that “p3 must start with either “RR,” “MC,” or “SU” followed by 6-10 nucleotides of any composition (making up 8-12 nucleotides total). Pattern “p4” defines the 5′ end of the loop, and can be 0 to 3 nucleotides of any composition (A, G, C, or U). The loop continues with pattern “p5,” defined by the string of nucleotides “AAV.” The loop continues with pattern “p6,” which consists of 11 to 14 nucleotides which begins with 0 to 1 nucleotide of any compositions (A, G, C, or U), followed by the pattern p7, which can be 2 to 7 nucleotides of any composition (A, G, C, or U) followed by 3 to 6 nucleotides, provided that the last 3 nucleotides, which will form the first part of the 3′ side of the stem, follow the “r1” rules for pairing. The next part of the 3′ side of the stem can be any string of nucleotides that pairs to the “p3” pattern, provided that the nucleotides follow the “r1” rules for pairing. The PatSearch description, “(((r1˜p3[1,0,0]|r1˜p3[1,1,0])|r1˜p3[1,0,1])|r1˜p3[0,1,1])” indicates that the nucleotides in this section can (but are not required to) incorporate 1 mismatch, 0 insertions, and 0 deletions; 1 mismatch, 1 insertion, and 0 deletions; 1 mismatch, 0 insertions, and 1 deletion; or 0 mismatches, 1 insertion, and 1 deletion. The second part of the 3′ stem is the string of nucleotides “NGAN.” Pattern 10 defines the next part of the 3′ stem, which consists of 2 to 9 unpaired nucleotides of any composition (A, G, C, or U). The final part of the 3′ stem can be any string of nucleotides that pairs to the “p1” pattern, provided that the nucleotides follow the “r1” rules for pairing. Examples of specific naturally-occurring SECIS type 2 sequences are identified in by GenBank number in FIG. 19 of U.S. Pat. No. 8,841,438, incorporated by reference herein, and can be accessed at www.ncbi.nlm.nih.gov/. “Start” and “end” indicate the location of the SECIS2 sequence within the entire nucleotide sequence.
G. Internal Ribosome Entry Site (IRES)
Alternatively, a wild-type Internal Ribosomal Entry Site (IRES), including a consensus sequence of a IRES motif, can be used as the starting point for functional design of an RNA construct in accordance with the principles described above. The IRES is located in the 5′ UTR of mRNA and allows for 5′ cap-independent initiation of translation. Multiple forms have been identified and some interact with different factors to begin translation. For example, some IRESes bind eIF4G to initiate translation, while some act by directly binding to the 40S subunit. However, IRESes are not known to bind to an RBP. Instead, proper formation of the IRES facilitates binding of proteins that make up the translation machinery itself (e.g., eIF4G or the 40S subunit).
IRESes typically are large and more complex than a simple stem-loop. A few flaviviruses have an IRES with similar structure, shown generally in
Non-coding RNAs can be any endogenous or exogenously supplied RNA that does not normally lead to the expression of an encoded polypeptide and that binds to an RNA construct. Non-coding RNAs bind to the flank regions of an RNA construct, such that the non-coding RNA preferentially binds to the RNA construct as compared to a natural target (e.g., an mRNA). A non-coding RNA can be, but is not limited to, a viral polynucleotide, a bacterial polynucleotide, an polynucleotide expressed in a neoplastic cell, or a polynucleotide characteristic of a protein deficiency. Non-coding RNA can be found in, for example, but not limited to, a virally- or bacterially-infected cell, a neoplastic cell, a diseased cell, a tissue, a cell culture, or a sample containing polynucleotides (e.g., a blood sample).
The sequence of the non-coding RNA also should allow for the RNA construct to base pair with a favorable energy minimization to promote preferential binding, i.e., the non-coding RNA should bind preferentially to the RNA construct as compared to a natural target. For example, the calculated MFE (minimum free energy) structure should be such that the non-coding RNA is predicted to hybridize the RNA construct's flank regions as desired (rather than the non-coding RNA binding to a natural target, or the RNA construct folding with itself and the non-coding RNA folding with itself or the RNA construct and the non-coding RNA hybridizing in a manner other than what is desired).
DNA encoding the RNA construct (and, optionally, a polypeptide coding region) of the present invention can be prepared in any number of ways known in the art. DNA fragments encoding portions of the RNA construct can be obtained from any source, for example, from a cDNA library or amplified from genomic DNA using polymerase chain reaction (PCR). Methods for isolating nucleic acids, synthesizing nucleic acids, hybridizing nucleic acids, making and screening cDNA libraries, and performing PCR are well known in the art (see, e.g., Gubler et al. (1983), G
In an exemplary embodiment, at least a portion of a contemplated RNA construct is chemically synthesized. The single stranded molecules that comprise RNA constructs may be synthesized using any of a variety of techniques known in the art, such as those described in Usman et al. (1987) J. A
Alternative methods of chemical synthesis well known to the skilled artisan (see, for example, Engels et al. (1989) A
DNA encoding the RNA construct, or portions thereof, (and, optionally, a polypeptide coding region) can be inserted into an appropriate expression or amplification vector using standard ligation techniques. The vector is typically selected to be functional in the particular expression environment employed (i.e., the vector is compatible with the host cell machinery such that amplification of the RNA construct and/or expression of the RNA construct can occur). DNA encoding the RNA construct may be amplified/expressed in prokaryotic, yeast, insect (baculovirus systems) and/or eukaryotic host cells. These transgenes can be introduced as a linear construct, a circular plasmid, or a viral vector, which can be incorporated and inherited as a transgene integrated into the host genome. The transgene can also be constructed to permit it to be inherited as an extrachromosomal plasmid (Gassmann et al. (1995) P
RNA constructs and, optionally, polypeptide coding regions can be expressed and purified using common molecular biology and biochemistry techniques. For example, recombinant expression vectors can be used which can be engineered to carry an encoded RNA construct and, optionally, polypeptide coding region into a host cell to provide for expression of the RNA construct and attached polypeptide coding region. Such vectors, for example, can be introduced into a host cell by transfection means including, but not limited to, heat shock, calcium phosphate, DEAE-dextran, electroporation or liposome-mediated transfer. Recombinant expression vectors include, but are not limited to, Escherichia coli based expression vectors such as BL21 (DE3) pLysS, COS cell-based expression vectors such as CDM8 or pDC201, or CHO cell-based expression vectors such as pED vectors. An RNA construct and, optionally, polypeptide coding region can be linked to one of any number of promoters in an expression vector that can be activated in the chosen cell line. In an embodiment, a cassette (RNA construct and promoter) is carried by a vector that contains a selectable marker such that cells receiving the vector can be identified.
For example, promoters to express the RNA construct within a cell line can be drawn from those that are functionally active within the host cell. Such promoters can include, but are not limited to, a T7 promoter, a CMV promoter, a SV40 early promoter, a herpes TK promoter, and others known in recombinant DNA technology. Inducible promoters can be used, and include promoters such as metallothionine promoter (MT), mouse mammary tumor virus promoter (MMTV), and others known to those skilled in the art. Exemplary selectable markers and their attendant selection agents can be drawn, for example, from the group including, but not limited to, ampicillin, kanamycin, aminoglycoside phosphotransferase/G418, hygromycin-B phosphotransferase/hygromycin-B, and amplifiable selection markers such as dihydrofolate reductase/methotrexate and others known to skilled practitioners.
Additional elements for directing the replication and transcription of an RNA construct (and, optionally, polypeptide coding region) can be included in a vector to express RNA constructs in a variety of cell types, including but not limited to, eukaryotic, prokaryotic, insect, plant and yeast. For example, microorganisms such as bacteria can be transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors containing the RNA construct coding sequences; yeast can be transformed with recombinant yeast expression vectors containing the RNA construct coding sequences; insect cell systems can be infected with recombinant virus expression vectors (e.g., baculovirus) containing the RNA construct coding sequences; plant cell systems can be infected with recombinant virus expression vectors (e.g., cauliflower mosaic virus CaMV; tobacco mosaic virus, TMV) or transformed with recombinant plasmid expression vectors (e.g., Ti plasmid) containing the RNA construct coding sequences.
Typically, the vectors used in any of the host cells will contain at least a 5′ flanking sequence (also referred to as a promoter) and, optionally, other regulatory elements, such as an enhancer(s), an origin of replication element, a transcriptional termination element, a complete intron sequence containing a donor and acceptor splice site, a signal peptide sequence, a ribosome binding site element, a polyadenylation sequence, a polylinker region for inserting the DNA encoding the polypeptide coding region, and a selectable marker element. Suitable modifications may be made to the 5′ cap in order to direct translation of the encoded polypeptide using cap-independent mechanisms. Such modifications are well-known in the art (Kowalska, J. et al. (2008) N
The recombinant RNA construct expression vectors can be DNA plasmids or viral vectors. RNA construct-expressing viral vectors can be constructed based on, but not limited to, adeno-associated virus (for a review, see Muzyczka et al. (1992) C
a. Assays to Test Preferential Binding and/or Binding Strength
A DNA vector encoding an RNA construct, and optionally a polypeptide coding region can be transfected into desired cell culture lines or cell-free expression systems with a non-coding RNA, and tested to determine whether the non-coding RNA binds preferentially to the RNA construct as compared to a natural target. Preferential binding of the non-coding RNA to the RNA construct can be measured using any assay known in the art.
In certain embodiments, binding is measured by determining the reduction in free energy (kCal) that occurs when an RNA construct binds a non-coding RNA and comparing that value to the reduction in free energy that occurs when the non-coding RNA binds a natural target (e.g., an mRNA or portion thereof). Reduction in free energy can range from about 1-10,000 kCal and, as is known in the art, can vary depending upon the overall length of the RNA construct and/or non-coding RNA. In certain embodiments, the reduction in free energy that occurs when an RNA construct binds a non-coding RNA is at least about 10 kCal, at least about 25 kCal, at least about 50 kCal, at least about 100 kCal, at least about 1,000 kCal. In certain embodiments, the reduction in free energy that occurs when a non-coding RNA binds an RNA construct is at least 1% more, at least 2% more, at least 5% more, at least 10% more, at least 25% more, at least 50% more at least 75% more, at least 95% more, at least 100% more, at least 200% more, at least 500% more than the reduction in free energy that occurs when a non-coding RNA binds its natural target.
In one example, a native binding assay can be used to determine whether a non-coding RNA preferentially binds an RNA construct. In a native binding assay, RNA is dissolved in a buffer containing 10 mM Tris-HCl (pH 8.0) and 100 mM NaCl. The RNA construct and non-coding RNA are mixed in a 1:2 ratio and annealed at 85° C. for 3 minutes and gradually cooled to room temperature for about an hour. The resulting mixture is run on a native PAGE (15% TBE-PAGE) gel with appropriate controls (RNA construct alone and non-coding RNA alone) and MW standards. Bands with a lower electrophoretic mobility (i.e., a higher molecular weight) than either the RNA construct or non-coding RNA alone are most likely the RNA construct complexed with the non-coding RNA, indicating that the RNA construct is capable of binding the non-coding RNA.
Denaturing gels also can be used to determine whether a non-coding RNA preferentially binds an RNA construct. For example, a 1 M, 3 M, 7 M or 8 M denaturing urea gel can be used to test the strength of binding of an RNA construct to a non-coding RNA. Strength of the binding interaction between RNA construct and non-coding RNA correlates to the ability to detect an interaction between an RNA construct and non-coding RNA at increasingly high urea concentrations. For example, very strong binding between an RNA construct and non-coding RNA will be detectable at higher urea molarity (e.g., 7 M, 8 M) while relatively weaker binding between an RNA construct and non-coding RNA will be detectable at higher urea concentrations (e.g., 1 M, 3 M). For non-coding RNA that binds preferentially to an RNA construct as compared to a natural target, binding between non-coding RNA and RNA construct may be detectible at higher urea concentrations (e.g., 7 M, 8 M) than binding between the non-coding RNA and its natural target, which may, for example, be detectible at lower urea concentrations (e.g., 1 M or 3 M) or on a native gel.
In certain embodiments, RNA immunoprecipitation (RIP) can be used to determine whether a non-coding RNA preferentially binds an RNA construct. RIP can be performed by tagging any component of the interaction (RNA construct, non-coding RNA, or cellular protein), for example, with a His tag, allowing the components to combine (e.g., in a tube or in a cell), isolating the tagged component (e.g., with an anti-His antibody), precipitating any bound RNA, and analyzing bound RNA on, for example, a TBE-gel. Various RIP protocols are known in the art. See, e.g., Jain et al. (2011) “RIP-Chip analysis: RNA-Binding Protein Immunoprecipitation-Microarray (Chip) Profiling,” M
In certain embodiments, when an RNA construct with a weakened stem is used, binding of the non-coding RNA can be tested by determining whether the RNA construct forms a stem-loop and/or binds a cellular protein (e.g., an RBP). The ability of an RNA construct to form a stem-loop and bind an RBP in the presence of a target sequence can be tested by incorporating a label, for example, a radiolabel, fluorescent label, or luminescent label, into the RNA construct. The RNA construct, non-coding RNA and any additional desired reagents, such as buffers, may be added to the appropriate RBP. If a non-coding RNA is present, the RNA construct assumes a structure to which the RBP can bind. The RBP is then isolated and the presence of label is assayed. Detection of the label indicates that the RNA construct has attained the structure required for binding to the RBP, which further indicates that the non-coding RNA is present. Alternatively, assays useful for detecting the binding of an RNA construct to an RBP, but that do not require labeled RNA construct, include, but are not limited to, absorbance assays, immunoassays, assays for enzymatic activity of the RBP and Western blots.
Examples of suitable fluorescent labels include fluorescein isothiocyanate (FITC), 5,6-carboxymethyl fluorescein, Texas red, nitrobenz-2-oxa-1,3-diazol-4-yl (NBD), coumarin, dansyl chloride, rhodamine, amino-methyl coumarin (AMCA), Eosin, Erythrosin, BODIPY®, Cascade Blue®, Oregon Green®, pyrene, lissamine, xanthenes, acridines, oxazines, phycoerythrin, macrocyclic chelates of lanthanide ions such as quantum Dye®, fluorescent energy transfer dyes, such as thiazole orange-ethidium heterodimer, and the cyanine dyes Cy3, Cy3.5, Cy5, Cy5.5 and Cy7. Examples of other specific fluorescent labels include 3-Hydroxypyrene 5,8,10-Tri Sulfonic acid, 5-Hydroxy Tryptamine (5-HT), Acid Fuchsin, Alizarin Complexon, Alizarin Red, Allophycocyanin, Aminocoumarin, Anthroyl Stearate, Astrazon Brilliant Red 4G, Astrazon Orange R, Astrazon Red 6B, Astrazon Yellow 7 GLL, Atabrine, Auramine, Aurophosphine, Aurophosphine G, BAO 9 (Bisaminophenyloxadiazole), BCECF, Berberine Sulphate, Bisbenzamide, Blancophor FFG Solution, Blancophor SV, Bodipy F1, Brilliant Sulphoflavin FF, Calcien Blue, Calcium Green, Calcofluor RW Solution, Calcofluor White, Calcophor White ABT Solution, Calcophor White Standard Solution, Carbostyryl, Cascade Yellow, Catecholamine, Chinacrine, Coriphosphine O, Coumarin-Phalloidin, CY3.1 8, CY5.1 8, CY7, Dans (1-Dimethyl Amino Naphaline 5 Sulphonic Acid), Dansa (Diamino Naphtyl Sulphonic Acid), Dansyl NH-CH3, Diamino Phenyl Oxydiazole (DAO), Dimethylamino-5-Sulphonic acid, Dipyrrometheneboron Difluoride, Diphenyl Brilliant Flavine 7GFF, Dopamine, Erythrosin ITC, Euchrysin, FIF (Formaldehyde Induced Fluorescence), Flazo Orange, Fluo 3, Fluorescamine, Fura-2, Genacryl Brilliant Red B, Genacryl Brilliant Yellow 10GF, Genacryl Pink 3G, Genacryl Yellow 5GF, Gloxalic Acid, Granular Blue, Haematoporphyrin, Indo-1, Intrawhite Cf Liquid, Leucophor PAF, Leucophor SF, Leucophor WS, Lissamine Rhodamine B200 (RD200), Lucifer Yellow CH, Lucifer Yellow VS, Magdala Red, Marina Blue, Maxilon Brilliant Flavin 10 GFF, Maxilon Brilliant Flavin 8 GFF, MPS (Methyl Green Pyronine Stilbene), Mithramycin, NBD Amine, Nitrobenzoxadidole, Noradrenaline, Nuclear Fast Red, Nuclear Yellow, Nylosan Brilliant Flavin EBG, Oxadiazole, Pacific Blue, Pararosaniline (Feulgen), Phorwite AR Solution, Phorwite BKL, Phorwite Rev, Phorwite RPA, Phosphine 3R, Phthalocyanine, Phycoerythrin R, Polyazaindacene Pontochrome Blue Black, Porphyrin, Primuline, Procion Yellow, Pyronine, Pyronine B, Pyrozal Brilliant Flavin 7GF, Quinacrine Mustard, Rhodamine 123, Rhodamine 5 GLD, Rhodamine 6G, Rhodamine B, Rhodamine B 200, Rhodamine B Extra, Rhodamine BB, Rhodamine BG, Rhodamine WT, Serotonin, Sevron Brilliant Red 2B, Sevron Brilliant Red 4G, Sevron Brilliant Red B, Sevron Orange, Sevron Yellow L, SITS (Primuline), SITS (Stilbene Isothiosulphonic acid), Stilbene, Snarf 1, sulpho Rhodamine B Can C, Sulpho Rhodamine G Extra, Tetracycline, Thiazine Red R, Thioflavin S, Thioflavin TCN, Thioflavin 5, Thiolyte, Thiozol Orange, Tinopol CBS, True Blue, Ultralite, Uranine B, Uvitex SFC, Xylene Orange, and XRITC.
Other useful fluorescent labels include fluorescein (5-carboxyfluorescein-N-hydroxysuccinimide ester), rhodamine (5,6-tetramethyl rhodamine), and the cyanine dyes Cy3, Cy3.5, Cy5, Cy5.5 and Cy7. The absorption and emission maxima, respectively, for these fluors are: FITC (490 nm; 520 nm), Cy3 (554 nm; 568 nm), Cy3.5 (581 nm; 588 nm), Cy5 (652 nm: 672 nm), Cy5.5 (682 nm; 703 nm) and Cy7 (755 nm; 778 nm), thus allowing their simultaneous detection. Other examples of fluorescein dyes include 6-carboxyfluorescein (6-FAM), 2′,4′,1,4,-tetrachlorofluorescein (TET), 2′,4′,5′,7′,1,4-hexachlorofluorescein (HEX), 2′,7′-dimethoxy-4′,5′-dichlo-ro-6-carboxyrhodamine (JOE), 2′-chloro-5′-fluoro-7′,8′-fused phenyl-1,4-dichloro-6-carboxyfluorescein (NED), and 2′-chloro-7′-phenyl-1,4-dichloro-6-carboxyfluorescein (VIC). Fluorescent labels can be obtained from a variety of commercial sources, including GE Healthcare Bio-Sciences, Piscataway, N.J.; Life Technologies, Grand Island, N.Y.; and Sigma Aldrich, St. Louis, Mo.
It is further contemplated that labels that detect a structural change in a polynucleotide can be incorporated into an RNA construct or directly into a nucleotide of the RNA construct to detect the conformational change that results upon binding to a non-coding RNA. Many such labels are known to those of skill in the art. Examples of labels suitable for use in the disclosed method are radioactive isotopes, fluorescent molecules, phosphorescent molecules, enzymes, antibodies, and ligands. More specifically, molecular beacons, Amplifluors®, FRET probes, cleavable FRET probes, TagMan® probes, scorpion primers, fluorescent triplex oligonucleotides including but not limited to triplex molecular beacons or triplex FRET probes, fluorescent water-soluble conjugated polymers, PNA probes or QPNA probes, for example, can be used to activate or quench a given label based on stem formation.
Stem activated labels are labels or pairs of labels where fluorescence is increased or altered by formation of a stem structure. Stem activated labels can include an acceptor fluorescent label and a donor moiety such that, when the acceptor and donor are in proximity (when the nucleic acid strands containing the labels form a stem structure), fluorescence resonance energy transfer from the donor to the acceptor causes the acceptor to fluoresce. Stem activated labels are typically pairs of labels positioned on nucleic acid molecules, such as the sides of the stem-loop, such that the acceptor and donor are brought into proximity when a stem structure is formed in the nucleic acid molecule. If the donor moiety of a stem activated label is itself a fluorescent label, it can release energy as fluorescence (typically at a different wavelength than the fluorescence of the acceptor) when not in proximity to an acceptor (that is, when a stem structure is not formed). When the stem structure forms, the overall effect would then be a reduction of donor fluorescence and an increase in acceptor fluorescence. FRET probes are an example of the use of stem activated labels, the operational principles of which can be adapted for use with RNA constructs.
Stem-quenched labels are fluorescent labels positioned on a nucleic acid such that when a stem structure forms a quenching moiety is brought into proximity such that fluorescence from the label is quenched. When the stem is disrupted, the quenching moiety is no longer in proximity to the fluorescent label and fluorescence increases. Examples of this effect can be found in molecular beacons, fluorescent triplex oligonucleotides, triplex molecular beacons, triplex FRET probes, and QPNA probes, the operational principles of which can be adapted for use with RNA constructs.
Examples of labels that can be incorporated into nucleic acids include nucleotide analogs such as BrdUrd (5-bromodeoxyuridine, Hoy et al., (1993) M
Labels that are incorporated into nucleic acid, such as biotin, can be subsequently detected using sensitive methods well-known in the art. For example, biotin can be detected using streptavidin-alkaline phosphatase conjugate (Tropix, Inc.), which is bound to the biotin and subsequently detected by chemiluminescence of suitable substrates (for example, chemiluminescent substrate CSPD: disodium, 3-(4-methoxyspiro-[1,2,-dioxetane-3-2′-(5′-chloro)tricyclo [3.3.1.13,7]decane]-4-yl) phenyl phosphate; Tropix, Inc.). Labels can also be enzymes, such as alkaline phosphatase, soybean peroxidase, horseradish peroxidase and polymerases, that can be detected, for example, with chemical signal amplification or by using a substrate to the enzyme which produces light (for example, a chemiluminescent 1,2-dioxetane substrate) or fluorescent signal.
Methods for detecting and measuring signals generated by labels are also known to those of skill in the art. For example, radioactive isotopes can be detected by scintillation counting or direct visualization; fluorescent molecules can be detected with fluorescent spectrophotometers; phosphorescent molecules can be detected with a spectrophotometer or directly visualized with a camera; enzymes can be detected by detection or visualization of the product of a reaction catalyzed by the enzyme; antibodies can be detected by detecting a secondary label coupled to the antibody.
b. Assays to Test Inhibition of Non-Coding RNA
In certain embodiments, an RNA construct can inhibit the function of a non-coding RNA to which it binds. Inhibition may result from sequestration (i.e., the non-coding RNA is bound to the RNA construct and therefore not available for binding to its natural target) or Tests of inhibition of function can vary based upon the known function of the non-coding RNA. Generally, however, inhibition of a non-coding RNA by an RNA construct can be determined by (a) measuring the quantity of a downstream target of the non-coding RNA in the presence of the RNA construct; (b) measuring the quantity of the downstream target of the non-coding RNA in the absence of the RNA construct; and (c) determining that the quantity of the downstream target in the presence of the RNA construct is greater that the quantity of the downstream target the absence of the RNA construct. The downstream target can be, for example, an mRNA or a protein.
In certain embodiments, the non-coding RNA is an miRNA and the RNA construct inhibits the function of the miRNA as determined by (a) measuring the quantity of a downstream target of the miRNA in the presence of the RNA construct; (b) measuring the quantity of the downstream target of the miRNA in the absence of the RNA construct; and (c) determining that the quantity of the downstream target in the presence of the RNA construct is greater that the quantity of the downstream target the absence of the RNA construct. The downstream target can be, for example, an mRNA or a protein. In certain embodiments, an RNA construct reduces the function of a non-coding RNA by 1%, 2%, 5%, 10%, 15%, 25%, 50%, 75% 95% or 100% (completely eliminates function). For example, an RNA construct may, in an expression environment, reduce the quantity (e.g., concentration) of a downstream target (e.g., an mRNA or protein) of a non-coding RNA (e.g., an miRNA) by 1%, 2%, 5%, 10%, 15%, 25%, 50%, 75% 95% or 100% (as compared to the quantity of the downstream target in the expression environment in the absence of the RNA construct).
Any of the delivery methods known in the art suitable for use with RNA-based drugs are contemplated to work for RNA constructs according to the invention. For example, recombinant vectors capable of expressing RNA constructs are delivered as described above, and persist in target cells. Alternatively, viral vectors can be used that provide for transient expression of RNA constructs. Such vectors can be repeatedly administered as necessary. Once expressed, the RNA constructs bind to non-coding RNAs and undergo a conformational change, for example, strengthening of a stem-loop structure. Delivery of RNA construct-expressing vectors can be systemic, such as by intravenous or intramuscular administration, by administration to target cells ex-planted from a multicellular organism followed by reintroduction into the multicellular organism, or by any other means that allows for introduction into a desired target cell.
DNA plasmids carrying RNA constructs can be transfected into target cells as a complex with cationic lipid carriers (e.g. Oligofectamine) or non-cationic lipid-based carriers (e.g. Transit-TKO™). Multiple lipid transfections for RNA constructs targeting different regions of a single non-coding RNA or multiple non-coding RNAs over a period of a week or more are also contemplated by the present invention. Successful introduction of the vectors of the invention into host cells can be monitored using various known methods. For example, transient transfection can be signaled with a reporter, such as a fluorescent marker, such as Green Fluorescent Protein (GFP). Stable transfection of ex vivo cells can be ensured using markers that provide the transfected cell with resistance to specific environmental factors (e.g., antibiotics and drugs), such as hygromycin B resistance.
Alternatively, RNA constructs can be produced in cell culture and isolated, or can be chemically synthesized, and administered. Absorption or uptake of an RNA construct can occur through unaided diffusive or active cellular processes, or by auxiliary agents or devices. An RNA construct may be introduced into a cell, either in vitro or where the cell is part of a living organism. If the cell is part of an organism, introduction into the cell will include delivery to the organism. For example, for in vivo delivery, an RNA construct can be injected into a tissue site or administered systemically. In vitro introduction into a cell includes methods known in the art such as electroporation and lipofection.
An RNA construct can be chemically modified to enhance stability. The nucleic acids of the invention may be synthesized and/or modified by methods well established in the art, such as those described in C
In yet another embodiment, nucleotides may be modified to prevent or inhibit the activation of cellular enzymes, such as, for example, without limitation, certain nucleases. Techniques for inhibiting the activation of cellular enzymes are known in the art including, but not limited to, 2′-amino modifications, 2′-amino sugar modifications, 2′-F sugar modifications, 2′-F modifications, 2′-alkyl sugar modifications, uncharged backbone modifications, morpholino modifications, 2′-O-methyl modifications, and phosphoramidate (see, e.g., Wagner (1995) NAT. MED. 1:1116-8). Thus, at least one 2′-hydroxyl group of the nucleotides on an RNA construct can be replaced by a chemical group, for example, by a 2′-amino or a 2′-methyl group.
Conjugating a ligand to an RNA construct can enhance its cellular absorption. In certain instances, a hydrophobic ligand is conjugated to the RNA construct to facilitate direct permeation of the cellular membrane. Alternatively, the ligand conjugated to the RNA construct is a substrate for receptor-mediated endocytosis. These approaches have been used to facilitate cell permeation of antisense oligonucleotides. For example, cholesterol has been conjugated to various antisense oligonucleotides resulting in compounds that are substantially more active compared to their non-conjugated analogs. See Manoharan et al., (2002) A
In certain instances, conjugation of a cationic ligand to oligonucleotides results in improved resistance to nucleases. Representative examples of cationic ligands are propylammonium and dimethylpropylammonium.
A composition that includes an RNA construct (it should be understood that the RNA construct compositions and administration methods referenced below include both RNA constructs and DNA vectors encoding an RNA construct) can be delivered to a subject by a variety of routes. Exemplary routes include intrathecal, parenchymal, intravenous, nasal, oral, parenteral and ocular delivery. An RNA construct can be incorporated into pharmaceutical compositions suitable for administration. For example, compositions can include one or more species of an RNA construct and a pharmaceutically acceptable carrier. As used herein the language “pharmaceutically acceptable carrier” is intended to include any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, compatible with pharmaceutical administration. The use of such media and agents for pharmaceutically active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active compound, use thereof in the compositions is contemplated. Supplementary active compounds can also be incorporated into the compositions.
The pharmaceutical compositions of the present invention may be administered in a number of ways depending upon whether local or systemic treatment is desired and upon the area to be treated. Administration may be topical (including ophthalmic, intranasal, transdermal), oral or parenteral. Parenteral administration includes intravenous drip, subcutaneous, intraperitoneal or intramuscular injection, site-specific injection (e.g. delivery directly to a tumor or other site of disease), or intrathecal or intraventricular administration.
In general, an RNA construct can be administered by any suitable method. As used herein, topical delivery can refer to the direct application of an RNA construct to any surface of the body, including the eye, a mucous membrane, surfaces of a body cavity, or to any internal surface. Formulations for topical administration may include transdermal patches, ointments, lotions, creams, gels, drops, sprays, and liquids. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. Topical administration can also be used as a means to selectively deliver the RNA construct to the epidermis or dermis of a subject, or to specific strata thereof, or to an underlying tissue.
Compositions for intrathecal or intraventricular administration may include sterile aqueous solutions which may also contain buffers, diluents and other suitable additives.
Formulations for parenteral administration may include sterile aqueous solutions which may also contain buffers, diluents and other suitable additives. Intraventricular injection may be facilitated by an intraventricular catheter, for example, attached to a reservoir. For intravenous use, the total concentration of solutes should be controlled to render the preparation isotonic.
An RNA construct can be administered to a subject by pulmonary delivery. Pulmonary delivery compositions can be delivered by inhalation by the patient of a dispersion so that the RNA construct within the dispersion can reach the lung where it can be readily absorbed through the alveolar region directly into blood circulation. Pulmonary delivery can be effective both for systemic delivery and for localized delivery to treat diseases of the lungs.
Pulmonary delivery can be achieved by different approaches, including the use of nebulized, aerosolized, micellular and dry powder-based formulations. Delivery can be achieved with liquid nebulizers, aerosol-based inhalers, metered-dose devices, and dry powder dispersion devices. One of the benefits of using an atomizer or inhaler is that the potential for contamination is minimized because the devices are self-contained. Dry powder dispersion devices, for example, deliver drugs that may be readily formulated as dry powders. An RNA construct may be stably stored as lyophilized or spray-dried powders by itself or in combination with suitable powder carriers. The delivery of a composition for inhalation can be mediated by a dosing timing element which can include a timer, a dose counter, time measuring device, or a time indicator which when incorporated into the device enables dose tracking, compliance monitoring, and/or dose triggering to a patient during administration of the aerosol medicament.
An RNA construct can be modified such that it is capable of traversing the blood brain barrier. For example, the RNA construct can be conjugated to a molecule that enables the agent to traverse the barrier. Such modified RNA construct can be administered by any desired method, such as by intraventricular or intramuscular injection, or by pulmonary delivery, for example.
An RNA construct can be administered ocularly, such as to treat a retinal disorder, e.g., a retinopathy. For example, the pharmaceutical compositions can be applied to the surface of the eye or nearby tissue, e.g., the inside of the eyelid. They can be applied topically, e.g., by spraying, in drops, as an eyewash, or an ointment. Ointments or droppable liquids may be delivered by ocular delivery systems known in the art such as applicators or eye droppers. Such compositions can include mucomimetics such as hyaluronic acid, chondroitin sulfate, hydroxypropyl methylcellulose or poly(vinyl alcohol), preservatives such as sorbic acid, EDTA or benzylchronium chloride, and the usual quantities of diluents and/or carriers. The pharmaceutical composition can also be administered to the interior of the eye, and can be introduced by a needle or other delivery device which can introduce it to a selected area or structure. The composition containing the RNA construct can also be applied via an ocular patch.
An RNA construct can be administered by an oral or nasal delivery. For example, drugs administered through these membranes have a rapid onset of action, provide therapeutic plasma levels, avoid first pass effect of hepatic metabolism, and avoid exposure of the drug to the hostile gastrointestinal (GI) environment. Additional advantages include easy access to the membrane sites so that the drug can be applied, localized and removed easily.
The types of pharmaceutical excipients that are useful as carrier include stabilizers such as human serum albumin (HSA), bulking agents such as carbohydrates, amino acids and polypeptides; pH adjusters or buffers; salts such as sodium chloride; and the like. These carriers may be in a crystalline or amorphous form or may be a mixture of the two.
To determine whether RNA constructs comprising a stem-loop could stably bind and inhibit non-coding RNAs, a search was performed for miRNAs that complex with mRNAs bound by SLBP (i.e., RNA molecules containing a stem-loop bound by SLBP). 600 miRNAs were found to be present in human chronic myelogenous lymphoblastoid K562 cell lysate using microfluidics miRNA array technology optimized to interrogate the comprehensive miRBase repertoire of identified miRNAs (LC Sciences, Houston, Tex.) Then, an RNA immunoprecipitation (RIP) targeting stem-loop binding protein (SLBP) was performed (Jain et al. (2011) supra; Jayaseelan et al. (2011) supra) and the results of that RIP were similarly analyzed. Approximately 60 miRNAs out of 600 miRNAs present in K562 lysate showed enrichment in the SLBP-RIP pool. In other words, approximately 60 miRNAs were found to complex with mRNAs bound by SLBP. Of these, 10 miRNA could not be accounted for using conventional miRNA targeting algorithms as predicted by informatic tools such as miRanda or PicTar. However, these 10 miRNA are predicted to bind through non-conventional three-way junction interactions as depicted in
One of these 10 miRNA, miR-4298 (5′-CUGGGACAGGAGGAGGAGGCAG-3′ (SEQ ID NO:23)) and the portion of the 3′ UTR for HIST1H3c to which it binds (5′-CUGCCCGUUUCUUCCUCAUUGAAAAGGCUCUUUUCAGAGCCACUCA-3′ (SEQ ID NO:24)), was identified for further study. An RNA construct was created that has the naturally occurring 3′ UTR for HIST1H3c and a 5′ UTR that contained multiple copies of the MS2 Bacteriophage RBP RNA-target motif. This routinely used molecular tag is a small RNA stem-loop structure that is readily bound (typically as a duplex) by the bacteriophage MS2 coat-protein. The MS2-protein/RNA tagging combination has been widely used for capturing a targeted RNA and is naturally used by the virus to encapsidate its RNA genome. As depicted in
The RNA construct (i.e., sxRNA switch) shown in
The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are therefore to be considered in all respects illustrative rather than limiting on the invention described herein. Scope of the invention is thus indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are intended to be embraced therein.
The entire disclosure of each of the patent documents, including U.S. Pat. No. 8,841,438, scientific publications, and other database references disclosed hereinabove is expressly incorporated herein by reference for all purposes.
This application claims priority to and the benefit of U.S. Provisional Patent Application No. 61/878,711, filed Sep. 17, 2013, the disclosure of which is incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US14/56153 | 9/17/2014 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
61878711 | Sep 2013 | US |