The present invention relates to a nucleic acid containing complementary sequences which may form multiple double stranded regions. The present invention also relates to sequences and constructs encoding such a nucleic acid and the uses of such a nucleic acid or construct to modify gene expression, particularly to reduce or inhibit gene expression.
Certain single stranded nucleic acid molecules are able to form a self-complementary double stranded region where part of the nucleotide sequence is able to interact with another part of the sequence by Watson-Crick base pairing between inverted repeats of the sequence. Where the repeated regions are adjacent or in close proximity to each other, the double stranded regions may form structures known as hairpin structures. The hairpin structure forms with an unpaired “loop” of nucleotides at one end of the hairpin structure, with the inverted repeat sequence annealed. The loop may also facilitate the folding of the nucleic acid chain.
Hairpin RNA sequences have become a powerful tool for basic and applied research. In particular these sequences have been used in interfering RNA and gene silencing technologies. Such techniques are described in the specification of PCT/AU99/00195 (U.S. patent application Ser. No. 09/646,807 and U.S. Pat. No. 6,573,099) and PCT/AU01/00297, the contents of which are herein incorporated by reference. In summary, RNA interference (RNAi) hairpin RNA sequences may be synthesised within a cell from DNA constructs coding these sequences, hereafter termed “hairpin DNA constructs”.
While many hairpin DNA constructs have proved effective in gene silencing, other DNA constructs only show partial gene silencing activity. Increasing the degree of gene inactivation produced by RNAi hairpin RNA would be advantageous, for example in gene therapy. Furthermore, in many situations, it would be advantageous to be able to silence two or more separate genes or gene regions simultaneously, particularly in respect of gene therapy applications.
Reference to any prior art in this specification is not, and should not be taken as, an acknowledgment or any form of suggestion that this prior art forms part of the common general knowledge of one skilled in the art.
There is a need for improved RNA hairpin sequences to be used in interfering RNA and gene silencing technology. Furthermore, there is a need for a DNA construct that is capable of producing hairpin RNA transcripts with an improved gene silencing activity and a need for a DNA construct encoding hairpin RNA capable of inactivating two or more separate genes. There is further a need for improved methods for the synthesis of such DNA constructs. It is an object of the present invention to overcome, or at least alleviate, one or more of these needs in light of the prior art.
In one aspect, the present invention provides a ribonucleic acid (RNA) suitable for use as interfering RNA in gene silencing techniques comprising in a 5′ to 3′ direction at least a first effector sequence, a second effector sequence, a sequence substantially complementary to the second effector sequence and a sequence substantially complementary to the first effector sequence, the complementary sequences capable of forming double stranded regions with their respective effector sequences and further including one or more spacing sequences of one or more nucleotides.
In one embodiment, the first effector sequence is spaced from the second effector sequence by a first spacing sequence. In another embodiment, the sequence substantially complementary to the second effector sequence is spaced from the sequence substantially complementary to the first effector sequence by a second spacing sequence. Accordingly, RNA according to this aspect of the present invention can fold so that at least double stranded RNA region is spaced from an adjacent double stranded RNA region by spacing sequences, the spacing sequences being non-annealing and forming a so-called bubble. The terms “hybridising” and “annealing” refer to nucleotide sequences capable of forming Watson-Crick base pairs between complementary bases, as discussed further below.
In a further aspect the present invention provides a ribonucleic acid (RNA) suitable for use as interfering RNA in gene silencing techniques comprising at least a first effector sequence, a second effector sequence, a sequence substantially complementary to the second effector sequence and a sequence substantially complementary to the first effector sequence, the complementary sequences capable of forming double stranded regions with their respective effector sequences. Accordingly, at least one double stranded RNA region is directly adjacent to at least one other double stranded RNA region thereby producing at least two effector regions suitable for use in producing interfering RNA in the gene silencing technique, without intervening spacing sequences. In one preferred embodiment, the RNA further includes a spacing sequence between the second effector sequence and the sequence substantially complementary to it, the spacing sequence forming a loop about which the RNA folds to form the double-stranded regions.
In another aspect the present invention provides a ribonucleic acid (RNA) for use as interfering RNA in gene silencing techniques to silence a target gene comprising in a 5′ to 3′ direction at least a first effector sequence, a second effector sequence, a sequence substantially complementary to the second effector sequence and a sequence substantially complementary to the first effector sequence, wherein the complementary sequences are capable of forming double stranded regions with their respective effector sequences and wherein at least one of these sequences is substantially identical to the predicted transcript of a region of the target gene. Preferably, the RNA further comprises a spacer sequence of one or more nucleotides, wherein any two of the sequences are spaced by the spacing sequence. More preferably, the RNA further comprises an additional spacer sequence of one or more nucleotides.
In another aspect the present invention provides a ribonucleic acid (RNA) suitable for use as interfering RNA in gene silencing techniques comprising in a 5′ to 3′ direction at least a first effector sequence, a second effector sequence, a sequence substantially complementary to the second effector sequence and a sequence substantially complementary to the first effector sequence, the complementary sequences capable of forming double stranded regions with their respective effector sequences, the sequence substantially complementary to the second effector sequence being spaced from the sequence substantially complementary to the first effector sequence by one spacing sequence of one or more nucleotides, and the first effector sequence being spaced from the second effector sequence by another spacing sequence of one or more nucleotides. In one embodiment of this aspect of the present invention, both spacing sequences are included and do not anneal.
In a further aspect the present invention provides a ribonucleic acid (RNA) suitable for use as interfering RNA in gene silencing techniques comprising in a 5′ to 3′ direction at least a first effector sequence, a second effector sequence, a sequence substantially complementary to the second effector sequence and a sequence substantially complementary to the first effector sequence, the complementary sequences capable of forming double stranded regions with their respective effector sequences, the first effector sequence being spaced from the second effector sequence by a first spacing sequence of one or more nucleotides. In one embodiment, the sequence substantially complementary to the second effector sequence is spaced from the sequence substantially complementary to the first effector sequence by a second spacing sequence of one or more nucleotides, the second spacing sequence not being hybridisable with the first spacing sequence. Accordingly, the RNA according to this aspect of the present invention can fold so that at least one strand of at least one double stranded RNA region is spaced from an adjacent double stranded RNA region by a spacing (non-pairing) sequence, the spacing sequence forming a so-called bubble.
By an RNA “suitable for use as interfering RNA” is meant an RNA that may directly act as interfering RNA or that may be processed to produce RNA molecules that are active in RNA interference. Such RNA is suitable for genetic silencing techniques.
In another embodiment, there is provided a nucleic acid construct comprising at least a first effector sequence, a first complementary sequence that is substantially complementary to the first effector sequence, a second effector sequence and a second complementary sequence that is substantially complementary to the second effector sequence, wherein both first and second effector sequences form double stranded portions with their corresponding complementary sequences, the double stranded regions being spaced by a spacer sequence, usually a shorter sequence than the first effector sequence.
In preferred embodiments, one double stranded portion will have its two strands connected by a loop sequence forming the bend in the so-called hairpin structure. In this embodiment, the double stranded portion has this loop at one end, i.e. the loop is formed by a spacing sequence between one of the effector sequences and its substantially complementary sequence. Preferably, the nucleic acid also has a pair of spacing sequences between the double stranded portions, forming a “bubble”.
Preferably, the spacer sequence is shorter than either effector sequence. The spacer sequence is preferably 1 to 20, more preferably 1 to 10, more preferably 1 to 7 and most preferably 2 to 7 nucleotides long. Even more preferably, in one embodiment one spacer sequence is 2 nucleotides long and another spacer sequence is four nucleotides long.
As the ribonucleic acid or nucleic acid construct contains at least two effector sequences, the invention extends to such constructs containing three or more effector sequences, each with corresponding complementary sequences. The effector sequences and corresponding complementary sequences may be spaced from each other by spacing (non-pairing) sequences with the spacing sequence forming a bubble when the effector sequences base pair with the complementary sequences. In preferred embodiments, the ribonucleic acid or nucleic acid construct contains three effector sequences and three corresponding complementary sequences, each separated by a spacing sequence forming a bubble; four effector sequences and four corresponding complementary sequences, each separated by a spacing sequence forming a bubble; or five effector sequences and five corresponding complementary sequences, each separated by a spacing sequence forming a bubble. In further preferred embodiments, the ribonucleic acid or nucleic acid construct contains three effector sequences and three corresponding complementary sequences; four effector sequences and four corresponding complementary sequences; or five effector sequences and five corresponding complementary sequences without intervening spacing sequences between adjacent effector and complementary sequences. There may similarly be six, seven, eight, nine, ten or more effector sequences and complementary sequences in an RNA or nucleic acid construct of the invention. The effector sequences may be the same or different and directed to the same or different target genes, different regions of the same target gene or a combination of these.
In another embodiment, there is provided a ribonucleic acid suitable for use as interfering RNA in gene silencing techniques comprising in a 5′ to 3′ direction at least a first effector sequence, a second effector sequence, a sequence substantially complementary to the second effector sequence and a sequence substantially complementary to the first effector sequence, the complementary sequences capable of forming double stranded regions with their respective effector sequences, the second effector sequence being spaced from the sequence substantially complementary to the second effector sequence by a spacing sequence of one or more nucleotides.
In the context of the present invention, “target gene” refers to a gene which is targeted for silencing by RNA interference techniques. The RNA product of the gene may be a messenger RNA (mRNA) capable of being translated to form an amino acid sequence, or it may be a non-translated RNA, such as a ribosomal RNA, small uracil-rich RNA, or ribozyme.
Reference herein to a “gene” or “genes” is to be taken in its broadest context and includes:
(i) a classical genomic gene consisting of transcription and/or translational regulatory sequences and/or coding region and/or non-translated sequences (i.e. introns, 5′- and 3′-untranslated sequences); and/or
(ii) DNA and RNA viral genes; and/or
(iii) cDNA corresponding to the coding regions (i.e. exons) and/or 5′- and 3′-untranslated sequences,
whether naturally occurring or synthesised. Furthermore, “gene” includes within its scope both a nucleic acid coding for an amino-acid encoding RNA (i.e. mRNA) as well as a nucleic acid encoding a RNA that does not code for an amino acid sequence.
By “substantially identical” is meant about 70% identical to a portion of the target gene. Preferably, it is at least 80-90%, more preferably at least 95-100% identical, and includes 100% identity. Thus a sequence substantially identical to a region of a target gene has this degree of sequence similarity. Generally, a double-stranded RNA region of the invention may be subjected to mutagenesis to produce single or several nucleotide substitutions, deletions or additions without substantially affecting its ability to modify gene expression.
It is known that RNAi is generally optimised by identical sequences between the target and the RNAi construct, but that the RNA interference phenomenon can be observed with less than 100% homology. As is understood by those skilled in the art, the strands comprising the double-stranded regions must be sufficiently homologous to each other to form the specific double stranded regions. The precise structural rules to achieve a double-stranded region effective to result in RNA interference have not been fully identified, but approximately 70% identity is generally sufficient. Greater identity in the central portion of the effector sequence as opposed to the end portions is required as explained below. Another consideration is that base-pairing in RNA is subtly different from DNA in that G will pair with U, although not as strongly as it does with C, in RNA duplexes.
By “substantially complementary” is meant that the sequences are hybridisable or annealable. Moreover, it is know that hybridisation is affected by the conditions of the solution. In general, substantially complementary sequences will have at least 70% Watson-Crick base pairing.
The two sequences of an RNA duplex or double-stranded region are referred to as the “sense” strand and “antisense” strand, even though they may be different portions of one polynucleotide (eg. where it forms a hairpin). The “sense” strand is the one where the sequence is broadly related to the relevant region of the target gene (ie, one that is substantially the predicted transcription product), and the sequence annealing to the sense strand sequence is termed “antisense”. For RNAi efficacy, it is more important that the antisense strand be homologous (ie, exactly complementary) to the target sequence. In some circumstances, it is known that 17 out of 21 nucleotides is sufficient to initiate RNAi, but in other circumstances, identity of 19 or 20 nucleotides out of 21 is required. It is believed, at a general level, that greater homology is required in the central part of a double stranded region (i.e. duplex) than at its ends. Some predetermined degree of lack of perfect homology may be designed into a particular construct so as to reduce its RNAi activity which would result in a partial silencing or repression of the target gene's product, in circumstances in which only a degree of silencing was sought. In such a case, it is envisaged that only one or two bases of the antisense strand of the RNA construct would be changed. On the other hand, the other, sense strand of the RNA construct is more tolerant of mutations. It is believed this is due to the antisense strand being the one that is catalytically active. Thus, less identity between the sense strand and the transcript of a region of a target gene will not necessarily reduce RNAi activity, particularly where the antisense strand perfectly hybridises with that transcript. Mutations in the sense strand (such that it is not identical to the transcript of the region of the target gene) may be useful to assist sequencing of hairpin constructs and potentially for other purposes, such as modulating dicer processing of a hairpin transcript or other aspects of the RNAi pathway.
The terms “hybridising” and “annealing” (and grammatical equivalents) are used interchangeably in this specification in respect of nucleotide sequences and refer to nucleotide sequences that are capable of forming Watson-Crick base pairs due to their complementarity. The person skilled in the art would understand that non-Watson-Crick base-pairing is also possible, especially in the context of RNA sequences. For example a so-called “wobble pair” can form between guanosine and uracil residues in RNA. “Complementary” is used herein in its usual way to indicate Watson-Crick base pairing, and “non-complementary” is used to mean non-Watson-Crick base pairing, even though such non-complementary sequences may form wobble pairs or other interactions. However, in the context of the present invention, reference to “non-pairing” sequences relates specifically to sequences between which Watson-Crick base pairs do not form. Accordingly, embodiments of spacing or bubble sequences according to the present invention are described and illustrated herein as non-pairing sequences, regardless of whether non-Watson-Crick base pairing could theoretically or does in practice occur.
The term “effector sequence” and “effector” in the context of this specification relates to either DNA or RNA, depending on the context, and the term is used to denote a sequence that anneals to form a double-stranded region, due to complementarity of bases in the annealed region. The double-stranded region may determine the region of the target gene to which the construct is directed where the effector sequence, or the sequence substantially complementary to the effector sequence, is substantially identical to a region of the target gene.
In several preferred embodiments, the double stranded regions are interfering RNA (RNAi) sequences. Preferably, at least one of the effector sequences is substantially identical to at least a region of a target gene in the case of an RNA gene, or substantially identical to the predicted transcript of at least a region of a target gene in the case of a DNA gene. Preferably, the first effector sequence has this characteristic. In another preferred embodiment, the effector sequences are each separately substantially identical to different regions of a single target gene, or their predicted transcripts, as the case may be. In another preferred embodiment, the effector sequences are each separately substantially identical to regions of different target genes. In this context, “transcript” includes RNA which could theoretically be encoded by a DNA sequence, also called a “predicted transcript” regardless of the actual method of generation of that RNA sequence. In the DNA described in the embodiments below, at least one of the effector sequences is substantially identical or complementary to a region of the target gene (where the target gene is DNA). In this context, such a sequence may be called the “targeting sequence” where it is directed to a region of the gene to be silenced. Such a sequence may also be referred to structurally as an “intramolecular self-complementary targeting sequence”.
Alternatively, a double-stranded region may form a so-called “stem” sequence. In some embodiments, one or more of the effector sequences will have a different length to the sequence substantially complementary to it. In such a case, the unpaired portion may function as a spacer sequence. For example, where the effector sequence is generated by identity (or substantial identity) to a region of a target gene and the sequence substantially complementary to it is longer or shorter, the unpaired sequence will still be substantially identical to the corresponding region of the target gene, but may function as a spacer (e.g. loop or bubble) in the RNA, rather than as part of the effector sequence. In one embodiment, the effector sequence and the sequence substantially complementary to it are adjacent on the polynucleotide, in which case the region between these two sequences forms a loop comprised by either:
(i) the 3′ end of the effector sequence and the 5′ end of the complementary sequence; or
(ii) an unpaired sequence.
Similarly, where the effector and complementary sequences are not adjacent, but separated by one or more other double-stranded regions, the unpaired sequence may form a bubble.
The effector sequences may be of the same or different lengths. Preferably, effector sequences are at least 10 nucleotides in length, preferably 10-200 nucleotides in length. More preferably, they are 17 to 30 and most preferably 21 to 23 nucleotides in length. In different embodiments, the effector sequences are 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 or 30 nucleotides in length, respectively, or any combination of two or more of these lengths.
It will also be understood that the term “comprises” (or its grammatical variants) as used in this specification is equivalent to the term “includes” and should not be taken as excluding the presence of other elements or features.
Where the first effector sequence is longer than a second effector sequence, it has been found that the activity of the double-stranded sequence may be enhanced. In such a situation, the second effector sequence (which usually is not designed to be substantially identical to any particular target) can be called a “stem”. Preferably, the stem sequence is 1 to 50 nucleotides in length. A suitable stem sequence is GACUGAA and its complement.
Bubbles are formed by two unpaired, or partially unpaired, strands (which may also be spacing sequences) containing at least a single unpaired base that bridge or link the double stranded regions on the nucleic acid. Further, a bubble may form where one strand of the nucleic acid includes one or more spacer nucleotides between the double stranded regions and the other strand includes no such spacer nucleotides. In this case, as the end nucleotides on that other strand near the junction of the double-stranded regions form the bubble with the one or more spacer nucleotides. Preferably the RNA according to this aspect of the present invention includes one loop region and one or more bubble regions. Preferably the bubble regions comprise 1 to 20 unpaired nucleotides per RNA strand. More preferably, the bubble regions comprise 2 to 10 unpaired nucleotides. In a preferred embodiment the bubble region includes the nucleotide sequence AA, UU, UUA, UUAG, UUACAA or N1AAN2, where N1 and N2 are any of C, G, U and A and may be the same or different. In a further preferred embodiment, the opposing sequence to each of these to form the bubble is AA, UU, UUG, UUGA, UUGUUG, and N1AAN2 respectively, where N1 and N2 are any of C, G, U or A and may be the same or different.
In a preferred embodiment, a nucleic acid according to the present invention comprises two double stranded RNA regions separated by a bubble region and a loop at one end of the double stranded RNA region. In another preferred embodiment, the nucleic acid according to the present invention comprises five double stranded RNA regions, with the first and second, second and third, third and fourth and fourth and fifth double stranded regions, respectively, being separated by a bubble region and with a loop at one end of the fifth double stranded RNA region.
In another preferred embodiment, there is provided a construct including sequence -X-A-Y-L-Y′-B-X′-, wherein:
X is a nucleotide sequence substantially identical to a first region, or a transcript of a region, of a target gene;
Y is a nucleotide sequence of one or more nucleotides;
A is a nucleotide sequence shorter than X;
B is a nucleotide sequence shorter than X and non-complementary to A;
L is a loop sequence;
X′ is substantially complementary to X; and
Y′ is substantially complementary to Y.
Additional effector sequences, with complementary sequences to form duplexes, and with or without spacer sequences like A in this embodiment may be added.
In another preferred embodiment, there is provided a construct including sequence -X-A-Y-L-Y′-X′-, wherein:
X is a nucleotide sequence substantially identical to a first region, or a transcript of a region, of a target gene;
Y is a nucleotide sequence of one or more nucleotides;
A is a nucleotide sequence shorter than X;
L is a loop sequence;
X′ is substantially complementary to X; and
Y′ is substantially complementary to Y.
In another preferred embodiment there is provided a construct including sequence -X-Y-L-Y′-X′-, wherein:
X is a nucleotide sequence substantially identical to a first region, or a transcript of a region, of a target gene;
Y is a nucleotide sequence of one or more nucleotides;
L is a loop sequence;
X′ is substantially complementary to X; and
Y′ is substantially complementary to Y.
In a further embodiment, L comprises -P-Q-R-S-T-, wherein P, Q, R, S and T each represent a nucleotide sequence of one or more nucleotides and Q and S are hybridisable with each other, P and T do not hybridise so forming a bubble and R is an unpaired loop region. P is preferably one of UU, UUA, UUAG or UUACAA. Preferably, the opposing sequence to each of these to form the bubble is UU, UUG, UUGA and UUGUUG respectively or vice versa. In one preferred embodiment, R is UUCAAGAGA.
In one embodiment, Y is substantially identical to a second region, or a transcript of a region, of a target gene, the target gene being the same or different from the gene referred to in the definition of X. Where the target genes are the same, typically different regions will be targeted by X and Y.
In another preferred embodiment, there is provided a construct further including the sequences C and D in the form -C-X-A-Y-L-Y′-B-X′-D-, wherein:
C is a nucleotide sequence shorter than X;
D is a nucleotide sequence shorter than X non-complementary to C.
In another preferred embodiment, there is provided a construct including sequence -S-A-T-A-U-A-V-A-W-L-W′-B-V′-B-U′-B-T′-B-S′-, wherein:
S, T, U, V and W are nucleotide sequences each substantially identical to a region, or a transcript of a region, of a target gene;
A is a nucleotide sequence shorter than S, T, U, V and W (each A may be the same or different);
B is a nucleotide sequence shorter than S, T, U, V and W and non-complementary to A (each B may be the same or different, but each B is non-complementary to its opposed A sequence when a double-stranded construct is formed about sequence L by annealing of S, T, U, V and W with their respective complements);
L is a loop sequence;
S′, T′, U′, V′ and W′ are nucleotide sequences substantially complementary to S, T, U, V and W.
As will be appreciated by one skilled in the art, it is not necessary that the entire construct is generated as one sequence. For example, in one embodiment of the invention, the at least first and second effector sequences, together with any spacing sequence, are generated (eg, transcribed by one DNA sequence), and the sequences substantially complementary to the effector sequences, together with any spacing sequence, are generated (eg, transcribed from a separate DNA sequence). The two or more DNA sequences may be under the control of separate promoters. Any loop sequence may be attached to either transcript or part of the loop attached to the 3′ end of one transcript and the 5′ end of the other transcript, and a ligation performed. In circumstances where the RNA construct is to be delivered by a DNA construct to a cell, in this embodiment, the two transcripts would be separately generated, and then would hybridise through annealing between the at least first and second effector sequences and their complements.
In a further aspect of the present invention there is provided a nucleic acid construct encoding any of the ribonucleic acids described above. In a preferred embodiment, this construct is a deoxyribonucleic acid (DNA) construct. In one embodiment, the DNA construct includes a sequence encoding a ribonucleic acid (RNA) suitable for use as interfering RNA in gene silencing techniques, the construct comprising in a 5′ to 3′ direction at least a first effector-encoding sequence, a second effector-encoding sequence, a sequence substantially complementary to the second effector-encoding sequence and a sequence substantially complementary to the first effector-encoding sequence, the complementary sequences' transcripts capable of forming double stranded regions with the respective effector-encoding sequences' transcripts. In an embodiment of this aspect of the invention, the first effector-encoding sequence is spaced from the second effector-encoding sequence by a first spacing sequence of one or more nucleotides. Preferably, the sequence substantially complementary to the second effector-encoding sequence is spaced from the sequence substantially complementary to the first effector-encoding sequence by a second spacing sequence of one or more nucleotides. Preferably, the second spacing sequence does not anneal with the first spacing sequence. Accordingly, the RNA of, or encoded by, the nucleic acid construct according to this embodiment can fold so that at least one double stranded RNA region is spaced from an adjacent double stranded RNA region by a spacing (non-pairing) sequence, the spacing sequence forming a so-called bubble. Preferably, the nucleic acid construct further includes a spacing sequence between the second effector sequence and the sequence substantially complementary to it, wherein the RNA of, or encoded by, the nucleic acid construct according to this embodiment forms a loop about which the RNA folds to form the double-stranded region between the second effector sequence and the sequence substantially complementary to the second effector sequence.
In a further aspect the present invention provides a nucleic acid construct including a sequence encoding a ribonucleic acid (RNA) suitable for use as interfering RNA in gene silencing techniques to silence a target gene, the construct comprising in a 5′ to 3′ direction at least a first effector-encoding sequence, a second effector-encoding sequence, a sequence substantially complementary to the second effector-encoding sequence and a sequence substantially complementary to the first effector-encoding sequence, wherein the transcripts of the complementary sequences are capable of forming double stranded regions with the transcripts of their respective effector-encoding sequences and wherein at least one of these sequences is substantially identical to a region of the target gene.
Preferably, the nucleic acid construct further comprises a spacing sequence of one or more nucleotides wherein any two of the encoding sequences are spaced by a spacing sequence. In preferred embodiments, the first effector-encoding sequence is spaced from the second effector-encoding sequence by the spacing sequence and/or the sequence substantially complementary to the first effector-encoding sequence is paced from the sequence substantially complementary to the first effector-encoding sequence by the spacing sequence.
In a further preferred embodiment the nucleic acid construct further comprises an additional spacing sequence. In a preferred embodiment, the first effector-encoding sequence is spaced from the second effector-encoding sequence or the sequence substantially complementary to the second effector-encoding sequence is spaced from the sequence substantially complementary to the first effector-encoding sequence by the additional spacing sequence and the transcript of the first spacing sequence is not annealable with the transcript of the additional spacing sequence.
The nucleic acid construct or an RNA according to the invention will usually be a recombinant or isolated molecule.
In a further preferred embodiment, the nucleic acid construct comprises a spacing sequence of one or more nucleotides between the second effector encoding sequence and the sequence substantially complementary to the second effector-encoding sequence.
Preferably, the nucleic acid construct further includes a loop coding sequence between the second effector-encoding sequence and the sequence substantially complementary to the second effector-encoding sequence. The loop forms the “hinge” of the hairpin. In one embodiment, the loop's sequence is 5′TTCAAGAGA3′. In a further embodiment, the loop sequence is 5′TTTGTGTAG3′.
Preferably the construct is derived from a DNA vector selected from the group consisting of a plasmid, a bacteriophage and a viral-based vector. Preferably the DNA construct is suitable for producing RNA suitable for use as interfering RNA in gene silencing technologies. More preferably, the construct can be introduced into a cell where gene silencing is to take place and interfering RNA can be transcribed within this cell.
Preferably the first effector sequence or its complementary sequence is substantially identical or substantially complementary to a region of a target gene. In one embodiment, the second effector sequence or its complementary sequence is substantially identical to the same or a different region of the same or a different target gene. In another embodiment, the second effector sequence or its complementary sequence is substantially identical to a region of a different target gene.
In another embodiment, the DNA construct comprises up to five effector-encoding sequences. Each of the encoded effector sequences or their complementary sequences is substantially identical to a region of a target gene. The encoded effector sequences or their complementary sequences may be substantially identical to regions of different target genes, or to different regions in the same target gene.
The construct according to the present invention may further contain one or more regulatory elements to allow transcription of the RNA to take place. Preferably at least one of the regulatory elements is a promoter, which is operably linked with the portion of the construct encoding the nucleic acid according to the present invention. A variety of promoters may be included in the polynucleotide vector. Factors influencing the choice of promoter include the desire for inducible transcription of the oligonucleotide or oligonucleotide and polynucleotide sequences, the strength of the promoter and the suitability of the promoter to induce expression in the in vivo or in vitro environment in which the transcription is to take place. In a preferred embodiment the promoter is an RNA polymerase III (pol III) promoter such as U6 or H1 promoters.
One or more of the regulatory elements of the construct according to the present invention may be a terminator sequence. Such a terminator sequence may be operably linked with the portion of the construct encoding the nucleic acid of the present invention in order to determine the sequence of the 3′ end of the transcribed nucleic acid. Terminators for the various classes of RNA polymerase as known to those skilled in the art. In one embodiment, the terminator is a pol II terminator. In another embodiment, the terminator is a pol III terminator. Preferably, the pol III terminator includes the sequences TTTTT or TTTTTT.
As will be appreciated, such constructs will often also include selection markers or sequences (eg, Ampicillin resistance) and/or restriction enzyme sites.
In a preferred embodiment, the nucleic acid construct includes a transcriptional unit comprising a promoter; at least a first effector-encoding sequence; a second effector-encoding sequence; a sequence substantially complementary to the second effector-encoding sequence; a sequence substantially complementary to the first effector-encoding sequence and a terminator sequence, the promoter, effector sequences, sequences complementary to the effector sequences and terminator being operably linked. The nucleic acid construct may include in addition to the transcriptional unit described above at least one further transcriptional unit encoding RNA suitable for use as interfering RNA for use in gene silencing techniques. By “operably linked” in the context of the present invention means that the transcription of a nucleic acid is modulated by the regulatory element with which it is connected. Preferably these are incorporated within a vector.
The DNA construct may have regulatory and other elements inserted by methods known in the art so as to optimise the transcription of the RNA suitable for use as interfering RNA in gene silencing techniques.
It will be apparent to the person skilled in the art that deoxyribonucleic acids (DNA) and ribonucleic acids (RNA) may include modified nucleotides. Thus RNA in the context of the present invention includes nucleic acid containing principally any or all of the ribonucleotides uracil (U), guanosine (G), cytosine (C) and adenosine (A), however modified or otherwise altered nucleotides and nucleotide analogues may also be included within an RNA sequence. Likewise, DNA contains principally any or all of the deoxyribonucleotides thymidine (T), guanosine (G), cytosine (C) and adenosine (A), however modified or otherwise altered nucleotides and nucleotide analogues may also be included within a DNA sequence.
In another aspect of the present invention there is provided a method of producing RNA from the construct according to the present invention. The RNA is preferably RNAi for use in gene silencing techniques. The RNA may be produced from the construct according to the present invention in vitro, or by in vivo techniques after introduction of the construct into a cell. In this specification, “silence” means reduced expression, but is not limited to prevention of expression.
In another aspect of the present invention there is provided a method of inhibiting the expression of a target gene by introducing the nucleic acid or construct of the present invention into a cell or other system or environment permitting expression permitting expression of a target gene (including for example a cell lysate, tissue, in vitro system etc) containing a target gene to be silenced using RNAi techniques. In a preferred embodiment, multiple target genes or multiple gene targets are silenced.
A variety of vectors may be used to introduce the nucleic acid or construct encoding the nucleic acid of the present invention into a cell. Virus-based vectors, such as those related to adenovirus, lentivirus or retrovirus, may be used. The expression of the nucleic acid according to the present invention may be in vitro, ex vivo or in vivo. The expression of the nucleic acid after introduction of the construct according to the present invention into a cell may be stable (that is, long-term) or transient. Adeno-associated virus is one preferred vector. Other preferred vectors are retroviral and lentiviral vectors.
The use of the method of this aspect of the present invention has applications in gene therapy strategies where multiple gene inactivation and/or complete inactivation of a gene (for example, an oncogene) would be advantageous. For example, viruses may be controlled by targeting two or more regions of a viral genome, or genes of a virus; thereby decreasing the likelihood that the virus might mutate to become resistant to the effect of a particular DNA construct. Furthermore, multiple site in a single viral gene may be targeting using the nucleic acid or construct according to the present invention. Another potential use in viral control might be to design a single construct inactivating both viral genes and also host genes involved in viral replication. Such uses and methods are within the scope of the invention. Accordingly, the method of the present invention may be used to inactivate two or more genes of the human immunodeficiency virus (HIV) or to inactivate one or more HIV genes and one or more HIV receptors on the host cell, for example the CCR4 receptor.
In cancers, mutations frequently occur in multiple genes. For gene therapy approaches, inactivation of two or more critical genes involved in tumour development are likely to prove more effective in controlling cancer cell proliferation than DNA constructs inactivating a single gene. For example, the development of a particular type of tumour may be accelerated by the cumulative effect of two signalling pathways controlled by two different genes. The simultaneous inactivation of the two genes may result in more immediate control of tumour growth. Furthermore, the tumour development may involve two alternative pathways controlled by different genes, whereby the inhibition of both pathways would be a requirement for the effective inhibition of tumour development.
The method according to this aspect of the present invention may be useful for the treatment and/or prevention of disease in plants and animals, including humans. This method has the advantage over many other treatments in that the gene can be targeted with high specificity, reducing the possibility for side-effects.
Multiple gene inactivation strategies are also likely to have uses in target definition and gene function studies. For example, DNA constructs according to the present invention may be designed whereby the construct can inactivate a single gene A by possessing a target sequence for that gene. In order to establish the phenotypic effects of inhibiting the expression of a particular gene in the environment where gene A is not expressed, other sequences can be included in the multiple target construct. For example, random shotgun library sequences can be cloned into the DNA construct already possessing the target sequence for gene A. Therefore, such a library can be used to screen for genes of unknown functions in a background where the first gene is also inactivated.
Regions of target genes targeted by RNAi techniques may be predicted, including empirically or by various algorithms. Where there is more than one optimal target sequence, all such target sequences may be included in one construct.
Different non-complementary bubble-forming or bubble-encoding sequences in the constructs or nucleic acids of the present invention may have different activity in respect of gene silencing. Accordingly, random libraries of bubble sequences may be generated to determine the optimal sequences required for gene silencing activity for any given application or system. Such a method may involve inserting one or more randomised nucleotides into specific defined positions along a bubble sequence in a DNA construct and testing the activity of the interfering RNA encoded by the adjacent double-strand forming region. Such bubble sequences may be up to ten nucleotides in length or more. Preferably the bubble sequence is four or six nucleotides in length.
Constructs inactivating multiple target genes may also be used in transgenic systems to screen directly for the effects of inactivating two known genes. Such an approach may circumvent the requirement of complex breeding programs to generate individual animals possessing multiple gene inactivation.
The nucleic acid or construct according to the present invention may be introduced into a cell in a suitable context. The carriers, excipients and/or diluents utilised in delivering the subject nucleic acid or constructs to a host cell should be acceptable for human or veterinary applications. Such carriers, excipients and/or diluents are well-known to those skilled in the art. Carriers and/or diluents suitable for veterinary use include any and all solvents, dispersion media, aqueous solutions, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like. Except insofar as any conventional media or agent is incompatible with the active ingredient, use thereof in the composition is contemplated. Supplementary active ingredients can also be incorporated into the compositions.
In another aspect of the present invention there is provided a method of inhibiting the expression of a target gene by introducing RNA produced from the construct of the present invention into a cell containing a target gene to be silenced using RNAi techniques.
A viral delivery system based on any appropriate virus may be used to deliver the RNA or nucleic acid construct of the present invention. In addition, hybrid viral systems may be of use. The choice of viral delivery system will depend on various parameters, such as the tissue targeted for delivery, transduction efficiency of the system, pathogenicity, immunological and toxicity concerns, and the like. Given the diversity of infections, diseases and other conditions that are amenable to interference by the RNA and RNA encoded by the nucleic acid constructs of the present invention, it is clear that there is no single viral system that is suitable for all applications. When selecting a viral delivery system to use in the present invention, it is important to choose a system where the interfering RNA-containing viral particles are preferably: 1) reproducibly and stably propagated; 2) able to be purified to high titres; and 3) able to mediate targeted delivery (delivery of the interfering RNA to the tissue or organ of interest without widespread dissemination).
In general, the five most commonly used classes of viral systems used in gene therapy can be categorized into two groups according to whether their genomes integrate into host cellular chromatin (oncoretroviruses and lentiviruses) or persist in the cell nucleus predominantly as extrachromosomal episomes (adeno-associated virus, adenoviruses and herpes viruses). This distinction is an important determinant of the suitability of each vector for particular applications; non-integrating vectors can, under certain circumstances, mediate persistent gene expression in non-proliferating cells, but integrating vectors are the tools of choice if stable genetic alteration needs to be maintained in dividing cells, for example where the target cells are rapidly proliferating cancer cells.
For example, in one embodiment of the present invention, viruses from the Parvoviridae family are utilized. The Parvoviridae is a family of small single-stranded, non-enveloped DNA viruses with genomes approximately 5000 nucleotides long. Included among the family members is adeno-associated virus (AAV), a dependent parvovirus that by definition requires co-infection with another virus (typically an adenovirus or herpes virus) to initiate and sustain a productive infectious cycle. In the absence of such a helper virus, AAV is still competent to infect or transduce a target cell by receptor-mediated binding and internalization, penetrating the nucleus in both non-dividing and dividing cells.
Once in the nucleus, the virus uncoats and the transgene is expressed from a number of different forms—the most persistent of which are circular monomers. AAV will integrate into the genome of 1-5% of cells that are stably transduced (Nakai, et al., J. Virol. 76:11343-349 (2002)). Expression of the transgene can be exceptionally stable and in one study with AAV delivery of Factor IX, a dog model continues to express therapeutic levels of the protein 4.5 years after a single direct infusion with the virus. Because progeny virus is not produced from AAV infection in the absence of helper virus, the extent of transduction is restricted only to the initial cells that are infected with the virus. However, unlike retrovirus, adenovirus, and herpes simplex virus, AAV appears to lack human pathogenicity and toxicity (Kay, et al., Nature 424: 251 (2003) and Thomas, et al., Nature Reviews Genetics 4:346-58 (2003)).
Typically, the genome of AAV contains only two genes. The “rep” gene codes for at least four separate proteins utilized in DNA replication. The “cap” gene product is spliced differentially to generate the three proteins that comprise the capsid of the virus. When packaging the genome into nascent virus, only the Inverted Terminal Repeats (ITRs) are obligate sequences; rep and cap can be deleted from the genome and be replaced with heterologous sequences of choice. However, in order produce the proteins needed to replicate and package the AAV-based heterologous construct into nascent virion, the rep and cap proteins must be provided in trans. The helper functions normally provided by co-infection with the helper virus, such as adenovirus or herpes virus mentioned above, also can be provided in trans in the form of one or more DNA expression plasmids. Since the genome normally encodes only two genes it is not surprising that, as a delivery vehicle, AAV is limited by a packaging capacity of 4.5 single stranded kilobases (kb). However, although this size restriction may limit the genes that can be delivered for replacement gene therapies, it does not adversely affect the packaging and expression of shorter sequences such as RNAi nucleic acids.
However, technical hurdles must be addressed when using AAV as a vehicle for nucleic acid constructs. For example, various percentages of the human population may possess neutralizing antibodies against certain AAV serotypes. However, since there are several AAV serotypes, some of which the percentage of individuals harbouring neutralizing antibodies is vastly reduced, other serotypes can be used or pseudo-typing may be employed. There are at least eight different serotypes that have been characterized, with dozens of others which have been isolated but have been less well described. Another limitation is that as a result of a possible immune response to AAV, AAV-based therapy may only be administered once; however, use of alternate, non-human derived serotypes may allow for repeat administrations. Administration route, serotype, and composition of the delivered genome all influence tissue specificity.
Another limitation in using unmodified AAV systems with a nucleic acid construct is that transduction can be inefficient. Stable transduction in vivo may be limited to 5-10% of cells. Yet, different methods are known in the art to boost stable transduction levels. One approach is utilizing pseudo typing, where AAV-2 genomes are packaged using cap proteins derived from other serotypes. One group of investigators exhaustively pseudotyped AAV-2 with AAV-1, AAV-3B, AAV-4, AAV-5, and AAV-6 for tissue culture studies. The highest levels of transgene expression were induced by virion which had been pseudotyped with AAV-6; producing nearly 2000% higher transgene expression than AAV-2. Thus, the present invention contemplates use of a pseudotyped AAV virus to achieve high transduction levels, with a corresponding increase in the expression of the interfering RNA.
Another viral delivery system useful with the nucleic acid construct of the present invention is a system based on viruses from the family Retroviridae. Retroviruses comprise single-stranded RNA animal viruses that are characterized by two unique features. First, the genome of a retrovirus is diploid, consisting of two copies of the RNA. Second, this RNA is transcribed by the virion-associated enzyme reverse transcriptase into double-stranded DNA. This double-stranded DNA or provirus can then integrate into the host genome and be passed from parent cell to progeny cells as a stably-integrated component of the host genome.
In some embodiments, lentiviruses are the preferred members of the retrovirus family for use in the present invention. Lentivirus vectors are often pseudotyped with vesicular stomatitis virus glycoprotein (VSV-G), and have been derived from the human immunodeficiency virus (HIV), the etiologic agent of the human acquired immunodeficiency syndrome (AIDS); visan-maedi, which causes encephalitis (visna) or pneumonia in sheep; equine infectious anemia virus (EIAV), which causes autoimmune hemolytic anemia and encephalopathy in horses; feline immunodeficiency virus (FIV), which causes immune deficiency in cats; bovine immunodeficiency virus (BIV) which causes lymphadenopathy and lymphocytosis in cattle; and simian immunodeficiency virus (Sly), which causes immune deficiency and encephalopathy in non-human primates. Vectors that are based on HIV generally retain <5% of the parental genome, and <25% of the genome is incorporated into packaging constructs, which minimizes the possibility of the generation of reverting replication-competent HIV. Biosafety has been further increased by the development of self-inactivating vectors that contain deletions of the regulatory elements in the downstream long-terminal-repeat sequence, eliminating transcription of the packaging signal that is required for vector mobilization.
Reverse transcription of the retroviral RNA genome occurs in the cytoplasm. Unlike C-type retroviruses, the lentiviral cDNA complexed with other viral factors—known as the pre-initiation complex—is able to translocate across the nuclear membrane and transduce non-dividing cells. A structural feature of the viral cDNA—a DNA flap—seems to contribute to efficient nuclear import. This flap is dependent on the integrity of a central polypurine tract (cPPT) that is located in the viral polymerase gene, so most lentiviral-derived vectors retain this sequence. Lentiviruses have broad tropism, low inflammatory potential, and result in an integrated vector. The main limitations are that integration might induce oncogenesis in some applications. The main advantage to the use of lentiviral vectors is that gene transfer is persistent in most tissues or cell types.
A lentiviral-based construct that may be used to express the RNA according to the present invention preferably comprise sequences from the 5′ and 3′ LTRs of a lentivirus. More preferably the viral construct comprises an inactivated or self-inactivating 3′ LTR from a lentivirus. The 3′ LTR may be made self-inactivating by any method known in the art. In a preferred embodiment, the U3 element of the 3′ LTR contains a deletion of its enhancer sequence, preferably the TATA box, Sp1 and NF-kappa B sites. As a result of the self-inactivating 3′ LTR, the provirus that is integrated into the host cell genome will comprise an inactivated 5′ LTR. The LTR sequences may be LTR sequences from any lentivirus from any species. The lentiviral-based construct may also incorporate sequences for MMLV or MSCV, RSV or mammalian genes. In addition, the U3 sequence from the lentiviral 5′ LTR may be replaced with a promoter sequence in the viral construct. This may increase the titre of virus recovered from the packaging cell line. An enhancer sequence may also be included.
Adenoviruses are non-enveloped viruses containing a linear double-stranded DNA genome. While there are over 40 serotype strains of adenovirus—most of which cause benign respiratory tract infections in humans—subgroup C serotypes 2 or 5 are predominantly used as vectors. The adenovirus life cycle normally does not involve integration into the host genome, rather it replicates as episomal elements in the nucleus of the host cell and consequently there is no risk of insertional mutagenesis. The wild type adenovirus genome is approximately 35 kb of which up to 30 kb can be replaced with foreign DNA. There are four early transcriptional units (E1, E2, E3 and E4), which have regulatory functions, and a late transcript, which codes for structural proteins. Progenitor vectors have either the E1 or E3 gene inactivated, with the missing gene being supplied in trans either by a helper virus, plasmid or by an integrated gene in a helper cell genome. Second generation vectors additionally use an E2a temperature sensitive mutant or an E4 deletion. The most recent “gutless” vectors contain only the inverted terminal repeats (ITRs) and a packaging sequence around the transgene, all the necessary viral genes being provided in trans by a helper virus.
Adenoviral vectors are very efficient at transducing target cells in vitro and in vivo, and can be produced at high titres (>1011/ml). With the exception of one study that showed prolonged transgene expression in rat brains using an E1 deletion vector, transgene expression in vivo from progenitor vectors tends to be transient. Following intravenous injection, 90% of the administered vector is degraded in the liver by a non-immune mediated mechanism. Thereafter, an MHC class I restricted immune response occurs, using CD8+ CTLs to eliminate virus infected cells and CD4+ cells to secrete IFN-alpha which results in anti-adenoviral antibody. Alteration of the adenoviral vector can remove some CTL epitopes; however, the epitopes recognized differ with the host MHC haplotype. The remaining vectors, in those cells that are not destroyed, have their promoter inactivated and persisting antibody prevents subsequent administration of the vector.
Approaches to avoid the immune response involving transient immunosuppressive therapies have been successful in prolonging transgene expression and achieving secondary gene transfer. A less interventionist method has been to induce oral tolerance by feeding the host UV inactivated vector. However, it is more desirable to manipulate the vector rather than it is to manipulate the host through immunosuppression. Although only replication deficient vectors are used, viral proteins are expressed at a very low level, which are then presented to the immune system. The development of vectors containing fewer genes—culminating in the “gutless” vectors which contain no viral coding sequences—has resulted in prolonged in vivo transgene expression in liver tissue. However, the initial delivery of DNA packaged within adenovirus proteins—the majority of which will be degraded and presented to the immune system—may still cause problems for clinical trials.
Until recently, the mechanism by which the adenovirus targeted the host cell was poorly understood. Tissue-specific expression was therefore only possible by using cellular promoter/enhancers, e.g., the myosin light chain 1 promoter or the smooth muscle cell SM22a promoter, or by direct delivery to a local area. Uptake of the adenovirus particle has been shown to be a two-stage process involving an initial interaction of a fibre coat protein in the adenovirus with a cellular receptor or receptors, which include the MHC class I molecule and the coxsackievirus-adenovirus receptor. The penton base protein of the adenovirus particle then binds to the integrin family of cell surface heterodimers allowing internalization via receptor mediated endocytosis. Most cells express primary receptors for the adenovirus fibre coat protein, however internalization is more selective. Methods of increasing viral uptake include stimulating the target cells to express an appropriate integrin and conjugating an antibody with specificity for the target cell type to the adenovirus. However, the use of antibodies increases the production difficulties of the vector and the potential risk of activating the complement system.
Another virus that may be used as a basis for a viral delivery vector in the present invention is the Herpes simplex virus-1. HSV-1 is a double-stranded DNA virus with a packaging capacity of 40 kb, or up to 150 kb (helper dependent). HSV-1 has strong tropism for neurons, but also has a high inflammatory potential. HSV-1 is maintained episomally. Replication defective HSV-1 vectors generally are produced by deleting all, or a combination, of the five immediate-early genes (ICP0, ICP4, ICP22, ICP27 and ICP47), which are required for lytic infection and expression of all other viral proteins. Unfortunately, the ICP0 gene product is both cytotoxic and required for high level and sustained transgene expression. As such, the production of non-toxic quintuple immediate-early mutant vectors is a trade-off against efficient and persistent transgene expression. An HSV-1 protein that is activated during latency has recently be shown to complement mutations in ICP0 and overcome the repression of transgene expression that occurs in the absence of ICP0. Substitution of this protein in place of ICP0 might facilitate efficient transgene expression without cytotoxicity in non-neuronal cells. Long-term expression can be achieved in the nervous system by using one of the HSV-1 neuron-specific latency-activated promoters to drive transgene expression.
Other viral or non-viral systems known to those skilled in the art may be used to deliver the RNA or nucleic acid constructs of the present invention to cells of interest, including but not limited to gene-deleted adenovirus-transposon vectors that stably maintain virus-encoded transgenes in vivo through integration into host cells (see, Yant, et al., Nature Biotech. 20:999-1004 (2002)); systems derived from Sindbis virus or Semliki forest virus (see Perri, et al, J. Virol. 74(20):9802-07 (2002)); systems derived from Newcastle disease virus or Sendai virus; or mini-circle DNA vectors devoid of bacterial DNA sequences (see Chen, et al., Molecular Therapy. 8(3):495-500 (2003)). In addition, hybrid viral systems may be used to combine useful properties of two or more viral systems.
To deliver a viral-based nucleic acid construct into target cells, the nucleic acid construct first must be packaged into viral particles. Any method known in the art may be used to produce infectious viral particles whose genome comprises a copy of the viral construct. For example, certain methods utilize packaging cells that stably express in trans the viral proteins that are required for the incorporation of the nucleic acid construct into viral particles, as well as other sequences necessary or preferred for a particular viral delivery system (for example, sequences needed for replication, structural proteins and viral assembly) and either viral-derived or artificial ligands for tissue entry. In such a method, a nucleic acid construct is ligated to a viral delivery vector and the resulting viral nucleic acid construct is used to transfect packaging cells. The packaging cells then replicate viral sequences, express viral proteins and package the viral nucleic acid constructs into infectious viral particles. The packaging cell line may be any cell line that is capable of expressing viral proteins, including but not limited to 293, HeLa, A549, PerC6, D17, MDCK, BHK, Bing cherry, phoenix, Cf2Th, or any other line known to or developed by those skilled in the art. One packaging cell line is described, for example, in U.S. Pat. No. 6,218,181.
Alternatively, a cell line that does not stably express necessary viral proteins may be co-transfected with two or more constructs to achieve efficient production of functional particles. One of the constructs comprises the nucleic acid construct of the present invention, and the other plasmid(s) comprises nucleic acids encoding the proteins necessary to allow the cells to produce functional virus (replication and packaging construct) as well as other helper functions. This method utilizes cells for packaging that do not stably express viral replication and packaging genes. In this case, the nucleic acid construct is ligated to the viral delivery vector and then co-transfected with one or more vectors that express the viral sequences necessary for replication and production of infectious viral particles. The cells replicate viral sequences, express viral proteins and package the viral nucleic acid constructs into infectious viral particles.
The packaging cell line or replication and packaging construct may not express envelope gene products. In these embodiments, the gene encoding the envelope gene can be provided on a separate construct that is co-transfected with the viral nucleic acid construct. As the envelope protein is responsible, in part, for the host range of the viral particles, the viruses may be pseudotyped. As described supra, a “pseudotyped” virus is a viral particle having an envelope protein that is from a virus other than the virus from which the genome is derived. One with skill in the art can choose an appropriate pseudotype for the viral delivery system used and cell to be targeted. In addition to conferring a specific host range, a chosen pseudotype may permit the virus to be concentrated to a very high titre. Viruses alternatively can be pseudotyped with ecotropic envelope proteins that limit infection to a specific species (e.g., ecotropic envelopes allow infection of, e.g., murine cells only, where amphotropic envelopes allow infection of, e.g., both human and murine cells). In addition, genetically-modified ligands can be used for cell-specific targeting.
After production in a packaging cell line, the viral particles containing the nucleic acid constructs are purified and quantified (titred). Purification strategies include density gradient centrifugation, or, preferably, column chromatographic methods.
In another aspect of the present invention there is provided a method of testing nucleic acid sequences for efficacy in RNAi comprising the steps of inserting DNA encoding RNAi regions to be tested into the construct according to the present invention; introducing the construct into a cell containing the target gene corresponding to the RNAi region; allowing RNA to be produced from the construct and evaluating the effect on the expression of the target gene.
In a further aspect of the present invention there is provided a method for the production of a construct according to the present invention using long range PCR techniques. In one embodiment there is provided a method of adding a predetermined oligonucleotide to a polynucleotide, the oligonucleotide being divided into a first sub-sequence and a second sub-sequence, by a polymerase chain reaction process including:
providing a first primer having at its 3′ end a fixing part hybridizable under polymerase chain reaction conditions with at least a first part of the polynucleotide and at its 5′ end an effector part identical to the first sub-sequence, and a second primer having at its 3′ end a fixing part hybridizable with at least a second part of the polynucleotide that is adjacent the first part of the polynucleotide and at its 5′ end an effector part identical to the second sub-sequence,
introducing the primers to the nucleotide under polymerase chain reaction conditions such that the fixing parts of each primer hybridizes with the polynucleotide;
conducting a multiple polymerase chain reaction to produce an amplification product which includes the effector parts of the primers at the ends of a double-stranded sequence; and
ligating the ends of the effector parts together to form a combined polynucleotide and oligonucleotide sequence.
For additional clarification, in this description of this embodiment of the invention directed towards production of a construct, the term “effector” is used for convenience and as an appropriate term, but in a different context from that in which it is used in describing the RNA an DNA constructs themselves above. It is thus used in a different context from the way in which it is described in the paragraph above that commences “The term ‘effector sequence’ and ‘effector’ in the context of . . . ”. The term “effector” in this embodiment and the related claims should be construed in context without importing the limitations of the meaning of “effector” described above. It may also be referred to as the “variable” sequence as it largely contains the sequence that will vary from construct to construct.
By “oligonucleotide” in this process is meant a nucleic acid sequence of 40 to 100, preferably less than 100 nucleotides in length. The oligonucleotide may be single or double-stranded. Preferably the oligonucleotide is DNA.
By “polynucleotide” in this process is meant a nucleic acid sequence of at least about 1000 nucleotides in length. The polynucleotide may be single or double-stranded depending on the stage of the process according to the present invention. The polynucleotide may have a double-stranded circular conformation or a linear form, or may be the linearized form of a previously circular double stranded sequence. Preferably the polynucleotide is DNA. In a preferred embodiment of the present invention, the polynucleotide is a DNA vector selected from the group consisting of a plasmid, a bacteriophage and a viral-based vector.
It will be appreciated by a person skilled in the art that the efficiency of the polymerase chain reaction (PCR) can be modified, for example by altering the denaturation, annealing and polymerisation temperatures, the timing of the cycles and the salt concentration in the reaction mixture. Variations of these and other conditions that allow the PCR reaction to take place are encompassed in the term “polymerase chain reaction conditions”. It will be further appreciated by a person skilled in the art that a range of products may be produced from a given PCR reaction. These products may be separated by size or weight by methods known in the art, such as gel electrophoresis. In a preferred embodiment of the present invention the desired PCR product is isolated from solution.
The long range PCR method of this aspect of the present invention can be used to insert a DNA oligonucleotide into a DNA polynucleotide that is a vector in order to form a construct which enables the oligonucleotide to be transcribed into a ribonucleic acid sequence (RNA). The transcription may take place from the oligonucleotide only or the RNA transcript may be the result of the transcription of a combination of oligonucleotide and polynucleotide sequences. The transcribed RNA may further be translated into protein, or may also remain as untranslated RNA. In a preferred embodiment of this aspect of the present invention the primers have a homology with a restriction enzyme site in the polynucleotide sequence. In a further preferred embodiment of this aspect of the present invention, the primers are phosphorylated and the ligation of the amplification product is catalysed by T4 DNA ligase.
The polynucleotide used in the methods according to this long-range PCR process may contain one or more regulatory elements to allow transcription to take place. Preferably at least one of the regulatory elements is a promoter. A variety of promoters may be included in the polynucleotide vector. Factors influencing the choice of promoter include the desire for inducible transcription of the oligonucleotide or oligonucleotide and polynucleotide sequences, the strength of the promoter and the suitability of the promoter to induce expression in the in vivo or in vitro environment in which the transcription is to take place. In a preferred embodiment the promoter is an RNA polymerase III (pol III) promoter such as U6 or H1 promoters
In a preferred embodiment of this aspect of this process, the oligonucleotide codes for an RNA sequence capable of forming a double-stranded hairpin structure due to the presence an inverted repeat sequence. Preferably, the first primer contains approximately one half of the inverted repeat sequence in its effector part and the second primer contains approximately the other half of the inverted repeat sequence in its effector part. More preferably, the first and second primers further contain at least one nucleotide at their 5′ ends that forms the loop region of the hairpin-loop RNA structure.
In another embodiment of this aspect of the present invention, the effector parts are at least partially complementary, such that upon transcription (following transfection of a cell by a vector which incorporates a polynucleotide as described above) their respective RNA transcripts may hybridise with each other due to the complementarity of their sequences.
In a further preferred embodiment the oligonucleotide used in the method according to this aspect of the present invention is capable of coding RNA suitable for use as interfering RNA in gene silencing techniques. Such techniques are described in the specification of PCT/AU99/00195. Preferably the RNA has a hairpin-loop structure.
In another embodiment the oligonucleotide encodes a restriction site and the addition of the oligonucleotide to the polynucleotide results in the a restriction site being inserted into the combined oligonucleotide and polynucleotide sequence. It will be appreciated by a person skilled in the art the where the polynucleotide is a vector, such as a plasmid, the insertion of a restriction site would have many advantages in the subsequent use of the plasmid, particularly for subcloning purposes.
In a further embodiment the oligonucleotide includes an intron, or non-coding, sequence of a gene. The polynucleotide may include the coding sequence of the gene. Accordingly, the addition of the oligonucleotide to the polynucleotide using the method of the present invention may allow the insertion of the intron at the appropriate site in the coding sequence of the gene. Insertion of an intron into a coding sequence of a gene has a number of practical applications. For example, insertion of introns into DNA constructs has been shown to increase transgene expression. Another possible application is to use introns as a means of delivering double stranded RNA to induce gene silencing.
In another aspect of this aspect of the present invention there is provided a DNA construct produced by the addition of an oligonucleotide to a polynucleotide according to the method of the present invention. The DNA construct may be useful for further subcloning purposes whereby a second oligonucleotide of interest may be introduced by, for example known subcloning techniques. The DNA construct may also be an expression construct for the further production of RNA and/of protein. Preferably the DNA construct is suitable for producing RNA suitable for use as interfering RNA in gene silencing technologies. More preferably, the construct can be introduced into a cell where gene silencing is to take place and interfering RNA can be transcribed within this cell.
In another aspect of the present invention there is provided primers suitable for use in the method according to the present invention. In a further aspect of the present invention there is provided a kit comprising a polynucleotide and a primer pair for producing a polynucleotide containing an additional oligonucleotide.
In a further embodiment of this aspect of the invention there is provided a method for the large scale production of large numbers of hairpin DNA plasmids using the long range PCR method of the present invention with automation procedures. The simplicity of the long range PCR method lends itself to automation, using a robotics system to amplify DNA templates and ligate these to prepare DNA vectors. Such vectors can also be used to transform bacteria to grow substantial copy numbers of the vectors. In this way large numbers of plasmids, for example targeting different regions of a single gene could be rapidly prepared.
In a further aspect of the invention, a method for preparing libraries of sequences using long range PCR techniques is provided. In this instance, portions of one or both of the forward and reverse primers are synthesised using redundant oligonucleotides. Following amplification, ligation and transformation of bacteria, individual colonies contain unique hairpin DNA constructs reflecting the particular redundancies incorporated into individual plasmid by individual amplification primers. In this way libraries with, for example, random loop sequences are prepared and individual plasmids from the library are analysed for gene silencing activity in order to define loop sequences that enhance the activity of hairpin DNA constructs.
In another aspect of the present invention there is provided a kit for constructing a nucleic acid construct using the long range PCR method of the present invention comprising the polynucleotide, a polymerase, a first primer, a second primer and a ligating enzyme in proportions suitable for the long range PCR method according to the present invention.
In another aspect of the present invention there is provided a kit for inhibiting the expression of a target gene, including a vector suitable for use in producing a construct according to the present invention. Such a vector may include regulatory elements and facility for insertion of a cassette encoding a nucleic acid designed according to the present invention.
Without being bound by any theory or mode of action, it is believed that the invention is mediated by enzymes including Dicer and Drosha. At least these two ribonucleases, both members of the RNase III class, play a central role in the processing of double stranded RNA into siRNAs.
Dicer is the best characterized component. Dicer is a thought to be a cytoplasmic protein. It can cleave double-stranded RNA to produce approx 21 nucleotide (nt) dsRNAs with a 2 nt 3′ overhang; this overhang is a characteristic of RNase III-type enzymes. The precise requirements that allow dsRNA to act as an efficient substrate for Dicer remain unclear. miRNA precursors are one such substrate—they naturally form a hpRNA structure, but typically contain regions of mismatch, ie they do not form perfect double stranded structures, in contrast to hpRNAs designed to produce siRNAs from expression constructs. Dicer appears normally to process hpRNAs from the base of the hairpin, but definitive proof of this is not yet available. Dicer probably plays other roles in the RNAi process. It has recently been shown that the enzyme plays a role in RISC, ie it might play a role in cleavage of the target mRNA.
Drosha is another RNase III enzyme implicated in RNA interference. Much less is known about its function compared to Dicer. The enzyme is nuclear and may be nucleolar, since Drosha is known to play a role in rRNA maturation, which is a nucleolar process. The precise role of Drosha in RNAi is unknown. It is known to play a role in processing of miRNAs and may play a role in processing longer dsRNAs in RNAi. Current models suggest that Drosha may recognize loop structures in RNA, bind to these, then cut hp RNAs about 19-21 nt downstream of the loop. Most RNase 111s are thought to act by recognising loop structures, although it is recognised that the model described above for Dicer processing contradicts this view.
A hp RNA expressed from a pol III promoter thus may have 2 potential pathways by which it might enter RISC, namely:
These models are currently incomplete and are possibly not mutually exclusive, ie a longer hpRNA might be processed by both pathways, some is processed by Drosha then Dicer, some only by Dicer. Moreover some hpRNAs are expressed with a 5′ leader sequence (“U6+27”) which may target the hpRNA to the nucleolus, ie it is preferentially processed by Drosha before Dicer.
Without being bound by any theory or mode of action it is believed that improved therapeutic efficacy and safety of RNAi constructs can be achieved by optimising the length of effector sequences. This may assist the cleavage enzymes, such as Dicer and Drosha, cleaving at the same, predictable position, thereby providing predictability of result and reduction of side effects and/or variability of efficacy within and between patients.
The present invention will now be more fully described with reference to the accompanying examples and drawings. It should be understood, however, that the description following is illustrative only and should not be taken in any way as a restriction on the generality of the invention described above.
DNA constructs were prepared which were targeted to inactivate a number of genes, principally the Renilla luciferase gene because of the availability of simple rapid assays (see below). The base plasmid for all constructs was pU6.cass shown in
The resulting fragment was A-tailed using Taq polymerase and cloned into the vector pZero Blunt (pZB) using the manufacturer's protocols (Invitrogen). The human U6 promoter region was excised from this plasmid as an Eco RI/Hind III fragment and cloned into the vector pBluescript II SK+ (Stratagene), using the restriction sites introduced into the fragment by the above oligonucleotides. The resulting plasmid pU6.cass (
The plasmid pU6.ACTB-A hp (
The partially complementary oligonucleotide pairs, ACTB-A-hp-U6-5 and ACTB-A-hp-U6-6 and ACTB-A-hp-H1-3 and ACTB-A-hp-H1-4 were annealed, and the annealed pairs themselves subsequently annealed to form a double-stranded DNA structure compatible with cloning into BsmB 1/Hind III digested pU6.cass. The annealed oligonucleotides were phosphorylated with T4 polynucleotide kinase using the manufacturer's (Promega) protocol and then cloned into the cut vector which had been dephosphoylated using Shrimp Alkaline Phosphatase (SAP) using the manufacturer's (Promega) protocol. This plasmid was expected to express a hairpin RNA, with transcription initiating in the human U6 promoter and terminating at the poly T tract in the 3′ region of the annealed sequences (SEQ ID NO: 226) as shown in
The general strategy of the long-range PCR method is shown in
Step 1: Long-range PCR (LPCR) primers are used to extend and amplify circular or linear templates. DNA templates are shown as two lines, denoting double stranded DNA, although single stranded DNA could be used as a template. The LPCR primers are shown as bent lines above and below the templates; thin regions represent 3′ fixing parts of primers, thick lines represent 5′ effector parts of primers.
A. Step 2: Amplify DNA molecule. PCR amplification of either of the templates in A will result in the production of linear DNA molecules, where the effector parts of the two LPCR oligonucleotides, denoted as thick lines, are incorporated into both ends of the linear DNA molecule.
C. Step 3: Circularized DNA molecule. The linear DNA can be readily recircularised using T4 DNA ligase or a similar enzyme. Note 5′ phosphorylation of at least one end of the DNA molecule is required to achieve this. This can be done by either synthesising 5′ phosphorylated oligonucleotides, or treating the linear DNA molecule with an enzyme such as T4 polynucleotide kinase; the former method is simplest.
2.1 Insertion of a Restriction Site into a Plasmid
An Asc I restriction site was introduced into a plasmid as shown in
The forward and reverse primers used in this reaction were:
The substrate plasmid was amplified and ligated, and bacterial colonies were obtained and analysed as described above. In this fashion an ASC I restriction site was introduced in a single step.
The procedure is shown in
A circular plasmid template is shown at the top, the two lines denote the positions at which the forward and reverse primers can anneal to the template at the point of sequence insertion. In this instance one primer contains only a 3′ fixing part, the other primer contains a 3′ fixing part as well as a 5′ effector part. The double stranded sequence (SEQ ID NO: 228) of the plasmid surrounding the point of insertion is shown below this. Above this, the sequence of the forward primer (SEQ ID NO: 227) is shown, the 3′ fixing part is shown directly above the sequence, the primer binding site is indicated by the arrow. The sequence of the 5′ effector region, which in this instance contains an Asc I restriction site, is indicated by the inclined letters. The sequence of the reverse primer (SEQ ID NO: 229) is shown below this and its primer binding site is also indicated by an arrow.
This example describes the optimised approach for generating hairpin DNA constructs using long range PCR as outlined in
The reverse and forward primers are designed to contain a 3′ U6 fixing part and a 3′ terminator fixing part, respectively. The 5′ sequences of each primer contain approximately half of the hairpin and loop sequences, in this instance 30 nucleotides homologous to a region of the murine GLUT4 gene separated by a 9 nucleotide loop.
The general design of the primers are shown below. U6 and terminator fixing parts are shown in bold.
TTTCCACA 3′
TTTTTGGAAAAGCTTATCGATACCGTC3′
In this example the sequences of the reverse and forward primers were:
To produce the linear amplification product, PCR reactions are assembled as follows:
a10 x cloned Pfu DNA polymerase reaction buffer (Stratagene). 200 mM Tris-HCl (pH 8.8), 20 mM MgSO4, 100 mM KCl, 100 mM (NH4)2SO4, 1% Triton X-100, 1 mg/ml BSA (nuclease free).
bPfu is added last, preferably just prior to running reaction to minimise primer degradation.
Reactions are undertaken using a “touchdown” protocol as follows:
Touch-down PCR reaction consists of 30 cycles as follows:
These reaction conditions are robust. If necessary individual reactions can be optimised by:
Altering touch down and annealing conditions, e.g., use temperature ranges of 65° C./60° C., 60° C./55° C. and 55° C./50° C.
Adding MgCl2, e.g an extra 0.5 mM MgCl2 can dramatically effect PCR yields.
PCR products are circularised using T4 DNA ligase, using a quick ligation kit according to the manufacturer's (New England Biolabs) instructions.
For Quick Ligation:
Bacteria are then transformed using standard protocols and transformed cells selected on ampicillin, since the pU6.EGFP-A hp construct encodes ampicillin resistance.
Transformed colonies were analysed using a standard “colony cracking” procedure, in which plasmids in individual colonies were amplified using M13 Forward and Reverse primers. The resultant reactions were analysed using agarose gel electrophoresis. In this instance plasmids containing the correct insert gave a larger product, since the GLUT4 hairpin was longer than the hairpin sequence in the substrate plasmid. In this example 8 colonies were analysed by colony cracking and 6 gave the correct size band. Plasmids from 3 colonies were sequenced and one gave the correct product, which was designated pU6.GA.
Both covalent closed circular or linearised templates can be used to construct hairpin plasmids in this fashion. Background levels are lower when linear templates are used. For U6 constructs the preferred template is Pu6.GA hp cut with Bsm BI, which linearises within the loop region of the construct. Treatment with shrimp alkaline phosphatise (SAP) further reduces background.
The length of an inverted repeat within a plasmid was increased as shown in
The relative positions and sequences of the forward and reverse primers (SEQ ID NOS: 233 and 235) are indicated as in
2.4 Insertion of an Intron into a Cloned Sequence
A mouse Ige3 intron was inserted into a cloned sequence of the EGFP gene.
The reverse and forward primers were designed to contain a 3′ fixing part homologous to sequential sequences located in the EGFP gene. The 5′ effector sequences of each primer contained approximately half of the sequence of intron 3 from the mouse IgE3 gene.
The forward and reverse primers used in this reaction were:
The substrate plasmid was amplified and ligated, and bacterial colonies were obtained and analysed as described above. In this fashion a functional intron (intron 3 from the mouse IgE gene) was inserted into the coding sequences of the EGFP gene in a single step.
This procedure is shown in
The relative positions and sequences of the forward and reverse primers (SEQ ID NOS: 236 and 238) are indicated as in
Most constructs described in this application were prepared using the long range PCR strategy described above. The plasmid pU6.cass lin (
To prepare pU6.cass lin, Bsm BI linearised pU6.GA was amplified using the following primers:
A clone of the predicted sequence was isolated, this contains a polylinker containing three unique restriction sites (BmgB I, Bgl II and Bsm I) which can be used to linearise the vector prior to long range PCR amplification to reduce background (
The constructs used in these experiments are described in Table 1. Conventional single hp DNA constructs were used as controls. Double hairpin constructs were prepared and their activity was compared to the control constructs. The control constructs targeted a single gene, the test constructs (“double hairpin” constructs) targeted two genes, using one sequence at the “base” of the hairpin sequence (furthest from the loop), and a second sequence near the loop of the hairpin structure (the “top” of the hairpin”). This terminology can extend to triple, quadruple, etc hairpins with 3, 4, etc duplex sequences. The activity of constructs where sequences targeting the Renilla luciferase gene were located at the base or the top of a double hairpin RNA, was compared with the activity of a single construct targeting only Renilla luciferase. Using this method, the ability of a construct to target two genes can be reliably inferred. This can optionally be confirmed by determining the activity of a single construct against both target genes.
Renilla luciferase (positive control
amRNA targeted for inactivation
The test constructs were prepared as follows.
pU6.Rluc hp
This construct was designed to target Renilla luciferase mRNA, present in HeLa cells stably transformed with a construct designed to express Renilla luciferase. The construct was prepared using the long range PCR strategy described above using Bgl II linearised pU6.cass lin as a substrate; this was amplified with Pfu Turbo polymerase (Stratagene) using the primers:
A map of this construct is shown in
pU6.Rluc/ACTB TTA
This construct tested whether a construct carrying a UUA bubble sequence was capable of inactivating two mRNAs, namely a Renilla luciferase transgene and β actin. The construct was prepared using the plasmid pU6.cass lin (
A map of this construct is shown in
pU6.Rluc/ACTB TTAG hp
This construct tested whether a construct carrying a UUAG bubble sequence could inactivate two mRNAs, namely a Renilla luciferase transgene and β actin. The construct was prepared by annealing the following nucleotides:
To prepare this construct an oligo assembly strategy was used. Each oligonucleotide was resuspended at 1 ug/ml in water and 1 ul of each was added together, to create a final volume of 100 ul, containing 0.5× strength Buffer M (Roche; 10× Buffer M is 100 mM tris HCl (pH 7.5), 100 mM MgCl2, 500 mM NaCl, 10 mM DTE). The mixture was heated to 95° C., then oligonucleotides annealed by cooling to 30° C. at 1° C. per minute; these manipulations were performed in a Corbett Palm-Cycler PCR machine (Corbett Research). 20 ul of annealed oligonucleotides were then treated with T4 polynucleotide kinase according to the manufacturer's (Promega) protocol. The annealed oligonucleotides were then purified using a Qiagen PCR purification column, according to the manufacturer's (Qiagen) protocol. 2 ul of eluted oligonucleotides (from 28 ul of eluted material) were then ligated to approximately 100 ng of BsmB I/Hind III Shrimp Alkaline Phosphatase (SAP: Promega) treated pU6.cass prepared, using procedures well known to those familiar with the art, Colonies containing the appropriate sequences were then isolated, and sequence of the construct was confirmed using well known sequencing protocols.
A map of this construct is shown in
pU6.ACTB/Rluc TTA
This construct tests whether a construct carrying a UUA bubble sequence inactivates two mRNAs, namely β actin and a Renilla luciferase transgene. The construct is prepared using the plasmid pU6.cass lin as a substrate by amplifying with the two primers:
A map of this construct is shown in
pU6.ACTB/Rluc TTAG
This construct tests whether a construct carrying a UUAG bubble sequence is capable of inactivating two mRNAs, namely β actin and a Renilla luciferase transgene. The construct is prepared using the plasmid pU6.cass lin as a substrate by amplifying with the two primers:
A map of this construct is shown in
Four constructs were prepared targeting Renilla luciferase and a variety of other genes. Three constructs contain Renilla luciferase-targeting sequences at three different positions, respectively, within the hairpin RNA, namely the base, middle and top of the hairpin RNAs and contain the UUAG bubble sequence. The constructs are outlined in Table 2. A fifth construct acted as a negative control.
Renilla luciferase
Renilla luciferase
Renilla luciferase
Renilla luciferase
amRNA targeted for inactivation
The constructs were prepared mainly using the long range PCR strategy described above.
pU6.Rluc/ACTB/AD1
This construct tested whether a construct carrying sequences targeting a Renilla luciferase transgene in the base of the predicted hairpin RNA inactivated Renilla luciferase. The construct was prepared using linearised plasmid pU6.cass lin as a substrate by amplifying with the two primers:
A map of this construct is shown in
pU6.ACTB/Rluc/AD1 hp
This construct tested whether a construct carrying sequences targeting a Renilla luciferase transgene in the middle of the predicted hairpin RNA inactivated Renilla luciferase. The construct was prepared using linearised plasmid pU6.cass lin as a substrate by amplifying with the two primers:
A map of this construct is shown in
pU6.ACTB/AD1/Rluc hp
This construct tested whether a construct carrying sequences targeting a Renilla luciferase transgene at the top of the predicted hairpin RNA inactivated Renilla luciferase. The construct was prepared using the plasmid pU6.cass lin as a substrate by amplifying with the two primers:
A map of this construct is shown in
pU6.ACTB/AD1/GFP hp
This construct acted as a negative control for the three previous constructs. The construct was prepared using linearised plasmid pU6.cass lin as a substrate by amplifying with the two primers
A map of this construct is shown in
To test constructs targeting four separate genes, five constructs may be prepared targeting Renilla luciferase and a variety of other genes. The four constructs each contain a sequence targeting Renilla luciferase at one of four possible positions within the predicted hairpin RNA, namely the base, next to the base, next to the top and top of the hairpin RNAs. The hairpin RNAs further contain the UUAG bubble sequence separating the various components. The constructs are outlined in Table 3.
Renilla luciferase
Renilla luciferase
Renilla luciferase
Renilla luciferase
amRNA targeted for inactivation
The constructs may be prepared using the long range PCR strategy described above.
pU6.Rluc/ACTB/AD1/GFP hp
This construct tests whether a construct carrying sequences targeting a Renilla luciferase transgene in the base of the predicted hairpin RNA inactivates Renilla luciferase. The construct is prepared using the plasmid pU6.Rluc/ACTB/AD1 hp as a substrate by amplifying with the two primers:
A map of the construct is shown in
pU6.ACTB/Rluc/AD1/GFP hp
This construct tests whether a construct carrying sequences targeting a Renilla luciferase transgene in the second position from the base of the predicted hairpin RNA inactivates Renilla luciferase. The construct is prepared using the plasmid pU6.ACTB/Rluc/AD1 hp as a substrate by amplifying with the two primers:
A map of the construct is shown in
pU6.ACTB/AD1/Rluc/GFP hp
This construct tests whether a construct carrying sequences targeting a Renilla luciferase transgene in the third position from the base of the predicted hairpin RNA inactivates Renilla luciferase. The construct is prepared using the plasmid pU6.ACTB/AD1/Rluc hp as a substrate by amplifying with the two primers:
A map of the construct is shown in
pU6.ACTB/AD1/GFP/Rluc hp
This construct tested whether a construct carrying sequences targeting a Renilla luciferase transgene adjacent to the loop of the predicted hairpin RNA inactivated Renilla luciferase. The construct was prepared by annealing the following oligonucleotides:
The oligonucleotides were annealed together, treated with T4 PNK according to the manufacturer's (Promega) protocol and cloning the resultant mixture into BsmB I/Hind III cleaved pU6.cass that had been treated with SAP as described above.
A map of the construct is shown in
pU6.ACTB/AD1/GFP/HER2 hp
This construct acts as a negative control for the four previous constructs. The construct is prepared using the plasmid pU6.ACTB/AD1/GFP hp as a substrate by amplifying with the two primers:
A map of the construct is shown in
To test constructs targeting five separate genes, six constructs may be prepared targeting Renilla luciferase and a variety of other genes. Each of the five constructs contains a sequence targeting Renilla luciferase at one of five possible positions within the predicted hairpin RNA, namely the base, all positions from next to the base to next to the top and the top of the hairpin RNAs. The hairpin RNAs further contain the UUAG bubble sequence separating the various components. The constructs are outlined in Table 4.
Renilla luciferase
Renilla luciferase
Renilla luciferase
Renilla luciferase
Renilla luciferase
amRNA targeted for inactivation
The constructs may be prepared using the long range PCR strategy described above.
pU6.Rluc/ACTB/AD1/GFP/HER2 hp
This construct tests whether a construct carrying sequences targeting a Renilla luciferase transgene in the base of the predicted hairpin RNA inactivates Renilla luciferase. The construct is prepared using the plasmid pU6.Rluc/ACTB/AD1/GFP hp as a substrate by amplifying with the two primers:
A map of the construct is shown in
pU6.ACTB/Rluc/AD1/GFP/HER2 hp
This construct tests whether a construct carrying sequences targeting a Renilla luciferase transgene in position two, one position up the base of the predicted hairpin RNA inactivates Renilla luciferase. The construct is prepared using the plasmid pU6.ACTB/Rluc/AD1/GFP hp as a substrate by amplifying with the two primers:
A map of the construct is shown in
pU6.ACTB/AD1/Rluc/GFP/HER2 hp
This construct tests whether a construct carrying sequences targeting a Renilla luciferase transgene in position three, in the middle of the predicted hairpin RNA, inactivates Renilla luciferase. The construct is prepared using the plasmid pU6.ACTB/AD1/Rluc/GFP hp as a substrate by amplifying with the two primers:
A map of the construct is shown in
pU6.ACTB/AD1/GFP/Rluc/HER2 hp
This construct tests whether a construct carrying sequences targeting a Renilla luciferase transgene in position four, one back from the loop sequence of the predicted hairpin RNA, inactivates Renilla luciferase. The construct is prepared using the plasmid pU6.ACTB/AD1/GFP/Rluc hp as a substrate by amplifying with the two primers:
A map of the construct is shown in
pU6.ACTB/AD1/GFP/HER2/Rluc hp
This construct tested whether a construct carrying sequences targeting a Renilla luciferase transgene in position five, adjacent to the loop sequence of the predicted hairpin RNA, inactivated Renilla luciferase. The construct was prepared by annealing the following oligonucleotides:
The oligonucleotides were annealed together, treated with T4 PNK according to the manufacturer's (Promega) protocol and cloning the resultant mixture into BsmB I/Hind III cleaved pU6.cass that had been treated with SAP as described above.
A map of the construct is shown in
pU6.ACTB/AD1/GFP/HER2/Lam hp
This construct acts as a control for the five previous constructs. The construct is prepared using the plasmid pU6.ACTB/AD1/GFP/HER2 hp as a substrate by amplifying with the two primers:
A map of the construct is shown in
To test the activity of constructs targeting Renilla luciferase, plasmids were prepared using Qiagen columns according to the manufacturer's protocol. Plasmid DNAs were then transfected into HeLa cells that had been previously stably transformed with the construct. pHRLSV40 (Promega). This was done by co-transfection of pHRLSV40 with a selectable marker plasmid encoding hygromycin resistance; techniques to obtain such stably transformed cells are well known to those familiar with the art.
3,000 cells (as determined by haemocytometer count) were plated into each well of a 96 well tissue culture plates (Costar) and incubated overnight in 100 ul of DMEM media (Gibco) supplemented with heat inactivated 10% FBS (Gibco). To transfect cells each well was treated as follows:
0.2 ul or 0.3 ul LT1 transfection reagent (Mirus Corp.) was added to 25 ul serum free media (DMEM) and incubated for 10 mins at room temperature.
100 ng of DNA was added to this mixture and complex formation allowed to proceed for a further 10 mins at room temperature. The entire mixture was then added to a well of transgenic HeLa cells.
Cells were incubated at 37° C. overnight then media removed and 100 ul fresh DMEM 10% FBS added and incubation continued.
To determine Renilla luciferase activity, media was removed and fresh media containing EnduRen was added according to the manufacturer's (Promega) protocol; cells were then incubated for 5 hrs. Renilla luciferase activity was determined using a Veritas Microplate Luminometer according to the manufacturer's (Turner Biosystems) protocols. These values were then corrected for relative cell numbers which were determined using CellTiter-Glo reagent according to the manufacturer's (Promega) protocols using a Veritas Microplate Luminometer according to the manufacturer's (Turner Biosystems) protocols.
In this fashion the relative activity of individual constructs could be easily and accurately determined, moreover these activities were then corrected using appropriate negative controls, typically pU6.cass, to determine the relative activities of constructs.
8. Inactivating Two Genes with a Single Construct
To demonstrate that the above strategy can be used to inactivate two endogenous genes a single construct was prepared targeting the Akt1 (site a) and Akt2 (sites a and b) genes. This construct, pU6.GF-2, was designed to inactivate two genes, Akt1 and Akt2; sequences were designed based on the data of Jiang Z Y, Zhou Q L, Coleman K A, Chouinard M, Boese Q, Czech M P (2003). Insulin signalling through Akt/protein kinase B analysed by small interfering RNA-mediated gene silencing. Proc Natl Acad Sci USA. 100(13):7569-74. pU6.GF-2 was prepared using the long range PCR strategy described above using Bgl II linearised pU6.cass lin as a substrate; this was amplified with Pfu Turbo polymerase (Stratagene) using the primers:
A map of the construct is shown in
To assay the activity of this construct C2C12 cells were transfected with the plasmid using Lipofectamine 2000 according to the manufacturer's (Invitrogen) protocols. After 48 hrs total proteins were isolated and Akt1 and Akt2 protein levels were determined using Western blots; blots were also probed with a control antibody to ensure even loading. Procedures for these experiments are well known to those familiar with the art.
To demonstrate this approach can be used to increase the activity of constructs the plasmid pU6.GG-4 was prepared. This construct targets the Akt2 gene, target site selection was based on the data of Jiang et al (2003) cited above. Two sites (“a” and “b”) within the Akt2 gene were targeted and compared to the activity of the two single hp constructs targeting each sites, these single hp constructs were named pU6.GG-2 and pU6.GG-3.
The construct pU6.GG-2 was prepared using the long range PCR strategy described above using Bgl II linearised pU6.cass lin as a substrate; this was amplified with Pfu Turbo polymerase (Stratagene) using the primers:
A map of the construct is shown in
The construct pU6.GG-3 was prepared using the long range PCR strategy described above using Bgl II linearised pU6.cass lin as a substrate; this was amplified with Pfu Turbo polymerase (Stratagene) using the primers:
A map of the construct is shown in
The construct pU6.GG-4 was prepared using the long range PCR strategy as described above using Bgl II linearised pU6.cass lin as a substrate; this was amplified with Pfu Turbo polymerase (Stratagene) using the primers:
A map of the construct is shown in
To assay the activity of these constructs C2C12 myoblasts were transfected with the constructs and Akt2 protein levels were determined using quantitative Western blots as described above.
The results of these experiments are shown in
In this example DNA constructs were prepared which were targeted to inactivating the ADAR 1 and ADAR 2 genes (these are also sometimes known as ADARA and ADARB respectively).
Two plasmids, pU6.ACTB-A hp (
A similar strategy to that described above for pU6.ACTB-Ahp was used to prepare the construct pU6.ACTB-A48 hp. In this instance eight oligonucleotides were annealed, namely
Four partially complementary pairs of oligonucleotides (ACTB48-9 and ACTB48-10, ACTB48-3 and ACTB48-4, ACTB48-5 and ACTB48-6, and ACTB48-11 and ACTB48-12) were annealed, and annealed pairs were themselves annealed through two further cycles of annealing to produce a double-stranded DNA structure (SEQ ID NO: 264) compatible with cloning into BsmB 1/Hind III digested pU6.cass as shown diagrammatically in
The constructs used in these experiments are described in Table 5. Conventional single hp DNA constructs were used as controls. Double hairpin constructs were prepared and their activity was compared to the control constructs. The control constructs targeted a single gene, the test constructs (“double hairpin” constructs) targeted two genes, one gene targeted by the base of the hairpin sequence, the second by sequences near the loop of the hairpin structure.
amRNA targeted for inactivation
The test constructs were prepared as follows.
pU6.AD1-A
This construct was designed to target ADAR 1 mRNA for inactivation at the ADAR 1 A site and acted as a control for the double hairpin constructs which all targeted ADAR1 mRNA at the A site with sequences located at the base of the hairpin (SEQ ID NO: 265). The construct was prepared using the long range PCR strategy described above. The plasmid pU6.ACTB-A hp was used as a substrate, this was amplified using Pfu Turbo polymerase (Stratagene) with the primers:
A map of this construct is shown in
pU6.AD2-C
This construct was designed to target ADAR 1 mRNA for inactivation at the ADAR 2 C site and acted as a control for the double hairpin constructs (SEQ ID NO: 266) which all targeted ADAR2 mRNA at a different site. The construct was prepared using the long range PCR strategy described above. The plasmid pU6.ACTB-A hp was used as a substrate, this was amplified using Pfu Turbo polymerase (Stratagene) with the primers:
A map of this construct is shown in
pU6.AD2-A
This construct was designed to target ADAR 2 mRNA for inactivation at the ADAR 2 A site, and acted as a control for the double hairpin constructs which all targeted ADAR 2 mRNA at the A site with sequences located near the loop of the hairpin structure (SEQ ID NO: 267). The construct was prepared using the long range PCR strategy described above. The plasmid pU6.ACTB-A hp was used as a substrate, this was amplified using Pfu Turbo polymerase (Stratagene) with the primers:
A map of this construct is shown in
pU6.AD1/2-B
This construct was designed to target both ADAR 1 and ADAR 2 mRNA for inactivation at the ADAR 1 B site and the ADAR 2 B site. Both ADAR 1 mRNA and ADAR 2 mRNA contain this site, both mRNAs were therefore potentially inactivated by a single hairpin element within the construct (SEQ ID NO: 268). This construct acted as a control for the double hairpin constructs which all targeted ADAR 1 and/or ADAR 2 mRNAs at different sites. The construct was prepared using the long range PCR strategy described above. The plasmid pU6.ACTB-A hp was used as a substrate, this was amplified using Pfu Turbo polymerase (Stratagene) using the primers:
A map of this construct is shown in
pU6.AD1 &2-A/UU
This double hairpin construct was designed to inactivate ADAR 1 mRNA at the ADAR 1 A site with sequences at the base of the hairpin DNA construct, and ADAR 2 mRNA at the ADAR 2 A site with sequences near the loop of the double hairpin structure (SEQ ID NO: 269). The two structural elements were separated by a two nucleotide “bubble” sequence UU (
A map of this construct is shown in
pU6.AD1&2-A/UUA
This double hairpin construct was designed to inactivate ADAR 1 mRNA at the ADAR 1 A site with sequences at the base of the hairpin DNA construct, and ADAR 2 mRNA at the ADAR 2 A site with sequences near the loop of the double hairpin structure (SEQ ID NO: 270). The two structural elements were separated by a three nucleotide “bubble” sequence UUA (
A map of this construct is shown in
pU6.AD1 &2-A/UUACAA
This double hairpin construct was designed to inactivate ADAR 1 mRNA at the ADAR 1 A site with sequences at the base of the hairpin DNA construct, and ADAR 2 mRNA at the ADAR 2 A site with sequences near the loop of the double hairpin structure (SEQ ID NO: 271). The two structural elements were separated by a six nucleotide “bubble” sequence UUACAA (
A map of this construct is shown in
The two hairpin DNA constructs, pU6.ACTB-A hp (
As an additional control the effects of an siRNA targeting the ACTB-A site was tested using RNA transcribed from T7 promoters. The siRNA was termed siACTB-A and the DNA encoding this siRNA was prepared using the oligonucleotides:
Five further constructs, targeting both β actin and ADAR 1 were prepared as outlined in the Table 6 illustrating other embodiments of the invention.
aACTB-A site corresponds to positions 1045-1065 of NM_001101.
The constructs were prepared using the long range PCR strategy described above.
pU6.ACTB-A/UUA
This construct was designed to test whether a UUA bubble sequence (SEQ ID NO: 272) will enhance the activity of a single hairpin DNA construct having the sequence of ACTB-A (β actin). The construct is prepared using the plasmid pU6.ACTB-A hp as a substrate by amplifying with the two primers:
A map of this construct is shown in
pU6.AD1-A&ACTB-A/UU
This construct was designed to test whether a construct carrying a UU bubble sequence (SEQ ID NO: 273) was capable of inactivating two mRNAs, namely ADAR 1 and β actin. The construct was prepared using the plasmid pU6.AD1&2-A/UU as a substrate by amplifying with the two primers:
A map of this construct is shown in
pU6.AD1-A&ACTB-A/UUA
This construct was designed to test whether a construct carrying a UUA bubble sequence (SEQ ID NO: 274) was capable of inactivating two mRNAs, namely ADAR 1 and β actin. The construct was prepared using the plasmid pU6.AD1&2-A/UU as a substrate by amplifying with the two primers:
A map of this construct is shown in
pU6.AD1-A&ACTB-A/UUAG
This construct was designed to test whether a construct carrying a UUAG bubble sequence (SEQ ID NO: 275) was capable of inactivating two mRNAs, namely ADAR 1 and β actin. The construct was prepared using the plasmid pU6.AD1&2-A/UU as a substrate by amplifying with the two primers:
A map of this construct is shown in
pU6.AD1-A&ACTB-A/UUACAA
This construct was designed to test whether a construct carrying a UUACAA bubble sequence (SEQ ID NO: 276) was capable of inactivating two mRNAs, namely ADAR 1 and β actin. The construct was prepared using the plasmid pU6.AD1&2-A/UU as a substrate by amplifying with the two primers:
A map of this construct is shown in
pU6.ACTB-A&AD1-A/UUA
This construct was designed to test whether a construct carrying a UUA bubble sequence (SEQ ID NO: 277) was capable of inactivating two mRNAs, namely 0 actin and ADAR 1. The construct differed from the construct pU6.AD1-A&ACTB-A/UUA in that the relative positions of the AD1-A and ACTB-A differed. The construct was prepared using the plasmid pU6.ACTB-A hp as a substrate by amplifying with the two primers:
A map of this construct is shown in
HeLa cells were grown and maintained in tissue culture using known procedures. To transfect HeLa cells, 200,000 cells were plated in each well of a 6 well tissue culture plate. After overnight incubation cells were transfected with either siRNAs or plasmid DNAs. siRNAS were transfected using Oligofectamine according to manufacturer's (Invitrogen) protocol. Plasmid DNAs were transfected into cells using PolyFect according to manufacturer's (Qiagen) protocol. Cells were incubated for 48 hrs following transfection and total RNAs were isolated for analysis of ADAR1, ADAR 2 and/or β actin mRNA levels.
Total RNAs were prepared using QIAGEN RNeasy mini columns according to the manufacturer's protocol. To remove DNase contamination samples were treated with DNase according to the manufacturer's (Qiagen) protocol. Poly A+ RNA was prepared using DYNAL Dynabeads® mRNA DIRECT™ Micro Kit according to the manufacturer's (DYNAL) protocol. Levels of ADAR 1 and ADAR 2 mRNAs were determined using Quantitative Real Time PCR assays. Three duplicate assays were performed for each RNA sample using SYBR green incorporation to determine relative mRNA levels. The reactions and analyses were performed using procedures widely known to those skilled in the art.
Quantitative Northern blot analyses were used to determine levels of 0 actin mRNA, and thereby quantify β actin inactivation. Northern blots of total RNAs isolated from cells were probed with a fragment specific to the 3′ UTR of β actin mRNA, prepared using PCR of total HeLa cell RNA, and the degree of hybridization quantified using a phosphoimager. To correct for unequal loading, Northern filters were stripped then reprobed with a PCR fragment corresponding to human GAPDH and the degree of hybridization also quantified using a phosphoimager. β actin mRNA levels in individual RNA samples were then normalized to GAPDH levels and the relative levels of β actin between experimental treatments were determined. The methodologies and procedures used for these analyses are widely known to those skilled in the art.
In
In
To define sequences that may increase the activity of double and higher order hairpin constructs, a series of libraries are prepared containing randomised sequences in regions of the hp RNAs that might be predicted to be sites for Dicer processing.
The base construct for these experiments is the construct pU6.GR-21. This was prepared using the oligonucleotide annealing strategy described above, using the primers:
A map of the construct is shown in
The library constructs described below all contain identical sequences targeting Renilla luciferase at the top position of the double hairpin construct. By comparing the activity of individual clones from the library against Renilla luciferase as described above to the activity of pU6.GR-21 sequences of bubbles showing enhanced activity might be determined. Based on such data, generalised design rules to enhance the activity of double, and higher order, hairpin constructs may be developed.
pU6.GR-21-1-2N
This construct series was prepared using the oligonucleotide assembly strategy described above. Libraries were prepared using the oligonucleotides:
In this instance N denotes any nucleotide. A map of such constructs is shown in
pU6.GR-21-4-2N
This construct series was prepared using the oligonucleotide assembly strategy described above. Libraries were prepared using the oligonucleotides:
In this instance N denotes any nucleotide. A map of such constructs is shown in
pU6.GR-21-1&4-2N
This construct series may be prepared using the oligonucleotide assembly strategy described above. Libraries may be prepared using the oligonucleotides:
In this instance N denotes any nucleotide. A map of such constructs is shown in
pU6.GR22-1-4N
This construct series may be prepared using the oligonucleotide assembly strategy described above. In this instance random oligonucleotides are not used, rather three nucleotides which are incapable of base pairing in the predicted hpRNA are incorporated synthetically. To generate the constructs, the oligonucleotides GR5-22, GR6-22, GR7 and GR8 are annealed together with:
In this instance, D denotes A,G or T; B denotes C,G or T; H denotes A,C or T and V denotes A,C or G. A map of such constructs is shown in
pU6.GR22-1-4N
This construct series may be prepared using the oligonucleotide assembly strategy described above. In this instance random oligonucleotides are not used, rather three nucleotides which are incapable of base pairing in the predicted hpRNA are incorporated synthetically. To generate the constructs, the oligonucleotides GR1, GR2-22, GR3-22, GR4, GR7 and GR8 are annealed together with:
In this instance, D denotes A,G or T; B denotes C,G or T; H denotes A,C or T and V denotes A,C or G. A map of such constructs is shown in
This construct series may be prepared using the oligonucleotide assembly strategy described above. In this instance random oligonucleotides may be incorporated to screen for sequences that may augment the optimal AA sequence identified previously. To generate the constructs, the oligonucleotides GR22-1-4N-1, GR2-1-4N-4, GR5-22, GR6-22, GR7 and GR8 are annealed together with:
In this instance N denotes any nucleotide. A map of such constructs is shown in
This construct series may be prepared using the oligonucleotide assembly strategy described above. In this instance random oligonucleotides may be incorporated to screen for sequences that might potentially augment the optimal AA sequence defined previously.
To generate the constructs, the oligonucleotides GR1, GR2-22, GR3-22, GR4, GR7 and GR8 are annealed together with:
In this instance N denotes any nucleotide. A map of such constructs is shown in
This construct series may be prepared using the long range PCR strategy described above. In this instance random oligonucleotides are not used, rather three nucleotides which are incapable of conventional base pairing in the predicted hp RNA are incorporated. The oligonucleotides suitable for use in these experiments are:
In this instance, D denotes A,G or T; B denotes C,G or T, H denotes A,C or T and V denotes A,C or G. A map of such constructs is shown in
Based on the model whereby Dicer processes from the base of an expressed hpRNA, the actual distance (in nucleotides) between dicer cuts becomes a critical factor in designing multi-constructs to obtain maximum activity, since this “phasing” of Dicer processing will be critical in precisely defining the sequence of effector siRNAs produced from a hpRNA. To determine the optimal phasing a series of constructs were prepared which were designed to express variable lengths of EFGP effector sequences at the base of a double hairpin construct and constant sequences at the top, targeting Rluc.
The constructs were prepared using the oligonucleotide assembly strategy and cloned into BsmBI/Hind III digested pU6.cass as described above. The constructs and oligonucleotides used to prepare the constructs were:
Examples of phasing constructs are shown in
These constructs were transformed into transgenic Rluc-expressing HeLa cells and Rluc activity determined as described above. Results of these experiments are shown in
Plasmid DNAs from randomly picked clones from the pU6.GR-21-1-2N and pU6.GR-21-4-2N libraries were prepared and screened for activity against Rluc in transgenic HeLa cells as described above.
A total of 22 clones from the pU6.GR-21-4-2N library were screened in this fashion. None of these clones showed increased activity (Data not shown).
A total of 38 clones from the pU6.GR-21-1-2N library were screened in this fashion. Data from 22 clones are shown in
Upon sequencing it was shown that pU6.GR-21-1-2N-18 had the sequence AA between positions 21 and 22 of the predicted hpRNA.
An alternative approach to inactivating multiple genes, is to express multiple transcripts from a single construct. An example of such a construct is shown in
This construct pU6.GF-3 (
A map of a portion of pU6.GL is shown in 71A, the positions of Sma I and Kpn I cloning sites in the plasmid are also shown. The predicted transcript produced (SEQ ID NO: 297) from this plasmid is shown in
One disease state that may be treated with the multiple target interfering RNA nucleic acid constructs of the present invention is hepatitis C virus (HCV) infection. Based on statistics compiled from the Centers for Disease Control and Prevention, almost 2% of the American population (nearly 4 million people) is currently infected with HCV. Initially, the majority of the individuals infected with HCV exhibit no symptoms; however, greater than 80% will develop chronic and progressive liver disease eventually leading to cirrhosis or hepatocellular carcinomas. HCV is the leading indication for liver transplantation within the United States and results in the death of 8,000 to 10,000 Americans every year. On a global level, the World Heath Organization estimates that there are more than 170 million affected individuals, with infection rates as high as 10-30% of the general population in some countries.
HCV is a positive-sense single stranded enveloped RNA virus belonging to the Flaviviridae family. The infectious cycle of HCV typically begins with the entry of the viral particle into the cell by receptor-mediated binding and internalization. After uncoating in the cytoplasm, the positive strand of RNA that comprises the genome can interact directly with the host cell translational machinery. Lacking 5′ cap methylation, the RNA forms an extensive secondary structure in the 5′ untranslated Region (UTR) that serves as an internal ribosomal entry site (IRES) and permits the direct binding of the 40S subunit as the initiating step of the translation process.
The HCV genome, approximately 9600 nucleotides in length, encodes a single long open reading frame termed the polyprotein. Viral proteins are produced as linked precursors from the polyprotein which is subsequently cleaved into mature products by a wide variety of viral and cellular enzymes. Encoded amongst the genes are the structural proteins, including the core and envelope glycoproteins, so named because they are integral structural components in progeny virions. Non-structural proteins, which provide indispensable functions such as the RNA dependent RNA polymerase, are also produced. The viral replication machinery is established within the cytoplasm of infected cells that transcribe the positive-sense RNA into a negative strand intermediate. Thus, the HCV genomic RNA serves as both a template for its own replication and as a messenger RNA for translation of the virally encoded proteins. The negative strand is transcribed back into a positive strand of RNA, thereby amplifying the number of positive strand copies within the cell. At this stage, the positive strand can interact with the host cell translational machinery once again or, if there have been enough structural proteins accumulated, be packaged into virions. Following egress from the cell, the virus repeats its infectious cycle.
Although many of the individual steps of HCV replication are understood, until recently there was no tissue culture system that propagated the viral life cycle, making studies of the virus difficult. However, an in vitro replicon system has been developed (see, e.g., U.S. Pat. Nos. 5,585,258; 6,472,180; and 6,127,116 to Rice, et al.). A replicon is an autonomously replicating portion of HCV genomic RNA containing a marker gene for selection and verification of replication. HCV-RNA constructs are transfected into cell lines that are amenable to support continuous propagation. Following the steps of the infectious cycle, the RNA is translated by the cellular machinery and produces the appropriate viral proteins required for replication of the genome are produced, as is the selectable marker. Full-length and sub-genomic replicons have been generated and shown to be functional, although only the non-structural proteins are obligate. The autonomously replicating properties of the RNA remain independent of expression of the structural genes. Even when present in replicons expressing the full length HCV genome, the core and envelope proteins fail to effectively package the genome into infectious particles, resulting in the loss of a model system to study the packaging, egress and re-entry steps of the virus. Regardless, the replicon is able to recreate a portion of the biology and mechanisms utilized by HCV.
Before the delivery of interfering RNA nucleic acid constructs according to the present invention by infectious particles is tested, the appropriate expression plasmid is constructed and validated. AAV-2 vectors which have been gutted of rep and cap provide the backbone (hereinafter referred to as the rAAV vector) for the viral interfering RNA nucleic acid construct. This vector has been extensively employed in AAV studies and the requirements for efficient packaging are well understood. The U6 and H1 promoters may be used for the expression of interfering RNA according to the present invention, though there have been reports of vastly different levels of inhibition of an identical interfering RNA driven independently by each promoter. However, vector construction is such that promoters can be easily swapped if such variation is seen.
As with virtually any viral delivery system, the rAAV vector must meet certain size criteria in order to be packaged efficiently. In general, an rAAV vector must be 4300-4900 nucleotides in length (McCarty, et al. Gene Ther. 8: 1248-1254 (2001)). When the rAAV vector falls below the limit, a ‘stuffier’ fragment must be added (Muzyczka, et al. Curr. Top. Microbiol. Immunol. 158: 970129 (1992)). In the AAV vector embodiment described here, one or more selectable marker genes may be engineered into the rAAV interfering RNA nucleic acid construct in order to assess the transfection efficiency of the rAAV interfering RNA nucleic acid construct as well as allow for quantification of transduction efficiency of target cells by the rAAV interfering RNA nucleic acid construct delivered via infectious particles.
The initial test expression construct drives expression of interfering RNAs designed from sequences with demonstrated ability to inhibit luciferase activity from a reporter construct (see, Elbashir, et al. Embo. J. 20(23): 6877-6888 (2001)). A commercially available expression plasmid that encodes for the production of luciferase functions as the reporter to verify the ability of the various interfering RNAs to downregulate the target sequences.
Although the interfering RNAs against luciferase have been previously validated, the efficacy of rAAV-delivered interfering RNAs is assessed in vitro prior to testing the construct in vivo. The test and reporter constructs are transfected into permissive cells utilizing standard techniques. An rAAV expression construct in which the luciferase-specific RNAi agent has been replaced by an unrelated RNA sequence is utilized as a negative control in the experiments. The relative percentage of transfection efficiency is estimated directly by assessing the levels of the selective marker using fluorescence microscopy. For assessing inhibitory activity of each different RNAi agent, luciferase activity is measured utilizing standard commercial kits. Alternatively, quantitative real time PCR analysis (Q-PCR) is run on RNA that is harvested and purified from parallel experimental plates. Activity decreases greater than about 70%, relative to the activity recovered in lysates from cells treated with the unrelated RNA species, are an indication that the RNAi agent is functional.
Subsequent experiments are performed in order to assess the effects of interfering RNAs on a luciferase reporter system that is transfected into the livers of mice, similar to the work of McCaffrey et al. in Nature, 418: 38-39 (2002). Nucleic acids delivered to mice by hydrodynamic transfection methods (high pressure tail vein injection) primarily localized to the livers. Much like the principle which governs co-transfection in cell culture, simultaneous injection of multiple plasmids from a mixture often permits the penetrance of all of the expression constructs into the same cell. Thus, even though the tail vein injection procedures are well documented to only transfect 5-40% of the hepatocytes within the liver (McCaffrey, et al. Nature Biotech. 21(6): 639-644 (2003)), co-injection permits delivery of the reporter system and the expression construct into the same cells.
The rAAV nucleic acid construct bearing the interfering RNA targeted against luciferase is co-injected with the reporter construct that encodes for the luciferase gene. In animals receiving the negative control, an expression construct bearing an unrelated RNA is co-injected with the reporter construct. After seven days, the mice are sacrificed and the livers harvested. Luciferase activity is measured on lysates generated from a portion of the liver. Remaining portions of the liver are utilized for Q-PCR measurements as well as histological analysis to determine marker protein expression for normalization of the data. Alternative methods to assess transfection efficiency may include ELISA measurements of serum from mice that have been co-injected with a third marker plasmid for a secreted protein such as human α1-antitrypsin (hAAT) (Yant, et al. Nature Genetics. 25: 35-41 (2000), see also McCaffrey, et al. Nature Biotech. 21(6): 639-644 (2003)).
Once it is established that the nucleic acid construct is functional in both in vitro cell culture systems as well as in vivo mouse models by utilizing co-transfection of the naked DNA plasmids, testing is initiated on the rAAV expression construct packaged into infectious particles. The infectious particles are produced from a commercially available AAV helper-free system that requires the co-transfection of three separate expression constructs containing 1) the rAAV nucleic acid construct expressing the interfering RNA against luciferase (flanked by the AAV ITRs); 2) the construct encoding the AAV rep and cap genes; and 3) an expression construct comprising the helper adenovirus genes required for the production of high titer virus. Following standard purification procedures, the viral particles are ready for use in experiments.
Before mice can be infused with the rAAV particles, a reporter system is established in the mouse livers. Hydrodynamic transfection is employed to deliver the luciferase reporter construct as well as an expression plasmid for hAAT to control for differences in transfection efficiencies from animal to animal. The mice are permitted to recover for several days in order to establish sufficient levels of reporter activity. After luciferase reporter activity has been established in the livers, AAV particles are infused into normal C57B1/6 mice either through portal vein or tail vein injection. AAV particles bearing the expression construct of an unrelated RNA are used as a negative control. Initially, the mice are infused with relatively high doses (2×1012 vector genomes (vg)) which are reduced in follow-up experiments performed to generate dose-response curves. After seven to ten days, the mice are sacrificed, the livers harvested and samples of serum collected. The relative levels of hepatic luciferase activity and RNA are determined from the isolated livers utilizing the luciferase assay and QPCR procedures previously described. Additionally, the efficiency of transduction is assessed by measurement of the marker protein in serial slices of the hepatic tissues.
It has been estimated that hydrodynamic transfection procedures may result in the transfection of 5-40% of hepatocytes. Transduction of liver cells by AAV-2 delivery procedures have been shown to result in 5-10% transduction efficiencies. Although AAV may preferentially transduce the same pool of hepatocytes that were transfected by the initial tail vein injection procedure, it is possible that the subsets of cells that each technique affects are non-overlapping. If the former occurs, a reduction in luciferase activity relative to mice transduced with an unrelated interfering RNA is seen. If the latter occurs, then no decrease in luciferase activity is seen.
Although it has been demonstrated that AAV-based vectors can deliver desired sequences to hepatocytes, the relative level of transduction that occurs within those tissues has been rather poor. For current clinical hemophilia studies which employ AAV-2 to deliver and express blood factor IX, this is not a significant issue. For treatment of hemophilia, it is critical only to replenish levels of secreted protein to therapeutic levels. Such replenishment may occur from a small number of transduced cells able to express significant levels of the desired protein. However, because the mechanism of interfering RNA action is intracellular and the effect is not transmitted directly from cell to cell, the transduction efficiency must be increased in order for AAV expressing interfering RNAs to be utilized as a therapeutic.
McCarty et al. were able to generate a self complementary AAV vector (scAAV) that has both a plus and a minus strand of the same expression cassette within its capsid (Gene Ther. 8: 1248-1254 (2001)). This was achieved by mutating the 5′ ITR and leaving the 3′ ITR intact. By mutating or deleting the terminal resolution site other non-essential AAV sequences, thus eliminating possible recombination by wild type AAV and this construct, a DNA template is created where replication starts at the 3′ ITR. Once the replication machinery reaches the 5′ ITR, no resolution takes place and replication continues to the 3′ ITR. The resulting product has both a plus and complementary minus strand, yet is efficiently packaged. Employing the scAAV vectors, transduction of liver cells was increased to 30% of the total hepatocytes (Fu, et al. Molec Therapy. 8(6):911-917 (2003)). When delivered intercisternally, more than 50% of the Purkinje cells in the cerebellum were transduced by the scAAV particles. Thomas et al. showed that self-complementary vectors could produce 50-fold higher luciferase transgene expression levels in mouse livers than their corresponding single-stranded AAV counterparts when infused into mouse livers at equivalent doses (Thomas et al., J. Virol (2004), 78:6 p. 3110-3122). Though dropping slightly, the relative difference of expression between the vectors persisted at 20-fold nearly one year after injection.
Other modifications of AAV-delivery systems also have been used to dramatically enhance transduction efficiencies, including the production of pseudotyped viral particles by packaging rAAV-2 vector genomes with the Cap protein from other serotypes. Because they have been among the best characterized of all of the serotypes, the Cap proteins from AAV-1 through AAV-6 are used most commonly to pseudotype the AAV-2 vectors. Even with the advantages gained by these employing pseudotyping strategies, the threshold of transduction efficiency of hepatocytes may be increased only to 15% of the total population. However, dozens of other serotypes of AAV have been isolated and identified, but have not been characterized to any appreciable degree. For example, one of these is AAV-8, which was isolated originally from the heart tissue of a rhesus monkey. In an effort to determine effects novel cap proteins on transduction, pseudotyped virus in which the single stranded AAV-2 genome was pseudotyped with AAV-8 cap was created. The vectors carried the LacZ gene to assess the relative efficiency of transduction of mouse livers after infusion with increasing doses of infectious particles. A summary of the results (Thomas, et al. (2004)) is shown below in Table 1:
As the dose of infused control AAV-2/2 particles is increased, there is a modest increase in transduction of hepatocytes; however, the upper threshold of transduction remains entrenched near the 10% limit. Surprisingly, pseudotyped AAV-2/8 particles transduced 8% of hepatocytes at the lowest dose of particles administered; doses that were 30-80 fold less than their AAV-2/2 counterparts. Additionally, the dose-dependent increase in transduction efficiency for AAV-2/8 surpassed the transduction efficiency for AAV-2/2 to greater than 97% at the highest dose. Transduction efficiencies within this range enable to efficient delivery of interfering RNA to cells within tissues.
Similar modifications of AAV are engineered into the rAAV interfering RNA nucleic acid constructs. Following incorporation of these simple modifications, stocks of virus are generated for testing in the mouse model system. The following rAAV RNAi experimental virus stocks are tested: single-strand AAV-2/2; single-strand AAV-2/8; self-complementary AAV-2/2; and self-complementary AAV-2/8.
Corresponding viral particles that harbor rAAV vectors expressing unrelated RNA sequences are produced and used as negative controls. Large decreases in relative levels of luciferase activity correlate with increases in transduction efficiency.
Construction of a nucleic acid construct according to the present invention includes two or more individual interfering RNAs under the influence of a single promoter. Initially, assessment of promoter strength of various promoter sequences is conducted in vectors containing the single, individual promoters, driving expression of the same interfering RNA with demonstrated functional inhibition of luciferase activity (Elbashir, et al. Nature. 411: 494-498 (2001a)). Since there is a wealth of data demonstrating the successful utilization of the U6 promoter for the expression of interfering RNAs, it is used as the standard for assessing the relative strength of other promoters. The majority of the promoters that are tested are quite short, most in the range of 200-300 nucleotides in length. Long, overlapping oligonucleotides may be used to assemble the promoters and terminators de novo and are then cloned into multiple cloning sites that flank the sequence encoding the interfering RNA. The promoter is paired with the termination signal that occurs naturally downstream of the gene from which the promoter is taken.
The relative strength of each promoter is assessed in vitro by the decrease in activity of a co-transfected luciferase reporter. The test and reporter constructs are transfected into permissive cells utilizing standard techniques. Controls consist of a test promoter construct in which the sequence encoding the functional interfering RNA against luciferase is replaced by an unrelated RNA sequence. A third marker construct encoding for the secreted protein human α1-antitrypsin (hAAT) is co-transfected into the cells in order to assess for variations in transfection efficiencies. For assessing inhibitory activity of the interfering RNA, luciferase activity is measured utilizing standard commercial kits. The interfering RNA-mediated decrease in luciferase expression, normalized to hAAT levels, is an indirect measurement of promoter strength. Alternatively or in addition, quantitative real time PCR analysis (Q-PCR) on luciferase RNA levels is performed on RNA that is harvested and purified from parallel experimental plates.
It must be verified that AAV particles delivered by the interfering RNA nucleic acid construct of the present invention inhibit the luciferase-HCV fusion reporter in vitro. Permissive tissue culture cells are transfected with one of the reporter constructs described supra. In addition, each co-transfection mixture is supplemented with a plasmid coding for hAAT. Following 48 hours of incubation, cells are dosed with infectious particles harboring the interfering RNA nucleic acid construct against HCV. AAV particles containing a triple promoter construct expressing three unrelated RNAs serve as the negative control. Measurement of luciferase activity is used to verify that the AAV-delivered interfering RNAs are highly functional.
Nucleic acids delivered to mice by hydrodynamic transfection methods (high pressure tail vein injection) localize primarily to the liver; thus, this technique is used to deliver the luciferase-HCV fusions to mouse livers. In order to assess the differences in transfection efficiency from animal to animal, a hAAT expression plasmid is included in the transfection mixture.
Infectious AAV particles containing constructs that express the interfering RNAs targeted against HCV sequences are delivered to normal C57B1/6 mice either by tail vein or hepatic portal vein injection. Infectious AAV particles expressing three unrelated RNAs serve as the negative control. Initially, a fairly high dose of virus, e.g. 2×1012 vector genomes, is used, though subsequent experiments are performed to establish dose-response curves. After 48-72 hours, the mice are sacrificed, the livers harvested and samples of serum collected. Luciferase activity is used as a benchmark to assess efficacy of the AAV-delivered RNA agents. In addition to monitoring the levels of hAAT, serum levels of the liver enzymes alanine aminotransferase, aspartate aminotransferase, and tumor necrosis factor alpha are measured by ELISA to ensure general hepatic toxicity is not induced by the treatment.
Hepatitis C virus (HSV) is a small single stranded RNA virus that shows a high degree of sequence variation. The use of multiple constructs targeting HCV and other variable viruses, such as HIV, offers considerable advantages. Specifically, the use of multiple constructs may act to greatly reduce or eliminate the development of ddRNAi-resistant HCV strains. Moreover, as demonstrated by examples above, more active constructs may be obtained.
The construct pU6.HCVx3 hp (
pU6.HCVx3 may be prepared using the oligonucleotide assembly strategy with the following oligonucleotides:
The activity of this construct against HCV may be determined using the assays described above.
The interfering RNA of the present invention may be produced by two constructs in vivo.
pU6.GR22-Antisense is Prepared Using the Oligonucleotides
These constructs are designed to produce RNAs that are the reverse complement of each other (SEQ ID NOS: 305 and 306). They are predicted to (spontaneously) form double stranded RNA as shown in
The constructs might be tested by co-transfecting the two plasmids into HeLa cells expressing hRluc and hRluc inactivation assayed as described above. Alternatively the two transcriptional units might be combined into a single construct as shown in
The interfering RNA of the present invention may be produced by two constructs in vitro.
T7 GR22-sense is prepared using the oligonucleotide:
T7 GR22-antisense is prepared using the oligonucleotide:
The two transcripts (SEQ ID NOS: 307 and 308) are predicted to anneal and following the appropriate RNase treatment they will produce the dsRNA shown in
It will be understood that the invention disclosed and defined in this specification extends to all alternative combinations of two or more of the individual features mentioned or evident from the text or drawings. All of these different combinations constitute various alternative aspects of the invention.
Number | Date | Country | Kind |
---|---|---|---|
2003906281 | Nov 2003 | AU | national |
2004901258 | Mar 2004 | AU | national |
2004902279 | Apr 2004 | AU | national |
This application claims benefit of U.S. application Ser. No. 10/861,191, filed Jun. 3, 2004, which claims benefit to U.S. provisional patent application Ser. No. 60/475,827, filed Jun. 3, 2003; U.S. provisional patent application Ser. No. 60/479,616, filed Jun. 17, 2003; Australian patent application serial number 2003906281, filed Nov. 14, 2003; U.S. provisional patent application Ser. No. 60/550,504, filed Mar. 5, 2004; Australian patent application serial number 2004901258, filed Mar. 10, 2004; U.S. provisional patent application Ser. No. 60/553,920, filed Mar. 17, 2004; and Australian patent application serial number 2004902279, filed Apr. 30, 2004; which are herein incorporated by reference.
Number | Date | Country | |
---|---|---|---|
60475827 | Jun 2003 | US | |
60479616 | Jun 2003 | US | |
60550504 | Mar 2004 | US | |
60553920 | Mar 2004 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10861191 | Jun 2004 | US |
Child | 12914893 | US |