In some embodiments, therapeutic genome editing tools are provided comprising modified guide RNAs. The modified guide RNAs described herein may improve the stability of the guide RNA and the guide RNA/Cas9 complex and improve the activity of Cas9 (e.g., SpyCas9 and equivalents) to cleave target DNA. In some embodiments, the guide RNA is an sgRNA. In some embodiments, the guide RNA is a dgRNA. In some embodiments, the guide RNA is a tracrRNA. In some embodiments, the guide RNA is a crRNA.
The guide RNAs described herein comprise at least one modified nucleotide. Modifications may include 2′-O-methyl (2′-O-Me), 2′-O-(2-methoxyethyl) (2′-O-moe), 2′-fluoro (2′-F), phosphorothioate (PS) bond between nucleotides, G-C substitutions, and inverted abasic linkages between nucleotides and equivalents thereof. Embodiments of the invention include:
In some embodiments, a single guide RNA (sgRNA) is encompassed comprising a 5′ end modification and one or more modification in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region, wherein the 5′ end modification comprises at least two phosphorothioate linkages within the first seven nucleotides at the 5′ end of the 5′ terminus. In some instances, the modification is a 2′-O-methyl (2′-O-Me) modified nucleotide. In some embodiments, the modification is a 2′-fluoro (2′-F) modified nucleotide.
In some embodiments, the sgRNA comprises modifications at US1 to US12 and/or a modification at H1-1 and/or a modification in H2-1. In some embodiments, the sgRNA comprises modifications at H1-1 to H1-12 and/or H2-1 to H2-15. In some embodiments, the sgRNA comprises one or more modifications in each of the upper stem region, the hairpin 1 region, and the hairpin 2 region. In some embodiments, the sgRNA comprises a modified nucleotide between hairpin 1 and hairpin 2 regions. In some embodiments, the sgRNA comprises a modification in the lower stem region.
In some embodiments, the sgRNA comprises a modification at the 5′ terminus and/or the 3′ terminus. In some embodiments, the sgRNA comprises a 3′ end modification in the 3′ terminus. In some embodiments, the sgRNA comprises modifications on at least two of the last four nucleotides at the 3′ end of the 3′ terminus. In some embodiments, the sgRNA comprises a 5′ end modification in the 5′ terminus. In some embodiments, the sgRNA comprises modifications on at least two of the first four nucleotides at the 5′ end of the 5′ terminus. In some embodiments, the sgRNA comprises a 3′ end modification in the 3′ terminus and a 5′ end modification in the 5′ terminus. In some embodiments, the sgRNA comprises modifications on at least two of the last four nucleotides at the 3′ end of the 3′ terminus and on at least two of the first four nucleotides at the 5′ end of the 5′ terminus. In some instances, these modifications are 2′-O-Me, 2′-F, 2′-O-moe, or phosphorothioate (PS) bonds linking the nucleotides. In some embodiments, the sgRNA comprises PS bonds between at least two of the last four nucleotides at the 3′ end of the 3′ terminus and/or at least two of the first four nucleotides at the 5′ end of the 5′ terminus. In some instances, the sgRNA comprises 5′ terminus and 3′ terminus with more than one modification as described herein, such as, with PS bonds and 2′-O-Me modifications.
In some embodiments, the sgRNA comprises a modification in the bulge region. In some embodiments, 50% of the nucleotides in the bulge region are modified, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, the sgRNA comprises a modification in the nexus region. In some embodiments, the sgRNA comprises modifications at N15, N16, N17, and/or N18 in the nexus region, wherein the modification is 2′-O-Me or 2′-F. In some instances, N16, N17, and N18 are linked with PS bonds.
In some embodiments, the sgRNA comprises at least the first three nucleotides at the 5′ end of the 5′ terminus, and the last three nucleotides at the 3′ end of the 3′ terminus are modified.
In some embodiments, the sgRNA comprises modifications at the 3′ terminus and/or 5′ terminus. In some instances, the first four nucleotides at the 5′ end of the 5′ terminus, and the last four nucleotides at the 3′ end of the 3′ terminus are linked with phosphorothioate (PS) bonds. In some embodiments, the 5′ and 3′ modification comprises 2′-O-Me or 2′-O-moe. In some embodiments, the 5′ and 3′ modification comprises 2′-F. In some embodiments, the 5′ and/or 3′ modification comprises PS bonds linking nucleotides. In some embodiments, the 5′ and/or 3′ modification comprises one or more of 2′-O-Me, 2′-O-moe, 2′-F, and PS bonds linking nucleotides.
In some embodiments, the sgRNA comprises modifications at the first four nucleotides at the 5′ end of the 5′ terminus and the last four nucleotides at the 3′ end of the 3′terminus. In some instances, these modifications are linking PS bond (i.e., PS bonds that link the first four and last four nucleotides). In some embodiments, the sgRNA further comprises 2′-O-Me modifications at the first three nucleotides at the 5′ end of the 5′ terminus and the last three nucleotides at the 3′ end of the 3′ terminus.
In some embodiments, the sgRNA comprises modifications at the first four nucleotides at the 5′ end of the 5′ terminus and the last four nucleotides at the 3′ end of the 3′ terminus, wherein the modifications are at least PS bonds linking the four nucleotides, and further wherein the first three nucleotides at the 5′ end of the 5′ terminus and the last three nucleotides at the 3′ end of the 3′ terminus comprise 2′-O-Me, 2′-O-moe, or 2′-F modifications.
In some embodiments, the sgRNA comprises modifications LS1, LS6, LS7, LS8, LS11, and LS12, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, the sgRNA comprises modifications at each of the nucleotides in the bulge region, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, the sgRNA comprises modifications at each of the nucleotides in the upper stem region, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, the sgRNA comprises modifications at each of the nucleotides in the hairpin 1 region, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, the sgRNA comprises modifications at each of the nucleotides in the hairpin 2 region, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, an sgRNA is encompassed comprising 2′-O-Me modified nucleotides at the following positions:
In some embodiments, an sgRNA is encompassed comprising 2′-F modified nucleotides at the following positions:
In some embodiments, an sgRNA is encompassed comprising
In some embodiments, an sgRNA is encompassed comprising:
In some embodiments, an sgRNA is encompassed comprising:
In some embodiments, a sgRNA is encompassed comprising:
In some embodiments, a sgRNA is encompassed comprising:
In some embodiments, an sgRNA is encompassed comprising:
In some embodiments, a sgRNA is encompassed comprising:
In some embodiments, a sgRNA is encompassed comprising:
In some embodiments, a sgRNA is encompassed comprising:
In some embodiments, a sgRNA is encompassed comprising:
In some embodiments, a sgRNA is encompassed comprising:
In some embodiments, a sgRNA is encompassed comprising the nucleic acids of any of SEQ ID Nos: 228-353, including the modifications of Table 4. In some embodiments, a sgRNA is encompassed comprising any of SEQ ID Nos: 228-332, including the modifications of Table 4. In some embodiments, an sgRNA is encompassed comprising any of SEQ ID Nos: 235-240, 265-285, and 309-329, including the modifications of Table 4. In some embodiments, an sgRNA is encompassed comprising SEQ ID No: 240. In some embodiments, a sgRNA is encompassed comprising SEQ ID No. 240, including the modifications of Table 4. In some embodiments, a sgRNA is encompassed comprising SEQ ID No: 242. In some embodiments, a sgRNA is encompassed comprising SEQ ID No: 358. In additional embodiments, a sgRNA comprising nucleic acids having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleic acids of any one of SEQ ID Nos: 235-240, 265-285, and 309-329, wherein the modification at each nucleotide of the sgRNA that corresponds to a nucleotide of the reference sequence identifier in Table 4, is identical to or equivalent to the modification shown in the reference sequence identifier in Table 4, optionally further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ end of the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ end of the 3′ terminus. In some embodiments, the sgRNA further comprises at least three PS bonds linking the nucleotides in the hairpin 1 region. In some embodiments, the sgRNA further comprises at least three PS bonds linking the nucleotides in the hairpin 2 region. In some embodiments, the sgRNA further comprises at least three PS bonds linking the nucleotides in the upper stem region. In some embodiments, the sgRNA forms a ribonucleoprotein complex with S. pyogenes Cas9.
Provided herein are modified guide RNAs, including dual and single guide RNAs for use in gene editing methods. The modified guides are more stable and show improved in vitro and in vivo efficacy as compared to their non-modified counterparts. Sequences of engineered and tested guide RNAs are shown in Table 4.
“Guide RNA” and “gRNA” are used herein interchangeably to refer collectively to either an sgRNA, a trRNA (also known as tracrRNA), or a crRNA (also known as a CRISPR RNA). The crRNA and trRNA may be associated on one RNA molecule (single guide RNA [sgRNA]) or in two separate RNA molecules (dual guide RNA [dgRNA]). “Guide RNA” or “gRNA” refers to each type.
The trRNA sequences may be naturally-occurring, or the trRNA sequence may include modifications or variations compared to naturally-occurring sequences.
“Editing efficiency” or “editing percentage” or “percent editing” as used herein is the total number of sequence reads with insertions or deletions of nucleotides into the target region of interest over the total number of sequence reads following cleavage by a Cas RNP.
“Hairpin” as used herein describes a loop of nucleic acids that is created when a nucleic acid strand folds and forms base pairs with another section of the same strand. A hairpin may form a structure that comprises a loop or a U-shape. In some embodiments, a hairpin may be comprised of a RNA loop. Hairpins can be formed with two complementary sequences in a single nucleic acid molecule bind together, with a folding or wrinkling of the molecule. In some embodiments, hairpins comprise stem or stem loop structures.
“Regions” as used herein describes conserved groups of nucleic acids. Regions may also be referred to as “modules” or “domains.” Regions of a gRNA may perform particular functions, e.g., in directing endonuclease activity of the RNP, for example as described in Briner A E et al., Molecular Cell 56: 333-339 (2014). Regions of a gRNA are described in Tables 1-3.
“Ribonucleoprotein” (RNP) or “RNP complex” as used herein describes a gRNA, for example, together with a nuclease, such as a Cas protein. In some embodiments, the RNP comprises Cas9 and gRNA.
“Stem loop” as used herein describes a secondary structure of nucleotides that form a base-paired “stem” that ends in a loop of unpaired nucleic acids. A stem may be formed when two regions of the same nucleic acid strand are at least partially complementary in sequence when read in opposite directions. “Loop” as used herein describes a region of nucleotides that do not base pair (i.e., are not complementary) that may cap a stem. A “tetraloop” describes a loop of 4 nucleotides. As used herein, the upper stem of a sgRNA may comprise a tetraloop.
In certain embodiments involving dgRNA, a “stem” region as used herein describes a secondary structure of nucleotides that forms a base-paired region between certain regions of a crRNA and trRNA (e.g., the lower and upper stem regions of each RNA). The “stem” region of a dgRNA may also be referred to in the art as a “flagpole” region.
“Treatment” as used herein covers any administration or application of a therapeutic for disease in a subject, and includes inhibiting the disease, arresting its development, relieving one or more symptoms of the disease, curing the disease, or preventing reoccurrence of one or more symptoms of the disease.
1. Types of Modifications
A. 2′-O-methyl Modifications
Modified sugars are believed to control the puckering of nucleotide sugar rings, a physical property that influences oligonucleotide binding affinity for complementary strands, duplex formation, and interaction with nucleases. Substitutions on sugar rings can therefore alter the confirmation and puckering of these sugars. For example, 2′-O-methyl (2′-O-Me) modifications can increase binding affinity and nuclease stability of oligonucleotides, though as shown in the Examples, the effect of any modification at a given position in an oligonucleotide needs to be empirically determined.
The terms “mA,” “mC,” “mU,” or “mG” may be used to denote a nucleotide that has been modified with 2′-O-Me.
Modification of a ribonucleotide as 2′-O-methyl ribonucleotide can be depicted as follows:
B. 2′-O-(2-methoxyethyl) Modifications
In some embodiments, the modification may be 2′-O-(2-methoxyethyl) (2′-O-moe). Modification of a ribonucleotide as a 2′-O-moe ribonucleotide can be depicted as follows:
The terms “moeA,” “moeC,” “moeU,” or “moeG” may be used to denote a nucleotide that has been modified with 2′-O-moe.
C. 2′-fluoro Modifications
Another chemical modification that has been shown to influence nucleotide sugar rings is halogen substitution. For example, 2′-fluoro (2′-F) substitution on nucleotide sugar rings can increase oligonucleotide binding affinity and nuclease stability.
In this application, the terms “fA,” “fC,” “fU,” or “fG” may be used to denote a nucleotide that has been substituted with 2′-F.
Substitution of 2′-F can be depicted as follows:
D. Phosphorothioate Modifications
Phosphorothioate (PS) linkage or bond refers to a bond where a sulfur is substituted for one nonbridging phosphate oxygen in a phosphodiester linkage, for example in the bonds between nucleotides bases. When phosphorothioates are used to generate oligonucleotides, the modified oligonucleotides may also be referred to as S-oligos.
A “*” may be used to depict a PS modification. In this application, the terms A*, C*, U*, or G* may be used to denote a nucleotide that is linked to the next (e.g., 3′) nucleotide with a PS bond.
In this application, the terms “mA*,” “mC*,” “mU*,” or “mG*” may be used to denote a nucleotide that has been substituted with 2′-O-Me and that is linked to the next (e.g., 3′) nucleotide with a PS bond. Similarly, the terms “fA*,” “fC*,” “fU*,” or “fG*” may be used to denote a nucleotide that has been substituted with 2′-F and that is linked to the next (e.g., 3′) nucleotide with a PS bond. Equivalents of a PS linkage or bond are encompassed by embodiments described herein.
The diagram below shows the substitution of S- into a nonbridging phosphate oxygen, generating a PS bond in lieu of a phosphodiester bond:
E. G-C Substitutions
In some embodiments, gRNAs are modified with sequence substitutions that do not comprise chemical modifications. In some embodiments, modified gRNAs are engineered with G-C pairings (e.g., in lower and/or upper stem regions) that are not found in the parental gRNA sequence. In some embodiments, modified gRNAs are engineered with G-U mismatches (“GU wobbles” or mismatch pairings) that are not found in the parental gRNA sequence.
F. Inverted Abasic Modifications
Abasic nucleotides refer to those which lack nitrogenous bases. The figure below depicts an oligonucleotide with an abasic (also known as apurinic) site that lacks a base:
Inverted bases refer to those with linkages that are inverted from the normal 5′ to 3′ linkage (i.e., either a 5′ to 5′ linkage or a 3′ to 3′ linkage). For example:
An abasic nucleotide can be attached with an inverted linkage. For example, an abasic nucleotide may be attached to the terminal 5′ nucleotide via a 5′ to 5′ linkage, or an abasic nucleotide may be attached to the terminal 3′ nucleotide via a 3′ to 3′ linkage. An inverted abasic nucleotide at either the terminal 5′ or 3′ nucleotide may also be called an inverted abasic end cap. In this application, the terms “invd” indicates an inverted abasic nucleotide linkage.
The above modifications and their equivalents are included within the scope of the embodiments described herein.
2. Guide RNA Compositions
Compositions comprising guide RNA are encompassed. In some embodiments, the guide RNA comprises a trRNA. In some embodiments, the guide RNA comprises a crRNA. In some embodiments, the guide RNA comprises a crRNA and trRNA. In some embodiments, the guide RNA comprises a crRNA and trRNA on one RNA molecule as a sgRNA. In some embodiments, the guide RNA comprises a crRNA and trRNA on two RNA molecules as a dgRNA. In a dgRNA, the two RNA molecules may associate via base pairing.
In some embodiments, the guide RNA comprises a 5′ terminus region. In some embodiments, the guide RNA does not comprise a 5′ terminus region. In some embodiments, the 5′ terminus region comprises a “spacer” region as described in Briner A E et al., Molecular Cell 56: 333-339 (2014) for sgRNA (but applicable herein to all guide RNAs). In some embodiments, the 5′ terminus region comprises a 5′ end modification. A 5′ terminus region with or without a spacer region may be associated with a crRNA, trRNA, sgRNA and/or dgRNA. The spacer region is also sometimes referred to herein, and by others, as a “guide region,” “guide domain” or “targeting domain.” A “target sequence” as used herein refers to a sequence of nucleic acid to which the guide region/domain directs a nuclease for cleavage. In some embodiments, a spyCas9 protein may be directed by a guide region/domain to a target sequence of a target nucleic acid molecule by the nucleotides present in the spacer region. In some embodiments, the guide RNA does not comprise a spacer region.
In some embodiments, the guide RNAs described herein comprise or consist of any of the sequences shown in Table 4. Note, however, that where a sequence shows a guide/spacer region, it should be recognized that the composition may comprise this region or not. Further, guide RNAs are encompassed that comprise the modifications of any of the sequences shown in Table 4, and identified therein by SEQ ID No. That is, the nucleotides may be the same or different, but the modification pattern shown may be the same or similar to a modification pattern of a guide sequence of Table 4. A modification pattern includes the relative position and identity of modifications of the gRNA or a region of the gRNA (e.g. 5′ terminus region, lower stem region, bulge region, upper stem region, nexus region, hairpin 1 region, hairpin 2 region, 3′ terminus region). In some embodiments, the modification pattern contains at least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% of the modifications of any one of the sequences shown in the sequence column of Table 4, or over one or more regions of the sequence. In some embodiments, the modification pattern is at least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical to the modification pattern of any one of the sequences shown in the sequence column of Table 4. In some embodiments, the modification pattern is at least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over one or more regions of the sequence shown in Table 4, e.g., a 5′ terminus region, lower stem region, bulge region, upper stem region, nexus region, hairpin 1 region, hairpin 2 region, and/or 3′ terminus region. For example, in some embodiments, a guide RNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical to the modification pattern of a sequence over the 5′ terminus region. In some embodiments, a guide RNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the lower stem. In some embodiments, a guide RNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the bulge. In some embodiments, a guide RNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the upper stem. In some embodiments, a guide RNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the nexus. In some embodiments, a guide RNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the hairpin 1. In some embodiments, a guide RNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the hairpin 2. In some embodiments, a guide RNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the 3′ terminus. In some embodiments, the modification pattern differs from the modification pattern of a sequence of Table 4, or a region (e.g. 5′ terminus, lower stem, bulge, upper stem, nexus, hairpin 1, hairpin 2, 3′ terminus) of such a sequence, at 0, 1, 2, 3, 4, 5, or 6 nucleotides. In some embodiments, the gRNA comprises modifications that differ from the modifications of a sequence of Table 4, at 0, 1, 2, 3, 4, 5, or 6 nucleotides. In some embodiments, the gRNA comprises modifications that differ from modifications of a region (e.g. 5′ terminus, lower stem, bulge, upper stem, nexus, hairpin 1, hairpin 2, 3′ terminus) of a sequence of Table 4, at 0, 1, 2, 3, 4, 5, or 6 nucleotides.
In some embodiments, the gRNA comprises a 2′-O-methyl (2′-O-Me) modified nucleotide. In some embodiments, the gRNA comprises a 2′-O-(2-methoxyethyl) (2′-O-moe) modified nucleotide. In some embodiments, the gRNA comprises a 2′-fluoro (2′-F) modified nucleotide. In some embodiments, the gRNA comprises a phosphorothioate (PS) bond between nucleotides.
In some embodiments, the gRNA comprises a 5′ end modification, a 3′ end modification, or 5′ and 3′ end modifications. In some embodiments, the 5′ end modification comprises a phosphorothioate (PS) bond between nucleotides. In some embodiments, the 5′ end modification comprises a 2′-O-methyl (2′-O-Me), 2′-O-(2-methoxyethyl) (2′-O-moe), and/or 2′-fluoro (2′-F) modified nucleotide. In some embodiments, the 5′ end modification comprises at least one phosphorothioate (PS) bond and one or more of a 2′-O-methyl (2′-O-Me), 2′-O-(2-methoxyethyl) (2′-O-moe), and/or 2′-fluoro (2′-F) modified nucleotide. The end modification may comprise a phosphorothioate (PS), 2′-O-methyl (2′-O-Me), 2′-O-(2-methoxyethyl) (2′-O-moe), and/or 2′-fluoro (2′-F) modification. Equivalent end modifications are also encompassed by embodiments described herein. In some embodiments, the gRNA comprises an end modification in combination with a modification of one or more regions of the gRNA.
A. Compositions of sgRNAs
In some embodiments, the compositions and methods of the invention comprise gRNA comprising a crRNA and trRNA that direct a nuclease such as Cas9 to a target DNA sequence. In some embodiments, the gRNAs described herein may be associated on one RNA molecule (single guide RNA or sgRNA).
In some embodiments, the invention comprises a sgRNA comprising or consisting of any one of the sequences described in SEQ ID Nos: 228-332.
In some embodiments, a sgRNA comprising any one of the modified sequences of SEQ ID Nos: 235-240, 265-285, and 309-329 is provided. In some embodiments, a sgRNA comprising any one of the modified sequences of SEQ ID Nos: 235-240, 265-285, and 309-329, wherein the sgRNA further comprises a 5′ “spacer” sequence (“guide sequence”) that is complementary to a target sequence, and directs a Cas9 to its target for cleavage is encompassed. In some instances, the invention comprises sgRNA comprising nucleic acids having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleic acids of any one of SEQ ID Nos: 235-240, 265-285, and 309-329, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier.
1. Domains of sgRNAs
Briner A E et al., Molecular Cell 56:333-339 (2014) describes functional domains of sgRNAs, referred to herein as “domains”, including the “spacer” domain responsible for targeting, the “lower stem”, the “bulge”, “upper stem” (which may include a tetraloop), the “nexus”, and the “hairpin 1” and “hairpin 2” domains. See, Briner et al. at page 334,
Table 1 and
a) 5′ Terminus Region
In some embodiments, the sgRNA comprises nucleotides at the 5′ terminus as shown in Table 1. In some embodiments, the 5′ terminus of the sgRNA comprises a spacer or guide region that functions to direct a Cas protein to a target nucleotide sequence. In some embodiments, the 5′ terminus does not comprise a spacer or guide region. In some embodiments, the 5′ terminus comprises a spacer and additional nucleotides that do not function to direct a Cas protein to a target nucleotide region.
In some embodiments, the guide region comprises the first 1-10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotides at the 5′ end of the sgRNA. In some embodiments, the guide region comprises 20 nucleotides. In some embodiments, the guide region may comprise 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 or more nucleotides. In some embodiments, the guide region may comprise 17 nucleotides. In some embodiments, the guide region may comprise 18 nucleotides. In some embodiments, the guide region may comprise 19 nucleotides.
In some embodiments, the selection of the guide region is determined based on target sequences within the gene of interest for editing. For example, in some embodiments, the sgRNA comprises a guide region that is complementary to target sequences of a gene of interest.
In some embodiments, the target sequence in the gene of interest may be complementary to the guide region of the sgRNA. In some embodiments, the degree of complementarity or identity between a guide region of a sgRNA and its corresponding target sequence in the gene of interest may be about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100%. In some embodiments, the guide region of a sgRNA and the target region of a gene of interest may be 100% complementary or identical. In other embodiments, the guide region of a sgRNA and the target region of a gene of interest may contain at least one mismatch. For example, the guide region of a sgRNA and the target sequence of a gene of interest may contain 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 mismatches, where the total length of the target sequence is at least about 17, 18, 19, 20 or more base pairs. In some embodiments, the guide region of a sgRNA and the target region of a gene of interest may contain 1-6 mismatches where the guide sequence comprises at least about 17, 18, 19, 20 or more nucleotides. In some embodiments, the guide region of a sgRNA and the target region of a gene of interest may contain 1, 2, 3, 4, 5, or 6 mismatches where the guide sequence comprises about 20 nucleotides. The 5′ terminus may comprise nucleotides that are not considered guide regions (i.e., do not function to direct a cas9 protein to a target nucleic acid).
b) Lower Stem
In some embodiments, the sgRNA comprises a lower stem (LS) region that when viewed linearly, is separated by a bulge and upper stem regions. See Table 1.
In some embodiments, the lower stem regions comprise 1-12 nucleotides, e.g. in one embodiment the lower stem regions comprise LS1-LS12. In some embodiments, the lower stem region comprises fewer nucleotides than shown in Table 1 and
In some embodiments, the lower stem region has nucleotides that are complementary in nucleic acid sequence when read in opposite directions. In some embodiments, the complementarity in nucleic acid sequence of lower stem leads to a secondary structure of a stem in the sgRNA (e.g., the regions may base pair with one another). In some embodiments, the lower stem regions may not be perfectly complimentary to each other when read in opposite directions.
c) Bulge
In some embodiments, the sgRNA comprises a bulge region comprising six nucleotides, B1-B6. When viewed linearly, the bulge region is separated into two regions. See Table 1. In some embodiments, the bulge region comprises six nucleotides, wherein the first two nucleotides are followed by an upper stem region, followed by the last four nucleotides of the bulge. In some embodiments, the bulge region comprises fewer nucleotides than shown in Table 1 and
In some embodiments, the presence of a bulge results in a directional kink between the upper and lower stem modules in a sgRNA.
d) Upper Stem
In some embodiments, the sgRNA comprises an upper stem region comprising 12 nucleotides. In some embodiments, the upper stem region comprises a loop sequence. In some instances, the loop is a tetraloop (loop consisting of four nucleotides).
In some embodiments, the upper stem region comprises fewer nucleotides than shown in Table 1 and
In some embodiments, the upper stem region has nucleotides that are complementary in nucleic acid sequence when read in opposite directions. In some embodiments, the complementarity in nucleic acid sequence of upper stem leads to a secondary structure of a stem in the sgRNA (e.g., the regions may base pair with one another). In some embodiments, the upper stem regions may not be perfectly complimentary to each other when read in opposite directions.
e) Nexus
In some embodiments, the sgRNA comprises a nexus region that is located between the lower stem region and the hairpin 1 region. In some embodiments, the nexus comprises 18 nucleotides. In some embodiments, the nexus region comprises nucleotides N1 through N18 as shown in Table 1 and
In some embodiments, the nexus region comprises fewer nucleotides than shown in Table 1 and
In some embodiments, the nexus region has nucleotides that are complementary in nucleic acid sequence when read in opposite directions. In some embodiments, the complementarity in nucleic acid sequence leads to a secondary structure of a stem and/or stem loop in the sgRNA (e.g., certain nucleotides in the nexus region may base pair with one another). In some embodiments, the nexus regions may not be perfectly complimentary to each other when read in opposite directions.
f) Hairpin
In some embodiments, the sgRNA comprises one or more hairpin regions. In some embodiments, the hairpin region is downstream of (e.g., 3′ to) the nexus region. In some embodiments, the region of nucleotides immediately downstream of the nexus region is termed “hairpin 1” or “H1”. In some embodiments, the region of nucleotides 3′ to hairpin 1 is termed “hairpin 2” or “H2”. In some embodiments, the hairpin region comprises hairpin 1 and hairpin 2. In some embodiments, the sgRNA comprises only hairpin 1 or hairpin 2.
In some embodiments, the hairpin 1 region comprises 12 nucleic acids immediately downstream of the nexus region. In some embodiments, the hairpin 1 region comprises nucleotides H1-1 through H1-12 as shown in Table 1 and
In some embodiments, the hairpin 2 region comprises 15 nucleic acids downstream of the hairpin 1 region. In some embodiments, the hairpin 2 region comprises nucleotides H2-1 through H2-15 as shown in Table 1 and
In some embodiments, one or more nucleotides is present between the hairpin 1 and the hairpin 2 regions. The one or more nucleotides between the hairpin 1 and hairpin 2 region may be modified or unmodified. In some embodiments, hairpin 1 and hairpin 2 are separated by one nucleotide. In some embodiments, the hairpin regions comprise fewer nucleotides than shown in Table 1 and
In some embodiments, a hairpin region has nucleotides that are complementary in nucleic acid sequence when read in opposite directions. In some embodiments, the hairpin regions may not be perfectly complimentary to each other when read in opposite directions (e.g., the top or loop of the hairpin comprises unpaired nucleotides).
In some embodiments, the sgRNA comprises replacement of hairpin 1 with nucleotides “n”, wherein “n” is an integer between 1 and 50, 40, 30, 20, 15, 10, 5, 4, 3, and 2. In some embodiments, the hairpin 1 region of a sgRNA is replaced by 2 nucleotides.
g) 3′ Terminus Region
In some embodiments, the sgRNA comprises nucleotides after the hairpin region(s). In some embodiments, the 3′ terminus region comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15 or 20 or more nucleotides, e.g. that are not associated with the secondary structure of a hairpin. In some embodiments, the 3′ terminus region comprises 1, 2, 3, or 4 nucleotides that are not associated with the secondary structure of a hairpin. In some embodiments, the 3′ terminus region comprises 4 nucleotides that are not associated with the secondary structure of a hairpin. In some embodiments, the 3′ terminus region comprises 1, 2, or 3 nucleotides that are not associated with the secondary structure of a hairpin.
2. Modifications of sgRNAs
In some embodiments, the invention comprises a sgRNA comprising one or more modifications within one or more of the following regions: the nucleotides at the 5′ terminus; the lower stem region; the bulge region; the upper stem region; the nexus region; the hairpin 1 region; the hairpin 2 region; and the nucleotides at the 3′ terminus.
In some embodiments, the modification comprises a 2′-O-methyl (2′-O-Me) modified nucleotide. In some embodiments, the modification comprises a 2′-O-(2-methoxyethyl) (2′-O-moe) modified nucleotide. In some embodiments, the modification comprises a 2′-fluoro (2′-F) modified nucleotide. In some embodiments, the modification comprises a phosphorothioate (PS) bond between nucleotides.
In some embodiments, the sgRNA comprises modifications at 1, 2, 3, or 4 of the first 4 nucleotides at its 5′ end. In some embodiments, the first three or four nucleotides at the 5′ terminus, and the last three or four nucleotides at the 3′ terminus are modified. In some embodiments, the first four nucleotides at the 5′ terminus, and the last four nucleotides at the 3′ terminus are linked with phosphorothioate (PS) bonds. In some embodiments, the modification comprises 2′-O-Me. In some embodiments, the modification comprises 2′-F. In some embodiments, the modification comprises 2′-O-moe.
In some embodiments, the sgRNA comprises modifications at 1, 2, 3, or 4 of the first 4 nucleotides at the 5′ end. In some embodiments, the sgRNA comprises modifications at 1, 2, 3, or 4 of the first 4 nucleotides at the 3′ end. In some embodiments, the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-O-Me or 2′-O-moe modifications.
In some embodiments, the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-F modifications.
In some embodiments, a sgRNA is provided wherein LS1, LS6, LS7, LS8, LS11, and LS12 are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the bulge region of the sgRNA are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the upper stem region of the sgRNA are modified with 2′-O-Me. In some embodiments, N16, N17, and N18 in the nexus region of the sgRNA are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the hairpin 1 region of the sgRNA are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the hairpin 2 region of the sgRNA are modified with 2′-O-Me.
In some embodiments, the sgRNA comprises 2′-O-Me modified nucleotides at the following nucleotides: the first three nucleotides at the 5′ terminus; LS1, LS6, LS7, LS8, LS11, and LS12; B1 and B2 in the bulge region; each of the nucleotides in the upper stem region of the sgRNA; N16, N17, and N18 in the nexus region; each of the nucleotides in the hairpin 1 region; each of the nucleotides in the hairpin 2 region; and last four nucleotides at the 3′ terminus.
In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises 2′-O-Me or 2′-F modified nucleic acids at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleic acids at the last four nucleotides at the 3′ terminus. In some embodiments, LS9 and LS10 are modified with 2′-F. In some embodiments, N15, N16, N17, and N18 are modified with 2′-F. In some embodiments, H2-9, H2-10, H2-11, H2-12, H2-13, HS-14, and H2-15 are modified with 2′-F. In some embodiments, the second to last, third to last, and fourth to last nucleotides at the 3′ terminus are modified with 2′-F
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-F modified nucleic acids at the following nucleotides: LS9 and LS10 in the lower stem region; N15, N16, N17, and N18 in the nexus region; and H2-9, H2-10, H2-11, H2-12, H2-13, HS-14, and H2-15 in the hairpin 2 region. In some embodiments, the sgRNA further comprises 2′-F modified nucleotides at the second to last, third to last, and fourth to last nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises 2′-O-Me or 2′-F modified nucleic acids at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleic acids at three of the last four nucleotides at the 3′ terminus.
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at LS1 and LS6; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-F modified nucleotides at LS1-LS6; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide at “n” between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-F modified nucleotides at LS2-LS5; 2′-O-Me modified nucleotides at LS1 and LS6; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide at “n” between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at LS7, LS8, LS11, and LS12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide at “n” between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at LS8, LS10, and LS12; 2′-O-F modified nucleotides at LS7, LS9, and LS11; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at LS1, LS6, LS7, LS8, LS11, and LS12; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at LS1, LS6, LS7, LS8, LS11, and LS12; 2′-F modified nucleotides at LS9 and LS10; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-8; 2′-F modified nucleotides at H2-9-H2-15; 2′-F modified nucleotides at the second from last, third from last, and fourth from last nucleotide at the 3′ terminus; and a 2′-O-Me modified nucleotide at the last nucleotide at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-2, H1-4, H1-6, H1-8, H1-10, and H1-12; 2′-F modified nucleotides at H1-1, H1-3, H1-5, H1-7, H1-9, and H1-11; a 2′-F modified nucleotide between Hairpin 1 and Hairpin 2; 2′-F modified nucleotides at H2-2, H2-4, H2-6, H2-8, H2-10, H2-12; and H2-14; 2′-O-Me modified nucleotides at H2-1, H2-3, H2-5, H2-7, H2-9, H2-11; H2-13, and H2-15; 2′-F modified nucleotides at the second from last, and fourth from last nucleotide at the 3′ terminus; and 2′-O-Me modified nucleotide at the third from last, and last nucleotide at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
Disclosed herein, in some embodiments, is a single guide RNA (sgRNA) comprising 2′-O-Me modifications at nucleotides LS8, LS10, LS12, H1-2, H1-4, H1-6, H1-8, H1-10, H1-12, H2-1, H2-3, H2-5, H2-7, H2-9, H2-11, H2-13, and H2-15; and 2′-F modifications at LS7, LS9, LS11; H1-1, H1-3, H1-5, H1-7, H1-9, H1-11, H1-13, H2-2, H2-4, H2-6, H2-8, H2-10, H2-12, and H2-14. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises 2′-O-Me modified nucleotides at the last and third to last nucleotide at the 3′ terminus; and 2′-F modified nucleotides at the second to last and third to last nucleotide at the 3′ terminus.
Disclosed herein, in some embodiments, is a sgRNA comprising the nucleic acids of any one of SEQ ID Nos: 228-232. Disclosed herein, in some embodiments, is a sgRNA comprising the nucleic acids of any one of SEQ ID Nos: 235-240, 265-285, and 309-329. Disclosed herein, in some embodiments, is a sgRNA comprises nucleic acids having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleic acids of any one of SEQ ID Nos: 235-240, 265-285, and 309-329, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, a sgRNA comprising a 5′ end modification and one or more modifications in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region is provided, wherein the 5′ end modification comprises at least two phosphorothioate linkages within the first seven nucleotides of the 5′ terminus.
In some embodiments, a sgRNA comprising a 5′ end modification and one or more modifications in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region is provided, wherein the 5′ end modification comprises one or more phosphorothioate linkages at the 5′ end of the RNA. In some embodiments, one or more phorphorothioate bonds link the 5′ terminal nucleotides.
In some embodiments, a sgRNA comprising a 5′ end modification and one or more modifications in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region is provided, wherein the 5′ end modification comprises one or more phosphorothioate linkages within the first seven nucleotides of the 5′ terminus.
In some embodiments, a sgRNA comprising any one of the modified sgRNA sequences of SEQ ID Nos: 228-332 is provided.
In some embodiments, a sgRNA comprising or consisting of any one of the modified sgRNA sequences of SEQ ID Nos: 235-240, 265-285, and 309-329 is provided.
In some embodiments, the invention comprises a sgRNA comprising any one of the modified sequences of SEQ ID Nos: 235-240, 265-285, and 309-329, wherein the sgRNA further comprises a 5′ spacer sequence that is at least partially complementary to a target sequence, and directs a Cas9 to its target for cleavage.
In some embodiments, the invention comprises a sgRNA comprising nucleotides having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleotides of any one of SEQ ID Nos: 235-240, 265-285, and 309-329, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier. That is, the nucleotides A, U, C, and G may differ by 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% compared to what is shown in in the sequences, but the modification remains unchanged.
In some embodiments, the invention comprises a sgRNA comprising one or more modifications within one or more of the following regions: the nucleotides at the 5′ terminus; the lower stem region; the bulge region; the upper stem region; the; the nexus region; the hairpin 1 region; the hairpin 2 region; and the nucleotides at the 3′ terminus.
In some embodiments, the modification comprises a 2′-O-methyl (2′-O-Me) modified nucleotide. In some embodiments, the modification comprises a 2′-fluoro (2′-F) modified nucleotide. In some embodiments, the modification comprises a phosphorothioate (PS) bond between nucleotides. In some embodiments, the modification comprises an inverted abasic nucleotide.
In some embodiments, a sgRNA is provided comprising 2′-O-Me modified nucleotides at: the first three nucleotides in the 5′ terminus; LS1, LS6, LS7, LS8, LS11, and LS12 in the lower stem; B1 and B2 in the bulge region; each of the nucleotides in the upper stem region; N16, N17, and N18 in the nexus region; each of the nucleotides in the hairpin 1 region; one nucleotide between hairpin 1 and hairpin 2; each of the nucleotides in the hairpin 2 region; and the last four nucleotides at the 3′ terminus. In one embodiment, the sgRNA further comprises three PS bonds between the first four nucleotides at the 5′ terminus and three PS bonds between the last four nucleotides at the 3′ terminus.
In some embodiments, a sgRNA is provided comprising 2′-O-Me modified nucleotides at: the first three nucleotides in the 5′ terminus; LS1, LS6, LS7, LS8, LS11, and LS12 in the lower stem; B1-B6 in the bulge region; each of the nucleotides in the upper stem region; N16, N17, and N18 in the nexus region; each of the nucleotides in the hairpin 1 region; one nucleotide between hairpin 1 and hairpin 2; each of the nucleotides in the hairpin 2 region; and the last four nucleotides at the 3′ terminus. In one embodiment, the sgRNA further comprises three PS bonds between the first four nucleotides at the 5′ terminus and three PS bonds between the last four nucleotides at the 3′ terminus.
In some embodiments, a sgRNA is provided comprising 2′-F modified nucleotides at: LS9 and LS10 in the lower stem; 15-N18 in the nexus region; H2-9-HS-15 in the hairpin 2 region; and the second to last, third to last, and fourth to last nucleotide in the 3′ terminus region.
In some embodiments, a sgRNA is provided comprising 2′-F modified nucleotides at: each nucleotide in the lower stem; 15-N18 in the nexus region; H2-9-HS-15 in the hairpin 2 region; and the second to last, third to last, and fourth to last nucleotide in the 3′ terminus region.
In some embodiments, a single guide RNA (sgRNA) is provided comprising 2′-O-Me modified nucleotides at LS8, LS10, LS12, H1-2, H1-4, H1-6, H1-8, H1-10, H1-12, H2-1, H2-3, H2-5, H2-7, H2-9, H2-11, H2-13, H2-15, and the last and third to last nucleotides at the 3′ terminus; and 2′-F modifications at LS7, LS9, LS11; H1-1, H1-3, H1-5, H1-7, H1-9, H1-11, H1-13, H2-2, H2-4, H2-6, H2-8, H2-10, H2-12, H2-14, and the second to last and fourth to last nucleotide at the 3′ terminus.
Each of the following embodiments are encompassed:
A single guide RNA (sgRNA) comprising one or more modifications in one or more of the following regions:
The sgRNA of embodiment 1, wherein the modification comprises a 2′-O-methyl (2′-O-Me) modified nucleotide.
The sgRNA of embodiment 1, wherein the modification comprises a 2′-fluoro (2′-F) modified nucleotide.
The sgRNA of embodiment 1, wherein the modification comprises a phosphorothioate (PS) bond between nucleotides.
The sgRNA of any one of embodiments 1-3, wherein the first three or four nucleotides at the 5′ terminus, and the last three or four nucleotides at the 3′ terminus are modified.
The sgRNA of any one of embodiments 1-5, wherein the first four nucleotides at the 5′ terminus, and the last four nucleotides at the 3′ terminus are linked with phosphorothioate (PS) bonds.
The sgRNA of embodiment 5, wherein the modification comprises 2′-O-Me.
The sgRNA of embodiment 5, wherein the modification comprises 2′-F.
The sgRNA of any one of embodiments 1-7, wherein the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′terminus are linked with a PS bond, and wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-O-Me modifications.
The sgRNA of any one of embodiments 1-8, wherein the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-F modifications.
The sgRNA of any one of embodiments 1-10, wherein LS1, LS6, LS7, LS8, LS11, and LS12 are modified with 2′-O-Me.
The sgRNA of any one of embodiments 1-11, wherein each of the nucleotides in the bulge region are modified with 2′-O-Me.
The sgRNA of any one of embodiments 1-12, wherein each of the nucleotides in the upper stem region are modified with 2′-O-Me.
The sgRNA of any one of embodiments 1-13, wherein N16, N17, and N18 in the nexus region are modified with 2′-O-Me.
The sgRNA of any one of embodiments 1-14, wherein each of the nucleotides in the hairpin 1 region are modified with 2′-O-Me.
The sgRNA of any one of embodiments 1-15, wherein each of the nucleotides in the hairpin 2 region are modified with 2′-O-Me.
A single guide RNA (sgRNA) comprising 2′-O-Me modified nucleic acids at the following nucleotides:
The sgRNA of embodiment 17, wherein B3-B6 are modified with 2′-O-Me.
The sgRNA of embodiment 17, further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
The sgRNA of any one of embodiments 1-10, wherein LS9 and LS10 are modified with 2′-F.
The sgRNA of any one of embodiments 1-10 and 20, wherein N15, N16, N17, and N18 are modified with 2′-F.
The sgRNA of any one of embodiments 1-10 and 20-21, wherein H2-9, H2-10, H2-11, H2-12, H2-13, H2-14, and H2-15 are modified with 2′-F.
The sgRNA of any one of embodiments 1-10 and 21-22, wherein the second to last, third to last, and fourth to last nucleotides at the 3′ terminus are modified with 2′-F.
A single guide RNA (sgRNA) comprising 2′-F modified nucleotides at the following positions:
The sgRNA of embodiment 24, further comprising 2′-F modified nucleotides at the second to last, third to last, and fourth to last nucleotides at the 3′ terminus.
The sgRNA of any one of embodiments 24 or 25, further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
The sgRNA of any one of embodiments 24-26, further comprising 2′-O-Me or 2′-F modified nucleotides at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleotides at the three of the last four nucleotides at the 3′ terminus.
A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 28 further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 30 further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 32 further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 34 further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 36 further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 32 further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 40 further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus Embodiment 42. A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 43 further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 44 further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 46 further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A single guide RNA (sgRNA) comprising
The sgRNA of embodiment 48, further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
The sgRNA of any one of embodiments 48-49, further comprising
A sgRNA comprising the nucleic acids of any one of SEQ ID Nos: 228-332.
A sgRNA comprising the nucleic acids of any one of SEQ ID Nos: 235-240, 265-285, and 309-329.
A sgRNA comprising nucleic acids having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleic acids of any one of SEQ ID Nos: 235-240, 265-285, and 309-329, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier.
The sgRNA of any one of embodiments 51-53, further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
B. Compositions of dgRNAs
In some embodiments, the compositions and methods of the invention comprise gRNA comprising a crRNA and trRNA that direct a nuclease such as Cas9 to a target DNA sequence. In some embodiments, the gRNAs are associated, but on two separate RNA molecules (dual guide RNA or dgRNA).
Table 2 and
Table 3 and
1. Domains of dgRNAs
As described in Briner 2014, dgRNAs can be developed based on specific functional domains, referred to herein as “domains”, including the spacer responsible for targeting, the lower stem, the bulge, the upper stem, the nexus, and the hairpin domains. In dgRNAs, the crRNA comprises some components of the gRNA and the trRNA comprises some components of the gRNA.
Regions of crRNAs are provided in Table 2 and
a) 5′ Terminus Region
In some embodiments, the dgRNA comprises nucleotides at the 5′ terminus of the crRNA and trRNA as shown in Tables 2-3 and
In some embodiments, the 5′ terminus of the crRNA comprises a spacer or guide region that functions to direct a Cas protein to a target nucleotide sequence. In some embodiments, the 5′ terminus does not comprise a spacer or guide region. In some embodiments, the 5′ terminus comprises a spacer and additional nucleotides that do not function to direct a Cas protein to a target nucleotide region.
In some embodiments, the guide region comprises the first 1-10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotides at the 5′ end of the crRNA. In some embodiments, the guide region comprises 20 nucleotides. In some embodiments, the guide region may comprise 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 or more nucleotides. In some embodiments, the guide region may comprise 17 nucleotides. In some embodiments, the guide region may comprise 18 nucleotides. In some embodiments, the guide region may comprise 19 nucleotides.
In some embodiments, the selection of the guide region is determined based on target sequences within the gene of interest for editing. For example, in some embodiments, the crRNA comprises a guide region that is complementary to target sequences of a gene of interest.
In some embodiments, the target sequence in the gene of interest may be complementary to the guide region of the crRNA. In some embodiments, the degree of complementarity or identity between a guide region of a crRNA and its corresponding target sequence in the gene of interest may be about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100%. In some embodiments, the guide region of a crRNA and the target region of a gene of interest may be 100% complementary or identical. In other embodiments, the guide region of a crRNA and the target region of a gene of interest may contain at least one mismatch. For example, the guide region of a crRNA and the target sequence of a gene of interest may contain 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 mismatches, where the total length of the target sequence is at least about 17, 18, 19, 20 or more base pairs. In some embodiments, the guide region of a crRNA and the target region of a gene of interest may contain 1-6 mismatches where the guide sequence comprises at least about 17, 18, 19, 20 or more nucleotides. In some embodiments, the guide region of a crRNA and the target region of a gene of interest may contain 1, 2, 3, 4, 5, or 6 mismatches where the guide sequence comprises about 20 nucleotides.
In some embodiments, the trRNA comprises a 5′ terminus. In some embodiments, the trRNA comprises a 5′ terminus which forms, in part, the upper stem of a dgRNA. The 5′ terminus of the trRNA is not complementary to a region of the target gene.
b) Lower Stem
In some embodiments, the dgRNA comprises a lower stem (LS) region. The lower stem region comprises a crRNA lower stem region and a trRNA lower stem region that associate as depicted in
In some embodiments, the lower stem region of the crRNA and trRNA each comprise 6 nucleotides. In some embodiments, the lower stem region of the crRNA and trRNA each comprise fewer nucleotides than shown in Tables 2 and 3 and
c) Bulge
In some embodiments, the dgRNA comprises a bulge (B) region. In some embodiments, the crRNA comprises one bulge region and the trRNA comprises one bulge region. In some embodiments, each bulge region comprises 1-4 nucleotides. In some embodiments, the bulge region of the crRNA comprises two nucleotides, and the bulge region of the trRNA comprises four nucleotides.
In some embodiments, the crRNA bulge region is located between the lower stem region and the upper stem region of the crRNA. In some embodiments, the bulge region of the crRNA comprises two nucleotides. In some embodiments, the bulge region of the crRNA comprises nucleotides B1 and B2 as shown Table 2 and
In some embodiments, the trRNA bulge region is located between the upper stem region and the lower stem region of the trRNA. In some embodiments, the bulge region of the trRNA comprises four nucleotides. In some embodiments, the bulge region of the trRNA comprises nucleotides B1 through B4 as shown Table 3 and
In some embodiments, the presence of a bulge results in a directional kink between the upper and lower stems modules in a dgRNA. The crRNA bulge and trRNA bulge may be partially complementary. The crRNA bulge and trRNA bulge may have no complementary.
In some embodiments, the bulge regions of the crRNA and trRNA comprise more nucleotides than shown in Tables 2 and 3 and
d) Upper Stem
In some embodiments, the dgRNA comprises an upper stem (US) region. The upper stem region comprises a crRNA upper stem region and a trRNA upper stem region that associate as depicted in
In some embodiments, the upper stem region of the crRNA comprises fourteen nucleotides. In some embodiments, the upper stem region of the trRNA comprises eleven nucleotides. In some embodiments, the upper stem regions of the crRNA and trRNA each comprise fewer nucleotides than shown in Tables 2 and 3 and
In some embodiments, the upper stem of the crRNA comprises nucleotides US1 through US14 as shown in Table 2 and
In some embodiments, the upper stem of the trRNA comprises nucleotides US1 through US11 as shown in Table 3 and
e) Nexus
In some embodiments, the dgRNA comprises a trRNA comprising a nexus region. In some embodiments, the nexus is between the lower stem region and the hairpin 1 region of the trRNA. In some embodiments, the nexus is located immediately downstream of the lower stem of the trRNA. In some embodiments, the nexus comprises eighteen nucleotides. In some embodiments, the nexus region of the trRNA comprises nucleotides N1-N18 as shown in Table 3 and
In some embodiments, the nexus region has nucleotides that are complementary in nucleic acid sequence when read in opposite directions. In some embodiments, the complementarity in nucleic acid sequence leads to a secondary structure of a stem and/or stem loop in the sgRNA (e.g., certain nucleotides in the nexus region may base pair with one another). In some embodiments, the nexus regions may not be perfectly complimentary to each other when read in opposite directions.
f) Hairpin
In some embodiments, the hairpin region of the trRNA is downstream of the nexus region. In some embodiments, the region of nucleotides immediately downstream of the nexus region is termed “hairpin 1.” In some embodiments, the region of nucleotides immediately downstream of the hairpin 1 region is termed “hairpin 2.” In some embodiments, the hairpin region comprises hairpin 1 and hairpin 2. In some instances, hairpin 1 and hairpin 2 are separated by one or more nucleotide “n.” In some embodiments, n=1. In some embodiments, the trRNA comprises only hairpin 1 or hairpin 2.
Replacement of the hairpin 1 region of a trRNA with 2 nucleotides has been shown to allow editing activity of a Cas RNP (see US20150376586,
In some embodiments, hairpin 1 of the trRNA comprises twelve nucleotides immediately downstream of the nexus region. In some embodiments, the hairpin 1 region of the trRNA comprises nucleotides H1-1 through H1-12 as shown in Table 3 and
In some embodiments, non-hairpin nucleotides are present between the hairpin 1 and the hairpin 2 regions of the trRNA. In some embodiments, one to two non-hairpin nucleotides reside between hairpin 1 and hairpin 2.
In some embodiments, hairpin 2 of the trRNA comprises fifteen nucleotides after (3′ to) hairpin 1. In some embodiments, the hairpin 2 region of the trRNA comprises nucleotides H2-1 through H2-15 as shown in Table 3 and
In some embodiments, a hairpin region of the trRNA comprises more nucleotides than shown in Table 3 and
In some embodiments, a hairpin region has nucleotides that are complementary in nucleic acid sequence when read in opposite directions. In some embodiments, the hairpin regions may not be perfectly complimentary to each other when read in opposite directions (e.g., the top or loop of the hairpin comprises unpaired nucleotides).
In some embodiments, the trRNA comprises replacement of hairpin 1 with nucleotides “n”, wherein “n” is an integer between 1 and 50, 40, 30, 20, 15, 10, 5, 4, 3, and 2. In some embodiments, the hairpin 1 region of a trRNA is replaced by 2 nucleotides.
g) 3′ Terminus
In some embodiments, the dgRNA comprises a trRNA comprising a 3′ terminus region comprising additional nucleotides after (3′ to) the hairpin region(s). In some embodiments, the 3′ terminus region comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15 or 20 or more nucleotides that are not associated with the secondary structure of a hairpin. In some embodiments, the 3′ terminus region comprises 1, 2, 3, or 4 nucleotides that are not associated with the secondary structure of a hairpin. In some embodiments, the 3′ terminus region comprises 4 nucleotides that are not associated with the secondary structure of a hairpin. In some embodiments, the 3′ terminus region comprises 1, 2, or 3 nucleotides that are not associated with the secondary structure of a hairpin.
2. Modifications of dgRNAs
In some embodiments, a dgRNA comprises a modified crRNA and an unmodified trRNA. In some embodiments, a dgRNA comprises an unmodified crRNA and a modified trRNA. In some embodiments, both the crRNA and trRNA of a dgRNA comprise modifications.
In some embodiments, the gRNAs described herein are in two separate RNA molecules (dual guide or dgRNA). See, Tables 2, 3, and
In some embodiments, the invention comprises a dgRNA comprising or consisting of a) any one of the crRNA sequences of SEQ ID Nos: 1-187; and b) any one of the trRNA sequences described in SEQ ID Nos: 188-227.
In some embodiments, a dgRNA comprising any one of the modified crRNA sequences of 1-187 is provided.
In some embodiments, a dgRNA comprising any one of the modified trRNA sequences of 188-227 is provided.
In some embodiments, a dgRNA comprising any one of the modified crRNA sequences of SEQ ID Nos: 19-31, 53-73, and 104-130 is provided. In some embodiments, the invention comprises a dgRNA comprising any one of the modified sequences of SEQ ID Nos: 19-31, 53-73, and 104-130, wherein the crRNA further comprises a 5′ spacer sequence that is at least partially complementary to a target sequence, and directs a Cas9 to its target for cleavage.
In some embodiments, the invention comprises a crRNA comprising any one of the sequences described in SEQ ID Nos: 1-187. In some embodiments, the invention comprises a crRNA comprising or consisting of any one of the sequences described in SEQ ID Nos: 19-31, 53-73, and 104-130. In some embodiments, the invention comprises a crRNA comprising any one of the sequences described in SEQ ID Nos: 19-31, 53-73, and 104-130 and a spacer region.
In some embodiments, the invention comprises a trRNA comprising or consisting of any one of the sequences described in SEQ ID Nos:188-277.
In some embodiments, the invention comprises a crRNA comprising nucleotides having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleotides of any one of SEQ ID Nos: 1-187, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier. That is, the nucleotides A, U, C, and G may differ by 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% compared to what is shown in in the sequences, but the modification remains unchanged.
In some embodiments, the invention comprises a trRNA comprising nucleotides having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleotides of any one of SEQ ID Nos: 188-277, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier. That is, the nucleotides A, U, C, and G may differ by 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% compared to what is shown in the sequences, but the modification on each nucleotide remains unchanged.
3. crRNAs, trRNAs, and dgRNAs with Modifications
In some embodiments, the crRNA comprises one or more modified nucleotides within one or more of the 5′ terminus, lower stem, bulge, upper stem, and 3′ terminus.
In some embodiments, the modification comprises 2′-O-Me.
In some embodiments, the modification comprises 2′-F.
In some embodiments, the modification comprises a phosphorothioate (PS) bond linking one or more nucleotides. In some embodiments, the modification is three PS bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, the modification comprises an inverted abasic nucleotide.
In some embodiments, a crRNA is provided comprising 2′-O-Me modified nucleotides at each nucleotide in the upper stem. In some embodiments, US-1 through US-14 of the crRNA are each modified with 2′-O-Me. In some embodiments, LS1 and LS6 of the crRNA are modified with 2′-O-Me. In some embodiments, LS5 of the crRNA is modified with 2′-O-Me.
In some embodiments, a crRNA comprising 2′-O-Me modified nucleotides at each of the nucleotides in the upper stem, and LS1 and LS6 in the lower stem is provided. In some embodiments, the crRNA further comprises one or more 2′-O-Me or 2′-O-moe modified nucleotides in the 5′ and/or 3′ terminus region, e.g. in a 5′ and/or 3′ end modification.
In some embodiments, a crRNA comprising 2′-O-Me modified nucleotides at each of the nucleotides in the upper stem, LS1, LS5, and LS6 in the lower stem is provided. In some embodiments, the crRNA further comprises one or more 2′-O-Me or 2′-O-moe modified nucleotides in the 5′ and/or 3′ terminus region, e.g. in a 5′ and/or 3′ end modification.
In some embodiments, the invention comprises a crRNA comprising 2′-F modified nucleotides at LS1, LS2, and LS6 in the lower stem. In some embodiments, the crRNA further comprises 2′-F modified nucleotides at each of B1 and B2 in the bulge region. In some embodiments, the invention comprises a crRNA comprising 2′-F modified nucleotides at LS1, LS2, and LS6 in the lower stem, and at each of B1 and B2 in the bulge region. In some embodiments, the crRNA further comprises one or more 2′-O-Me or 2′-O-moe modified nucleotides in the 5′ and/or 3′ terminus region, e.g. in a 5′ and/or 3′ end modification.
In some embodiments, the crRNA comprises 2′-O-Me modified nucleotides at nucleotides LS1 and LS6 in the lower stem region; each of the nucleic acids in the bulge region; and each of the nucleic acids in the upper stem region. In some embodiments, the LS5 nucleotide of the crRNA is also modified with 2′-O-Me. In some embodiments, LS2, LS3, and LS4 of the crRNA are not modified. In some embodiments, the crRNA further comprises one or more 2′-O-Me or 2′-O-moe modified nucleotides in the 5′ and/or 3′ terminus region, e.g. in a 5′ and/or 3′ end modification.
In some embodiments, the crRNA comprises 2′-fluoro (2′-F) modified nucleotides at LS1, LS2, and LS6 in the lower stem region, and each of the nucleotides in the bulge region. In some embodiments, the crRNA comprises 2′-fluoro (2′-F) modified nucleotides at LS1, LS2, and LS6 in the lower stem region, and at B2 and B2 in the bulge region. In some embodiments, the crRNA comprises 2′-fluoro (2′-F) modified nucleotides at LS1-LS6 in the lower stem region, and each of the nucleotides in the bulge region. In some embodiments, the crRNA further comprises one or more 2′-O-Me or 2′-O-moe modified nucleotides in the 5′ and/or 3′ terminus region, e.g. in a 5′ and/or 3′ end modification.
In some embodiments, the invention comprises a trRNA comprising one or more modified nucleotides within one or more of the following regions: the 5′ terminus, the upper stem region; the bulge region; the lower stem region; the nexus region; the hairpin 1 region; the intervening region between the hairpin 1 and hairpin 2 regions; the hairpin 2 region; and the 3′ terminus region.
In some embodiments, the modification comprises 2′-O-Me.
In some embodiments, the modification comprises 2′-F.
In some embodiments, the modification comprises a phosphorothioate (PS) bond linking one or more nucleotides. In some embodiments, the modification is three PS bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, the modification comprises an inverted abasic nucleotide.
In some embodiments, the trRNA comprises 2′-O-Me modified nucleotides at each nucleic acid in the upper stem; B1 and B2 in the bulge region; LS1 and LS2 in the lower stem region; N3, N4, N5, N15, N16, N17, and N18 in the nexus region; each nucleotide in the hairpin 1 region; one nucleotide between the hairpin 1 and hairpin 2 region; and each nucleotide in the hairpin 2 region. In some embodiments, the trRNA further comprises one or more 2′-O-Me or 2′-O-moe modified nucleotides in the 5′ and/or 3′ terminus region, e.g. in a 5′ and/or 3′ end modification.
In some embodiments, the trRNA comprises 2′-O-Me modified nucleotides at each nucleic acid in the upper stem; each nucleotide in the bulge region; LS1, LS2, LS5, and LS6 in the lower stem region; N3-N5, N10-N18 in the nexus region; each nucleotide in the hairpin 1 region; one nucleotide between the hairpin 1 and hairpin 2 region; and each nucleotide in the hairpin 2 region. In some embodiments, the crRNA further comprises one or more 2′-O-Me or 2′-O-moe modified nucleotides in the 5′ and/or 3′ terminus region, e.g. in a 5′ and/or 3′ end modification.
In some embodiments, the trRNA comprises 2′-F modified nucleotides at N15 through N18 in the nexus region. In some embodiments, the trRNA further comprises one or more 2′-F modified nucleotides in the 5′ and/or 3′ terminus region, e.g. in a 5′ and/or 3′ end modification.
In some embodiments, the trRNA comprises 2′-F modified nucleotides at LS4 and LS5 in the lower stem region, and N13-N18 in the nexus region. In some embodiments, the trRNA further comprises one or more 2′-F modified nucleotides in the 5′ and/or 3′ terminus region, e.g. in a 5′ and/or 3′ end modification.
In some embodiments, the trRNA comprises 2′-F modified nucleotides at LS1, LS3, and LS5 in the lower stem, and 2′-O-Me modified nucleotides at LS2, LS4, and LS6 in the lower stem.
Disclosed herein, in some embodiments, is a crispr RNA (crRNA) comprising one or more modifications within one or more of the following regions: the first five nucleotides at the 5′ terminus; the lower stem region; the bulge region; the upper stem region; and the last five nucleotides at the 3′ terminus. In some embodiments, the modification comprises a 2′-O-methyl (2′-O-Me) modified nucleotide. In some embodiments, the modification comprises a 2′-fluoro (2′-F) modified nucleotide. In some embodiments, the modification comprises a phosphorothioate (PS) bond between nucleotides. In some embodiments, the first three nucleotides at the 5′ terminus, and the last three nucleotides at the 3′ terminus are modified. In some embodiments, the first four nucleotides at the 5′ terminus, and the last four nucleotides at the 3′ terminus are linked with phosphorothioate (PS) bonds. In some embodiments, the modification comprises 2′-O-Me. In some embodiments, the modification comprises 2′-F. In some embodiments, the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-O-Me modifications. In some embodiments, the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-F modifications. In some embodiments, LS1 and LS6 are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the upper stem region are modified with 2′-O-Me.
In some embodiments, the invention comprises a crispr RNA (crRNA) comprising 2′-O-Me modified nucleic acids at the following nucleotides: LS1 and LS6 in the lower stem region; and each nucleotide in the upper stem region. In some embodiments, the crRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus. In some embodiments, the crRNA further comprises 2′-O-Me or 2′-F modified nucleic acids at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleic acids at the last three nucleotides at the 3′ terminus. In some embodiments, LS1, LS2, and LS6 are modified with 2′-F. In some embodiments, each nucleotide in the bulge region is modified with 2′-F.
Disclosed herein, in some embodiments, is a crispr RNA (crRNA) comprising 2′-F modified nucleic acids at the following nucleotides: LS1, LS2, and LS6 in the lower stem region; and each nucleotide in the bulge region. In some embodiments, the crRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus. In some embodiments, the crRNA further comprises 2′-O-Me or 2′-F modified nucleic acids at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleic acids at the last three nucleotides at the 3′ terminus.
In some embodiments, a crRNA comprising the nucleic acids of any one of SEQ ID Nos: 1-187 is provided. In some embodiments, a crRNA comprising the nucleic acids of any one of SEQ ID Nos: 19-31, 53-73, 104-130, and 161-187 is provided. In some embodiments, a crRNA comprising nucleic acids having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleic acids of any one of SEQ ID Nos: 19-31, 53-73, 104-130, and 161-187, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier, is provided. In some embodiments, the crRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
Also encompassed is a tracr RNA (trRNA) comprising one or more modifications within one or more of the following regions: the first five nucleotides at the 5′ terminus; the upper stem region; the bulge region; the lower stem region; the nexus region; the hairpin 1 region; the hairpin 2 region; and the last five nucleotides at the 3′ terminus. In some embodiments, the modification comprises a 2′-O-methyl (2′-O-Me) modified nucleotide. In some embodiments, the modification comprises a 2′-fluoro (2′-F) modified nucleotide. In some embodiments, the modification comprises a phosphorothioate (PS) bond between nucleotides. In some embodiments, the first four nucleotides at the 5′ terminus, and the last four nucleotides at the 3′ terminus are linked with phosphorothioate (PS) bonds. In some embodiments, the first three nucleotides at the 5′ terminus, and the last three nucleotides at the 3′ terminus are modified. In some embodiments, the modification comprises 2′-O-Me. In some embodiments, the modification comprises 2′-F. In some embodiments, the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-O-Me modifications. In some embodiments, the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-F modifications. In some embodiments, each nucleotide in the upper stem region is modified with 2′-O-Me. In some embodiments, B1 and B2 within the bulge region are modified with 2′-O-Me. In some embodiments, N3, N4, N5, N15, N16, N17, and N18 in the nexus region are modified with 2′-O-Me. In some embodiments, each nucleotide in the hairpin 1 region is modified with 2′-O-Me. In some embodiments, each nucleotide in the hairpin 2 region is modified with 2′-O-Me.
In some embodiments, the invention comprises a tracr RNA (trRNA) comprising 2′-O-Me modified nucleic acids at the following nucleotides: each nucleotide in the upper stem; B1 and B2 within the bulge region; N3, N4, N5, N15, N16, N17, and N18 in the nexus region; each nucleotide in the hairpin 1 region; and each nucleotide in the hairpin 2 region. In some embodiments, the trRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus. In some embodiments, the trRNA further comprises 2′-O-Me or 2′-F modified nucleotides at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleic acids at the last three nucleotides at the 3′ terminus. In some embodiments, N15, N16, N17, and N18 are modified with 2′-F. In some embodiments, LS1, LS3, and LS5 are modified with 2′-F, and LS2, LS4, and LS6 are modified with 2′-O-Me. In some embodiments, the trRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus. In some embodiments, the trRNA further comprises 2′-O-Me or 2′-F modified nucleic acids at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleic acids at the last three nucleotides at the 3′ terminus.
In some embodiments, a trRNA comprising the nucleic acids of any one of SEQ ID Nos: 188-227 is provided. In some embodiments, a trRNA comprising nucleic acids having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleic acids of any one of SEQ ID Nos: 188-227, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier, is provided. In some embodiments, the trRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some instances, a dual guide comprising a crRNA and a trRNA is provided, wherein the crRNA comprises the nucleic acids of any one of SEQ ID Nos: 1-187, and wherein the trRNA comprises the nucleic acids of any one of SEQ ID Nos: 188-227.
A dual guide comprising a crRNA disclosed herein and a trRNA disclosed herein is encompassed, as is a dual guide comprising a crRNA disclosed herein and an unmodified trRNA. In some embodiments, a dual guide comprising an unmodified crRNA and a modified trRNA disclosed herein is provided.
In some embodiments, and of the following are encompassed:
A crispr RNA (crRNA) comprising one or more modifications within one or more of the following regions:
The crRNA of embodiment 55, wherein the modification comprises a 2′-O-methyl (2′-O-Me) modified nucleotide.
The crRNA of embodiment 55, wherein the modification comprises a 2′-fluoro (2′-F) modified nucleotide.
The crRNA of embodiment 55, wherein the modification comprises a phosphorothioate (PS) bond between nucleotides.
The crRNA of any one of embodiments 55-58, wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus are modified.
The crRNA of any one of embodiments 55-58, wherein the first four nucleotides at the 5′ terminus, and the last four nucleotides at the 3′ terminus are linked with phosphorothioate (PS) bonds.
The crRNA of embodiment 59, wherein the modification comprises 2′-O-Me.
The crRNA of embodiment 59, wherein the modification comprises 2′-F.
The crRNA of any one of embodiments 55-62, wherein the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′terminus are linked with a PS bond, and wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-O-Me modifications.
The crRNA of any one of embodiments 55-62, wherein the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′terminus are linked with a PS bond, and wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-F modifications.
The crRNA of any one of embodiments 55-60, wherein LS1 and LS6 are modified with 2′-O-Me.
The crRNA of any one of embodiments 55-60 and 65, wherein each of the nucleotides in the upper stem region are modified with 2′-O-Me.
A crispr RNA (crRNA) comprising 2′-O-Me modified nucleotides at:
The crRNA of embodiment 67, further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
The crRNA of embodiment 67 or 68, further comprising 2′-O-Me or 2′-F modified nucleotides at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleotides at the last three nucleotides at the 3′ terminus.
The crRNA of any of embodiments 55-60, wherein LS1, LS2, and LS6 are modified with 2′-F.
The crRNA of any of embodiments 55-60 and 70, wherein each nucleotide in the bulge region is modified with 2′-F.
A crispr RNA (crRNA) comprising 2′-F modified nucleotides at:
The crRNA of any one of embodiments 70-72, further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
The crRNA of embodiment 72 or 73, further comprising 2′-O-Me or 2′-F modified nucleotides at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleotides at the last three nucleotides at the 3′ terminus.
A crRNA comprising the nucleic acids of any one of SEQ ID Nos: 1-187.
A crRNA comprising the nucleic acids of any one of SEQ ID Nos: 19-31, 53-73, 104-130, and 161-187.
A crRNA comprising nucleic acids having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleic acids of any one of SEQ ID Nos: 19-31, 53-73, 104-130, and 161-187, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier.
The crRNA of any one of embodiments 75-77, further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A tracr RNA (trRNA) comprising one or more modifications within one or more of the following regions:
The trRNA of embodiment 79, wherein the modification comprises a 2′-O-methyl (2′-O-Me) modified nucleotide.
The trRNA of embodiment 79, wherein the modification comprises a 2′-fluoro (2′-F) modified nucleotide.
The trRNA of embodiment 79, wherein the modification comprises a phosphorothioate (PS) bond between nucleotides.
The trRNA of any one of embodiments 79-82, wherein the first four nucleotides at the 5′ terminus, and the last four nucleotides at the 3′ terminus are linked with phosphorothioate (PS) bonds.
The trRNA of any one of embodiments 79-82, wherein the first three nucleotides at the 5′ terminus, and the last three nucleotides at the 3′ terminus are modified.
The trRNA of embodiment 84, wherein the modification comprises 2′-O-Me.
The trRNA of embodiment 84, wherein the modification comprises 2′-F.
The trRNA of any one of embodiments 79-86, wherein the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′terminus are linked with a PS bond, and wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-O-Me modifications.
The trRNA of any one of embodiments 79-86, wherein the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′terminus are linked with a PS bond, and wherein the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-F modifications.
The trRNA of any one of embodiments 79-84, wherein each nucleotide in the upper stem region is modified with 2′-O-Me.
The trRNA of any one of embodiments 79-84 and 89, wherein B1 and B2 within the bulge region are modified with 2′-O-Me.
The trRNA of any one of embodiments 79-84 and 89-90, wherein N3, N4, N5, N15, N16, N17, and N18 in the nexus region are modified with 2′-O-Me.
The trRNA of any one of embodiments 79-84 and 89-91, wherein each nucleotide in the hairpin 1 region is modified with 2′-O-Me.
The trRNA of any one of embodiments 79-84 and 89-92, wherein each nucleotide in the hairpin 2 region is modified with 2′-O-Me.
A tracr RNA (trRNA) comprising 2′-O-Me modified nucleotides at:
The trRNA of embodiment 94, further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
The crRNA of embodiment 94 or 95, further comprising 2′-O-Me or 2′-F modified nucleotides at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleic acids at the last three nucleotides at the 3′ terminus.
The trRNA of any of embodiments 79-84, wherein N15, N16, N17, and N18 are modified with 2′-F.
The trRNA of any of embodiments 79-84 and 97, wherein LS1, LS3, and LS5 are modified with 2′-F, and LS2, LS4, and LS6 are modified with 2′-O-Me.
The trRNA of any one of embodiments 87-98, further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
The trRNA of embodiment 98 or 99, further comprising 2′-O-Me or 2′-F modified nucleotides at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleotides at the last three nucleotides at the 3′ terminus.
A trRNA comprising the nucleic acids of any one of SEQ ID Nos: 188-227.
A trRNA comprising nucleic acids having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleic acids of any one of SEQ ID Nos: 188-227, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier.
The trRNA of any one of embodiments 101-102, further comprising three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
A dual guide comprising a crRNA and a trRNA, wherein the crRNA comprises the nucleotides of any one of SEQ ID Nos: 1-187, and wherein the trRNA comprises the nucleic acids of any one of SEQ ID Nos: 188-227.
A dual guide comprising a crRNA of any one of embodiments 55-78 and a trRNA of any one of embodiments 79-103.
A dual guide comprising a crRNA of any one of embodiments 55-78 and an unmodified trRNA.
A dual guide comprising an unmodified crRNA and a trRNA of any one of embodiments 79-103.
C. Modifications to Terminal Nucleotides
In some embodiments, the 5′ or 3′ terminal nucleotides of any of the guide RNAs described herein are modified. In some embodiments, the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in 3′ terminus region of guide RNA, including, for example, the sgRNA, the dgRNA, the crRNA, trRNA, or both crRNA and trRNA are modified. In some embodiments, the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in 3′ terminus region of guide RNA comprise more than one modification. In some embodiments, at least one of the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides at the 3′ terminus region are modified. In some embodiments, at least two of the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in 3′ terminus region are modified. In some embodiments, at least three of the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in 3′ terminus region are modified. some embodiments, the modification comprises a PS linkage.
In some embodiments, the 5′ end of the 5′ terminus region is modified, for example, the first 1, 2, 3, 4, 5, 6, or 7 nucleotides of the sgRNA, the dgRNA, crRNA, trRNA, or both crRNA and trRNA are modified. In some embodiments, the first 1, 2, 3, 4, 5, 6, or 7 nucleotides in 3′ terminus region of guide RNA comprise more than one modification. In some embodiments, at least one of the terminal (i.e., first) 1, 2, 3, 4, 5, 6, or 7 nucleotides at the 5′ end are modified. In some embodiments, at least two of the terminal 1, 2, 3, 4, 5, 6, or 7 nucleotides at the 5′ end are modified. In some embodiments, at least three of the terminal 1, 2, 3, 4, 5, 6, or 7 nucleotides at the 5′ end are modified. some embodiments, the modification comprises a PS linkage.
In some embodiments, both the 5′ and 3′ termini (e.g., ends) of the guide RNA, e.g., sgRNA, dgRNA, crRNA, trRNA, or both crRNA and trRNA are modified. In some embodiments, only the 5′ terminus of the guide RNA, e.g., sgRNA, dgRNA, crRNA, trRNA, or both crRNA and trRNA is modified. In some embodiments, only the 3′ terminus of the guide RNA, e.g., sgRNA, dgRNA, crRNA, trRNA, or both crRNA and trRNA is modified.
In some embodiments, the gRNA comprises modifications at 1, 2, 3, 4, 5, 6, or 7 of the first 7 nucleotides at a 5′ end of the gRNA. In some embodiments, the gRNA comprises modifications at 1, 2, 3, 4, 5, 6, or 7 of the 7 terminal nucleotides at a 3′ end. In some embodiments, 2, 3, or 4 of the first 4 nucleotides at the 5′ end, and/or 2, 3, or 4 of the terminal 4 nucleotides at the 3′ end are modified. In some embodiments, 2, 3, or 4 of the first 4 nucleotides at the 5′ end are linked with phosphorothioate (PS) bonds.
In some embodiments, the modification to the 5′ terminus and/or 3′ terminus comprises a 2′-O-methyl (2′-O-Me) or 2′-O-(2-methoxyethyl) (2′-O-moe) modification to a nucleotide. In some embodiments, the modification comprises a 2′-fluoro (2′-F) modification to a nucleotide. In some embodiments, the modification comprises a phosphorothioate (PS) linkage between nucleotides. In some embodiments, the modification comprises an inverted abasic nucleotide. In some embodiments, the modification comprises a more than one modification selected from 2′-O-Me, 2′-O-moe, 2′-fluoro (2′-F), a phosphorothioate (PS) linkage between nucleotides, and an inverted abasic nucleotide. In some embodiments, an equivalent modification is encompassed.
In some embodiments, the guide RNA, e.g., sgRNA, dgRNA, crRNA, trRNA, or both crRNA and trRNA comprises one or more phosphorothioate (PS) linkages between the first one, two, three, four, five, six, or seven nucleotides at the 5′ terminus. In some embodiments, the guide RNA, e.g., sgRNA, dgRNA, crRNA, trRNA, or both crRNA and trRNA comprises one or more PS linkages between the last one, two, three, four, five, six, or seven nucleotides at the 3′ terminus. In some embodiments, the guide RNA, e.g., sgRNA, dgRNA, crRNA, trRNA, or both crRNA and trRNA comprises one or more PS linkages between the last one, two, three, four, five, six, or seven nucleotides at both the 5′ terminus and the 3′ terminus. In some embodiments, in addition to PS linkages, the 5′ and 3′ terminal nucleotides may comprise 2′-O-Me, 2′-O-moe, or 2′-F modified nucleotides.
In some embodiments, the guide RNA, e.g., sgRNA, dgRNA, crRNA, trRNA, or both crRNA and trRNA comprises modified nucleotides at the 5′ and 3′ terminus, and modified nucleotides in one or more other regions described in Tables 1-3 and
In some embodiments, the crRNA, trRNA, or both crRNA and trRNA comprises modified nucleotides that are not at the 5′ or 3′ ends. Specific patterns of modifications are described below and in Table 4.
3. Delivery of gRNAs and Cas Protein
In some embodiments, in addition to the at least one gRNA, the compositions provided herein further comprise a nuclease. In some embodiments, the nuclease is a Cas protein. In some embodiments, the gRNA together with a Cas protein is called a Cas RNP. In some embodiments, the Cas protein is from the Type-II CRISPR/Cas system. In some embodiments, the Cas protein is Cas9. In some embodiments, the Cas9 protein is a wild type Cas9. In some embodiments, the Cas9 protein is derived from the Streptococcus pyogenes Cas9 protein, e.g., a S. pyogenes Cas9. In some embodiments, the Cas9 protein is not derived from S. pyogenes, but functions in the same way as S. pyogenes Cas9 such that gRNA that is specific to S. pyogenes Cas9 will direct the non-S. pyogenes Cas9 to its target site. In some embodiments, the Cas induces a double strand break in target DNA. Equivalents of S. pyogenes Cas9 protein are encompassed by the embodiments described herein.
Cas9 encompasses modified and variants thereof. Modified versions of Cas9 having one catalytic domain, either RuvC or HNH, that is inactive are termed “nickases.” Nickases cut only one strand on the target DNA, thus creating a single-strand break. A single-strand break may also be known as a “nick.” In some embodiments, the compositions and methods comprise nickases. In some embodiments, the compositions and methods comprise a nickase Cas9 that induces a nick rather than a double strand break in the target DNA.
In some embodiments, the Cas protein may be modified to contain only one functional nuclease domain. For example, the Cas protein may be modified such that one of the nuclease domains is mutated or fully or partially deleted to reduce its nucleic acid cleavage activity. In some embodiments, a nickase Cas is used having a RuvC domain with reduced activity. In some embodiments, a nickase Cas is used having an inactive RuvC domain. In some embodiments, a nickase Cas is used having an HNH domain with reduced activity. In some embodiments, a nickase Cas is used having an inactive HNH domain.
In some embodiments, a conserved amino acid within a Cas protein nuclease domain is substituted to reduce or alter nuclease activity. In some embodiments, a Cas protein may comprise an amino acid substitution in the RuvC or RuvC-like nuclease domain. Exemplary amino acid substitutions in the RuvC or RuvC-like nuclease domain include D10A (based on the S. pyogenes Cas9 protein). In some embodiments, the Cas protein may comprise an amino acid substitution in the HNH or HNH-like nuclease domain. Exemplary amino acid substitutions in the HNH or HNH-like nuclease domain include E762A, H840A, N863A, H983A, and D986A (based on the S. pyogenes Cas9 protein).
In some embodiments, the RNP complex described herein comprises a nickase and a pair of guide RNAs that are complementary to the sense and antisense strands of the target sequence, respectively. In this embodiment, the guide RNAs direct the nickase to a target sequence and introduce a double stranded break (DSB) by generating a nick on opposite strands of the target sequence (i.e., double nicking). In some embodiments, use of double nicking may improve specificity and reduce off-target effects. In some embodiments, a nickase Cas is used together with two separate guide RNAs targeting opposite strands of DNA to produce a double nick in the target DNA. In some embodiments, a nickase Cas is used together with two separate guide RNAs that are selected to be in close proximity to produce a double nick in the target DNA.
In some embodiments, chimeric Cas proteins are used, where one domain or region of the protein is replaced by a portion of a different protein. In some embodiments, a Cas nuclease domain may be replaced with a domain from a different nuclease such as Fok1. In some embodiments, a Cas protein may be a modified nuclease.
In some embodiments, the Cas protein comprises a fusion protein comprising a catalytically inactive Cas9 linked to a heterologous functional domain (see, e.g., WO2014152432). In some embodiments, the catalytically inactive Cas9 is from S. pyogenes. In some embodiments, the catalytically inactive Cas9 comprises mutations that inactivate the Cas9. In some embodiments, the heterologous functional domain is a domain that modifies gene expression, histones, or DNA. In some embodiments, the heterologous functional domain is a transcriptional activation domain or a transcriptional repressor domain.
A. PAM
In some embodiments, the target sequence may be adjacent to the PAM. In some embodiments, the PAM may be adjacent to or within 1, 2, 3, or 4, nucleotides of the 3′ end of the target sequence. The length and the sequence of the PAM may depend on the Cas protein used. For example, the PAM may be selected from a consensus or a particular PAM sequence for a specific Cas9 protein or Cas9 ortholog, including those disclosed in
B. Delivery of Modified gRNA
Lipid nanoparticles (LNPs) are a well-known means for delivery of nucleotide and protein cargo, and may be used for delivery of the gRNA, mRNA, Cas9, and RNPs disclosed herein. In some embodiments, the LNPs deliver nucleic acid, protein, or nucleic acid together with protein.
In some embodiments, the invention comprises a method for delivering any one of the gRNAs disclosed herein to a subject, wherein the gRNA is associated with an LNP. In some embodiments, the gRNA/LNP is also associated with a Cas9 or an mRNA encoding Cas9.
In some embodiments, the invention comprises a composition comprising any one of the gRNAs disclosed and an LNP. In some embodiments, the composition further comprises a Cas9 or an mRNA encoding Cas9.
In some embodiments, the LNPs comprise cationic lipids. In some embodiments, the LNPs comprise (9Z,12Z)-3-((4,4-bis(octyloxy)butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl octadeca-9,12-dienoate, also called 3-((4,4-bis(octyloxy)-butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl (9Z,12Z)-octadeca-9,12-dienoate). In some embodiments, the LNPs comprise molar ratios of a cationic lipid amine to RNA phosphate (N:P) of about 4.5.
In some embodiments, LNPs associated with the gRNAs disclosed herein are for use in preparing a medicament for treating a disease or disorder.
Electroporation is a well-known means for delivery of cargo, and any electroporation methodology may be used for delivery of any one of the gRNAs disclosed herein. In some embodiments, electroporation may be used to deliver any one of the gRNAs disclosed herein and Cas9 or an mRNA encoding Cas9.
In some embodiments, the invention comprises a method for delivering any one of the gRNAs disclosed herein to an ex vivo cell, wherein the gRNA is associated with an LNP or not associated with an LNP. In some embodiments, the gRNA/LNP or gRNA is also associated with a Cas9 or an mRNA encoding Cas9.
4. Methods of Gene Modulation
In some embodiments, the invention comprises a pharmaceutical formulation comprising any one of the gRNAs disclosed herein together with a pharmaceutically acceptable carrier. In some embodiments, the invention comprises a pharmaceutical formulation comprising any one of the gRNAs disclosed herein and an LNP together with a pharmaceutically acceptable carrier. In some embodiments, the invention comprises a pharmaceutical formulation comprising any one of the gRNAs disclosed herein, a Cas9 protein or an mRNA encoding a Cas9 protein, and a LNP together with a pharmaceutically acceptable carrier. In some embodiments, the pharmaceutical formulation is for use in preparing a medicament for treating a disease or disorder. In some embodiments, the invention comprises a method of treating a human patient comprising administering any one of the gRNAs or pharmaceutical formulations described herein.
In some embodiments, the invention comprises a method or use of modifying a target DNA comprising, administering or delivering a Cas protein or Cas mRNA and any one or more of the gRNAs disclosed herein.
In some embodiments, the invention comprises a method or use for modulation of a target gene comprising, administering or delivering a Cas protein or Cas mRNA and any one or more of the gRNAs disclosed herein. In some embodiments, the modulation is editing of the target gene. In some embodiments, the modulation is a change in expression of the protein encoded by the target gene.
In some embodiments, the method or use results in gene editing. In some embodiments, the method or use results in a double-stranded break within the target gene. In some embodiments, the method or use results in formation of indel mutations during non-homologous end joining of the DSB. In some embodiments, the method or use results in an insertion or deletion of nucleotides in a target gene. In some embodiments, the insertion or deletion of nucleotides in a target gene leads to a frameshift mutation or premature stop codon that results in a non-functional protein. In some embodiments, the insertion or deletion of nucleotides in a target gene leads to a knockdown or elimination of target gene expression. In some embodiments, the method or use comprises homology directed repair of a DSB. In some embodiments, the method or use further comprises delivering to the cell a template, wherein at least a part of the template incorporates into a target DNA at or near a double strand break site induced by the Cas protein.
In some embodiments, the method or use results in gene modulation. In some embodiments, the gene modulation is an increase or decrease in gene expression, a change in methylation state of DNA, or modification of a histone subunit. In some embodiments, the method or use results in increased or decreased expression of the protein encoded by the target gene.
In some embodiments, any of the gRNAs disclosed herein may be useful in preparing a medicament for treating a disease or disorder.
A. Measures of Gene Modulation
The efficacy of modified gRNAs can be tested in vitro and in vivo. In some embodiments, the invention comprises one or more of the gRNAs disclosed herein, wherein the gRNA results in gene modulation when provided to a cell together with Cas9. In some embodiments, the efficacy of gRNA can be measured in in vitro or in vivo assays.
1. In Vitro Measurement of Cas Efficacy
In some embodiments, the activity of a Cas RNP comprising a modified sgRNA is compared to the activity of a Cas RNP comprising an unmodified sgRNA.
In some embodiments, the activity of a Cas RNP comprising a dgRNA comprising a modified trRNA is compared to the activity of a Cas RNP comprising a dgRNA comprising an unmodified trRNA.
In some embodiments, the activity of a Cas RNP comprising a dgRNA comprising a modified crRNA is compared to the activity of a Cas RNP comprising a dgRNA comprising an unmodified crRNA.
In some embodiments, the activity of a Cas RNP comprising a dgRNA comprising a modified crRNA and a modified trRNA is compared to the activity of a Cas RNP comprising an unmodified crRNA and an unmodified trRNA.
In some embodiments, the efficiency of a gRNA in increasing or decreasing target protein expression is determined by measuring the amount of target protein. In some embodiments, the invention comprises any one of the gRNAs described herein, wherein the gRNA increases or decreases the amount of protein produced from the targeted gene. In some embodiments, the invention comprises a method of modulating protein expression comprising administering any one of the gRNAs disclosed herein to a subject, wherein the gRNA directs Cas9 to the gene encoding the target protein, and the target protein expression is increased or decreased as compared to a gRNA control that does not target Cas9 to that gene.
In some embodiments, the efficiency of editing with specific gRNAs is determined by the editing present at the target location in the genome following delivery of Cas9 and the gRNA (either sgRNA or dgRNA comprising a crRNA and trRNA). In some embodiments, the efficiency of editing with specific gRNAs is measured by next-generation sequencing. In some embodiments, the editing percentage of the target region of interest is determined. In some embodiments, the total number of sequence reads with insertions or deletions of nucleotides into the target region of interest over the total number of sequence reads is measured following delivery of a gRNA and Cas9. In some embodiments, the invention comprises a method of increasing the efficiency of gene editing comprising, administering or delivering any one of the modified gRNAs described herein to a cell, wherein the percentage of gene editing is increased as compared to a control gRNA that is not similarly modified.
In some embodiments, the efficiency of editing with specific gRNAs is measured by the presence of insertions or deletions of nucleotides introduced by successful gene editing. In some embodiments, the invention comprises a method of creating insertions or deletions of nucleotides in genes comprising, administering or delivering any one of the modified gRNAs described herein to a cell, wherein the nucleotides are inserted or deleted as compared to a control gRNA that is not similarly modified. In some embodiments, activity of a Cas9 and gRNAs is tested in biochemical assays. In some embodiments, activity of a Cas9 and gRNAs is tested in a cell-free cleavage assay. In some embodiments, activity of a Cas9 and gRNAs is tested in Neuro2A cells.
In some embodiments, Cas 9 and sgRNA or dgRNA comprising modified crRNA and/or trRNA shows similar, greater, or reduced activity compared to the unmodified sgRNA or dgRNA comprising unmodified crRNA and trRNA. In some embodiments, Cas9 and modified sgRNA or dgRNA comprising modified crRNA and/or trRNA shows enhanced activity compared to the unmodified sgRNA or dgRNA comprising unmodified crRNA and trRNA.
2. In Vivo Measurement of Cas Efficacy
In some embodiments, the activity of modified gRNAs is measured after in vivo dosing of LNPs comprising modified gRNAs and Cas protein or mRNA encoding Cas protein.
In some embodiments, in vivo efficacy of a gRNA or composition provided herein is determined by editing efficacy measured in DNA extracted from tissue (e.g., liver tissue) after administration of gRNA and Cas9.
3. In Vivo Measurement of Immune System Activation
Modifications to gRNA as disclosed herein may reduce the subject's immune response to in vivo dosing of gRNAs. In some embodiments, activation of the subject's immune response is measured by serum concentrations of cytokine(s) following in vivo dosing of sgRNA or dgRNA comprising trRNA and crRNA together with Cas9 mRNA or protein (e.g., formulated in a LNP). In some embodiments, the cytokine is interferon-alpha (IFN-alpha), interleukin 6 (IL-6), monocyte chemotactic protein 1 (MCP-1), and/or tumor necrosis factor alpha (TNF-alpha). In some embodiments, the invention comprises a method of reducing a subject's immune response to delivery of a gRNA comprising, administering any one of the gRNAs disclosed herein, wherein the gRNA produces a reduced response by the subject's immune system following administration. In some embodiments, the invention comprises a method of reducing activation of the subject's immune system following administration as compared to a control gRNA that is not similarly modified.
In some embodiments, administration of Cas RNP or Cas9 mRNA together with the modified gRNA (e.g., sgRNA or dgRNA) produces lower serum concentration(s) of immune cytokines compared to administration of unmodified sgRNA. In some embodiments, the invention comprises a method of reducing a subject's serum concentration of immune cytokines comprising, administering any one of the gRNAs disclosed herein, wherein the gRNA produces a lower concentration of immune cytokines in a subject's serum as compared to a control gRNA that is not similarly modified.
This description and exemplary embodiments should not be taken as limiting. For the purposes of this specification and appended claims, unless otherwise indicated, all numbers expressing quantities, percentages, or proportions, and other numerical values used in the specification and claims, are to be understood as being modified in all instances by the term “about,” to the extent they are not already so modified. Accordingly, unless indicated to the contrary, the numerical parameters set forth in the following specification and attached claims are approximations that may vary depending upon the desired properties sought to be obtained. At the very least, and not as an attempt to limit the application of the doctrine of equivalents to the scope of the claims, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques.
It is noted that, as used in this specification and the appended claims, the singular forms “a,” “an,” and “the,” and any singular use of any word, include plural referents unless expressly and unequivocally limited to one referent. As used herein, the term “include” and its grammatical variants are intended to be non-limiting, such that recitation of items in a list is not to the exclusion of other like items that can be substituted or added to the listed items.
The following examples are provided to illustrate certain disclosed embodiments and are not to be construed as limiting the scope of this disclosure in any way.
A. Synthetic Guide RNA (gRNA)
gRNA in both dual (dgRNA, i.e., crRNA and trRNA) and single guide (sgRNA) format were chemically synthesized by commercial vendors with modified nucleotides and linkages as provided in Table 4.
B. In Vitro Transcription (“IVT”) of Cas9 mRNA
Capped and polyadenylated Cas9 mRNA containing N1-methyl pseudo-U was generated by in vitro transcription using a linearized plasmid DNA template and T7 RNA polymerase. Plasmid DNA containing a T7 promoter and a 100 nucleotide (nt) poly(A/T) region was linearized by XbaI and obtained from a commercial manufacturer. The IVT reaction to generate Cas9 modified mRNA was incubated at 37° C. for 4 hours in the following conditions: 50 ng/μL linearized plasmid; 2 mM each of GTP, ATP, CTP, and N1-methyl pseudo-UTP (Trilink); 10 mM ARCA (Trilink); 5 U/pt T7 RNA polymerase (NEB); 1 U/pt Murine RNase inhibitor (NEB); 0.004 U/μL Inorganic E. coli pyrophosphatase (NEB); and 1× reaction buffer. After the 4 hr incubation, TURBO DNase (ThermoFisher) was added to a final concentration of 0.01 U/4, and the reaction was incubated for an additional 30 minutes to remove the DNA template. The Cas9 mRNA was purified from enzyme and nucleotides using standard protocols, including silica binding columns such as a MegaClear Transcription Clean-up kit (ThermoFisher) or precipitation steps using LiCl followed by EtOH with NaOAc. The transcript concentration was determined by measuring the light absorbance at 260 nm (Nanodrop), and the transcript was analyzed by capillary electrophoresis by Bioanlayzer (Agilent).
C. Cas9 mRNA and gRNA Transfections in Neuro2A Cells
The mouse cell line Neuro2A was cultured in DMEM media supplemented with 10% fetal bovine serum and was plated at a density of 15,000 cells/well in a 96-well plate 24 hours prior to transfection. On the day of transfection, the media was aspirated from cells and replaced with fresh media. Lipofectamine-2000 (Invitrogen) was diluted 1:50 (v/v) in Opti-MEM (Invitrogen). Cas9 mRNA and single guide RNA were diluted separately in Opti-MEM. For the dual guide format, crRNA and trRNA were diluted together in 1:1 molar ratio in Opti-MEM. Both Cas9 mRNA and gRNA were mixed separately 1:1 (v/v) with diluted Lipofectamine-2000, producing two lipoplexes. After 5 minutes of incubation, lipoplexes were added in succession to cells, for a final concentration of 100 ng Cas9 mRNA/well and 0.4 μL total lipofection reagent. Guides were tested at two dose levels for each experiment, including 25 nM and 2.5 nM, 16.7 nM and 1.67 nM, 10 nM and 1 nM, 8.3 nM and 0.83 nM, and 3 nM and 0.3 nM. For dual guide, this concentration includes equimolar amounts of crRNA and trRNA, such that, for example, 25 nM crRNA and 25 nM trRNA produce 25 nM total dual guide. Cells were lysed 24 hours post transfection, and lysates were used directly in the PCR reaction that was analyzed for editing by NGS.
D. Primary Liver Hepatocytes
Primary mouse liver hepatocytes (PMH) (Gibco) were cultured per the manufacturer's protocol (Invitrogen, protocol Nov. 28, 2012). In brief, the cells were thawed and resuspended in hepatocyte thawing medium with supplements (Gibco, Cat. CM7000) followed by centrifugation at 100 g for 10 minutes. The supernatant was discarded and the pelleted cells resuspended in hepatocyte plating medium plus supplement pack (Invitrogen, Cat. A1217601 and CM3000). Cells were counted and plated on Bio-coat collagen I coated 96-well plates (ThermoFisher, Cat. 877272) at a density of 15,000 cells/well and incubated for 5 hours at 37° C. and 5% CO2 atmosphere to allow for monolayer formation. After 5 hours, the plating media was removed and replaced with supplemented hepatocyte culture medium (Invitrogen, Cat. A1217601 and CM4000) containing LNP formulated Cas9 mRNA and guide RNA plus 3% mouse serum. LNPs were diluted from a starting dose level of 100 ng Cas9 mRNA and approximately 30 nM guide RNA per well, carrying out serial dilutions down to 0.1 ng mRNA and 0.03 nM guide per well. Cells were incubated for approximately 48 hours at 37° C. and 5% CO2 atmosphere before cell lysis and NGS analysis as described herein.
E. Lipid Nanoparticle (“LNP”) Formulation
LNPs were formulated with a cationic lipid amine to RNA phosphate (N:P) molar ratio of about 4.5. The lipid nanoparticle components were dissolved in 100% ethanol with the following molar ratios: 45 mol-% (12.7 mM) cationic lipid (e.g., (9Z,12Z)-3-((4,4-bis(octyloxy)butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl octadeca-9,12-dienoate, also called 3-((4,4-bis(octyloxy)butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl (9Z,12Z)-octadeca-9,12-dienoate); 44 mol-% (12.4 mM) helper lipid (e.g., cholesterol); 9 mol-% (2.53 mM) neutral lipid (e.g., DSPC); and 2 mol-% (0.563 mM) PEG (e.g., PEG2k-DMG). The RNA cargo were prepared in 25 mM sodium acetate buffer, pH 4.5, resulting in a concentration of RNA cargo of approximately 0.45 mg/mL.
The LNPs were formed by microfluidic mixing of the lipid and RNA solutions using a Precision Nanosystems NanoAssemblr™ Benchtop Instrument, according to the manufacturer's protocol. A 2:1 ratio of aqueous to organic solvent was maintained during mixing using differential flow rates.
LNP Formulation Procedure A:
After mixing, the LNPs were collected, diluted in phosphate buffered saline (PBS, approximately 1:1), and then remaining buffer was exchanged into PBS (100-fold excess of sample volume), overnight at 4° C. under gentle stirring using a 10 kDa Slide-a-Lyzer™ G2 Dialysis Cassette (ThermoFisher Scientific). The LNPs were concentrated using 10 kDa Amicon spin filter (centrifugation at 4000 g at 4° C.) to achieve the desired concentration. The resulting mixture was then filtered using a 0.2 μm sterile filter. The resulting filtrate was stored at 2-8° C.
LNP Formulation Procedure B:
After mixing, the LNPs were collected, diluted in 50 mM Tris at pH 7.5 (approximately 1:1), and then LNPs were exchanged into 50 mM Tris at pH 7.5 (100-fold excess of sample volume), overnight at 4° C. under gentle stirring using a 10 kDa Slide-a-Lyzer™ G2 Dialysis Cassette (ThermoFisher Scientific). The LNPs were concentrated using 10 kDa Amicon spin filter (centrifugation at 4000 g at 4° C.) to achieve twice the desired concentration. These concentrated LNPs were mixed 1:1 with 50 mM Tris, 90 mM NaCl, 10% sucrose at pH 7.5 (2×TSS). The resulting mixture was then filtered using a 0.2 μM sterile filter. The resulting filtrate was stored at −80° C.
LNP Formulation Procedure C:
The RNA cargo were prepared in 25 mM sodium citrate, 100 mM sodium chloride at pH 5 resulting in a concentration of RNA cargo of approximately 0.45 mg/mL. After mixing, the LNPs were collected in water at the ratio of 3:1. The LNPs were incubated for an hour at room temperature and mixed 1:1 with water. Then they were buffer-exchanged into 1×TSS (50 mM Tris, 45 mM NaCl, 5% sucrose at pH 7.5) on PD-10 columns (GE Healthcare), using manufacturer's protocol. The LNPs were concentrated using 10 kDa Amicon spin filter (centrifugation at 4000 g at 4° C.) to achieve the desired concentration. The resulting mixture was then filtered using a 0.2 μm sterile filter. The resulting filtrate was stored at −80° C.
F. Next-Generation Sequencing (“NGS”) and Analysis for On-Target Cleavage Efficiency
To quantitatively determine the efficiency of editing at the target location in the genome, deep sequencing was utilized to identify the presence of insertions and deletions introduced by gene editing.
PCR primers were designed around the target site (e.g., TTR, FVII), and the genomic area of interest was amplified. Primer sequences are provided below in Table 5.
Additional PCR was performed according to the manufacturer's protocols (Illumina) to add the necessary chemistry for sequencing. The amplicons were sequenced on an Illumina MiSeq instrument. The reads were aligned to the human reference genome (e.g., hg38) after eliminating those having low quality scores. The resulting files containing the reads were mapped to the reference genome (BAM files), where reads that overlapped the target region of interest were selected and the number of wild type reads versus the number of reads which contain an insertion, substitution, or deletion was calculated.
The editing percentage (e.g., the “editing efficiency” or “percent editing”) is defined as the total number of sequence reads with insertions or deletions over the total number of sequence reads, including wild type.
G. LNP Delivery In Vivo
CD-1 female mice, ranging 6-10 weeks of age were used in each study. Animals were weighed and grouped according to body weight for preparing dosing solutions based on group average weight. LNPs were dosed via the lateral tail vein in a volume of 0.2 mL per animal (approximately 10 mL per kilogram body weight). The animals were observed at approximately 6 hours post dose for adverse effects. Body weight was measured at twenty-four hours post-administration, and animals were euthanized at various time points by exsanguination via cardiac puncture under isoflourane anesthesia. Blood was collected into serum separator tubes or into tubes containing buffered sodium citrate for plasma as described herein. For studies involving in vivo editing, liver tissue was collected from the median lobe from each animal for DNA extraction and analysis.
H. Cytokine Induction Analysis
For this analysis, approximately 50-100 μL of blood was collected by tail vein nick for serum cytokine measurements. Blood was allowed to clot at room temperature for approximately 2 hours, and then centrifuged at 1000×g for 10 minutes before collecting the serum. A Luminex based magnetic bead multiplex assay (Affymetrix ProcartaPlus, catalog number Exp040-00000-801) measuring IL-6, TNF-alpha, IFN-alpha, and MCP-1 was used for cytokine analysis in collected in samples. Kit reagents and standards were prepared as directed in the manufacturer's protocol. 25 μL of mouse serum was added to wells containing 25 μL of the diluted antibody coated magnetic beads. The plate was incubated for 2 hours at room temperature and then washed. Diluted biotin antibody (50 μL) was added to the beads and incubated for 1 hour at room temperature. The beads were washed again before adding 50 μL of diluted streptavidin-PE to each well, followed by incubation for 30 minutes. The beads were washed once again and then suspended in 100 μL of wash buffer and read on the Bio-Plex 200 instrument (Bio-Rad). The data was analyzed using Bioplex Manager ver. 6.1 analysis package with cytokine concentrations calculated off a standard curve using a five parameter logistic curve fit.
I. Genomic DNA Isolation
For the in vivo studies, genomic DNA was extracted from 10 mg of tissue using a bead based extraction kit, MagMAX-96 DNA Multi-Sample Kit (ThermoFisher, Cat #4413020) according to manufacturer's protocol, which includes homogenizing the tissue in lysis buffer (approximately 400 μL/10 mg tissue). All DNA samples were normalized to 100 ng/μL concentration for PCR and subsequent NGS analysis, as described herein.
J. Transthyretin (TTR) ELISA Analysis
Blood was collected and the serum was isolated as indicated. The total TTR serum levels were determined using a Mouse Prealbumin (Transthyretin) ELISA Kit (Aviva Systems Biology, Cat. OKIA00111). Kit reagents and standards were prepared according to the manufacture's protocol. Mouse serum was diluted to a final dilution of 10,000-fold with 1× assay diluent. This was done by carrying out two sequential 50-fold dilutions resulting in a 2500-fold dilution. A final 4-fold dilution step was carried out for a total sample dilution of 10,000-fold. Both standard curve dilutions (100 μL each) and diluted serum samples were added to each well of the ELISA plate pre-coated with capture antibody. The plate was incubated at room temperature for 30 minutes before washing. Enzyme-antibody conjugate (100 μL per well) was added for a 20-minute incubation. Unbound antibody conjugate was removed and the plate was washed again before the addition of the chromogenic substrate solution. The plate was incubated for 10 minutes before adding 100 μL of the stop solution, e.g., sulfuric acid (approximately 0.3 M). The plate was read on a SpectraMax M5 plate reader at an absorbance of 450 nm. Serum TTR levels were calculated by SoftMax Pro software ver. 6.4.2 using a four parameter logistic curve fit off the standard curve. Final serum values were adjusted for the assay dilution.
Modified gRNAs were designed in the dual guide format (dgRNA), as shown in Table 4. Accordingly, both modified crRNAs and trRNAs were designed and chemically synthesized to allow for the pairing of modified and unmodified components forming dgRNA. These pairings were transfected into Neuro2A cells at concentrations as indicated in the figures and editing efficiency (e.g., percent editing) was measured by NGS, as described in Example 1.
Certain modified crRNAs from Table 4 targeting the mouse TTR gene were transfected with Cas9 mRNA and unmodified trRNA (TR000002). Tested guides included SEQ ID Nos: 1-18. As shown in
In parallel, modified trRNAs from Table 4 were transfected with Cas9 mRNA along with an unmodified crRNA (CR000686) targeting the same sequence of the mouse TTR gene. Tested guides included SEQ ID Nos: 188-200, and 204. As shown in
In addition to substituting chemically modified nucleotides, some of the crRNA and trRNA pairings tested were also engineered with sequence substitutions, e.g., resulting in G-C pairings not found in the parental sequences. Tested guides included SEQ ID Nos: 15 and 201; 16 and 202; 1 and 188. As shown in
Next, pairings of modified crRNAs and modified trRNAs from Table 4 were tested. As shown in
Based on the dgRNA designs, corresponding single guide RNAs (sgRNAs) were engineered featuring aspects of some of the modified crRNAs and trRNAs, as depicted in Table 4 and
A similar set of experiments were conducted for additional dgRNAs guides depicted in Table 4 and
In parallel, as shown in
In addition to substituting chemically modified nucleotides, some of the crRNA and trRNA pairings tested from Table 4 were also engineered with sequence substitutions, e.g., resulting in G-C pairings or G-U mismatches (“GU wobbles”) not found in the parental sequences. As shown in
Next, select pairings of the modified crRNAs and modified trRNAs from Table 4 were tested as shown in
Some of the modified gRNAs (dgRNAs and sgRNAs) from Table 4 were also tested in a purely biochemical assay (i.e., cell free cleavage assay). Interestingly, many of the modified gRNAs that were largely inactive in the Neuro2A cells were active in the biochemical assay, indicating that such biochemical assays may not be predictive of modified gRNA activity in cells (data not shown).
Having established that certain modifications affected gRNA activity, it was next tested whether these modifications would affect the activity when targeting (1) a separate sequence in the same gene or (2) a sequence in a different gene. Accordingly, gRNAs targeting another sequence in the mouse TTR gene as well as a sequence in the mouse Factor-VII (FVII) gene were engineered and synthesized having certain modification patterns tested in Example 2 (see Table 4). These gRNAs were transfected into Neuro2A cells at the concentrations indicated in the figures and editing efficiency (e.g., percent editing) was measured by NGS, as described in Example 1.
Modified crRNAs from Table 4 targeting either the mouse TTR gene (different sequence as targeted in Example 2) or the mouse FVII gene, were transfected with Cas9 mRNA and unmodified trRNA (TR000002). Tested guides included those shown in
In parallel, modified trRNAs from Table 4 were transfected with Cas9 mRNA along with an unmodified crRNA targeting the same sequence of the mouse TTR gene (CR000705; different sequence as targeted in Example 2) or the same sequence as the mouse FVII gene (CR000657). As shown in
Based on the dgRNA designs described above, corresponding single guide RNAs (sgRNAs) were engineered featuring aspects of some of the modified crRNAs and trRNAs. See, Table 4. These sgRNAs were also tested in Neuro2A cells. Results are shown in
Following the in vitro testing, modified sgRNAs were delivered to animals in six separate studies in order to determine whether the modifications conferred any benefits for editing in vivo.
LNPs were formulated with IVT Cas9 mRNA together with chemically modified sgRNA (targeting TTR or FVII), as described in Example 1. The ratio of mRNA:sgRNA was approximately 1:1, by weight of the RNA components. Unless otherwise indicated, the Cas9 mRNA used in the studies described in this example had the sequence of SEQ ID NO: 360 and the LNPs were formulated using LNP Formulation Procedure A described above.
In one experiment, mice (n=5 per group) were administered a single dose of LNP at 2 mg/kg and blood was collected four hours post dose for serum cytokine analysis. 7 days post dose at necropsy, livers and blood were collected for NGS measurements of editing efficiency and serum TTR analysis, respectively. Each of the sgRNAs in this experiment targeted the same sequence in the TTR gene, the only difference between the sgRNAs being the modifications made to each (See
The results shown in
In another in vivo study, three sgRNAs targeting a separate sequence in the mouse TTR gene were tested. Mice (n=5 per group) were administered a single dose of LNP at 2 mg/kg, 1 mg/kg, or 0.3 mg/kg. Blood was collected four hours post dose for serum cytokine analysis. 7 days post dose at necropsy, livers and blood were collected for NGS measurements of editing efficiency and serum TTR analysis, respectively. In this study, each of the sgRNAs targeted the same sequence in the TTR gene (a different sequence from what was targeted in the previous in vivo study) with one sgRNA being completely unmodified (G000201 (SEQ ID NO: 243)), another having only end modifications (G000211 (SEQ ID NO: 241)), with 2′-O-methyl modifications and phosphorothioate linkages at and between the three terminal nucleotides at both the 5′ and 3′ termini of the sgRNA, respectively), and a third sgRNA having the same modification pattern as G000267 in the previous in vivo study (G000282 (SEQ ID NO: 242)).
As shown in
A similar study as the second in vivo study was next conducted with another set of three sgRNAs targeting yet a different TTR sequence in the mouse TTR gene (targeting a different sequence then what was targeted in the two previous in vivo studies). Mice (n=5 per group) were administered a single dose of LNP at 2 mg/kg, 1 mg/kg, or 0.3 mg/kg. Blood was collected four hours post dose for serum cytokine analysis. 7 days post dose at necropsy, livers and blood were collected for NGS measurements of editing efficiency and serum TTR analysis, respectively. In this study, each of the sgRNAs targeted the same sequence in the TTR gene (a different sequence from what was targeted in the previous two in vivo studies) with one sgRNA being completely unmodified (G000285; (SEQ ID NO: 332)), another having only end modifications (G000269 (SEQ ID NO: 330)), with 2′-O-methyl modifications and phosphorothioate linkages at and between the three terminal nucleotides at both the 5′ and 3′ ends of the sgRNA, respectively), and a third sgRNA having the same modification pattern as G000267 and G000282 in the previous two in vivo studies (G000283 (SEQ ID NO: 331)).
In this study, the unmodified sgRNA (G000285 (SEQ ID NO: 332)) conferred little in vivo editing, while the heavily modified sgRNA (G000283 (SEQ ID NO: 331)) conferred levels reaching ˜60% with a dose of 2 mg/kg, which was significantly greater than the levels achieved with the less modified sgRNA (G000269 (SEQ ID NO: 330)) (
In a fourth in vivo study, the effects of modifications to gRNAs was evaluated for another gene (FVII). For in-study comparison, two of the sgRNAs tested in the first in vivo study were included (G000209 and G000267). Mice (n=5 per group) were administered a single dose of LNP at 2 mg/kg, 1 mg/kg, or 0.3 mg/kg, and blood was collected four hours post dose for serum cytokine analysis. 6 days post dose at necropsy, livers were collected for NGS measurements of editing efficiency. In this study, each of the sgRNAs targeted the same sequence in the TTR or FVII genes, with one sgRNA for each having only end modifications (G000208 (SEQ ID NO: 286)) for FVII, G000209 for TTR, both having 2′-O-methyl modifications and phosphorothioate linkages at and between the three terminal nucleotides at both the 5′ and 3′ ends of the sgRNA, respectively), and a second sgRNA having the same modification patterns as G000267, G000282, and G000283 in the previous in vivo studies (G000373 (SEQ ID NO: 287) for FVII; G000267 (SEQ ID NO: 234) for TTR).
As shown in
In another in vivo study, ten additional sgRNAs targeting the same sequence in the mouse TTR gene as G000282 were tested. 0000282 was also included in the study for comparative purposes. Mice (n=5 per group) were administered a single dose of LNP at 1 mg/kg or 0.5 mg/kg. The LNPs used in this study were formulated using LNP Formulation Procedure B described above. Seven (7) days post dose at necropsy, livers and blood were collected for NGS measurements of editing efficiency and serum TTR analysis, respectively. In this study, each of the sgRNAs targeted the same sequence in the TTR gene. The modification pattern of each sgRNA tested varied and included 2′-OMe, 2′-F, and PS modifications in the 5′ terminus, 3′ terminus, hairpin 1, hairpin 2, nexus, lower stem, bulge, and upper stem of the sgRNA. The results of this study are shown in
In another in vivo study, thirteen sgRNAs targeting the same sequence in the mouse TTR gene as G000282 were tested. G000282 was also included in the study for comparative purposes. Mice (n=5 per group) were administered a single dose of LNP at 1 mg/kg. The LNPs used in this study were formulated using LNP Formulation Procedure C described above. The Cas9 mRNA used in this study had the sequence of SEQ ID NO: 359. Blood was collected four hours post dose for serum cytokine analysis. 7 days post dose at necropsy, livers and blood were collected for NGS measurements of editing efficiency and serum TTR analysis, respectively. In this study, each of the sgRNAs targeted the same sequence in the TTR gene. The sgRNAs tested include additional 2′-OMe and PS modifications in the 5′ terminus, 3′ terminus, hairpin 1, hairpin 2, and upper stem of the sgRNA. The results of this study are shown in
The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Jun. 7, 2019, is named 01155-0004-00US_SeqList.txt and is 118,923 bytes in size. This application is a Continuation of International Application No. PCT/US2017/065306, which was filed on Dec. 8, 2017, and which claims the benefit of priority to U.S. Provisional Application No. 62/431,756, which was filed on Dec. 8, 2016, both of which are incorporated by reference in their entirety. This disclosure relates to the field of gene editing using CRISPR/Cas systems, a part of the prokaryotic immune system that recognizes and cuts exogenous genetic elements. The CRISPR/Cas system relies on a single nuclease, termed CRISPR-associated protein 9 (Cas9), which induces site-specific breaks in DNA. Cas9 is guided to specific DNA sequences by small RNA molecules termed guide RNA (gRNA). Guide RNA comprises trRNA (also known as tracrRNA) and crisprRNA (crRNA). The trRNA and crRNA may be contained within a single guide RNA (sgRNA) or in two separate RNA molecules of a dual guide RNA (dgRNA). Cas9 in combination with trRNA and crRNA or an sgRNA is termed the Cas9 ribonucleoprotein complex (RNP). Oligonucleotides, and in particular RNA, are sometimes degraded in cells and in serum by endonuclease or exonuclease cleavage. Improved methods and compositions for preventing such degradation, improving stability of gRNAs and enhancing gene editing efficiency is desired, especially for therapeutic applications.
Number | Name | Date | Kind |
---|---|---|---|
20150376586 | May et al. | Dec 2015 | A1 |
20150376628 | Schoenherr | Dec 2015 | A1 |
Number | Date | Country |
---|---|---|
2014152432 | Sep 2014 | WO |
2016089433 | Jun 2016 | WO |
2016164356 | Oct 2016 | WO |
2017004279 | Jan 2017 | WO |
2017068377 | Apr 2017 | WO |
2017136794 | Aug 2017 | WO |
2017173054 | Oct 2017 | WO |
Entry |
---|
Finn, J. D. et al. “A Single Administration of CRISPR/Cas9 Lipid Nanoparticles Achieves Robust and Persistent In Vivi Genome Editing” Cell Reports 22, 2227-2235 (2018). |
Briner et al., “Guide RNA Functional Modules Direct Cas9 Activity and Orthogonality”, Molecular Cell 56:333-339, 2014. |
Hendel, Ayal et al. “Chemically modified guide RNAs enhance CRISPR-Cas genome editing in human primary cells.” Nature biotechnology vol. 33,9 (2015): 985-989. |
Ran, F Ann et al. “In vivo genome editing using Staphylococcus aureus Cas9.” Nature vol. 520,7546 (2015): 186-91. |
Wang Ming et al., “Efficient delivery of genome-editing proteins using bioreducible lipid nanoparticles”, Proceedings National Academy of Sciences PNAS, vol. 113, No. 11, Feb. 29, 2016, p. 2868-2873. |
Yin Hao et al., “Therapeutic genome editing by combined viral and non-viral delivery of CRISPR system components in vivo”, Nature Biotechnology, vol. 34, No. 3, Feb. 1, 2016, p. 328-333. |
Yu, Xin et al. “Improved delivery of Cas9 protein/gRNA complexes using lipofectamine CRISPRMAX.” Biotechnology Letters vol. 38,6 (2016): 919-29. |
Ryan, et al. “Improving CRISPR-Cas specificity with chemical modifications in single-guide RNAs” Nucleic Acids Research, 2018, 46(2):792-803 (published online Dec. 4, 2017). |
International Search Report and Written Opinion for PCT/US2017/065306 dated Apr. 17, 2018. |
Number | Date | Country | |
---|---|---|---|
20190316121 A1 | Oct 2019 | US |
Number | Date | Country | |
---|---|---|---|
62431756 | Dec 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2017/065306 | Dec 2017 | US |
Child | 16434512 | US |