The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Dec. 4, 2020, is named 2020-12-04_01155-0022-00US_ST25.txt and is 969,230 bytes in size.
This disclosure relates to the field of gene editing using CRISPR/Cas systems, a part of the prokaryotic immune system that recognizes and cuts exogenous genetic elements. The CRISPR/Cas system relies on a single nuclease, termed CRISPR-associated protein 9 (Cas9), which induces site-specific breaks in DNA. Cas9 is guided to specific DNA sequences by small RNA molecules termed guide RNA (gRNA). A complete guide RNA comprises tracrRNA (trRNA) and crisprRNA (crRNA). A crRNA comprising a guide region may also be referred to as a gRNA, with the understanding that to form a complete gRNA it should be or become associated covalently or noncovalently with a trRNA. The trRNA and crRNA may be contained within a single guide RNA (sgRNA) or in two separate RNA molecules of a dual guide RNA (dgRNA). Cas9 in combination with trRNA and crRNA or an sgRNA is termed the Cas9 ribonucleoprotein complex (RNP).
Oligonucleotides, and in particular RNA, are sometimes degraded in cells and in serum by non-enzymatic, endonuclease or exonuclease cleavage. Improved methods and compositions for preventing such degradation, improving stability of gRNAs and enhancing gene editing efficiency is desired, especially for therapeutic applications.
In some embodiments, genome editing tools are provided comprising modified guide RNA (gRNA). The modifications of gRNAs described herein may improve the stability of the gRNA and the gRNA/Cas9 complex and improve the activity of Cas9 (e.g., SaCas9, SpyCas9, and equivalents) to cleave target DNA.
In some embodiments, modified crisprRNA (crRNA) and/or modified tracrRNA (trRNA) are provided. In some embodiments, the modified crRNA and/or modified trRNA comprise a dual guide RNA (dgRNA). In some embodiments, the modified crRNA and/or modified trRNA comprise a single guide RNA (sgRNA). The modifications of crRNA and/or trRNA described herein may improve the stability of the gRNA and the gRNA/Cas9 complex and improve the activity of Cas9 (e.g., SauCas9, SpyCas9, and equivalents) to cleave target DNA. In some embodiments, the crRNA portion of a dgRNA or an sgRNA is modified in the targeting domain.
In some embodiments, genome editing tools are provided comprising short-single guide RNA (short-sgRNA). In some embodiments, the short-sgRNA is modified. The short-sgRNAs described herein may improve the stability of the short-sgRNA and the short-sgRNA/Cas9 complex and improve the activity of Cas9 (e.g., SauCas9, SpyCas9, and equivalents) to cleave target DNA.
In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, dgRNA, or crRNA) comprises a modification at one or more YA sites, e.g., as set forth in the embodiments below, Table 1, and in the Examples and associated Figures. For the avoidance of doubt, sgRNAs include but are not limited to short-sgRNAs. As discussed in the Examples section, it has been found that gRNAs can be susceptible to an RNase A-like degradation pattern, e.g., including cleavage at unmodified YA sites. It has further been found that YA site modifications can reduce or eliminate such cleavage and that many YA site modifications appear to be tolerated without adversely affecting the ability of the gRNA to direct cleavage by a nuclease such as Cas9. It has also been found that certain gRNA positions, including but not limited to YA sites, can be modified despite statements by others (see Yin et al., Nature Biotechnol. 35:1179-1187 (2017)) that they are contacted by Cas9 and should not be modified out of concern for loss of activity. Such modifications may further reduce undesirable gRNA degradation while not compromising activity.
The following embodiments are encompassed.
Provided herein are modified guide RNAs (gRNAs) for use in gene editing methods. Sequences of engineered and tested gRNAs are shown in Table 1.
Certain of the gRNAs provided herein are modified dual guide RNAs (dgRNAs) for use in gene editing methods. Sequences of engineered and tested dgRNAs are shown in Table 1. Certain of the dgRNAs have certain modifications at YA sites in the dgRNA, including modifications in the crRNA and/or the trRNA.
Certain of the gRNAs provided herein are modified single guide RNAs (sgRNAs) for use in gene editing methods. Sequences of engineered and tested sgRNAs are shown in Table 1. Certain of the sgRNAs have certain modifications at YA sites in the sgRNA, including modifications in the crRNA portion of the sgRNA and/or the trRNA portion of the sgRNA.
Also provided herein are short-single guide RNAs (short-sgRNAs), optionally modified, for use in gene editing methods. Sequences of engineered and tested short-sgRNAs are shown in Table 1. Certain of the short-sgRNAs have certain modifications at YA sites in the short-sgRNA, including modifications in the crRNA portion of the short-sgRNA and/or the trRNA portion of the short-sgRNA.
This disclosure further provides uses of these gRNAs (e.g., sgRNA, short-sgRNA, dgRNA, or crRNA) to alter the genome of a target nucleic acid in vitro (e.g., cells cultured in vitro for use in ex vivo therapy or other uses of genetically edited cells) or in a cell in a subject such as a human (e.g., for use in in vivo therapy). The present disclosure also provides methods for preventing or treating a disease in a subject by modifying a target gene associated with a disease. The disclosed gRNAs can be used with any cell type and at any genetic locus amenable to nuclease mediated genome editing technology.
Nucleotide modifications are indicated in Table 1 as follows: m: 2′-OMe; *: PS linkage; f: 2′-fluoro; (invd): inverted abasic; moe: 2′-moe; e: ENA; d: deoxyribonucleotide (also note that T is always a deoxyribonucleotide); x: UNA. Thus, for example, mA represents 2′-O-methyl adenosine; xA represents a UNA nucleotide with an adenine nucleobase; eA represents an ENA nucleotide with an adenine nucleobase; and dA represents an adenosine deoxyribonucleotide.
sgRNA designations are sometimes provided with one or more leading zeroes immediately following the G. This does not affect the meaning of the designation. Thus, for example, G000282, G0282, G00282, and G282 refer to the same sgRNA. Similarly, crRNA and or trRNA designations are sometimes provided with one or more leading zeroes immediately following the CR or TR, respectively, which does not affect the meaning of the designation. Thus, for example, CR000100, CR00100, CR0100, and CR100 refer to the same crRNA, and TR000200, TR00200, TR0200, and TR200 refer to the same trRNA.
For SEQ ID NOs: 401-535, 1001, and 1007-1032, positions correspond to sgRNA regions as follows: 1-20, guide region; 21-26 and 45-50, lower stem; 27-28 and 41-44, bulge; 29-40, upper stem (of which 33-36 are a tetraloop); 51-68, nexus; 69-80, hairpin 1; 82-96, hairpin 2 (position 81 is a nucleotide between hairpin 1 and hairpin 2); 97-100, 3′ terminus region.
For SEQ ID NOs 601 and 607-732, no guide region is shown and the positions corresponding to the remaining regions are each decremented by 20 relative to those given for SEQ ID NOs: 401-532. For SEQ ID NOs 801 and 807-932, the spacer is the length of x and the positions corresponding to the remaining regions are each decremented by 20 and incremented by x relative to those given for SEQ ID NOs: 401-532.
“Editing efficiency” or “editing percentage” or “percent editing” as used herein is the total number of sequence reads with insertions or deletions of nucleotides into the target region of interest over the total number of sequence reads following cleavage by a Cas RNP.
“Regions” as used herein describes conserved groups of nucleic acids. Regions may also be referred to as “modules” or “domains.” Regions of an sgRNA may perform particular functions, e.g., in directing endonuclease activity of the RNP, for example as described in Briner A E et al., Molecular Cell 56:333-339 (2014). Exemplary regions of an sgRNA are described in Table 3.
“Hairpin” as used herein describes a duplex of nucleic acids that is created when a nucleic acid strand folds and forms base pairs with another section of the same strand. A hairpin may form a structure that comprises a loop or a U-shape. In some embodiments, a hairpin may be comprised of an RNA loop. Hairpins can be formed with two complementary sequences in a single nucleic acid molecule bind together, with a folding or wrinkling of the molecule. In some embodiments, hairpins comprise stem or stem loop structures. As used herein, a “hairpin region” refers to hairpin 1 and hairpin 2 and the “n” between hairpin 1 and hairpin 2 of a conserved portion of an sgRNA.
“Ribonucleoprotein” (RNP) or “RNP complex” as used herein describes an sgRNA, for example, together with a nuclease, such as a Cas protein. In some embodiments, the RNP comprises Cas9 and gRNA (e.g., sgRNA, short-sgRNA, dgRNA, or crRNA).
“Stem loop” as used herein describes a secondary structure of nucleotides that form a base-paired “stem” that ends in a loop of unpaired nucleic acids. A stem may be formed when two regions of the same nucleic acid strand are at least partially complementary in sequence when read in opposite directions. “Loop” as used herein describes a region of nucleotides that do not base pair (i.e., are not complementary) that may cap a stem. A “tetraloop” describes a loop of 4 nucleotides. As used herein, the upper stem of an sgRNA may comprise a tetraloop.
“Guide RNA”, “gRNA”, and “guide” are used herein interchangeably to refer to either a crRNA (also known as CRISPR RNA), or the combination of a crRNA and a trRNA (also known as tracrRNA). The crRNA and trRNA may be associated as a single RNA molecule (single guide RNA, sgRNA) or in two separate RNA molecules (dual guide RNA, dgRNA). “Guide RNA” or “gRNA” refers to each type. The trRNA may be a naturally-occurring sequence, or a trRNA sequence with modifications or variations compared to naturally-occurring sequences. Guide RNAs can include modified RNAs as described herein.
In some embodiments, the gRNA (e.g., sgRNA) comprises a “guide region”, which is sometimes referred to as a “spacer” or “spacer region,” for example, in Briner A E et al., Molecular Cell 56:333-339 (2014) for sgRNA (but applicable herein to all guide RNAs). The guide region or spacer region is also sometimes referred to as a “variable region,” “guide domain” or “targeting domain.” In some embodiments, a “guide region” immediately precedes a “conserved portion of an sgRNA” at its 5′ end, and in some embodiments the sgRNA is a short-sgRNA. An exemplary “conserved portion of an sgRNA” is shown in Table 2. In some embodiments, a “guide region” comprises a series of nucleotides at the 5′ end of a crRNA. In some embodiments, the guide region comprises one or more YA sites (“guide region YA sites”). In some embodiments, the guide region comprises one or more YA sites located at positions from a given nucleotide relative to the 5′ end to the end of the guide region. Such ranges of positions are referred to as, e.g., “5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus” where the “end” in “5-end”, etc., refers to most 3′ nucleotide in the guide region. (Similarly, expressions such as “nucleotides 21-end of the gRNA” refer to the range from nucleotide 21 from the 5′ end of the 5′ terminus of the gRNA to the final nucleotide at the 3′ end of the gRNA.) Furthermore, a nucleotide that is, for example, 6 nucleotides from the 5′ end of a particular sgRNA segment is the sixth nucleotide of that segment, or “nucleotide 6” from the 5′ end, e.g., , where N is the 6th nucleotide from the 5′ end. A range of nucleotides that is located “at or after” 6 nucleotides from the 5′ end begins with the 6th nucleotide and continues down the chain toward the 3′ end. Similarly, a nucleotide that is, for example, 5 nucleotides from the 3′ end of the chain is the 5th nucleotide when counting from the 3′ end of the chain, e.g. NXXXX. A numeric position or range in the guide region refers to the position as determined from the 5′ end unless another point of reference is specified; for example, “nucleotide 5” in a guide region is the 5th nucleotide from the 5′ end.
In some embodiments, a gRNA comprises nucleotides that “match the modification pattern” at corresponding or specified nucleotides of a gRNA described herein. This means that the nucleotides matching the modification pattern have the same modifications (e.g., phosphorothioate, 2′-fluoro, 2′-OMe, etc.) as the nucleotides at the corresponding positions of the gRNA described herein, regardless of whether the nucleobases at those positions match. For example, if in a first gRNA, nucleotides 5 and 6, respectively, have 2′-OMe and phosphorothioate modifications, then this gRNA has the same modification pattern at nucleotides 5 and 6 as a second gRNA that also has 2′-OMe and phosphorothioate modifications at nucleotides 5 and 6, respectively, regardless of whether the nucleobases at positions 5 and 6 are the same or different in the first and second gRNAs. However, a 2′-OMe modification at nucleotide 6 but not nucleotide 7 is not the same modification pattern at nucleotides 6 and 7 as a 2′-OMe modification at nucleotide 7 but not nucleotide 6. Similarly, a modification pattern that matches at least 75% of the modification pattern of a gRNA described herein means that at least 75% of the nucleotides have the same modifications as the corresponding positions of the gRNA described herein. Corresponding positions may be determined by pairwise or structural alignment.
A “conserved region” of a S. pyogenes Cas9 (“spyCas9” (also referred to as “spCas9”)) sgRNA” is shown in Table 2. The first row shows the numbering of the nucleotides; the second row shows the sequence (e.g., SEQ ID NO: 400); and the third row shows the regions.
As used herein, a “short-single guide RNA” (“short-sgRNA”) is an sgRNA comprising a conserved portion of an sgRNA comprising a hairpin region, wherein the hairpin region lacks at least 5-10 or 6-10 nucleotides. In some embodiments, a short-sgRNA lacks at least nucleotides 54-58 (AAAAA) of the conserved portion of a S. pyogenes Cas9 (“spyCas9”) sgRNA, as shown in Table 2. In some embodiments, a short-sgRNA is a non-spyCas9 sgRNA that lacks nucleotides corresponding to nucleotides 54-58 (AAAAA) of the conserved portion of a spyCas9 as determined, for example, by pairwise or structural alignment. In some embodiments, a short-sgRNA lacks at least nucleotides 54-61 (AAAAAGUG) of the conserved portion of a spyCas9 sgRNA. In some embodiments, a short-sgRNA lacks at least nucleotides 53-60 (GAAAAAGU) of the conserved portion of a spyCas9 sgRNA. In some embodiments, a short-sgRNA lacks 4, 5, 6, 7, or 8 nucleotides of nucleotides 53-60 (GAAAAAGU) or nucleotides 54-61 (AAAAAGUG) of the conserved portion of a spyCas9 sgRNA, or the corresponding nucleotides of the conserved portion of a non-spyCas9 sgRNA as determined, for example, by pairwise or structural alignment.
As used herein, a “YA site” refers to a 5′-pyrimidine-adenine-3′ dinucleotide. For clarification, a “YA site” in an original sequence that is altered by modifying a base is still considered a (modified) YA site in the resulting sequence, regardless of the absence of a literal YA dinucleotide. A “conserved region YA site” is present in the conserved region of an sgRNA. A “guide region YA site” is present in the guide region of an sgRNA. An unmodified YA site in an sgRNA may be susceptible to cleavage by RNase-A like endonucleases, e.g., RNase A. In some embodiments, a short-sgRNA comprises about 10 YA sites in its conserved region. In some embodiments, an sgRNA comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 YA sites in its conserved region. Exemplary conserved region YA sites are indicated in
As discussed herein, positions of nucleotides corresponding to those described with respect to spyCas9 gRNA can be identified in another gRNA with sequence and/or structural similarity by pairwise or structural alignment. Structural alignment is useful where molecules share similar structures despite considerable sequence variation. For example, spyCas9 and Staphylococcus aureus Cas9 (“SaCas9”) have divergent sequences, but significant structural alignment. See, e.g.,
Structural alignment involves identifying corresponding residues across two (or more) sequences by (i) modeling the structure of a first sequence using the known structure of the second sequence or (ii) comparing the structures of the first and second sequences where both are known, and identifying the residue in the first sequence most similarly positioned to a residue of interest in the second sequence. Corresponding residues are identified in some algorithms based on distance minimization given position (e.g., nucleobase position 1 or the 1′ carbon of the pentose ring for polynucleotides, or alpha carbons for polypeptides) in the overlaid structures (e.g., what set of paired positions provides a minimized root-mean-square deviation for the alignment). When identifying positions in a non-spyCas9 gRNA corresponding to positions described with respect to spyCas9 gRNA, spyCas9 gRNA can be the “second” sequence. Where a non-spyCas9 gRNA of interest does not have an available known structure, but is more closely related to another non-spyCas9 gRNA that does have a known structure, it may be most effective to model the non-spyCas9 gRNA of interest using the known structure of the closely related non-spyCas9 gRNA, and then compare that model to the spyCas9 gRNA structure to identify the desired corresponding residue in the non-spyCas9 gRNA of interest. There is an extensive literature on structural modeling and alignment for proteins; representative disclosures include U.S. Pat. Nos. 6,859,736; 8,738,343; and those cited in Aslam et al., Electronic Journal of Biotechnology 20 (2016) 9-13. For discussion of modeling a structure based on a known related structure or structures, see, e.g., Bordoli et al., Nature Protocols 4 (2009) 1-13, and references cited therein. See also
A “target sequence” as used herein refers to a sequence of nucleic acid to which the guide region directs a nuclease for cleavage. In some embodiments, a spyCas9 protein may be directed by a guide region to a target sequence by the nucleotides present in the guide region. In some embodiments, the sgRNA does not comprise a spacer region.
As used herein, the “5′ end” refers to the first nucleotide of the gRNA (including a dgRNA (typically the 5′ end of the crRNA of the dgRNA), sgRNA or a short-sgRNA), in which the 5′ position is not linked to another nucleotide.
As used herein, a “5′ end modification” refers to a gRNA comprising a guide region having modifications in one or more of the one (1) to about seven (7) nucleotides at its 5′ end, optionally wherein the first nucleotide (from the 5′ end) of the gRNA is modified.
As used herein, the “3′ end” refers to the end or terminal nucleotide of a gRNA, in which the 3′ position is not linked to another nucleotide. In some embodiment, the 3′ end is in the 3′ tail. In some embodiments, the 3′ end is in the conserved portion of an gRNA.
As used herein, a “3′ end modification” refers to a gRNA having modifications in one or more of the one (1) to about seven (7) nucleotides at its 3′ end, optionally wherein the last nucleotide (i.e., the 3′ most nucleotide) of the gRNA is modified. If a 3′ tail is present, the 1 to about 7 nucleotides may be within the 3′ tail. If a 3′ tail is not present, the 1 to about 7 nucleotides may be within the conserved portion of a sgRNA.
The “last,” “second to last,” “third to last,” etc., nucleotide refers to the 3′ most, second 3′ most, third 3′ most, etc., nucleotide, respectively in a given sequence. For example, in the sequence 5′-AAACTG-3′, the last, second to last, and third to last nucleotides are G, T, and C, respectively. The phrase “last 3 nucleotides” refers to the last, second to last, and third to last nucleotides; more generally, “last N nucleotides” refers to the last to the Nth to last nucleotides, inclusive. “Third nucleotide from the 3′ end of the 3′ terminus” is equivalent to “third to last nucleotide.” Similarly, “third nucleotide from the 5′ end of the 5′ terminus” is equivalent to “third nucleotide at the 5′ terminus.”
As used herein, a “protective end modification” (such as a protective 5′ end modification or protective 3′ end modification) refers to a modification of one or more nucleotides within seven nucleotides of the end of an sgRNA that reduces degradation of the sgRNA, such as exonucleolytic degradation. In some embodiments, a protective end modification comprises modifications of at least two or at least three nucleotides within seven nucleotides of the end of the sgRNA. In some embodiments, the modifications comprise phosphorothioate linkages, 2′ modifications such as 2′-OMe or 2′-fluoro, 2′-H (DNA), ENA, UNA, or a combination thereof. In some embodiments, the modifications comprise phosphorothioate linkages and 2′-OMe modifications. In some embodiments, at least three terminal nucleotides are modified, e.g., with phosphorothioate linkages or with a combination of phosphorothioate linkages and 2′-OMe modifications. Modifications known to those of skill in the art to reduce exonucleolytic degradation are encompassed.
In some embodiments, a “3′ tail” comprising between 1 and about 20 nucleotides follows the conserved portion of a sgRNA at its 3′ end.
As used herein, an “RNA-guided DNA binding agent” means a polypeptide or complex of polypeptides having RNA and DNA binding activity, or a DNA-binding subunit of such a complex, wherein the DNA binding activity is sequence-specific and depends on the sequence of the RNA. Exemplary RNA-guided DNA binding agents include Cas cleavases/nickases and inactivated forms thereof (“dCas DNA binding agents”). “Cas nuclease”, also called “Cas protein”, as used herein, encompasses Cas cleavases, Cas nickases, and dCas DNA binding agents. Cas cleavases/nickases and dCas DNA binding agents include a Csm or Cmr complex of a type III CRISPR system, the Cas10, Csm1, or Cmr2 subunit thereof, a Cascade complex of a type I CRISPR system, the Cas3 subunit thereof, and Class 2 Cas nucleases. As used herein, a “Class 2 Cas nuclease” is a single-chain polypeptide with RNA-guided DNA binding activity, such as a Cas9 nuclease or a Cpf1 nuclease. Class 2 Cas nucleases include Class 2 Cas cleavases and Class 2 Cas nickases (e.g., H840A, D10A, or N863A variants), which further have RNA-guided DNA cleavases or nickase activity, and Class 2 dCas DNA binding agents, in which cleavase/nickase activity is inactivated. Class 2 Cas nucleases include, for example, Cas9, Cpf1, C2c1, C2c2, C2c3, HF Cas9 (e.g., N497A, R661A, Q695A, Q926A variants), HypaCas9 (e.g., N692A, M694A, Q695A, H698A variants), eSPCas9(1.0) (e.g, K810A, K1003A, R1060A variants), and eSPCas9(1.1) (e.g., K848A, K1003A, R1060A variants) proteins and modifications thereof. Cpf1 protein, Zetsche et al., Cell, 163: 1-13 (2015), is homologous to Cas9, and contains a RuvC-like nuclease domain. Cpf1 sequences of Zetsche are incorporated by reference in their entirety. See, e.g., Zetsche, Tables S1 and S3. “Cas9” encompasses Spy Cas9, the variants of Cas9 listed herein, and equivalents thereof. See, e.g., Makarova et al., Nat Rev Microbiol, 13(11): 722-36 (2015); Shmakov et al., Molecular Cell, 60:385-397 (2015).
As used herein, a first sequence is considered to “comprise a sequence with at least X % identity to” a second sequence if an alignment of the first sequence to the second sequence shows that X % or more of the positions of the second sequence in its entirety are matched by the first sequence. For example, the sequence AAGA comprises a sequence with 100% identity to the sequence AAG because an alignment would give 100% identity in that there are matches to all three positions of the second sequence. The differences between RNA and DNA (generally the exchange of uridine for thymidine or vice versa) and the presence of nucleoside analogs such as modified uridines do not contribute to differences in identity or complementarity among polynucleotides as long as the relevant nucleotides (such as thymidine, uridine, or modified uridine) have the same complement (e.g., adenosine for all of thymidine, uridine, or modified uridine; another example is cytosine and 5-methylcytosine, both of which have guanosine or modified guanosine as a complement). Thus, for example, the sequence 5′-AXG where X is any modified uridine, such as pseudouridine, N1-methyl pseudouridine, or 5-methoxyuridine, is considered 100% identical to AUG in that both are perfectly complementary to the same sequence (5′-CAU). Exemplary alignment algorithms are the Smith-Waterman and Needleman-Wunsch algorithms, which are well-known in the art. One skilled in the art will understand what choice of algorithm and parameter settings are appropriate for a given pair of sequences to be aligned; for sequences of generally similar length and expected identity >50% for amino acids or >75% for nucleotides, the Needleman-Wunsch algorithm with default settings of the Needleman-Wunsch algorithm interface provided by the EBI at the www.ebi.ac.uk web server is generally appropriate.
“mRNA” is used herein to refer to a polynucleotide that is RNA or modified RNA and comprises an open reading frame that can be translated into a polypeptide (i.e., can serve as a substrate for translation by a ribosome and amino-acylated tRNAs). mRNA can comprise a phosphate-sugar backbone including ribose residues or analogs thereof, e.g., 2′-methoxy ribose residues. In some embodiments, the sugars of a nucleic acid phosphate-sugar backbone consist essentially of ribose residues, 2′-methoxy ribose residues, or a combination thereof. In general, mRNAs do not contain a substantial quantity of thymidine residues (e.g., 0 residues or fewer than 30, 20, 10, 5, 4, 3, or 2 thymidine residues; or less than 10%, 9%, 8%, 7%, 6%, 5%, 4%, 4%, 3%, 2%, 1%, 0.5%, 0.2%, or 0.1% thymidine content). An mRNA can contain modified uridines at some or all of its uridine positions.
As used herein, the “minimum uridine content” of a given ORF is the uridine content of an ORF that (a) uses a minimal uridine codon at every position and (b) encodes the same amino acid sequence as the given ORF. The minimal uridine codon(s) for a given amino acid is the codon(s) with the fewest uridines (usually 0 or 1 except for a codon for phenylalanine, where the minimal uridine codon has 2 uridines). Modified uridine residues are considered equivalent to uridines for the purpose of evaluating minimum uridine content.
As used herein, the “minimum uridine dinucleotide content” of a given ORF is the lowest possible uridine dinucleotide (UU) content of an ORF that (a) uses a minimal uridine codon (as discussed above) at every position and (b) encodes the same amino acid sequence as the given ORF. The uridine dinucleotide (UU) content can be expressed in absolute terms as the enumeration of UU dinucleotides in an ORF or on a rate basis as the percentage of positions occupied by the uridines of uridine dinucleotides (for example, AUUAU would have a uridine dinucleotide content of 40% because 2 of 5 positions are occupied by the uridines of a uridine dinucleotide). Modified uridine residues are considered equivalent to uridines for the purpose of evaluating minimum uridine dinucleotide content.
As used herein, the “minimum adenine content” of a given open reading frame (ORF) is the adenine content of an ORF that (a) uses a minimal adenine codon at every position and (b) encodes the same amino acid sequence as the given ORF. The minimal adenine codon(s) for a given amino acid is the codon(s) with the fewest adenines (usually 0 or 1 except for a codon for lysine and asparagine, where the minimal adenine codon has 2 adenines). Modified adenine residues are considered equivalent to adenines for the purpose of evaluating minimum adenine content.
As used herein, the “minimum adenine dinucleotide content” of a given open reading frame (ORF) is the lowest possible adenine dinucleotide (AA) content of an ORF that (a) uses a minimal adenine codon (as discussed above) at every position and (b) encodes the same amino acid sequence as the given ORF. The adenine dinucleotide (AA) content can be expressed in absolute terms as the enumeration of AA dinucleotides in an ORF or on a rate basis as the percentage of positions occupied by the adenines of adenine dinucleotides (for example, UAAUA would have an adenine dinucleotide content of 40% because 2 of 5 positions are occupied by the adenines of an adenine dinucleotide). Modified adenine residues are considered equivalent to adenines for the purpose of evaluating minimum adenine dinucleotide content.
As used herein, a “subject” refers to any member of the animal kingdom. In some embodiments, “subject” refers to humans. In some embodiments, “subject” refers to non-human animals. In some embodiments, “subject” refers to primates. In some embodiments, subjects include, but are not limited to, mammals, birds, reptiles, amphibians, fish, insects, and/or worms. In certain embodiments, the non-human subject is a mammal (e.g., a rodent, a mouse, a rat, a rabbit, a monkey, a dog, a cat, a sheep, cattle, a primate, and/or a pig). In some embodiments, a subject may be a transgenic animal, genetically-engineered animal, and/or a clone. In certain embodiments of the present invention the subject is an adult, an adolescent or an infant. In some embodiments, terms “individual” or “patient” are used and are intended to be interchangeable with “subject”.
Types of Modifications described herein
Guide RNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, and crRNAs) comprising modifications at various positions are disclosed herein. In some embodiments, a position of a gRNA that comprises a modification is modified with any one or more of the following types of modifications.
Modified sugars are believed to control the puckering of nucleotide sugar rings, a physical property that influences oligonucleotide binding affinity for complementary strands, duplex formation, and interaction with nucleases. Substitutions on sugar rings can therefore alter the conformation and puckering of these sugars. For example, 2′-O-methyl (2′-OMe) modifications can increase binding affinity and nuclease stability of oligonucleotides, though as shown in the Examples, the effect of any modification at a given position in an oligonucleotide needs to be empirically determined.
The terms “mA,” “mC,” “mU,” or “mG” may be used to denote a nucleotide that has been modified with 2′-OMe.
A ribonucleotide and a modified 2′-O-methyl ribonucleotide can be depicted as follows:
2′-O-(2-methoxyethyl) Modifications
In some embodiments, the modification may be 2′-O-(2-methoxyethyl) (2′-O-moe). A modified 2′-O-moe ribonucleotide can be depicted as follows:
The terms “moeA,” “moeC,” “moeU,” or “moeG” may be used to denote a nucleotide that has been modified with 2′-O-moe.
2′-fluoro Modifications
Another chemical modification that has been shown to influence nucleotide sugar rings is halogen substitution. For example, 2′-fluoro (2′-F) substitution on nucleotide sugar rings can increase oligonucleotide binding affinity and nuclease stability.
In this application, the terms “fA,” “fC,” “fU,” or “fG” may be used to denote a nucleotide that has been substituted with 2′-F.
A ribonucleotide without and with a 2′-F substitution can be depicted as follows:
A phosphorothioate (PS) linkage or bond refers to a bond where a sulfur is substituted for one nonbridging phosphate oxygen in a phosphodiester linkage, for example between nucleotides. When phosphorothioates are used to generate oligonucleotides, the modified oligonucleotides may also be referred to as S-oligos.
A “*” may be used to depict a PS modification. In this application, the terms A*, C*, U*, or G* may be used to denote a nucleotide that is linked to the next (e.g., 3′) nucleotide with a PS bond. Throughout this application, PS modifications are grouped with the nucleotide whose 3′ carbon is bonded to the phosphorothioate; thus, indicating that a PS modification is at position 1 means that the phosphorothioate is bonded to the 3′ carbon of nucleotide 1 and the 5′ carbon of nucleotide 2. Thus, where a YA site is indicated as being “PS modified” or the like, the PS linkage is between the Y and A or between the A and the next nucleotide.
In this application, the terms “mA*,” “mC*,” “mU*,” or “mG*” may be used to denote a nucleotide that has been substituted with 2′-OMe and that is linked to the next (e.g., 3′) nucleotide with a PS linkage, which may sometimes be referred to as a “PS bond.” Similarly, the terms “fA*,” “fC*,” “fU*,” or “fG*” may be used to denote a nucleotide that has been substituted with 2′-F and that is linked to the next (e.g., 3′) nucleotide with a PS linkage. Equivalents of a PS linkage or bond are encompassed by embodiments described herein.
The diagram below shows the substitution of S- for a nonbridging phosphate oxygen, generating a PS linkage in lieu of a phosphodiester linkage:
Abasic nucleotides refer to those which lack nitrogenous bases. The figure below depicts an oligonucleotide with an abasic (in this case, shown as apurinic; an abasic site could also be an apyrimidinic site, wherein the description of the abasic site is typically in reference to Watson-Crick base pairing—e.g., an apurinic site refers to a site that lacks a nitrogenous base and would typically base pair with a pyrimidinic site) site that lacks a base, wherein the base may be substituted by another moiety at the 1′ position of the furan ring (e.g., a hydroxyl group, as shown below, to form a ribose or deoxyribose site, as shown below, or a hydrogen):
Inverted bases refer to those with linkages that are inverted from the normal 5′ to 3′ linkage (i.e., either a 5′ to 5′ linkage or a 3′ to 3′ linkage). For example:
An abasic nucleotide can be attached with an inverted linkage. For example, an abasic nucleotide may be attached to the terminal 5′ nucleotide via a 5′ to 5′ linkage, or an abasic nucleotide may be attached to the terminal 3′ nucleotide via a 3′ to 3′ linkage. An inverted abasic nucleotide at either the terminal 5′ or 3′ nucleotide may also be called an inverted abasic end cap. In this application, the terms “invd” indicates an inverted abasic nucleotide linkage.
A deoxyribonucleotide (in which the sugar comprises a 2′-deoxy position) is considered a modification in the context of a gRNA, in that the nucleotide is modified relative to standard RNA by the substitution of a proton for a hydroxyl at the 2′ position. Unless otherwise indicated, a deoxyribonucleotide modification at a position that is U in an unmodified RNA can also comprise replacement of the U nucleobase with a T.
Exemplary bicyclic ribose analogs include locked nucleic acid (LNA), ENA, bridged nucleic acid (BNA), or another LNA-like modifications. In some instances, a bicyclic ribose analog has 2′ and 4′ positions connected through a linker. The linker can be of the formula —X—(CH2)n— where n is 1 or 2; X is O, NR, or S; and R is H or C1-3 alkyl, e.g., methyl. Examples of bicyclic ribose analogs include LNAs comprising a 2′-O—CH2-4′ bicyclic structure (oxy-LNA) (see WO 98/39352 and WO 99/14226); 2′-NH—CH2-4′ or 2′-N(CH3)—CH2-4′ (amino-LNAs) (Singh et al., J. Org. Chem. 63:10035-10039 (1998); Singh et al., J. Org. Chem. 63:6078-6079 (1998)); and 2′-S—CH2-4′ (thio-LNA) (Singh et al., J. Org. Chem. 63:6078-6079 (1998); Kumar et al., Biorg. Med. Chem. Lett. 8:2219-2222 (1998)).
An ENA modification refers to a nucleotide comprising a 2′-O,4′-C-ethylene modification. An exemplary structure of an ENA nucleotide is shown below, in which wavy lines indicate connections to the adjacent nucleotides (or terminal positions as the case may be, with the understanding that if the 3′ terminal nucleotide is an ENA nucleotide, the 3′ position may comprise a hydroxyl rather than phosphate). For further discussion of ENA nucleotides, see, e.g., Koizumi et al., Nucleic Acids Res. 31: 3267-3273 (2003).
A UNA or unlocked nucleic acid modification refers to a nucleotide comprising a 2′,3′-seco-RNA modification, in which the 2′ and 3′ carbons are not bonded directly to each other. An exemplary structure of a UNA nucleotide is shown below, in which wavy lines indicate connections to the adjacent phosphates or modifications replacing phosphates (or terminal positions as the case may be). For further discussion of UNA nucleotides, see, e.g., Snead et al., Molecular Therapy 2: e103, doi:10.1038/mtna.2013.36 (2013).
A base modification is any modification that alters the structure of a nucleobase or its bond to the backbone, including isomerization (as in pseudouridine). In some embodiments, a base modification includes inosine. In some embodiments, a modification comprises a base modification that reduces RNA endonuclease activity, e.g., by interfering with recognition of a cleavage site by an RNase and/or by stabilizing an RNA structure (e.g., secondary structure) that decreases accessibility of a cleavage site to an RNase. Exemplary base modifications that can stabilize RNA structures are pseudouridine and 5-methylcytosine. See Peacock et al., J Org Chem. 76: 7295-7300 (2011). In some embodiments, a base modification can increase or decrease the melting temperature (Tm) of a nucleic acid, e.g., by increasing the hydrogen bonding in a Watson-Crick base pair, forming non-canonical base pair, or creating a mismatched base pair.
The above modifications and their equivalents are included within the scope of the embodiments described herein.
A modification at a YA site (also referred to as a YA modification) can be a modification of the internucleoside linkage, a modification of the base (pyrimidine or adenine), e.g. by chemical modification, substitution, or otherwise, and/or a modification of the sugar (e.g. at the 2′ position, such as 2′-O-alkyl, 2′-F, 2′-moe, 2′-F arabinose, 2′-H (deoxyribose), and the like). In some embodiments, a “YA modification” is any modification that alters the structure of the dinucleotide motif to reduce RNA endonuclease activity, e.g., by interfering with recognition or cleavage of a YA site by an RNase and/or by stabilizing an RNA structure (e.g., secondary structure) that decreases accessibility of a cleavage site to an RNase. See Peacock et al., J Org Chem. 76: 7295-7300 (2011); Behlke, Oligonucleotides 18:305-320 (2008); Ku et al., Adv. Drug Delivery Reviews 104: 16-28 (2016); Ghidini et al., Chem. Commun., 2013, 49, 9036. Peacock et al., Belhke, Ku, and Ghidini provide exemplary modifications suitable as YA modifications. Modifications known to those of skill in the art to reduce endonucleolytic degradation are encompassed. Exemplary 2′ ribose modifications that affect the 2′ hydroxyl group involved in RNase cleavage are 2′-H and 2′-O-alkyl, including 2′-O-Me. Modifications such as bicyclic ribose analogs, UNA, and modified internucleoside linkages of the residues at the YA site can be YA modifications. Exemplary base modifications that can stabilize RNA structures are pseudouridine and 5-methylcytosine. In some embodiments, at least one nucleotide of the YA site is modified. In some embodiments, the pyrimidine (also called “pyrimidine position”) of the YA site comprises a modification (which includes a modification altering the internucleoside linkage immediately 3′ of the sugar of the pyrimidine, a modification of the pyrimidine base, and a modification of the ribose, e.g. at its 2′ position). In some embodiments, the adenine (also called “adenine position”) of the YA site comprises a modification (which includes a modification altering the internucleoside linkage immediately 3′ of the sugar of the pyrimidine, a modification of the pyrimidine base, and a modification of the ribose, e.g. at its 2′ position). In some embodiments, the pyrimidine and the adenine of the YA site comprise modifications. In some embodiments, the YA modification reduces RNA endonuclease activity.
The above modifications and their equivalents are included within the scope of the embodiments described herein.
Domains/regions of sgRNAs
Briner A E et al., Molecular Cell 56:333-339 (2014) describes functional domains of sgRNAs, referred to herein as “domains”, including the “spacer” domain responsible for targeting, the “lower stem”, the “bulge”, “upper stem” (which may include a tetraloop), the “nexus”, and the “hairpin 1” and “hairpin 2” domains. See, Briner et al. at page 334,
Table 3 provides a schematic of the domains of an sgRNA as used herein. In Table 3, the “n” between regions represents a variable number of nucleotides, for example, from 0 to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or more. In some embodiments, n equals 0. In some embodiments, n equals 1.
5′ Terminus Region
In some embodiments, the sgRNA or short-sgRNA comprises nucleotides at the 5′ terminus as shown in Table 3. In some embodiments, the 5′ terminus of the sgRNA or short-sgRNA comprises a spacer or guide region that functions to direct a Cas protein, e.g., a Cas9 protein, to a target nucleotide sequence. In some embodiments, the 5′ terminus does not comprise a guide region. In some embodiments, the 5′ terminus comprises a spacer and additional nucleotides that do not function to direct a Cas protein to a target nucleotide region.
Lower Stem
In some embodiments, the sgRNA or short-sgRNA comprises a lower stem (LS) region that when viewed linearly, is separated by a bulge and upper stem regions. See Table 3.
In some embodiments, the lower stem regions comprise 1-12 nucleotides, e.g. in one embodiment the lower stem regions comprise LS1-LS12. In some embodiments, the lower stem region comprises fewer nucleotides than shown in Table 3. In some embodiments, the lower stem region comprises more nucleotides than shown in Table 3. When the lower stem region comprises fewer or more nucleotides than shown in the schematic of Table 3, the modification pattern, as will be apparent to the skilled artisan, should be maintained.
In some embodiments, the lower stem region has nucleotides that are complementary in nucleic acid sequence when read in opposite directions. In some embodiments, the complementarity in nucleic acid sequence of lower stem leads to a secondary structure of a stem in the sgRNA or short-sgRNA (e.g., the regions may base pair with one another). In some embodiments, the lower stem regions may not be perfectly complimentary to each other when read in opposite directions.
Bulge
In some embodiments, the sgRNA or short-sgRNA comprises a bulge region comprising six nucleotides, B1-B6. When viewed linearly, the bulge region is separated into two regions. See Table 3. In some embodiments, the bulge region comprises six nucleotides, wherein the first two nucleotides are followed by an upper stem region, followed by the last four nucleotides of the bulge. In some embodiments, the bulge region comprises fewer nucleotides than shown in Table 3. In some embodiments, the bulge region comprises more nucleotides than shown in Table 3. When the bulge region comprises fewer or more nucleotides than shown in the schematic of Table 3, the modification pattern, as will be apparent to the skilled artisan, should be maintained.
In some embodiments, the presence of a bulge results in a directional kink between the upper and lower stem modules in an sgRNA or short-sgRNA.
Upper Stem
In some embodiments, the sgRNA or short-sgRNA comprises an upper stem region comprising 12 nucleotides. In some embodiments, the upper stem region comprises a loop sequence. In some instances, the loop is a tetraloop (loop consisting of four nucleotides).
In some embodiments, the upper stem region comprises fewer nucleotides than shown in Table 3. In some embodiments, the upper stem region comprises more nucleotides than shown in Table 3. When the upper stem region comprises fewer or more nucleotides than shown in the schematic of Table 3, the modification pattern, as will be apparent to the skilled artisan, should be maintained.
In some embodiments, the upper stem region has nucleotides that are complementary in nucleic acid sequence when read in opposite directions. In some embodiments, the complementarity in nucleic acid sequence of upper stem leads to a secondary structure of a stem in the sgRNA or short-sgRNA (e.g., the regions may base pair with one another). In some embodiments, the upper stem regions may not be perfectly complimentary to each other when read in opposite directions.
Nexus
In some embodiments, the sgRNA or short-sgRNA comprises a nexus region that is located between the lower stem region and the hairpin 1 region. In some embodiments, the nexus comprises 18 nucleotides. In some embodiments, the nexus region comprises nucleotides N1 through N18 as shown in Table 3.
In some embodiments, the nexus region comprises fewer nucleotides than shown in Table 3. In some embodiments, the nexus region comprises more nucleotides than shown in Table 3. When the nexus region comprises fewer or more nucleotides than shown in the schematic of Table 3, the modification pattern, as will be apparent to the skilled artisan, should be maintained.
In some embodiments, the nexus region has nucleotides that are complementary in nucleic acid sequence when read in opposite directions. In some embodiments, the complementarity in nucleic acid sequence leads to a secondary structure of a stem and/or stem loop in the sgRNA or short-sgRNA (e.g., certain nucleotides in the nexus region may base pair with one another). In some embodiments, the nexus regions may not be perfectly complimentary to each other when read in opposite directions.
Hairpin
In some embodiments, the sgRNA or short-sgRNA comprises one or more hairpin regions. In some embodiments, the hairpin region is downstream of (e.g., 3′ to) the nexus region. In some embodiments, the region of nucleotides immediately downstream of the nexus region is termed “hairpin 1” or “H1”. In some embodiments, the region of nucleotides 3′ to hairpin 1 is termed “hairpin 2” or “H2”. In some embodiments, the hairpin region comprises both hairpin 1 and hairpin 2. In some embodiments, the sgRNA or short-sgRNA comprises hairpin 1 or hairpin 2.
In some embodiments, the hairpin 1 region comprises 12 nucleic acids immediately downstream of the nexus region. In some embodiments, the hairpin 1 region comprises nucleotides H1-1 through H1-12 as shown in Table 3.
In some embodiments, the hairpin 2 region comprises 15 nucleic acids downstream of the hairpin 1 region. In some embodiments, the hairpin 2 region comprises nucleotides H2-1 through H2-15 as shown in Table 3.
In some embodiments, one or more nucleotides is present between the hairpin 1 and the hairpin 2 regions. The one or more nucleotides between the hairpin 1 and hairpin 2 region may be modified or unmodified. In some embodiments, hairpin 1 and hairpin 2 are separated by one nucleotide. In some embodiments, the hairpin regions comprise fewer nucleotides than shown in Table 3. In some embodiments, the hairpin regions comprise more nucleotides than shown in Table 3. When a hairpin region comprises fewer or more nucleotides than shown in the schematic of Table 3, the modification pattern, as will be apparent to the skilled artisan, should be maintained.
In some embodiments, a hairpin region has nucleotides that are complementary in nucleic acid sequence when read in opposite directions. In some embodiments, the hairpin regions may not be perfectly complimentary to each other when read in opposite directions (e.g., the top or loop of the hairpin comprises unpaired nucleotides).
In some embodiments, the sgRNA or short-sgRNA comprises replacement of hairpin 1 with nucleotides “n”, wherein “n” is an integer between 1 and 50, 40, 30, 20, 15, 10, 5, 4, 3, and 2. In some embodiments, the hairpin 1 region of an sgRNA is replaced by 2 nucleotides.
3′ Terminus
The sgRNA or short-sgRNA has a 3′ end, which is the last nucleotide of the sgRNA. The 3′ terminus region includes the last 1-7 nucleotides from the 3′ end. In some embodiments, the 3′ end is the end of hairpin 2. In some embodiments, the sgRNA comprises nucleotides after the hairpin region(s). In some embodiments, the sgRNA includes a 3′ tail region, in which case the last nucleotide of the 3′ tail is the 3′ terminus. In some embodiments, the 3′ tail comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15 or 20 or more nucleotides, e.g. that are not associated with the secondary structure of a hairpin. In some embodiments, the 3′ tail region comprises 1, 2, 3, or 4 nucleotides that are not associated with the secondary structure of a hairpin. In some embodiments, the 3′ tail region comprises 4 nucleotides that are not associated with the secondary structure of a hairpin. In some embodiments, the 3′ tail region comprises 1, 2, or 3 nucleotides that are not associated with the secondary structure of a hairpin.
gRNAs comprising Modifications, including Modifications of YA Sites
In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, dgRNA, or crRNA) described herein comprises modifications at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, or more YA sites (e.g., in the conserved region and/or the guide region) and/or a modification, such as a YA modification, at one or more nucleotides located at or after nucleotide 6 from the 5′ end of the 5′ terminus. In some embodiments, the pyrimidine of the YA site comprises a modification (which includes a modification altering the internucleoside linkage immediately 3′ of the sugar of the pyrimidine). In some embodiments, the adenine of the YA site comprises a modification (which includes a modification altering the internucleoside linkage immediately 3′ of the sugar of the adenine). In some embodiments, the pyrimidine and the adenine of the YA site comprise modifications, such as sugar, base, or internucleoside linkage modifications. The YA modifications can be any of the types of modifications set forth herein. In some embodiments, the YA modifications comprise one or more of phosphorothioate, 2′-OMe, or 2′-fluoro. In some embodiments, the YA modifications comprise pyrimidine modifications comprising one or more of phosphorothioate, 2′-OMe, or 2′-fluoro. In some embodiments, the YA modification comprises a bicyclic ribose analog (e.g., an LNA, BNA, or ENA) within an RNA duplex region that contains one or more YA sites. In some embodiments, the YA modification comprises a bicyclic ribose analog (e.g., an LNA, BNA, or ENA) within an RNA duplex region that contains a YA site, wherein the YA modification is distal to the YA site.
Any of the embodiments described above may be combined with the following: (i) at least one of nucleotides 8-11, 13, 14, 17, or 18 from the 5′ end of the 5′ terminus does not comprise a 2′-fluoro modification, and/or (ii) at least one of nucleotides 6-10 from the 5′ end of the 5′ terminus does not comprise a phosphorothioate linkage; and (i) at least one of nucleotides 7-10 from the 5′ end of the 5′ terminus does not comprise a 2′-OMe modification, (ii) nucleotide 20 from the 5′ end of the 5′ terminus does not comprise a 2′-OMe modification, and/or (iii) or the guide RNA comprises a 2′-fluoro modification at any one or more of nucleotides 1-20 from the 5′ end of the 5′ terminus and at least one of nucleotides 11, 12, 13, 14, 17, or 18 from the 5′ end of the 5′ terminus does not comprise a 2′-fluoro modification, optionally wherein nucleotide 12 from the 5′ end of the 5′ terminus does not comprise a 2′-fluoro modification. Such embodiments may be further, or alternatively, combined with any other one or more embodiments described herein to the extent feasible.
Guide Region Modifications including YA Site Modifications
In some embodiments, the guide region comprises one or more modifications, optionally including YA site modifications. In some embodiments, the guide region comprises 1, 2, 3, 4, 5, or more YA sites (“guide region YA sites”) that may comprise YA modifications. In some embodiments, one or more YA sites located at 5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus (where “5-end”, etc., refers to position 5 to the 3′ end of the guide region, i.e., the most 3′ nucleotide in the guide region) comprise YA modifications. In some embodiments, two or more YA sites located at 5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus comprise YA modifications. In some embodiments, three or more YA sites located at 5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus comprise YA modifications. In some embodiments, four or more YA sites located at 5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus comprise YA modifications. In some embodiments, five or more YA sites located at 5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus comprise YA modifications. A modified guide region YA site comprises a YA modification.
In some embodiments, a modified guide region YA site is within 17, 16, 15, 14, 13, 12, 11, 10, or 9 nucleotides of the 3′ terminal nucleotide of the guide region. For example, if a modified guide region YA site is within 10 nucleotides of the 3′ terminal nucleotide of the guide region and the guide region is 20 nucleotides long, then the modified nucleotide of the modified guide region YA site is located at any of positions 11-20. In some embodiments, a YA modification is located within a YA site 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 nucleotides from the 3′ terminal nucleotide of the guide region. In some embodiments, a YA modification is located 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 nucleotides from the 3′ terminal nucleotide of the guide region.
In some embodiments, a modified guide region YA site is at or after nucleotide 4, 5, 6, 7, 8, 9, 10, or 11 from the 5′ end of the 5′ terminus.
In some embodiments, a modified guide region YA site is other than a 5′ end modification. For example, a gRNA can comprise a 5′ end modification as described herein and further comprise a modified guide region YA site. Alternatively, a gRNA can comprise an unmodified 5′ end and a modified guide region YA site. Alternatively, a gRNA can comprise a modified 5′ end and an unmodified guide region YA site.
In some embodiments, a modified guide region YA site comprises a modification that at least one nucleotide located 5′ of the guide region YA site does not comprise. For example, if nucleotides 1-3 comprise phosphorothioates, nucleotide 4 comprises only a 2′-OMe modification, and nucleotide 5 is the pyrimidine of a YA site and comprises a phosphorothioate, then the modified guide region YA site comprises a modification (phosphorothioate) that at least one nucleotide located 5′ of the guide region YA site (nucleotide 4) does not comprise. In another example, if nucleotides 1-3 comprise phosphorothioates, and nucleotide 4 is the pyrimidine of a YA site and comprises a 2′-OMe, then the modified guide region YA site comprises a modification (2′-OMe) that at least one nucleotide located 5′ of the guide region YA site (any of nucleotides 1-3) does not comprise. This condition is also always satisfied if an unmodified nucleotide is located 5′ of the modified guide region YA site.
In some embodiments, the guide region comprises modifications at 1-14 of nucleotides 1, 2, 3, 4, 6, 7, 8, 9, 10, 11, 13, 14, 17, and 18 of the guide region. Such modifications may be 2′-OMe, 2′-fluoro, 2′-H, inosine, or phosphorothioate modifications, or a combination thereof. For example, 2′-OMe modifications may be included at any or all of nucleotides 1-4 and 12; phosphorothioate modifications may be included at any or all of nucleotides 1-3 and 6-10; and/or 2′-fluoro modifications may be included at any or all of nucleotides 8-11, 13, 14, 17, and 18. In negative terms, 2′-OMe modifications may be excluded from nucleotides 6-11 and 13-end; 2′-fluoro modifications may be excluded from nucleotides 1-7, 15, 16, and 20 (if present); and/or phosphorothioate modifications may be excluded from nucleotides 4, 5, 11-14, 17, and 18. In some embodiments, nucleotides are modified in a YA-site dependent manner, e.g., if a YA site is present at any of nucleotides 5-6, 12-13, 15-16, 16-17, or 19-20, then at least one nucleotide of the YA site is modified, e.g., at least the pyrimidine of the YA site is modified, optionally wherein the nucleotides at positions 5, 12, 15, 16, and 19 are unmodified if they are not the pyrimidine of a YA site. In some embodiments, the modification at nucleotide 5 when it is the pyrimidine of a YA site is 2′-OMe; the modification at nucleotide 12 when it is the pyrimidine of a YA site is 2′-OMe; the modification at nucleotide 15 when it is the pyrimidine of a YA site is phosphorothioate; the modification at nucleotide 16 when it is the pyrimidine of a YA site is phosphorothioate; and/or the modification at nucleotide 19 when it is the pyrimidine of a YA site is phosphorothioate. Recognizing that YA sites cannot be present at both positions 15-16 and 16-17, it is thus possible for there to be up to four modifications contingent on the presence of YA sites. In an alternative embodiment, the modification at nucleotide 19 may instead be a 2′-fluoro. This can be present in a YA site-dependent manner or it can be present regardless of whether there is a YA site at positions 19-20. In some embodiments, nucleotide 15 and 16 are unmodified or modified only with a phosphorothioate, e.g., only at a nucleotide which is the pyrimidine of a YA site located at nucleotides 15-16 or 16-17. In some embodiments, nucleotides 15 and 16 comprise unmodified riboses and/or unmodified nucleobases. In some embodiments, nucleotide 5 is unmodified or is modified only with 2′-OMe if it is the pyrimidine of a YA site. In some embodiments, nucleotide 12 is unmodified or is modified only with 2′-OMe if it is the pyrimidine of a YA site. In some embodiments, nucleotide 20 (or the 3′-terminal nucleotide of the guide region) is unmodified. In any of the foregoing embodiments, the guide region may consist of 20 nucleotides.
In some embodiments, a gRNA comprises a guide region that comprises a modification at one or more of nucleotide 5 and/or 12. The modifications at nucleotide 5 and/or 12 may be independently selected from modifications described herein, e.g., 2′-OMe, 2′-F, phosphorothioate, and 2′-H (a deoxyribonucleotide). Such modifications may be combined with another modification pattern or nucleotide modifications described herein, e.g., as shown in a gRNA described herein. Particular examples of such embodiments are described herein, e.g., in certain numbered embodiments set forth above and in modification patterns represented by sequences in the Table of Sequences. In some embodiments, such a modification is combined with one or more, or all, of 2′-OMe modifications at nucleotides 1-4, phosphorothioate modifications at nucleotides 1-3 and 6-10, and/or 2′-F modifications at nucleotides 8-11, 13, 14, 17, and 18.
In some embodiments, a gRNA comprises a guide region that comprises modifications at any one, two, or all of nucleotides 8-10. Such modifications may be combined with another modification pattern or nucleotide modifications described herein, e.g., as shown in a gRNA described herein. The modifications may be independently selected from modifications described herein, e.g., 2′-F modifications and phosphorothioate modifications, or a combination thereof. In some embodiments, any one, two, or all of nucleotides 8-10 comprise 2′-F modifications. In some embodiments, any one, two, or all of nucleotides 8-10 comprise 2′-F modifications but not phosphorothioate modifications. In some embodiments, any one, two, or all of nucleotides 8-10 comprise 2′-F modifications and phosphorothioate modifications. Particular examples of such embodiments are described herein, e.g., in certain numbered embodiments set forth above and in modification patterns represented by sequences in the Table of Sequences. In some embodiments, such a modification is combined with one or more, or all, of 2′-OMe modifications at nucleotides 1-4, phosphorothioate modifications at nucleotides 1-3 and 6-7, and/or 2′-F modifications at nucleotides 11, 13, 14, 17, and 18.
In some embodiments, a gRNA comprises a guide region that comprises modifications at any one or both of nucleotides 5 and 6. The modifications may be independently selected from modifications described herein, e.g., 2′-F modifications and phosphorothioate modifications, or a combination thereof. In some embodiments, any one or both of nucleotides 5 and 6 comprise 2′-F modifications. In some embodiments, any one or both of nucleotides 5 and 6 comprise 2′-F modifications but not phosphorothioate modifications. In some embodiments, any one or both of nucleotides 5 and 6 comprise 2′-F modifications and phosphorothioate modifications. Particular examples of such embodiments are described herein, e.g., in certain numbered embodiments set forth above and in modification patterns represented by sequences in the Table of Sequences. In some embodiments, such a modification is combined with one or more, or all, of 2′-OMe modifications at nucleotides 1-4, phosphorothioate modifications at nucleotides 1-3 and 7-10, and/or 2′-F modifications at nucleotides 8-11, 13, 14, 17, and 18.
In some embodiments, a gRNA comprises a guide region that comprises modifications at at least 1, 2, 3, 4, 5, or 6 of nucleotides 6-11. The modifications may be independently selected from modifications described herein, e.g., 2′-F modifications. In some embodiments, 2′-F modifications at 1, 2, 3, 4, 5, or 6 of nucleotides 6-11 are combined with another compatible modification, such as a phosphorothioate modification, at one or more of the positions comprising a 2′-F modification. Particular examples of such embodiments are described herein, e.g., in certain numbered embodiments set forth above and in modification patterns represented by sequences in the Table of Sequences. In some embodiments, such a modification is combined with one or more, or all, of 2′-OMe modifications at nucleotides 1-4, phosphorothioate modifications at nucleotides 1-3, and/or 2′-F modifications at nucleotides 13, 14, 17, and 18.
In some embodiments, a gRNA comprises a guide region that comprises modifications at at least 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 of nucleotides 1-4 and 6-11. The modifications may be independently selected from modifications described herein, e.g., 2′-F modifications. In some embodiments, 2′-F modifications at 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 of nucleotides 1-4 and 6-11 are combined with another compatible modification, such as a phosphorothioate modification, at one or more of the positions comprising a 2′-F modification. Particular examples of such embodiments are described herein, e.g., in certain numbered embodiments set forth above and in modification patterns represented by sequences in the Table of Sequences. In some embodiments, such a modification is combined with one or more, or all, of phosphorothioate modifications at nucleotides 1-3 and/or 2′-F modifications at nucleotides 13, 14, 17, and 18.
In some embodiments, a gRNA comprises a guide region that comprises 2′-OMe modifications at at least 1, 2, 3, or 4 of nucleotides 9, 11, 13, and 14. In some embodiments, 2′-OMe modifications at at least 1, 2, 3, or 4 of nucleotides 9, 11, 13, and 14 are combined with another compatible modification, such as a phosphorothioate modification, at one or more of the positions comprising a 2′-OMe modification. Particular examples of such embodiments are described herein, e.g., in certain numbered embodiments set forth above and in modification patterns represented by sequences in the Table of Sequences. In some embodiments, such a modification is combined with one or more, or all, of 2′-OMe modifications at nucleotides 1-4 and/or phosphorothioate modifications at nucleotides 1-3 and 6-10.
In some embodiments, the modified guide region YA sites comprise modifications as described for YA sites above.
Additional embodiments of guide region YA site modifications are set forth in the summary above. Any embodiments set forth elsewhere in this disclosure may be combined to the extent feasible with any of the foregoing embodiments.
Conserved Region YA Site Modifications
Conserved region YA sites 1-10 are illustrated in
In some embodiments, conserved region YA sites 1, 8, or 1 and 8 comprise YA modifications. In some embodiments, conserved region YA sites 1, 2, 3, 4, and 10 comprise YA modifications. In some embodiments, YA sites 2, 3, 4, 8, and 10 comprise YA modifications. In some embodiments, conserved region YA sites 1, 2, 3, and 10 comprise YA modifications. In some embodiments, YA sites 2, 3, 8, and 10 comprise YA modifications. In some embodiments, YA sites 1, 2, 3, 4, 8, and 10 comprise YA modifications. In some embodiments, 1, 2, 3, 4, 5, 6, 7, or 8 additional conserved region YA sites comprise YA modifications.
In some embodiments, 1, 2, 3, or 4 of conserved region YA sites 2, 3, 4, and 10 comprise YA modifications. In some embodiments, 1, 2, 3, 4, 5, 6, 7, or 8 additional conserved region YA sites comprise YA modifications.
In some embodiments, the modified conserved region YA sites comprise modifications as described for YA sites above.
Additional embodiments of conserved region YA site modifications are set forth in the summary above. Any embodiments set forth elsewhere in this disclosure may be combined to the extent feasible with any of the foregoing embodiments.
In some embodiments, the 5′ and/or 3′ terminus regions of a gRNA (e.g., sgRNA, short-sgRNA, dgRNA, or crRNA) are modified.
3′ Terminus Region Modifications
In some embodiments, the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in the 3′ terminus region are modified. Throughout, this modification may be referred to as a “3′ end modification”. In some embodiments, the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in the 3′ terminus region comprise more than one modification. In some embodiments, at least one of the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in the 3′ terminus region are modified. In some embodiments, at least two of the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in the 3′ terminus region are modified. In some embodiments, at least three of the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in the 3′ terminus region are modified. In some embodiments, the modification comprises a PS linkage. In some embodiments, the modification to the 3′ terminus region is a 3′ protective end modification. In some embodiments, the 3′ end modification comprises a 3′ protective end modification.
In some embodiments, the 3′ end modification comprises a modified nucleotide selected from 2′-O-methyl (2′-O-Me) modified nucleotide, 2′-O-(2-methoxyethyl) (2′-O-moe) modified nucleotide, a 2′-fluoro (2′-F) modified nucleotide, a phosphorothioate (PS) linkage between nucleotides, an inverted abasic modified nucleotide, an ENA, a UNA, a 2′-H (DNA), or combinations thereof.
In some embodiments, the 3′ end modification comprises or further comprises a 2′-O-methyl (2′-O-Me) modified nucleotide.
In some embodiments, the 3′ end modification comprises or further comprises a 2′-fluoro (2′-F) modified nucleotide.
In some embodiments, the 3′ end modification comprises or further comprises a phosphorothioate (PS) linkage between nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises an inverted abasic modified nucleotide.
In some embodiments, the 3′ end modification comprises or further comprises an ENA.
In some embodiments, the 3′ end modification comprises or further comprises a UNA.
In some embodiments, the 3′ end modification comprises or further comprises a 2′-H (DNA).
In some embodiments, the 3′ end modification comprises or further comprises a modification of any one or more of the last 7, 6, 5, 4, 3, 2, or 1 nucleotides. In some embodiments, the 3′ end modification comprises or further comprises one modified nucleotide. In some embodiments, the 3′ end modification comprises or further comprises two modified nucleotides. In some embodiments, the 3′ end modification comprises or further comprises three modified nucleotides. In some embodiments, the 3′ end modification comprises or further comprises four modified nucleotides. In some embodiments, the 3′ end modification comprises or further comprises five modified nucleotides. In some embodiments, the 3′ end modification comprises or further comprises six modified nucleotides. In some embodiments, the 3′ end modification comprises or further comprises seven modified nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises a modification of between 1 and 7 or between 1 and 5 nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises modifications of 1, 2, 3, 4, 5, 6, or 7 nucleotides at the 3′ terminus of the gRNA.
In some embodiments, the 3′ end modification comprises or further comprises modifications of about 1-3, 1-5, 1-6, or 1-7 nucleotides at the 3′ terminus of the gRNA.
In some embodiments, the 3′ end modification comprises or further comprises any one or more of the following: a phosphorothioate (PS) linkage between nucleotides, a 2′-O-Me modified nucleotide, a 2′-O-moe modified nucleotide, a 2′-F modified nucleotide, an inverted abasic modified nucleotide, an ENA, a UNA, and a combination thereof.
In some embodiments, the 3′ end modification comprises or further comprises 1, 2, 3, 4, 5, 6, or 7 PS linkages between nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises at least one 2′-O-Me, 2′-O-moe, inverted abasic, or 2′-F modified nucleotide.
In some embodiments, the 3′ end modification comprises or further comprises one PS linkage, wherein the linkage is between the last and second to last nucleotide. In some embodiments, the 3′ end modification comprises or further comprises two PS linkages between the last three nucleotides. In some embodiments, the 3′ end modification comprises or further comprises four PS linkages between the last four nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises PS linkages between any one or more of the last four nucleotides. In some embodiments, the 3′ end modification comprises or further comprises PS linkages between any one or more of the last five nucleotides. In some embodiments, the 3′ end modification comprises or further comprises PS linkages between any one or more of the last 2, 3, 4, 5, 6, or 7 nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises a modification of one or more of the last 1-7 nucleotides, wherein the modification is a PS linkage, inverted abasic nucleotide, 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof.
In some embodiments, the 3′ end modification comprises or further comprises a modification to the last nucleotide with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and an optionally one or two PS linkages to the next nucleotide and/or the first nucleotide of the 3′ tail.
In some embodiments, the 3′ end modification comprises or further comprises a modification to the last and/or second to last nucleotide with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages.
In some embodiments, the 3′ end modification comprises or further comprises a modification to the last, second to last, and/or third to last nucleotides with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages.
In some embodiments, the 3′ end modification comprises or further comprises a modification to the last, second to last, third to last, and/or fourth to last nucleotides with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages.
In some embodiments, the 3′ end modification comprises or further comprises a modification to the last, second to last, third to last, fourth to last, and/or fifth to last nucleotides with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages.
In some embodiments, an sgRNA or short-sgRNA comprising a 3′ end modification comprises or further comprises a 3′ tail, wherein the 3′ tail comprises a modification of any one or more of the nucleotides present in the 3′ tail. In some embodiments, the 3′ tail is fully modified. In some embodiments, the 3′ tail comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1-2, 1-3, 1-4, 1-5, 1-6, 1-7, 1-8, 1-9, or 1-10 nucleotides, optionally where any one or more of these nucleotides are modified.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises the 3′ end modification as shown in any one of SEQ ID Nos: 1-132. In some embodiments, an sgRNA is provided comprising a 3′ protective end modification.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises (i) a 2′-OMe modified nucleotide at the last nucleotide of the conserved region of an sgRNA or sgRNA (ii) three consecutive 2′O-moe modified nucleotides immediately 5′ to the 2′-OMe modified nucleotide, and (iii) three consecutive PS linkages between the last three nucleotides.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises (i) five consecutive 2′-OMe modified nucleotides from the last nucleotide of the conserved region of an sgRNA or sgRNA, and (ii) three PS linkages between the last three nucleotides.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises an inverted abasic modified nucleotide at the last nucleotide of the conserved region of an sgRNA or sgRNA.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises (i) an inverted abasic modified nucleotide at the last nucleotide of the conserved region of an sgRNA or short-sgRNA, and (ii) three consecutive 2′-OMe modified nucleotides at the last three nucleotides of the conserved region of an sgRNA or short-sgRNA.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises (i) 15 consecutive 2′-OMe modified nucleotides from the last nucleotide of the conserved region of an sgRNA, (ii) five consecutive 2′-F modified nucleotides immediately 5′ to the 2′-OMe modified nucleotides, and (iii) three PS linkages between the last three nucleotides.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises (i) alternating 2′-OMe modified nucleotides and 2′-F modified nucleotides at the last 20 nucleotides of the conserved region of an sgRNA or sgRNA, and (ii) three PS linkages between the last three nucleotides.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises (i) two or three consecutive 2′-OMe modified nucleotides, and (ii) three PS linkages between the last three nucleotides.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises one PS linkage between the last and next to last nucleotides.
In some embodiments, the sgRNA or short-sgRNA comprises a 5′ end modification and a 3′ end modification.
3′ Tail
In some embodiments, the sgRNA comprises a 3′ terminus comprising a 3′ tail, which follows the 3′ end of the conserved portion of an sgRNA. In some embodiments, the 3′ tail comprises between 1 and about 20 nucleotides, between 1 and about 15 nucleotides, between 1 and about 10 nucleotides, between 1 and about 5 nucleotides, between 1 and about 4 nucleotides, between 1 and about 3 nucleotides, and between 1 and about 2 nucleotides. In some embodiments, the 3′ tail comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides. In some embodiments, the 3′ tail comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides. In some embodiments, the 3′ tail comprises 1 nucleotide. In some embodiments, the 3′ tail comprises 2 nucleotides. In some embodiments, the 3′ tail comprises 3 nucleotides. In some embodiments, the 3′ tail comprises 4 nucleotides. In some embodiments, the 3′ tail comprises about 1-2, 1-3, 1-4, 1-5, 1-7, 1-10, at least 1-5, at least 1-3, at least 1-4, at least 1-5, at least 1-5, at least 1-7, or at least 1-10 nucleotides.
In some embodiments, the 3′ tail comprising between 1 and 20 nucleotides and follows the 3′ end of the conserved portion of an sgRNA.
In some embodiments, the 3′ tail comprises or further comprises one or more of a protective end modification, a phosphorothioate (PS) linkage between nucleotides, a 2′-O-Me modified nucleotide, a 2′-O-moe modified nucleotide, a 2′-F modified nucleotide, an inverted abasic modified nucleotide, and a combination thereof.
In some embodiments, the 3′ tail comprises or further comprises one or more phosphorothioate (PS) linkages between nucleotides. In some embodiments, the 3′ tail comprises or further comprises one or more 2′-O-Me modified nucleotides. In some embodiments, the 3′ tail comprises or further comprises one or more 2′-O-moe modified nucleotides. In some embodiments, the 3′ tail comprises or further comprises one or more 2′-F modified nucleotide. In some embodiments, the 3′ tail comprises or further comprises one or more an inverted abasic modified nucleotides. In some embodiments, the 3′ tail comprises or further comprises one or more protective end modifications. In some embodiments, the 3′ tail comprises or further comprises a combination of one or more of a phosphorothioate (PS) linkage between nucleotides, a 2′-O-Me modified nucleotide, a 2′-O-moe modified nucleotide, a 2′-F modified nucleotide, and an inverted abasic modified nucleotide.
In some embodiments, the sgRNA does not comprise a 3′ tail.
5′ Terminus Region Modifications
In some embodiments, the 5′ terminus region is modified, for example, the first 1, 2, 3, 4, 5, 6, or 7 nucleotides of the gRNA (e.g., sgRNA, short-sgRNA, or crRNA) are modified. Throughout, this modification may be referred to as a “5′ end modification”. In some embodiments, the first 1, 2, 3, 4, 5, 6, or 7 nucleotides of the 5′ terminus region (i.e., the first 1, 2, 3, 4, 5, 6, or 7 nucleotides from the 5′ end of the 5′ terminus) comprise more than one modification. In some embodiments, at least one of the terminal (i.e., first) 1, 2, 3, 4, 5, 6, or 7 nucleotides from the 5′ end of the 5′ terminus are modified. In some embodiments, at least two of the terminal 1, 2, 3, 4, 5, 6, or 7 nucleotides from the 5′ end of the 5′ terminus are modified. In some embodiments, at least three of the terminal 1, 2, 3, 4, 5, 6, or 7 nucleotides from the 5′ end of the 5′ terminus are modified. In some embodiments, the modification comprises a PS linkage. In some embodiments, the modification to the 5′ terminus region is a 5′ protective end modification. In some embodiments, the 5′ end modification comprises a 5′ protective end modification.
In some embodiments, both the 5′ and 3′ terminus regions of the sgRNA or short-sgRNA are modified (e.g., including the first and last nucleotides of the gRNA). In some embodiments, only the 5′ terminus region of the sgRNA or short-sgRNA is modified. In some embodiments, only the 3′ terminus region (plus or minus a 3′ tail) of the conserved portion of an sgRNA or short-sgRNA is modified.
In some embodiments, the gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises modifications at 1, 2, 3, 4, 5, 6, or 7 of the first 7 nucleotides from the 5′ end of the 5′ terminus of the gRNA. In some embodiments, the gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises modifications at 1, 2, 3, 4, 5, 6, or 7 of the 7 terminal nucleotides from the 3′ end of the 3′ terminus. In some embodiments, 2, 3, or 4 of the first 4 nucleotides from the 5′ end of the 5′ terminus, and/or 2, 3, or 4 of the terminal 4 nucleotides from the 3′ end of the 3′ terminus are modified. In some embodiments, 2, 3, or 4 of the first 4 nucleotides from the 5′ end of the 5′ terminus are linked with phosphorothioate (PS) bonds.
In some embodiments, the modification to the 5′ terminus and/or 3′ terminus comprises a 2′-O-methyl (2′-O-Me) or 2′-O-(2-methoxyethyl) (2′-O-moe) modification. In some embodiments, the modification comprises a 2′-fluoro (2′-F) modification to a nucleotide. In some embodiments, the modification comprises a phosphorothioate (PS) linkage between nucleotides. In some embodiments, the modification comprises an inverted abasic nucleotide. In some embodiments, the modification comprises a protective end modification. In some embodiments, the modification comprises a more than one modification selected from protective end modification, 2′-O-Me, 2′-O-moe, 2′-fluoro (2′-F), a phosphorothioate (PS) linkage between nucleotides, 2′-H (DNA), an ENA, a UNA, and an inverted abasic nucleotide. In some embodiments, an equivalent modification is encompassed.
In some embodiments, the gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises one or more phosphorothioate (PS) linkages between the first one, two, three, four, five, six, or seven nucleotides at the 5′ terminus. In some embodiments, the sgRNA comprises one or more PS linkages between the last one, two, three, four, five, six, or seven nucleotides at the 3′ terminus. In some embodiments, the sgRNA or short-sgRNA comprises one or more PS linkages between both the last one, two, three, four, five, six, or seven nucleotides at the 3′ terminus and the first 1, 2, 3, 4, 5, 6, or 7 nucleotides from the 5′ end of the 5′ terminus. In some embodiments, in addition to PS linkages, the 5′ and 3′ terminal nucleotides may comprise 2′-O-Me, 2′-O-moe, or 2′-F modified nucleotides.
In some embodiments, the sgRNA comprises a 5′ end modification, e.g., wherein the first nucleotide of the guide region is modified. In some embodiments, the sgRNA comprises a 5′ end modification, wherein the first nucleotide of the guide region comprises a 5′ protective end modification.
In some embodiments, the 5′ end modification comprises a modified nucleotide selected from 2′-O-methyl (2′-O-Me) modified nucleotide, 2′-O-(2-methoxyethyl) (2′-O-moe) modified nucleotide, a 2′-fluoro (2′-F) modified nucleotide, a phosphorothioate (PS) linkage between nucleotides, an inverted abasic modified nucleotide, an ENA, a UNA, a 2′H (DNA), or combinations thereof.
In some embodiments, the 5′ end modification comprises or further comprises a 2′-O-methyl (2′-O-Me) modified nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a 2′-fluoro (2′-F) modified nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a phosphorothioate (PS) linkage between nucleotides.
In some embodiments, the 5′ end modification comprises or further comprises an inverted abasic modified nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises an ENA.
In some embodiments, the 5′ end modification comprises or further comprises a UNA.
In some embodiments, the 5′ end modification comprises or further comprises a 2′-H (DNA).
In some embodiments, the 5′ end modification comprises or further comprises a modification of any one or more of nucleotides 1-7 of the guide region of an gRNA (e.g., sgRNA, short-sgRNA, or crRNA). In some embodiments, the 5′ end modification comprises or further comprises one modified nucleotide. In some embodiments, the 5′ end modification comprises or further comprises two modified nucleotides. In some embodiments, the 5′ end modification comprises or further comprises three modified nucleotides. In some embodiments, the 5′ end modification comprises or further comprises four modified nucleotides. In some embodiments, the 5′ end modification comprises or further comprises five modified nucleotides. In some embodiments, the 5′ end modification comprises or further comprises six modified nucleotides. In some embodiments, the 5′ end modification comprises or further comprises seven modified nucleotides.
In some embodiments, the 5′ end modification comprises or further comprises a modification of between 1 and 7, between 1 and 5, between 1 and 4, between 1 and 3, or between 1 and 2 nucleotides.
In some embodiments, the 5′ end modification comprises or further comprises modifications of 1, 2, 3, 4, 5, 6, or 7 nucleotides from the 5′ end. In some embodiments, the 5′ end modification comprises or further comprises modifications of about 1-3, 1-4, 1-5, 1-6, or 1-7 nucleotides from the 5′ end.
In some embodiments, the 5′ end modification comprises or further comprises modifications at the first nucleotide from the 5′ end of the gRNA (e.g., sgRNA, short-sgRNA, or crRNA). In some embodiments, the 5′ end modification comprises or further comprises modifications at the first and second nucleotide from the 5′ end of the gRNA (e.g., sgRNA, short-sgRNA, or crRNA). In some embodiments, the 5′ end modification comprises or further comprises modifications from the first, second, and third nucleotide at the 5′ end of the gRNA (e.g., sgRNA, short-sgRNA, or crRNA). In some embodiments, the 5′ end modification comprises or further comprises modifications at the first, second, third, and fourth nucleotide from the 5′ end of the gRNA (e.g., sgRNA, short-sgRNA, or crRNA). In some embodiments, the 5′ end modification comprises or further comprises modifications at the first, second, third, fourth, and fifth nucleotide from the 5′ end of the gRNA (e.g., sgRNA, short-sgRNA, or crRNA). In some embodiments, the 5′ end modification comprises or further comprises modifications at the first, second, third, fourth, fifth, and sixth nucleotide from the 5′ end of the gRNA (e.g., sgRNA, short-sgRNA, or crRNA). In some embodiments, the 5′ end modification comprises or further comprises modifications from the first, second, third, fourth, fifth, sixth, and seventh nucleotide at the 5′ end of the gRNA (e.g., sgRNA, short-sgRNA, or crRNA).
In some embodiments, the 5′ end modification comprises or further comprises a phosphorothioate (PS) linkage between nucleotides, and/or a 2′-O-Me modified nucleotide, and/or a 2′-O-moe modified nucleotide, and/or a 2′-F modified nucleotide, and/or an inverted abasic modified nucleotide, and/or combinations thereof.
In some embodiments, the 5′ end modification comprises or further comprises 1, 2, 3, 4, 5, 6, and/or 7 PS linkages between nucleotides. In some embodiments, the 5′ end modification comprises or further comprises about 1-2, 1-3, 1-4, 1-5, 1-6, or 1-7 PS linkages between nucleotides.
In some embodiments, the 5′ end modification comprises or further comprises at least one PS linkage, wherein if there is one PS linkage, the linkage is between nucleotides 1 and 2 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises at least two PS linkages, and the linkages are between nucleotides 1 and 2, and 2 and 3 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises PS linkages between any one or more of nucleotides 1 and 2, 2 and 3, and 3 and 4 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises PS linkages between any one or more of nucleotides 1 and 2, 2 and 3, 3 and 4, and 4 and 5 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises PS linkages between any one or more of nucleotides 1 and 2, 2 and 3, 3 and 4, 4 and 5, and 5 and 6 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises PS linkages between any one or more of nucleotides 1 and 2, 2 and 3, 3 and 4, 4 and 5, 5 and 6, and 7 and 8 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises a modification of one or more of nucleotides 1-7 of the guide region, wherein the modification is a PS linkage, inverted abasic nucleotide, 2′-O-Me, 2′-O-moe, 2′-F, and/or combinations thereof.
In some embodiments, the 5′ end modification comprises or further comprises a modification to the first nucleotide of the guide region with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and an optional PS linkage to the next nucleotide;
In some embodiments, the 5′ end modification comprises or further comprises a modification to the first and/or second nucleotide of the guide region with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages between the first and second nucleotide and/or between the second and third nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a modification to the first, second, and/or third nucleotides of the variable region with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages between the first and second nucleotide, between the second and third nucleotide, and/or between the third and the fourth nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a modification to the first, second, third, and/or fourth nucleotides of the variable region with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages between the first and second nucleotide, between the second and third nucleotide, between the third and the fourth nucleotide, and/or between the fourth and the fifth nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a modification to the first, second, third, fourth, and/or fifth nucleotides of the variable region with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages between the first and second nucleotide, between the second and third nucleotide, between the third and the fourth nucleotide, between the fourth and the fifth nucleotide, and/or between the fifth and the sixth nucleotide.
In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises a 5′ end modification as shown in any one of SEQ ID Nos: 401-532, 1001, or 1007-1132. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises a 5′ end modification as shown in nucleotides 1-3 of any one of SEQ ID Nos: 401-532, 1001, or 1007-1132. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises a 5′ end modification as shown in nucleotides 1-4 of any one of SEQ ID Nos: 401-532, 1001, or 1007-1132. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises a 5′ end modification as shown in nucleotides 1-5 of any one of SEQ ID Nos: 401-532, 1001, or 1007-1132. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises a 5′ end modification as shown in nucleotides 1-6 of any one of SEQ ID Nos: 401-532, 1001, or 1007-1132. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises a 5′ end modification as shown in nucleotides 1-7 of any one of SEQ ID Nos: 401-532, 1001, or 1007-1132.
In some embodiments, the gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises a 5′ end modification comprising a 5′ protective end modification. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′-OMe modified nucleotides at nucleotides 1, 2, and 3 of the guide region.
In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′-OMe modified nucleotides at nucleotides 1, 2, and 3 of the guide region and PS linkages between nucleotides 1 and 2, 2 and 3, and 3 and 4 of the guide region.
In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′-OMe modified nucleotides at nucleotides 1, 2, 3, 4, and 5 of the guide region.
In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′-OMe modified nucleotides at nucleotides 1, 2, 3, 4, and 5 of the guide region and PS linkages between nucleotides 1 and 2, 2 and 3, 3 and 4, 4 and 5, and 5 and 6 of the guide region.
In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′O-moe modified nucleotides at nucleotides 1, 2, and 3 of the guide region.
In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′O-moe modified nucleotides at nucleotides 1, 2, and 3 of the guide region and PS linkages between nucleotides 1 and 2, 2 and 3, and 3 and 4 of the guide region.
In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises an inverted abasic modified nucleotide at nucleotide 1 of the guide region.
In some embodiments, agRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises an inverted abasic modified nucleotide at nucleotide 1 of the guide region and 2′-OMe modified nucleotides at nucleotides 1, 2, and 3 of the guide region.
In some embodiments, agRNA (e.g., sgRNA, short-sgRNA, or crRNA) is provided comprising a 5′ end modification, wherein the 5′ end modification comprises an inverted abasic modified nucleotide at nucleotide 1 of the guide region, 2′-OMe modified nucleotides at nucleotides 1, 2, and 3 of the guide region, and PS linkages between nucleotides 1 and 2, 2 and 3, 3 and 4, 4 and 5, and 5 and 6 of the guide region.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a 5′ end modification and a 3′ end modification. Any of the 5′ end modifications discussed above and/or otherwise disclosed herein can be combined with a 3′ end modification, such as a 3′ end modification represented in the Table of Sequences and/or discussed below.
In some embodiments, the sgRNA or short-sgRNA comprises modified nucleotides at the 5′ and 3′ terminus, and modified nucleotides in one or more other regions described in Table 3.
In some embodiments, the sgRNA or short-sgRNA comprises modified nucleotides that are not at the 5′ or 3′ ends. Exemplary patterns of modifications are described below and in Table 1.
In some embodiments, an gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises a modification that stabilizes a secondary structure, e.g. a duplex region. Increased stability of a secondary structure can be determined empirically, e.g., by a melting temperature analysis. To simplify the analysis, the secondary structure element sought to be stabilized can be tested in isolation from the rest of the sgRNA structure. Increased stability of a secondary structure can also be determined by a reduction in accessibility of an endonucleolytic cleavage site (e.g., YA site) wherein the modification does not alter the endonucleolytic cleavage site primary structure but does occur in or affect a secondary structure comprising the endonucleolytic cleavage site. This is referred to as having a distal effect on the endonucleolytic cleavage site. In some embodiments, the endonucleolytic cleavage site is in the lower stem. In some embodiments, the endonucleolytic cleavage site is conserved region YA site 1. In some embodiments, the endonucleolytic cleavage site is conserved region YA site 2. In some embodiments, the endonucleolytic cleavage site is conserved region YA site 3. In some embodiments, the endonucleolytic cleavage site is conserved region YA site 10. In some embodiments, the modification is a bicyclic ribose analog modification, such as, a locked nucleic acid (LNA) or LNA-like modification. In some embodiments, the modification is an ENA modification. In some embodiments, nucleotide LS8 comprises a modification that stabilizes a secondary structure. In some embodiments, nucleotide LS11 comprises a modification that stabilizes a secondary structure. In some embodiments, one or both of nucleotides LS8 and LS11 collectively comprise one or more modifications (e.g., 2 modifications), such as ENA modifications, that stabilize a secondary structure. See discussion of G10008 and G10038 in the examples.
In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises modifications and/or unmodified nucleotides at at least 15 of nucleotides 1-20 from the 5′ end of the 5′ terminus that match the modification pattern at nucleotides 1-20 of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises modifications and/or unmodified nucleotides at at least 16 of nucleotides 1-20 from the 5′ end of the 5′ terminus that match the modification pattern at nucleotides 1-20 of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises modifications and/or unmodified nucleotides at at least 17 of nucleotides 1-20 from the 5′ end of the 5′ terminus that match the modification pattern at nucleotides 1-20 of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises modifications and/or unmodified nucleotides at at least 18 of nucleotides 1-20 from the 5′ end of the 5′ terminus that match the modification pattern at nucleotides 1-20 of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises modifications and/or unmodified nucleotides at at least 19 of nucleotides 1-20 from the 5′ end of the 5′ terminus that match the modification pattern at nucleotides 1-20 of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA (e.g., sgRNA, short-sgRNA, or crRNA) comprises modifications and/or unmodified nucleotides at nucleotides 1-20 from the 5′ end of the 5′ terminus that match the modification pattern at nucleotides 1-20 of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA comprises a modification pattern that matches at least 75% of the modification pattern of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA comprises a modification pattern that matches at least 80% of the modification pattern of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA comprises a modification pattern that matches at least 85% of the modification pattern of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA comprises a modification pattern that matches at least 90% of the modification pattern of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA comprises a modification pattern that matches at least 95% of the modification pattern of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA comprises a modification pattern that matches at least 98% of the modification pattern of a gRNA described herein, e.g., in Table 1. In some embodiments, a gRNA comprises a modification pattern that matches the modification pattern of a gRNA described herein, e.g., in Table 1. In some embodiments, an sgRNA or short-sgRNA comprises modifications in any one or more of the regions shown as modified in Table 1. In some embodiments, an sgRNA or short-sgRNA comprises modifications at any of the positions shown as modified in Table 1. In some embodiments, an sgRNA or short-sgRNA comprises any of the modifications shown in Table 1. Additional modifications are set forth in the summary section above, which may be combined to the extent feasible with modifications disclosed elsewhere herein, such as YA site modifications.
In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a modification to any one or more of US1-US12 in the upper stem region.
In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a modification of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or all 12 nucleotides in the upper stem region.
In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a modification of about 1-2, 1-3, 1-4, 1-5, 1-6, 1-7, 1-8, 1-9, 1-10, or 1-12 nucleotides in the upper stem region.
In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises 1, 2, 3, 4, or 5 YA modifications in a YA site. In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises at least 1, 2, 3, 4, or 5 YA modifications. In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises one YA modification. In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises 2 YA modifications. In some embodiments, the upper stem modification comprises 3 YA modifications. In some embodiments, one or more YA modifications are in a YA site. In some embodiments, one or more YA modifications are distal to a YA site.
In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a 2′-O-Me modified nucleotide. In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a 2′-O-moe modified nucleotide. In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a 2′-F modified nucleotide.
In some embodiments, an sgRNA or short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a 2′-O-Me modified nucleotide, a 2′-O-moe modified nucleotide, a 2′-F modified nucleotide, and/or combinations thereof.
In some embodiments, the sgRNA or short-sgRNA comprises an upper stem modification as shown in any one of the sequences in Table 1. In some embodiments, such an upper stem modification is combined with a 5′ protective end modification, e.g. as shown for the corresponding sequence in Table 1. In some embodiments, such an upper stem modification is combined with a 3′ protective end modification, e.g. as shown for the corresponding sequence in Table 1. In some embodiments, such an upper stem modification is combined with 5′ and 3′ end modifications as shown for the corresponding sequence in Table 1.
In some embodiments, the sgRNA or short-sgRNA comprises a 5′ end modification and an upper stem modification. In some embodiments, the sgRNA or short-sgRNA comprises a 3′ end modification and an upper stem modification. In some embodiments, the sgRNA or short-sgRNA comprises a 5′ end modification, a 3′ end modification and an upper stem modification.
In some embodiments, the sgRNA or short-sgRNA comprises a modification in the hairpin region. In some embodiments, the hairpin region modification is in hairpin 1. In some embodiments, the hairpin region modification is in hairpin 2. In some embodiments, modifications are within hairpin 1 and 2, optionally wherein the “n” between hairpin 1 and 2 is also modified. In some embodiments, the hairpin region modification comprises at least one modified nucleotide selected from a 2′H modified nucleotide (DNA), PS modified nucleotide, a YA modification, a 2′-O-methyl (2′-O-Me) modified nucleotide, a 2′-fluoro (2′-F) modified nucleotide, and/or combinations thereof.
In some embodiments, an sgRNA or short-sgRNA is provided comprising a hairpin modification, wherein the hairpin modification comprises 1, 2, or 3 YA modifications in a YA site. In some embodiments, an sgRNA or short-sgRNA is provided comprising a hairpin modification, wherein the hairpin modification comprises at least 1, 2, 3, 4, 5, or 6 YA modifications. In some embodiments, an sgRNA or short-sgRNA is provided comprising a hairpin modification, wherein the hairpin modification comprises one YA modification. In some embodiments, an sgRNA or short-sgRNA is provided comprising a hairpin modification, wherein hairpin modification comprises 2 YA modifications. In some embodiments, the hairpin modification comprises 3 YA modifications. In some embodiments, one or more YA modifications are in a YA site. In some embodiments, one or more YA modifications are distal to a YA site.
In some embodiments, the hairpin modification comprises or further comprises a 2′-O-methyl (2′-O-Me) modified nucleotide.
In some embodiments, the hairpin modification comprises or further comprises a 2′-fluoro (2′-F) modified nucleotide.
In some embodiments, the sgRNA or short-sgRNA comprises a 3′ end modification, and a modification in the hairpin region.
In some embodiments, the sgRNA or short-sgRNA comprises a 5′ end modification, and a modification in the hairpin region.
In some embodiments, the sgRNA or short-sgRNA comprises an upper stem modification, and a modification in the hairpin region.
In some embodiments, the sgRNA or short-sgRNA comprises a hairpin modification as shown in any one of the sequences in Table 1. In some embodiments, such a hairpin modification is combined with a 5′ end modification as shown for the corresponding sequence in Table 1. In some embodiments, such a hairpin modification is combined with a 3′ end modification as shown for the corresponding sequence in Table 1. In some embodiments, such an hairpin modification is combined with 5′ and 3′ end modifications as shown for the corresponding sequence in Table 1.
In some embodiments, the sgRNA or short-sgRNA comprises a 3′ end modification, a modification in the hairpin region, an upper stem modification, and a 5′ end modification.
In some embodiments, the sgRNA or short-sgRNA comprising one or more modifications of YA sites is a short-sgRNA as described herein, e.g., comprising a hairpin region that lacks at least 5-10 nucleotides, e.g., as defined herein or relative to the hairpin region shown in Table 2. Such an sgRNA may have any of the features set forth herein with respect to an sgRNA, e.g., in the summary and in the detailed description section regarding short-sgRNAs above.
Exemplary Modified sgRNAs
In some embodiments, the sgRNAs described herein comprise or consist of any of the sequences shown in Table 1. Further, sgRNAs are encompassed that comprise the modifications of any of the sequences shown in Table 1, and identified therein by SEQ ID No. That is, the nucleotides may be the same or different, but the modification pattern shown may be the same or similar to a modification pattern of a guide sequence of Table 1. A modification pattern includes the relative position and identity of modifications of the sgRNA (e.g. 5′ terminus region, lower stem region, bulge region, upper stem region, nexus region, hairpin 1 region, hairpin 2 region, 3′ tail region).
In some embodiments, the modification pattern contains at least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% of the modifications of any one of the sequences shown in the sequence column of Table 1, or over one or more regions of the sequence. In some embodiments, the modification pattern is at least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical to the modification pattern of any one of the sequences shown in the sequence column of Table 1. In some embodiments, the modification pattern is at least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over 1, 2, 3, 4, 5, 6, 7, or 8 regions of the sequence shown in Table 1, e.g., a 5′ terminus region, lower stem region, bulge region, upper stem region, nexus region, hairpin 1 region, hairpin 2 region, and/or 3′ terminus region.
For example, in some embodiments, an sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical to the modification pattern of a sequence over the 5′ terminus region. In some embodiments, an sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the lower stem. In some embodiments, an sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the bulge. In some embodiments, an sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the upper stem. In some embodiments, an sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the nexus. In some embodiments, an sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the hairpin 1. In some embodiments, an sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the hairpin 2. In some embodiments, an sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the 3′ terminus. In some embodiments, the modification pattern differs from the modification pattern of a sequence of Table 1, or a region (e.g. 5′ terminus, lower stem, bulge, upper stem, nexus, hairpin 1, hairpin 2, 3′ terminus) of such a sequence, at 0, 1, 2, 3, 4, 5, or 6 nucleotides. In some embodiments, the sgRNA comprises modifications that differ from the modifications of a sequence of Table 1, at 0, 1, 2, 3, 4, 5, or 6 nucleotides. In some embodiments, the sgRNA comprises modifications that differ from modifications of a region (e.g. 5′ terminus, lower stem, bulge, upper stem, nexus, hairpin 1, hairpin 2, 3′ terminus) of a sequence of Table 1, at 0, 1, 2, 3, 4, 5, or 6 nucleotides.
In some embodiments, the sgRNA comprises a 2′-O-methyl (2′-O-Me) modified nucleotide. In some embodiments, the sgRNA comprises a 2′-O-(2-methoxyethyl) (2′-O-moe) modified nucleotide. In some embodiments, the sgRNA comprises a 2′-fluoro (2′-F) modified nucleotide. In some embodiments, the sgRNA comprises a phosphorothioate (PS) bond between nucleotides. In some embodiments, the sgRNA comprises a YA modification.
In some embodiments, the sgRNA comprises a 5′ end modification, a 3′ end modification, or 5′ and 3′ end modification and further comprises a YA modification. In some embodiments, the 5′ end modification comprises a protective end modification. In some embodiments, the 5′ end modification comprises a phosphorothioate (PS) bond between nucleotides. In some embodiments, the 5′ end modification comprises a 2′-O-methyl (2′-O-Me), 2′-O-(2-methoxyethyl) (2′-O-moe), and/or 2′-fluoro (2′-F) modified nucleotide. In some embodiments, the 5′ end modification comprises at least one phosphorothioate (PS) bond and one or more of a 2′-O-methyl (2′-O-Me), 2′-O-(2-methoxyethyl) (2′-O-moe), and/or 2′-fluoro (2′-F) modified nucleotide. The end modification may comprise a phosphorothioate (PS), 2′-O-methyl (2′-O-Me), 2′-O-(2-methoxyethyl) (2′-O-moe), and/or 2′-fluoro (2′-F) modification. Equivalent end modifications are also encompassed by embodiments described herein. In some embodiments, the sgRNA comprises an end modification in combination with a modification of one or more regions of the sgRNA.
Modified sgRNAs comprising combinations of 5′ end modifications, 3′ end modifications, upper stem modifications, hairpin modifications, and 3′ terminus modifications, as described above, are encompassed. Exemplary modified sgRNAs are described below.
In some embodiments, an sgRNA is provided comprising or consisting of any one of the sequences described in SEQ ID Nos: 401-535, 601, 607-732, 801, 807-932, 1001, or 1007-1132.
In some embodiments, an sgRNA is provided comprising any one of the modified sequences of SEQ ID Nos: 601 or 607-732, wherein the sgRNA further comprises a guide region that is complementary to a target sequence, and directs a Cas9 to its target for cleavage. In some instances, an sgRNA is provided comprising nucleic acids having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleic acids of any one of SEQ ID Nos: 401-535, 601, 607-732, 801, 807-932, 1001, or 1007-1132, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier in Table 1. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, the sgRNA comprises modifications at 1, 2, 3, or 4 of the first 4 nucleotides at its 5′ end. In some embodiments, the first three or four nucleotides at the 5′ terminus, and the last three or four nucleotides at the 3′ terminus are modified. In some embodiments, the first four nucleotides at the 5′ end, and the last four nucleotides at the 3′ terminus are linked with phosphorothioate (PS) bonds. In some embodiments, the modification comprises 2′-O-Me. In some embodiments, the modification comprises 2′-F. In some embodiments, the modification comprises 2′-O-moe.
In some embodiments, the sgRNA comprises, if the nucleotide mentioned is present in the sgRNA, modifications at 1, 2, 3, or 4 of the first 4 nucleotides at the 5′ end. In some embodiments, the sgRNA comprises modifications at 1, 2, 3, or 4 of the last 4 nucleotides at the 3′ end (3′ tail or conserved portion of an sgRNA). In some embodiments, the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-O-Me or 2′-O-moe modifications.
In some embodiments, the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-F modifications.
In some embodiments, an sgRNA is provided, if the nucleotide mentioned is present in the sgRNA, wherein LS1, LS6, LS7, LS8, LS11, and LS12 are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the bulge region of the sgRNA are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the upper stem region of the sgRNA are modified with 2′-O-Me. In some embodiments, N16, N17, and N18 in the nexus region of the sgRNA are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the hairpin 1 region of the sgRNA are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the hairpin 2 region of the sgRNA are modified with 2′-O-Me.
In some embodiments, the sgRNA comprises 2′-O-Me modified nucleotides at the following nucleotides: the first three nucleotides at the 5′ terminus; LS1, LS6, LS7, LS8, LS11, and LS12; B1 and B2 in the bulge region; each of the nucleotides in the upper stem region of the sgRNA; N16, N17, and N18 in the nexus region; each of the nucleotides in the hairpin 1 region; each of the nucleotides in the hairpin 2 region; and last four nucleotides at the 3′ terminus.
In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises 2′-O-Me or 2′-F modified nucleic acids at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleic acids at the last four nucleotides at the 3′ terminus. In some embodiments, LS9 and LS10 are modified with 2′-F. In some embodiments, N15, N16, N17, and N18 are modified with 2′-F. In some embodiments, H2-9, H2-10, H2-11, H2-12, H2-13, HS-14, and H2-15 are modified with 2′-F. In some embodiments, the second to last, third to last, and fourth to last nucleotides at the 3′ terminus are modified with 2′-F.
In some embodiments, sgRNA is provided comprising 2′-F modified nucleic acids at the following nucleotides: LS9 and LS10 in the lower stem region; N15, N16, N17, and N18 in the nexus region; and H2-9, H2-10, H2-11, H2-12, H2-13, HS-14, and H2-15 in the hairpin 2 region. In some embodiments, the sgRNA further comprises 2′-F modified nucleotides at the second to last, third to last, and fourth to last nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises 2′-O-Me or 2′-F modified nucleic acids at the first three nucleotides at the 5′ terminus, and 2′-O-Me or 2′-F modified nucleic acids at three of the last four nucleotides at the 3′ terminus.
In some embodiments, an sgRNA is provided comprising: 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at LS1 and LS6; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-F modified nucleotides at LS1-LS6; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide at “n” between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-F modified nucleotides at LS2-LS5; 2′-O-Me modified nucleotides at LS1 and LS6; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide at “n” between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at LS7, LS8, LS11, and LS12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide at “n” between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at LS8, LS10, and LS12; 2′-O-F modified nucleotides at LS7, LS9, and LS11; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at LS1, LS6, LS7, LS8, LS11, and LS12; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at LS1, LS6, LS7, LS8, LS11, and LS12; 2′-F modified nucleotides at LS9 and LS10; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-15; and 2′-O-Me modified nucleotides at the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-1-H1-12; a 2′-O-Me modified nucleotide between Hairpin 1 and Hairpin 2; 2′-O-Me modified nucleotides at H2-1-H2-8; 2′-F modified nucleotides at H2-9-H2-15; 2′-F modified nucleotides at the second from last, third from last, and fourth from last nucleotide at the 3′ terminus; and a 2′-O-Me modified nucleotide at the last nucleotide at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at the first three nucleotides at the 5′ terminus; 2′-O-Me modified nucleotides at US1-US12; 2′-O-Me modified nucleotides at H1-2, H1-4, H1-6, H1-8, H1-10, and H1-12; 2′-F modified nucleotides at H1-1, H1-3, H1-5, H1-7, H1-9, and H1-11; a 2′-F modified nucleotide between Hairpin 1 and Hairpin 2; 2′-F modified nucleotides at H2-2, H2-4, H2-6, H2-8, H2-10, H2-12; and H2-14; 2′-O-Me modified nucleotides at H2-1, H2-3, H2-5, H2-7, H2-9, H2-11; H2-13, and H2-15; 2′-F modified nucleotides at the second from last, and fourth from last nucleotide at the 3′ terminus; and 2′-O-Me modified nucleotide at the third from last, and last nucleotide at the 3′ terminus. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
Disclosed herein, in some embodiments, is an sgRNA comprising 2′-O-Me modifications at nucleotides LS8, LS10, LS12, H1-2, H1-4, H1-6, H1-8, H1-10, H1-12, H2-1, H2-3, H2-5, H2-7, H2-9, H2-11, H2-13, and H2-15; and 2′-F modifications at LS7, LS9, LS11; H1-1, H1-3, H1-5, H1-7, H1-9, H1-11, H1-13, H2-2, H2-4, H2-6, H2-8, H2-10, H2-12, and H2-14. In some embodiments, the sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises 2′-O-Me modified nucleotides at the last and third to last nucleotide at the 3′ terminus; and 2′-F modified nucleotides at the second to last and third to last nucleotide at the 3′ terminus.
In some embodiments, an sgRNA comprising a 5′ end modification and one or more modifications in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region is provided, wherein the 5′ end modification comprises at least two phosphorothioate linkages within the first seven nucleotides of the 5′ terminus.
In some embodiments, an sgRNA comprising a 5′ end modification and one or more modifications in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region is provided, wherein the 5′ end modification comprises one or more phosphorothioate linkages at the 5′ end. In some embodiments, one or more phosphorothioate bonds link the 5′ terminal nucleotides.
In some embodiments, an sgRNA is provided, comprising a 5′ end modification and one or more modifications in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region is provided, wherein the 5′ end modification comprises one or more phosphorothioate linkages within the first seven nucleotides of the 5′ terminus.
In some embodiments, an sgRNA comprising any one of the modified sequences of SEQ ID Nos: 601 or 607-732 is provided, wherein the sgRNA further comprises a 5′ guide region that is at least partially complementary to a target sequence, and optionally directs a Cas9 to its target for cleavage.
In some embodiments, an sgRNA comprising nucleotides having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleotides of any one of SEQ ID Nos: 401-532, 601, 607-732, 801, 807-932, 1001, or 1007-1132, is provided, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier. That is, the nucleotides A, U (and/or T in the case of deoxyribonucleotide modifications), C, and G may differ by 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% compared to what is shown in in the sequences, but the modification remains unchanged.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at: the first three nucleotides in the 5′ terminus; LS1, LS6, LS7, LS8, LS11, and LS12 in the lower stem; B1 and B2 in the bulge region; each of the nucleotides in the upper stem region; N16, N17, and N18 in the nexus region; each of the nucleotides in the hairpin 1 region; one nucleotide between hairpin 1 and hairpin 2; each of the nucleotides in the hairpin 2 region; and the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three PS bonds between the first four nucleotides at the 5′ terminus and three PS bonds between the last four nucleotides at the 3′ terminus.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at: the first three nucleotides in the 5′ terminus; LS1, LS6, LS7, LS8, LS11, and LS12 in the lower stem; B1-B6 in the bulge region; each of the nucleotides in the upper stem region; N16, N17, and N18 in the nexus region; each of the nucleotides in the hairpin 1 region; one nucleotide between hairpin 1 and hairpin 2; each of the nucleotides in the hairpin 2 region; and the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three PS bonds between the first four nucleotides at the 5′ terminus and three PS bonds between the last four nucleotides in the 3′ terminus region.
In some embodiments, an sgRNA is provided comprising 2′-F modified nucleotides at: LS9 and LS10 in the lower stem; 15-N18 in the nexus region; H2-9-HS-15 in the hairpin 2 region; and the second to last, third to last, and fourth to last nucleotide in the 3′ terminus region.
In some embodiments, an sgRNA is provided comprising 2′-F modified nucleotides at: each nucleotide in the lower stem; 15-N18 in the nexus region; H2-9-HS-15 in the hairpin 2 region; and the second to last, third to last, and fourth to last nucleotide in the 3′ terminus region.
In some embodiments, an sgRNA is provided comprising 2′-O-Me modified nucleotides at LS8, LS10, LS12, H1-2, H1-4, H1-6, H1-8, H1-10, H1-12, H2-1, H2-3, H2-5, H2-7, H2-9, H2-11, H2-13, H2-15, and the last and third to last nucleotides in the 3′ terminus region; and 2′-F modifications at LS7, LS9, LS11; H1-1, H1-3, H1-5, H1-7, H1-9, H1-11, H1-13, H2-2, H2-4, H2-6, H2-8, H2-10, H2-12, H2-14, and the second to last and fourth to last nucleotide in the 3′ terminus region.
In some embodiments, a single guide RNA (sgRNA) comprises one or more guide region YA site modifications or conserved region YA modifications, a 5′ end modification and one or more modification in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region, wherein the 5′ end modification comprises at least two phosphorothioate linkages within the first seven nucleotides at the 5′ end of the 5′ terminus. In some instances, the modification is a 2′-O-methyl (2′-O-Me) modified nucleotide. In some embodiments, the modification is a 2′-fluoro (2′-F) modified nucleotide.
In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications, modifications at US1 to US12 and/or a modification at H1-1 and/or a modification in H2-1. In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and modifications at H1-1 to H1-12 and/or H2-1 to H2-15. In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and one or more modifications in each of the upper stem region, the hairpin 1 region, and the hairpin 2 region. In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and a modified nucleotide between hairpin 1 and hairpin 2 regions. In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and a modification in the lower stem region.
In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and a modification in the bulge region. In some embodiments, 50% of the nucleotides in the bulge region are modified, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and a modification in the nexus region. In some embodiments, the sgRNA comprises modifications at N15, N16, N17, and/or N18 in the nexus region, wherein the modification is 2′-O-Me or 2′-F. In some instances, N16, N17, and N18 are linked with PS bonds.
In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and modifications at the first four nucleotides at the 5′ end of the 5′ terminus and the last four nucleotides at the 3′ end of the 3′terminus. In some instances, these modifications are linking PS bond (i.e., PS bonds that link the first four and last four nucleotides). In some embodiments, the sgRNA further comprises 2′-O-Me modifications at the first three nucleotides at the 5′ end of the 5′ terminus and the last three nucleotides at the 3′ end of the 3′ terminus.
In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and modifications LS1, LS6, LS7, LS8, LS11, and LS12, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and modifications at each of the nucleotides in the bulge region, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and modifications at each of the nucleotides in the upper stem region, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and modifications at each of the nucleotides in the hairpin 1 region, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, the sgRNA comprises one or more guide region YA site modifications or conserved region YA modifications and modifications at each of the nucleotides in the hairpin 2 region, wherein the modification is 2′-O-Me or 2′-F.
In some embodiments, an sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising 2′-O-Me modified nucleotides at the following positions:
In some embodiments, an sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and 2′-F modified nucleotides at the following positions:
In some embodiments, an sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising:
In some embodiments, an sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and:
In some embodiments, an sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising:
In some embodiments, a sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising:
In some embodiments, a sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising:
In some embodiments, an sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising:
In some embodiments, a sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising:
In some embodiments, a sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising:
In some embodiments, a sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising:
In some embodiments, a sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising:
In some embodiments, a sgRNA is encompassed comprising one or more guide region YA site modifications or conserved region YA modifications and further comprising:
Any of the foregoing modification patterns can be combined with a modification pattern set forth in the embodiments described above, e.g., in the summary section or Table 1, to the extent that they are non-overlapping. In the event that combining a foregoing modification pattern with a modification pattern set forth in the summary section or Table 1 would result in incompatible modifications (e.g., the same position would be both 2′-OMe and 2′-fluoro), the modification set forth in the summary section or Table 1 controls.
In some embodiments, an sgRNA provided herein is a short-single guide RNAs (short-sgRNAs), e.g., comprising a conserved portion of an sgRNA comprising a hairpin region, wherein the hairpin region lacks at least 5-10 nucleotides or 6-10 nucleotides. In some embodiments, the sgRNA is from S. pyogenes Cas9 (“spyCas9”) or a spyCas9 equivalent. In some embodiments, the sgRNA is not from S. pyogenes Cas9 (“non-spyCas9”). In some embodiments, the 5-10 nucleotides or 6-10 nucleotides are consecutive.
In some embodiments, a short-sgRNA lacks at least nucleotides 54-58 (AAAAA) of the conserved portion of a spyCas9 sgRNA, as shown in Table 2. In some embodiments, a short-sgRNA is a non-spyCas9 sgRNA that lacks nucleotides corresponding to nucleotides 54-58 (AAAAA) of the conserved portion of a spyCas9 as determined, for example, by pairwise or structural alignment. In some embodiments, the non-spyCas9 sgRNA is Staphylococcus aureus Cas9 (“saCas9”) sgRNA.
In some embodiments, the hairpin regions lacks 5, 6, 7, 8, 9, 10, 11, or 12 nucleotides. In some embodiments, the hairpin 1 portion lacks 5, 6, 7, 8, 9, 10, 11, or 12 nucleotides. In some embodiments, the hairpin 2 portion lacks 5, 6, 7, 8, 9, 10, 11, or 12 nucleotides. In some embodiments, the hairpin regions lacks 5, 6, 7, 8, 9, 10, 11, or 12 consecutive nucleotides. In some embodiments, the hairpin 1 portion lacks 5, 6, 7, 8, 9, 10, 11, or 12 consecutive nucleotides. In some embodiments, the hairpin 2 portion lacks 5, 6, 7, 8, 9, 10, 11, or 12 consecutive nucleotides. In some embodiments, the 5-10 lacking nucleotides or 6-10 lacking nucleotides are within hairpin 1. In some embodiments, the 5-10 lacking nucleotides or 6-10 lacking nucleotides are within hairpin 2. In some embodiments, the 5-10 lacking nucleotides or 6-10 lacking nucleotides are within hairpin 1 and hairpin 2. In some embodiments, the 5-10 lacking nucleotides or 6-10 lacking nucleotides are within hairpin 1 or hairpin 2. In some embodiments, the 5-10 lacking nucleotides or 6-10 lacking nucleotides are consecutive and include the “N” between hairpin 1 and hairpin 2. In some embodiments, the 5-10 or 6-10 lacking nucleotides include the “N” between hairpin 1 and hairpin 2. In some embodiments, the 5-10 or 6-10 lacking nucleotides are consecutive and span at least a portion of hairpin 1. In some embodiments, the 5-10 or 6-10 lacking nucleotides are consecutive and span at least a portion of hairpin 2. In some embodiments, the 5-10 lacking nucleotides or 6-10 lacking nucleotides are consecutive and span at least a portion of hairpin 1 and a portion of hairpin 2. In some embodiments, the 5-10 lacking nucleotides or 6-10 lacking nucleotides are consecutive and span at least a portion of hairpin 1 and the “N” between hairpin 1 and hairpin 2. In some embodiments, the 5-10 lacking nucleotides comprise or consist of nucleotides 54-58, 54-61, or 53-60 of SEQ ID NO: 400.
In some embodiments, the short-sgRNA described herein further comprises a nexus region, wherein the nexus region lacks at least one nucleotide. In some embodiments, the short-sgRNA lacks at least 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides in the nexus region. In some embodiments, the short-sgRNA lacks at least 1-2, 1-3, 1-4 nucleotides, 1-5 nucleotides, 1-6 nucleotides, 1-10 nucleotides, or 1-15 nucleotides in the nexus region. In some embodiments, the short-sgRNA lacks each nucleotide in the nexus region.
In some embodiments, the short-sgRNA further comprises a guide region. In some embodiments, the guide region comprises the first 1-10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotides at the 5′ end of the short-sgRNA. In some embodiments, the guide region comprises 20 nucleotides. In some embodiments, the guide region comprises 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 or more nucleotides. In some embodiments, the guide region comprises 17 nucleotides. In some embodiments, the guide region comprises 18 nucleotides. In some embodiments, the guide region comprises 19 nucleotides.
In some embodiments, the selection of the guide region is determined based on target sequences within the gene of interest for editing. For example, in some embodiments, the short-sgRNA comprises a guide region that is complementary to target sequences of a gene of interest.
In some embodiments, the target sequence in the gene of interest may be complementary to the guide region of the short-sgRNA. In some embodiments, the degree of complementarity or identity between a guide region of a short-sgRNA and its corresponding target sequence in the gene of interest may be about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100%. In some embodiments, the guide region of a short-sgRNA and the target region of a gene of interest may be 100% complementary or identical. In other embodiments, the guide region of a short-sgRNA and the target region of a gene of interest may contain at least one mismatch. For example, the guide region of a short-sgRNA and the target sequence of a gene of interest may contain 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 mismatches, where the total length of the target sequence is at least about 17, 18, 19, 20 or more base pairs. In some embodiments, the guide region of a short-sgRNA and the target region of a gene of interest may contain 1-6 mismatches where the guide sequence comprises at least about 17, 18, 19, 20 or more nucleotides. In some embodiments, the guide region of a short-sgRNA and the target region of a gene of interest may contain 1, 2, 3, 4, 5, or 6 mismatches where the guide sequence comprises about 20 nucleotides. The 5′ terminus may comprise nucleotides that are not considered guide regions (i.e., do not function to direct a Cas9 protein to a target nucleic acid).
In some embodiments, the short-sgRNA is modified. The term “modified” or “modification” in the context of a short-sgRNA described herein includes, the modifications described above, including, for example, (a) end modifications, e.g., 5′ end modifications or 3′ end modifications, including 5′ or 3′ protective end modifications, (b) nucleobase (or “base”) modifications, including replacement or removal of bases, (c) sugar modifications, including modifications at the 2′, 3′, and/or 4′ positions, (d) internucleoside linkage modifications, and (e) backbone modifications, which can include modification or replacement of the phosphodiester linkages and/or the ribose sugar. A modification of a nucleotide at a given position includes a modification or replacement of the phosphodiester linkage immediately 3′ of the sugar of the nucleotide. Thus, for example, a nucleic acid comprising a phosphorothioate between the first and second sugars from the 5′ end is considered to comprise a modification at position 1. The term “modified short-sgRNA” generally refers to a short-sgRNA having a modification to the chemical structure of one or more of the base, the sugar, and the phosphodiester linkage or backbone portions, including nucleotide phosphates, all as detailed and exemplified herein.
Exemplary patterns of modifications are shown in Table 1. Additional exemplary patterns are discussed below.
Modifications of Guide Regions and/or YA Sites
In some embodiments, a short-sgRNA comprises modifications at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, or more YA sites. In some embodiments, the pyrimidine of the YA site comprises a modification (which includes a modification altering the internucleoside linkage immediately 3′ of the sugar of the pyrimidine). In some embodiments, the adenine of the YA site comprises a modification (which includes a modification altering the internucleoside linkage immediately 3′ of the sugar of the adenine). In some embodiments, the pyrimidine and the adenine of the YA site comprise modifications, such as sugar, base, or internucleoside linkage modifications. The YA modifications can be any of the types of modifications set forth herein. In some embodiments, the YA modifications comprise one or more of phosphorothioate, 2′-OMe, or 2′-fluoro. In some embodiments, the YA modifications comprise pyrimidine modifications comprising one or more of phosphorothioate, 2′-OMe, 2′-H, inosine, or 2′-fluoro. In some embodiments, the YA modification comprises a bicyclic ribose analog (e.g., an LNA, BNA, or ENA) within an RNA duplex region that contains one or more YA sites. In some embodiments, the YA modification comprises a bicyclic ribose analog (e.g., an LNA, BNA, or ENA) within an RNA duplex region that contains a YA site, wherein the YA modification is distal to the YA site.
Guide Region Modifications, including YA Site Modifications
In some embodiments, the guide region comprises 1, 2, 3, 4, 5, or more YA sites (“guide region YA sites”) that may comprise YA modifications. In some embodiments, one or more YA sites located at 5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus (where “5-end”, etc., refers to position 5 to the 3′ end of the guide region, i.e., the most 3′ nucleotide in the guide region) comprise YA modifications. In some embodiments, two or more YA sites located at 5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus comprise YA modifications. In some embodiments, three or more YA sites located at 5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus comprise YA modifications. In some embodiments, four or more YA sites located at 5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus comprise YA modifications. In some embodiments, five or more YA sites located at 5-end, 6-end, 7-end, 8-end, 9-end, or 10-end from the 5′ end of the 5′ terminus comprise YA modifications. A modified guide region YA site comprises a YA modification.
In some embodiments, a modified guide region YA site is within 17, 16, 15, 14, 13, 12, 11, 10, or 9 nucleotides of the 3′ terminal nucleotide of the guide region. For example, if a modified guide region YA site is within 10 nucleotides of the 3′ terminal nucleotide of the guide region and the guide region is 20 nucleotides long, then the modified nucleotide of the modified guide region YA site is located at any of positions 11-20. In some embodiments, a YA modification is located within a YA site 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 nucleotides from the 3′ terminal nucleotide of the guide region. In some embodiments, a YA modification is located 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 nucleotides from the 3′ terminal nucleotide of the guide region.
In some embodiments, a modified guide region YA site is at or after nucleotide 4, 5, 6, 7, 8, 9, 10, or 11 from the 5′ end of the 5′ terminus.
In some embodiments, a modified guide region YA site is other than a 5′ end modification. For example, a short-sgRNA can comprise a 5′ end modification as described herein and further comprise a modified guide region YA site. Alternatively, a short-sgRNA can comprise an unmodified 5′ end and a modified guide region YA site. Alternatively, a short-sgRNA can comprise a modified 5′ end and an unmodified guide region YA site.
In some embodiments, a modified guide region YA site comprises a modification that at least one nucleotide located 5′ of the guide region YA site does not comprise. For example, if nucleotides 1-3 comprise phosphorothioates, nucleotide 4 comprises only a 2′-OMe modification, and nucleotide 5 is the pyrimidine of a YA site and comprises a phosphorothioate, then the modified guide region YA site comprises a modification (phosphorothioate) that at least one nucleotide located 5′ of the guide region YA site (nucleotide 4) does not comprise. In another example, if nucleotides 1-3 comprise phosphorothioates, and nucleotide 4 is the pyrimidine of a YA site and comprises a 2′-OMe, then the modified guide region YA site comprises a modification (2′-OMe) that at least one nucleotide located 5′ of the guide region YA site (any of nucleotides 1-3) does not comprise. This condition is also always satisfied if an unmodified nucleotide is located 5′ of the modified guide region YA site.
In some embodiments, the modified guide region YA sites comprise modifications as described for YA sites above.
Additional embodiments of guide region modifications, including guide region YA site modifications, are set forth elsewhere herein, including in the summary above and in the discussion of gRNAs comprising modifications, including modifications at YA sites above, and elsewhere herein. The guide region of a short-sgRNA may be modified according to any embodiment comprising a modified guide region set forth herein. Any embodiments set forth elsewhere in this disclosure may be combined to the extent feasible with any of the foregoing embodiments.
Conserved Region YA Site Modifications
Conserved region YA sites 1-10 are illustrated in
In some embodiments, conserved region YA sites 1, 8, or 1 and 8 comprise YA modifications. In some embodiments, conserved region YA sites 1, 2, 3, 4, and 10 comprise YA modifications. In some embodiments, YA sites 2, 3, 4, 8, and 10 comprise YA modifications. In some embodiments, conserved region YA sites 1, 2, 3, and 10 comprise YA modifications. In some embodiments, YA sites 2, 3, 8, and 10 comprise YA modifications. In some embodiments, YA sites 1, 2, 3, 4, 8, and 10 comprise YA modifications. In some embodiments, 1, 2, 3, 4, 5, 6, 7, or 8 additional conserved region YA sites comprise YA modifications.
In some embodiments, 1, 2, 3, or 4 of conserved region YA sites 2, 3, 4, and 10 comprise YA modifications. In some embodiments, 1, 2, 3, 4, 5, 6, 7, or 8 additional conserved region YA sites comprise YA modifications.
In some embodiments, the modified conserved region YA sites comprise modifications as described for YA sites above.
Additional embodiments of conserved region YA site modifications are set forth in the summary above. Any embodiments set forth elsewhere in this disclosure may be combined to the extent feasible with any of the foregoing embodiments.
Modifications to Terminal Nucleotides
In some embodiments, the 5′ and/or 3′ terminus regions of a short-sgRNA are modified.
3′ Terminus Region Modifications
In some embodiments, the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in the 3′ terminus region are modified. Throughout, this modification may be referred to as a “3′ end modification”. In some embodiments, the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in the 3′ terminus region comprise more than one modification. In some embodiments, at least one of the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in the 3′ terminus region are modified. In some embodiments, at least two of the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in the 3′ terminus region are modified. In some embodiments, at least three of the terminal (i.e., last) 1, 2, 3, 4, 5, 6, or 7 nucleotides in the 3′ terminus region are modified. In some embodiments, the modification comprises a PS linkage. In some embodiments, the modification to the 3′ terminus region is a 3′ protective end modification. In some embodiments, the 3′ end modification comprises a 3′ protective end modification.
In some embodiments, the 3′ end modification comprises a modified nucleotide selected from 2′-O-methyl (2′-O-Me) modified nucleotide, 2′-O-(2-methoxyethyl) (2′-O-moe) modified nucleotide, a 2′-fluoro (2′-F) modified nucleotide, a phosphorothioate (PS) linkage between nucleotides, an inverted abasic modified nucleotide, or combinations thereof.
In some embodiments, the 3′ end modification comprises or further comprises a 2′-O-methyl (2′-O-Me) modified nucleotide.
In some embodiments, the 3′ end modification comprises or further comprises a 2′-fluoro (2′-F) modified nucleotide.
In some embodiments, the 3′ end modification comprises or further comprises a phosphorothioate (PS) linkage between nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises an inverted abasic modified nucleotide.
In some embodiments, the 3′ end modification comprises or further comprises a modification of any one or more of the last 7, 6, 5, 4, 3, 2, or 1 nucleotides. In some embodiments, the 3′ end modification comprises or further comprises one modified nucleotide. In some embodiments, the 3′ end modification comprises or further comprises two modified nucleotides. In some embodiments, the 3′ end modification comprises or further comprises three modified nucleotides. In some embodiments, the 3′ end modification comprises or further comprises four modified nucleotides. In some embodiments, the 3′ end modification comprises or further comprises five modified nucleotides. In some embodiments, the 3′ end modification comprises or further comprises six modified nucleotides. In some embodiments, the 3′ end modification comprises or further comprises seven modified nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises a modification of between 1 and 7 or between 1 and 5 nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises modifications of 1, 2, 3, 4, 5, 6, or 7 nucleotides at the 3′ end of the gRNA.
In some embodiments, the 3′ end modification comprises or further comprises modifications of about 1-3, 1-5, 1-6, or 1-7 nucleotides at the 3′ end of the gRNA.
In some embodiments, the 3′ end modification comprises or further comprises any one or more of the following: a phosphorothioate (PS) linkage between nucleotides, a 2′-O-Me modified nucleotide, a 2′-O-moe modified nucleotide, a 2′-F modified nucleotide, an inverted abasic modified nucleotide, and a combination thereof.
In some embodiments, the 3′ end modification comprises or further comprises 1, 2, 3, 4, 5, 6, or 7 PS linkages between nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises at least one 2′-O-Me, 2′-O-moe, inverted abasic, or 2′-F modified nucleotide. In some embodiments, the 3′ end modification comprises or further comprises one PS linkage, wherein the linkage is between the last and second to last nucleotide. In some embodiments, the 3′ end modification comprises or further comprises two PS linkages between the last three nucleotides. In some embodiments, the 3′ end modification comprises or further comprises four PS linkages between the last four nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises PS linkages between any one or more of the last four nucleotides. In some embodiments, the 3′ end modification comprises or further comprises PS linkages between any one or more of the last five nucleotides. In some embodiments, the 3′ end modification comprises or further comprises PS linkages between any one or more of the last 2, 3, 4, 5, 6, or 7 nucleotides.
In some embodiments, the 3′ end modification comprises or further comprises a modification of one or more of the last 1-7 nucleotides, wherein the modification is a PS linkage, inverted abasic nucleotide, 2′-OMe, 2′-O-moe, 2′-F, or combinations thereof.
In some embodiments, the 3′ end modification comprises or further comprises a modification to the last nucleotide with 2′-OMe, 2′-O-moe, 2′-F, or combinations thereof, and an optionally one or two PS linkages to the next nucleotide and/or the first nucleotide of the 3′ tail.
In some embodiments, the 3′ end modification comprises or further comprises a modification to the last and/or second to last nucleotide with 2′-OMe, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages.
In some embodiments, the 3′ end modification comprises or further comprises a modification to the last, second to last, and/or third to last nucleotides with 2′-OMe, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages.
In some embodiments, the 3′ end modification comprises or further comprises a modification to the last, second to last, third to last, and/or fourth to last nucleotides with 2′-OMe, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages.
In some embodiments, the 3′ end modification comprises or further comprises a modification to the last, second to last, third to last, fourth to last, and/or fifth to last nucleotides with 2′-OMe, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages.
In some embodiments, the gRNA comprising a 3′ end modification comprises or further comprises a 3′ tail, wherein the 3′ tail comprises a modification of any one or more of the nucleotides present in the 3′ tail. In some embodiments, the 3′ tail is fully modified. In some embodiments, the 3′ tail comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1-2, 1-3, 1-4, 1-5, 1-6, 1-7, 1-8, 1-9, or 1-10 nucleotides, optionally where any one or more of these nucleotides are modified.
In some embodiments, a gRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises the 3′ end modification as shown in any one of SEQ ID Nos: 1-54. In some embodiments, a gRNA is provided comprising a 3′ protective end modification.
In some embodiments, agRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises (i) a 2′-OMe modified nucleotide at the last nucleotide of the conserved region of an gRNA or short-sgRNA (ii) three consecutive 2′O-moe modified nucleotides immediately 5′ to the 2′-OMe modified nucleotide, and (iii) three consecutive PS linkages between the last three nucleotides.
In some embodiments, a gRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises (i) five consecutive 2′-OMe modified nucleotides from the last nucleotide of the conserved region of an sgRNA or the conserved region of a short-sgRNA, and (ii) three PS linkages between the last three nucleotides.
In some embodiments, a gRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises an inverted abasic modified nucleotide at the last nucleotide of the conserved region of an sgRNA or the conserved region of a short-sgRNA.
In some embodiments, a gRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises (i) an inverted abasic modified nucleotide at the last nucleotide of the conserved region of an sgRNA or short-sgRNA, and (ii) three consecutive 2′-OMe modified nucleotides at the last three nucleotides of the conserved region of an sgRNA or the conserved region of a short-sgRNA.
In some embodiments, a gRNA is provided comprising (i) 15 consecutive 2′-OMe modified nucleotides from the last nucleotide of the conserved region of an sgRNA or short-sgRNA, (ii) five consecutive 2′-F modified nucleotides immediately 5′ to the 2′-OMe modified nucleotides, and (iii) three PS linkages between the last three nucleotides.
In some embodiments, a short-sgRNA is provided comprising (i) alternating 2′-OMe modified nucleotides and 2′-F modified nucleotides at the last 20 nucleotides of the conserved region of an sgRNA or short-sgRNA, and (ii) three PS linkages between the last three nucleotides.
In some embodiments, a short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises (i) two or three consecutive 2′-OMe modified nucleotides, and (ii) three PS linkages between the last three nucleotides.
In some embodiments, a short-sgRNA is provided comprising a 3′ end modification, wherein the 3′ end modification comprises one PS linkage between the last and next to last nucleotides.
In some embodiments, a short-sgRNA is provided comprising (i) 15 or 20 consecutive 2′-OMe modified nucleotides, and (ii) three PS linkages between the last three nucleotides.
In some embodiments, the short-sgRNA comprises a 5′ end modification and a 3′ end modification.
In some embodiments, the short-sgRNA comprises a 3′ terminus comprising a 3′ tail, which follows and is 3′ of the conserved portion of a short-sgRNA. In some embodiments, the 3′ tail comprises between 1 and about 20 nucleotides, between 1 and about 15 nucleotides, between 1 and about 10 nucleotides, between 1 and about 5 nucleotides, between 1 and about 4 nucleotides, between 1 and about 3 nucleotides, and between 1 and about 2 nucleotides. In some embodiments, the 3′ tail comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides. In some embodiments, the 3′ tail comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides. In some embodiments, the 3′ tail comprises 1 nucleotide. In some embodiments, the 3′ tail comprises 2 nucleotides. In some embodiments, the 3′ tail comprises 3 nucleotides. In some embodiments, the 3′ tail comprises 4 nucleotides. In some embodiments, the 3′ tail comprises about 1-2, 1-3, 1-4, 1-5, 1-7, 1-10, at least 1-5, at least 1-3, at least 1-4, at least 1-5, at least 1-5, at least 1-7, or at least 1-10 nucleotides.
In some embodiments, the 3′ tail comprising between 1 and 20 nucleotides and follows the 3′ end of the conserved portion of a short-sgRNA.
In some embodiments, the 3′ tail comprises or further comprises one or more of a protective end modification, a phosphorothioate (PS) linkage between nucleotides, a 2′-OMe modified nucleotide, a 2′-O-moe modified nucleotide, a 2′-F modified nucleotide, an inverted abasic modified nucleotide, and a combination thereof.
In some embodiments, the 3′ tail comprises or further comprises one or more phosphorothioate (PS) linkages between nucleotides. In some embodiments, the 3′ tail comprises or further comprises one or more 2′-OMe modified nucleotides. In some embodiments, the 3′ tail comprises or further comprises one or more 2′-O-moe modified nucleotides. In some embodiments, the 3′ tail comprises or further comprises one or more 2′-F modified nucleotide. In some embodiments, the 3′ tail comprises or further comprises one or more an inverted abasic modified nucleotides. In some embodiments, the 3′ tail comprises or further comprises one or more protective end modifications. In some embodiments, the 3′ tail comprises or further comprises a combination of one or more of a phosphorothioate (PS) linkage between nucleotides, a 2′-OMe modified nucleotide, a 2′-O-moe modified nucleotide, a 2′-F modified nucleotide, and an inverted abasic modified nucleotide.
In some embodiments, the short-sgRNA does not comprise a 3′ tail.
5′ Terminus Region Modifications
In some embodiments, the 5′ terminus region is modified, for example, the first 1, 2, 3, 4, 5, 6, or 7 nucleotides of the short-sgRNA are modified. Throughout, this modification may be referred to as a “5′ end modification”. In some embodiments, the first 1, 2, 3, 4, 5, 6, or 7 nucleotides of the 5′ terminus region comprise more than one modification. In some embodiments, at least one of the terminal (i.e., first) 1, 2, 3, 4, 5, 6, or 7 nucleotides at the 5′ end are modified. In some embodiments, at least two of the terminal 1, 2, 3, 4, 5, 6, or 7 nucleotides at the 5′ terminus region are modified. In some embodiments, at least three of the terminal 1, 2, 3, 4, 5, 6, or 7 nucleotides at the 5′ terminus region are modified. In some embodiments, the 5′ end modification is a 5′ protective end modification.
In some embodiments, both the 5′ and 3′ terminus regions (e.g., ends) of the short-sgRNA are modified. In some embodiments, only the 5′ terminus region of the short-sgRNA is modified. In some embodiments, only the 3′ terminus region (plus or minus a 3′ tail) of the conserved portion of a short sgRNA is modified.
In some embodiments, the short-sgRNA comprises modifications at 1, 2, 3, 4, 5, 6, or 7 of the first 7 nucleotides at a 5′ terminus region of the short-sgRNA. In some embodiments, the short-sgRNA comprises modifications at 1, 2, 3, 4, 5, 6, or 7 of the 7 terminal nucleotides at a 3′ terminus region. In some embodiments, 2, 3, or 4 of the first 4 nucleotides at the 5′ terminus region, and/or 2, 3, or 4 of the terminal 4 nucleotides at the 3′ terminus region are modified. In some embodiments, 2, 3, or 4 of the first 4 nucleotides at the 5′ terminus region are linked with phosphorothioate (PS) bonds.
In some embodiments, the modification to the 5′ terminus and/or 3′ terminus comprises a 2′-O-methyl (2′-O-Me) or 2′-O-(2-methoxyethyl) (2′-O-moe) modification. In some embodiments, the modification comprises a 2′-fluoro (2′-F) modification to a nucleotide. In some embodiments, the modification comprises a phosphorothioate (PS) linkage between nucleotides. In some embodiments, the modification comprises an inverted abasic nucleotide. In some embodiments, the modification comprises a protective end modification. In some embodiments, the modification comprises a more than one modification selected from protective end modification, 2′-O-Me, 2′-O-moe, 2′-fluoro (2′-F), a phosphorothioate (PS) linkage between nucleotides, and an inverted abasic nucleotide. In some embodiments, an equivalent modification is encompassed.
In some embodiments, the short-sgRNA comprises one or more phosphorothioate (PS) linkages between the first one, two, three, four, five, six, or seven nucleotides at the 5′ terminus. In some embodiments, the short-sgRNA comprises one or more PS linkages between the last one, two, three, four, five, six, or seven nucleotides at the 3′ terminus. In some embodiments, the short-sgRNA comprises one or more PS linkages between both the last one, two, three, four, five, six, or seven nucleotides at the 3′ terminus and the first one, two, three, four, five, six, or seven nucleotides from the 5′ end of the 5′ terminus. In some embodiments, in addition to PS linkages, the 5′ and 3′ terminal nucleotides may comprise 2′-O-Me, 2′-O-moe, or 2′-F modified nucleotides.
In some embodiments, the short-sgRNA comprises a 5′ end modification, e.g., wherein the first nucleotide of the guide region is modified. In some embodiments, the short-sgRNA comprises a 5′ end modification, wherein the first nucleotide of the guide region comprises a 5′ protective end modification.
In some embodiments, the 5′ end modification comprises a modified nucleotide selected from 2′-O-methyl (2′-O-Me) modified nucleotide, 2′-O-(2-methoxyethyl) (2′-O-moe) modified nucleotide, a 2′-fluoro (2′-F) modified nucleotide, a phosphorothioate (PS) linkage between nucleotides, an inverted abasic modified nucleotide, or combinations thereof.
In some embodiments, the 5′ end modification comprises or further comprises a 2′-O-methyl (2′-O-Me) modified nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a 2′-fluoro (2′-F) modified nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a phosphorothioate (PS) linkage between nucleotides.
In some embodiments, the 5′ end modification comprises or further comprises an inverted abasic modified nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a modification of any one or more of nucleotides 1-7 of the guide region of a short-sgRNA. In some embodiments, the 5′ end modification comprises or further comprises one modified nucleotide. In some embodiments, the 5′ end modification comprises or further comprises two modified nucleotides. In some embodiments, the 5′ end modification comprises or further comprises three modified nucleotides. In some embodiments, the 5′ end modification comprises or further comprises four modified nucleotides. In some embodiments, the 5′ end modification comprises or further comprises five modified nucleotides. In some embodiments, the 5′ end modification comprises or further comprises six modified nucleotides. In some embodiments, the 5′ end modification comprises or further comprises seven modified nucleotides.
In some embodiments, the 5′ end modification comprises or further comprises a modification of between 1 and 7, between 1 and 5, between 1 and 4, between 1 and 3, or between 1 and 2 nucleotides.
In some embodiments, the 5′ end modification comprises or further comprises modifications of 1, 2, 3, 4, 5, 6, or 7 nucleotides from the 5′ end. In some embodiments, the 5′ end modification comprises or further comprises modifications of about 1-3, 1-4, 1-5, 1-6, or 1-7 nucleotides from the 5′ end.
In some embodiments, the 5′ end modification comprises or further comprises modifications at the first nucleotide at the 5′ end of the short-sgRNA. In some embodiments, the 5′ end modification comprises or further comprises modifications at the first and second nucleotide from the 5′ end of the short-sgRNA. In some embodiments, the 5′ end modification comprises or further comprises modifications at the first, second, and third nucleotide from the 5′ end of the short-sgRNA. In some embodiments, the 5′ end modification comprises or further comprises modifications at the first, second, third, and fourth nucleotide from the 5′ end of the short-sgRNA. In some embodiments, the 5′ end modification comprises or further comprises modifications at the first, second, third, fourth, and fifth nucleotide from the 5′ end of the short-sgRNA. In some embodiments, the 5′ end modification comprises or further comprises modifications at the first, second, third, fourth, fifth, and sixth nucleotide from the 5′ end of the short-sgRNA. In some embodiments, the 5′ end modification comprises or further comprises modifications at the first, second, third, fourth, fifth, sixth, and seventh nucleotide from the 5′ end of the short-sgRNA.
In some embodiments, the 5′ end modification comprises or further comprises a phosphorothioate (PS) linkage between nucleotides, and/or a 2′-O-Me modified nucleotide, and/or a 2′-O-moe modified nucleotide, and/or a 2′-F modified nucleotide, and/or an inverted abasic modified nucleotide, and/or combinations thereof.
In some embodiments, the 5′ end modification comprises or further comprises 1, 2, 3, 4, 5, 6, and/or 7 PS linkages between nucleotides. In some embodiments, the 5′ end modification comprises or further comprises about 1-2, 1-3, 1-4, 1-5, 1-6, or 1-7 PS linkages between nucleotides.
In some embodiments, the 5′ end modification comprises or further comprises at least one PS linkage, wherein if there is one PS linkage, the linkage is between nucleotides 1 and 2 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises at least two PS linkages, and the linkages are between nucleotides 1 and 2, and 2 and 3 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises PS linkages between any one or more of nucleotides 1 and 2, 2 and 3, and 3 and 4 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises PS linkages between any one or more of nucleotides 1 and 2, 2 and 3, 3 and 4, and 4 and 5 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises PS linkages between any one or more of nucleotides 1 and 2, 2 and 3, 3 and 4, 4 and 5, and 5 and 6 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises PS linkages between any one or more of nucleotides 1 and 2, 2 and 3, 3 and 4, 4 and 5, 5 and 6, and 7 and 8 of the guide region.
In some embodiments, the 5′ end modification comprises or further comprises a modification of one or more of nucleotides 1-7 of the guide region, wherein the modification is a PS linkage, inverted abasic nucleotide, 2′-O-Me, 2′-O-moe, 2′-F, and/or combinations thereof.
In some embodiments, the 5′ end modification comprises or further comprises a modification to the first nucleotide of the guide region with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and an optional PS linkage to the next nucleotide;
In some embodiments, the 5′ end modification comprises or further comprises a modification to the first and/or second nucleotide of the guide region with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages between the first and second nucleotide and/or between the second and third nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a modification to the first, second, and/or third nucleotides of the variable region with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages between the first and second nucleotide, between the second and third nucleotide, and/or between the third and the fourth nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a modification to the first, second, third, and/or fourth nucleotides of the variable region with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages between the first and second nucleotide, between the second and third nucleotide, between the third and the fourth nucleotide, and/or between the fourth and the fifth nucleotide.
In some embodiments, the 5′ end modification comprises or further comprises a modification to the first, second, third, fourth, and/or fifth nucleotides of the variable region with 2′-O-Me, 2′-O-moe, 2′-F, or combinations thereof, and optionally one or more PS linkages between the first and second nucleotide, between the second and third nucleotide, between the third and the fourth nucleotide, between the fourth and the fifth nucleotide, and/or between the fifth and the sixth nucleotide.
In some embodiments, a short-sgRNA is provided comprising a 5′ end modification, wherein the 5′ end modification comprises a 5′ end modification as shown in any one of SEQ ID Nos: 1-54.
In some embodiments, the sgRNA comprises a 5′ end modification comprising a 5′ protective end modification. In some embodiments, a short-sgRNA is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′-OMe modified nucleotides at nucleotides 1, 2, and 3 of the guide region.
In some embodiments, a short-sgRNA is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′-OMe modified nucleotides at nucleotides 1, 2, and 3 of the guide region and PS linkages between nucleotides 1 and 2, 2 and 3, and 3 and 4 of the guide region.
In some embodiments, a short-sgRNA is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′-OMe modified nucleotides at nucleotides 1, 2, 3, 4, and 5 of the guide region.
In some embodiments, a short-sgRNA is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′-OMe modified nucleotides at nucleotides 1, 2, 3, 4, and 5 of the guide region and PS linkages between nucleotides 1 and 2, 2 and 3, 3 and 4, 4 and 5, and 5 and 6 of the guide region.
In some embodiments, a short-sgRNA is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′O-moe modified nucleotides at nucleotides 1, 2, and 3 of the guide region.
In some embodiments, a short-sgRNA is provided comprising a 5′ end modification, wherein the 5′ end modification comprises 2′O-moe modified nucleotides at nucleotides 1, 2, and 3 of the guide region and PS linkages between nucleotides 1 and 2, 2 and 3, and 3 and 4 of the guide region.
In some embodiments, a short-sgRNA is provided comprising a 5′ end modification, wherein the 5′ end modification comprises an inverted abasic modified nucleotide at nucleotide 1 of the guide region.
In some embodiments, a short-sgRNA is provided comprising a 5′ end modification, wherein the 5′ end modification comprises an inverted abasic modified nucleotide at nucleotide 1 of the guide region and 2′-OMe modified nucleotides at nucleotides 1, 2, and 3 of the guide region.
In some embodiments, a short-sgRNA is provided comprising a 5′ end modification, wherein the 5′ end modification comprises an inverted abasic modified nucleotide at nucleotide 1 of the guide region, 2′-OMe modified nucleotides at nucleotides 1, 2, and 3 of the guide region, and PS linkages between nucleotides 1 and 2, 2 and 3, 3 and 4, 4 and 5, and 5 and 6 of the guide region.
In some embodiments, a short-sgRNA is provided comprising a 5′ end modification and a 3′ end modification. In some embodiments, the sgRNA comprises modified nucleotides at the 5′ and 3′ terminus, and modified nucleotides in one or more other regions described in Table 3.
In some embodiments, the sgRNA comprises modified nucleotides that are not at the 5′ or 3′ ends. Exemplary patterns of modifications are described below and in Table 1.
Upper Stem Modifications
In some embodiments, a short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a modification to any one or more of US1-US12 in the upper stem region.
In some embodiments, a short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a modification of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or all 12 nucleotides in the upper stem region.
In some embodiments, a short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a modification of about 1-2, 1-3, 1-4, 1-5, 1-6, 1-7, 1-8, 1-9, 1-10, or 1-12 nucleotides in the upper stem region.
In some embodiments, an sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises 1, 2, 3, 4, or 5 YA modifications in a YA site. In some embodiments, an sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises at least 1, 2, 3, 4, or 5 YA modifications. In some embodiments, an sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises one YA modification. In some embodiments, an sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises 2 YA modifications. In some embodiments, the upper stem modification comprises 3 YA modifications. In some embodiments, one or more YA modifications are in a YA site. In some embodiments, one or more YA modifications are distal to a YA site.
In some embodiments, a short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a 2′-OMe modified nucleotide. In some embodiments, a short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a 2′-O-moe modified nucleotide. In some embodiments, a short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a 2′-F modified nucleotide.
In some embodiments, a short-sgRNA is provided comprising an upper stem modification, wherein the upper stem modification comprises a 2′-OMe modified nucleotide, a 2′-O-moe modified nucleotide, a 2′-F modified nucleotide, and/or combinations thereof.
In some embodiments, the sgRNA comprises an upper stem modification as shown in any one of the sequences in Table 1. In some embodiments, such an upper stem modification is combined with a 5′ protective end modification, e.g. as shown for the corresponding sequence in Table 1. In some embodiments, such an upper stem modification is combined with a 3′ protective end modification, e.g. as shown for the corresponding sequence in Table 1. In some embodiments, such an upper stem modification is combined with 5′ and 3′ end modifications as shown for the corresponding sequence in Table 1.
In some embodiments, the short-sgRNA comprises a 5′ end modification and an upper stem modification. In some embodiments, the short-sgRNA comprises a 3′ end modification and an upper stem modification. In some embodiments, the short-sgRNA comprises a 5′ end modification, a 3′ end modification and an upper stem modification.
Hairpin Modifications
In some embodiments, the short-sgRNA comprises a modification in the hairpin region. In some embodiments, the hairpin region modification comprises at least one modified nucleotide selected from a 2′-O-methyl (2′-OMe) modified nucleotide, a 2′-fluoro (2′-F) modified nucleotide, and/or combinations thereof.
In some embodiments, the hairpin region modification is in the hairpin 1 region. In some embodiments, the hairpin region modification is in the hairpin 2 region. In some embodiments, modifications are within the hairpin 1 and hairpin 2 regions, optionally wherein the “n” between hairpin 1 and 2 is also modified.
In some embodiments, a short-sgRNA is provided comprising a hairpin modification, wherein the hairpin modification comprises 1, 2, or 3 YA modifications in a YA site. In some embodiments, a short-sgRNA is provided comprising a hairpin modification, wherein the hairpin modification comprises at least 1, 2, 3, 4, 5, or 6 YA modifications. In some embodiments, a short-sgRNA is provided comprising a hairpin modification, wherein the hairpin modification comprises one YA modification. In some embodiments, a short-sgRNA is provided comprising a hairpin modification, wherein hairpin modification comprises 2 YA modifications. In some embodiments, the hairpin modification comprises 3 YA modifications. In some embodiments, one or more YA modifications are in a YA site. In some embodiments, one or more YA modifications are distal to a YA site.
In some embodiments, the hairpin modification comprises or further comprises a 2′-O-methyl (2′-OMe) modified nucleotide.
In some embodiments, the hairpin modification comprises or further comprises a 2′-fluoro (2′-F) modified nucleotide.
In some embodiments, the hairpin region modification comprises at least one modified nucleotide selected from a 2′H modified nucleotide (DNA), PS modified nucleotide, a YA modification, a 2′-O-methyl (2′-O-Me) modified nucleotide, a 2′-fluoro (2′-F) modified nucleotide, and/or combinations thereof.
In some embodiments, the short-sgRNA comprises a 3′ end modification, and a modification in the hairpin region.
In some embodiments, the short-sgRNA comprises a 5′ end modification, and a modification in the hairpin region.
In some embodiments, the short-sgRNA comprises an upper stem modification, and a modification in the hairpin region.
In some embodiments, the short-sgRNA comprises a hairpin modification as shown in any one of the sequences in Table 1. In some embodiments, such a hairpin modification is combined with a 5′ end modification as shown for the corresponding sequence in Table 1. In some embodiments, such a hairpin modification is combined with a 3′ end modification as shown for the corresponding sequence in Table 1. In some embodiments, such a hairpin modification is combined with 5′ and 3′ end modifications as shown for the corresponding sequence in Table 1.
In some embodiments, the short-sgRNA comprises a 3′ end modification, a modification in the hairpin region, an upper stem modification, and a 5′ end modification.
Exemplary Modified Short-sgRNAs
In some embodiments, the short-sgRNAs described herein comprise or consist of any of the sequences shown in Table 1. Further, short-sgRNAs are encompassed that comprise the modifications of any of the sequences shown in Table 1, and identified therein by SEQ ID No. That is, the nucleotides may be the same or different, but the modification pattern shown may be the same or similar to a modification pattern of a guide sequence of Table 1. A modification pattern includes the relative position and identity of modifications of the short-sgRNA (e.g. 5′ terminus region, lower stem region, bulge region, upper stem region, nexus region, hairpin 1 region, hairpin 2 region, 3′ tail region).
In some embodiments, the modification pattern contains at least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% of the modifications of any one of the sequences shown in the sequence column of Table 1, or over one or more regions of the sequence. In some embodiments, the modification pattern is at least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical to the modification pattern of any one of the sequences shown in the sequence column of Table 1. In some embodiments, the modification pattern is at least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over one or more (e.g., 1, 2, 3, 4, 5, 6, 7, or 8) regions of the sequence shown in Table 1, e.g., a 5′ terminus region, lower stem region, bulge region, upper stem region, nexus region, hairpin 1 region, hairpin 2 region, and/or 3′ terminus region.
For example, in some embodiments, a short-sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical to the modification pattern of a sequence over the 5′ terminus region. In some embodiments, a short-sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the lower stem. In some embodiments, a short-sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the bulge. In some embodiments, a short-sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the upper stem. In some embodiments, a short-sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the nexus. In some embodiments, a short-sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the hairpin 1. In some embodiments, a short-sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the hairpin 2. In some embodiments, a short-sgRNA is encompassed wherein the modification pattern is least 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99% identical over the 3′ terminus. In some embodiments, the modification pattern differs from the modification pattern of a sequence of Table 1, or a region (e.g. 5′ terminus, lower stem, bulge, upper stem, nexus, hairpin 1, hairpin 2, 3′ terminus) of such a sequence, at 0, 1, 2, 3, 4, 5, or 6 nucleotides. In some embodiments, the short-sgRNA comprises modifications that differ from the modifications of a sequence of Table 1, at 0, 1, 2, 3, 4, 5, or 6 nucleotides. In some embodiments, the short-sgRNA comprises modifications that differ from modifications of a region (e.g. 5′ terminus, lower stem, bulge, upper stem, nexus, hairpin 1, hairpin 2, 3′ terminus) of a sequence of Table 1, at 0, 1, 2, 3, 4, 5, or 6 nucleotides.
In some embodiments, the short-sgRNA comprises a 2′-O-methyl (2′-O-Me) modified nucleotide. In some embodiments, the short-sgRNA comprises a 2′-O-(2-methoxyethyl) (2′-O-moe) modified nucleotide. In some embodiments, the short-sgRNA comprises a 2′-fluoro (2′-F) modified nucleotide. In some embodiments, the short-sgRNA comprises a phosphorothioate (PS) bond between nucleotides. In some embodiments, the sgRNA comprises a YA modification.
In some embodiments, the short-sgRNA comprises a 5′ end modification, a 3′ end modification, or 5′ and 3′ end modification, such as a protective end modification. In some embodiments, the 5′ end modification comprises a phosphorothioate (PS) bond between nucleotides. In some embodiments, the 5′ end modification comprises a 2′-O-methyl (2′-O-Me), 2′-O-(2-methoxyethyl) (2′-O-moe), and/or 2′-fluoro (2′-F) modified nucleotide. In some embodiments, the 5′ end modification comprises at least one phosphorothioate (PS) bond and one or more of a 2′-O-methyl (2′-O-Me), 2′-O-(2-methoxyethyl) (2′-O-moe), and/or 2′-fluoro (2′-F) modified nucleotide. The end modification may comprise a phosphorothioate (PS), 2′-O-methyl (2′-O-Me), 2′-O-(2-methoxyethyl) (2′-O-moe), and/or 2′-fluoro (2′-F) modification. Equivalent end modifications are also encompassed by embodiments described herein. In some embodiments, the short-sgRNA comprises an end modification in combination with a modification of one or more regions of the short-sgRNA.
Modified short-sgRNAs comprising combinations of 5′ end modifications, 3′ end modifications, upper stem modifications, hairpin modifications, and 3′ terminus modifications, as described above, are encompassed. Exemplary modified short-sgRNAs are described below.
In some embodiments, the invention comprises a short-sgRNA comprising or consisting of any one of the sequences described in SEQ ID Nos: 1-54, 201-254, and 301-354.
In some embodiments, a short-sgRNA is provided comprising any one of the modified sequences of SEQ ID Nos: 201-254, and 301-354, wherein the short-sgRNA further comprises a guide region that is complementary to a target sequence, and directs a Cas9 to its target for cleavage. In some instances, the invention comprises short-sgRNA comprising nucleic acids having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleic acids of any one of SEQ ID Nos: 1-54, 201-254, and 301-354, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier in Table 1. In some embodiments, the short-sgRNA further comprises three phosphorothioate (PS) bonds linking the first four nucleotides at the 5′ terminus and three PS bonds linking the last four nucleotides at the 3′ terminus.
In some embodiments, the short-sgRNA comprises modifications at 1, 2, 3, or 4 of the first 4 nucleotides at its 5′ end. In some embodiments, the first three or four nucleotides at the 5′ terminus, and the last three or four nucleotides at the 3′ terminus are modified. In some embodiments, the first four nucleotides at the 5′ end, and the last four nucleotides at the 3′ terminus are linked with phosphorothioate (PS) bonds. In some embodiments, the modification comprises 2′-O-Me. In some embodiments, the modification comprises 2′-F. In some embodiments, the modification comprises 2′-O-moe.
In some embodiments, the short-sgRNA comprises, if the nucleotide mentioned is present in the short-sgRNA, modifications at 1, 2, 3, or 4 of the first 4 nucleotides at the 5′ end. In some embodiments, the short-sgRNA comprises modifications at 1, 2, 3, or 4 of the last 4 nucleotides at the 3′ end (3′ tail or conserved portion of an sgRNA). In some embodiments, the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-O-Me or 2′-O-moe modifications.
In some embodiments, the first four nucleotides at the 5′ terminus and the last four nucleotides at the 3′ terminus are linked with a PS bond, and the first three nucleotides at the 5′ terminus and the last three nucleotides at the 3′ terminus comprise 2′-F modifications.
In some embodiments, a short-sgRNA is provided, if the nucleotide mentioned is present in the short-sgRNA, wherein LS1, LS6, LS7, LS8, LS11, and LS12 are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the bulge region of the short-sgRNA are modified with 2′-O-Me. In some embodiments, each of the nucleotides in the upper stem region of the short-sgRNA are modified with 2′-O-Me. In some embodiments, N16, N17, and N18 in the nexus region of the short-sgRNA are modified with 2′-O-Me. In some embodiments, each of the nucleotides remaining in the hairpin 1 region of the short-sgRNA are modified with 2′-O-Me. In some embodiments, each of the nucleotides remaining in the hairpin 2 region of the short-sgRNA are modified with 2′-O-Me.
In some embodiments, a short-sgRNA comprising a 5′ end modification and one or more modifications in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region is provided, wherein the 5′ end modification comprises at least two phosphorothioate linkages within the first seven nucleotides of the 5′ terminus.
In some embodiments, a short-sgRNA comprising a 5′ end modification and one or more modifications in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region is provided, wherein the 5′ end modification comprises one or more phosphorothioate linkages at the 5′ end. In some embodiments, one or more phorphorothioate bonds link the 5′ terminal nucleotides.
In some embodiments, a short-sgRNA comprising a 5′ end modification and one or more modifications in one or more of: the upper stem region; the hairpin 1 region; and the hairpin 2 region is provided, wherein the 5′ end modification comprises one or more phosphorothioate linkages within the first seven nucleotides of the 5′ terminus.
In some embodiments, the invention comprises a short-sgRNA comprising any one of the modified sequences of SEQ ID Nos: 201-254, and 301-354, wherein the short-sgRNA further comprises a 5′ guide region that is at least partially complementary to a target sequence, and optionally directs a Cas9 to its target for cleavage.
In some embodiments, the invention comprises a short-sgRNA comprising nucleotides having at least 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% identity to the nucleotides of any one of SEQ ID Nos: 1-54, 201-254, and 301-354, wherein the modification pattern is identical to the modification pattern shown in the reference sequence identifier. That is, the nucleotides A, U, C, and G may differ by 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 85, 80, 75, or 70% compared to what is shown in in the sequences, but the modification remains unchanged.
In some embodiments, a short-sgRNA is provided comprising, if the nucleotide mentioned is present in the short guide, 2′-O-Me modified nucleotides at: the first three nucleotides in the 5′ terminus; LS1, LS6, LS7, LS8, LS11, and LS12 in the lower stem; B1 and B2 in the bulge region; each of the nucleotides in the upper stem region; N16, N17, and N18 in the nexus region; each of the nucleotides in the hairpin 1 region; one nucleotide between hairpin 1 and hairpin 2; each of the nucleotides in the hairpin 2 region; and the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three PS bonds between the first four nucleotides at the 5′ terminus and three PS bonds between the last four nucleotides at the 3′ terminus.
In some embodiments, a short-sgRNA is provided comprising, if the nucleotide mentioned is present in the short guide, 2′-O-Me modified nucleotides at: the first three nucleotides in the 5′ terminus; LS1, LS6, LS7, LS8, LS11, and LS12 in the lower stem; B1-B6 in the bulge region; each of the nucleotides in the upper stem region; N16, N17, and N18 in the nexus region; each of the nucleotides in the hairpin 1 region; one nucleotide between hairpin 1 and hairpin 2; each of the nucleotides in the hairpin 2 region; and the last four nucleotides at the 3′ terminus. In some embodiments, the sgRNA further comprises three PS bonds between the first four nucleotides at the 5′ terminus and three PS bonds between the last four nucleotides at the 3′ terminus.
In some embodiments, a short-sgRNA is provided comprising 2′-F modified nucleotides at: LS9 and LS10 in the lower stem; 15-N18 in the nexus region; H2-9-HS-15 in the hairpin 2 region; and the second to last, third to last, and fourth to last nucleotide in the 3′ terminus region.
In some embodiments, a short-sgRNA is provided comprising 2′-F modified nucleotides at: each nucleotide in the lower stem; 15-N18 in the nexus region; H2-9-HS-15 in the hairpin 2 region; and the second to last, third to last, and fourth to last nucleotide in the 3′ terminus region.
In some embodiments, a short-sgRNA is provided comprising, if the nucleotide mentioned is present in the short guide, 2′-OMe modified nucleotides at LS8, LS10, LS12, H1-2, H1-4, H1-6, H1-8, H1-10, H1-12, H2-1, H2-3, H2-5, H2-7, H2-9, H2-11, H2-13, H2-15, and the last and third to last nucleotides at the 3′ terminus; and 2′-F modifications at LS7, LS9, LS11; H1-1, H1-3, H1-5, H1-7, H1-9, H1-11, H1-13, H2-2, H2-4, H2-6, H2-8, H2-10, H2-12, H2-14, and the second to last and fourth to last nucleotide at the 3′ terminus.
Any of the foregoing modification patterns can be combined with a modification pattern set forth in the embodiments described above, e.g., in the summary section or Table 1, to the extent that they are non-overlapping. In the event that combining a foregoing modification pattern with a modification pattern set forth in the summary section or Table 1 would result in incompatible modifications (e.g., the same position would be both 2′-OMe and 2′-fluoro), the modification set forth in the summary section or Table 1 controls.
Compositions comprising any of the gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs) described herein and a carrier, excipient, diluent, or the like are encompassed. In some instances, the excipient or diluent is inert. In some instances, the excipient or diluent is not inert. In some embodiments, a pharmaceutical formulation is provided comprising any of the gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs) described herein and a pharmaceutically acceptable carrier, excipient, diluent, or the like. In some embodiments, the pharmaceutical formulation further comprises an LNP. In some embodiments, the pharmaceutical formulation further comprises a Cas9 protein or an mRNA encoding a Cas9 protein. In some embodiments, the pharmaceutical formulation comprises any one or more of the gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs), an LNP, and a Cas9 protein or mRNA encoding a Cas9 protein.
Also provided are kits comprising one or more gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs), compositions, or pharmaceutical formulations described herein. In some embodiments, a kit further comprises one or more of a solvent, solution, buffer, each separate from the composition or pharmaceutical formulation, instructions, or desiccant.
Compositions comprising an RNA-Guided DNA Binding Agent or mRNA Encoding RNA-Guided DNA Binding Agent
In some embodiments, compositions or pharmaceutical formulations are provided comprising at least one gRNA (e.g., sgRNA, short-sgRNA, dgRNA, or crRNA) described herein and a nuclease or a nucleic acid (e.g., an mRNA) encoding a nuclease. In some embodiments, the nuclease is an RNA-guided DNA binding agent, such as a Cas protein. In some embodiments, the short-sgRNA together with a Cas protein or nucleic acid (e.g., mRNA) encoding Cas protein is called a Cas RNP. In some embodiments, the RNA-guided DNA binding agent is one that functions with the short-sgRNA to direct a RNA-guided DNA binding agent to a target nucleic acid sequence. In some embodiments, the RNA-guided DNA binding agent is a Cas protein from the Type-II CRISPR/Cas system. In some embodiments, the Cas protein is Cas9. In some embodiments, the Cas9 protein is a wild type Cas9. In some embodiments, the Cas9 protein is derived from the Streptococcus pyogenes Cas9 protein, e.g., a S. pyogenes Cas9 (sypCas9). In some embodiments, compositions are provided comprising at least one short-sgRNA and a nuclease or an mRNA encoding a spyCas9. In some embodiments, the Cas9 protein is not derived from S. pyogenes, but functions in the same way as S. pyogenes Cas9 such that short-sgRNA that is specific to S. pyogenes Cas9 will direct the non-S. pyogenes Cas9 to its target site. In some embodiments, the Cas9 protein is derived from the Staphylococcus aureus Cas9 protein, e.g., a SaCas9. In some embodiments, compositions are provided comprising at least one short-sgRNA and a nuclease or an mRNA encoding a saCas9. In some embodiments, the Cas induces a double strand break in target DNA. Equivalents of spyCas9 and saCas9 protein are encompassed by the embodiments described herein.
RNA-guided DNA binding agents, including Cas9, encompass modified and variants thereof. Modified versions having one catalytic domain, either RuvC or HNH, that is inactive are termed “nickases.” Nickases cut only one strand on the target DNA, thus creating a single-strand break. A single-strand break may also be known as a “nick.” In some embodiments, the compositions and methods comprise nickases. In some embodiments, the compositions and methods comprise a nickase RNA-guided DNA binding agent, such as a nickase Cas9, that induces a nick rather than a double strand break in the target DNA.
In some embodiments, the RNA-guided DNA binding agent may be modified to contain only one functional nuclease domain. For example, the RNA-guided DNA binding agent may be modified such that one of the nuclease domains is mutated or fully or partially deleted to reduce its nucleic acid cleavage activity. In some embodiments, a nickase Cas is used having a RuvC domain with reduced activity. In some embodiments, a nickase Cas is used having an inactive RuvC domain. In some embodiments, a nickase Cas is used having an HNH domain with reduced activity. In some embodiments, a nickase Cas is used having an inactive HNH domain.
In some embodiments, a conserved amino acid within an RNA-guided DNA binding agent nuclease domain is substituted to reduce or alter nuclease activity. In some embodiments, a Cas protein may comprise an amino acid substitution in the RuvC or RuvC-like nuclease domain. Exemplary amino acid substitutions in the RuvC or RuvC-like nuclease domain include D10A (based on the S. pyogenes Cas9 protein). In some embodiments, the Cas protein may comprise an amino acid substitution in the HNH or HNH-like nuclease domain. Exemplary amino acid substitutions in the HNH or HNH-like nuclease domain include E762A, H840A, N863A, H983A, and D986A (based on the spyCas9 protein).
In some embodiments, the RNP complex described herein comprises a nickase or an mRNA encoding a nickase and a pair of gRNAs (one or both of which may be sgRNAs and/or short-sgRNAs) that are complementary to the sense and antisense strands of the target sequence, respectively. In this embodiment, the gRNAs (e.g., sgRNAs and/or short-sgRNAs) direct the nickase to a target sequence and introduce a double stranded break (DSB) by generating a nick on opposite strands of the target sequence (i.e., double nicking). In some embodiments, use of double nicking may improve specificity and reduce off-target effects. In some embodiments, a nickase RNA-guided DNA binding agent is used together with two separate short-sgRNAs targeting opposite strands of DNA to produce a double nick in the target DNA. In some embodiments, a nickase RNA-guided DNA binding agent is used together with two separate gRNAs (e.g., sgRNAs or short-sgRNAs) that are selected to be in close proximity to produce a double nick in the target DNA.
In some embodiments, chimeric Cas proteins are used, where one domain or region of the protein is replaced by a portion of a different protein. In some embodiments, a Cas nuclease domain may be replaced with a domain from a different nuclease such as Fok1. In some embodiments, a Cas protein may be a modified nuclease.
In some embodiments, the Cas protein comprises a fusion protein comprising a catalytically inactive Cas (e.g., Cas9) linked to a heterologous functional domain (see, e.g., WO2014152432). In some embodiments, the catalytically inactive Cas9 is from S. pyogenes. In some embodiments, the catalytically inactive Cas comprises mutations that inactivate the Cas. In some embodiments, the heterologous functional domain is a domain that modifies gene expression, histones, or DNA. In some embodiments, the heterologous functional domain is a transcriptional activation domain or a transcriptional repressor domain.
In some embodiments, the target sequence may be adjacent to a PAM. In some embodiments, the PAM may be adjacent to or within 1, 2, 3, or 4, nucleotides of the 3′ end of the target sequence. The length and the sequence of the PAM may depend on the Cas protein used. For example, the PAM may be selected from a consensus or a particular PAM sequence for a specific Cas9 protein or Cas9 ortholog, including those disclosed in
In some embodiments, an nucleic acid (e.g., mRNA) comprising an ORF encoding an RNA-guided DNA binding agent is used which has one or more of the following features. In some embodiments, the ORF encoding the RNA-guided DNA-binding agent, e.g. a Cas9 nuclease such as an S. pyogenes Cas9, has an adenine content ranging from its minimum adenine content to about 150% of its minimum adenine content. In some embodiments, the adenine content of the ORF is less than or equal to about 145%, 140%, 135%, 130%, 125%, 120%, 115%, 110%, 105%, 104%, 103%, 102%, or 101% of its minimum adenine content. In some embodiments, the ORF has an adenine content equal to its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 150% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 145% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 140% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 135% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 130% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 125% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 120% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 115% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 110% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 105% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 104% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 103% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 102% of its minimum adenine content. In some embodiments, the ORF has an adenine content less than or equal to about 101% of its minimum adenine content.
In some embodiments, the ORF has an adenine dinucleotide content ranging from its minimum adenine dinucleotide content to 200% of its minimum adenine dinucleotide content. In some embodiments, the adenine dinucleotide content of the ORF is less than or equal to about 195%, 190%, 185%, 180%, 175%, 170%, 165%, 160%, 155%, 150%, 145%, 140%, 135%, 130%, 125%, 120%, 115%, 110%, 105%, 104%, 103%, 102%, or 101% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content equal to its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 200% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 195% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 190% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 185% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 180% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 175% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 170% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 165% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 160% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 155% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content equal to its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 150% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 145% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 140% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 135% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 130% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 125% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 120% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 115% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 110% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 105% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 104% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 103% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 102% of its minimum adenine dinucleotide content. In some embodiments, the ORF has an adenine dinucleotide content less than or equal to about 101% of its minimum adenine dinucleotide content.
In some embodiments, the ORF has an adenine dinucleotide content ranging from its minimum adenine dinucleotide content to the adenine dinucleotide content that is 90% or lower of the maximum adenine dinucleotide content of a reference sequence that encodes the same protein as the mRNA in question. In some embodiments, the adenine dinucleotide content of the ORF is less than or equal to about 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10%, or 5% of the maximum adenine dinucleotide content of a reference sequence that encodes the same protein as the mRNA in question.
In some embodiments, the ORF has an adenine trinucleotide content ranging from 0 adenine trinucleotides to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, or 50 adenine trinucleotides (where a longer run of adenines counts as the number of unique three-adenine segments within it, e.g., an adenine tetranucleotide contains two adenine trinucleotides, an adenine pentanucleotide contains three adenine trinucleotides, etc.). In some embodiments, the ORF has an adenine trinucleotide content ranging from 0% adenine trinucleotides to 0.1%, 0.2%, 0.3%, 0.4%, 0.5%, 0.6%, 0.7%, 0.8%, 0.9%, 1%, 1.5%, or 2% adenine trinucleotides, where the percentage content of adenine trinucleotides is calculated as the percentage of positions in a sequence that are occupied by adenines that form part of an adenine trinucleotide (or longer run of adenines), such that the sequences UUUAAA and UUUUAAAA would each have an adenine trinucleotide content of 50%. For example, in some embodiments, the ORF has an adenine trinucleotide content less than or equal to 2%. For example, in some embodiments, the ORF has an adenine trinucleotide content less than or equal to 1.5%. In some embodiments, the ORF has an adenine trinucleotide content less than or equal to 1%. In some embodiments, the ORF has an adenine trinucleotide content less than or equal to 0.9%. In some embodiments, the ORF has an adenine trinucleotide content less than or equal to 0.8%. In some embodiments, the ORF has an adenine trinucleotide content less than or equal to 0.7%. In some embodiments, the ORF has an adenine trinucleotide content less than or equal to 0.6%. In some embodiments, the ORF has an adenine trinucleotide content less than or equal to 0.5%. In some embodiments, the ORF has an adenine trinucleotide content less than or equal to 0.4%. In some embodiments, the ORF has an adenine trinucleotide content less than or equal to 0.3%. In some embodiments, the ORF has an adenine trinucleotide content less than or equal to 0.2%. In some embodiments, the ORF has an adenine trinucleotide content less than or equal to 0.1%. In some embodiments, a nucleic acid is provided that encodes an RNA-guided DNA-binding agent comprising an ORF containing no adenine trinucleotides.
In some embodiments, the ORF has an adenine trinucleotide content ranging from its minimum adenine trinucleotide content to the adenine trinucleotide content that is 90% or lower of the maximum adenine trinucleotide content of a reference sequence that encodes the same protein as the mRNA in question. In some embodiments, the adenine trinucleotide content of the ORF is less than or equal to about 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10%, or 5% of the maximum adenine trinucleotide content of a reference sequence that encodes the same protein as the mRNA in question.
A given ORF can be reduced in adenine content or adenine dinucleotide content or adenine trinucleotide content, for example, by using minimal adenine codons in a sufficient fraction of the ORF. For example, an amino acid sequence for an RNA-guided DNA-binding agent can be back-translated into an ORF sequence by converting amino acids to codons, wherein some or all of the ORF uses the exemplary minimal adenine codons shown below. In some embodiments, at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99%, or 100% of the codons in the ORF are codons listed in Table 4.
In some embodiments, a nucleic acid is provided that encodes an RNA-guided DNA-binding agent, e.g. a Cas9 nuclease such as an S. pyogenes Cas9, comprising an ORF consisting of a set of codons of which at least about 75%, 80%, 85%, 90%, 95%, 98%, 99%, or 100% of the codons are codons listed in Table 4. In some embodiments, the ORF has minimal nucleotide homopolymers, e.g., repetitive strings of the same nucleotides. For example, in some embodiments, when selecting a minimal uridine codon from the codons listed in Table 4, a nucleic acid is constructed by selecting the minimal adenine codons that reduce the number and length of nucleotide homopolymers, e.g., selecting GCG instead of GCC for alanine or selecting GGC instead of GGG for glycine.
In any of the foregoing embodiments, the nucleic acid may be an mRNA.
In some embodiments, at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the codons in an ORF are codons from a codon set shown in Table 5 (e.g., the low U, low A, or low A/U codon set). The codons in the low U, low A, and low A/U sets use codons that minimize the indicated nucleotides while also using codons corresponding to highly expressed tRNAs where more than one option is available. In some embodiments, at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the codons in an ORF are codons from the low U codon set shown in Table 5. In some embodiments, at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the codons in an ORF are codons from the low A codon set shown in Table 5. In some embodiments, at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the codons in an ORF are codons from the low A/U codon set shown in Table 5.
Exemplary Sequences
In some embodiments, the ORF encoding the RNA-guided DNA binding agent comprises a sequence with at least 90%, 93%, 95%, 96%, 97%, 98%, 99%, 99.5%, or 100% identity to any one of SEQ ID NOs: 3502-3522, 3525, 3526, or 3529-3546; and/or the ORF has at least 90%, 93%, 95%, 96%, 97%, 98%, 99%, 99.5%, or 100% identity to any one of SEQ ID NOs: 3502-3522, 3525, 3526, or 3529-3546 over at least its first 50, 200, 250, or 300 nucleotides, or at least 95% identity to any one of SEQ ID NOs: 3502-3522, 3525, 3526, or 3529-3546 over at least its first 30, 50, 70, 100, 150, 200, 250, or 300 nucleotides; and/or the ORF consists of a set of codons of which at least 95%, 96%, 97%, 98%, 99%, 99.5%, or 100% of the codons are codons listed in Table 4 or 5; and/or the ORF has an adenine content ranging from its minimum adenine content to 123% of the minimum adenine content; and/or the ORF has an adenine dinucleotide content ranging from its minimum adenine dinucleotide content to 150% of the minimum adenine dinucleotide content. In some embodiments, the polynucleotide encoding the RNA-guided DNA binding agent comprises a sequence with at least 95%, 96%, 97%, 98%, 99%, 99.5%, or 100% identity to any one of SEQ ID NOs: 3502-3522, 3525, 3526, or 3529-3546.
In some embodiments, the mRNA comprises a sequence with at least 90% identity to any one of SEQ ID NOs: 3501, 3523, 3524, or 3527, wherein the sequence comprises an ORF encoding an RNA-guided DNA binding agent. In some embodiments, the mRNA comprises a sequence with at least 90% identity to any one of SEQ ID NOs: 3501, 3523, 3524, or 3527, wherein the sequence comprises an ORF encoding an RNA-guided DNA binding agent, wherein the first three nucleotides of SEQ ID NOs: 3501, 3523, 3524, or 3527 are omitted. In some embodiments, the mRNA comprises a sequence with at least 90% identity to any one of SEQ ID NOs: 3501, 3523, 3524, or 3527, wherein the sequence comprises an ORF encoding an RNA-guided DNA binding agent, wherein the first three nucleotides of SEQ ID NOs: 3501, 3523, 3524, or 3527 are omitted and/or the ORF coding sequence contained within SEQ ID NO: 3501, 3523, 3524, or 3527 is substituted with the coding sequence of any one of SEQ ID NOs: 3502-3522, 3525, 3526, or 3529-3546. In some embodiments, any of the foregoing levels of identity is at least 95%, at least 98%, at least 99%, or 100%.
In some embodiments, any one or more of the gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs), compositions, or pharmaceutical formulations described herein is for use in preparing a medicament for treating or preventing a disease or disorder in a subject.
In some embodiments, the invention comprises a method of treating or preventing a disease or disorder in subject comprising administering any one or more of the gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs), compositions, or pharmaceutical formulations described herein.
In some embodiments, the invention comprises a method or use of modifying a target DNA comprising, administering or delivering any one or more of the gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs), compositions, or pharmaceutical formulations described herein.
In some embodiments, the invention comprises a method or use for modulation of a target gene comprising, administering or delivering any one or more of the gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs), compositions, or pharmaceutical formulations described herein. In some embodiments, the modulation is editing of the target gene. In some embodiments, the modulation is a change in expression of the protein encoded by the target gene.
In some embodiments, the method or use results in gene editing. In some embodiments, the method or use results in a double-stranded break within the target gene. In some embodiments, the method or use results in formation of indel mutations during non-homologous end joining of the DSB. In some embodiments, the method or use results in an insertion or deletion of nucleotides in a target gene. In some embodiments, the insertion or deletion of nucleotides in a target gene leads to a frameshift mutation or premature stop codon that results in a non-functional protein. In some embodiments, the insertion or deletion of nucleotides in a target gene leads to a knockdown or elimination of target gene expression. In some embodiments, the method or use comprises homology directed repair of a DSB. In some embodiments, the method or use further comprises delivering to the cell a template, wherein at least a part of the template incorporates into a target DNA at or near a double strand break site induced by the nuclease.
In some embodiments, the method or use results in gene modulation. In some embodiments, the gene modulation is an increase or decrease in gene expression, a change in methylation state of DNA, or modification of a histone subunit. In some embodiments, the method or use results in increased or decreased expression of the protein encoded by the target gene.
The efficacy of gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs) can be tested in vitro and in vivo. In some embodiments, the invention comprises one or more of the gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs), compositions, or pharmaceutical formulations described herein, wherein the short-sgRNA results in gene modulation when provided to a cell together with Cas9 or mRNA encoding Cas9. In some embodiments, the efficacy of short-sgRNA can be measured in vitro or in vivo.
In some embodiments, the activity of a Cas RNP comprising a short-sgRNA is compared to the activity of a Cas RNP comprising an unmodified sgRNA or a reference sgRNA lacking modifications present in the sgRNA or short-sgRNA, such as YA site modifications.
In some embodiments, the efficiency of a sgRNA or short-sgRNA in increasing or decreasing target protein expression is determined by measuring the amount of target protein.
In some embodiments, the efficiency of editing with specific gRNAs is determined by the editing present at the target location in the genome following delivery of Cas9 and the gRNA. In some embodiments, the efficiency of editing with specific gRNAs is measured by next-generation sequencing. In some embodiments, the editing percentage of the target region of interest is determined. In some embodiments, the total number of sequence reads with insertions or deletions of nucleotides into the target region of interest over the total number of sequence reads is measured following delivery of a gRNA and Cas9.
In some embodiments, the efficiency of editing with specific gRNAs is measured by the presence of insertions or deletions of nucleotides introduced by successful gene editing. In some embodiments, activity of a Cas9 and gRNAs is tested in biochemical assays. In some embodiments, activity of a Cas9 and gRNAs is tested in a cell-free cleavage assay. In some embodiments, activity of a Cas9 and gRNAs is tested in Neuro2A cells.
In some embodiments, the activity of modified gRNAs is measured after in vivo dosing of LNPs comprising modified gRNAs and Cas protein or mRNA encoding Cas protein.
In some embodiments, in vivo efficacy of a gRNA or composition provided herein is determined by editing efficacy measured in DNA extracted from tissue (e.g., liver tissue) after administration of gRNA and Cas9.
In some embodiments, activation of the subject's immune response is measured by serum concentrations of cytokine(s) following in vivo dosing of sgRNA together with Cas9 mRNA or protein (e.g., formulated in a LNP). In some embodiments, the cytokine is interferon-alpha (IFN-alpha), interleukin 6 (IL-6), monocyte chemotactic protein 1 (MCP-1), and/or tumor necrosis factor alpha (TNF-alpha).
In some embodiments, administration of Cas RNP or Cas9 mRNA together with the modified gRNA (e.g., sgRNA, short-sgRNA, or dgRNA) produces lower serum concentration(s) of immune cytokines compared to administration of unmodified sgRNA. In some embodiments, the invention comprises a method of reducing a subject's serum concentration of immune cytokines comprising, administering any one of the gRNAs disclosed herein, wherein the gRNA produces a lower concentration of immune cytokines in a subject's serum as compared to a control gRNA that is not similarly modified.
LNP Delivery of gRNA
Lipid nanoparticles (LNPs) are a well-known means for delivery of nucleotide and protein cargo, and may be used for delivery of the gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs), compositions, or pharmaceutical formulations disclosed herein. In some embodiments, the LNPs deliver nucleic acid, protein, or nucleic acid together with protein.
In some embodiments, the invention comprises a method for delivering any one of the gRNAs (e.g., sgRNAs, short-sgRNAs, dgRNAs, or crRNAs) disclosed herein to a subject, wherein the gRNA is associated with an LNP. In some embodiments, the gRNA/LNP is also associated with a Cas9 or an mRNA encoding Cas9.
In some embodiments, the invention comprises a composition comprising any one of the gRNAs disclosed and an LNP. In some embodiments, the composition further comprises a Cas9 or an mRNA encoding Cas9.
In some embodiments, the LNPs comprise cationic lipids. In some embodiments, the LNPs comprise (9Z,12Z)-3-((4,4-bis(octyloxy)butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl octadeca-9,12-dienoate, also called 3-((4,4-bis(octyloxy)butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl (9Z,12Z)-octadeca-9,12-dienoate). In some embodiments, the LNPs comprise molar ratios of a cationic lipid amine to RNA phosphate (N:P) of about 4.5.
In some embodiments, LNPs associated with the gRNAs disclosed herein are for use in preparing a medicament for treating a disease or disorder.
Electroporation is a well-known means for delivery of cargo, and any electroporation methodology may be used for delivery of any one of the gRNAs disclosed herein. In some embodiments, electroporation may be used to deliver any one of the gRNAs disclosed herein and Cas9 or an mRNA encoding Cas9.
In some embodiments, the invention comprises a method for delivering any one of the gRNAs disclosed herein to an ex vivo cell, wherein the gRNA is associated with an LNP or not associated with an LNP. In some embodiments, the gRNA/LNP or gRNA is also associated with a Cas9 or an mRNA encoding Cas9.
This description and exemplary embodiments should not be taken as limiting. For the purposes of this specification and appended claims, unless otherwise indicated, all numbers expressing quantities, percentages, or proportions, and other numerical values used in the specification and claims, are to be understood as being modified in all instances by the term “about,” to the extent they are not already so modified. “About” indicates a degree of variation that does not substantially affect the properties of the described subject matter, e.g., within 10%, 5%, 2%, or 1%. Accordingly, unless indicated to the contrary, the numerical parameters set forth in the following specification and attached claims are approximations that may vary depending upon the desired properties sought to be obtained. At the very least, and not as an attempt to limit the application of the doctrine of equivalents to the scope of the claims, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques.
It is noted that, as used in this specification and the appended claims, the singular forms “a,” “an,” and “the,” and any singular use of any word, include plural referents unless expressly and unequivocally limited to one referent. As used herein, the term “include” and its grammatical variants are intended to be non-limiting, such that recitation of items in a list is not to the exclusion of other like items that can be substituted or added to the listed items.
The following examples are provided to illustrate certain disclosed embodiments and are not to be construed as limiting the scope of this disclosure in any way.
Synthetic sgRNA and Short-Single Guide RNA (Short-sgRNA)
Single-guide RNA (sgRNA) and short-single guide (short-sgRNA) was chemically synthesized by commercial vendors or using standard in vitro synthesis techniques with modified nucleotides as provided in Table 1.
In Vitro Transcription (“IVT”) of Cas9 mRNA
Capped and polyadenylated Cas9 mRNA containing N1-methyl pseudouridine was generated by in vitro transcription using a linearized plasmid DNA template and T7 RNA polymerase. Plasmid DNA containing a T7 promoter and a sequence for transcription (for producing mRNA comprising an mRNA described herein (see SEQ ID NOs: 3499, 3500, 3501, 3523, 3524, and 3527 for exemplary transcripts and SEQ ID NOs: 3502-3522, 3525, 3526, and 3529-3546 for exemplary ORFs) was linearized by incubating at 37° C. to complete digestion with XbaI with the following conditions. The XbaI may be heat inactivated. The linearized plasmid was purified from enzyme and buffer salts and analyzed by agarose gel to confirm linearization. The IVT reaction to generate Cas9 modified mRNA was incubated at 37° C. for 2-4 hours in the following conditions: 50 ng/μL linearized plasmid; 2 mM each of GTP, ATP, CTP, and N1-methyl pseudo-UTP (Trilink); 10 mM ARCA (Trilink); 5 U/μL T7 RNA polymerase (NEB); 1 U/μL Murine RNase inhibitor (NEB); 0.004 U/μL inorganic E. coli pyrophosphatase (NEB); and 1× reaction buffer. After the 4-hour incubation, TURBO DNase (ThermoFisher) was added to a final concentration of 0.01 U/μL, and the reaction was incubated for an additional 30 minutes to remove the DNA template. The Cas9 mRNA was purified from enzyme and nucleotides using a MegaClear Transcription Clean-up kit per the manufacturer's protocol (ThermoFisher). Alternatively, the mRNA was purified through a precipitation protocol, which in some cases was followed by HPLC-based purification. Briefly, after the DNase digestion, the mRNA was precipitated by adding 0.21× volume of a 7.5 M LiCl solution and mixing, and the precipitated mRNA was pelleted by centrifugation. Once the supernatant was removed, the mRNA was reconstituted in water. The mRNA was precipitated again using ammonium acetate and ethanol. 5M Ammonium acetate was added to the mRNA solution for a final concentration of 2M along with 2× volume of 100% EtOH. The solution was mixed and incubated at −20° C. for 15 min. The precipitated mRNA was again pelleted by centrifugation, the supernatant was removed, and the mRNA was reconstituted in water. As a final step, the mRNA was precipitated using sodium acetate and ethanol. 1/10 volume of 3 M sodium acetate (pH 5.5) was added to the solution along with 2× volume of 100% EtOH. The solution was mixed and incubated at −20° C. for 15 min. The precipitated mRNA was again pelleted by centrifugation, the supernatant was removed, the pellet was washed with 70% cold ethanol and allowed to air dry. The mRNA was reconstituted in water. For HPLC purified mRNA, after the LiCl precipitation and reconstitution, the mRNA was purified by RP-IP HPLC (see, e.g., Kariko, et al. Nucleic Acids Research, 2011, Vol. 39, No. 21 e142). The fractions chosen for pooling were combined and desalted by sodium acetate/ethanol precipitation as described above. The transcript concentration was determined by measuring the light absorbance at 260 nm (Nanodrop), and the transcript was analyzed by capillary electrophoresis by Bioanalyzer (Agilent).
Cas9 mRNA and gRNA Transfections in Neuro2A Cells
The mouse cell line Neuro2A was cultured in DMEM media supplemented with 10% fetal bovine serum and was plated at a density of 15,000 cells/well in a 96-well plate 24 hours prior to transfection. On the day of transfection, the media was aspirated from cells and replaced with fresh media. Lipofectamine-2000 (Invitrogen) was diluted 1:50 (v/v) in Opti-MEM (Invitrogen). Cas9 mRNA and guide RNA were diluted separately in Opti-MEM. Both Cas9 mRNA and gRNA were mixed separately 1:1 (v/v) with diluted Lipofectamine-2000, producing two lipoplexes. After 5 minutes of incubation, lipoplexes were added in succession to cells, for a final concentration of 100 ng Cas9 mRNA and 0.4 μL total lipofection reagent per well. Guides were tested at four dose levels, including 3 nM, 0.3 nM, 0.03 nM, and 0.003 nM. Cells were lysed 24 hours post transfection, and lysates were used directly in the PCR reaction that was analyzed for editing by NGS.
Primary human liver hepatocytes (PHH), primary cynomolgus liver hepatocytes (PCH), or primary mouse liver hepatocytes (PMH) (Thermo Fisher) were cultured per the manufacturer's protocol (Invitrogen, protocol 11.28.2012). In brief, the cells were thawed and resuspended in hepatocyte thawing medium (Thermo Fisher, Cat. CM7000) followed by centrifugation at 100 g for 10 minutes for PHH, 100 g for 4 min for PMH, or 80 g for 4 minutes for PCH. The supernatant was discarded and the pelleted cells resuspended in hepatocyte plating medium plus supplement pack (Invitrogen, Cat. A1217601 and CM3000). Cells were counted and plated on Bio-coat collagen I coated 96-well plates (Thermo Fisher, Cat. 877272) at a density of 30,000-35,000 cells/well for PHH, 50,000-60,000 cells/well for PCH, or 15,000-20,000 cells/well for PMH. Plated cells were allowed to settle and adhere for 4 to 6 hours in a tissue culture incubator at 37° C. and 5% CO2 atmosphere. After incubation, cells were checked for monolayer formation. Cells were then washed with hepatocyte maintenance media/culture media with serum-free supplement pack (Invitrogen, Cat. A1217601 and CM4000) and then fresh hepatocyte maintenance media was added on to the cells
For lipoplex transfection experiments, Lipofectamine RNAiMax (ThermoFisher, Cat. 13778150) based transfections were conducted as per the manufacturer's protocol. Cells were transfected with a single lipoplex containing Spy Cas9 mRNA (100 ng for PMH, 50 ng for PHH and 25 ng for PCH) and OptiMem sgRNA (25 nM for PMH, 12.5 nM for PHH and 0.125 nM for PCH) and OptiMem and Lipofectamine RNAiMax (1 μL/well for PHH and PCH both and 2 uL/well for PMH).
For experiments involving LNP treatment, after 4-6 hours, the plating media was removed, cells were then washed with hepatocyte maintenance media/culture media with serum-free supplement pack (Invitrogen, Cat. A1217601 and CM4000), and replaced with supplemented hepatocyte culture medium (Invitrogen, Cat. A1217601 and CM4000) containing LNP formulated Cas9 mRNA and guide RNA plus 3% serum. LNPs were diluted from a starting dose level of 100 ng Cas9 mRNA and approximately 30 nM guide RNA per well, carrying out serial dilutions down to 0.1 ng mRNA and 0.03 nM guide per well. Cells were incubated for approximately 72 hours at 37° C. and 5% CO2 atmosphere before cell lysis and NGS analysis as described herein.
The human hepatocellular carcinoma cell line HepG2 (American Type Culture Collection, Cat. HB-8065) was cultured in DMEM media containing penstrep supplemented with 10% fetal bovine serum. Cells were counted and plated on Bio-coat collagen I coated 96-well plates (ThermoFisher, Cat. 877272) at a density of 10,000 cells/well in a 96-well plate 24 hours prior to transfection.
After 4-6 hours, the plating media was removed, cells were then washed with hepatocyte maintenance media/culture media with serum-free supplement pack (Invitrogen, Cat. A1217601 and CM4000), and replaced with supplemented hepatocyte culture medium (Invitrogen, Cat. A1217601 and CM4000) containing LNP formulated Cas9 mRNA and guide RNA plus 3% serum. LNPs were diluted from a starting dose level of 100 ng Cas9 mRNA and approximately 30 nM guide RNA per well, carrying out serial dilutions down to 0.1 ng mRNA and 0.03 nM guide per well. Cells were incubated for approximately 72 hours at 37° C. and 5% CO2 atmosphere before cell lysis and NGS analysis as described herein.
LNP Procedure A: LNPs were formulated with a cationic lipid amine to RNA phosphate (N:P) molar ratio of about 4.5. The lipid nanoparticle components were dissolved in 100% ethanol with the following molar ratios: 45 mol-% (12.7 mM) cationic lipid (e.g., (9Z,12Z)-3-((4,4-bis(octyloxy)butanoyl)oxy)-2-((((3-(diethylamino)propoxy)-carbonyl)oxy)methyl)propyl octadeca-9,12-dienoate, also called 3-((4,4-bis(octyloxy)-butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl (9Z,12Z)-octadeca-9,12-dienoate); 44 mol-% (12.4 mM) helper lipid (e.g., cholesterol); 9 mol-% (2.53 mM) neutral lipid (e.g., DSPC); and 2 mol-% (0.563 mM) PEG (e.g., PEG2k-DMG).
The LNPs were formed by microfluidic mixing of the lipid and RNA solutions using a Precision Nanosystems NanoAssemblr™ Benchtop Instrument, according to the manufacturer's protocol. A 2:1 ratio of aqueous to organic solvent was maintained during mixing using differential flow rates.
The RNA cargo were prepared in 25 mM sodium acetate buffer, pH 4.5, resulting in a concentration of RNA cargo of approximately 0.45 mg/mL, with a ratio of Cas9 mRNA:sgRNA of 1:1 (wt/wt). After mixing, the LNPs were collected, diluted in 50 mM Tris at pH 7.5 (approximately 1:1), and then LNPs were exchanged into 50 mM Tris at pH 7.5 (100-fold excess of sample volume), overnight at 4° C. under gentle stirring using a 10 kDa Slide-a-Lyzer™ G2 Dialysis Cassette (ThermoFisher Scientific). The LNPs were concentrated using 10 kDa Amicon spin filter (centrifugation at 4000 g at 4° C.) to achieve twice the desired concentration. These concentrated LNPs were mixed 1:1 with 50 mM Tris, 90 mM NaCl, 10% sucrose at pH 7.5 (2× TSS). The resulting mixture was then filtered using a 0.2 μM sterile filter. The resulting filtrate was stored at 2-8° C.
LNP Procedure B: LNPs were formulated with a cationic lipid amine to RNA phosphate (N:P) molar ratio of about 4.5. The lipid nanoparticle components were dissolved in 100% ethanol with the following molar ratios: 45 mol-% (12.7 mM) cationic lipid (e.g., (9Z,12Z)-3-((4,4-bis(octyloxy)butanoyl)oxy)-2-((((3-(diethylamino)propoxy)-carbonyl)oxy)methyl)propyl octadeca-9,12-dienoate, also called 3-((((4-bis(octyloxy)-butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl (9Z,12Z)-octadeca-9,12-dienoate); 44 mol-% (12.4 mM) helper lipid (e.g., cholesterol); 9 mol-% (2.53 mM) neutral lipid (e.g., DSPC); and 2 mol-% (0.563 mM) PEG (e.g., PEG2k-DMG).
The LNPs were formed by microfluidic mixing of the lipid and RNA solutions using a Precision Nanosystems NanoAssemblr™ Benchtop Instrument, according to the manufacturer's protocol. A 2:1 ratio of aqueous to organic solvent was maintained during mixing using differential flow rates.
The RNA cargo were prepared in 25 mM sodium citrate, 100 mM sodium chloride at pH 5 resulting in a concentration of RNA cargo of approximately 0.45 mg/mL. After mixing, the LNPs were collected in water at the ratio of 3:1. The LNPs were incubated for an hour at room temperature and mixed 1:1 with water. Then they were buffer-exchanged into 1× TSS (50 mM Tris, 45 mM NaCl, 5% sucrose at pH 7.5) on PD-10 columns (GE Healthcare), using manufacturer's protocol. The LNPs were concentrated using 10 kDa Amicon spin filter (centrifugation at 4000 g at 4° C.) to achieve the desired concentration. The resulting mixture was then filtered using a 0.2 μm sterile filter. The resulting filtrate was stored at −80° C.
LNP Procedure C: LNPs were formulated with a cationic lipid amine to RNA phosphate (N:P) molar ratio of about 6. The lipid nanoparticle components were dissolved in 100% ethanol with the following molar ratios: 50 mol-% (12.7 mM) cationic lipid (e.g., (9Z,12Z)-3-((4,4-bis(octyloxy)butanoyl)oxy)-2-((((3-(diethylamino)propoxy)-carbonyl)oxy)methyl)propyl octadeca-9,12-dienoate, also called 3-((4,4-bis(octyloxy)-butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl (9Z,12Z)-octadeca-9,12-dienoate); 38 mol-% (12.4 mM) helper lipid (e.g., cholesterol); 9 mol-% (2.53 mM) neutral lipid (e.g., DSPC); and 3 mol-% (0.563 mM) PEG (e.g., PEG2k-DMG).
The LNPs were formed by microfluidic mixing of the lipid and RNA solutions using a Precision Nanosystems NanoAssemblr™ Benchtop Instrument, according to the manufacturer's protocol. A 2:1 ratio of aqueous to organic solvent was maintained during mixing using differential flow rates.
The RNA cargo was prepared in 25 mM sodium citrate, 100 mM sodium chloride at pH 5 resulting in a concentration of RNA cargo of approximately 0.45 mg/mL. LNPs were formed by an impinging jet mixing method where one stream of lipids in ethanol were mixed with two streams of RNA in citrate buffer through a 0.25 mm ID Cross piece. The two RNA streams mix perpendicular to the ethanol stream. A fourth stream of water for injection (WFI) meets the resulting particles in an in-line dilution through a 0.5 mm ID Tee piece. All four streams are delivered at 10 mL/min using a syringe pump. These LNPs were incubated for an hour at room temperature and then they were buffer-exchanged into 1× TSS (50 mM Tris, 45 mM NaCl, 5% sucrose at pH 7.5) on PD-10 columns (GE Healthcare), using manufacturer's protocol. The LNPs were concentrated using 10 kDa Amicon spin filter (centrifugation at 4000 g at 4° C.) to achieve the desired concentration. The resulting mixture was then filtered using a 0.2 μm sterile filter. The resulting filtrate was stored at −80° C.
LNP Procedure D: LNPs were formulated with a cationic lipid amine to RNA phosphate (N:P) molar ratio of about 4.5. The lipid nanoparticle components were dissolved in 100% ethanol with the following molar ratios: 45 mol-% (12.7 mM) cationic lipid (e.g., (9Z,12Z)-3-((4,4-bis(octyloxy)butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl octadeca-9,12-dienoate, also called 3-((4,4-bis(octyloxy)butanoyl)oxy)-2-((((3-(diethylamino)propoxy)carbonyl)oxy)methyl)propyl (9Z,12Z)-octadeca-9,12-dienoate); 44 mol-% (12.4 mM) helper lipid (e.g., cholesterol); 9 mol-% (2.53 mM) neutral lipid (e.g., DSPC); and 2 mol-% (0.563 mM) PEG (e.g., PEG2k-DMG). The RNA cargo were prepared in 25 mM sodium acetate buffer, pH 4.5, resulting in a concentration of RNA cargo of approximately 0.45 mg/mL, with a ratio of Cas9 mRNA: sgRNA of 1:1 (wt/wt).
LNPs were prepared using a cross-flow technique by impinging jet mixing of the lipid in ethanol with two volumes of RNA solutions and one volume of water. The lipid in ethanol is mixed through a mixing cross with the two volumes of RNA solution. A fourth stream of water is mixed with the outlet stream of the cross through an inline tee. (See WO2016010840
To quantitatively determine the efficiency of editing at the target location in the genome, deep sequencing was utilized to identify the presence of insertions and deletions introduced by gene editing.
PCR primers were designed around the target site (e.g., within the target gene of interest (e.g., TTR)), and the genomic area of interest was amplified.
Additional PCR was performed according to the manufacturer's protocols (Illumina) to add the necessary chemistry for sequencing. The amplicons were sequenced on an Illumina MiSeq instrument. The reads were aligned to the human reference genome (e.g., hg38) after eliminating those having low quality scores. The resulting files containing the reads were mapped to the reference genome (BAM files), where reads that overlapped the target region of interest were selected and the number of wild type reads versus the number of reads which contain an insertion, substitution, or deletion was calculated.
The editing percentage (e.g., the “editing efficiency” or “percent editing”) is defined as the total number of sequence reads with insertions or deletions over the total number of sequence reads, including wild type.
CD-1 female mice, ranging 6-10 weeks of age were used in each study involving mice. Sprague-Dawley female rats, ranging 6-10 weeks of age were used in each study involving rats. Animals were weighed and grouped according to body weight for preparing dosing solutions based on group average weight. LNPs were dosed via the lateral tail vein in a volume of 0.2 mL per animal (approximately 10 mL per kilogram body weight). The animals were observed at approximately 6 hours post dose for adverse effects. Body weight was measured at twenty-four hours post-administration, and animals were euthanized at various time points by exsanguination via cardiac puncture under isoflourane anesthesia. Blood was collected into serum separator tubes or into tubes containing buffered sodium citrate for plasma as described herein. For studies involving in vivo editing, liver tissue was collected from the median lobe from each animal for DNA extraction and analysis.
For the in vivo studies, genomic DNA was extracted from 10 mg of tissue using a bead-based extraction kit, MagMAX-96 DNA Multi-Sample Kit (ThermoFisher, Cat #4413020) according to the manufacturer's protocol, which includes homogenizing the tissue in lysis buffer (approximately 400 μL/10 mg tissue). All DNA samples were normalized to 100 ng/μL concentration for PCR and subsequent NGS analysis, as described herein.
Blood was collected, and the serum was isolated as indicated. The total TTR serum levels were determined using a Mouse Prealbumin (Transthyretin) ELISA Kit (Aviva Systems Biology, Cat. OKIA00111); rat TTR serum levels were measured using a rat specific ELISA kit (Aviva Systems Biology catalog number OKIA00159). Kit reagents and standards were prepared according to the manufacturer's protocol. Mouse serum was diluted to a final dilution of 10,000-fold with 1× assay diluent. This was done by carrying out two sequential 50-fold dilutions resulting in a 2500-fold dilution. A final 4-fold dilution step was carried out for a total sample dilution of 10,000-fold. Both standard curve dilutions (100 μL each) and diluted serum samples were added to each well of the ELISA plate pre-coated with capture antibody. The plate was incubated at room temperature for 30 minutes before washing. Enzyme-antibody conjugate (100 μL per well) was added for a 20-minute incubation. Unbound antibody conjugate was removed and the plate was washed again before the addition of the chromogenic substrate solution. The plate was incubated for 10 minutes before adding 100 μL of the stop solution, e.g., sulfuric acid (approximately 0.3 M). The plate was read on a SpectraMax M5 plate reader at an absorbance of 450 nm. Serum TTR levels were calculated by SoftMax Pro software ver. 6.4.2 using a four parameter logistic curve fit off the standard curve. Final serum values were adjusted for the assay dilution. Percent knockdown (% KD) values were determined relative to controls, which generally were animals sham-treated with vehicle (transport and storage solution or TSS) unless otherwise indicated.
Assays to determine and quantify where sgRNA cleavage occurs upon exposure to hepatocyte cytosol and to assess the effect of sgRNA modifications on stability were performed as follows. sgRNAs at 15 μM were incubated with human liver cytosol (XenoTech Product H0610.C) (adjusted to 0.01 mg/mL final protein concentration using pH 7.4 phosphate-buffered saline unless otherwise indicated) for a time period as indicated below. Reactions were stopped by adding 67 μL of proteinase K cell lysis buffer solution, which consisted of 3.230 mL water, 2.125 mL of tissue and cell lysis solution (Epicentre Product MTC096H), and 340 μL of proteinase K (50 mg/mL from Epicentre Product MPRK092) and incubating for 30 minutes at 65° C. in a thermo-mixer shaking at 750 RPM. 8 μL of 3M KCl was then added and the mixture was incubated for 10 minutes at 0° C. The mixtures were then centrifuged for 15 minutes at 1500 g to precipitate detergent. The supernatant was removed, diluted with 0.95 mL of dilution buffer (consisting of 0.01% Tween20 in water), and mixed with 1 mL of pH 4.3 loading/dilution buffer (consisting of 10 mM sodium acetate, 10% acetonitrile, 0.01% Tween 20, 10 mM EDTA, and 1 mM TCEP) and the mixture was loaded on a Clarity® OTX™ SPE oligonucleotide purification cartridge. Washes were performed at pH 4.3, 5.5, and about 7, followed by elution at pH 9.0. The eluate was dried under vacuum and resuspended in 100 mM triethyl ammonium acetate (TEAAc).
Samples were then analyzed by LC/MS.
A survey of chemical modification within crRNA was performed to identify negative influence of chemical modification at specific positions on editing efficacy. Each crRNA in this survey targeted the identical sequence within the human BCL11A gene. Test guides contained modifications within the spacer region of the crRNA (positions 1-20 from the 5′ end) limited to either a single modified base or two adjacent bases with the same chemical modification as described in Table 6. Phosophothioate bonds (PS), inosine substitution, DNA bases, 2′OMe modifications, and unlocked nucleic acids (UNA) were assayed.
The human embryonic kidney adenocarcinoma cell line HEK293 constitutively expressing Spy Cas9 (“HEK293_Cas9”) was cultured in DMEM media supplemented with 10% fetal bovine serum. Cells were plated at a density of 15,000 cells/well in a 96-well plate 20 hours prior to transfection (˜70% confluent at time of transfection). Cells were transfected with Lipofectamine RNAiMAX (ThermoFisher, Cat. 13778150) according to the manufacturer's protocol. Cells were transfected with a lipoplex containing individual crRNA (3.1 nM), trRNA TR000002 (3.1 nM), Lipofectamine RNAiMAX (0.45 μL/well) and OptiMem. Cells were lysed 48 hours post transfection, and lysates were used directly in the PCR reaction that was analyzed for editing by NGS.
The editing results are shown in Table 6 and
The impacts of chemical modification type and position were evaluated in an editing screen of modified crRNAs. The screen assayed guides modified with 2′F, 2′OMe and PS. The complete pattern set was applied to guides targeting six distinct sites in the TTR gene. The final data set contained 1704 distinct guides and 284 unique modification patterns.
Guide modification patterns were computationally selected to minimize the number of modification combinations required to explore the large combinatorial space of possible modification patterns. Patterns were chosen to create a uniform distribution of modifications at each individual position and also at each pair of positions, so that no particular position or combination of positions was over-represented in the final set. This bias-minimization approach allowed for testing of individual positional effects as well as for detection of higher-order interaction effects between positions. Appropriate controls were added to the set to control for nuisance effects such as guide domain sequence, transfection efficiency and other experimental variability.
Each pattern in this set contained only one type of modification on positions 4 to 20; modification types were not mixed within patterns. The final set of patterns consisted of 3 groups of 96 patterns on positions 4 to 20 having 0-4 2′F modifications, 0-4 2′Ome modifications or 0-15 PS modifications. PS modifications were more heavily applied because previous observations indicated that the PS modification is better tolerated than 2′Flu or 2′Ome, and therefore less likely to show detectable effects when present in small numbers.
The human embryonic kidney adenocarcinoma cell line HEK293 constitutively expressing Spy Cas9 (“HEK293_Cas9”) was cultured in DMEM media supplemented with 10% fetal bovine serum. Cells were plated at a density of 10,000 cells/well in a 96-well plate about 24 hours prior to transfection (˜70% confluent at time of transfection). Cells were transfected with Lipofectamine RNAiMAX (ThermoFisher, Cat. 13778150) according to the manufacturer's protocol. Cells were transfected with a lipoplex containing individual crRNA (25 nM), trRNA TR009880 (25 nM), Lipofectamine RNAiMAX (0.3 μL/well) and OptiMem. Cells were lysed 48 hours post transfection, and lysates were used directly in the PCR reaction that was analyzed for editing by NGS.
The editing results are described in Table 8. Each row number represents a single modification pattern. The first two rows in the table show controls.
The overall impact of the modification type, targeting location, and guide domain sequence were evaluated in the modified guides. The data show a wide range of activity within the modified oligonucleotides, from close to 0 to nearly 90% editing. In general, the 2′F modifications were more well-tolerated than 2′OMe or PS modified guides, though the PS guides were more heavily modified, on average, than the other modifications. We observed a clear impact of guide domain's nucleobase sequence on response to modification. G480 and G490 variants were strongly impaired by 2′OMe modifications, but were resistant to 2′F and PS modifications, whereas G494, G499 and G502 variants were most strongly impacted by PS modifications, and less so by 2′OMe, and G488 responded similarly to all three modifications.
A regression-based analysis was conducted to identify the modification types with significant impact on guide activity. The editing data were first corrected for guide sequence and plate effects prior to modeling. Positional modification impacts were then modeled as independent linearly additive factors using standard regression techniques. A separate analysis examined whether there was evidence of interaction and non-linear relationships among positions. No significant higher-order effects were observed; the results reported below come from the initial linear regression analysis. The editing data for all the modified guides were corrected for guide sequence effects and then modeled for modification impact.
2′ modifications at a number of positions displayed negative impacts on editing as shown in Table 7 and
The impacts of chemical modification type and position within the guide domain were evaluated in an editing screen of modified sgRNAs. The screen assayed guides modified with 2′F, 2′OMe and PS. The complete pattern set was applied to the nucleobase sequence of three guides. Test modification patterns in this example were applied to three guide domain nucleobase sequences, specifically those described in Table 1 for G000486, G000502, or G000415. Guide domain modification patterns were assayed either the conserved region described in SEQ ID No. 695 or in a short-sgRNA format using the conserved region described in SEQ ID No. 253. The final data set contained 270 distinct guides and 45 unique modification patterns in the guide domain.
Rows 1-12 in Tables 9-12 show test guides assessed for editing efficacy with 2′OMe, 2′F, PS, and 2′H modifications at positions 5, 12, and 15. Rows 13-19 in Tables 9-12 show editing data for variants that substitute 2′F single modification for 2′F+PS modifications at positions 8-10. Rows 20-27 in Tables 9-12 show editing data for variants that substitute 2′F for PS at positions 4-20.
Editing was assayed in PCH and PHH cells as described in Example 1 with the following modifications. Cells were counted and plated at a density of 30,000-35,000 cells/well for PHH, and 40,000-60,000 cells/well for PCH Transfections used pre-mixed lipid formulations in which the lipid components were reconstituted in 100% ethanol at a molar ratio of 50% Lipid A, 9% DSPC, 38% cholesterol, and 3% PEG2k-DMG. The lipid mixture was then mixed with RNA cargos (e.g., Cas9 mRNA and gRNA) at a lipid amine to RNA phosphate (N:P) molar ratio of about 6.0. Transfections were performed with a final concentration of 100 nM gRNA, 3% cyno serum, and 50 ng Cas9 mRNA per well. Cells were incubated for approximately 48 hours before cell lysis and NGS analysis. The experiment was performed in duplicate. The editing results are described in Tables 9-12. Each row represents a single modification pattern design with the same conserved region. Rows 46-48 are controls.
The data were analyzed to estimate the impact of several variables, including whether the guide was or was not a short-sgRNA, variable region modification pattern, and individual modification position, where possible. Short-sgRNA guides were significantly more active than non-short-sgRNA guides in PCH (all sites) and PHH (G502 variants). In PCH, short-sgRNA guides added an additional 14% over equivalent non-short-sgRNAs.
Many of the modification patterns in this study were designed to incorporate and further test well tolerated modifications into highly modified gRNA molecules. In general this was successful; nearly all patterns showed similar activity to controls overall. A number of individual positions were also tested in this study. Position 5, 12 and 15 were modified individually. Position 5 was highly tolerant of modification. Position 12 was tolerant of PS, 2′-F and 2′-OMe, but was significantly sensitive to 2′-H modification (reduction of editing percentage by 7.5, p<0.00002). Position 15 was tolerant of 2′-H modification, but as in other work presented here, highly sensitive to 2′F and 2′OMe (p<10−13).
sgRNAs targeting the human TTR gene were designed as shown in Table 1 and transfected into primary cyno hepatocytes (PCH), primary human hepatocytes (PHH), and HepG2 cells in vitro at concentrations as indicated in the figures and editing efficiency (e.g., percent editing) was measured by NGS, as described in Example 1. The LNPs used in these transfections were made according to LNP Procedure C in Example 1(F).
Dose response curves of editing efficiency by concentration are shown in
5′- and 3′-end-labeled versions of G282, G480, G481, G502, and G504 were assayed as described in Example 1(K). Fragment lengths were mapped onto the G282 sequence (
Cleavage was consistently observed following nucleotides 25, 45, 50, 64, and 67 in G282, G480, G481, G502, and G504 (except that for G481, there was little to no cleavage at position 25) (
G502 was compared to sgRNAs with additional modifications in the guide domain (
In G9571, cleavage after positions 8 and 11 was reduced or eliminated, consistent with nuclease protection by the 2′-fluoro modifications of these positions (
In G10015, cleavage after positions 4, 8, and 11 was reduced or eliminated, consistent with nuclease protection by the 2′-OMe modification of position 4 and the 2′-fluoro modifications of positions 8 and 11 (
Assays on G282, G480, G481, G502, G504, and G509 assembled into ribonucleoproteins (RNPs) with Cas9 were also performed using a higher HLC concentration of 8.5 mg/ml but otherwise following the procedure described above. The RNPs showed reduced susceptibility compared to the experiments using sgRNA alone despite the higher HLC concentration, indicating that the sgRNA within an RNP is less accessible to nuclease, but the cleavage pattern remained qualitatively similar, with most cleavage occurring at YA sites (data not shown).
G10039, which comprises modifications at all YA sites, was assayed with 0.01 mg/ml HLC and found to show only a very small amount of cleavage at position 16, consistent with protection by the phosphorothioate modification at that position (
G10039 (as free sgRNA) was also treated with 8.5 mg/ml HLC. Degradation increased at position 16 and was also observed at several other positions, some of which were not YA sites (
A series of sgRNAs were designed by systematically introducing 2′-OMe modifications at individual YA sites in the conserved regions that were unmodified in G282. Thus, the sgRNAs sequentially numbered from G9989-G9994 have 2′-OMe modifications at positions 25, 45, 50, 56, 64, and 67, respectively, which are positions LS5, LS7, LS12, N6, N14, and N17 as shown in
Similarly, a series of sgRNAs were designed by systematically introducing 2′-fluoro modifications at individual YA sites in the conserved regions that were unmodified in G282. Thus, the sgRNAs sequentially numbered from G9995-G10000 have 2′-fluoro modifications at positions 25, 45, 50, 56, 64, and 67, respectively. The sgRNAs sequentially numbered from G10025-G10030 have the same 2′-fluoro modifications as G9995-G10000, respectively, but the nucleobase sequence is identical to that of G502 instead of G282.
A further series of sgRNAs were designed by systematically introducing phosphorothioate modifications at individual YA sites in the conserved regions that were unmodified in G282. Thus, the sgRNAs sequentially numbered from G10001-G10006 have 2′-fluoro modifications at positions 25, 45, 50, 56, 64, and 67. The sgRNAs sequentially numbered from G10031-G10036 have the same phosphorothioate modifications as G10001-G10006, respectively, but the nucleobase sequence is identical to that of G502 instead of G282.
Also tested were guides modified with ENA (G9878, G10007, and G10008 had the G282 sequence, and G10037 and G10038 had the G502 sequence). The modifications in G10007 and G10037 were at the 45th and 50th nucleotides (positions LS7 and LS12 as indicated in
Also tested were guides modified with deoxyribonucleotides (G9423-G9427) and UNA (G9879), all of which had the G282 sequence. The positions of the modifications in these guides are shown in the Sequence Table.
The guides described above were incorporated into lipoplexes and transfected into PMH as described above and percent editing was determined (
Several series of modified sgRNAs were designed based on the nucleobase sequences of G000282 and G000502. Specific modifications are described in Table 1. G00282 variants were assayed for editing in duplicate (unless otherwise noted) in primary mouse hepatocyte cells (PMH) in vitro. G000502 variants were similarly assayed in duplicate (unless otherwise noted) in primary cyno hepatocyte (PCH) and PMH cells in vitro. All data is reported in Tables 14 and 15 below.
A series of sgRNAs were designed assaying modifications in the guide domain combined with the modified conserved region described in SEQ ID No. 201. The sgRNAs sequentially numbered from G012421-G012425, G012689, G012690, G012426-G012431 have the same modifications as G012693-G012705, respectively, but the nucleobase sequence is identical to that of G000502 instead of 0000282. Data for these guides are presented in Table 14.
Similarly, a series of sgRNAs were designed assaying modifications in the conserved region combined with the modified guide domain of either the G0000502 variant, 6012402, or the modified guide domain of G000282 variant, G009533. The sgRNAs sequentially numbered from G012432-G012438, G012691, G012439-G012440, G012692 have the same modifications as G012706-G12716, respectively, but the nucleobase sequence is identical to that of G000502 instead of G000282. Data for these guides are presented in Table 14.
A further series of sgRNAs were designed combining various guide domain and conserved region modification patterns. The sgRNAs sequentially numbered from G012441-G012451 sequentially have the same modifications as G012717-G012727, respectively, but the nucleobase sequence is identical to that of G000502 instead of G000282. Data for these guides are presented in Table 14 and
Similarly, a series of sgRNAs were designed assaying modifications in the context of short single guide variants of G000282 and G000502. The sgRNAs sequentially numbered from G012452-G012461 are based on the short guide variant of G000502, specifically G012401. These modified variants have the same modifications as G012728-G12737, respectively, but the nucleobase sequence of G012728-G12737 is identical to that of G000639, the short guide variant of G000282. Data for this series of guides are presented in Table 14.
Lastly, a series of sgRNAs as indicated in Table 15 were designed assaying modifications in the nucleobase sequence of G000502 (See Table 1 for sgRNA nucleotide sequences). Data for this series of guides are presented in Table 15 and
Lipid nanoparticle (LNP) formulations of modified sgRNAs targeting were tested on primary human hepatocytes and primary cynomolgus hepatocytes in a dose response assay with guides targeting the human genes HAO1 or LDHA. All methods are as described in Example 1 unless otherwise noted. Both cell lines were incubated at 37° C., 5% CO2 for 48 hours prior to treatment with LNPs. LNPs were incubated in media containing 3% cynomolgus serum at 37° C. for 10 minutes and administered to cells in amounts as further provided herein. Post-incubation the LNPs were added to the human or cynomolgus hepatocytes in an 8 point 3-fold dose response curve starting at 300 ng Cas9 mRNA. The cells were lysed 96 hours post-treatment for NGS analysis as described in Example 1
Table 16 shows the average editing and standard deviation the tested control sgRNAs at 10.75 nM delivered with Spy Cas9 via LNP in PHH and PCH. These samples were generated in triplicate.
Table 17 shows the average editing and standard deviation for sgRNAs targing HAO1 delivered with Spy Cas9 via LNP to PHH or PCH. These samples were generated in at least duplicate. The dose response curve plot for these data are shown in
Table 18 shows the average editing and standard deviation for sgRNAs targeting SerpinA1 delivered with Spy Cas9 via LNP to PHH. G000480 and G000502 are controls that target TTR. These samples were generated in at least duplicate. The dose response curve plot for these data are shown in
Table 19 shows the average editing and standard deviation for sgRNAs targeting SerpinA1 delivered with Spy Cas9 via LNP to PCH. G000480 and G000502 are controls that target TTR. These samples were generated in at least duplicate. The dose response curve plot for these data are shown in
LNPs prepared as described above in Example 1(F), comprising chemically synthesized sgRNAs (including short-sgRNAs) targeting the mouse TTR gene and IVT Cas9 mRNA in a 1:1 weight ratio, were administered to CD-1 female mice (N indicated below) or Sprague-Dawley female rats as described above in Example 1(H). Eight days post dose at necropsy, livers and blood were collected for NGS measurements of editing efficiency and serum TTR analysis, respectively, as described above in Example 1. Animals were weighed 24 hours post dose for overall wellness assessment.
The same sgRNAs listed in Tables 23A and 23B (Table 1) were tested in vitro by transfection into Neuro2A cells, as per Example 1(C). Results are shown in
LNPs prepared as described above in Example 1(F) (LNP Procedure D), comprising chemically synthesized sgRNAs targeting the mouse TTR gene and IVT Cas9 mRNA in a 1:1 weight ratio, were administered to CD-1 female mice (N indicated below) as described above in Example 1(E). Eight days post dose at necropsy, livers and blood were collected for NGS measurements of editing efficiency and serum TTR analysis, respectively, as described above in Example 1. Animals were weighed 24 hours post dose for overall wellness assessment.
Table 40 shows the editing efficiency and TTR protein levels, respectively, for LNPs containing the indicated sgRNAs (See Table 1 for sgRNA nucleotide sequences) which all target the same sequence in the TTR gene. The LNPs were made according to LNP Procedure D in Example 1(F). The data shown in Table 40 are from CD-1 female mice administered 0.1 mg/kg of total RNA.
Chemically synthesized sgRNAs (G502 and G9565-G9576) and IVT Cas9 mRNA were administered to primary hepatocytes as lipoplex transfections or LNP transfections as described in Example 1(D) and 1(F) (LNP Procedure D), respectively. Editing of the TTR gene was determined by NGS as described above in Example 1(G). The same sgRNAs were also administered to CD-1 female mice as described in Example 5, particularly the section describing data shown in
The % editing from in vitro lipoplex transfections of PMH compared to in vivo editing is shown
The % editing from in vitro LNP transfections of PMH (at 0.3 ng, 1 ng, 3 ng, 10 ng, and 30 ng) compared to in vivo editing are shown in
This application is a Continuation of International Application No. PCT/US2019/036160, filed on Jun. 7, 2019, which claims the benefit of U.S. Provisional Patent Application No. 62/682,838, filed Jun. 8, 2018, and U.S. Provisional Patent Application No. 62/682,820, filed Jun. 8, 2018, each of which is incorporated herein by reference for all purposes.
Number | Date | Country | |
---|---|---|---|
62682838 | Jun 2018 | US | |
62682820 | Jun 2018 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2019/036160 | Jun 2019 | US |
Child | 17111769 | US |