The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Sep. 7, 2021, is named 2021-09-07_01245-0024-00US_ST25.txt and is 646,734 bytes in size.
Muscular dystrophies (MD) are a group of more than 30 genetic diseases characterized by progressive weakness and degeneration of the skeletal muscles that control movement. Duchenne muscular dystrophy (DMD) is one of the most severe forms of MD that affects approximately 1 in 5000 boys and is characterized by progressive muscle weakness and premature death. Cardiomyopathy and heart failure are common, incurable and lethal features of DMD. The disease is caused by mutations in the gene encoding dystrophin (DMD), which result in loss of expression of dystrophin, causing muscle membrane fragility and progressive muscle wasting.
CRISPR-based genome editing can provide sequence-specific cleavage of genomic DNA using a Cas9 and a guide RNA. For example, a nucleic acid encoding the Cas9 enzyme and a nucleic acid encoding for the appropriate guide RNA can be provided on separate vectors or together on a single vector and administered in vivo or in vitro to knockout or correct a genetic mutation, for example. The approximately 20 nucleotides at the 5′ end of the guide RNA serves as the guide or spacer sequence that can be any sequence complementary to one strand of a genomic target location that has an adjacent protospacer adjacent motif (PAM). The PAM sequence is a short sequence adjacent to the Cas9 nuclease cut site that the Cas9 molecule requires for appropriate binding. The nucleotides 3′ of the guide or spacer sequence of the guide RNA serve as a scaffold sequence for interacting with Cas9. When a guide RNA and a Cas9 are expressed, the guide RNA will bind to Cas9 and direct it to the sequence complementary to the guide sequence, where it will then initiate a double-stranded break (DSB). To repair these breaks, cells typically use an error prone mechanism of non-homologous end joining (NHEJ) which can lead to disruption of function in the target gene through insertions or deletion of codons, shifts in the reading frame, or result in a premature stop codon triggering nonsense-mediated decay. See, e.g., Kumar et al. (2018) Front. Mol. Neurosci. Vol. 11, Article 413.
While gene editing strategies using systems (e.g., CRISPR) for treating DMD have been previously explored, these strategies have focused primarily on either: a) cutting at multiple different sites to excise large portions (e.g., one or more exons) of the dystrophin gene (see, e.g., Ousterout et al., 2015, Nat Commun 6:6244); or b) cutting in a single site to introduce indels that either result in a frame-shifting mutation and/or that destroy a splice acceptor/donor site in the dystrophin gene (see, e.g., Amoassi et al., 2018, Science, 362(6410):86-91). However, there remains a need for additional alternative and effective gene editing strategies for treating diseases like DMD.
In order for gene editing systems like CRISPR to be effective in treating diseases and disorders like DMD in a patient, effective delivery vectors of the CRISPR system components are necessary. Adeno-associated virus (AAV) administration of the CRISPR-Cas components in vivo or in vitro is attractive due to the early and ongoing successes of AAV vector design, manufacturing, and clinical stage administration for gene therapy. See, e.g., Wang et al. (2019) Nature Reviews Drug Discovery 18:358-378; Ran et al. (2015a) Nature 520: 186-101. However, the commonly used Streptococcus pyogenes (spCas9) is very large, and when used in AAV-based CRISPR/Cas systems, requires two AAV vectors—one vector carrying the nucleic acid encoding the spCas9, and the other carrying the nucleic acid encoding the guide RNA. One possible way to overcome this technical hurdle is to take advantage of the smaller orthologs of Cas9 derived from different prokaryotic species. Smaller Cas9's such as Staphylococcus aureus (SaCas9) and Staphylococcus lugdunensis (SluCas9) may be able to be manufactured on a single AAV vector together with a nucleic acid encoding one or more guide RNAs. One advantage of incorporating one or more guide RNAs on a single vector together with the smaller SaCas9 or SluCas9 is that doing so allows extreme design flexibility in situations where more than one guide RNA is desired for optimal performance. For example, one vector may be utilized to express SaCas9 or SluCas9 and one or more guide RNAs targeting a first genomic target (e.g., a pair of guide RNAs that together bind regions flanking a genomic target), and a second vector may be utilized to express multiple copies of the same (e.g., the same pair of guides in the first vector) or different guide RNAs targeting the same or a different genomic target. As another example, multiple copies of the same guide RNA may be beneficial, and the use of smaller Cas9's allow for multiple copies of guide RNA to be incorporated on the same vector as the Cas9, and also for even more copies of guide RNA when combined with a second vector. Compositions and methods utilizing these configurations have the benefit of reducing manufacturing costs, reducing complexity of administration routes and protocols, and allowing maximum flexibility with regard to using multiple copies of the same or different guide RNAs targeting the same or different genomic target sequences. In some instances, providing multiple copies of the same guide RNA improves the efficiency of the guide, improving an already successful system.
Provided herein are compositions and methods for treating DMD utilizing the smaller Cas9s from Staphylococcus aureus (SaCas9) and Staphylococcus lugdunensis (SluCas9). Compositions comprising a single AAV vector comprising a nucleic acid molecule encoding SaCas9 or SluCas9 and a guide RNA are provided.
Accordingly, the following non-limiting embodiments are provided.
a. a single nucleic acid molecule comprising:
b. two nucleic acid molecules comprising:
wherein the guide RNA(s) target a region in the dystrophin gene.
a nucleic acid encoding a Staphylococcus lugdunensis (SluCas9).
wherein the pair of guide RNAs and SaCas9 excise a portion of the exon.
wherein the pair of guide RNAs and SaCas9 excise a portion of the exon.
Reference will now be made in detail to certain embodiments of the invention, examples of which are illustrated in the accompanying drawings. While the invention is described in conjunction with the illustrated embodiments, it will be understood that they are not intended to limit the invention to those embodiments. On the contrary, the invention is intended to cover all alternatives, modifications, and equivalents, which may be included within the invention as defined by the appended claims and included embodiments.
Before describing the present teachings in detail, it is to be understood that the disclosure is not limited to specific compositions or process steps, as such may vary. It should be noted that, as used in this specification and the appended claims, the singular form “a”, “an” and “the” include plural references unless the context clearly dictates otherwise. Thus, for example, reference to “a guide” includes a plurality of guides and reference to “a cell” includes a plurality of cells and the like.
Numeric ranges are inclusive of the numbers defining the range. Measured and measurable values are understood to be approximate, taking into account significant digits and the error associated with the measurement. Also, the use of “comprise”, “comprises”, “comprising”, “contain”, “contains”, “containing”, “include”, “includes”, and “including” are not intended to be limiting. It is to be understood that both the foregoing general description and detailed description are exemplary and explanatory only and are not restrictive of the teachings.
Unless specifically noted in the specification, embodiments in the specification that recite “comprising” various components are also contemplated as “consisting of” or “consisting essentially of” the recited components; embodiments in the specification that recite “consisting of” various components are also contemplated as “comprising” or “consisting essentially of” the recited components; and embodiments in the specification that recite “consisting essentially of” various components are also contemplated as “consisting of” or “comprising” the recited components (this interchangeability does not apply to the use of these terms in the claims). The term “or” is used in an inclusive sense, i.e., equivalent to “and/or,” unless the context clearly indicates otherwise.
The section headings used herein are for organizational purposes only and are not to be construed as limiting the desired subject matter in any way. In the event that any material incorporated by reference contradicts any term defined in this specification or any other express content of this specification, this specification controls. While the present teachings are described in conjunction with various embodiments, it is not intended that the present teachings be limited to such embodiments. On the contrary, the present teachings encompass various alternatives, modifications, and equivalents, as will be appreciated by those of skill in the art.
Unless stated otherwise, the following terms and phrases as used herein are intended to have the following meanings:
“Polynucleotide,” “nucleic acid,” and “nucleic acid molecule,” are used herein to refer to a multimeric compound comprising nucleosides or nucleoside analogs which have nitrogenous heterocyclic bases or base analogs linked together along a backbone, including conventional RNA, DNA, mixed RNA-DNA, and polymers that are analogs thereof. A nucleic acid “backbone” can be made up of a variety of linkages, including one or more of sugar-phosphodiester linkages, peptide-nucleic acid bonds (“peptide nucleic acids” or PNA; PCT No. WO 95/32305), phosphorothioate linkages, methylphosphonate linkages, or combinations thereof. Sugar moieties of a nucleic acid can be ribose, deoxyribose, or similar compounds with substitutions, e.g., 2′ methoxy or 2′ halide substitutions. Nitrogenous bases can be conventional bases (A, G, C, T, U), analogs thereof (e.g., modified uridines such as 5-methoxyuridine, pseudouridine, or N1-methylpseudouridine, or others); inosine; derivatives of purines or pyrimidines (e.g., N4-methyl deoxyguanosine, deaza- or aza-purines, deaza- or aza-pyrimidines, pyrimidine bases with substituent groups at the 5 or 6 position (e.g., 5-methylcytosine), purine bases with a substituent at the 2, 6, or 8 positions, 2-amino-6-methylaminopurine, O6-methylguanine, 4-thio-pyrimidines, 4-amino-pyrimidines, 4-dimethylhydrazine-pyrimidines, and O4-alkyl-pyrimidines; U.S. Pat. No. 5,378,825 and PCT No. WO 93/13121). For general discussion see The Biochemistry of the Nucleic Acids 5-36, Adams et al., ed., 11th ed., 1992). Nucleic acids can include one or more “abasic” residues where the backbone includes no nitrogenous base for position(s) of the polymer (U.S. Pat. No. 5,585,481). A nucleic acid can comprise only conventional RNA or DNA sugars, bases and linkages, or can include both conventional components and substitutions (e.g., conventional bases with 2′ methoxy linkages, or polymers containing both conventional bases and one or more base analogs). Nucleic acid includes “locked nucleic acid” (LNA), an analogue containing one or more LNA nucleotide monomers with a bicyclic furanose unit locked in an RNA mimicking sugar conformation, which enhance hybridization affinity toward complementary RNA and DNA sequences (Vester and Wengel, 2004, Biochemistry 43(42):13233-41). RNA and DNA have different sugar moieties and can differ by the presence of uracil or analogs thereof in RNA and thymine or analogs thereof in DNA.
“Guide RNA”, “guide RNA”, and simply “guide” are used herein interchangeably to refer to either a crRNA (also known as CRISPR RNA), or the combination of a crRNA and a trRNA (also known as tracrRNA). The crRNA and trRNA may be associated as a single RNA molecule (single guide RNA, sgRNA) or in two separate RNA molecules (dual guide RNA, dgRNA). “Guide RNA” or “guide RNA” refers to each type. The trRNA may be a naturally-occurring sequence, or a trRNA sequence with modifications or variations compared to naturally-occurring sequences. For clarity, the terms “guide RNA” or “guide” as used herein, and unless specifically stated otherwise, may refer to an RNA molecule (comprising A, C, G, and U nucleotides) or to a DNA molecule encoding such an RNA molecule (comprising A, C, G, and T nucleotides) or complementary sequences thereof. In general, in the case of a DNA nucleic acid construct encoding a guide RNA, the U residues in any of the RNA sequences described herein may be replaced with T residues, and in the case of a guide RNA construct encoded by any of the DNA sequences described herein, the T residues may be replaced with U residues.
As used herein, a “spacer sequence,” sometimes also referred to herein and in the literature as a “spacer,” “protospacer,” “guide sequence,” or “targeting sequence” refers to a sequence within a guide RNA that is complementary to a target sequence and functions to direct a guide RNA to a target sequence for cleavage by a Cas9. A guide sequence can be 24, 23, 22, 21, 20 or fewer base pairs in length, e.g., in the case of Staphylococcus lugdunensis (i.e., SluCas9) or Staphylococcus aureus (i.e., SaCas9) and related Cas9 homologs/orthologs. In preferred embodiments, a guide/spacer sequence in the case of SluCas9 or SaCas9 is at least 20 base pairs in length, or more specifically, within 20-25 base pairs in length (see, e.g., Schmidt et al., 2021, Nature Communications, “Improved CRISPR genome editing using small highly active and specific engineered RNA-guided nucleases”). Shorter or longer sequences can also be used as guides, e.g., 15-, 16-, 17-, 18-, 19-, 20-, 21-, 22-, 23-, 24-, or 25-nucleotides in length. For example, in some embodiments, the guide sequence comprises at least 17, 18, 19, 20, 21, 22, 23, 24, or 25 contiguous nucleotides of a sequence selected from SEQ ID NOs: 1-35 (for SaCas9), and 100-225 (for SluCas9). In some embodiments, the guide sequence comprises a sequence selected from SEQ ID NOs: 1-35, 100-225, 1000-1078, 2000-2116, 3000-3069, or 4000-4251. In some embodiments, the target sequence is in a gene or on a chromosome, for example, and is complementary to the guide sequence. In some embodiments, the degree of complementarity or identity between a guide sequence and its corresponding target sequence may be about 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100%. For example, in some embodiments, the guide sequence comprises a sequence with about 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identity to at least 17, 18, 19, 20, 21, 22, 23, 24, or 25 contiguous nucleotides of a sequence selected from SEQ ID NOs: 1-35, 100-225, 1000-1078, 2000-2116, 3000-3069, or 4000-4251. In some embodiments, the guide sequence comprises a sequence with about 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identity to a sequence selected from SEQ ID NOs: 1-35, 100-225, 1000-1078, 2000-2116, 3000-3069, or 4000-4251. In some embodiments, the guide sequence and the target region may be 100% complementary or identical. In other embodiments, the guide sequence and the target region may contain at least one mismatch. For example, the guide sequence and the target sequence may contain 1, 2, 3, or 4 mismatches, where the total length of the target sequence is at least 17, 18, 19, 20 or more base pairs. In some embodiments, the guide sequence and the target region may contain 1-4 mismatches where the guide sequence comprises at least 17, 18, 19, 20 or more nucleotides. In some embodiments, the guide sequence and the target region may contain 1, 2, 3, or 4 mismatches where the guide sequence comprises 20 nucleotides. In some embodiments, the guide sequence and the target region do not contain any mismatches.
In some embodiments, the guide sequence comprises a sequence selected from SEQ ID NOs: 1-35, 100-225, 1000-1078, 2000-2116, 3000-3069, or 4000-4251, wherein if the 5′ terminal nucleotide is not guanine, one or more guanine (g) is added to the sequence at its 5′ end. The 5′ g or gg is required in some instances for transcription, for example, for expression by the RNA polymerase III-dependent U6 promoter or the T7 promoter. In some embodiments, a 5′ guanine is added to any one of the guide sequences or pairs of guide sequences disclosed herein.
Target sequences for Cas9s include both the positive and negative strands of genomic DNA (i.e., the sequence given and the sequence's reverse compliment), as a nucleic acid substrate for a Cas9 is a double stranded nucleic acid. Accordingly, where a guide sequence is said to be “complementary to a target sequence”, it is to be understood that the guide sequence may direct a guide RNA to bind to the reverse complement of a target sequence. Thus, in some embodiments, where the guide sequence binds the reverse complement of a target sequence, the guide sequence is identical to certain nucleotides of the target sequence (e.g., the target sequence not including the PAM) except for the substitution of U for T in the guide sequence.
As used herein, “ribonucleoprotein” (RNP) or “RNP complex” refers to a guide RNA together with a Cas9. In some embodiments, the guide RNA guides the Cas9 such as Cas9 to a target sequence, and the guide RNA hybridizes with and the agent binds to the target sequence, which can be followed by cleaving or nicking (in the context of a modified “nickase” Cas9).
As used herein, a first sequence is considered to “comprise a sequence with at least X % identity to” a second sequence if an alignment of the first sequence to the second sequence shows that X % or more of the positions of the second sequence in its entirety are matched by the first sequence. For example, the sequence AAGA comprises a sequence with 100% identity to the sequence AAG because an alignment would give 100% identity in that there are matches to all three positions of the second sequence. The differences between RNA and DNA (generally the exchange of uridine for thymidine or vice versa) and the presence of nucleoside analogs such as modified uridines do not contribute to differences in identity or complementarity among polynucleotides as long as the relevant nucleotides (such as thymidine, uridine, or modified uridine) have the same complement (e.g., adenosine for all of thymidine, uridine, or modified uridine; another example is cytosine and 5-methylcytosine, both of which have guanosine or modified guanosine as a complement). Thus, for example, the sequence 5′-AXG where X is any modified uridine, such as pseudouridine, N1-methyl pseudouridine, or 5-methoxyuridine, is considered 100% identical to AUG in that both are perfectly complementary to the same sequence (5′-CAU). Exemplary alignment algorithms are the Smith-Waterman and Needleman-Wunsch algorithms, which are well-known in the art. One skilled in the art will understand what choice of algorithm and parameter settings are appropriate for a given pair of sequences to be aligned; for sequences of generally similar length and expected identity >50% for amino acids or >75% for nucleotides, the Needleman-Wunsch algorithm with default settings of the Needleman-Wunsch algorithm interface provided by the EBI at the www.ebi.ac.uk web server is generally appropriate.
“mRNA” is used herein to refer to a polynucleotide that is not DNA and comprises an open reading frame that can be translated into a polypeptide (i.e., can serve as a substrate for translation by a ribosome and amino-acylated tRNAs). mRNA can comprise a phosphate-sugar backbone including ribose residues or analogs thereof, e.g., 2′-methoxy ribose residues. In some embodiments, the sugars of an mRNA phosphate-sugar backbone consist essentially of ribose residues, 2′-methoxy ribose residues, or a combination thereof.
Guide sequences useful in the guide RNA compositions and methods described herein are shown, for example, in Table 1A, Table 1B, and Table 5 and throughout the specification.
As used herein, a “target sequence” refers to a sequence of nucleic acid in a target gene that has complementarity to at least a portion of the guide sequence of the guide RNA. The interaction of the target sequence and the guide sequence directs a Cas9 to bind, and potentially nick or cleave (depending on the activity of the agent), within the target sequence.
As used herein, “treatment” refers to any administration or application of a therapeutic for disease or disorder in a subject, and includes inhibiting the disease or development of the disease (which may occur before or after the disease is formally diagnosed, e.g., in cases where a subject has a genotype that has the potential or is likely to result in development of the disease), arresting its development, relieving one or more symptoms of the disease, curing the disease, or preventing reoccurrence of one or more symptoms of the disease. For example, treatment of DMD may comprise alleviating symptoms of DMD.
As used herein, “ameliorating” refers to any beneficial effect on a phenotype or symptom, such as reducing its severity, slowing or delaying its development, arresting its development, or partially or completely reversing or eliminating it. In the case of quantitative phenotypes such as expression levels, ameliorating encompasses changing the expression level so that it is closer to the expression level seen in healthy or unaffected cells or individuals.
A “pharmaceutically acceptable excipient” refers to an agent that is included in a pharmaceutical formulation that is not the active ingredient. Pharmaceutically acceptable excipients may e.g., aid in drug delivery or support or enhance stability or bioavailability.
The term “about” or “approximately” means an acceptable error for a particular value as determined by one of ordinary skill in the art, which depends in part on how the value is measured or determined.
As used herein, “Staphylococcus aureus Cas9” may also be referred to as SaCas9, and includes wild type SaCas9 (e.g., SEQ ID NO: 711) and variants thereof. A variant of SaCas9 comprises one or more amino acid changes as compared to SEQ ID NO: 711, including insertion, deletion, or substitution of one or more amino acids, or a chemical modification to one or more amino acids.
As used herein, “Staphylococcus lugdunensis Cas9” may also be referred to as SluCas9, and includes wild type SluCas9 (e.g., SEQ ID NO: 712) and variants thereof. A variant of SluCas9 comprises one or more amino acid changes as compared to SEQ ID NO: 712, including insertion, deletion, or substitution of one or more amino acids, or a chemical modification to one or more amino acids.
Provided herein are compositions useful for treating Duchenne Muscular Dystrophy (DMD), e.g., using a single or multiple (e.g., at least 2) nucleic acid molecule encoding 1) one or more guide RNAs comprising one or more guide sequences of Table 1A, Table 1B, or Table 5; and 2) SaCas9 (for SEQ ID NOs: 1-35, 1000-1078 or 3000-3069) or SluCas9 (for SEQ ID NOs: 100-225, 2000-2116, or 4000-4251). Such compositions may be administered to subjects having or suspected of having DMD.
In some embodiments, a composition is provided comprising, consisting of, or consisting essentially of a single nucleic acid molecule comprising: i) a nucleic acid encoding Staphylococcus aureus Cas9 (SaCas9) or Staphylococcus lugdunensis (SluCas9) and at least one, at least two, or at least three guide RNAs; or ii) a nucleic acid encoding Staphylococcus aureus Cas9 (SaCas9) or Staphylococcus lugdunensis (SluCas9) and from one to n guide RNAs, wherein n is no more than the maximum number of guide RNAs that can be expressed from said nucleic acid; or iii) a nucleic acid encoding Staphylococcus aureus Cas9 (SaCas9) or Staphylococcus lugdunensis (SluCas9) and one to three guide RNAs.
In some embodiments, a composition is provided comprising, consisting of, or consisting essentially of at least two nucleic acid molecules comprising a first nucleic acid encoding Staphylococcus aureus Cas9 (SaCas9) or Staphylococcus lugdunensis (SluCas9); and a second nucleic acid that does not encode a SaCas9 or SluCas9 and encodes any one of i) at least one, at least two, at least three, at least four, at least five, or at least six guide RNAs; or ii) from one to n guide RNAs, wherein n is no more than the maximum number of guide RNAs that can be expressed from said nucleic acid; or iii) from one to six guide RNAs.
In some embodiments, a composition is provided comprising, consisting of, or consisting essentially of at least two nucleic acid molecules comprising a first nucleic acid encoding Staphylococcus aureus Cas9 (SaCas9) or Staphylococcus lugdunensis (SluCas9) and i) at least one, at least two, or at least three guide RNAs; or ii) from one to n guide RNAs, wherein n is no more than the maximum number of guide RNAs that can be expressed from said nucleic acid; or iii) one to three guide RNAs; and a second nucleic acid that does not encode a SaCas9 or SluCas9, optionally wherein the second nucleic acid comprises any one of i) at least one, at least two, at least three, at least four, at least five, or at least six guide RNAs; or ii) from one to n guide RNAs, wherein n is no more than the maximum number of guide RNAs that can be expressed from said nucleic acid; or iii) from one to six guide RNAs.
In some embodiments, a composition is provided comprising, consisting of, or consisting essentially of at least two nucleic acid molecules comprising a first nucleic acid encoding Staphylococcus aureus Cas9 (SaCas9) or Staphylococcus lugdunensis (SluCas9) and at least one, at least two, or at least three guide RNAs; and a second nucleic acid that does not encode a SaCas9 or SluCas9 and encodes from one to six guide RNAs.
In some embodiments, a composition is provided comprising, consisting of, or consisting essentially of at least two nucleic acid molecules comprising a first nucleic acid encoding Staphylococcus aureus Cas9 (SaCas9) or Staphylococcus lugdunensis (SluCas9) and at least two guide RNAs, wherein at least one guide RNA binds upstream of sequence to be excised and at least one guide RNA binds downstream of sequence to be excised; and a second nucleic acid that does not encode a SaCas9 or SluCas9 and encodes at least one additional copy of the guide RNAs encoded in the first nucleic acid. In some embodiments, the guide RNA excises a portion of a DMD gene, optionally an exon, intron, or exon/intron junction.
In some embodiments, a composition is provided comprising, consisting of, or consisting essentially of at least two nucleic acid molecules comprising a first nucleic acid encoding Staphylococcus aureus Cas9 (SaCas9) or Staphylococcus lugdunensis (SluCas9) and a first and a second guide RNA that function to excise a portion of a DMD gene; and a second nucleic acid encoding at least 2 or at least 3 copies of the first guide RNA and at least 2 or at least 3 copies of the second guide RNA.
In some embodiments, a composition is provided comprising, consisting of, or consisting essentially of one or more nucleic acid molecules encoding an endonuclease and a pair of guide RNAs, wherein each guide RNA targets a different sequence in a DMD gene, wherein the endonuclease and pair of guide RNAs are capable of excising a target sequence in DNA that is between 5-250 nucleotides in length. In some embodiments, the endonuclease is a class 2, type II Cas endonuclease. In some embodiments, the class 2, type II Cas endonuclease is SpCas9, SaCas9, or SluCas9. In some embodiments, the endonuclease is not a class 2, type V Cas endonuclease. In some embodiments, the excised target sequence comprises a splice acceptor site or a splice donor site. In some embodiments, the excised target sequence comprises a premature stop codon in the DMD gene. In some embodiments, the excised target sequence does not comprise an entire exon of the DMD gene. In some embodiments, any of the methods and/or ribonucleoprotein complexes disclosed herein do not destroy/specifically alter the sequence of a splice acceptor site, splice donor site, or premature stop codon site.
In some embodiments, the guide RNA in the composition comprises any one of the following:
In some embodiments, the disclosure provides for a composition comprising: a) one or more nucleic acid molecules encoding a SluCas9 and b) a first spacer sequence and a second spacer sequence selected from SEQ ID Nos: 148 and 134. In some embodiments, the composition comprises: a) one or more nucleic acid molecules encoding a SluCas9 and b) a first spacer sequence and a second spacer sequence selected from SEQ ID Nos: 145 and 131. In some embodiments, the composition comprises: a) one or more nucleic acid molecules encoding a SluCas9 and b) a first spacer sequence and a second spacer sequence selected from SEQ ID Nos: 144 and 149. In some embodiments, the composition comprises: a) one or more nucleic acid molecules encoding a SluCas9 and b) a first spacer sequence and a second spacer sequence selected from SEQ ID Nos: 144 and 150. In some embodiments, the composition comprises: a) one or more nucleic acid molecules encoding a SluCas9 and b) a first spacer sequence and a second spacer sequence selected from SEQ ID Nos: 146 and 148. In some embodiments, the composition comprises: a) one or more nucleic acid molecules encoding a SaCas9 (e.g., an SaCas9-KKH) and b) a first spacer sequence and a second spacer sequence selected from SEQ ID Nos: 12 and 1013. In some embodiments, the composition comprises: a) one or more nucleic acid molecules encoding a SaCas9 (e.g., an SaCas9-KKH) and b) a first spacer sequence and a second spacer sequence selected from SEQ ID Nos: 12 and 1016.
In some embodiments, the one or more guide RNAs direct the Cas9 to a site in or near any one of exon 43, 44, 45, 50, 51, or 53 of dystrophin. For example, the Cas9 may be directed to cut within 10, 20, 30, 40, or 50 nucleotides of a target sequence.
The disclosure herein may reference a “first and a second spacer” or a “first and a second guide RNA, gRNA, or sgRNA” followed by one or more pairs of specific sequences. It should be noted that the order of the sequences in the pair is not intended to be restricted to the order in which they are presented. For example, the phrase “the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 15” could mean that the first sgRNA comprises the sequence of SEQ ID NO: 10 and the second sgRNA sequence comprises the sequence of SEQ ID NO: 15, or this phrase could mean that the first sgRNA comprises the sequence of SEQ ID NO: 15 and the second sgRNA sequence comprises the sequence of SEQ ID NO: 10.
In some embodiments, a composition comprising a single nucleic acid molecule encoding one or more guide RNAs and a Cas9 is provided, wherein the single nucleic acid molecule comprises:
In some embodiments, a composition is provided comprising, consisting of, or consisting essentially of one or more nucleic acid molecules encoding a Staphylococcus lugdunensis (SluCas9) and at least two guide RNAs, wherein a first and second guide RNA target different sequences in a DMD gene, wherein the first and a second guide RNA comprise a sequence that is at least 90% identical to a first and second spacer sequence selected from any one of:
In some embodiments, a composition is provided comprising, consisting of, or consisting essentially of one or more nucleic acid molecules encoding an endonuclease and at least two guide RNAs, wherein the guide RNAs each target a different sequence in a DMD gene, wherein the guide RNAs each comprise a sequence that is at least 90% identical to a first and second spacer sequence selected from any one of:
a second nucleic acid encoding a Staphylococcus aureus Cas9 (SaCas9).
In some embodiments, the first and/or the second nucleic acid, if present, comprises at least two guide RNAs. In some embodiments, the first and/or the second nucleic acid, if present, comprises at least three guide RNAs. In some embodiments, the first and/or the second nucleic acid, if present, comprises at least four guide RNAs. In some embodiments, the first and/or the second nucleic acid, if present, comprises at least five guide RNAs. In some embodiments, the first and/or the second nucleic acid, if present, comprises at least six guide RNAs. In some embodiments, the first nucleic acid comprises an endonuclease and at least one, at least two, or at least three guide RNAs. In some embodiments, the first nucleic acid comprises an endonuclease and from one to n guide RNAs, wherein n is no more than the maximum number of guide RNAs that can be expressed from said nucleic acid. In some embodiments, the first nucleic acid comprises an endonuclease and from one to three guide RNAs.
In some embodiments, the second nucleic acid, if present, encodes at least one, at least two, at least three, at least four, at least five, or at least six guide RNAs. In some embodiments, the second nucleic acid, if present, encodes from one to n guide RNAs, wherein n is no more than the maximum number of guide RNAs that can be expressed from said nucleic acid In some embodiments, the second nucleic acid, if present, encodes from one to six guide RNAs. In some embodiments, the second nucleic acid, if present, encodes 2-10, 2-9, 2-8, 2-7, 2-6, 2-5, 2-4, or 2-3 guide RNAs. In some embodiments, the second nucleic acid, if present, encodes 2, 3, 4, 5, or 6 guide RNAs.
In some embodiments comprising at least two nucleic acid molecules, the guide RNA encoded by the first nucleic acid and the second nucleic acid are the same. In some embodiments comprising at least two nucleic acid molecules, the guide RNA encoded by the first nucleic acid and the second nucleic acid are different. In some embodiments comprising at least two nucleic acid molecules, at least one guide RNA binds to a target sequence within an exon in the DMD gene that is upstream of a premature stop codon, and wherein at least one guide RNA binds to a target sequence within an exon in the DMD gene that is downstream of a premature stop codon. In some embodiments comprising at least two nucleic acid molecules, the same guide RNA is encoded by the nucleic acid of the first and second nucleic acid molecule. In some embodiments comprising at least two nucleic acid molecules, the second nucleic acid molecule encodes a guide RNA that binds to the same target sequence as the guide RNA in the first nucleic acid molecule. In some embodiments comprising at least two nucleic acid molecules, the second nucleic acid molecule encodes at least 2, at least 3, at least 4, at least 5, or at least 6 guide RNAs, wherein the guide RNAs in the second nucleic acid molecule bind to the same target sequence as the guide RNA in the first nucleic acid molecule.
In some embodiments, the guide RNA binds to one or more target sequences within a DMD gene. In some embodiments, the guide RNA binds to one or more target sequences within an exon of the DMD gene. In some embodiments comprising two guide RNAs, i) each guide RNA targets a sequence within an exon; ii) one guide RNA targets a sequence within an exon and one targets a sequence within an intron; or iii) each guide RNA targets a sequence within an intron.
In some embodiments comprising at least two guide RNAs, i) each guide RNA targets the same genomic target sequence; ii) each guide RNA targets a different target sequence; or iii) at least one guide RNA targets one sequence and at least one guide RNA targets a different sequence.
In some embodiments comprising a guide RNA that binds to an exon of the DMD gene, the exon is selected from exon 43, 44, 45, 50, 51, and 53.
In some embodiments comprising at least two guide RNAs that binds to an exon of the DMD gene, at least one guide RNA binds to a target sequence within an exon in the DMD gene, and at least one guide RNA binds to a different target sequence within the same exon in the DMD gene.
In some embodiments comprising at least two guide RNAs, at least one guide RNA binds to a target sequence within an exon that is upstream of a premature stop codon, and at least one guide RNA binds to a target sequence within an exon that is downstream of a premature stop codon.
In some embodiments comprising at least two guide RNAs, at least one guide RNA binds to a target sequence within an exon in the DMD gene, and at least one guide RNA binds to a different target sequence within the same exon in the DMD gene, wherein when expressed in vivo or in vivo, a portion of the exon is excised. In some embodiments comprising at least two guide RNAs, wherein once expressed in vivo or in vivo, the guide RNAs excise a portion of the exon.
In some embodiments comprising at least two guide RNAs, wherein once expressed in vivo or in vivo, the guide RNAs excise a portion of the exon, and wherein the portion of the exon remaining after excision are rejoined with a one nucleotide insertion. In some embodiments comprising at least two guide RNAs, wherein once expressed in vivo or in vivo, the guide RNAs excise a portion of the exon, wherein the portion of the exon remaining after excision is rejoined without a nucleotide insertion.
In some embodiments comprising at least two guide RNAs, wherein once expressed in vivo or in vivo, the guide RNAs excise a portion of the exon, wherein the size of excised portion of the exon is between 5, 6, 7, 8, 9, 10, 15, or 20 and 250 nucleotides in length. In some embodiments comprising at least two guide RNAs, once expressed in vivo or in vivo, the guide RNAs excise a portion of the exon, wherein the size of excised portion of the exon is between 5 and 250, 5 and 200, 5 and 150, 5 and 100, 5 and 75, 5 and 50, 5 and 25, 5 and 10, 20 and 250, 20 and 200, 20 and 150, 20 and 100, 20 and 75, 20 and 50, 20 and 25, 50 and 250, 50 and 200, 50 and 150, 50 and 100, and 50 and 75 nucleotides.
In some embodiments comprising at least two guide RNAs, wherein once expressed in vivo or in vivo, the guide RNAs excise a portion of the exon, wherein the size of excised portion of the exon is between 8 and 167 nucleotides.
In some embodiments, a guide RNA and a Cas9 are provided on a single nucleic acid molecule. In some embodiments, the single nucleic acid molecule comprises a nucleic acid encoding a guide RNA and a nucleic acid encoding a SaCas9 or SluCas9. In some embodiments, two guide RNAs and a Cas9 are provided on a single nucleic acid molecule. In some embodiments, the single nucleic acid molecule comprises a nucleic acid encoding a first guide RNA, a nucleic acid encoding a second guide RNA, and a nucleic acid encoding a SaCas9 or SluCas9. In some embodiments, the spacer sequences of the first and second guide RNAs are identical. In some embodiments, the spacer sequences of the first and second guide RNAs are not identical.
In some embodiments, the single nucleic acid molecule is a single vector. In some embodiments, the single vector expresses the one or two guide RNAs and Cas9 according to the schemes of
Each of the guide sequences shown in Table 1A, Table 1B, and Table 5 may further comprise additional nucleotides to form or encode a crRNA, e.g., using any known sequence appropriate for the Cas9 being used. In some embodiments, the crRNA comprises (5′ to 3′) at least a spacer sequence and a first complementarity domain. The first complementary domain is sufficiently complementary to a second complementarity domain, which may be part of the same molecule in the case of an sgRNA or in a tracrRNA in the case of a dual or modular gRNA, to form a duplex. See, e.g., US 2017/0007679 for detailed discussion of crRNA and gRNA domains, including first and second complementarity domains.
A single-molecule guide RNA (sgRNA) can comprise, in the 5′ to 3′ direction, an optional spacer extension sequence, a spacer sequence, a minimum CRISPR repeat sequence, a single-molecule guide linker, a minimum tracrRNA sequence, a 3′ tracrRNA sequence and/or an optional tracrRNA extension sequence. The optional tracrRNA extension can comprise elements that contribute additional functionality (e.g., stability) to the guide RNA. The single-molecule guide linker can link the minimum CRISPR repeat and the minimum tracrRNA sequence to form a hairpin structure. The optional tracrRNA extension can comprise one or more hairpins. In particular embodiments, the disclosure provides for an sgRNA comprising a spacer sequence and a tracrRNA sequence.
An exemplary scaffold sequence suitable for use with SaCas9 to follow the guide sequence at its 3′ end is: GTTTAAGTACTCTGTGCTGGAAACAGCACAGAATCTACTTAAACAAGGCAAAATGCCGT GTTTATCTCGTCAACTTGTTGGCGAGA (SEQ ID NO: 500) in 5′ to 3′ orientation. In some embodiments, an exemplary scaffold sequence for use with SaCas9 to follow the 3′ end of the guide sequence is a sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 500, or a sequence that differs from SEQ ID NO: 500 by no more than 1, 2, 3, 4, 5, 10, 15, 20, or 25 nucleotides.
In some embodiments, a variant of an SaCas9 scaffold sequence may be used. In some embodiments, the SaCas9 scaffold to follow the guide sequence at its 3′ end is referred to as “SaScaffoldV1” and is: GTTTTAGTACTCTGGAAACAGAATCTACTAAAACAAGGCAAAATGCCGTGTTTATCTCGT CAACTTGTTGGCGAGAT (SEQ ID NO: 501) in 5′ to 3′ orientation. In some embodiments, an exemplary scaffold sequence for use with SaCas9 to follow the 3′ end of the guide sequence is a sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 910, or a sequence that differs from SEQ ID NO: 910 by no more than 1, 2, 3, 4, 5, 10, 15, 20, or 25 nucleotides.
In some embodiments, a variant of an SaCas9 scaffold sequence may be used. In some embodiments, the SaCas9 scaffold to follow the guide sequence at its 3′ end is referred to as “SaScaffoldV2” and is: GTTTAAGTACTCTGTGCTGGAAACAGCACAGAATCTACTTAAACAAGGCAAAATGCCGT GTTTATCTCGTCAACTTGTTGGCGAGAT (SEQ ID NO: 502) in 5′ to 3′ orientation. In some embodiments, an exemplary scaffold sequence for use with SaCas9 to follow the 3′ end of the guide sequence is a sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 911, or a sequence that differs from SEQ ID NO: 911 by no more than 1, 2, 3, 4, 5, 10, 15, 20, or 25 nucleotides.
In some embodiments, a variant of an SaCas9 scaffold sequence may be used. In some embodiments, the SaCas9 scaffold to follow the guide sequence at its 3′ end is referred to as “SaScaffoldV3” and is: GTTTAAGTACTCTGGAAACAGAATCTACTTAAACAAGGCAAAATGCCGTGTTTATCTCGT CAACTTGTTGGCGAGAT (SEQ ID NO: 503) in 5′ to 3′ orientation. In some embodiments, an exemplary scaffold sequence for use with SaCas9 to follow the 3′ end of the guide sequence is a sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 912, or a sequence that differs from SEQ ID NO: 912 by no more than 1, 2, 3, 4, 5, 10, 15, 20, or 25 nucleotides.
In some embodiments, a variant of an SaCas9 scaffold sequence may be used. In some embodiments, the SaCas9 scaffold to follow the guide sequence at its 3′ end is referred to as “SaScaffoldV5” and is: GTTTCAGTACTCTGGAAACAGAATCTACTGAAACAAGGCAAAATGCCGTGTTTATCTCGT CAACTTGTTGGCGAGAT (SEQ ID NO: 932) in 5′ to 3′ orientation. In some embodiments, an exemplary scaffold sequence for use with SaCas9 to follow the 3′ end of the guide sequence is a sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 932, or a sequence that differs from SEQ ID NO: 932 by no more than 1, 2, 3, 4, 5, 10, 15, 20, or 25 nucleotides.
Two exemplary scaffold sequences suitable for use with SluCas9 to follow the guide sequence at its 3′ end is: GTTTTAGTACTCTGGAAACAGAATCTACTGAAACAAGACAATATGTCGTGTTTATCCCAT CAATTTATTGGTGGGA (SEQ ID NO: 900), and GTTTAAGTACTCTGTGCTGGAAACAGCACAGAATCTACTGAAACAAGACAATATGTCGT GTTTATCCCATCAATTTATTGGTGGGA (SEQ ID NO: 601) in 5′ to 3′ orientation. In some embodiments, an exemplary sequence for use with SluCas9 to follow the 3′ end of the guide sequence is a sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 900 or SEQ ID NO: 601, or a sequence that differs from SEQ ID NO: 900 or SEQ ID NO: 601 by no more than 1, 2, 3, 4, 5, 10, 15, 20, or 25 nucleotides.
Exemplary scaffold sequences suitable for use with SluCas9 to follow the guide sequence at its 3′ end are also shown below in the 5′ to 3′ orientation:
In some embodiments, the scaffold sequence suitable for use with SluCas9 to follow the guide sequence at its 3′ end is selected from any one of SEQ ID NOs: 900 or 601, or 901-917 in 5′ to 3 orientation (see below). In some embodiments, an exemplary sequence for use with SluCas9 to follow the 3′ end of the guide sequence is a sequence that is at least 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one off SEQ ID NOs: 900 or 601, or 901-917, or a sequence that differs from any one of SEQ ID NOs: 900 or 601, or 901-917 by no more than 1, 2, 3, 4, 5, 10, 15, 20, or 25 nucleotides.
In some embodiments, the scaffold sequence suitable for use with SluCas9 to follow the guide sequence at its 3′ end is selected from any one of SEQ ID NOs: 901-917 in 5′ to 3 orientation (see below). In some embodiments, an exemplary sequence for use with SluCas9 to follow the 3′ end of the guide sequence is a sequence that is at least 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one off SEQ ID NOs: 901-917, or a sequence that differs from any one of SEQ ID NOs: 901-917 by no more than 1, 2, 3, 4, 5, 10, 15, 20, or 25 nucleotides.
In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 900. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 601. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 900. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 901. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 902. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 903. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 904. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 905. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 906. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 907. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 908. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 909. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 910. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 911. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 912. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 913. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 914. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 915. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 916. In some embodiments, the nucleic acid encoding the gRNA or the nucleic acid encoding the pair of gRNAs comprises a sequence comprising SEQ ID NO: 917. In some embodiments, comprising a pair of gRNAs, one of the gRNAs comprises a sequence selected from any one of SEQ ID NOs: 900 or 601, or 901-917. In some embodiments, comprising a pair of gRNAs, both of the gRNAs comprise a sequence selected from any one of SEQ ID NOs: 900 or 601, or 901-917. In some embodiments, comprising a pair of gRNAs, the nucleotides 3′ of the guide sequence of the gRNAs are the same sequence. In some embodiments, comprising a pair of gRNAs, the nucleotides 3′ of the guide sequence of the gRNAs are different sequences.
In some embodiments, the scaffold sequence comprises one or more alterations in the stem loop 1 as compared to the stem loop 1 of a wildtype SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 900) or a reference SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 901). In some embodiments, the scaffold sequence comprises one or more alterations in the stem loop 2 as compared to the stem loop 2 of a wildtype SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 900) or a reference SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 901). In some embodiments, the scaffold sequence comprises one or more alterations in the tetraloop as compared to the tetraloop of a wildtype SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 900) or a reference SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 901). In some embodiments, the scaffold sequence comprises one or more alterations in the repeat region as compared to the repeat region of a wildtype SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 900) or a reference SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 901). In some embodiments, the scaffold sequence comprises one or more alterations in the anti-repeat region as compared to the anti-repeat region of a wildtype SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 900) or a reference SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 901). In some embodiments, the scaffold sequence comprises one or more alterations in the linker region as compared to the linker region of a wildtype SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 900) or a reference SluCas9 scaffold sequence (e.g., a scaffold comprising the sequence of SEQ ID NO: 901). See, e.g., Nishimasu et al., 2015, Cell, 162:1113-1126 for description of regions of a scaffold.
Where a tracrRNA is used, in some embodiments, it comprises (5′ to 3′) a second complementary domain and a proximal domain. In the case of a sgRNA, guide sequences together with additional nucleotides (e.g., SEQ ID Nos: 500, or 910-912 (for SaCas9), and 900 or 601, or 901-917 (for SluCas9)) form or encode a sgRNA. In some embodiments, an sgRNA comprises (5′ to 3′) at least a spacer sequence, a first complementary domain, a linking domain, a second complementary domain, and a proximal domain. A sgRNA or tracrRNA may further comprise a tail domain. The linking domain may be hairpin-forming. See, e.g., US 2017/0007679 for detailed discussion and examples of crRNA and gRNA domains, including second complementarity domains, linking domains, proximal domains, and tail domains.
In general, in the case of a DNA nucleic acid construct encoding a guide RNA, the U residues in any of the RNA sequences described herein may be replaced with T residues, and in the case of a guide RNA construct encoded by a DNA, the T residues may be replaced with U residues.
Provided herein are compositions comprising one or more guide RNAs or one or more nucleic acids encoding one or more guide RNAs comprising a guide sequence disclosed herein in Table 1A, Table 1B, and Table 5 and throughout the specification.
In some embodiments, a composition is provided comprising a guide RNA, or nucleic acid encoding a guide RNA, wherein the guide RNA comprises 17, 18, 19, or 20 contiguous nucleotides of any one of the guide sequences disclosed herein in Table 1A, Table 1B, and Table 5 and throughout the specification.
In some embodiments, a composition is provided comprising a guide RNA, or nucleic acid encoding a guide RNA, wherein the guide RNA comprises a sequence with about 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identity to at least 17, 18, 19, or 20 contiguous nucleotides of a guide sequence shown in Table 1A, Table 1B, and Table 5 and throughout the specification.
In some embodiments, a composition is provided comprising a guide RNA, or nucleic acid encoding a guide RNA, wherein the guide RNA comprises a sequence with about 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identity to a guide sequence shown in Table 1A, Table 1B, and Table 5 and throughout the specification.
In some embodiments, a composition is provided comprising a guide RNA, or nucleic acid encoding a guide RNA, wherein the guide RNA comprises a spacer sequence selected from any one of SEQ ID NOs: 10, 12, 15, 16, 20, 27, 28, 32, 33, 35, 1001, 1003, 1005, 1010, 1012, 1013, 1016, 1017, and 1018 (for SaCas9). In some embodiments, the spacer sequence is SEQ ID NO: 10. In some embodiments, the spacer sequence is SEQ ID NO: 12. In some embodiments, the spacer sequence is SEQ ID NO: 15. In some embodiments, the spacer sequence is SEQ ID NO: 16. In some embodiments, the spacer sequence is SEQ ID NO: 20. In some embodiments, the spacer sequence is SEQ ID NO: 27. In some embodiments, the spacer sequence is SEQ ID NO: 28. In some embodiments, the spacer sequence is SEQ ID NO: 32. In some embodiments, the spacer sequence is SEQ ID NO: 33. In some embodiments, the spacer sequence is SEQ ID NO: 35. In some embodiments, the spacer sequence is SEQ ID NO: 1001. In some embodiments, the spacer sequence is SEQ ID NO: 1003. In some embodiments, the spacer sequence is SEQ ID NO: 1005. In some embodiments, the spacer sequence is SEQ ID NO: 1010. In some embodiments, the spacer sequence is SEQ ID NO: 1012. In some embodiments, the spacer sequence is SEQ ID NO: 1013. In some embodiments, the spacer sequence is SEQ ID NO: 1016. In some embodiments, the spacer sequence is SEQ ID NO: 1017. In some embodiments, the spacer sequence is SEQ ID NO: 1018.
In some embodiments, a composition is provided comprising a guide RNA, or nucleic acid encoding a guide RNA, wherein the guide RNA comprises a spacer sequence selected from any one of SEQ ID NOs: 3022, 3023, 3028, 3029, 3030, 3031, 3038, 3039, 3052, 3053, 3054, 3055, 3062, 3063, 3064, 3065, 3068, and 3069 (for SaCas9). In some embodiments, the spacer sequence is SEQ ID NO: 3022. In some embodiments, the spacer sequence is SEQ ID NO: 3023. In some embodiments, the spacer sequence is SEQ ID NO: 3028. In some embodiments, the spacer sequence is SEQ ID NO: 3029. In some embodiments, the spacer sequence is SEQ ID NO: 3030. In some embodiments, the spacer sequence is SEQ ID NO: 3031. In some embodiments, the spacer sequence is SEQ ID NO: 3038. In some embodiments, the spacer sequence is SEQ ID NO: 3039. In some embodiments, the spacer sequence is SEQ ID NO: 3052. In some embodiments, the spacer sequence is SEQ ID NO: 3053. In some embodiments, the spacer sequence is SEQ ID NO: 3054. In some embodiments, the spacer sequence is SEQ ID NO: 3055. In some embodiments, the spacer sequence is SEQ ID NO: 3062. In some embodiments, the spacer sequence is SEQ ID NO: 3063. In some embodiments, the spacer sequence is SEQ ID NO: 3064. In some embodiments, the spacer sequence is SEQ ID NO: 3065. In some embodiments, the spacer sequence is SEQ ID NO: 3068. In some embodiments, the spacer sequence is SEQ ID NO: 3069.
In some embodiments, a composition is provided comprising a guide RNA, or nucleic acid encoding a guide RNA, wherein the guide RNA comprises a spacer sequence selected from any one of SEQ ID NOs: 131, 134, 135, 136, 139, 140, 141, 144, 145, 146, 148, 149, 150, 151, 179, 184, 201, 210, 223, 224, 225 (for SluCas9). In some embodiments, the spacer sequence is SEQ ID NO: 131. In some embodiments, the spacer sequence is SEQ ID NO: 134. In some embodiments, the spacer sequence is SEQ ID NO: 135. In some embodiments, the spacer sequence is SEQ ID NO: 136. In some embodiments, the spacer sequence is SEQ ID NO: 139. In some embodiments, the spacer sequence is SEQ ID NO: 140. In some embodiments, the spacer sequence is SEQ ID NO: 141. In some embodiments, the spacer sequence is SEQ ID NO: 144. In some embodiments, the spacer sequence is SEQ ID NO: 145. In some embodiments, the spacer sequence is SEQ ID NO: 146. In some embodiments, the spacer sequence is SEQ ID NO: 148. In some embodiments, the spacer sequence is SEQ ID NO: 149. In some embodiments, the spacer sequence is SEQ ID NO: 150. In some embodiments, the spacer sequence is SEQ ID NO: 151. In some embodiments, the spacer sequence is SEQ ID NO: 179. In some embodiments, the spacer sequence is SEQ ID NO: 184. In some embodiments, the spacer sequence is SEQ ID NO: 201. In some embodiments, the spacer sequence is SEQ ID NO: 223. In some embodiments, the spacer sequence is SEQ ID NO: 224. In some embodiments, the spacer sequence is SEQ ID NO: 225.
In some embodiments, a composition is provided comprising a guide RNA, or nucleic acid encoding a guide RNA, wherein the guide RNA comprises a spacer sequence selected from any one of SEQ ID NOs: 4062, 4063, 4068, 4069, 4070, 4071, 4072, 4073, 4078, 4079, 4088, 4089, 4096, 4097, 4098, 4099, 4100, 4101, 4102, 4103, 4158, 4159, 4168, 4169, 4202, 4203, 4220, 4221, 4246, 4247, 4248, 4249, 4250, 4251 (for SluCas9). In some embodiments, the spacer sequence is SEQ ID NO: 4062. In some embodiments, the spacer sequence is SEQ ID NO: 4063. In some embodiments, the spacer sequence is SEQ ID NO: 4068. In some embodiments, the spacer sequence is SEQ ID NO: 4069. In some embodiments, the spacer sequence is SEQ ID NO: 4070. In some embodiments, the spacer sequence is SEQ ID NO: 4071. In some embodiments, the spacer sequence is SEQ ID NO: 4072. In some embodiments, the spacer sequence is SEQ ID NO: 4073. In some embodiments, the spacer sequence is SEQ ID NO: 4078. In some embodiments, the spacer sequence is SEQ ID NO: 4079. In some embodiments, the spacer sequence is SEQ ID NO: 4088. In some embodiments, the spacer sequence is SEQ ID NO: 4089. In some embodiments, the spacer sequence is SEQ ID NO: 4096. In some embodiments, the spacer sequence is SEQ ID NO: 4097. In some embodiments, the spacer sequence is SEQ ID NO: 4098. In some embodiments, the spacer sequence is SEQ ID NO: 4099. In some embodiments, the spacer sequence is SEQ ID NO: 4100. In some embodiments, the spacer sequence is SEQ ID NO: 4101. In some embodiments, the spacer sequence is SEQ ID NO: 4102. In some embodiments, the spacer sequence is SEQ ID NO: 4103. In some embodiments, the spacer sequence is SEQ ID NO: 4158. In some embodiments, the spacer sequence is SEQ ID NO: 4159. In some embodiments, the spacer sequence is SEQ ID NO: 4168. In some embodiments, the spacer sequence is SEQ ID NO: 4169. In some embodiments, the spacer sequence is SEQ ID NO: 4202. In some embodiments, the spacer sequence is SEQ ID NO: 4203. In some embodiments, the spacer sequence is SEQ ID NO: 4220. In some embodiments, the spacer sequence is SEQ ID NO: 4221. In some embodiments, the spacer sequence is SEQ ID NO: 4246. In some embodiments, the spacer sequence is SEQ ID NO: 4247. In some embodiments, the spacer sequence is SEQ ID NO: 4248. In some embodiments, the spacer sequence is SEQ ID NO: 4249. In some embodiments, the spacer sequence is SEQ ID NO: 4250. In some embodiments, the spacer sequence is SEQ ID NO: 4251.
In some embodiments, a composition is provided comprising a guide RNA, or nucleic acid encoding a guide RNA, wherein the guide RNA further comprises a trRNA. In each composition and method embodiment described herein, the crRNA (comprising the spacer sequence) and trRNA may be associated as a single RNA (sgRNA) or may be on separate RNAs (dgRNA). In the context of sgRNAs, the crRNA and trRNA components may be covalently linked, e.g., via a phosphodiester bond or other covalent bond.
In one aspect, a composition is provided comprising a single nucleic acid molecule encoding, or two nucleic acid molecules where one molecule encodes, 1) one or more guide RNA that comprises a guide sequence selected from any one of SEQ ID NOs: 1-35, 1000-1078, or 3000-3069; and 2) a SaCas9. In one aspect, a composition is provided comprising a single nucleic acid molecule encoding, or two nucleic acid molecules where one molecule encodes, 1) one or more guide RNA that comprises a guide sequence selected from any one of SEQ ID NOs: 100-225, 2000-2116, or 4000-4251; and 2) a SluCas9.
In one aspect, a composition is provided comprising a single nucleic acid molecule encoding, or two nucleic acid molecules where one molecule encodes, 1) one or more guide RNA that comprises a guide sequence that is at least 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, or 90% identical to any one of SEQ ID NOs: 1-35, 1000-1078, 3000-3069; and 2) a SaCas9. In one aspect, a composition is provided comprising a single nucleic acid molecule encoding 1) one or more guide RNA that comprises a guide sequence that is at least 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, or 90% identical to any one of SEQ ID NOs: 100-225, 2000-2116, or 4000-4251; and 2) a SluCas9.
In another aspect, a composition is provided comprising a single nucleic acid molecule encoding, or two nucleic acid molecules where one molecule encodes, 1) one or more guide RNA that comprises a guide sequence comprising at least 17, 18, 19, or 20 contiguous nucleotides of a spacer sequence selected from any one of SEQ ID NOs: 1-35, 1000-1078, or 3000-3069; and 2) a SaCas9. In another aspect, a composition is provided comprising a single nucleic acid molecule encoding, or two nucleic acid molecules where one molecule encodes, 1) one or more guide RNA that comprises a guide sequence comprising at least 17, 18, 19, or 20 contiguous nucleotides of a spacer sequence selected from any one of SEQ ID NOs: 100-225, 2000-2116, or 4000-4251; and 2) a SluCas9.
In any embodiment comprising a nucleic acid molecule encoding a guide RNA and/or a Cas9, the nucleic acid molecule may be a vector. In some embodiments, a composition is provided comprising a single nucleic acid molecule encoding a guide RNA and Cas9, wherein the nucleic acid molecule is a vector.
Any type of vector, such as any of those described herein, may be used. In some embodiments, the vector is a viral vector. In some embodiments, the viral vector is a non-integrating viral vector (i.e., that does not insert sequence from the vector into a host chromosome). In some embodiments, the viral vector is an adeno-associated virus vector (AAV), a lentiviral vector, an integrase-deficient lentiviral vector, an adenoviral vector, a vaccinia viral vector, an alphaviral vector, or a herpes simplex viral vector. In some embodiments, the vector comprises a muscle-specific promoter. Exemplary muscle-specific promoters include a muscle creatine kinase promoter, a desmin promoter, an MHCK7 promoter, or an SPc5-12 promoter. See US 2004/0175727 A1; Wang et al., Expert Opin Drug Deliv. (2014) 11, 345-364; Wang et al., Gene Therapy (2008) 15, 1489-1499. In some embodiments, the muscle-specific promoter is a CK8 promoter. In some embodiments, the muscle-specific promoter is a CK8e promoter. In any of the foregoing embodiments, the vector may be an adeno-associated virus vector (AAV).
In some embodiments, the muscle specific promoter is the CK8 promoter. The CK8 promoter has the following sequence (SEQ ID NO. 700):
In some embodiments, the muscle-cell cell specific promoter is a variant of the CK8 promoter, called CK8e. In some embodiments, the size of the CK8e promoter is 436 bp. The CK8e promoter has the following sequence (SEQ ID NO. 701):
In some embodiments, the vector comprises one or more of a U6, H1, or 7SK promoter. In some embodiments, the U6 promoter is the human U6 promoter (e.g., the U6L promoter or U6S promoter). In some embodiments, the promoter is the murine U6 promoter. In some embodiments, the 7SK promoter is a human 7SK promoter. In some embodiments, the 7SK promoter is the 7SK1 promoter. In some embodiments, the 7SK promoter is the 7SK2 promoter. In some embodiments, the H1 promoter is a human H1 promoter (e.g., the H1L promoter or the H1S promoter). In some embodiments, the vector comprises multiple guide sequences, wherein each guide sequence is under the control of a separate promoter. In some embodiments, each of the multiple guide sequences comprises a different sequence. In some embodiments, each of the multiple guide sequences comprise the same sequence (e.g., each of the multiple guide sequences comprise the same spacer sequence). In some embodiments, each of the multiple guide sequences comprises the same spacer sequence and the same scaffold sequence. In some embodiments, each of the multiple guide sequences comprises different spacer sequences and different scaffold sequences. In some embodiments, each of the multiple guide sequences comprises the same spacer sequence, but comprises a different scaffold sequence. In some embodiments, each of the multiple guide sequences comprises different spacer sequences and different scaffold sequences. In some embodiments, each of the separate promoters comprises the same nucleotide sequence (e.g., the U6 promoter sequence). In some embodiments, each of the separate promoters comprises a different nucleotide sequence (e.g., the U6, H1, and/or 7SK promoter sequence).
In some embodiments, the U6 promoter comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 702:
In some embodiments, the H1 promoter comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 703:
In some embodiments, the 7SK promoter comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO. 704:
In some embodiments, the U6 promoter is a hU6c promoter and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 705:
In some embodiments, the U6 promoter is a variant of the hU6c promoter. In some embodiments, the variant of the hU6c promoter comprises alternative nucleotides as compared to the sequence of SEQ ID NO: 705. In some embodiments, the variant of the hU6c promoter comprises fewer nucleotides as compared to the 249 nucleotides of SEQ ID NO: 705. In some embodiments, the variant of the hU6c promoter has fewer nucleotides in the nucleosome binding sequence of the hU6c promoter of SEQ ID NO: 705. In some embodiments, the variant of the hU6c promoter lacks all of or at least a portion of (e.g., at least 5, 10, 15, 20, 25, or 30 nucleotides) the nucleotides corresponding to nucleotides 96-125 of SEQ ID NO: 705. In some embodiments, the variant of the hU6c promoter lacks all of or at least a portion of (e.g., at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, or 60 nucleotides) the nucleotides corresponding to nucleotides 81-140 of SEQ ID NO: 705. In some embodiments, the variant of the hU6c promoter lacks all of or at least a portion of (e.g., at least 10, 20, 30, 40, 50, 60, 65, 70, 75, 80, or 85 nucleotides) the nucleotides corresponding to nucleotides 66-150 of SEQ ID NO: 705. In some embodiments, the variant of the hU6c promoter lacks all of or at least a portion of (e.g., at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, or 120 nucleotides) the nucleotides corresponding to nucleotides 51-170 of SEQ ID NO: 705. In some embodiments, the variant of the hU6c promoter lacks the nucleotides corresponding to nucleotides 96-125 of SEQ ID NO: 705. In some embodiments, the variant of the hU6c promoter comprises 129-219 nucleotides. In some embodiments, the variant of the hU6c promoter comprises 219 nucleotides. In some embodiments, the variant of the hU6c promoter comprises 189 nucleotides. In some embodiments, the variant of the hU6c promoter comprises 159 nucleotides. In some embodiments, the variant of the hU6c promoter comprises 129 nucleotides.
In some embodiments, the U6 promoter is hU6d30 and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 9001:
In some embodiments, the U6 promoter is hU6d60 and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 9002:
In some embodiments, the U6 promoter is hU6d90 and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 9003:
In some embodiments, the U6 promoter is hU6d120 and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 9004:
In some embodiments, the 7SK promoter is a 7SK2 promoter and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 706:
In some embodiments, the 7SK promoter is a variant of the 7SK2 promoter. In some embodiments, the variant of the 7SK2 promoter comprises alternative nucleotides as compared to the sequence of SEQ ID NO: 706. In some embodiments, the variant of the 7SK2 promoter e.g., comprises fewer nucleotides as compared to the 243 nucleotides of SEQ ID NO: 706. In some embodiments, the variant of the 7SK2 promoter has fewer nucleotides in the nucleosome binding sequence of the 7SK2 promoter of SEQ ID NO: 706. In some embodiments, the variant of the 7SK2 promoter lacks all of or at least a portion of (e.g., at least 5, 10, 15, 20, 25, or 30 nucleotides) the nucleotides corresponding to nucleotides 95-124 of SEQ ID NO: 706. In some embodiments, the variant of the 7SK2 promoter lacks all of or at least a portion of (e.g., at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, or 60 nucleotides) the nucleotides corresponding to nucleotides 81-140 of SEQ ID NO: 706. In some embodiments, the variant of the 7SK2 promoter lacks all of or at least a portion of (e.g., at least 10, 20, 30, 40, 50, 60, 65, 70, 75, 80, 85 or 90 nucleotides) the nucleotides corresponding to nucleotides 67-156 of SEQ ID NO: 706. In some embodiments, the variant of the 7SK2 promoter lacks all of or at least a portion of (e.g., at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, or 120 nucleotides) the nucleotides corresponding to nucleotides 52-171 of SEQ ID NO: 706. In some embodiments, the variant of the 7SK2 promoter comprises 123-213 nucleotides. In some embodiments, the variant of the 7SK2 promoter comprises 213 nucleotides. In some embodiments, the variant of the 7SK2 promoter comprises 183 nucleotides. In some embodiments, the variant of the 7SK2 promoter comprises 153 nucleotides. In some embodiments, the variant of the 7SK2 promoter comprises 123 nucleotides.
In some embodiments, the 7SK promoter is 7SKd30 and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 9006:
In some embodiments, the 7SK promoter is 7SKd60 and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 9007:
In some embodiments, the 7SK promoter is 7SKd90 and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 9008:
In some embodiments, the 7SK promoter is 7SKd120 and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 9009:
In some embodiments, the H1 promoter is a H1m or mH1 promoter and comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 707:
In some embodiments, the Ck8e promoter comprises a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 701
In some embodiments, the vector comprises multiple inverted terminal repeats (ITRs). These ITRs may be of an AAV1, AAV2, AAV3, AAV4, AAVS, AAV6, AAV7, AAV8, or AAV9 serotype. In some embodiments, the ITRs are of an AAV2 serotype. In some embodiments, the 5′ ITR comprises a sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the sequence of SEQ ID NO: 709:
In some embodiments, the 3′ITR comprises a sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the sequence of SEQ ID NO: 710:
In some embodiments, a vector comprising a single nucleic acid molecule encoding 1) one or more guide RNA comprising any one or more of the spacer sequences of SEQ ID Nos: 1-35, 100-225, 1000-1078, 2000-2116, 3000-3069, or 4000-4251; and 2) a SaCas9 (for SEQ ID Nos: 1-35, 1000-1078, and 3000-3069) or SluCas9 (for SEQ ID NO: 100-225, 2000-2116, and 4000-4251) is provided. In some embodiments, the vector is an AAV vector. In some embodiments, the AAV vector is administered to a subject to treat DMD. In some embodiments, only one vector is needed due to the use of a particular guide sequence that is useful in the context of SaCas9 or SluCas9.
In some embodiments, the vector comprises a nucleic acid encoding a Cas9 protein (e.g., an SaCas9 or SluCas9 protein) and further comprises a nucleic acid encoding one or more single guide RNA(s). In some embodiments, the nucleic acid encoding the Cas9 protein is under the control of a CK8e promoter. In some embodiments, the nucleic acid encoding the guide RNA sequence is under the control of a hU6c promoter. In some embodiments, the vector is AAV9.
In some embodiments, the vector comprises multiple nucleic acids encoding more than one guide RNA. In some embodiments, the vector comprises two nucleic acids encoding two guide RNA sequences.
In some embodiments, the vector comprises a nucleic acid encoding a Cas9 protein (e.g., an SaCas9 protein or SluCas9 protein), a nucleic acid encoding a first guide RNA, and a nucleic acid encoding a second guide RNA. In some embodiments, the vector does not comprise a nucleic acid encoding more than two guide RNAs. In some embodiments, the nucleic acid encoding the first guide RNA is the same as the nucleic acid encoding the second guide RNA. In some embodiments, the nucleic acid encoding the first guide RNA is different from the nucleic acid encoding the second guide RNA. In some embodiments, the vector comprises a single nucleic acid molecule, wherein the single nucleic acid molecule comprises a nucleic acid encoding a Cas9 protein, a nucleic acid encoding a first guide RNA, and a nucleic acid that is the reverse complement to the coding sequence for the second guide RNA. In some embodiments, the vector comprises a single nucleic acid molecule, wherein the single nucleic acid molecule comprises a nucleic acid encoding a Cas9 protein, a nucleic acid that is the reverse complement to the coding sequence for the first guide RNA, and a nucleic acid that is the reverse complement to the coding sequence for the second guide RNA. In some embodiments, the nucleic acid encoding a Cas9 protein (e.g., an SaCas9 or SluCas9 protein) is under the control of the CK8e promoter. In some embodiments, the first guide is under the control of the 7SK2 promoter, and the second guide is under the control of the H1m promoter. In some embodiments, the first guide is under the control of the H1m promoter, and the second guide is under the control of the 7SK2 promoter. In some embodiments, the first guide is under the control of the hU6c promoter, and the second guide is under the control of the H1m promoter. In some embodiments, the first guide is under the control of the H1m promoter, and the second guide is under the control of the hU6c promoter. In some embodiments, the nucleic acid encoding the Cas9 protein is: a) between the nucleic acids encoding the guide RNAs, b) between the nucleic acids that are the reverse complement to the coding sequences for the guide RNAs, c) between the nucleic acid encoding the first guide RNA and the nucleic acid that is the reverse complement to the coding sequence for the second guide RNA, d) between the nucleic acid encoding the second guide RNA and the nucleic acid that is the reverse complement to the coding sequence for the first guide RNA, e) 5′ to the nucleic acids encoding the guide RNAs, f) 5′ to the nucleic acids that are the reverse complements to the coding sequences for the guide RNAs, g) 5′ to a nucleic acid encoding one of the guide RNAs and 5′ to a nucleic acid that is the reverse complement to the coding sequence for the other guide RNA, h) 3′ to the nucleic acids encoding the guide RNAs, i) 3′ to the nucleic acids that are the reverse complements to the coding sequences for the guide RNAs, or j) 3′ to a nucleic acid encoding one of the guide RNAs and 3′ to a nucleic acid that is the reverse complement to the coding sequence for the other guide RNA. In some embodiments, any of the vectors disclosed herein is AAV9. In preferred embodiments, the AAV9 vector is less than 5 kb from ITR to ITR in size, inclusive of both ITRs. In particular embodiments, the AAV9 vector is less than 4.9 kb from ITR to ITR in size, inclusive of both ITRs. In further embodiments, the AAV9 vector is less than 4.85 kb from ITR to ITR in size, inclusive of both ITRs. In further embodiments, the AAV9 vector is less than 4.8 kb from ITR to ITR in size, inclusive of both ITRs. In further embodiments, the AAV9 vector is less than 4.75 kb from ITR to ITR in size, inclusive of both ITRs. In further embodiments, the AAV9 vector is less than 4.7 kb from ITR to ITR in size, inclusive of both ITRs. In some embodiments, the vector is between 3.9-5 kb, 4-5 kb, 4.2-5 kb, 4.4-5 kb, 4.6-5 kb, 4.7-5 kb, 3.9-4.9 kb, 4.2-4.9 kb, 4.4-4.9 kb, 4.7-4.9 kb, 3.9-4.85 kb, 4.2-4.85 kb, 4.4-4.85 kb, 4.6-4.85 kb, 4.7-4.85 kb, 4.7-4.9 kb, 3.9-4.8 kb, 4.2-4.8 kb, 4.4-4.8 kb or 4.6-4.8 kb from ITR to ITR in size, inclusive of both ITRs. In some embodiments, the vector is between 4.4-4.85 kb from ITR to ITR in size, inclusive of both ITRs. In some embodiments, the vector is an AAV9 vector.
In some embodiments, any of the vectors disclosed herein comprises a nucleic acid encoding at least a first guide RNA and a second guide RNA. In some embodiments, the nucleic acid comprises a spacer-encoding sequence for the first guide RNA, a scaffold-encoding sequence for the first guide RNA, a spacer-encoding sequence for the second guide RNA, and a scaffold-encoding sequence of the second guide RNA. In some embodiments, the spacer-encoding sequence (e.g., encoding any of the spacer sequences disclosed herein) for the first guide RNA is identical to the spacer-encoding sequence for the second guide RNA. In some embodiments, the spacer-encoding sequence (e.g., encoding any of the spacer sequences disclosed herein) for the first guide RNA is different from the spacer-encoding sequence for the second guide RNA. In some embodiments, the scaffold-encoding sequence for the first guide RNA is identical to the scaffold-encoding sequence for the second guide RNA. In some embodiments, the scaffold-encoding sequence for the first guide RNA is different from the scaffold-encoding sequence for the nucleic acid encoding the second guide RNA. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises a sequence selected from the group consisting of SEQ ID Nos: 901-916, and the scaffold-encoding sequence for the second guide RNA comprises a different sequence selected from the group consisting of SEQ ID Nos: 901-916. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 902. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 903. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 904. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 905. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 906. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 907. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 908. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 909. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 910. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 911. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 912. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 913. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 914. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 915. In some embodiments, the scaffold-encoding sequence for the first guide RNA comprises the sequence of SEQ ID NO: 901, and the scaffold-encoding sequence for the second guide RNA comprises the sequence of SEQ ID NO: 916. In some embodiments, the spacer encoding sequence for the first guide RNA is the same as the spacer-encoding sequence in the second guide RNA, and the scaffold-encoding sequence for the first guide RNA is different from the scaffold-encoding sequence in the nucleic acid encoding the second guide RNA.
The disclosure provides for novel AAV vector configurations. Some examples of these novel AAV vector configurations are provided herein, and the order of elements in these exemplary vectors are referenced in a 5′ to 3′ manner with respect to the plus strand. For these configurations, it should be understood that the recited elements may not be directly contiguous, and that one or more nucleotides or one or more additional elements may be present between the recited elements. However, in some embodiments, it is possible that no nucleotides or no additional elements are present between the recited elements. Also, unless otherwise stated, “a promoter for expression of element X” means that the promoter is oriented in a manner to facilitate expression of the recited element X. In addition, unless otherwise stated, references to an “sgRNA scaffold sequence” or “a guide RNA scaffold sequence” are synonymous with “a nucleotide sequence/nucleic acid encoding an sgRNA scaffold sequence” or “a nucleotide sequence/nucleic acid encoding a guide RNA scaffold sequence.” In some embodiments, the disclosure provides for a nucleic acid encoding an SaCas9 (e.g., an SaCas9-KKH) or SluCas9. In some embodiments, the nucleic acid encodes for one or more nuclear localization signals (e.g., the SV40 NLS and/or the c-Myc NLS) on the C-terminus of the encoded SaCas9 or SluCas9. In some embodiments, the nucleic acid encodes for one or more NLSs (e.g., the SV40 NLS and/or the c-Myc NLS) on the C-terminus of the encoded SaCas9 or SluCas9, and the nucleic acid does not encode for an NLS on the N-terminus of the encoded SaCas9 or SluCas9. In some embodiments, the nucleic acid encodes for one or more nuclear localization signals (e.g., the SV40 NLS and/or the c-Myc NLS) on the N-terminus of the encoded SaCas9 or SluCas9. In some embodiments, the nucleic acid encodes for one or more NLSs (e.g., the SV40 NLS and/or the c-Myc NLS) on the N-terminus of the encoded SaCas9 or SluCas9, and the nucleic acid does not encode for an NLS on the C-terminus of the encoded SaCas9 or SluCas9. In some embodiments, the nucleic acid encodes for one or more nuclear localization signals (e.g., the SV40 NLS and/or the c-Myc NLS) on the C-terminus of the encoded SaCas9 or SluCas9 and also encodes for one or more NLSs on the N-terminus of the encoded SaCas9 or SluCas9 (e.g., the SV40 NLS and/or the c-Myc NLS). In some embodiments, the nucleic acid encodes one NLS. In some embodiments, the nucleic acid encodes two NLSs. In some embodiments, the nucleic acid encodes three NLSs. The one, two, or three NLS may all be C-terminal, N-terminal, or any combination of C- and N-terminal. The NLS may be fused/attached directly to the C- or N-terminus or to another NLS, or may be fused/attached indirectly attached through a linker. In some embodiments, an additional domain may be: a) fused to the N- or C-terminus of the Cas protein (e.g., a Cas9 protein), b) fused to the N-terminus of an NLS fused to the N-terminus of a Cas protein, or c) fused to the C-terminus of an NLS fused to the C-terminus of a Cas protein, with or without a linker. In some embodiments, an NLS is fused to the N- and/or C-terminus of the Cas protein by means of a linker. In some embodiments, an NLS is fused to the N-terminus of an N-terminally-fused NLS on a Cas protein by means of a linker, and/or an NLS is fused to the C-terminus of a C-terminally fused NLS on a Cas protein by means of a linker. In some embodiments, the linker is GSVD (SEQ ID NO: 550) or GSGS (SEQ ID NO: 551). In some embodiments, the Cas protein comprises a c-Myc NLS fused to the N-terminus of the Cas protein (or to an N-terminally-fused NLS on the Cas protein), optionally by means of a linker. In some embodiments, the Cas protein comprises an SV40 NLS fused to the C-terminus of the Cas protein (or to a C-terminally-fused NLS on the Cas protein), optionally by means of a linker. In some embodiments, the Cas protein comprises a nucleoplasmin NLS fused to the C-terminus of the Cas protein (or to a C-terminally-fused NLS on the Cas protein), optionally by means of a linker. In some embodiments, the Cas protein comprises: a) a c-Myc NLS fused to the N-terminus of the Cas protein, optionally by means of a linker, b) an SV40 NLS fused to the C-terminus of the Cas protein, optionally by means of a linker, and c) a nucleoplasmin NLS fused to the C-terminus of the SV40 NLS, optionally by means of a linker. In some embodiments, the Cas protein comprises: a) a c-Myc NLS fused to the N-terminus of the Cas protein, optionally by means of a linker, b) a nucleoplasmin NLS fused to the C-terminus of the Cas protein, optionally by means of a linker, and c) an SV40 NLS fused to the C-terminus of the nucleoplasmin NLS, optionally by means of a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of a promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SaCas9 (e.g., CK8e), a nucleic acid encoding SaCas9, a polyadenylation sequence, a promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA is any of the hU6c promoters disclosed herein. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 705. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9001. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9002. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9003. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9004. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA is any of the 7SK2 promoters disclosed herein. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 705. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9006. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9007. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9008. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9009. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA is any of the hU6c promoters disclosed herein. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 705. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9001. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9002. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9003. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9004. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA is any of the 7SK2 promoters disclosed herein. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 705. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9006. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9007. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9008. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9009. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA is any of the H1m promoters disclosed herein. In some embodiments, the promoter for SaCas9 is the CK8e promoter. In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SaCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an hU6c promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SaCas9 (e.g., CK8e), a nucleic acid encoding SaCas9, a polyadenylation sequence, an hU6c promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1010. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1012. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1013. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 1016. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1017 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1018 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1010 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1012 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1013 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1016 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1017. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1018. In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SaCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an hU6c promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SaCas9 (e.g., CK8e), a nucleic acid encoding SaCas9, a polyadenylation sequence, an 7SK promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1010. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1012. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1013. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 1016. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1017 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1018 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1010 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1012 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1013 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1016 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1017. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1018. In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SaCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an hU6c promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SaCas9 (e.g., CK8e), a nucleic acid encoding SaCas9, a polyadenylation sequence, an H1m promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1010. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1012. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1013. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 1016. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1017 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1018 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1010 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1012 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1013 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1016 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1017. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1018. In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SaCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an 7SK promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SaCas9 (e.g., CK8e), a nucleic acid encoding SaCas9, a polyadenylation sequence, an H1m promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1010. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1012. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1013. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 1016. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1017 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1018 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1010 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1012 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1013 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1016 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1017. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1018. In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SaCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an hU6c promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SaCas9 (e.g., CK8e), a nucleic acid encoding SaCas9-KKH (e.g., a sequence that is at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 715 or functional fragments thereof), a polyadenylation sequence, an hU6c promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1010. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1012. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1013. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 1016. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1017 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1018 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1010 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1012 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1013 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1016 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1017. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1018. In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SaCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an hU6c promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SaCas9 (e.g., CK8e), a nucleic acid encoding SaCas9-KKH (e.g., a sequence that is at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 715 or functional fragments thereof), a polyadenylation sequence, an 7SK promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1010. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1012. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1013. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 1016. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1017 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1018 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1010 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1012 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1013 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1016 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1017. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1018. In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SaCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an hU6c promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SaCas9 (e.g., CK8e), a nucleic acid encoding SaCas9-KKH (e.g., a sequence that is at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 715 or functional fragments thereof), a polyadenylation sequence, an H1m promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1010. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1012. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1013. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 1016. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1017 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1018 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1010 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1012 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1013 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1016 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1017. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1018. In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SaCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an 7SK promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SaCas9 (e.g., CK8e), a nucleic acid encoding SaCas9-KKH (e.g., a sequence that is at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 715 or functional fragments thereof), a polyadenylation sequence, an H1m promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 15. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1001 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 1005. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1010. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1012. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 12 and 1013. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 10 and 1016. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1017 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1018 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 15 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1001. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1005 and 1003. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1003 and 16. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1010 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1012 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1013 and 12. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 1016 and 10. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1017. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 16 and 1018. In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SaCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SaCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of a promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SluCas9 (e.g., CK8e), a nucleic acid encoding SluCas9, a polyadenylation sequence, a promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA is any of the hU6c promoters disclosed herein. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 705. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9001. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9002. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9003. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9004. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA is any of the 7SK2 promoters disclosed herein. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 7005. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9006. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9007. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9008. In some embodiments the promoter for expression of the nucleic acid encoding the first sgRNA comprises SEQ ID NO: 9009. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA is any of the hU6c promoters disclosed herein. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 705. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9001. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9002. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9003. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9004. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA is any of the 7SK2 promoters disclosed herein. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 705. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9006. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9007. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9008. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA comprises SEQ ID NO: 9009. In some embodiments the promoter for expression of the nucleic acid encoding the second sgRNA is any of the H1m promoters disclosed herein. In some embodiments, the promoter for SluCas9 is the CK8e promoter. In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SluCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an hU6c promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SluCas9 (e.g., CK8e), a nucleic acid encoding SluCas9, a polyadenylation sequence, an hU6c promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 134. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 149 and 135. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 150 and 135. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 136. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 136. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 139 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 140 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 140 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 141 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 144 and 149. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 144 and 150. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 145 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 145 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 146 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 134 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 135 and 149. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 135 and 150. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 136 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 136 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 139. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 140. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 140. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 141. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 149 and 144. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 150 and 144. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 145. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 145. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 146. In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SluCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an hU6c promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SluCas9 (e.g., CK8e), a nucleic acid encoding SluCas9, a polyadenylation sequence, an 7SK promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 134. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 149 and 135. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 150 and 135. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 136. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 136. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 139 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 140 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 140 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 141 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 144 and 149. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 144 and 150. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 145 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 145 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 146 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 134 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 135 and 149. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 135 and 150. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 136 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 136 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 139. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 140. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 140. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 141. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 149 and 144. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 150 and 144. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 145. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 145. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 146. In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SluCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an hU6c promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SluCas9 (e.g., CK8e), a nucleic acid encoding SluCas9, a polyadenylation sequence, an H1m promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 134. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 149 and 135. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 150 and 135. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 136. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 136. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 139 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 140 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 140 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 141 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 144 and 149. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 144 and 150. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 145 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 145 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 146 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 134 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 135 and 149. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 135 and 150. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 136 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 136 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 139. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 140. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 140. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 141. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 149 and 144. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 150 and 144. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 145. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 145. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 146. In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SluCas9 with a linker.
In some embodiments, the AAV vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first sgRNA scaffold sequence, the reverse complement of a nucleic acid encoding a first sgRNA guide sequence, the reverse complement of an 7SK promoter for expression of the nucleic acid encoding the first sgRNA, a promoter for expression of a nucleic acid encoding SluCas9 (e.g., CK8e), a nucleic acid encoding SluCas9, a polyadenylation sequence, an H1m promoter for expression of a second sgRNA, a second sgRNA guide sequence, and a second sgRNA scaffold sequence. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 134. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 149 and 135. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 150 and 135. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 136. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 136. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 139 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 140 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 140 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 141 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 144 and 149. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 144 and 150. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 145 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 145 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 146 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 134 and 148. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 135 and 149. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 135 and 150. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 136 and 131. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 136 and 151. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 139. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 140. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 140. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 141. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 149 and 144. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 150 and 144. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 131 and 145. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 151 and 145. In some embodiments, the first sgRNA and the second sgRNA comprise the sequences of SEQ ID NOs: 148 and 146. In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to a nucleic acid sequence encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to two nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the nucleic acid sequence encoding SluCas9 is fused to three nucleic acid sequences each encoding a nuclear localization sequence (NLS). In some embodiments, the one or more NLSs is an SV40 NLS. In some embodiments, the one or more NLSs is a c-Myc NLS. In some embodiments, the NLS is fused to the SluCas9 with a linker.
In some embodiments, the disclosure provides for a composition comprising at least two nucleic acids. In some embodiments, the composition comprises at least two nucleic acid molecules, wherein the first nucleic acid molecule comprises a sequence encoding any of the endonucleases disclosed herein (e.g., a SaCas9 or SluCas9), wherein the second nucleic acid molecule encodes a first guide RNA and a second guide RNA, wherein the first guide RNA and the second guide RNA are not the same sequence, and wherein the second nucleic acid molecule does not encode an endonuclease. In some embodiments, the first nucleic acid molecule also encodes a copy of the first guide RNA and a copy of the second guide RNA. In some embodiments, the first nucleic acid molecule does not encode any guide RNAs. In some embodiments, the second nucleic acid molecule encodes two copies of the first guide RNA and two copies of the second guide RNA. In some embodiments, the second nucleic molecule encodes two copies of the first guide RNA, and one copy of the second guide RNA. In some embodiments, the second nucleic acid molecule encodes one copy of the first guide RNA, and two copies of the second guide RNA. In some embodiments, the second nucleic acid molecule comprises two copies of the first guide RNA, and three copies of the second guide RNA. In some embodiments, the second nucleic acid molecule comprises three copies of the first guide RNA, and two copies of the second guide RNA. In some embodiments, the second nucleic acid molecule encodes three copies of the first guide RNA and three copies of the second guide RNA. In some embodiments, the first nucleic acid molecule comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a first guide RNA scaffold sequence, the reverse complement of a nucleotide sequence encoding the first guide RNA sequence, the reverse complement of a promoter for expression of the nucleotide sequence encoding the first guide RNA sequence, a promoter for expression of a nucleotide sequence encoding the endonuclease, a nucleotide sequence encoding an endonuclease, a polyadenylation sequence, a promoter for expression of the second guide RNA in the same direction as the promoter for the endonuclease, the second guide RNA sequence, and a second guide RNA scaffold sequence. In some embodiments, the promoter for expression of the nucleotide sequence encoding the first guide RNA sequence in the first nucleic acid molecule is a U6 promoter and the promoter for expression of the nucleotide sequence encoding the second guide RNA in the first nucleic acid molecule is a U6 promoter. In some embodiments, the first nucleic acid molecule encodes a Staphylococcus aureus Cas9 (SaCas9) endonuclease, and the first guide RNA comprises the first sequence and the second guide RNA comprises the second sequence from any one the following pairs of sequences: SEQ ID NOs: 10 and 15; 10 and 16; 12 and 16; 1001 and 1005; 1001 and 15; 1001 and 16; 1003 and 1005; 16 and 1003; 12 and 1010; 12 and 1012; 12 and 1013; 10 and 1016; 1017 and 1005; 1017 and 16; 1018 and 16; 15 and 10; 16 and 10; 16 and 12; 1005 and 1001; 15 and 1001; 16 and 1001; 1005 and 1003; 1003 and 16; 1010 and 12; 1012 and 12; 1013 and 12; 1016 and 10; 1005 and 1017; 16 and 1017; and 16 and 1018. In some embodiments, the first guide RNA comprises the sequence of SEQ ID NO: 12 and the second guide RNA comprises the sequence of SEQ ID NO: 1013. In some embodiments, the first guide RNA comprises the sequence of SEQ ID NO: 1013 and the second guide RNA comprises the sequence of SEQ ID NO: 12. In some embodiments, the first nucleic acid molecule encodes a Staphylococcus lugdunensis (SluCas9) endonuclease, and wherein the first guide RNA comprises the first sequence and the second guide RNAs comprises the second sequence from any one the following pairs of sequences: SEQ ID NOs: 148 and 134; 149 and 135; 150 and 135; 131 and 136; 151 and 136; 139 and 131; 139 and 151; 140 and 131; 140 and 151; 141 and 148; 144 and 149; 144 and 150; 145 and 131; 145 and 151; 146 and 148; 134 and 148; 135 and 149; 135 and 150; 136 and 131; 136 and 151; 131 and 139; 151 and 139; 131 and 140; 151 and 140; 148 and 141; 149 and 144; 150 and 144; 131 and 145; 151 and 145; and 148 and 146. In some embodiments, a) the first guide RNA comprises the sequence of SEQ ID NO: 148 and the second guide RNA comprises the sequence of SEQ ID NO: 134; b) the second guide RNA comprises the sequence of SEQ ID NO: 145 and the second guide RNA comprises the sequence of SEQ ID NO: 131, c) the first guide RNA comprises the sequence of SEQ ID NO: 134 and the second guide RNA comprises the sequence of SEQ ID NO: 148; d) the second guide RNA comprises the sequence of SEQ ID NO: 131 and the second guide RNA comprises the sequence of SEQ ID NO: 145.
In some embodiments, the first nucleic acid molecule is in a first vector (e.g., AAV9), and the second nucleic acid is in a separate second vector. In some embodiments, the first vector is AAV9. In some embodiments, the second vector is AAV9. In preferred embodiments, the AAV9 vector is less than 5 kb from ITR to ITR in size, inclusive of both ITRs. In particular embodiments, the AAV9 vector is less than 4.9 kb from ITR to ITR in size, inclusive of both ITRs. In further embodiments, the AAV9 vector is less than 4.85 kb from ITR to ITR in size, inclusive of both ITRs. In further embodiments, the AAV9 vector is less than 4.8 kb from ITR to ITR in size, inclusive of both ITRs. In further embodiments, the AAV9 vector is less than 4.75 kb from ITR to ITR in size, inclusive of both ITRs. In further embodiments, the AAV9 vector is less than 4.7 kb from ITR to ITR in size, inclusive of both ITRs. In some embodiments, the second vector comprises from 5′ to 3′ with respect to the plus strand: a promoter for expression of a first copy of a first guide RNA (e.g., a U6 promoter), a first copy of a nucleotide sequence encoding a first guide RNA, a first copy of a nucleotide sequence encoding a first guide RNA scaffold, a promoter for expression of a second copy of the first guide RNA (e.g., a H1 promoter), a second copy of the nucleotide sequence encoding the first guide RNA, a second copy of the nucleotide sequence encoding the first guide RNA scaffold, a promoter for expression of a second guide RNA (e.g., a 7SK promoter), a nucleotide sequence encoding a second guide RNA, and a nucleotide sequence encoding a second guide RNA scaffold. In some embodiments, the second vector comprises from 5′ to 3′ with respect to the plus strand: a promoter for expression of a first guide RNA (e.g., a U6 promoter), a nucleotide sequence encoding a first guide RNA, a nucleotide sequence encoding a first guide RNA scaffold, a promoter for expression of a second guide RNA (e.g., a 7SK promoter), a nucleotide sequence encoding a second guide RNA, and a nucleotide sequence encoding a second guide RNA scaffold. In some embodiments, the second vector comprises a stuffer sequence (e.g., a 3′UTR desmin sequence) between the nucleotide sequence encoding the first guide scaffold sequence and the promoter for expression of the second guide sequence. In some embodiments, the second vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a nucleotide sequence encoding a scaffold for a first guide RNA, the reverse complement of a nucleotide sequence encoding a first guide RNA, the reverse complement of a promoter for expression of the first guide RNA (e.g., a U6c promoter), a promoter for expression of a second guide RNA (e.g., a U6c promoter), a nucleotide sequence encoding a second guide RNA, and a nucleotide sequence encoding a second guide RNA scaffold. In some embodiments, the second vector comprises a stuffer sequence (e.g., a 3′UTR desmin sequence) between the reverse complement of the promoter for expression of the first guide RNA and the promoter for expression of the second guide RNA. In some embodiments, the second vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a nucleotide sequence encoding a first copy of a first guide RNA scaffold, the reverse complement of a nucleotide sequence encoding a first copy of the first guide RNA, the reverse complement of a promoter for expression of the first copy of the first guide RNA (e.g., a 7SK2 promoter), the reverse complement of a second copy of the nucleotide sequence encoding the first guide RNA scaffold, the reverse complement of a second copy of the nucleotide sequence encoding the first guide RNA, the reverse complement of a promoter for expression of the second copy of the nucleotide sequence encoding the first guide RNA (e.g., a hU6c promoter), a promoter for expression of a first copy of a second guide RNA (e.g., a hU6c promoter), a first copy of a nucleotide sequence encoding a second guide RNA, a first copy of a nucleotide sequence encoding a second guide RNA scaffold, a promoter for expression of a second copy of the second guide RNA (e.g., a 7Sk2 promoter), a second copy of the nucleotide sequence encoding the second guide RNA, and a second copy of the nucleotide sequence encoding the second guide RNA scaffold. In some embodiments, the second vector comprises a stuffer sequence (e.g., a 3′UTR desmin sequence) between the reverse complement of the promoter for expression of the second copy of the first guide RNA and the promoter for expression of the first copy of the second guide RNA. In some embodiments, the second vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a nucleotide sequence encoding a first copy of a first guide RNA scaffold, the reverse complement of a first copy of a nucleotide sequence encoding the first guide RNA, the reverse complement of a promoter for expression of the first copy of the first guide RNA (e.g., a 7SK2 promoter), the reverse complement of a first copy of a nucleotide sequence encoding a second guide RNA scaffold, the reverse complement of a nucleotide sequence encoding the first copy of the second guide RNA, the reverse complement of a promoter for expression of the first copy of the second guide RNA (e.g., a hU6c promoter), a promoter for expression of a second copy of the second guide RNA (e.g., a hU6c promoter), a second copy of the nucleotide sequence encoding the second guide RNA, a second copy of the nucleotide sequence encoding the second guide RNA scaffold, a promoter for expression of a second copy of the first guide RNA (e.g., a 7SK2 promoter), a second copy of the nucleotide sequence encoding the first guide RNA, and a second copy of the nucleotide sequence encoding the first guide RNA scaffold. In some embodiments, the second vector comprises a stuffer sequence (e.g., a 3′UTR desmin sequence) between the reverse complement of the promoter for expression of the first copy of the second guide RNA and the promoter for expression of the second copy of the first guide RNA. In some embodiments, the second vector comprises from 5′ to 3′ with respect to the plus strand: the reverse complement of a nucleotide sequence encoding a first guide RNA scaffold, the reverse complement of a nucleotide sequence encoding a first guide RNA, the reverse complement of a promoter for expression of a first guide RNA (e.g., a hU6c promoter), a promoter for expression of a second guide RNA (e.g., a hU6c promoter), a nucleotide sequence encoding a second guide RNA, and a nucleotide sequence encoding a second guide RNA scaffold. In some embodiments, the second vector comprises a stuffer sequence (e.g., a 3′UTR desmin sequence) between the reverse complement of the promoter for expression of the first guide RNA and the promoter for expression of the second guide RNA. In particular embodiments, the first guide RNA is different from the second guide RNA. In some embodiments, the first guide RNA comprises the sequence of SEQ ID NO: 12 and the second guide RNA comprises the sequence of SEQ ID NO: 1013. In some embodiments, the first guide RNA comprises the sequence of SEQ ID NO: 1013 and the second guide RNA comprises the sequence of SEQ ID NO: 12. In some embodiments, a) the first guide RNA comprises the sequence of SEQ ID NO: 148 and the second guide RNA comprises the sequence of SEQ ID NO: 134; b) the second guide RNA comprises the sequence of SEQ ID NO: 145 and the second guide RNA comprises the sequence of SEQ ID NO: 131; c) the first guide RNA comprises the sequence of SEQ ID NO: 134 and the second guide RNA comprises the sequence of SEQ ID NO: 148; or d) the second guide RNA comprises the sequence of SEQ ID NO: 131 and the second guide RNA comprises the sequence of SEQ ID NO: 145. In some embodiments, the scaffold for the first guide RNA comprises the sequence of SEQ ID NO: 901. In some embodiments, the scaffold for the second guide RNA comprises the sequence of SEQ ID NO: 901. In some embodiments, any of the second vectors comprises a stuffer sequence. In some embodiments, the stuffer sequence is a 3′UTR sequence. In some embodiments, the 3′UTR desmin sequence comprises a sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the sequence of SEQ ID NO: 552 (taattaacagatttgt taatgaagaggaaatatacaaatattaatgaacaaaatagtgtgtttagaacaagactcacatacaggagacatacacatgttaaaggaggcattaga atagtggggaaaacttacaaattcagtatagggtaggagcactagatagctaagaggtggggtagggggcagagtgaaggctgtctctcttttact ccttctaaaaatattccagtggatcaaagatgtaggaaacgatggatccatgtaggcctcagccttcaagtactcctctgagatttcagggattatcctt taaaggagtcaggaagagggtagagatcacaaagataataagctcagacagtagtatgttaaataaacctcaggaggttttagtcttaaggtccttaa tctgaccttccaaactgacttttccagaaacttccaaaagcctctc) or fragments thereof.
In some embodiments, if the composition comprises one or more nucleic acids encoding an RNA-targeted endonuclease and one or more guide RNAs, the one or more nucleic acids are designed such that they express the one or more guide RNAs at an equivalent or higher level (e.g., a greater number of expressed transgene copies) as compared to the expression level of the RNA-targeted endonuclease. In some embodiments, the one or more nucleic acids are designed such that they express (e.g., on average in 100 cells) the one or more guide RNAs at least a 1.1, 1.2, 1.3, 1.4, or 1.5 times higher level (e.g., a greater number of expressed transgene copies) as compared to the expression level of the RNA-targeted endonuclease. In some embodiments, the one or more nucleic acids are designed such that they express the one or more guide RNAs at 1.01-1.5, 1.01-1.4, 1.01-1.3, 1.01-1.2, 1.01-1.1, 1.1-2.0, 1.1-1.8, 1.1-1.6, 1.1-1.4, 1.1-1.3, 1.2-2.0, 1.2-1.8, 1.2-1.6, 1.2-1.4, 1.4-2.0, 1.4-1.8, 1.4-1.6, 1.6-2.0, 1.6-1.8, or 1.8-2.0 times higher level (e.g., a greater number of expressed transgene copies) as compared to the expression level of the RNA-targeted endonuclease. In some embodiments, the one or more guide RNAs are designed to express a higher level than the RNA-targeted endonuclease by: a) utilizing one or more regulatory elements (e.g., promoters or enhancers) that express the one or more guide RNAs at a higher level as compared to the regulatory elements (e.g., promoters or enhancers) for expression of the RNA-targeted endonuclease; and/or b) expressing more copies of one or more of the guide RNAs as compared to the number of copies of the RNA-targeted endonuclease (e.g., 2× or 3× as many copies of the nucleotide sequences encoding the one or more guide RNAs as compared to the number of copies of the nucleotide sequences encoding the RNA-targeted endonuclease). For example, in some embodiments, the composition comprises multiple nucleic acid molecules (e.g., in multiple vectors), wherein for every nucleotide sequence encoding an RNA-targeted endonuclease in the nucleic acid molecules in the composition, there are two or three copies of the nucleotide sequence encoding the guide RNA in the nucleic acid molecules in the composition. In some embodiments, the composition comprises a first guide RNA and a second guide RNA, wherein the first guide RNA and the second guide RNA are not the same (e.g., any of the guide RNA pairs disclosed herein), and for every nucleotide sequence encoding an RNA-targeted endonuclease in the nucleic acid molecules in the composition, there are two or three copies of the nucleotide sequence encoding the first guide RNA and/or the second guide RNA.
In some embodiments, any of the nucleic acids disclosed herein encodes an RNA-targeted endonuclease. In some embodiments, the RNA-targeted endonuclease has cleavase activity, which can also be referred to as double-strand endonuclease activity. In some embodiments, the RNA-targeted endonuclease comprises a Cas nuclease. Examples of Cas9 nucleases include those of the type II CRISPR systems of S. pyogenes, S. aureus, and other prokaryotes (see, e.g., the list in the next paragraph), and modified (e.g., engineered or mutant) versions thereof. See, e.g., US2016/0312198 A1; US 2016/0312199 A1. In particular embodiments, the RNA-targeted endonuclease is a type II CRISPR Cas enzyme. Other examples of Cas nucleases include a Csm or Cmr complex of a type III CRISPR system or the Cas10, Csm1, or Cmr2 subunit thereof; and a Cascade complex of a type I CRISPR system, or the Cas3 subunit thereof. In some embodiments, the Cas nuclease may be from a Type-IIA, Type-IIB, or Type-IIC system. For discussion of various CRISPR systems and Cas nucleases see, e.g., Makarova et al., NAT. REV. MICROBIOL. 9:467-477 (2011); Makarova et al., N
Non-limiting exemplary species that the Cas nuclease can be derived from include Streptococcus pyogenes, Streptococcus thermophilus, Streptococcus sp., Staphylococcus aureus, Listeria innocua, Lactobacillus gasseri, Francisella novicida, Wolinella succinogenes, Sutterella wadsworthensis, Gammaproteobacterium, Neisseria meningitidis, Campylobacter jejuni, Pasteurella multocida, Fibrobacter succinogene, Rhodospirillum rubrum, Nocardiopsis dassonvillei, Streptomyces pristinaespiralis, Streptomyces viridochromogenes, Streptomyces viridochromogenes, Streptosporangium roseum, Streptosporangium roseum, Alicyclobacillus acidocaldarius, Bacillus pseudomycoides, Bacillus selenitireducens, Exiguobacterium sibiricum, Lactobacillus delbrueckii, Lactobacillus salivarius, Lactobacillus buchneri, Treponema denticola, Microscilla marina, Burkholderiales bacterium, Polaromonas naphthalenivorans, Polaromonas sp., Crocosphaera watsonii, Cyanothece sp., Microcystis aeruginosa, Synechococcus sp., Acetohalobium arabaficum, Ammonifex degensii, Caldicelulosiruptor becscii, Candidatus Desulforudis, Clostridium botulinum, Clostridium difficile, Finegoldia magna, Natranaerobius thermophilus, Pelotomaculum thermopropionicum, Acidithiobacillus caldus, Acidithiobacillus ferrooxidans, Allochromafium vinosum, Marinobacter sp., Nitrosococcus halophilus, Nitrosococcus watsoni, Pseudoalteromonas haloplanktis, Ktedonobacter racemifer, Methanohalobium evestigatum, Anabaena variabilis, Nodularia spumigena, Nostoc sp., Arthrospira maxima, Arthrospira platensis, Arthrospira sp., Lyngbya sp., Microcoleus chthonoplastes, Oscillatoria sp., Petrotoga mobilis, Thermosipho africanus, Streptococcus pasteurianus, Neisseria cinerea, Campylobacter lari, Parvibaculum lavamentivorans, Corynebacterium diphtheria, Acidaminococcus sp., Lachnospiraceae bacterium ND2006, and Acaryochloris marina.
In some embodiments, the nucleic acid encoding SaCas9 encodes an SaCas9 comprising an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 711:
In some embodiments, the nucleic acid encoding SaCas9 comprises the nucleic acid of SEQ ID NO: 9014:
In some embodiments comprising a nucleic acid encoding SaCas9, the SaCas9 comprises an amino acid sequence of SEQ ID NO: 711.
In some embodiments, the SaCas9 is a variant of the amino acid sequence of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an amino acid other than an E at the position corresponding to position 781 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an amino acid other than an N at the position corresponding to position 967 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an amino acid other than an R at the position corresponding to position 1014 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises a K at the position corresponding to position 781 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises a K at the position corresponding to position 967 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an H at the position corresponding to position 1014 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an amino acid other than an E at the position corresponding to position 781 of SEQ ID NO: 711; an amino acid other than an N at the position corresponding to position 967 of SEQ ID NO: 711; and an amino acid other than an R at the position corresponding to position 1014 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises a K at the position corresponding to position 781 of SEQ ID NO: 711; a K at the position corresponding to position 967 of SEQ ID NO: 711; and an H at the position corresponding to position 1014 of SEQ ID NO: 711.
In some embodiments, the SaCas9 comprises an amino acid other than an R at the position corresponding to position 244 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an amino acid other than an N at the position corresponding to position 412 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an amino acid other than an N at the position corresponding to position 418 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an amino acid other than an R at the position corresponding to position 653 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an amino acid other than an R at the position corresponding to position 244 of SEQ ID NO: 711; an amino acid other than an N at the position corresponding to position 412 of SEQ ID NO: 711; an amino acid other than an N at the position corresponding to position 418 of SEQ ID NO: 711; and an amino acid other than an R at the position corresponding to position 653 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an A at the position corresponding to position 244 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an A at the position corresponding to position 412 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an A at the position corresponding to position 418 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an A at the position corresponding to position 653 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an A at the position corresponding to position 244 of SEQ ID NO: 711; an A at the position corresponding to position 412 of SEQ ID NO: 711; an A at the position corresponding to position 418 of SEQ ID NO: 711; and an A at the position corresponding to position 653 of SEQ ID NO: 711.
In some embodiments, the SaCas9 comprises an amino acid other than an R at the position corresponding to position 244 of SEQ ID NO: 711; an amino acid other than an N at the position corresponding to position 412 of SEQ ID NO: 711; an amino acid other than an N at the position corresponding to position 418 of SEQ ID NO: 711; an amino acid other than an R at the position corresponding to position 653 of SEQ ID NO: 711; an amino acid other than an E at the position corresponding to position 781 of SEQ ID NO: 711; an amino acid other than an N at the position corresponding to position 967 of SEQ ID NO: 711; and an amino acid other than an R at the position corresponding to position 1014 of SEQ ID NO: 711. In some embodiments, the SaCas9 comprises an A at the position corresponding to position 244 of SEQ ID NO: 711; an A at the position corresponding to position 412 of SEQ ID NO: 711; an A at the position corresponding to position 418 of SEQ ID NO: 711; an A at the position corresponding to position 653 of SEQ ID NO: 711; a K at the position corresponding to position 781 of SEQ ID NO: 711; a K at the position corresponding to position 967 of SEQ ID NO: 711; and an H at the position corresponding to position 1014 of SEQ ID NO: 711.
In some embodiments, the SaCas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 715 (designated herein as SaCas9-KKH or SACAS9KKH):
In some embodiments, the SaCas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 716 (designated herein as SaCas9-HF):
In some embodiments, the SaCas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 717 (designated herein as SaCas9-KKH-HF):
In some embodiments, the nucleic acid encoding SluCas9 encodes a SluCas9 comprising an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 712:
In some embodiments, the SluCas9 is a variant of the amino acid sequence of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an amino acid other than an Q at the position corresponding to position 781 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an amino acid other than an R at the position corresponding to position 1013 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises a K at the position corresponding to position 781 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises a K at the position corresponding to position 966 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an H at the position corresponding to position 1013 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an amino acid other than an Q at the position corresponding to position 781 of SEQ ID NO: 712; and an amino acid other than an R at the position corresponding to position 1013 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises a K at the position corresponding to position 781 of SEQ ID NO: 712; a K at the position corresponding to position 966 of SEQ ID NO: 712; and an H at the position corresponding to position 1013 of SEQ ID NO: 712.
In some embodiments, the SluCas9 comprises an amino acid other than an R at the position corresponding to position 246 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an amino acid other than an N at the position corresponding to position 414 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an amino acid other than a T at the position corresponding to position 420 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an amino acid other than an R at the position corresponding to position 655 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an amino acid other than an R at the position corresponding to position 246 of SEQ ID NO: 712; an amino acid other than an N at the position corresponding to position 414 of SEQ ID NO: 712; an amino acid other than a T at the position corresponding to position 420 of SEQ ID NO: 712; and an amino acid other than an R at the position corresponding to position 655 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an A at the position corresponding to position 246 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an A at the position corresponding to position 414 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an A at the position corresponding to position 420 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an A at the position corresponding to position 655 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an A at the position corresponding to position 246 of SEQ ID NO: 712; an A at the position corresponding to position 414 of SEQ ID NO: 712; an A at the position corresponding to position 420 of SEQ ID NO: 712; and an A at the position corresponding to position 655 of SEQ ID NO: 712.
In some embodiments, the SluCas9 comprises an amino acid other than an R at the position corresponding to position 246 of SEQ ID NO: 712; an amino acid other than an N at the position corresponding to position 414 of SEQ ID NO: 712; an amino acid other than a T at the position corresponding to position 420 of SEQ ID NO: 712; an amino acid other than an R at the position corresponding to position 655 of SEQ ID NO: 712; an amino acid other than an Q at the position corresponding to position 781 of SEQ ID NO: 712; a K at the position corresponding to position 966 of SEQ ID NO: 712; and an amino acid other than an R at the position corresponding to position 1013 of SEQ ID NO: 712. In some embodiments, the SluCas9 comprises an A at the position corresponding to position 246 of SEQ ID NO: 712; an A at the position corresponding to position 414 of SEQ ID NO: 712; an A at the position corresponding to position 420 of SEQ ID NO: 712; an A at the position corresponding to position 655 of SEQ ID NO: 712; a K at the position corresponding to position 781 of SEQ ID NO: 712; a K at the position corresponding to position 966 of SEQ ID NO: 712; and an H at the position corresponding to position 1013 of SEQ ID NO: 712.
In some embodiments, the SluCas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 718 (designated herein as SluCas9-KH or SLUCAS9KH):
In some embodiments, the SluCas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 719 (designated herein as SluCas9-HF):
In some embodiments, the SluCas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 720 (designated herein as SluCas9-HF-KH):
In some embodiments, the Cas protein is any of the engineered Cas proteins disclosed in Schmidt et al., 2021, Nature Communications, “Improved CRISPR genome editing using small highly active and specific engineered RNA-guided nucleases.”
In some embodiments, the Cas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7021 (designated herein as sRGN1):
In some embodiments, the Cas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7022 (designated herein as sRGN2):
In some embodiments, the Cas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7023 (designated herein as sRGN3):
In some embodiments, the Cas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7024 (designated herein as sRGN3.1):
In some embodiments, the Cas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7025 (designated herein as sRGN3.2):
In some embodiments, the Cas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7026 (designated herein as sRGN3.3):
In some embodiments, the Cas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7027 (designated herein as sRGN4):
In some embodiments, the Cas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7028 (designated herein as Staphylococcus hyicus Cas9 or ShyCas9):
In some embodiments, the Cas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7029 (designated herein as Staphylococcus microti Cas9 or Smi Cas9):
In some embodiments, the Cas9 comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7030 (designated herein as Staphylococcus pasteuri Cas9 or Spa Cas9):
In some embodiments, the Cas protein comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7031 (designated herein as Cas12i1):
In some embodiments, the Cas protein comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7032 (designated herein as Cas12i2):
In some embodiments, the Cas protein comprises an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence of SEQ ID NO: 7033 (designated herein as SpCas9):
In some embodiments, the guide RNA is chemically modified. A guide RNA comprising one or more modified nucleosides or nucleotides is called a “modified” guide RNA or “chemically modified” guide RNA, to describe the presence of one or more non-naturally and/or naturally occurring components or configurations that are used instead of or in addition to the canonical A, G, C, and U residues. In some embodiments, a modified guide RNA is synthesized with a non-canonical nucleoside or nucleotide, is here called “modified.” Modified nucleosides and nucleotides can include one or more of: (i) alteration, e.g., replacement, of one or both of the non-linking phosphate oxygens and/or of one or more of the linking phosphate oxygens in the phosphodiester backbone linkage (an exemplary backbone modification); (ii) alteration, e.g., replacement, of a constituent of the ribose sugar, e.g., of the 2′ hydroxyl on the ribose sugar (an exemplary sugar modification); (iii) wholesale replacement of the phosphate moiety with “dephospho” linkers (an exemplary backbone modification); (iv) modification or replacement of a naturally occurring nucleobase, including with a non-canonical nucleobase (an exemplary base modification); (v) replacement or modification of the ribose-phosphate backbone (an exemplary backbone modification); (vi) modification of the 3′ end or 5′ end of the oligonucleotide, e.g., removal, modification or replacement of a terminal phosphate group or conjugation of a moiety, cap or linker (such 3′ or 5′ cap modifications may comprise a sugar and/or backbone modification); and (vii) modification or replacement of the sugar (an exemplary sugar modification).
Chemical modifications such as those listed above can be combined to provide modified guide RNAs comprising nucleosides and nucleotides (collectively “residues”) that can have two, three, four, or more modifications. For example, a modified residue can have a modified sugar and a modified nucleobase, or a modified sugar and a modified phosphodiester. In some embodiments, every base of a guide RNA is modified, e.g., all bases have a modified phosphate group, such as a phosphorothioate group. In certain embodiments, all, or substantially all, of the phosphate groups of an guide RNA molecule are replaced with phosphorothioate groups. In some embodiments, modified guide RNAs comprise at least one modified residue at or near the 5′ end of the RNA. In some embodiments, modified guide RNAs comprise at least one modified residue at or near the 3′ end of the RNA.
In some embodiments, the guide RNA comprises one, two, three or more modified residues. In some embodiments, at least 5% (e.g., at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or 100%) of the positions in a modified guide RNA are modified nucleosides or nucleotides.
Unmodified nucleic acids can be prone to degradation by, e.g., intracellular nucleases or those found in serum. For example, nucleases can hydrolyze nucleic acid phosphodiester bonds. Accordingly, in one aspect the guide RNAs described herein can contain one or more modified nucleosides or nucleotides, e.g., to introduce stability toward intracellular or serum-based nucleases. In some embodiments, the modified guide RNA molecules described herein can exhibit a reduced innate immune response when introduced into a population of cells, both in vivo and ex vivo. The term “innate immune response” includes a cellular response to exogenous nucleic acids, including single stranded nucleic acids, which involves the induction of cytokine expression and release, particularly the interferons, and cell death.
In some embodiments of a backbone modification, the phosphate group of a modified residue can be modified by replacing one or more of the oxygens with a different substituent. Further, the modified residue, e.g., modified residue present in a modified nucleic acid, can include the wholesale replacement of an unmodified phosphate moiety with a modified phosphate group as described herein. In some embodiments, the backbone modification of the phosphate backbone can include alterations that result in either an uncharged linker or a charged linker with unsymmetrical charge distribution.
Examples of modified phosphate groups include, phosphorothioate, phosphoroselenates, borano phosphates, borano phosphate esters, hydrogen phosphonates, phosphoroamidates, alkyl or aryl phosphonates and phosphotriesters. The phosphorous atom in an unmodified phosphate group is achiral. However, replacement of one of the non-bridging oxygens with one of the above atoms or groups of atoms can render the phosphorous atom chiral. The stereogenic phosphorous atom can possess either the “R” configuration (herein Rp) or the “S” configuration (herein Sp). The backbone can also be modified by replacement of a bridging oxygen, (i.e., the oxygen that links the phosphate to the nucleoside), with nitrogen (bridged phosphoroamidates), sulfur (bridged phosphorothioates) and carbon (bridged methylenephosphonates). The replacement can occur at either linking oxygen or at both of the linking oxygens.
The phosphate group can be replaced by non-phosphorus containing connectors in certain backbone modifications. In some embodiments, the charged phosphate group can be replaced by a neutral moiety. Examples of moieties which can replace the phosphate group can include, without limitation, e.g., methyl phosphonate, hydroxylamino, siloxane, carbonate, carboxymethyl, carbamate, amide, thioether, ethylene oxide linker, sulfonate, sulfonamide, thioformacetal, formacetal, oxime, methyleneimino, methylenemethylimino, methylenehydrazo, methylenedimethylhydrazo and methyleneoxymethylimino.
Scaffolds that can mimic nucleic acids can also be constructed wherein the phosphate linker and ribose sugar are replaced by nuclease resistant nucleoside or nucleotide surrogates. Such modifications may comprise backbone and sugar modifications. In some embodiments, the nucleobases can be tethered by a surrogate backbone. Examples can include, without limitation, the morpholino, cyclobutyl, pyrrolidine and peptide nucleic acid (PNA) nucleoside surrogates.
The modified nucleosides and modified nucleotides can include one or more modifications to the sugar group, i.e. at sugar modification. For example, the 2′ hydroxyl group (OH) can be modified, e.g. replaced with a number of different “oxy” or “deoxy” substituents. In some embodiments, modifications to the 2′ hydroxyl group can enhance the stability of the nucleic acid since the hydroxyl can no longer be deprotonated to form a 2′-alkoxide ion.
Examples of 2′ hydroxyl group modifications can include alkoxy or aryloxy (OR, wherein “R” can be, e.g., alkyl, cycloalkyl, aryl, aralkyl, heteroaryl or a sugar); polyethyleneglycols (PEG), O(CH2CH2O)nCH2CH2OR wherein R can be, e.g., H or optionally substituted alkyl, and n can be an integer from 0 to 20 (e.g., from 0 to 4, from 0 to 8, from 0 to 10, from 0 to 16, from 1 to 4, from 1 to 8, from 1 to 10, from 1 to 16, from 1 to 20, from 2 to 4, from 2 to 8, from 2 to 10, from 2 to 16, from 2 to 20, from 4 to 8, from 4 to 10, from 4 to 16, and from 4 to 20). In some embodiments, the 2′ hydroxyl group modification can be 2′-O-Me. In some embodiments, the 2′ hydroxyl group modification can be a 2′-fluoro modification, which replaces the 2′ hydroxyl group with a fluoride. In some embodiments, the 2′ hydroxyl group modification can include “locked” nucleic acids (LNA) in which the 2′ hydroxyl can be connected, e.g., by a C1-6 alkylene or C1-6 heteroalkylene bridge, to the 4′ carbon of the same ribose sugar, where exemplary bridges can include methylene, propylene, ether, or amino bridges; O-amino (wherein amino can be, e.g., NH2; alkylamino, dialkylamino, heterocyclyl, arylamino, diarylamino, heteroarylamino, or diheteroarylamino, ethylenediamine, or polyamino) and aminoalkoxy, O(CH2)n-amino, (wherein amino can be, e.g., NH2; alkylamino, dialkylamino, heterocyclyl, arylamino, diarylamino, heteroarylamino, or diheteroarylamino, ethylenediamine, or polyamino) In some embodiments, the 2′ hydroxyl group modification can include “unlocked” nucleic acids (UNA) in which the ribose ring lacks the C2′-C3′ bond. In some embodiments, the 2′ hydroxyl group modification can include the methoxyethyl group (MOE), (OCH2CH2OCH3, e.g., a PEG derivative).
“Deoxy” 2′ modifications can include hydrogen (i.e. deoxyribose sugars, e.g., at the overhang portions of partially dsRNA); halo (e.g., bromo, chloro, fluoro, or iodo); amino (wherein amino can be, e.g., NH2; alkylamino, dialkylamino, heterocyclyl, arylamino, diarylamino, heteroarylamino, diheteroarylamino, or amino acid); NH(CH2CH2NH)nCH2CH2-amino (wherein amino can be, e.g., as described herein), —NHC(O)R (wherein R can be, e.g., alkyl, cycloalkyl, aryl, aralkyl, heteroaryl or sugar), cyano; mercapto; alkyl-thio-alkyl; thioalkoxy; and alkyl, cycloalkyl, aryl, alkenyl and alkynyl, which may be optionally substituted with e.g., an amino as described herein.
The sugar modification can comprise a sugar group which may also contain one or more carbons that possess the opposite stereochemical configuration than that of the corresponding carbon in ribose. Thus, a modified nucleic acid can include nucleotides containing e.g., arabinose, as the sugar. The modified nucleic acids can also include abasic sugars. These abasic sugars can also be further modified at one or more of the constituent sugar atoms. The modified nucleic acids can also include one or more sugars that are in the L form, e.g. L-nucleosides.
The modified nucleosides and modified nucleotides described herein, which can be incorporated into a modified nucleic acid, can include a modified base, also called a nucleobase. Examples of nucleobases include, but are not limited to, adenine (A), guanine (G), cytosine (C), and uracil (U). These nucleobases can be modified or wholly replaced to provide modified residues that can be incorporated into modified nucleic acids. The nucleobase of the nucleotide can be independently selected from a purine, a pyrimidine, a purine analog, or pyrimidine analog. In some embodiments, the nucleobase can include, for example, naturally-occurring and synthetic derivatives of a base.
In embodiments employing a dual guide RNA, each of the crRNA and the tracr RNA can contain modifications. Such modifications may be at one or both ends of the crRNA and/or tracr RNA. In embodiments comprising sgRNA, one or more residues at one or both ends of the sgRNA may be chemically modified, and/or internal nucleosides may be modified, and/or the entire sgRNA may be chemically modified. Certain embodiments comprise a 5′ end modification. Certain embodiments comprise a 3′ end modification.
Modifications of 2′-O-methyl are encompassed.
Another chemical modification that has been shown to influence nucleotide sugar rings is halogen substitution. For example, 2′-fluoro (2′-F) substitution on nucleotide sugar rings can increase oligonucleotide binding affinity and nuclease stability. Modifications of 2′-fluoro (2′-F) are encompassed.
Phosphorothioate (PS) linkage or bond refers to a bond where a sulfur is substituted for one nonbridging phosphate oxygen in a phosphodiester linkage, for example in the bonds between nucleotides bases. When phosphorothioates are used to generate oligonucleotides, the modified oligonucleotides may also be referred to as S-oligos.
Abasic nucleotides refer to those which lack nitrogenous bases.
Inverted bases refer to those with linkages that are inverted from the normal 5′ to 3′ linkage (i.e., either a 5′ to 5′ linkage or a 3′ to 3′ linkage).
An abasic nucleotide can be attached with an inverted linkage. For example, an abasic nucleotide may be attached to the terminal 5′ nucleotide via a 5′ to 5′ linkage, or an abasic nucleotide may be attached to the terminal 3′ nucleotide via a 3′ to 3′ linkage. An inverted abasic nucleotide at either the terminal 5′ or 3′ nucleotide may also be called an inverted abasic end cap.
In some embodiments, one or more of the first three, four, or five nucleotides at the 5′ terminus, and one or more of the last three, four, or five nucleotides at the 3′ terminus are modified. In some embodiments, the modification is a 2′-O-Me, 2′-F, inverted abasic nucleotide, PS bond, or other nucleotide modification well known in the art to increase stability and/or performance.
In some embodiments, the first four nucleotides at the 5′ terminus, and the last four nucleotides at the 3′ terminus are linked with phosphorothioate (PS) bonds.
In some embodiments, the first three nucleotides at the 5′ terminus, and the last three nucleotides at the 3′ terminus comprise a 2′-O-methyl (2′-O-Me) modified nucleotide. In some embodiments, the first three nucleotides at the 5′ terminus, and the last three nucleotides at the 3′ terminus comprise a 2′-fluoro (2′-F) modified nucleotide.
In some embodiments, a composition is encompassed comprising: a) one or more guide RNAs comprising one or more guide sequences from Table 1A, Table 1B, or Table 5 and b) saCas9 (when combined with a gRNA comprising any one of or combination of SEQ ID Nos: 1-35, 1000-1078, and 3000-3069) or sluCas9 (when combined with a gRNA comprising any one of or combination of SEQ ID Nos: 100-225, 2000-2116, and 4000-4251), or any of the mutant Cas9 proteins disclosed herein. In some embodiments, the guide RNA together with a Cas9 is called a ribonucleoprotein complex (RNP).
In some embodiments, the disclosure provides for an RNP complex, wherein the guide RNA (e.g., any of the guide RNAs disclosed herein) binds to or is capable of binding to a target sequence in the dystrophin gene (e.g., a splice acceptor site or a splice donor site for exons 45, 51, or 53, including e.g., the Exon 51 intron-exon junction having the sequence ccagagtaacagtctgagtaggagctaaaatattagggtattgcaa (SEQ ID NO: 721), or a target sequence bound by any of the sequences disclosed in Table 1A, Table 1B, and 5), wherein the dystrophin gene comprises a PAM recognition sequence position upstream of the target sequence, and wherein the RNP cuts at a position that is 3 nucleotides upstream (−3) of the PAM in the dystrophin gene. In some embodiments, the RNP also cuts at a position that is 2 nucleotides upstream (−2), 4 nucleotides upstream (−4), 5 nucleotides upstream (−5), or 6 nucleotides upstream (−6) of the PAM in the dystrophin gene. In some embodiments, the RNP cuts at a position that is 3 nucleotides upstream (−3) and 4 nucleotides upstream (−4) of the PAM in the dystrophin gene.
In some embodiments, chimeric Cas9 (SaCas9 or SluCas9) nucleases are used, where one domain or region of the protein is replaced by a portion of a different protein. In some embodiments, a Cas9 nuclease domain may be replaced with a domain from a different nuclease such as Fok1. In some embodiments, a Cas9 nuclease may be a modified nuclease.
In some embodiments, the Cas9 is modified to contain only one functional nuclease domain. For example, the agent protein may be modified such that one of the nuclease domains is mutated or fully or partially deleted to reduce its nucleic acid cleavage activity.
In some embodiments, a conserved amino acid within a Cas9 protein nuclease domain is substituted to reduce or alter nuclease activity. In some embodiments, a Cas9 nuclease may comprise an amino acid substitution in the RuvC or RuvC-like nuclease domain. Exemplary amino acid substitutions in the RuvC or RuvC-like nuclease domain include D10A (based on the S. pyogenes Cas9 protein). See, e.g., Zetsche et al. (2015) Cell October 22:163(3): 759-771. In some embodiments, the Cas9 nuclease may comprise an amino acid substitution in the HNH or HNH-like nuclease domain. Exemplary amino acid substitutions in the HNH or HNH-like nuclease domain include E762A, H840A, N863A, H983A, and D986A (based on the S. pyogenes Cas9 protein). See, e.g., Zetsche et al. (2015). Further exemplary amino acid substitutions include D917A, E1006A, and D1255A (based on the Francisella novicida U112 Cpf1 (FnCpf1) sequence (UniProtKB—A0Q7Q2 (CPF1_FRATN)). Further exemplary amino acid substitutions include D10A and N580A (based on the S. aureus Cas9 protein). See, e.g., Friedland et al., 2015, Genome Biol., 16:257.
In some embodiments, the Cas9 lacks cleavase activity. In some embodiments, the Cas9 comprises a dCas DNA-binding polypeptide. A dCas polypeptide has DNA-binding activity while essentially lacking catalytic (cleavase/nickase) activity. In some embodiments, the dCas polypeptide is a dCas9 polypeptide. In some embodiments, the Cas9 lacking cleavase activity or the dCas DNA-binding polypeptide is a version of a Cas nuclease (e.g., a Cas9 nuclease discussed above) in which its endonucleolytic active sites are inactivated, e.g., by one or more alterations (e.g., point mutations) in its catalytic domains. See, e.g., US 2014/0186958 A1; US 2015/0166980 A1.
In some embodiments, the Cas9 comprises one or more heterologous functional domains (e.g., is or comprises a fusion polypeptide).
In some embodiments, the heterologous functional domain may facilitate transport of the Cas9 into the nucleus of a cell. For example, the heterologous functional domain may be a nuclear localization signal (NLS). In some embodiments, the Cas9 may be fused with 1-10 NLS(s). In some embodiments, the Cas9 may be fused with 1-5 NLS(s). In some embodiments, the Cas9 may be fused with 1-3 NLS(s). In some embodiments, the Cas9 may be fused with one NLS. Where one NLS is used, the NLS may be attached at the N-terminus or the C-terminus of the Cas9 sequence, and may be directly fused/attached or fused/attached via a linker. It may also be inserted within the Cas9 sequence. In other embodiments, the Cas9 may be fused with more than one NLS. In some embodiments, the Cas9 may be fused with 2, 3, 4, or 5 NLSs. In some embodiments, the Cas9 may be fused with two NLSs. In certain circumstances, the two NLSs may be the same (e.g., two SV40 NLSs) or different. In some embodiments, the Cas9 protein is fused with one or more SV40 NLSs. In some embodiments, the SV40 NLS comprises the amino acid sequence of SEQ ID NO: 713 (PKKKRKV). In some embodiments, the Cas9 protein (e.g., the SaCas9 or SluCas9 protein) is fused to one or more nucleoplasmin NLSs. In some embodiments, the Cas protein is fused to one or more c-myc NLSs. In some embodiments, the Cas protein is fused to one or more E1A NLSs. In some embodiments, the Cas protein is fused to one or more BP (bipartite) NLSs. In some embodiments, the nucleoplasmin NLS comprises the amino acid sequence of SEQ ID NO: 714 (KRPAATKKAGQAKKKK). In some embodiments, the Cas9 protein is fused with a c-Myc NLS. In some embodiments, the c-Myc NLS is encoded by the nucleic acid sequence of SEQ ID NO: 722 (CCGGCAGCTAAGAAAAAGAAACTGGAT). In some embodiments, the Cas9 is fused to two SV40 NLS sequences linked at the carboxy terminus. In some embodiments, the Cas9 may be fused with two NLSs, one linked at the N-terminus and one at the C-terminus. In some embodiments, the Cas9 may be fused with 3 NLSs. In some embodiments, the Cas9 may be fused with 3 NLSs, two linked at the N-terminus and one linked at the C-terminus. In some embodiments, the Cas9 may be fused with 3 NLSs, one linked at the N-terminus and two linked at the C-terminus. In some embodiments, the Cas9 may be fused with no NLS. In some embodiments, the Cas9 may be fused with one NLS. In some embodiments, the Cas9 may be fused with an NLS on the C-terminus and does not comprise an NLS fused on the N-terminus. In some embodiments, the Cas9 may be fused with an NLS on the N-terminus and does not comprise an NLS fused on the C-terminus. In some embodiments, the Cas9 protein is fused to an SV40 NLS and to a nucleoplasmin NLS. In some embodiments, the Cas9 protein is fused to an SV40 NLS and to a c-Myc NLS. In some embodiments, the SV40 NLS is fused to the C-terminus of the Cas9, while the nucleoplasmin NLS is fused to the N-terminus of the Cas9 protein. In some embodiments, the SV40 NLS is fused to the C-terminus of the Cas9, while the c-Myc NLS is fused to the N-terminus of the Cas9 protein. In some embodiments, the SV40 NLS is fused to the N-terminus of the Cas9, while the nucleoplasmin NLS is fused to the C-terminus of the Cas9 protein. In some embodiments, the SV40 NLS is fused to the N-terminus of the Cas9, while the c-Myc NLS is fused to the C-terminus of the Cas9 protein. In some embodiments, the SV40 NLS is fused to the Cas9 protein by means of a linker. In some embodiments, the SV40 NLS and linker is encoded by the nucleic acid sequence of SEQ ID NO: 723 (ATGATGGCCCCAAAGAAGAAGCGGAAGGTCGGTATCCACGGAGTCCCAGCAGCC). In some embodiments, the nucleoplasmin NLS is fused to the Cas9 protein by means of a linker. In some embodiments, the c-Myc NLS is fused to the Cas9 protein by means of a linker. In some embodiments, an additional domain may be: a) fused to the N- or C-terminus of the Cas protein (e.g., a Cas9 protein), b) fused to the N-terminus of an NLS fused to the N-terminus of a Cas protein, or c) fused to the C-terminus of an NLS fused to the C-terminus of a Cas protein. In some embodiments, an NLS is fused to the N- and/or C-terminus of the Cas protein by means of a linker. In some embodiments, an NLS is fused to the N-terminus of an N-terminally-fused NLS on a Cas protein by means of a linker, and/or an NLS is fused to the C-terminus of a C-terminally fused NLS on a Cas protein by means of a linker. In some embodiments, the linker is GSVD (SEQ ID NO: 550) or GSGS (SEQ ID NO: 551). In some embodiments, the Cas protein comprises a c-Myc NLS fused to the N-terminus of the Cas protein (or to an N-terminally-fused NLS on the Cas protein), optionally by means of a linker. In some embodiments, the Cas protein comprises an SV40 NLS fused to the C-terminus of the Cas protein (or to a C-terminally-fused NLS on the Cas protein), optionally by means of a linker. In some embodiments, the Cas protein comprises a nucleoplasmin NLS fused to the C-terminus of the Cas protein (or to a C-terminally-fused NLS on the Cas protein), optionally by means of a linker. In some embodiments, the Cas protein comprises: a) a c-Myc NLS fused to the N-terminus of the Cas protein, optionally by means of a linker, b) an SV40 NLS fused to the C-terminus of the Cas protein, optionally by means of a linker, and c) a nucleoplasmin NLS fused to the C-terminus of the SV40 NLS, optionally by means of a linker. In some embodiments, the Cas protein comprises: a) a c-Myc NLS fused to the N-terminus of the Cas protein, optionally by means of a linker, b) a nucleoplasmin NLS fused to the C-terminus of the Cas protein, optionally by means of a linker, and c) an SV40 NLS fused to the C-terminus of the nucleoplasmin NLS, optionally by means of a linker.
In some embodiments, the heterologous functional domain may be capable of modifying the intracellular half-life of the Cas9. In some embodiments, the half-life of the Cas9 may be increased. In some embodiments, the half-life of the Cas9 may be reduced. In some embodiments, the heterologous functional domain may be capable of increasing the stability of the Cas9. In some embodiments, the heterologous functional domain may be capable of reducing the stability of the Cas9. In some embodiments, the heterologous functional domain may act as a signal peptide for protein degradation. In some embodiments, the protein degradation may be mediated by proteolytic enzymes, such as, for example, proteasomes, lysosomal proteases, or calpain proteases. In some embodiments, the heterologous functional domain may comprise a PEST sequence. In some embodiments, the Cas9 may be modified by addition of ubiquitin or a polyubiquitin chain. In some embodiments, the ubiquitin may be a ubiquitin-like protein (UBL). Non-limiting examples of ubiquitin-like proteins include small ubiquitin-like modifier (SUMO), ubiquitin cross-reactive protein (UCRP, also known as interferon-stimulated gene-15 (ISG15)), ubiquitin-related modifier-1 (URM1), neuronal-precursor-cell-expressed developmentally downregulated protein-8 (NEDD8, also called Rubl in S. cerevisiae), human leukocyte antigen F-associated (FAT10), autophagy-8 (ATG8) and -12 (ATG12), Fau ubiquitin-like protein (FUB1), membrane-anchored UBL (MUB), ubiquitin fold-modifier-1 (UFM1), and ubiquitin-like protein-5 (UBLS).
In some embodiments, the heterologous functional domain may be a marker domain. Non-limiting examples of marker domains include fluorescent proteins, purification tags, epitope tags, and reporter gene sequences. In some embodiments, the marker domain may be a fluorescent protein. Non-limiting examples of suitable fluorescent proteins include green fluorescent proteins (e.g., GFP, GFP-2, tagGFP, turboGFP, sfGFP, EGFP, Emerald, Azami Green, Monomeric Azami Green, CopGFP, AceGFP, ZsGreenl), yellow fluorescent proteins (e.g., YFP, EYFP, Citrine, Venus, YPet, PhiYFP, ZsYellow1), blue fluorescent proteins (e.g., EBFP, EBFP2, Azurite, mKalamal, GFPuv, Sapphire, T-sapphire,), cyan fluorescent proteins (e.g., ECFP, Cerulean, CyPet, AmCyan1, Midoriishi-Cyan), red fluorescent proteins (e.g., mKate, mKate2, mPlum, DsRed monomer, mCherry, mRFP1, DsRed-Express, DsRed2, DsRed-Monomer, HcRed-Tandem, HcRedl, AsRed2, eqFP611, mRasberry, mStrawberry, Jred), and orange fluorescent proteins (mOrange, mKO, Kusabira-Orange, Monomeric Kusabira-Orange, mTangerine, tdTomato) or any other suitable fluorescent protein. In other embodiments, the marker domain may be a purification tag and/or an epitope tag. Non-limiting exemplary tags include glutathione-S-transferase (GST), chitin binding protein (CBP), maltose binding protein (MBP), thioredoxin (TRX), poly(NANP), tandem affinity purification (TAP) tag, myc, AcV5, AU1, AUS, E, ECS, E2, FLAG, HA, nus, Softag 1, Softag 3, Strep, SBP, Glu-Glu, HSV, KT3, S, 51, T7, V5, VSV-G, 6×His, 8×His, biotin carboxyl carrier protein (BCCP), poly-His, and calmodulin. Non-limiting exemplary reporter genes include glutathione-S-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT), beta-galactosidase, beta-glucuronidase, luciferase, or fluorescent proteins.
In additional embodiments, the heterologous functional domain may target the Cas9 to a specific organelle, cell type, tissue, or organ. In some embodiments, the heterologous functional domain may target the Cas9 to muscle.
In further embodiments, the heterologous functional domain may be an effector domain. When the Cas9 is directed to its target sequence, e.g., when a Cas9 is directed to a target sequence by a guide RNA, the effector domain may modify or affect the target sequence. In some embodiments, the effector domain may be chosen from a nucleic acid binding domain or a nuclease domain (e.g., a non-Cas nuclease domain). In some embodiments, the heterologous functional domain is a nuclease, such as a FokI nuclease. See, e.g., U.S. Pat. No. 9,023,649.
In some embodiments, any of the compositions disclosed herein comprising any of the guides and/or endonucleases disclosed herein is sterile and/or substantially pyrogen-free. In particular embodiments, any of the compositions disclosed herein comprise a pharmaceutically acceptable carrier. The phrase “pharmaceutically or pharmacologically acceptable” refers to molecular entities and compositions that do not produce an adverse, allergic, or other untoward reaction when administered to an animal or human. As used herein “pharmaceutically acceptable carrier” includes any and all solvents (e.g., water), dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like that are physiologically compatible, including pharmaceutically acceptable cell culture media. Pharmaceutically acceptable carriers include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. In some embodiments, the composition comprises a preservative to prevent the growth of microorganisms.
In some embodiments, the efficacy of a guide RNA is determined when delivered or expressed together with other components forming an RNP. In some embodiments, the guide RNA is expressed together with a SaCas9 or SluCas9. In some embodiments, the guide RNA is delivered to or expressed in a cell line that already stably expresses an SaCas9 or SluCas9. In some embodiments the guide RNA is delivered to a cell as part of an RNP. In some embodiments, the guide RNA is delivered to a cell along with a nucleic acid (e.g., mRNA) encoding SaCas9 or SluCas9.
In some embodiments, the efficacy of particular guide RNAs is determined based on in vitro models. In some embodiments, the in vitro model is a cell line.
In some embodiments, the efficacy of particular guide RNAs is determined across multiple in vitro cell models for a guide RNA selection process. In some embodiments, a cell line comparison of data with selected guide RNAs is performed. In some embodiments, cross screening in multiple cell models is performed.
In some embodiments, the efficacy of particular guide RNAs is determined based on in vivo models. In some embodiments, the in vivo model is a rodent model. In some embodiments, the rodent model is a mouse which expresses a mutated dystrophin gene, e.g., a mdx mouse. In some embodiments, the in vivo model is a non-human primate, for example cynomolgus monkey.
This disclosure provides methods for gene editing and treating Duchenne Muscular Dystrophy (DMD). In some embodiments, any of the compositions described herein may be administered to a subject in need thereof for use in making a double or single strand break in any one or more of exons 43, 44, 45, 50, 51, or 53 of the dystrophin (DMD) gene. In some embodiments, pairs of guide RNAs described herein, in any of the vector configurations described herein, may be administered to a subject in need thereof to excise a portion of a DMD, thereby treating DMD. In some embodiments, any of the compositions described herein may be administered to a subject in need thereof for use in treating DMD. In some embodiments, a nucleic acid molecule comprising a first nucleic acid encoding one or more guide RNAs of Table 1A, Table 1B, or Table 5 and a second nucleic acid encoding either SaCas9 or SluCas9 (depending on the guide) is administered to a subject to treat DMD. In some embodiments, a single nucleic acid molecule (which may be a vector, including an AAV vector) comprising a first nucleic acid encoding one or more guide RNAs of Table 1A, Table 1B, or Table 5 and a second nucleic acid encoding either SaCas9 or SluCas9 (depending on the guide) is administered to a subject to treat DMD.
In some embodiments, any of the compositions described herein is administered to a subject in need thereof to treat Duchenne Muscular Dystrophy (DMD).
In some embodiments, any of the compositions described herein is administered to a subject in need thereof to induce a double strand break in any one or more of exons 43, 44, 45, 50, 51, or 53 of the dystrophin gene.
In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell any one of the compositions described herein, wherein the cell comprises a mutation in the dystrophin gene that is known to be associated with DMD.
In particular, in some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: 1) a nucleic acid molecule comprising: a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 1-35, 1000-1078, and 3000-3069; a nucleic acid encoding one or more spacer sequences comprising at least 17, 18, 19, or 20 contiguous nucleotides of a spacer sequence selected from SEQ ID NOs: 1-35, 1000-1078, and 3000-3069; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 1-35, 1000-1078, 3000-3069; and 2) a Staphylococcus aureus Cas9 (SaCas9) or a nucleic acid encoding (SaCas9).
In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: 1) a nucleic acid molecule comprising: a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 1-35, 1000-1078, and 3000-3069; a nucleic acid encoding one or more spacer sequences comprising at least 17, 18, 19, or 20 contiguous nucleotides of a spacer sequence selected from SEQ ID NOs: 1-35, 1000-1078, and 3000-3069; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 1-35, 1000-1078, 3000-3069; and 2) a Staphylococcus aureus Cas9 (SaCas9) or a nucleic acid encoding (SaCas9); wherein the cell comprises a mutation in the dystrophin gene that is known to be associated with DMD.
In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: a single nucleic acid molecule comprising: 1) a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 1-35, 1000-1078, and 3000-3069; a nucleic acid encoding one or more spacer sequences comprising at least 17, 18, 19, or 20 contiguous nucleotides of a spacer sequence selected from SEQ ID NOs: 1-35, 1000-1078, and 3000-3069; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 1-35, 1000-1078, and 3000-3069; and 2) a nucleic acid encoding SaCas9.
In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell a single nucleic acid molecule comprising: 1) a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 10, 12, 15, 16, 20, 27, 28, 32, 33, 35, 1001, 1003, 1005, 1010, 1012, 1013, 1016, 1017, and 1018; a nucleic acid encoding one or more spacer sequence comprising at least 17, 18, 19, or 20 contiguous nucleotides of a spacer sequence selected from SEQ ID NOs: 10, 12, 15, 16, 20, 27, 28, 32, 33, 35, 1001, 1003, 1005, 1010, 1012, 1013, 1016, 1017, and 1018; or a nucleic acid encoding one or more spacer sequence that is at least 90% identical to any one of SEQ ID NOs: 10, 12, 15, 16, 20, 27, 28, 32, 33, 35, 1001, 1003, 1005, 1010, 1012, 1013, 1016, 1017, and 1018; and 2) a nucleic acid encoding a Staphylococcus aureus Cas9 (SaCas9). In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: a single nucleic acid molecule comprising: 1) a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 10, 12, 15, 16, 20, 27, 28, 32, 33, 35, 1001, 1003, 1005, 1010, 1012, 1013, 1016, 1017, and 1018; a nucleic acid encoding one or more spacer sequences comprising at least 17, 18, 19, or 20 contiguous nucleotides of one or more spacer sequences selected from SEQ ID NOs: 10, 12, 15, 16, 20, 27, 28, 32, 33, 35, 1001, 1003, 1005, 1010, 1012, 1013, 1016, 1017, and 1018; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 10, 12, 15, 16, 20, 27, 28, 32, 33, 35, 1001, 1003, 1005, 1010, 1012, 1013, 1016, 1017, and 1018; and 2) a nucleic acid encoding SaCas9. In some embodiments, the spacer sequence is SEQ ID NO: 10. In some embodiments, the spacer sequence is SEQ ID NO: 12. In some embodiments, the spacer sequence is SEQ ID NO: 15. In some embodiments, the spacer sequence is SEQ ID NO: 16. In some embodiments, the spacer sequence is SEQ ID NO: 20. In some embodiments, the spacer sequence is SEQ ID NO: 27. In some embodiments, the spacer sequence is SEQ ID NO: 28. In some embodiments, the spacer sequence is SEQ ID NO: 32. In some embodiments, the spacer sequence is SEQ ID NO: 33. In some embodiments, the spacer sequence is SEQ ID NO: 35. In some embodiments, the spacer sequence is SEQ ID NO: 1001. In some embodiments, the spacer sequence is SEQ ID NO: 1003. In some embodiments, the spacer sequence is SEQ ID NO: 1005. In some embodiments, the spacer sequence is SEQ ID NO: 1010. In some embodiments, the spacer sequence is SEQ ID NO: 1012. In some embodiments, the spacer sequence is SEQ ID NO: 1013. In some embodiments, the spacer sequence is SEQ ID NO: 1016. In some embodiments, the spacer sequence is SEQ ID NO: 1017. In some embodiments, the spacer sequence is SEQ ID NO: 1018.
In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell a single nucleic acid molecule comprising: 1) a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 3022, 3023, 3028, 3029, 3030, 3031, 3038, 3039, 3052, 3053, 3054, 3055, 3062, 3063, 3064, 3065, 3068, and 3069; a nucleic acid encoding one or more spacer sequence comprising at least 17, 18, 19, or 20 contiguous nucleotides of a spacer sequence selected from SEQ ID NOs: 3022, 3023, 3028, 3029, 3030, 3031, 3038, 3039, 3052, 3053, 3054, 3055, 3062, 3063, 3064, 3065, 3068, and 3069; or a nucleic acid encoding one or more spacer sequence that is at least 90% identical to any one of SEQ ID NOs: 3022, 3023, 3028, 3029, 3030, 3031, 3038, 3039, 3052, 3053, 3054, 3055, 3062, 3063, 3064, 3065, 3068, and 3069; and 2) a nucleic acid encoding a Staphylococcus aureus Cas9 (SaCas9). In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: a single nucleic acid molecule comprising: 1) a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 3022, 3023, 3028, 3029, 3030, 3031, 3038, 3039, 3052, 3053, 3054, 3055, 3062, 3063, 3064, 3065, 3068, and 3069; a nucleic acid encoding one or more spacer sequences comprising at least 17, 18, 19, or 20 contiguous nucleotides of one or more spacer sequences selected from SEQ ID NOs: 3022, 3023, 3028, 3029, 3030, 3031, 3038, 3039, 3052, 3053, 3054, 3055, 3062, 3063, 3064, 3065, 3068, and 3069; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 3022, 3023, 3028, 3029, 3030, 3031, 3038, 3039, 3052, 3053, 3054, 3055, 3062, 3063, 3064, 3065, 3068, and 3069; and 2) a nucleic acid encoding SaCas9. In some embodiments, the spacer sequence is SEQ ID NO: 3022. In some embodiments, the spacer sequence is SEQ ID NO: 3023. In some embodiments, the spacer sequence is SEQ ID NO: 3028. In some embodiments, the spacer sequence is SEQ ID NO: 3029. In some embodiments, the spacer sequence is SEQ ID NO: 3030. In some embodiments, the spacer sequence is SEQ ID NO: 3031. In some embodiments, the spacer sequence is SEQ ID NO: 3038. In some embodiments, the spacer sequence is SEQ ID NO: 3039. In some embodiments, the spacer sequence is SEQ ID NO: 3052. In some embodiments, the spacer sequence is SEQ ID NO: 3053. In some embodiments, the spacer sequence is SEQ ID NO: 3054. In some embodiments, the spacer sequence is SEQ ID NO: 3055. In some embodiments, the spacer sequence is SEQ ID NO: 3062. In some embodiments, the spacer sequence is SEQ ID NO: 3063. In some embodiments, the spacer sequence is SEQ ID NO: 3064. In some embodiments, the spacer sequence is SEQ ID NO: 3065. In some embodiments, the spacer sequence is SEQ ID NO: 3068. In some embodiments, the spacer sequence is SEQ ID NO: 3069.
In particular, in some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: 1) a nucleic acid molecule comprising: a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 100-225, 2000-2116, and 4000-4251; a nucleic acid encoding one or more spacer sequences comprising at least 17, 18, 19, or 20 contiguous nucleotides of a spacer sequence selected from SEQ ID NOs: 100-225, 2000-2116, and 4000-4251; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 100-225, 2000-2116, and 4000-4251; and 2) a Staphylococcus lugdunensis (SluCas9) or a nucleic acid molecule encoding SluCas9.
In particular, in some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: 1) a nucleic acid molecule comprising: a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 100-225, 2000-2116, and 4000-4251; a nucleic acid encoding one or more spacer sequences comprising at least 17, 18, 19, or 20 contiguous nucleotides of a spacer sequence selected from SEQ ID NOs: 100-225, 2000-2116, and 4000-4251; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 100-225, 2000-2116, and 4000-4251; and 2) a Staphylococcus lugdunensis (SluCas9) or a nucleic acid molecule encoding SluCas9; wherein the cell comprises a mutation in the dystrophin gene that is known to be associated with DMD.
In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: a single nucleic acid molecule comprising: 1) a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 100-225, 2000-2116, and 4000-4251; a nucleic acid encoding one or more spacer sequences comprising at least 17, 18, 19, or 20 contiguous nucleotides of a spacer sequence selected from SEQ ID NOs: 100-225, 2000-2116, and 4000-4251; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 100-225, 2000-2116, and 4000-4251; and 2) a nucleic acid encoding Staphylococcus lugdunensis (SluCas9).
In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: 1) a nucleic acid molecule comprising: a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 131, 134, 135, 136, 139, 140, 141, 144, 145, 146, 148, 149, 150, 151, 179, 184, 201, 210, 223, 224, 225; a nucleic acid encoding a spacer sequence comprising at least 17, 18, 19, or 20 contiguous nucleotides of one or more spacer sequences selected from SEQ ID NOs: 131, 134, 135, 136, 139, 140, 141, 144, 145, 146, 148, 149, 150, 151, 179, 184, 201, 210, 223, 224, 225; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 131, 134, 135, 136, 139, 140, 141, 144, 145, 146, 148, 149, 150, 151, 179, 184, 201, 210, 223, 224, 225; and 2) a Staphylococcus lugdunensis (SluCas9) or a nucleic acid molecule encoding SluCas9. In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: a single nucleic acid molecule comprising: 1) a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 131, 134, 135, 136, 139, 140, 141, 144, 145, 146, 148, 149, 150, 151, 179, 184, 201, 210, 223, 224, 225; a nucleic acid encoding one or more spacer sequences comprising at least 17, 18, 19, or 20 contiguous nucleotides of one or more spacer sequences selected from SEQ ID NOs: 131, 134, 135, 136, 139, 140, 141, 144, 145, 146, 148, 149, 150, 151, 179, 184, 201, 210, 223, 224, 225; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 131, 134, 135, 136, 139, 144, 148, 149, 150, 151, 179, 184, 201, 210, 223, 224, 225; and 2) a nucleic acid encoding Staphylococcus lugdunensis (SluCas9). In some embodiments, the spacer sequence is SEQ ID NO: 131. In some embodiments, the spacer sequence is SEQ ID NO: 134. In some embodiments, the spacer sequence is SEQ ID NO: 135. In some embodiments, the spacer sequence is SEQ ID NO: 136. In some embodiments, the spacer sequence is SEQ ID NO: 139. In some embodiments, the spacer sequence is SEQ ID NO: 140. In some embodiments, the spacer sequence is SEQ ID NO: 141. In some embodiments, the spacer sequence is SEQ ID NO: 144. In some embodiments, the spacer sequence is SEQ ID NO: 145. In some embodiments, the spacer sequence is SEQ ID NO: 146. In some embodiments, the spacer sequence is SEQ ID NO: 148. In some embodiments, the spacer sequence is SEQ ID NO: 149. In some embodiments, the spacer sequence is SEQ ID NO: 150. In some embodiments, the spacer sequence is SEQ ID NO: 151. In some embodiments, the spacer sequence is SEQ ID NO: 179. In some embodiments, the spacer sequence is SEQ ID NO: 184. In some embodiments, the spacer sequence is SEQ ID NO: 201. In some embodiments, the spacer sequence is SEQ ID NO: 223. In some embodiments, the spacer sequence is SEQ ID NO: 224. In some embodiments, the spacer sequence is SEQ ID NO: 225.
In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: 1) a nucleic acid molecule comprising: a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 4062, 4063, 4068, 4069, 4070, 4071, 4072, 4073, 4078, 4079, 4088, 4089, 4096, 4097, 4098, 4099, 4100, 4101, 4102, 4103, 4158, 4159, 4168, 4169, 4202, 4203, 4220, 4221, 4246, 4247, 4248, 4249, 4250, 4251; a nucleic acid encoding a spacer sequence comprising at least 17, 18, 19, or 20 contiguous nucleotides of one or more spacer sequences selected from SEQ ID NOs: 4062, 4063, 4068, 4069, 4070, 4071, 4072, 4073, 4078, 4079, 4088, 4089, 4096, 4097, 4098, 4099, 4100, 4101, 4102, 4103, 4158, 4159, 4168, 4169, 4202, 4203, 4220, 4221, 4246, 4247, 4248, 4249, 4250, 4251; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 4062, 4063, 4068, 4069, 4070, 4071, 4072, 4073, 4078, 4079, 4088, 4089, 4096, 4097, 4098, 4099, 4100, 4101, 4102, 4103, 4158, 4159, 4168, 4169, 4202, 4203, 4220, 4221, 4246, 4247, 4248, 4249, 4250, 4251; and 2) a Staphylococcus lugdunensis (SluCas9) or a nucleic acid molecule encoding SluCas9. In some embodiments, a method of treating Duchenne Muscular Dystrophy (DMD) is provided, the method comprising delivering to a cell: a single nucleic acid molecule comprising: 1) a nucleic acid encoding one or more spacer sequences selected from SEQ ID NOs: 4062, 4063, 4068, 4069, 4070, 4071, 4072, 4073, 4078, 4079, 4088, 4089, 4096, 4097, 4098, 4099, 4100, 4101, 4102, 4103, 4158, 4159, 4168, 4169, 4202, 4203, 4220, 4221, 4246, 4247, 4248, 4249, 4250, 4251; a nucleic acid encoding one or more spacer sequences comprising at least 17, 18, 19, or 20 contiguous nucleotides of one or more spacer sequences selected from SEQ ID NOs: 4062, 4063, 4068, 4069, 4070, 4071, 4072, 4073, 4078, 4079, 4088, 4089, 4096, 4097, 4098, 4099, 4100, 4101, 4102, 4103, 4158, 4159, 4168, 4169, 4202, 4203, 4220, 4221, 4246, 4247, 4248, 4249, 4250, 4251; or a nucleic acid encoding one or more spacer sequences that is at least 90% identical to any one of SEQ ID NOs: 4062, 4063, 4068, 4069, 4070, 4071, 4072, 4073, 4078, 4079, 4088, 4089, 4096, 4097, 4098, 4099, 4100, 4101, 4102, 4103, 4158, 4159, 4168, 4169, 4202, 4203, 4220, 4221, 4246, 4247, 4248, 4249, 4250, 4251; and 2) a nucleic acid encoding Staphylococcus lugdunensis (SluCas9). In some embodiments, the spacer sequence is SEQ ID NO: 4062. In some embodiments, the spacer sequence is SEQ ID NO: 4063. In some embodiments, the spacer sequence is SEQ ID NO: 4068. In some embodiments, the spacer sequence is SEQ ID NO: 4069. In some embodiments, the spacer sequence is SEQ ID NO: 4070. In some embodiments, the spacer sequence is SEQ ID NO: 4071. In some embodiments, the spacer sequence is SEQ ID NO: 4072. In some embodiments, the spacer sequence is SEQ ID NO: 4073. In some embodiments, the spacer sequence is SEQ ID NO: 4078. In some embodiments, the spacer sequence is SEQ ID NO: 4079. In some embodiments, the spacer sequence is SEQ ID NO: 4088. In some embodiments, the spacer sequence is SEQ ID NO: 4089. In some embodiments, the spacer sequence is SEQ ID NO: 4096. In some embodiments, the spacer sequence is SEQ ID NO: 4097. In some embodiments, the spacer sequence is SEQ ID NO: 4098. In some embodiments, the spacer sequence is SEQ ID NO: 4099. In some embodiments, the spacer sequence is SEQ ID NO: 4100. In some embodiments, the spacer sequence is SEQ ID NO: 4101. In some embodiments, the spacer sequence is SEQ ID NO: 4102. In some embodiments, the spacer sequence is SEQ ID NO: 4103. In some embodiments, the spacer sequence is SEQ ID NO: 4158. In some embodiments, the spacer sequence is SEQ ID NO: 4159. In some embodiments, the spacer sequence is SEQ ID NO: 4168. In some embodiments, the spacer sequence is SEQ ID NO: 4169. In some embodiments, the spacer sequence is SEQ ID NO: 4202. In some embodiments, the spacer sequence is SEQ ID NO: 4203. In some embodiments, the spacer sequence is SEQ ID NO: 4220. In some embodiments, the spacer sequence is SEQ ID NO: 4221. In some embodiments, the spacer sequence is SEQ ID NO: 4246. In some embodiments, the spacer sequence is SEQ ID NO: 4247. In some embodiments, the spacer sequence is SEQ ID NO: 4248. In some embodiments, the spacer sequence is SEQ ID NO: 4249. In some embodiments, the spacer sequence is SEQ ID NO: 4250. In some embodiments, the spacer sequence is SEQ ID NO: 4251.
In some embodiments, the methods comprise delivering to a cell a nucleic acid molecule encoding SaCas9, wherein the SaCas9 comprises the amino acid sequence of SEQ ID NO: 711. In some embodiments, the methods comprise delivering to a cell a nucleic acid molecule encoding SaCas9, wherein the SaCas9 is a variant of the amino acid sequence of SEQ ID NO: 711. In some embodiments, the methods comprise delivering to a cell a nucleic acid molecule encoding SaCas9, wherein the SaCas9 comprises an amino acid sequence selected from any one of SEQ ID NOs: 715-717. In some embodiments, the methods comprise delivering to a cell a nucleic acid molecule encoding SaCas9, wherein the SaCas9 comprises the amino acid of SEQ ID NO: 715.
In some embodiments, the methods comprise delivering to a cell a nucleic acid molecule encoding SluCas9, wherein the SluCas9 comprises the amino acid sequence of SEQ ID NO: 712. In some embodiments, the methods comprise delivering to a cell a nucleic acid molecule encoding SluCas9, wherein the SaCas9 is a variant of the amino acid sequence of SEQ ID NO: 712. In some embodiments, the methods comprise delivering to a cell a nucleic acid molecule encoding SluCas9, wherein the SluCas9 comprises an amino acid sequence selected from any one of SEQ ID NOs: 718-720.
In some embodiments, methods are provided for treating Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: a nucleic acid encoding a pair of guide RNAs comprising: a first and second spacer sequence selected from any one of SEQ ID NOs: 10 and 15; 10 and 16; 12 and 16; 1001 and 1005; 1001 and 15; 1001 and 16; 1003 and 1005; 16 and 1003; 12 and 1010; 12 and 1012; 12 and 1013; 10 and 1016; 1017 and 1005; 1017 and 16; and 1018 and 16; and a nucleic acid encoding a Staphylococcus aureus Cas9 (SaCas9). In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 10 and 15. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 10 and 16. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 12 and 16. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1001 and 1005. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1001 and 15. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1001 and 16. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1003 and 1005. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 16 and 1003. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 12 and 1010. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 12 and 1012. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 12 and 1013. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 10 and 1016. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1017 and 16. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1018 and 16.
In some embodiments, methods are provided for treating Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: a nucleic acid encoding a pair of guide RNAs comprising: a first and second spacer sequence comprising at least 17, 18, 19, 20, or 21 contiguous nucleotides of any one of SEQ ID NOs: 10 and 15; 10 and 16; 12 and 16; 1001 and 1005; 1001 and 15; 1001 and 16; 1003 and 1005; 16 and 1003; 12 and 1010; 12 and 1012; 12 and 1013; 10 and 1016; 1017 and 1005; 1017 and 16; and 1018 and 16; and a nucleic acid encoding a Staphylococcus aureus Cas9 (SaCas9).
In some embodiments, methods are provided for treating Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: a nucleic acid encoding a pair of guide RNAs comprising: a first and second spacer sequence that are selected from spacer sequences that are at least 90% identical to any one of SEQ ID NOs: 10 and 15; 10 and 16; 12 and 16; 1001 and 1005; 1001 and 15; 1001 and 16; 1003 and 1005; 16 and 1003; 12 and 1010; 12 and 1012; 12 and 1013; 10 and 1016; 1017 and 1005; 1017 and 16; and 1018 and 16; and a nucleic acid encoding a Staphylococcus aureus Cas9 (SaCas9).
In some embodiments, methods are provided for treating Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: a nucleic acid encoding a pair of guide RNAs comprising: a first and second spacer sequence selected from any one of SEQ ID NOs: 1001 and 1005; 1001 and 15; 1001 and 16; 1003 and 1005; 16 and 1003; 12 and 1010; 12 and 1012; 12 and 1013; 10 and 1016; 1017 and 1005; 1017 and 16; and 1018 and 16; and a nucleic acid encoding a Staphylococcus aureus Cas9 (SaCas9), wherein the nucleic acid encoding the SaCas9 comprises the amino acid sequence of SEQ ID NO: 715. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1001 and 1005. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1001 and 15. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1001 and 16. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1003 and 1005. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 16 and 1003. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 12 and 1010. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 12 and 1012. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 12 and 1013. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 10 and 1016. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1017 and 16. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 1018 and 16.
In some embodiments, methods are provided for treating Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: a nucleic acid encoding a pair of guide RNAs comprising: a first and second spacer sequence comprising at least 17, 18, 19, 20, or 21 contiguous nucleotides of any one of SEQ ID NOs: 1001 and 1005; 1001 and 15; 1001 and 16; 1003 and 1005; 16 and 1003; 12 and 1010; 12 and 1012; 12 and 1013; 10 and 1016; 1017 and 1005; 1017 and 16; and 1018 and 16; and a nucleic acid encoding a Staphylococcus aureus Cas9 (SaCas9), wherein the nucleic acid encoding the SaCas9 comprises the amino acid sequence of SEQ ID NO: 715.
In some embodiments, methods are provided for treating Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: a nucleic acid encoding a pair of guide RNAs comprising: a first and second spacer sequence that is at least 90% identical to any one of SEQ ID NOs: 1001 and 1005; 1001 and 15; 1001 and 16; 1003 and 1005; 16 and 1003; 12 and 1010; 12 and 1012; 12 and 1013; 10 and 1016; 1017 and 1005; 1017 and 16; and 1018 and 16; and a nucleic acid encoding a Staphylococcus aureus Cas9 (SaCas9), wherein the nucleic acid encoding the SaCas9 comprises the amino acid sequence of SEQ ID NO: 715.
In some embodiments, methods are provided for treating Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: a nucleic acid encoding a pair of guide RNAs comprising: a first and second spacer sequence selected from any one of SEQ ID NOs: 148 and 134; 149 and 135; 150 and 135; 131 and 136; 151 and 136; 139 and 131; 139 and 151; 140 and 131; 140 and 151; 141 and 148; 144 and 149; 144 and 150; 145 and 131; 145 and 151; and 146 and 148; and a nucleic acid encoding a Staphylococcus lugdunensis (SluCas9). In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 148 and 134. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 149 and 135. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 150 and 135. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 131 and 136. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 151 and 136. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 139 and 131. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 139 and 151. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 140 and 131. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 140 and 151. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 141 and 148. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 144 and 149. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 144 and 150. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 145 and 131. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 145 and 151. In some embodiments, the first and second spacer sequence are selected from SEQ ID NOs: 146 and 148.
In some embodiments, methods are provided for treating Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: a nucleic acid encoding a pair of guide RNAs comprising: a first and second spacer sequence comprising at least 17, 18, 19, 20, or 21 contiguous nucleotides of any one of SEQ ID NOs: 148 and 134; 149 and 135; 150 and 135; 131 and 136; 151 and 136; 139 and 131; 139 and 151; 140 and 131; 140 and 151; 141 and 148; 144 and 149; 144 and 150; 145 and 131; 145 and 151; and 146 and 148; and a nucleic acid encoding a Staphylococcus lugdunensis (SluCas9).
In some embodiments, methods are provided for treating Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: a nucleic acid encoding a pair of guide RNAs comprising: a first and second spacer sequence that are selected from spacer sequences that are at least 90% identical to any one of SEQ ID NOs: 148 and 134; 149 and 135; 150 and 135; 131 and 136; 151 and 136; 139 and 131; 139 and 151; 140 and 131; 140 and 151; 141 and 148; 144 and 149; 144 and 150; 145 and 131; 145 and 151; and 146 and 148; and a nucleic acid encoding a Staphylococcus lugdunensis (SluCas9).
In some embodiments, methods are provided for excising a portion of an exon with a premature stop codon in a subject with Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: (i) a nucleic acid encoding a pair of guide RNAs wherein the first guide RNA binds to a target sequence within the exon that is upstream of the premature stop codon, and wherein the second guide RNA binds to a sequence that is downstream of the premature stop codon and downstream of the sequence bound by the first guide RNA; and (ii) a nucleic acid encoding a Staphylococcus aureus Cas9 (SaCas9); wherein the pair of guide RNAs and SaCas9 excise a portion of the exon. In some embodiments, the single nucleic acid molecule is delivered to the cell on a single vector. In some embodiments, the portions of the exon remaining after excision are rejoined with a one nucleotide insertion. In some embodiments, the portions of the exon remaining after excision are rejoined without a nucleotide insertion. Precise segmental deletion means that the dual cut process takes out a specific deletion. This precise deletion can lead to exon reframing (see e.g.,
In some embodiments, methods are provided for excising a portion of an exon with a premature stop codon in a subject with Duchenne Muscular Dystrophy (DMD), the method comprising delivering to a cell a single nucleic acid molecule comprising: (i) a nucleic acid encoding a pair of guide RNAs wherein the first guide RNA binds to a target sequence within the exon that is upstream of the premature stop codon, and wherein the second guide RNA binds to a sequence that is downstream of the premature stop codon and downstream of the sequence bound by the first guide RNA; and (ii) a nucleic acid encoding a Staphylococcus lugdunensis (SluCas9); wherein the pair of guide RNAs and SluCas9 excise a portion of the exon. In some embodiments, the single nucleic acid molecule is delivered to the cell on a single vector. In some embodiments, the portions of the exon remaining after excision are rejoined with a one nucleotide insertion. In some embodiments, the portions of the exon remaining after excision are rejoined without a nucleotide insertion. In some embodiments, the size of excised portion of the exon is between 8 and 167 nucleotides. In some embodiments, the exon is exon 45 of the dystrophin gene. In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 148 and 134 (or SEQ ID Nos: 134 and 148). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 149 and 135 (or SEQ ID Nos: 135 and 149). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 150 and 135 (or SEQ ID Nos: 135 and 150). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 131 and 136 (or SEQ ID Nos: 136 and 131). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 151 and 136 (or SEQ ID Nos: 136 and 151). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 139 and 131 (or SEQ ID Nos: 131 and 139). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 139 and 151 (or SEQ ID Nos: 151 and 139). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 140 and 131 (or SEQ ID Nos: 131 and 140). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 140 and 151 (or SEQ ID Nos: 151 and 140). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 141 and 148 (or SEQ ID Nos: 148 and 141). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 144 and 149 (or SEQ ID Nos: 149 and 144). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 144 and 150 (or SEQ ID Nos: 150 and 144). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 145 and 131 (or SEQ ID Nos: 131 and 145). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 145 and 151 (or SEQ ID Nos: 151 and 145). In some embodiments, the pair of guide RNAs comprise a first and second spacer sequence of SEQ ID NOs: 146 and 148 (or SEQ ID Nos: 148 and 146).
In some embodiments, the subject is a mammal. In some embodiments, the subject is human
For treatment of a subject (e.g., a human), any of the compositions disclosed herein may be administered in a manner compatible with the dosage formulation and in such amount as is therapeutically effective. The compositions may be readily administered in a variety of dosage forms, such as injectable solutions. For parenteral administration in an aqueous solution, for example, the solution will generally be suitably buffered and the liquid diluent first rendered isotonic with, for example, sufficient saline or glucose. Such aqueous solutions may be used, for example, for intravenous, intramuscular, subcutaneous, and/or intraperitoneal administration.
In some embodiments, the invention comprises combination therapies comprising any of the methods or uses described herein together with an additional therapy suitable for ameliorating DMD.
The methods and uses disclosed herein may use any suitable approach for delivering the guide RNAs and compositions described herein. Exemplary delivery approaches include vectors, such as viral vectors; lipid nanoparticles; transfection; and electroporation. In some embodiments, vectors or LNPs associated with the single-vector guide RNAs/Cas9's disclosed herein are for use in preparing a medicament for treating DMD.
Where a vector is used, it may be a viral vector, such as a non-integrating viral vector. In some embodiments, viral vector is an adeno-associated virus vector, a lentiviral vector, an integrase-deficient lentiviral vector, an adenoviral vector, a vaccinia viral vector, an alphaviral vector, or a herpes simplex viral vector. In some embodiments, the viral vector is an adeno-associated virus (AAV) vector. In some embodiments, the AAV vector is an AAV1, AAV2, AAV3, AAV4, AAVS, AAV6, AAV7, AAV8, AAVrh10 (see, e.g., SEQ ID NO: 81 of U.S. Pat. No. 9,790,472, which is incorporated by reference herein in its entirety), AAVrh74 (see, e.g., SEQ ID NO: 1 of US 2015/0111955, which is incorporated by reference herein in its entirety), or AAV9 vector, wherein the number following AAV indicates the AAV serotype. In some embodiments, the AAV vector is a single-stranded AAV (ssAAV). In some embodiments, the AAV vector is a double-stranded AAV (dsAAV). Any variant of an AAV vector or serotype thereof, such as a self-complementary AAV (scAAV) vector, is encompassed within the general terms AAV vector, AAV1 vector, etc. See, e.g., McCarty et al., Gene Ther. 2001; 8:1248-54, Naso et al., BioDrugs 2017; 31:317-334, and references cited therein for detailed discussion of various AAV vectors. In some embodiments, the AAV vector size is measured in length of nucleotides from ITR to ITR, inclusive of both ITRs. In some embodiments, the AAV vector is less than 5 kb in size from ITR to ITR, inclusive of both ITRs. In particular embodiments, the AAV vector is less than 4.9 kb from ITR to ITR in size, inclusive of both ITRs. In further embodiments, the AAV vector is less than 4.85 kb in size from ITR to ITR, inclusive of both ITRs. In further embodiments, the AAV vector is less than 4.8 kb in size from ITR to ITR, inclusive of both ITRs. In further embodiments, the AAV vector is less than 4.75 kb in size from ITR to ITR, inclusive of both ITRs. In further embodiments, the AAV vector is less than 4.7 kb in size from ITR to ITR, inclusive of both ITRs. In some embodiments, the vector is between 3.9-5 kb, 4-5 kb, 4.2-5 kb, 4.4-5 kb, 4.6-5 kb, 4.7-5 kb, 3.9-4.9 kb, 4.2-4.9 kb, 4.4-4.9 kb, 4.7-4.9 kb, 3.9-4.85 kb, 4.2-4.85 kb, 4.4-4.85 kb, 4.6-4.85 kb, 4.7-4.85 kb, 4.7-4.9 kb, 3.9-4.8 kb, 4.2-4.8 kb, 4.4-4.8 kb or 4.6-4.8 kb from ITR to ITR in size, inclusive of both ITRs. In some embodiments, the vector is between 4.4-4.85 kb in size from ITR to ITR, inclusive of both ITRs. In some embodiments, the vector is an AAV9 vector.
In some embodiments, the vector (e.g., viral vector, such as an adeno-associated viral vector) comprises a tissue-specific (e.g., muscle-specific) promoter, e.g., which is operatively linked to a sequence encoding the guide RNA. In some embodiments, the muscle-specific promoter is a muscle creatine kinase promoter, a desmin promoter, an MHCK7 promoter, or an SPc5-12 promoter. In some embodiments, the muscle-specific promoter is a CK8 promoter. In some embodiments, the muscle-specific promoter is a CK8e promoter. Muscle-specific promoters are described in detail, e.g., in US2004/0175727 A1; Wang et al., Expert Opin Drug Deliv. (2014) 11, 345-364; Wang et al., Gene Therapy (2008) 15, 1489-1499. In some embodiments, the tissue-specific promoter is a neuron-specific promoter, such as an enolase promoter. See, e.g., Naso et al., BioDrugs 2017; 31:317-334; Dashkoff et al., Mol Ther Methods Clin Dev. 2016; 3:16081, and references cited therein for detailed discussion of tissue-specific promoters including neuron-specific promoters.
In some embodiments, in addition to guide RNA and Cas9 sequences, the vectors further comprise nucleic acids that do not encode guide RNAs. Nucleic acids that do not encode guide RNA and Cas9 include, but are not limited to, promoters, enhancers, and regulatory sequences. In some embodiments, the vector comprises one or more nucleotide sequence(s) encoding a crRNA, a trRNA, or a crRNA and trRNA.
Lipid nanoparticles (LNPs) are a known means for delivery of nucleotide and protein cargo, and may be used for delivery of the guide RNAs, compositions, or pharmaceutical formulations disclosed herein. In some embodiments, the LNPs deliver nucleic acid, protein, or nucleic acid together with protein.
Electroporation is a well-known means for delivery of cargo, and any electroporation methodology may be used for delivering the single vectors disclosed herein.
In some embodiments, the invention comprises a method for delivering any one of the single vectors disclosed herein to an ex vivo cell, wherein the guide RNA is encoded by a vector, associated with an LNP, or in aqueous solution. In some embodiments, the guide RNA/LNP or guide RNA is also associated with a Cas9 or sequence encoding Cas9 (e.g., in the same vector, LNP, or solution).
In some embodiments, the disclosure provides for methods of using any of the guides, endonucleases, cells, or compositions disclosed herein in research methods. For example, any of the guides or endonucleases disclosed herein may be used alone or in combination in experiments under various parameters (e.g., temperatures, pH, types of cells) or combined with other reagents to evaluate the activity of the guides and/or endonucleases.
Further embodiments encompassed by the disclosure are as follows.
NOs: 1-35, 1000-1078, or 3000-3069.
NOs: 100-225, 2000-2116, or 4000-4251.
Muscular Dystrophy (DMD).
The following examples are provided to illustrate certain disclosed embodiments and are not to be construed as limiting the scope of this disclosure in any way.
Guide RNA comprising the guide sequences shown in Table 1A and Table 1B below are prepared according to standard methods in a single guide (sgRNA) format. A single AAV vector is prepared that expresses one or more of the guide RNAs and a SaCas9 (for guide sequences having SEQ ID NOs: 1-35, or SEQ ID NOs: 3000-3069) or SluCas9 (for guide sequences having SEQ ID NOs: 100-225, or SEQ ID NOs: 4000-4251). See, Table 1A and Table 1B. The AAV vector is administered to cells in vitro and to mice (e.g., mdx mice) in vivo to assess the ability of the AAV to express the guide RNA and Cas9, edit the targeted exon (see Table 1A and Table 1B), and thereby treat DMD.
In particular, the ability of in vivo single AAV-mediated delivery of gene-editing components to successfully remove the mutant genomic sequence by exon skipping in the cardiac and skeletal muscle cells of mdx mice is tested.
A. Materials and Methods
1. sgRNA Selection
A subset of sgRNAs targeting the DMD gene were selected for indel frequency and profile evaluation. The selected sgRNAs are shown in Table 2 and were prepared according to standard methods. The criteria used to select these sgRNAs included their potential to induce exon reframing and or skipping, in addition to the existence of a mouse, dog and a non-human primate (NHP) homologue counterpart. This selection included 13 sgRNAs located within exon 45, three sgRNAs located within exon 51 and ten sgRNAs located within exon 53. The number of predicted off target sites was determined for each sgRNA.
2. Transfection of HECK293FT and Neuro-2a Cells
To evaluate indel frequency and profile, human HEK293FT and mouse Neuro-2a cell lines were used. HEK293FT and Neuro-2a cells were transfected in 12-well plates with 750 ng plasmid+2.25 μL of Lipofectamine 2000. Three days after transfection, cells were trypsinized and sorted for green fluorescent protein (GFP). GFP-positive cells were sorted directly into lysis buffer, and DNA extraction was performed using the GeneJet Genomic DNA Purification Kit. PCR was then performed on the DNA using exon-specific primers that targeted the relevant cut site.
3. Amplicon Deep Sequencing, Library Preparation, and Data Analysis
The relevant loci for each exon were amplified by PCR and the products were used to prepare sequencing libraries using MiSeq reagent kit V3. Indel analysis was performed using CRISPResso2±10-nt quantification window. (See, e.g., Clement et al., Nat Biotechnol. 2019 March; 37(3):224-226). Indel profiling consisted of 5 mutually exclusive indel categories, depicted in
The following sequences of selected primers were used for amplification of the specific human locus containing the sgRNA targeting sites (Tables 3A-3B) and the specific mouse locus containing the sgRNA targeting sites (Tables 4A-4B).
B. Results
A set of exemplary DMD sgRNAs were evaluated for indel frequency and editing profile with either SaCas9 or SluCas9 (as indicated in Table 2). Among this selection, 13 sgRNAs were located within exon 45, 3 sgRNAs were located within exon 51 and 10 sgRNAs were located within exon 53. To evaluate indel frequency and profile, plasmid transfection was performed in HEK293FT and Neuro-2a cell lines.
The average indel frequency of sgRNAs targeting exon 45 was determined in HEK293FT cells (
The average indel frequency of sgRNAs targeting exon 51 was determined in HEK293FT cells (
The average indel frequency of sgRNAs targeting exon 53 was determined in HEK293FT cells (
Additional exemplary DMD guide RNAs were designed that may be used with SaCas9 and variants of SaCas9 (e.g., guide sequences having SEQ ID NOs: 1000-1078) and SluCas9 and variants of SluCas9 (e.g., guide sequences having SEQ ID NOs: 2000-2116) in Table 5 below. In particular, guide RNAs were designed based on SaCas9-KKH (for guide sequences having SEQ ID NOs: 1000-1078) with a PAM sequence NNNRRT (N is any nucleotide, R is purine) and SluCas9-KH (for guide sequences having SEQ ID NOs: 2000-2116) with a PAM sequence NNRG.
Guide RNAs were designed focusing on genomic coordinate regions within exons 45, 51, and 53. For exon 45, the design region was genomic coordinates chrX: 31968307-31968546. For exon 51, the design region was genomic coordinates chrX: 31773928-31774224. For exon 53, the design region was genomic coordinates chrX: 31679343-31679618.
Off-target site prediction was computationally performed for two sets of mismatches: (1) 3 mismatches+0 bulge; and (2) 2 mismatches+1 bulge. Results are shown in Table 5.
Guide RNA comprising the guide sequences shown in Table 5 below are prepared according to standard methods in a single guide (sgRNA) format. A single AAV vector is prepared that expresses the guide RNA and a variant SaCas9 (for guide sequences having SEQ ID NOs: 1000-1078) or the guide RNA and a variant SluCas9 (for guide sequences having SEQ ID NOs: 2000-2116). See, Table 5. The AAV vector is administered to cells in vitro and to mice (e.g., mdx mice) in vivo to assess the ability of the AAV to express the guide RNA and Cas9, edit the targeted exon (see Table 5), and thereby treat DMD.
In particular, the ability of in vivo single AAV-mediated delivery of gene-editing components to successfully remove the mutant genomic sequence by exon skipping in the cardiac and skeletal muscle cells of mdx mice is tested.
A. Materials and Methods
1. sgRNA Selection
A subset of SaCas9-KKH or SluCas9 sgRNAs found within the DMD gene was selected for indel frequency and profile evaluation. The criteria used to select these sgRNAs included their potential to induce exon reframing and or skipping as a pair, in addition to the existence of a mouse, dog and NHP homologue counterpart. This selection included 27 sgRNAs located within exon 45, 39 sgRNAs located within exon 51 and 29 sgRNAs located within exon 53. The number of predicted off target sites was determined for each sgRNA.
2. Transfection of HEK293FT Cells
293FT cells were transfected in 12-well plates with 750 ng plasmid+2.25 uL of Lipofectamine 2000. Three days after transfection, cells were trypsinized and sorted for GFP. GFP-positive cells were sorted directly into lysis buffer, and DNA extraction was performed using the Promega Maxwell RSC Blood DNA Kit. PCR was then performed on the DNA using exon-specific primers that targeted the relevant cut site.
3. Amplicon Deep Sequencing Library Preparation and Data Analysis
The relevant loci for each exon were amplified by PCR and the products were used to prepare sequencing libraries using MiSeq reagent kit V3 600 cycle. Indel analysis was performed using CRISPResso2±10-nt quantification window. Indel profiling consisted of 8 indel categories listed in Table 6.
4. AAV Configurations for the Dual Cut Single Vector Candidates
A combination of promoter orientations, promoter configurations, NLSs and scaffolds were selected for generating AAV plasmids and evaluation on sgRNA transgene expression, AAV manufacturability, and editing efficiency in vitro and in vivo. AAV plasmid configurations listed in Table 7. Promoter, NLS and scaffold sequences listed in Table 8.
Sequences of selected primers used for amplification of the specific human locus containing the sgRNA sites are shown in Table 9.
5. Results
A set of sgRNAs found within the DMD gene was selected for evaluation of indel frequency and profile. Among this selection, 27 sgRNAs were located within exon 45, 39 sgRNAs were located within exon 51 and 29 sgRNAs were located within exon 53. To evaluate indel frequency and editing profiles, plasmid transfection was performed in HEK293FT (
1. Materials and Methods
Primary human skeletal muscle myoblasts (HsMM; Lonza CC-2580: lot #20TL070666, P0) were recovered and passaged in SkBM®-2 Skeletal Muscle Myoblast Basal Medium plus SkGM®-2 SingleQuots (CC-3246, CC-3244; Lonza) in the incubator at 37° C. with 5% CO2. When HsMM culture reached approximately 80% to 90% confluence and were actively proliferating, the cells were harvested for SluCas9 ribonucleoprotein (RNP) delivery. After thawing, the cells were passaged once before SluCas9 RNP delivery.
To form SluCas9 RNPs, the appropriate amount of synthetic sgRNA (Synthego: SO #7292552) and recombinant SluCas9 protein (Aldevron: Lot #M22536-01) were mixed in supplemented P5 Primary Cell nucleofection solution (Lonza V4XP-5032). In total, three sgRNA:SluCas9 doses were tested, including a low dose with 37.5 pmol:6.25 pmol, a middle dose 75 pmol:12.5 pmol, and a high dose 150:25. The sgRNAs and SluCas9 proteins were incubated for at least 10 minutes at room temperature for Cas9-sgRNA RNP formation.
While the SluCas9 RNP was forming, HsMMs were rinsed with HEPES buffered saline solution, dissociated from tissue culture flasks by trypsin, and centrifuge at 90×g for 10 minutes. The cell pellets were resuspended in fresh, pre-warmed, complete growth medium. The number of cells were counted. Appropriate number of cells were transfer into a new centrifuge tube, pelleted by centrifugation at 90×g for 10 minutes, and resuspended in supplemented nucleofection solution. About 200,000 cells in 15 μl nucleofection solution were mixed with about 7 μl of preformed SluCas9:sgRNA RNP complex.
Approximately 20 μl of the cell and RNP mix were transferred into a 16-well nucleofection strip tube, and then nucleofected with the DS-158 program in a 4D-Nucleofector (Lonza) Immediately after nucleofection, about 80 μl pre-warmed media were added into the each nucleocuvette and incubated at 37 degrees for 10 minutes. The contents (100 μl) of each nucleocuvette were transferred into one well in a 12-well plate filled with 2 ml media and incubated at 37 degrees for 48 hours.
To determine cell viability 48 hours after nucleofection, the cells were stained with Hoechst and Propidium Iodide (Life Technologies). Cell viability was then assessed using ImageXpress Micro (Molecular Devices). In general, samples with an overall cell viability above 70% were harvested and analyzed for indel analysis.
To isolate genomic DNA from HsMMs, the cells were washed with saline buffer, trypsinized and centrifuged. The cell pellets were treated with lysis buffer from the Maxwell RSC Blood DNA Kit (Promega #AS1400), and genomic DNAs were extracted using a Maxwell® RSC48 instrument (Promega #AS8500) according to the manufacturer's instruction. The concentrations of genomic DNAs were determined using Qubit™ 1× dsDNA HS Assay Kit (Thermo Fisher Scientific Q33231) according to the manufacturer's instruction.
To determine the gene editing efficiency, the genomic DNAs were amplified using primers flanking the DMD exon 45 genomic region. The following primer sequences were used: MiSeq_hE45_F TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGgtattctgtettgtatcctttgg (SEQ ID NO: 724) and MiSeq_hE45_R GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGaatgttagtgcctttcaccc (SEQ ID NO: 725). The size of the amplicons was verified by analyze a small amount of the PCR products on 2% E-gels (Thermo Fisher Scientific). A portion of the PCR product and the forward primer were then sent for sanger sequencing at Genewiz.
The sequencing results that pass the quality filter were used to determine editing efficiency and indel profile using the TIDE (Tracking of Indels by DEcomposition) algorithm. The following version of the algorithm was used for analysis https://shiny.vrtx.com/app/orrj/tide/ with the appropriate sgRNA sequence, the default analysis parameters, and a mock nucleofected sample as the control. Percentage of other insertions and deletions that have the potential to restore the reading frame of particular DMD patient mutations of interest were referred to as “RF other”. This represents the sum of 2, 5, 8, 11 bp deletions within the alignment window of −20 bp to +20 bp around the Cas9 cut site. The editing efficiency (% mutation) outputs for +1 bp insertion, RF. Other, and other indels from TIDE were then plotted using Prism 9.
2. Results
To optimize and improve gene editing efficiency of the top SluCas9 single-guide RNA (sgRNA) candidates, different sgRNA scaffold sequences were tested in primary human skeletal muscle myoblasts (HsMM) using synthetic sgRNAs as shown in Table 11 below. Two spacer sequences were tested: E45SL23 (SEQ ID NO: 150 (DNA); SEQ ID NO: 930 (RNA)) and E45SL24 (SEQ ID NO: 151 (DNA); SEQ ID NO: 931 (RNA)). Three scaffold sequences were tested: Slu-VCGT-4.5 (SEQ ID NO: 601 (DNA); SEQ ID NO: 918 (RNA)), Slu-VCGT-4 (SEQ ID NO: 917 (DNA); SEQ ID NO: 919 (RNA)), Slu-VCGT-5 (SEQ ID NO: 901 (DNA); SEQ ID NO: 920 (RNA)).
The three scaffold sequences differ by the nucleotide identity, and thus the stem-loop I in RNA secondary structure (
1. Materials and Methods
Transfection of HEK293FT Cells
293FT cells were transfected in 12-well plates with 750 ng plasmid+2.25 μL of Lipofectamine 2000. Three days after transfection, cells were trypsinized and sorted for GFP. GFP-positive cells were sorted directly into lysis buffer, and DNA extraction was performed using the Promega Maxwell RSC Blood DNA Kit. PCR was then performed on the DNA using exon-specific primers that targeted the relevant cut site.
Transfection of N2a Cells
N2a (Neuro2a) cells were transfected in 12-well plates after 24-hour of growth with 1000 ng plasmid+3 μL of Lipofectamine 2000. Three days after transfection, cell were trypsinized and sorted for GFP (green fluroescent protein). GFP-positive cells were sorted via FACS (flurorescence-activated cell sorting) directly into lysis buffer, and DNA extraction was performed using the Promega Maxwell RSC Blood DNA Kit. PCR was then performed on the DNA using exon-specific primers that targeted the relevant cut site.
Amplicon Deep Sequencing Library Preparation for HEK293FT Cells
Genomic DNA were extracted after sorting for 100 k GFP positive cells and were subjected for amplification by locus specific amplicon. The amplified products were purified by AMPure beads (0.8×) followed by QC with 1% E-gel and concentrations were measured using QuBiT for normalization of samples. Barcoding PCR was carried out with i5 and i7 indices and they were purified by 0.7× AMPure beads followed by QC with Tapestation or 1% E-gel. The barcoded samples were then measured by QuBiT and based on the concentration, the samples were pooled and subjected for library preparation for loading into MiSeq. 8 pM of library was loaded with 20% PhiX spike-in and the output were transferred to the Computational Genomics team for assessing the indel efficiencies for the guides.
Human Primary Skeletal Muscle Myoblasts (HsMMs) Culture
On day zero, two frozen vials of HsMMs (lot 20TL070666) was thawed and grown in complete growth media in a 37° C., 5% CO2, humidified incubator in two T75 flasks. The following day, on day 1, the media was changed. On day 3, the cells were passaged such that there were 9x10 cells seeded into 7 T175 flasks. On day 4, the media was changed in each flask. On day 6, RNP nucleofection was performed.
Nucleofection of HsMM Cells
HsMM cells were nucleofected with RNP using the Lonza 4D nucleofector. For each sample, 7 μL of RNP were combined with 0.3e6 cells in 15 μL P5 solution. RNP was prepared in a 6:1 gRNA:Cas9 ratio at various concentrations. For dual-cut samples that contain 2 gRNAs, RNP was pre-formed with a single gRNA first. RNP was formed by incubating gRNA and protein for 20 min at room temperature. After electroporation, 80 μL of complete growth media was added to each sample and samples were incubated in a 37° C., 5% CO2, humidified incubator. After 10 minutes, the samples were transferred to 12-well plates containing 2 mL of complete growth media that had been previously equilibrated in a 37° C., 5% CO2, humidified incubator.
Cell Harvesting and gDNA Extraction
To determine cell viability 48 hours after nucleofection, the cells were stained with Hoechst and Propidium Iodide (Life Technologies). Cell viability was then assessed using ImageXpress Micro (Molecular Devices). In general, samples with an overall cell viability above 70% were harvested and analyzed for indel analysis. To isolate genomic DNA from HsMMs, the cells were washed with saline buffer, trypsinized and centrifuged. The cell pellets were treated with lysis buffer from the Maxwell RSC Blood DNA Kit (Promega #AS1400), and genomic DNAs were extracted using a Maxwell® RSC48 instrument (Promega #AS8500) according to the manufacturer's instruction. The concentrations of genomic DNAs were determined using Qubit™ 1× dsDNA HS Assay Kit (Thermo Fisher Scientific Q33231) according to the manufacturer's instruction.
Amplicon Deep Sequencing Library Preparation for HsMM
To determine the gene editing efficiency, the genomic DNAs were amplified using primers flanking the DMD exon 45 genomic region. The following primer sequences were used: MiSeq_hE45_F TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGgtctttctgtcttgtatcctttgg (SEQ ID NO: 724) and MiSeq_hE45_R GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGaatgttagtgcctttcaccc (SEQ ID NO: 725).
The size of the amplicons was verified by analyze a small amount of the PCR products on 2% E-gels (Thermo Fisher Scientific). The PCR product was purified by AMPure XP beads (A63881). The purified PCR product was amplified again with primers that contain barcodes and Illumina adaptors. Multiple barcoded samples were pooled, combined with PhiX library, and loaded onto the Illumina Mi-Seq platform. MiSeq Reagent Kit v3 (MS-102-3003) was used to produce a 600-cycle run.
Maintenance and Differentiation of C2C12 Myotubes
C2C12 are maintained in DMEM supplemented with 10% FBS (Fetal Bovine Serum) and 1% Pen/Strep. For culture purposes, cells should not be allowed to reach >75% confluency. C2C12 myoblasts are differentiated in DMEM supplemented with 2% HS and 1% Pen/Strep. Briefly, cells are seeded at 42-45 k/cm2 and differentiation is started 24 hr after seeding (once cells reach ˜90-95% confluency). Differentiation medium is changed on day 1 and then refreshed every other day.
Transduction of C2C12 Myotubes
C2C12 myoblasts were allowed to differentiate to myotubes in differentiation medium (DMEM with 2% horse serum and 1% Pen/Strep) for 6 days. Two hours before viral transduction, myotubes were treated with Neuraminidase type III (Sigma-Aldrich, 50 mU/ml), followed by washing with differentiation medium twice. Myotubes were incubated with AAV (MOI 1.0E7) and centrifuged at 1000×g at 4 C for 1.5 hours. After spin transduction, the virus is aspirated, and the myotubes are washed with cold PBS once followed by two washes with differentiation medium. The myotubes are cultured in differentiation medium for an additional week (7 days) before being harvested for INDEL analysis (ICE/NGS) or fixed for immunohistochemistry analysis (IHC). DNA/RNA extraction was performed using the Qiagen AllPrep DNA/RNA Minikit
Cas9 Nuclear Localization by Immunofluorescence Staining of C2C12 Myotubes
7 days post AAV transduction (day 12 of myotube differentiation) C2C12 myotubes were fixed with 4% paraformaldehyde (PFA) for 20 mins at room temperature (RT). After fixation, cells were washed twice with PBS followed by permeabilization with PBS-0.5% Triton for 15 mins at RT. Prior to incubation with primary antibodies cells were incubated with a blocking buffer for 30 mins at RT (PBS+10% FBS+0.1% Triton). Cell were then incubated with primary antibodies in block for 2 hours at RT (or overnight at 4° C.) followed by 3 washes with PBS-Tween 0.1%. Secondary antibodies are added for 1.5-2 hours at RT. Antibodies for these studies are listed here at Table 12:
Vector Genome Quantitation
Quantitative polymerase chain reaction (qPCR) was used to quantify levels of AAV9 DNA in C2C12 differentiated cells. gDNA was extracted using the Qiagen AllPrep kit and quantified using the Qubit 4 Fluorometer and diluted to a final concentration of 2.5 ng/μL.
Absolute quantification of AAV9 vectors was performed by constructing a standard curve prepared from known quality of linearized plasmids encoding the region of interest. A set of Quality control (QC) were included to validate the methods of quantification. A set of non-template control (NTC) samples were included to confirm the specificity of reactions.
qPCR reactions were conducted in triplicate using a QuantStudio 6 Flex Real-Time PCR System (Thermo Fisher). A linear regression analysis was performed using threshold cycle (Ct) values of the standard curve. From this linear regression, the Ct values of the samples were used to quantify the number of copies per μg of gDNA of AAV9 present in each sample.
Cas9 and sgRNA Transgene Expression
Reverse transcription polymerase chain reaction (RT-qPCR) was used to quantify levels of Cas9 mRNA and the expression of gRNA in C2C12 differentiated cells. RNA was extracted using the Qiagen AllPrep kit and extracted RNA were quantified using the Qubit 4 Fluorometer and diluted to a final concentration of 20 ng/μL. Absolute quantification of gRNA and Cas9 mRNA were performed by constructing a standard curve prepared from known quality of a T7 transcript encoding the corresponding region of interest. A set of Quality control (QC) were included to validate the methods of quantification. A set of non-template control (NTC) samples were included to confirm the specificity of reactions.
RT-qPCR reactions were conducted in triplicate using a QuantStudio 6 Flex Real-Time PCR System (Thermo Fisher). A linear regression analysis was performed using threshold cycle (Ct) values of the standard curve. From this linear regression, the Ct values of the samples were used to quantify the number of copies per μg of RNA of either Cas9 mRNA or gRNA present in samples.
Amplicon Deep Sequencing Library Preparation for C2C12
gDNA was extracted using the Qiagen AllPrep kit. The relevant locus for exon 45 of mouse Dmd was amplified by PCR and amplified products were purified by AMPure beads (0.8×) followed by QC with 1% E-gel. DNA concentrations were measured using QuBiT for normalization of samples and Illumina sequencing libraries were created from these PCR products using a MiSeq Reagent Kit v3 (600-cycle).
Next Generation Sequencing Data Analysis
A custom bioinformatic workflow was used to process the Illumina sequencing data. First, poor quality reads were removed. Reads that passed the quality filter were trimmed using Trimmomatic to remove adapters and low quality bases. Then reads mapping to the PhiX genome were removed, and paired end reads were merged using PEAR. Based on the expected cut sites corresponding to the two guides, three reference amplicon sequences were created—wildtype amplicon, deletion amplicon with a deletion between the cutsites for the two guides, and inversion amplicon with an inversion of the sequence between the two the cutsites for the two guides. The merged reads were assigned to one of three amplicons (wild-type, deletion, inversion) based on the alignment score provided by the Needleman Wunsch algorithm as implemented in ParasailNeedle. The CIGAR string provided by the alignment algorithm was parsed to identify the sequence of indel events. The summarized table of indel events and their frequencies was overlaid with information on exon length, exon frame, and position of the premature stop codon introduced as a result of the DMD disease causing mutation, to characterize Cas9-associated indel events as productive (e.g., precise deletion, RF+1, RF Other, Exon skipping) or non-productive (e.g., OE) (Table 13). Throughout this workflow, a stringent set of QQC criteria was applied to filter poor quality samples. (Table 13).
Animal and Study Design
This study was designed to evaluate all-in-one gene editing vector candidates that have been created to include both the sgRNA and a Cas9 endonuclease to mediate gene editing at the DMD locus. In this study, the efficacy of the one-vector gene editing candidates was assessed in vivo by measuring dystrophin restoration, on-target gene editing, tissue vector genomes, and Cas9 and sgRNA transgene expression following an intraperitoneal administration to dEx44 mice at postnatal day 4 or 5.
Table 14A describes the in-vivo study design:
Amplicon Deep Sequencing Library Preparation for In Vivo Study
gDNA was extracted from mouse heart, quadricep, and triceps tissue using the Maxwell RSC Tissue DNA Kit and quantified via Qubit. The relevant locus of exon 45 of the mouse DMD gene was amplified using PCR. PCR products were visualized using both E-gel and TapeStation to confirm proper size and were purified using AMPure XP beads. PCR products then underwent a second PCR reaction to add unique 5′ and 3′ barcodes corresponding to each sample. These PCR products were quantified via Qubit and normalized to 4 nM each. The normalized products were then pooled, combined with a PhiX library to increase diversity, and loaded onto a MiSeq instrument. The library was sequenced using a MiSeq Reagent Kit v3 (600-cycle) and raw data was transferred to the DCS team.
Next Generation Sequencing Data Analysis for In Vivo Study
A custom bioinformatic workflow was used to process the Illumina sequencing data. First, poor quality reads (below Q30) were removed. The surviving reads were trimmed using Trimmomatic Trimmomatic to remove adapters and low quality bases. Then reads mapping to the PhiX genome were removed, and paired end reads were merged using PEAR. Based on the expected cut sites corresponding to the two guides, three reference amplicon sequences were created—wildtype amplicon, deletion amplicon with a deletion between the cut sites for the two guides, and inversion amplicon with an inversion of the sequence between the two the cut sites for the two guides. The merged reads were assigned to one of three amplicons (wild-type, deletion, inversion) based on the alignment score provided by the Needleman Wunsch algorithm as implemented in ParasailNeedle. The CIGAR string provided by the alignment algorithm was parsed to identify the sequence of indel events. The summarized table of indel events and their frequencies was overlaid with information on exon length, exon frame, and position of the premature stop codon introduced as a result of the DMD disease causing mutation, to characterize Cas9-associated indel events as productive (e.g., precise deletion, RF+1, RF Other, Exon skipping) or non-productive (e.g., OE) (Table 1). Throughout this workflow, a stringent set of QC criteria was applied to filter poor quality samples and all samples are checked for potential contaminations from other treatment groups in the same study.
2. Results
Dual Cut sgRNA In Vitro Screening
The average indel frequency of sgRNAs targeting exon 45 was tested in primary human skeletal muscle myoblasts (HsMM) (
The average indel frequency of sgRNAs targeting exon 51 was determined in HEK293FT cells (
AAV Configurations for the Dual Cut Single Vector Candidates
A combination of promoter orientations, and configurations, NLS sequences and sgRNA scaffolds were selected for generating AAV plasmids and evaluation on sgRNA transgene expression, AAV manufacturability, and editing efficiency in vitro and in vivo. AAV plasmid configurations are listed in Table 15 and Table 16. Promoter, NLS and scaffold sequences are listed in Table 17.
Guide RNA were prepared according to standard methods in a single guide (sgRNA) format. A single AAV vector was prepared that expresses the guide RNA pair and a variant SaCas9 or the guide RNA pair and a variant SluCas9. The AAV vectors were administered to C2C12 cells in vitro and to mice (e.g., dEx44 mice) in vivo to assess the ability of the AAV to express the guide RNA and Cas9, edit the targeted exon, and reframe the dystrophin gene (in vivo studies only).
In particular, the ability of in vitro single AAV-mediated delivery of gene-editing components to introduce productive editing and precise segmental deletion in the C2C12 mouse myotubes was tested (Table 18B,
Selected AAV configurations were evaluated for vector genome quantitation (
The ability of in vivo single AAV-mediated delivery of gene-editing components to successfully reframe exon 45 in the cardiac and skeletal muscle of dEx44 mice was tested. All selected AAV configurations showed an average total indel frequency higher than 12.5% and E45Slu18/4 sgRNA pair showed an average precise segmental deletion frequency higher than 15% in the heart (
Similar experiments were performed with a variety of sgRNAs (see spacer sequences in Table 19;
This description and exemplary embodiments should not be taken as limiting. For the purposes of this specification and appended claims, unless otherwise indicated, all numbers expressing quantities, percentages, or proportions, and other numerical values used in the specification and claims, are to be understood as being modified in all instances by the term “about,” to the extent they are not already so modified. Accordingly, unless indicated to the contrary, the numerical parameters set forth in the following specification and attached claims are approximations that may vary depending upon the desired properties sought to be obtained. At the very least, and not as an attempt to limit the application of the doctrine of equivalents to the scope of the claims, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques.
It is noted that, as used in this specification and the appended claims, the singular forms “a,” “an,” and “the,” and any singular use of any word, include plural referents unless expressly and unequivocally limited to one referent. As used herein, the term “include” and its grammatical variants are intended to be non-limiting, such that recitation of items in a list is not to the exclusion of other like items that can be substituted or added to the listed items.
This application claims the benefit of priority to U.S. Provisional Application No. 63/076,250, filed Sep. 9, 2020; U.S. Provisional Application No. 63/152,114, filed Feb. 22, 2021; U.S. Provisional Patent Application No. 63/166,174, filed Mar. 25, 2021; and U.S. Provisional Application No. 63/179,850, filed Apr. 26, 2021; all of which are incorporated by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
63179850 | Apr 2021 | US | |
63166174 | Mar 2021 | US | |
63152114 | Feb 2021 | US | |
63076250 | Sep 2020 | US |