The instant application contains a Sequence Listing which has been submitted electronically is hereby incorporated by reference in its entirety. The sequence listing was created on Jun. 20, 2019, is named UTFD_P3391WO.txt and is ˜6,540,494 bytes in size.
The present disclosure relates to the fields of molecular biology, medicine and genetics. More particularly, the disclosure relates to the use of genome editing to create humanized animal models for different forms of Duchenne muscular dystrophy (DMD), each containing distinct DMD mutations.
Muscular dystrophies (MD) are a group of more than 30 genetic diseases characterized by progressive weakness and degeneration of the skeletal muscles that control movement. Duchenne muscular dystrophy (DMD) is one of the most severe forms of MD that affects approximately 1 in 5000 boys and is characterized by progressive muscle weakness and premature death. Cardiomyopathy and heart failure are common, incurable and lethal features of DMD. The disease is caused by mutations in the gene encoding dystrophin (DMD), a large intracellular protein that links the dystroglycan complex at the cell surface with the underlying cytoskeleton, thereby maintaining integrity of the muscle cell membrane during contraction. Mutations in the dystrophin gene result in loss of expression of dystrophin causing muscle membrane fragility and progressive muscle wasting.
There remains a need in the art for treatments and cures for DMD, and for mouse models to be used for developing the same.
In accordance with the present disclosure, there is provided a nucleic acid comprising a sequence encoding a single guide RNA (sgRNA) comprising a spacer sequence and a scaffold sequence; wherein the spacer sequence comprises the sequence of any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617. The scaffold sequence may comprise, for example, the sequence of any one of SEQ ID NO: 147-153. In some embodiments, the nucleic acid comprises one copy of the sequence encoding the sgRNA. In some embodiments, the nucleic acid comprises two, three, four, or five copies of the sequence encoding the sgRNA. The nucleic acid may comprise a sequence encoding a promoter, wherein the promoter drives expression of the sgRNA. In some embodiments, the nucleic acid comprises three copies of the sequence encoding the sgRNA, wherein the nucleic acid comprises a sequence encoding a first promoter and expression of the first copy of the sgRNA is driven by the first promoter, wherein the nucleic acid comprises a sequence encoding a second promoter and expression of the second copy of the sgRNA is driven by the second promoter, and wherein the nucleic acid comprises a sequence encoding a third promoter and expression of the third copy of the sgRNA is driven by the third promoter. In some embodiments, the nucleic acid further comprises a sequence encoding a nuclease. In some embodiments, the nuclease is a Type II, Type V-A, Type V-B, Type V-C, Type V-U, or Type VI-B nuclease. In some embodiments, the nuclease is a TAL nuclease, a meganuclease, or a zinc-finger nuclease. In some embodiments, the nuclease is a Cas9, Cas12a, Cas12b, Cas12c, Tnp-B like, Cas13a (C2c2), Cas13b, or Cas14 nuclease. The Cas9 nuclease may be, for example, a Streptococcus pyogenes or Streptococcus aureus Cas9. In some embodiments, the Cas9 nuclease is a modified Cas9 nuclease, such as a modified Streptococcus pyogenes Cas9 or a modified Streptococcus aureus Cas9.
Also provided is a recombinant vector comprising a nucleic acid of the disclosure. In some embodiments, the recombinant vector is an expression vector. In some embodiments, the recombinant vector is a viral vector. In some embodiments, the viral vector is a lentiviral vector, a retroviral vector, an adenoviral vector, or an adeno-associated virus (AAV) vector. In some embodiments, the viral vector is an adeno-associated virus (AAV) vector. The AAV vector may have a serotype of, for example, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAVRh74, AAV2i8, AAVRh10, AAV39, AAV43, AAVRh8, avian AAV, bovine AAV, canine AAV, equine AAV, or ovine AAV. In some embodiments, the serotype of the AAV vector is AAV9. In some embodiments, the AAV vector is replication-defective or conditionally replication defective.
Also provided is a non-viral vector comprising a nucleic acid of the disclosure. The non-viral vector may comprise calcium phosphate, liposomes, nanoparticles, and/or lipid emulsions.
Also provided is an AAV expression cassette comprising a first inverted terminal repeat (ITR); a first promoter; a nucleic acid of the disclosure; and a second ITR. In some embodiments, the AAV expression cassette further comprises a polyadenosine (polyA) sequence. In some embodiments, one or both of the first ITR and the second ITR are isolated or derived from any one of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAVRh74, AAV2i8, AAVRh10, AAV39, AAV43, AAVRh8, avian AAV, bovine AAV, canine AAV, equine AAV, or ovine AAV.
Also provided is an AAV vector comprising a nucleic acid or an AAV expression cassette of the disclosure. In some embodiments, the AAV vector has the serotype of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAVRh74, AAV2i8, AAVRh10, AAV39, AAV43, AAVRh8, avian AAV, bovine AAV, canine AAV, equine AAV, or ovine AAV. In some embodiments, the serotype of the AAV vector is AAV9. In some embodiments, the AAV vector is replication-defective or conditionally replication defective.
Also provided is a composition comprising a nucleic acid, an AAV expression cassette, or an AAV vector of the disclosure. In some embodiments, the composition further comprises a pharmaceutically acceptable carrier.
Also provided is a cell comprising a nucleic acid, an expression cassette, an AAV vector, or a composition of the disclosure. In some embodiments, the cell is a stem cell. In some embodiments, the cell is a mammalian cell such as a human cell.
Also provided is a method of correcting a gene defect in a cell, the method comprising contacting the cell with a nucleic acid, a recombinant vector, a non-viral vector, an AAV vector, or a composition of the disclosure. In some embodiments, the cell is a stem cell. In some embodiments, the cell is a mammalian cell such as a human cell.
Also provided is a method of treating a subject suffering from Duchenne muscular dystrophy, the method comprising administering to the subject a therapeutically effective amount of a nucleic acid, a recombinant vector, a non-viral vector, an AAV vector, or a composition of the disclosure.
Also provided is a method of treating a subject suffering from Duchenne muscular dystrophy, the method comprising administering to the subject a first vector (e.g., a recombinant vector or non-viral vector of the disclosure), and a second vector, wherein the second vector encodes a nuclease. In some embodiments, the nuclease is a Type II, Type V-A, Type V-B, Type V-C, Type V-U, or Type VI-B nuclease. In some embodiments, the nuclease is a TAL nuclease, a meganuclease, or a zinc-finger nuclease. In some embodiments, the nuclease is a Cas9, Cas12a, Cas12b, Cas12c, Tnp-B like, Cas13a (C2c2), Cas13b, or Cas14 nuclease. The Cas9 nuclease may be, for example, a Streptococcus pyogenes or Streptococcus aureus Cas9. In some embodiments, the Cas9 is a modified Cas9 nuclease, such as a modified Streptococcus pyogenes Cas9 or a modified Streptococcus aureus Cas9. In some embodiments, the second vector is a plasmid. In some embodiments, the second vector is an expression vector. In some embodiments, the second vector is a viral vector. In embodiments wherein the second vector is a viral vector, it may be a lentiviral vector, a retroviral vector, an adenoviral vector, or an adeno-associated virus (AAV) vector. In some embodiments, the viral vector is an adeno-associated virus (AAV) vector. The serotype of the AAV vector may be selected from AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAVRh74, AAV2i8, AAVRh10, AAV39, AAV43, AAVRh8, avian AAV, bovine AAV, canine AAV, equine AAV, and ovine AAV. In some embodiments, the second vector is a non-viral vector, wherein the non-viral vector comprises calcium phosphate, liposomes, nanoparticles, and/or lipid emulsions. In some embodiments, the administering induces a frameshift mutation in a target nucleic acid sequence in a cell of the patient. In some embodiments, the frameshift mutation comprises a deletion of at least one nucleotide, wherein the number of nucleotides deleted is not a multiple of 3 (e.g., a deletion of 1, 2, 4, 5, 7, 8, 10, 11, 13, 14, 16, 17, 19 or 20 nucleotides.) In some embodiments, the frameshift mutation comprises an insertion of at least one nucleotide, wherein the number of nucleotides inserted is not a multiple of 3 (e.g., an insertion of 1, 2, 4, 5, 7, 8, 10, 11, 13, 14, 16, 17, 19 or 20 nucleotides.) In some embodiments, the frameshift mutation comprises an insertion of 1 nucleotide. The first vector and the second vector may be administered simultaneously, or may be administered sequentially. In some embodiments, the first vector and the second vector may be administered locally (e.g., to a muscle tissue), or may be administered systemically. In some embodiments, the first vector and the second vector are administered by an oral, rectal, transmucosal, topical, transdermal, inhalation, intravenous, subcutaneous, intradermal, intramuscular, intra-articular, intrathecal, intraventricular, intravenous, intraperitoneal, intranasal, or intraocular route of administration. The subject suffering from DMD may be greater than or equal to 18 years old, less than 18 years old, or less than 2 years old. In some embodiments, the subject is a human. In some embodiments, the ratio of the first vector to the second vector is 1:1 to 1:100. In some embodiments, the ratio of the second vector to the first vector is 1:1 to 1:100.
Also provided is a combination therapy comprising a first composition comprising a first vector comprising a nucleic acid of the disclosure, and a second composition comprising a second vector comprising a nucleic acid that encodes a nuclease. The first and/or the second composition may comprise a pharmaceutically acceptable carrier. In some embodiments, the nuclease is a Type II, Type V-A, Type V-B, Type V-C, Type V-U, or Type VI-B nuclease. In some embodiments, the nuclease is a TAL nuclease, a meganuclease, or a zinc-finger nuclease. In some embodiments, the nuclease is a Cas9, Cas12a, Cas12b, Cas12c, Tnp-B like, Cas13a (C2c2), Cas13b, or Cas14 nuclease. In embodiments wherein the nuclease is a Cas9 nuclease it may be, for example, a Streptococcus pyogenes or Streptococcus aureus Cas9. In some embodiments, the nuclease is a modified Cas9 nuclease, such as a modified Streptococcus pyogenes Cas9 or a modified Streptococcus aureus Cas9.
Also provided is a composition, a recombinant vector, or a non-viral vector of the disclosure for use as a medicament.
Also provided is a composition, a recombinant vector, or a non-viral vector of the disclosure for use in the treatment of Duchenne muscular dystrophy.
Also provided are three different mouse models, wherein the genome of each mouse model comprises a deletion of exon 43, exon 45, or exon 52 of the dystrophin gene, resulting in an out of frame shift and a premature stop codon in exon 44, exon 46, and exon 53, respectively. These mutations are similar to mutations found in approximately 18% of human DMD patients, and correction of these deletions through exon skipping or reframing of surrounding exons can be used to treat DMD. The genome of these mice may further comprise a reporter gene located downstream of and in frame with exon 79 of the dystrophin gene, and upstream of a dystrophin 3′-UTR, wherein said reporter gene is expressed when exon 79 is translated in frame with exon 45, exon 47, or exon 54. The reporter gene may be luciferase. The genome of the mouse may further comprise a protease coding sequence upstream of and in frame with said reporter gene, and downstream of and in frame with exon 79. The protease may be autocatalytic, such as 2A protease. The mouse may be heterozygous for said deletion, or homozygous for said deletion. The mouse may exhibit increased creatine kinase levels, and/or may not exhibit detectable dystrophin protein in heart or skeletal muscle.
Also provided is a method of producing the mouse described above comprising (a) contacting a fertilized oocyte with CRISPR/Cas9 elements and two single guide RNA (sgRNA) targeting sequences flanking exon 43, exon 45, or exon 52, thereby creating a modified oocyte, wherein deletion of exon 43, exon 45, or exon 52 by CRISPR/Cas9 results in an out of frame shift and a premature stop codon in exon 44, exon 46, or exon 53; (b) transferring said modified oocyte into a recipient female. The oocyte genome may comprise a dystrophin gene having a reporter gene located downstream of and in frame with exon 79 of said dystrophin gene, and upstream of a dystrophin 3′-UTR, wherein said reporter gene is expressed when exon 79 is translated in frame with exon 45, exon 47, or exon 54. The reporter gene may be luciferase. The oocyte genome may further comprise a protease coding sequence upstream of and in frame with said reporter gene, and downstream of and in frame with exon 79. The protease may be autocatalytic, such as 2A protease. The mouse may be heterozygous for said deletion, or homozygous for said deletion. The mouse may exhibit increased creatine kinase levels and/or may not exhibit detectable dystrophin protein in heart or skeletal muscle.
In another embodiment, there is provided an isolated cell obtained from the mouse described above. The genome of the cell may further comprise a reporter gene located downstream of and in frame with exon 79 of the dystrophin gene, and upstream of a dystrophin 3′-UTR, wherein said reporter gene is expressed when exon 79 is translated in frame with exon 45, exon 47, or exon 54. The reporter gene may be luciferase. The genome of the cell may further comprise a protease coding sequence upstream of and in frame with said reporter gene, and downstream of and in frame with exon 79. The protease may be autocatalytic, such as 2A protease. The cell may be heterozygous for said deletion, or homozygous for said deletion.
In a further embodiment, there is provided a mouse produced by a method comprising the steps of (a) contacting a fertilized oocyte with CRISPR/Cas9 elements and two single guide RNA (sgRNA) targeting sequences flanking exon 43, exon 45, or exon 52, thereby creating a modified oocyte, wherein deletion of exon 43, exon 45, or exon 52 by CRISPR/Ca9 results in an out of frame shift and a premature stop codon in exon 44, exon 46, or exon 53; (b) transferring said modified oocyte into a recipient female.
These mice provide an important system for assessing the efficacy of a variety of therapeutic analogues for correction of DMD mutations. In one embodiment, CRISPR/Cas9 can be used to skip or reframe exon 44, exon 46 or exon 53, putting the dystrophin protein back in frame. The mice allow for rapid optimization of the method. In another embodiment, the mice can be used to test exon-skipping oligonucleotides or small molecules or other therapeutic modalities in a “humanized” system. In still a further embodiment, there is provided a method of screening a candidate substance for DMD exon-skipping activity comprising (a) treating a mouse (e.g., a mouse from one of the mouse models described herein) with a candidate substance; and (b) assessing in frame transcription and/or translation of exon 79, wherein the presence of in frame transcription and/or translation of exon 79 indicates said candidate substance exhibits exon-skipping activity.
A further embodiment comprises an isolated nucleic acid comprising a sequence of any one of SEQ ID NO: 1-72, 340-359, or 360-515. Also provided is a double-stranded nucleic acid formed by hybridization of SEQ ID NO: 1 and 2, SEQ ID NO: 3 and 4, SEQ ID NO: 5 and 6, SEQ ID NO: 7 and 8, SEQ ID NO: 9 and 10, SEQ ID NO: 11 and 12, SEQ ID NO: 13 and 14, SEQ ID NO: 15 and 16, SEQ ID NO: 17 and 18, SEQ ID NO: 19 and 20, SEQ ID NO: 21 and 22, SEQ ID NO: 23 and 24, SEQ ID NO: 25 and 26, SEQ ID NO: 27 and 28, SEQ ID NO: 29 and 30, SEQ ID NO: 31 and 32, SEQ ID NO: 33 and 34, SEQ ID NO: 35 and 36, SEQ ID NO: 37 and 38, SEQ ID NO: 39 and 40, SEQ ID NO: 41 and 42, SEQ ID NO: 43 and 44, SEQ ID NO: 45 and 46, SEQ ID NO: 47 and 48, SEQ ID NO: 49 and 50, SEQ ID NO: 51 and 52, SEQ ID NO: 53 and 54, SEQ ID NO: 55 and 56, SEQ ID NO: 57 and 58, SEQ ID NO: 59 and 60, SEQ ID NO: 61 and 62, SEQ ID NO: 63 and 64, SEQ ID NO: 65 and 66, SEQ ID NO: 67 and 68, SEQ ID NO: 69 and 70, and SEQ ID NO: 71 and 72, and an expression construct comprising a nucleic acid formed by hybridization of SEQ ID NO: 1 and 2, SEQ ID NO: 3 and 4, SEQ ID NO: 5 and 6, SEQ ID NO: 7 and 8, SEQ ID NO: 9 and 10, SEQ ID NO: 11 and 12, SEQ ID NO: 13 and 14, SEQ ID NO: 15 and 16, SEQ ID NO: 17 and 18, SEQ ID NO: 19 and 20, SEQ ID NO: 21 and 22, SEQ ID NO: 23 and 24, SEQ ID NO: 25 and 26, SEQ ID NO: 27 and 28, SEQ ID NO: 29 and 30, SEQ ID NO: 31 and 32, SEQ ID NO: 33 and 34, SEQ ID NO: 35 and 36, SEQ ID NO: 37 and 38, SEQ ID NO: 39 and 40, SEQ ID NO: 41 and 42, SEQ ID NO: 43 and 44, SEQ ID NO: 45 and 46, SEQ ID NO: 47 and 48, SEQ ID NO: 49 and 50, SEQ ID NO: 51 and 52, SEQ ID NO: 53 and 54, SEQ ID NO: 55 and 56, SEQ ID NO: 57 and 58, SEQ ID NO: 59 and 60, SEQ ID NO: 61 and 62, SEQ ID NO: 63 and 64, SEQ ID NO: 65 and 66, SEQ ID NO: 67 and 68, SEQ ID NO: 69 and 70, and SEQ ID NO: 71 and 72, such as a viral or non-viral vector. Additionally, a kit comprising one or more isolated nucleic acids comprising the sequence of any one of SEQ ID NO: 1-72, 340-359, or 360-515 is provided.
Still a further embodiment comprises a method of correcting a dystrophin gene defect in exon 44, exon 46, or exon 53 of the DMD gene in a subject comprising contacting a cell in said subject with Cpf1 or Cas9 and a DMD guide RNA as defined above, resulting in selective skipping of a mutant DMD exon. The cell may be a muscle cell, a satellite cell, or an induced pluripotent stem cell (iPSC) or iPSC-derived cardiomyocyte (iPSC-CM). Cpf1 and/or DMD guide RNA may be provided to said cell through expression from one or more expression vectors coding therefore, such as a viral vector (e.g., adeno-associated viral vector) or as a non-viral vector. Cpf1 or Cas9 may be provided to said cell as naked plasmid DNA or chemically-modified mRNA.
The method of may further comprise contacting said cell with a single-stranded DMD oligonucleotide to effect homology directed repair or nonhomologous end joining (NHEJ). Cpf1 or Cas9, DMD guide RNA and/or single-stranded DMD oligonucleotide, or expression vectors coding therefor, may be provided to said cell in one or more nanoparticles. Cpf1 or Cas9, DMD guide RNA and/or single-stranded DMD oligonucleotide may be delivered directly to a muscle tissue, such as tibialis anterior, quadricep, soleus, diaphragm or heart. Cpf1 or Cas9, DMD guide RNA and/or single-stranded DMD oligonucleotide may be delivered systemically.
The subject may exhibit normal dystrophin-positive myofibers and/or mosaic dystrophin-positive myofibers containing centralized nuclei. The subject may exhibit a decreased serum CK level as compared to a serum CK level prior to contacting. The subject may exhibit improved grip strength as compared to a serum CK level prior to contacting. The correction may be permanent skipping of said mutant DMD exon, or more than one mutant DMD exon. The Cpf1 or Cas9 and/or DMD guide RNA may be delivered to a human iPSC with an adeno-associated viral vector.
It is contemplated that any method or composition described herein can be implemented with respect to any other method or composition described herein.
Other objects, features and advantages of the present disclosure will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples, while indicating specific embodiments of the disclosure, are given by way of illustration only, since various changes and modifications within the spirit and scope of the disclosure will become apparent to those skilled in the art from this detailed description.
The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present disclosure. The disclosure may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.
DMD is a new mutation syndrome, and more than 4,000 independent causative mutations that have been identified in humans (world-wide web at dmd.nl). The majority of patient mutations carry deletions that cluster in a hotspot, and thus a therapeutic approach for skipping certain exon applies to large group of patients. The rationale of the exon skipping approach is based on the genetic difference between DMD and Becker muscular dystrophy (BMD) patients. In DMD patients, the reading frame of dystrophin mRNA is disrupted resulting in prematurely truncated, non-functional dystrophin proteins. BMD patients have mutations in the DMD gene that maintain the reading frame allowing the production of internally deleted, but partially functional dystrophins leading to much milder disease symptoms compared to DMD patients.
One the most common mutational hot spots in DMD is the genetic region between exons 44 and 51. Therapeutic approaches involving skipping or reframing of exon 44, exon 46, and exon 53 would treat approximately 18% of the DMD population. Here, the efficiency of CRISPR/Cas9 mediated correction of DMD mutations in patient-derived iPSCs is shown. To further assess the efficiency and optimize CRISPR/Cas9-mediated exon skipping in vivo, a mimic of the human “hot spot” region was generated in mouse models by deleting the exon 43, exon 45, or exon 52 using CRISPR/Cas9 system directed by two single guide RNAs (sgRNAs). The Δ43, Δ45, and Δ52 mouse models exhibit dystrophic myofibers, increased serum creatine kinase level, and reduced muscle function, thus providing a new set of representative models of DMD. These and other aspects of the disclosure are reproduced below.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. The terminology used in the detailed description herein is for the purpose of describing particular embodiments only and is not intended to be limiting.
All publications, patent applications, patents, GenBank or other accession numbers and other references mentioned herein are each incorporated by reference herein in their entirety.
The singular forms “a,” “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise.
Furthermore, the terms “about” and “approximately” as used herein when referring to a measurable value such as an amount of the length of a polynucleotide or polypeptide sequence, dose, time, temperature, and the like, is meant to encompass variations of ±20%, ±10%, ±5%, ±1%, ±0.5%, or even ±0.1% of the specified amount.
Also as used herein, “and/or” refers to and encompasses any and all possible combinations of one or more of the associated listed items, as well as the lack of combinations when interpreted in the alternative (“or”).
Reference to a vector or other DNA sequences as “recombinant” merely acknowledges the operable linkage of DNA sequences which are not typically operably linked as isolated from or found in nature.
The terms “gRNA” and “sgRNA” are used interchangeably herein, and refer to a short synthetic RNA composed of a “spacer” (or “targeting”) sequence and a “scaffold” sequence. In some embodiments, the gRNA may further comprise a polyA tail.
In accordance with some embodiments of the disclosure, a “frameshift mutation” (or “frame-shift mutation” or “frameshift”) is caused by a deletion or insertion in a DNA sequence that shifts the reading frame of the DNA sequence.
In accordance with some embodiments of the disclosure, “exon skipping” (or “exon-skipping”) refers to a strategy which causes sections (e.g. mutated sections) of a gene to be “skipped” during RNA splicing, allowing the expression of a partially or fully functional protein.
Unless the context indicates otherwise, it is specifically intended that the various features described herein can be used in any combination.
Provided herein are gene editing systems which produce an insertion, deletion, or replacement of DNA at a specific site in the genome of an organism or cell. In some embodiments, the genome editing systems introduce a loss of function mutation or a gain of function mutation. In some embodiments, the genome editing systems of the disclosure are capable of modulating splicing or causing a frameshift in a target DNA sequence. In some embodiments, the genome editing systems correct DNA mutations in vitro and/or in vivo.
The genome editing systems of the disclosure may comprise at least one nuclease (or catalytic domain thereof) and at least one gRNA, or nucleic acids encoding the at least one nuclease (or catalytic domain thereof) and the at least one gRNA. A sequence encoding the at least one nuclease and a sequence encoding the at least one gRNA may be delivered using the same vector (e.g., an AAV vector), or using different vectors (e.g., a first AAV vector for delivering the sequence encoding the nuclease, and a second AAV vector for delivering the sequence encoding the at least one gRNA).
In some embodiments, the nuclease is a Type II, Type V-A, Type V-B, Type V-C, Type V-U, Type VI-B nuclease. In some embodiments, the nuclease is a transcription activator-like effector nuclease (TALEN), a meganuclease, or a zinc-finger nuclease. In some embodiments, the nuclease is a Cas9, Cas12a (Cpf1), Cas12b, Cas12c, Tnp-B like, Cas13a (C2c2), Cas13b, or Cas14 nuclease. For example, in some embodiments, the nuclease is a Cas9 nuclease or a Cpf1 nuclease.
In some embodiments, the nuclease is a modified form or variant of a Cas9, Cas12a (Cpf1), Cas12b, Cas12c, Tnp-B like, Cas13a (C2c2), Cas13b, or Cas14 nuclease. In some embodiments, the nuclease is a modified form or variant of a TAL nuclease, a meganuclease, or a zinc-finger nuclease. A “modified” or “variant” nuclease is one that is, for example, truncated, fused to another protein (such as another nuclease), catalytically inactivated, etc. In some embodiments, the nuclease may have at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100% sequence identity to a naturally occurring Cas9, Cas12a (Cpf1), Cas12b, Cas12c, Tnp-B like, Cas13a (C2c2), Cas13b, Cas14 nuclease, or a TALEN, meganuclease, or zinc-finger nuclease.
In embodiments, the nuclease is a Cas9 nuclease derived from S. pyogenes (SpCas9). An exemplary SpCas9 sequence is provided in SEQ ID NO: 166. In some embodiments, the nuclease has at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 166, shown below:
In embodiments, the nuclease is a Cas9 derived from S. aureus (SaCas9). An exemplary SaCas9 sequence is provided in SEQ ID NO: 167. In some embodiments, the nuclease has at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 167, shown below:
In embodiments, the nuclease is a Cpf1 enzyme from Acidaminococcus (species BV3L6, UniProt Accession No. U2UMQ6). For example, the Cpf1 enzyme may have the sequence set forth below (SEQ ID NO: 168), or a sequence with at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity thereto:
In some embodiments, the nuclease is a Cpf1 enzyme from Lachnospiraceae (species ND2006, UniProt Accession No. A0A182DWE3). An exemplary Lachnospiraceae Cpf1 sequence is provided in SEQ ID NO: 169. In some embodiments, the nuclease has at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 169, which is provided below:
In some embodiments, a sequence encoding the nuclease is codon optimized for expression in mammalian cells. In some embodiments, the sequence encoding the nuclease is codon optimized for expression in human cells or mouse cells.
In some embodiments, the disclosure provides a nucleic acid comprising a sequence encoding a single guide RNA (sgRNA) comprising a spacer sequence and a scaffold sequence.
A. Spacer
A spacer sequence is a short nucleic acid sequence used to target a nuclease (e.g., a Cas9 nuclease) to a specific nucleotide region of interest (e.g., a genomic DNA sequence to be cleaved).
In some embodiments, the spacer may be about 17-24 base pairs in length, such as about 20 base pairs in length. In some embodiments, the spacer may be about 15, about 16, about 17, about 18, about 19, about 20, about 21, about 22, about 23, about 24, about 25, about 26, about 27, about 28, about 29, or about 30 base pairs in length. In some embodiments, the spacer may be at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, or at least 30 base pairs in length. In some embodiments, the spacer may be 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 base pairs in length. In some embodiments, the spacer sequence has between about 40% to about 80% GC content.
In some embodiments, the spacer targets a site that immediately precedes a 5′ protospacer adjacent motif (PAM). The PAM sequence may be selected based on the desired nuclease. For example, the PAM sequence may be any one of the PAM sequences shown in Table 1 below, wherein N refers to any nucleic acid, R refers to A or G, Y refers to C or T, W refers to A or T, and V refers to A or C or G.
Streptococcus pyogenes
Staphylococcus aureus
Neisseria meningitidis
Campylobacter jejuni
Streptococcus thermophilus
Lachnospiraceae bacterium
Acidaminococcus sp.
In some embodiments, a spacer may target a sequence of a mammalian gene, such as a human gene. In some embodiments, the spacer may target a mutant gene. In some embodiments, the spacer may target a coding sequence. In some embodiments, the spacer targets the dystrophin (DMD) gene. An exemplary wild-type dystrophin sequence includes the human DNA sequence (see GenBank Accession NO. NC_000023.11), located on the human X chromosome, which codes for the protein dystrophin (GenBank Accession No. AAA53189), the sequence of which is reproduced below:
In some embodiments, the spacer sequence targets a sequence of the DMD gene. In some embodiments, the spacer targets an exon of the DMD gene. In some embodiments, the spacer targets exon 43, exon 44, exon 46, exon 50 or exon 53 of the DMD gene.
In some embodiments, the spacer may have a sequence of any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617 (shown in Table 2 below). In some embodiments, a spacer may have a sequence at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence of any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617. In some embodiments, a spacer may have a sequence of any one of the spacers shown in Table 2, or a sequence that is at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical thereto.
In some embodiments, the spacer may have a sequence of any one of SEQ ID NOs: 261-329 (shown in Table 3 below). In some embodiments, a spacer may have a sequence at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence of any one of SEQ ID NOs: 261-329. In some embodiments, a spacer may have a sequence of any one of the spacers shown in Table 3, or a sequence that is at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical thereto.
B. Scaffold
The scaffold sequence is the sequence within the gRNA that is responsible for nuclease (e.g., Cas9) binding. The scaffold sequence does not include the spacer/targeting sequence.
In some embodiments, the scaffold may be about 60 to about 70, about 70 to about 80, about 80 to about 90, about 90 to about 100, about 100 to about 110, about 110 to about 120, or about 120 to about 130 nucleotides in length. In some embodiments, the scaffold may be about 60, about 61, about 62, about 63, about 64, about 65, about 66, about 67, about 68, about 69, about 70, about 71, about 72, about 73, about 74, about 75, about 76, about 77, about 78, about 79, about 80, about 81, about 82, about 83, about 84, about 85, about 86, about 87, about 88, about 89, about 90, about 91, about 92, about 93, about 94, about 95, about 96, about 97, about 98, about 99, about 100, about 101, about 102, about 103, about 104, about 105, about 106, about 107, about 108, about 109, about 110, about 111, about 112, about 113, about 114, about 115, about 116, about 117, about 118, about 119, about 120, about 121, about 122, about 123, about 124, or about 125 nucleotides in length. In some embodiments, the scaffold may be at least 60, at least 70, at least 80, at least 90, at least 100, at least 110, at least 120, or at least 125 nucleotides in length. In some embodiments, the scaffold may be 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, or 125 nucleotides in length.
In some embodiments, the scaffold may comprise a sequence of any one of SEQ ID NOs: 147-153 (shown in Table 4 below), or a sequence that is at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical thereto.
In some embodiments, a gRNA (spacer+scaffold) comprises a scaffold and a spacer as shown in Table 5 below, wherein “X” indicates that the particular combination is contemplated by the instant disclosure.
In some embodiments, the sgRNA has a sequence (spacer+scaffold) of any one of SEQ ID NO: 154 to 165 (shown in Table 6, below), or a sequence that is at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical thereto.
In some embodiments, a nucleic acid comprises one copy of the sequence encoding the sgRNA. In some embodiments, a nucleic acid comprises two, three, four, or five copies of the sequence encoding the sgRNA.
In some embodiments, a nucleic acid comprises a sequence encoding a promoter, wherein the promoter drives expression of the sgRNA. In some embodiments, the nucleic acid comprises two copies of the sequence encoding a sgRNA, wherein expression of the first copy of the sgRNA is driven by a first promoter, and expression of the second copy of the sgRNA is driven by a second promoter.
In some embodiments, the nucleic acid comprises three copies of the sequence encoding a sgRNA, wherein expression of the first copy of the sgRNA is driven by a first promoter, expression of the second copy of the sgRNA is driven by a second promoter, and expression of the third copy of the sgRNA is driven by a third promoter.
In some embodiments, the nucleic acid comprises four copies of the sequence encoding a sgRNA, wherein expression of the first copy of the sgRNA is driven by a first promoter, expression of the second copy of the sgRNA is driven by a second promoter, expression of the third copy of the sgRNA is driven by a third promoter, and expression of the fourth copy of the sgRNA is driven by a fourth promoter.
In some embodiments, the nucleic acid comprises five copies of the sequence encoding a sgRNA, wherein expression of the first copy of the sgRNA is driven by a first promoter, expression of the second copy of the sgRNA is driven by a second promoter, expression of the third copy of the sgRNA is driven by a third promoter, expression of the fourth copy of the sgRNA is driven by a fourth promoter, and expression of the fifth copy of the sgRNA is driven by a fifth promoter.
In some embodiments, a nucleic acid sequence comprising a sequence encoding a sgRNA further comprises a sequence encoding a nuclease. The nuclease may be, for example, a Type II, Type V-A, Type V-B, Type V-C, Type V-U, or Type VI-B nuclease. Exemplary nucleases include, but are not limited to a TALEN, a meganuclease, a zinc-finger nuclease, or a Cas9, Cas12a, Cas12b, Cas12c, Tnp-B like, Cas13a (C2c2), Cas13b, or Cas14 nuclease.
In some embodiments, the nuclease is a Cas9 nuclease. In some embodiments, the Cas9 nuclease is a Streptococcus pyogenes or Streptococcus aureus Cas9. In some embodiments, the nuclease is a modified Cas9 nuclease. In some embodiments, the nuclease is a modified Streptococcus pyogenes Cas9 or a modified Streptococcus aureus Cas9.
CRISPRs (clustered regularly interspaced short palindromic repeats) are DNA loci containing short repetitions of base sequences. Each repetition is followed by short segments of “spacer DNA” from previous exposures to a virus. CRISPRs are found in approximately 40% of sequenced eubacteria genomes and 90% of sequenced archaea. CRISPRs are often associated with Cas genes that code for proteins related to CRISPRs. The CRISPR/Cas system is a prokaryotic immune system that confers resistance to foreign genetic elements such as plasmids and phages and provides a form of acquired immunity. CRISPR spacers recognize and silence these exogenous genetic elements like RNAi in eukaryotic organisms.
CRISPR repeats can range in size from 24 to 48 base pairs. They usually show some dyad symmetry, implying the formation of a secondary structure such as a hairpin, but are not truly palindromic. Repeats are separated by spacers of similar length.
CRISPR-associated (cas) genes are often associated with CRISPR repeat-spacer arrays. More than forty different Cas protein families have been described. Of these protein families, Cast appears to be ubiquitous among different CRISPR/Cas systems. Particular combinations of cas genes and repeat structures have been used to define 8 CRISPR subtypes (E. coli, Ypest, Nmeni, Dvulg, Tneap, Hmari, Apern, and Mtube), some of which are associated with an additional gene module encoding repeat-associated mysterious proteins (RAMPs). More than one CRISPR subtype may occur in a single genome. The sporadic distribution of the CRISPR/Cas subtypes suggests that the system is subject to horizontal gene transfer during microbial evolution.
Exogenous DNA is processed by proteins encoded by Cas genes into small elements (˜30 base pairs in length), which are then inserted into the CRISPR locus near the leader sequence. RNAs from the CRISPR loci are constitutively expressed and are processed by Cas proteins to small RNAs composed of individual, exogenously-derived sequence elements with a flanking repeat sequence. The RNAs guide other Cas proteins to silence exogenous genetic elements at the RNA or DNA level. Evidence suggests functional diversity among CRISPR subtypes. The Cse (Cas subtype E. coli) proteins (called CasA-E in E. coli) form a functional complex, Cascade, that processes CRISPR RNA transcripts into spacer-repeat units that Cascade retains. In other prokaryotes, Cas6 processes the CRISPR transcripts. Interestingly, CRISPR-based phage inactivation in E. coli requires Cascade and Cas3, but not Cas1 and Cas2. The Cmr (Cas RAMP module) proteins found in Pyrococcus furiosus and other prokaryotes form a functional complex with small CRISPR RNAs that recognizes and cleaves complementary target RNAs. RNA-guided CRISPR enzymes are classified as type V restriction enzymes.
Cas9 is a nuclease, an enzyme specialized for cutting DNA, with two active cutting sites, one for each strand of the double helix. One or both sites may be inactivated while preserving Cas9's ability to locate its target DNA. tracrRNA (i.e., a scaffold sequence) and spacer RNA may be combined into a “single-guide RNA” molecule that, mixed with Cas9, could find and cut the correct DNA targets. Such synthetic guide RNAs can be used for gene editing.
Cas9 proteins are highly enriched in pathogenic and commensal bacteria. CRISPR/Cas-mediated gene regulation may contribute to the regulation of endogenous bacterial genes, particularly during bacterial interaction with eukaryotic hosts. For example, Cas protein Cas9 of Francisella novicida uses a unique, small, CRISPR/Cas-associated RNA (scaRNA) to repress an endogenous transcript encoding a bacterial lipoprotein that is critical for F. novicida to dampen host response and promote virulence. Delivery of Cas9 DNA sequences also is contemplated.
Clustered Regularly Interspaced Short Palindromic Repeats from Prevotella and Francisella 1 or CRISPR/Cpf1 is a DNA-editing technology analogous to the CRISPR/Cas9 system. Cpf1 is an RNA-guided endonuclease of a class II CRISPR/Cas system. This acquired immune mechanism is found in Prevotella and Francisella bacteria. It prevents genetic damage from viruses. Cpf1 genes are associated with the CRISPR locus, coding for an endonuclease that use a guide RNA to find and cleave viral DNA. Cpf1 is a smaller and simpler endonuclease than Cas9, overcoming some of the CRISPR/Cas9 system limitations. CRISPR/Cpf1 has multiple applications, including treatment of genetic illnesses and degenerative conditions.
Cpf1 appears in many bacterial species. The Two Cpf1 enzymes from Acidaminococcus and Lachnospiraceae display efficient genome-editing activity in human cells.
A smaller version of Cas9 from the bacterium Staphylococcus aureus is a potential alternative to Cpf1.
The systems CRISPR/Cas are separated into three classes. Class 1 uses several Cas proteins together with the CRISPR RNAs (crRNA) to build a functional endonuclease. Class 2 CRISPR systems use a single Cas protein with a crRNA. Cpf1 has been recently identified as a Class II, Type V CRISPR/Cas systems containing a 1,300 amino acid protein.
The Cpf1 locus contains a mixed alpha/beta domain, a RuvC-I followed by a helical region, a RuvC-II and a zinc finger-like domain. The Cpf1 protein has a RuvC-like endonuclease domain that is similar to the RuvC domain of Cas9. Furthermore, Cpf1 does not have a HNH endonuclease domain, and the N-terminal of Cpf1 does not have the alfa-helical recognition lobe of Cas9.
Cpf1 CRISPR-Cas domain architecture shows that Cpf1 is functionally unique, being classified as Class 2, type V CRISPR system. The Cpf1 loci encode Cas1, Cas2 and Cas4 proteins more similar to types I and III than from type II systems. Database searches suggest the abundance of Cpf1-family proteins in many bacterial species.
Functional Cpf1 doesn't need the tracrRNA, therefore, only crRNA is required. This benefits genome editing because Cpf1 is not only smaller than Cas9, but also it has a smaller sgRNA molecule (proximately half as many nucleotides as Cas9).
The Cpf1-crRNA complex cleaves target DNA or RNA by identification of a protospacer adjacent motif 5′-YTN-3′ (where “Y” is a pyrimidine and “N” is any nucleobase) or 5′-TTN-3′, in contrast to the G-rich PAM targeted by Cas9. After identification of PAM, Cpf1 introduces a sticky-end-like DNA double-stranded break of 4 or 5 nucleotides overhang.
The CRISPR/Cpf1 system consist of a Cpf1 enzyme and a guide RNA that finds and positions the complex at the correct spot on the double helix to cleave target DNA. CRISPR/Cpf1 systems activity has three stages:
As discussed above, in certain embodiments, expression cassettes are employed to express a transcription factor product, either for subsequent purification and delivery to a cell/subject, or for use directly in a genetic-based delivery approach. Expression requires that appropriate signals be provided in the vectors, and include various regulatory elements such as enhancers/promoters from both viral and mammalian sources that drive expression of the genes of interest in cells. Elements designed to optimize messenger RNA stability and translatability in host cells also are defined. The conditions for the use of a number of dominant drug selection markers for establishing permanent, stable cell clones expressing the products are also provided, as is an element that links expression of the drug selection markers to expression of the polypeptide.
A. Regulatory Elements
Throughout this application, the term “expression cassette” is meant to include any type of genetic construct containing a nucleic acid coding for a gene product in which part or all of the nucleic acid encoding sequence is capable of being transcribed and translated, i.e., is under the control of a promoter. A “promoter” refers to a DNA sequence recognized by the synthetic machinery of the cell, or introduced synthetic machinery, responsible for initiating the specific transcription of a gene. The phrase “under transcriptional control” means that the promoter is in the correct location and orientation in relation to the nucleic acid to control RNA polymerase initiation and expression of the gene. An “expression vector” is meant to include expression cassettes comprised in a genetic construct that is capable of replication, and thus including one or more of origins of replication, transcription termination signals, poly-A regions, selectable markers, and multipurpose cloning sites.
The term promoter will be used here to refer to a group of transcriptional control modules that are clustered around the initiation site for RNA polymerase II. Much of the thinking about how promoters are organized derives from analyses of several viral promoters, including those for the HSV thymidine kinase (tk) and SV40 early transcription units. These studies, augmented by more recent work, have shown that promoters are composed of discrete functional modules, each consisting of approximately 7-20 bp of DNA, and containing one or more recognition sites for transcriptional activator or repressor proteins.
At least one module in each promoter functions to position the start site for RNA synthesis. The best-known example of this is the TATA box, but in some promoters lacking a TATA box, such as the promoter for the mammalian terminal deoxynucleotidyl transferase gene and the promoter for the SV40 late genes, a discrete element overlying the start site itself helps to fix the place of initiation.
In some embodiments, the sgRNA, Cas9 or Cpf1 constructs of the disclosure are expressed by a muscle-cell specific promoter. This muscle-cell specific promoter may be constitutively active or may be an inducible promoter.
Additional promoter elements regulate the frequency of transcriptional initiation. Typically, these are located in the region 30-110 bp upstream of the start site, although a number of promoters have recently been shown to contain functional elements downstream of the start site as well. The spacing between promoter elements frequently is flexible, so that promoter function is preserved when elements are inverted or moved relative to one another. In the tk promoter, the spacing between promoter elements can be increased to 50 bp apart before activity begins to decline. Depending on the promoter, it appears that individual elements can function either co-operatively or independently to activate transcription.
In certain embodiments, viral promoters such as the human cytomegalovirus (CMV) immediate early gene promoter, the SV40 early promoter, the Rous sarcoma virus long terminal repeat, rat insulin promoter and glyceraldehyde-3-phosphate dehydrogenase can be used to obtain high-level expression of the coding sequence of interest. The use of other viral or mammalian cellular or bacterial phage promoters which are well-known in the art to achieve expression of a coding sequence of interest is contemplated as well, provided that the levels of expression are sufficient for a given purpose. By employing a promoter with well-known properties, the level and pattern of expression of the protein of interest following transfection or transformation can be optimized. Further, selection of a promoter that is regulated in response to specific physiologic signals can permit inducible expression of the gene product.
Enhancers are genetic elements that increase transcription from a promoter located at a distant position on the same molecule of DNA. Enhancers are organized much like promoters. That is, they are composed of many individual elements, each of which binds to one or more transcriptional proteins. The basic distinction between enhancers and promoters is operational. An enhancer region as a whole is able to stimulate transcription at a distance; this need not be true of a promoter region or its component elements. On the other hand, a promoter has one or more elements that direct initiation of RNA synthesis at a particular site and in a particular orientation, whereas enhancers lack these specificities. Promoters and enhancers are often overlapping and contiguous, often seeming to have a very similar modular organization.
Below is a list of promoters/enhancers and inducible promoters/enhancers that could be used to drive expression of a nucleic acid encoding a gene of interest in an expression construct. Additionally, any promoter/enhancer combination (as per the Eukaryotic Promoter Data Base EPDB) could also be used to drive expression of the gene. Eukaryotic cells can support cytoplasmic transcription from certain bacterial promoters if the appropriate bacterial polymerase is provided, either as part of the delivery complex or as an additional genetic expression construct.
The promoter and/or enhancer may be, for example, immunoglobulin light chain, immunoglobulin heavy chain, T-cell receptor, HLA DQ a and/or DQ β, β-interferon, interleukin-2, interleukin-2 receptor, MHC class II 5, MHC class II HLA-Dra, β-Actin, muscle creatine kinase (MCK), prealbumin (transthyretin), elastase I, metallothionein collagenase, albumin, α-fetoprotein, t-globin, β-fos, c-HA-ras, insulin, neural cell adhesion molecule (NCAM), α1-antitrypain, H2B (TH2B) histone, mouse and/or type I collagen, glucose-regulated proteins (GRP94 and GRP78), rat growth hormone, human serum amyloid A (SAA), troponin I (TN I), platelet-derived growth factor (PDGF), duchenne muscular dystrophy, SV40, polyoma, retroviruses, papilloma virus, hepatitis B virus, human immunodeficiency virus, cytomegalovirus (CMV), and gibbon ape leukemia virus.
In some embodiments, inducible elements may be used. In some embodiments, the inducible element is, for example, MTII, MMTV (mouse mammary tumor virus), β-interferon, adenovirus 5 E2, collagenase, stromelysin, SV40, murine MX gene, GRP78 gene, α-2-macroglobulin, vimentin, MHC class I gene H-2κb, HSP70, proliferin, tumor necrosis factor, and/or thyroid stimulating hormone a gene. In some embodiments, the inducer is phorbol ester (TFA), heavy metals, glucocorticoids, poly(rDx, poly(rc), ElA, phorbol ester (TPA), interferon, Newcastle Disease Virus, A23187, IL-6, serum, interferon, SV40 large T antigen, PMA, and/or thyroid hormone. Any of the inducible elements described herein may be used with any of the inducers described herein.
Of particular interest are muscle specific promoters. These include the myosin light chain-2 promoter, the α-actin promoter, the troponin 1 promoter; the Na+/Ca2+ exchanger promoter, the dystrophin promoter, the α7 integrin promoter, the brain natriuretic peptide promoter and the αB-crystallin/small heat shock protein promoter, α-myosin heavy chain promoter and the ANF promoter.
In some embodiments, the muscle specific promoter is the CK8 promoter. The CK8 promoter has the following sequence (SEQ ID NO: 331):
In some embodiments, the muscle-cell cell specific promoter is a variant of the CK8 promoter, called CK8e. The CK8e promoter has the following sequence (SEQ ID NO. 332):
Where a cDNA insert is employed, one will typically desire to include a polyadenylation signal to effect proper polyadenylation of the gene transcript. Any polyadenylation sequence may be employed such as human growth hormone and SV40 polyadenylation signals. Also contemplated as an element of the expression cassette is a terminator. These elements can serve to enhance message levels and to minimize read through from the cassette into other sequences.
B. Self-Cleaving Peptides
In some embodiments, the nucleic acids and/or expression constructs disclosed herein may encode a self-cleaving peptide.
In some embodiments of self-cleaving peptides of the disclosure, the self-cleaving peptide is a 2A peptide. In some embodiments, a 2A-like self-cleaving domain from the insect virus Thosea asigna (TaV 2A peptide) (SEQ ID NO: 333, EGRGSLLTCGDVEENPGP) is used. These 2A-like domains have been shown to function across eukaryotes and cause cleavage of amino acids to occur co-translationally within the 2A-like peptide domain. Therefore, inclusion of TaV 2A peptide allows the expression of multiple proteins from a single mRNA transcript. Importantly, the domain of TaV when tested in eukaryotic systems has shown greater than 99% cleavage activity. Other acceptable 2A-like peptides include, but are not limited to, equine rhinitis A virus (ERAV) 2A peptide (SEQ ID NO: 334; QCTNYALLKLAGDVESNPGP), porcine teschovirus-1 (PTV1) 2A peptide (SEQ ID NO: 335; ATNFSLLKQAGDVEENPGP) and foot and mouth disease virus (FMDV) 2A peptide (SEQ ID NO: 336; PVKQLLNFDLLKLAGDVESNPGP) or modified versions thereof.
In some embodiments, the 2A peptide is used to express a reporter and a Cas9 or a Cpf1 simultaneously. The reporter may be, for example, GFP.
Other self-cleaving peptides that may be used include, but are not limited to nuclear inclusion protein a (Nia) protease, a P1 protease, a 3C protease, an L protease, a 3C-like protease, or modified versions thereof.
C. Delivery of Expression Vectors
There are a number of other ways in which expression vectors may introduced into cells. In certain embodiments, the expression construct comprises a virus or engineered construct derived from a viral genome. The ability of certain viruses to enter cells via receptor-mediated endocytosis, to integrate into host cell genome and express viral genes stably and efficiently have made them attractive candidates for the transfer of foreign genes into mammalian cells. In some embodiments, the gene editing compositions described herein are administered to a cell or to a subject using a non-viral vector or a viral vector. In some embodiments, the gene editing compositions described herein are administered to a cell or to a subject using a recombinant vector (e.g., a recombinant viral or a recombinant non-viral vector). In some embodiments, a recombinant vector comprises a nucleic acid of the disclosure, i.e., a nucleic acid comprising a sequence encoding a single guide RNA (sgRNA) comprising a spacer sequence and a scaffold sequence, wherein the spacer sequence targets an exon sequence of the DMD gene, such as a sequence of exon 43, 44, 46, 50, or 53. In some embodiments, the recombinant vector is a plasmid. In some embodiments, the recombinant vector is an expression vector.
Exemplary non-viral vectors for use with the compositions and methods described herein comprise nanoparticles (e.g., polymeric nanoparticles), liposomes (e.g., cationic liposomes), naked DNA, cationic lipid-DNA complexes, lipid emulsions, calcium phosphate, polymer complexes, or combinations thereof.
Exemplary viral vectors for use with the compositions and methods described herein include vectors based on adeno-associated virus (AAV), adenovirus, lentivirus, retrovirus, or a hybrid virus. In some embodiments, the viral vectors of the instant disclosure are replication defective, or at least conditionally replication defective.
The AAV genome may be from any naturally derived serotype or isolate or clade of AAV. Thus, the AAV genome may be the full genome of a naturally occurring AAV virus. As is known to the skilled person, AAV viruses occurring in nature may be classified according to various biological systems.
Commonly, AAV viruses are referred to in terms of their serotype. A serotype corresponds to a variant subspecies of AAV which owing to its profile of expression of capsid surface antigens has a distinctive reactivity which can be used to distinguish it from other variant subspecies. Typically, a virus having a particular AAV serotype does not efficiently cross-react with neutralizing antibodies specific for any other AAV serotype. AAV serotypes include AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10 and AAV11, also recombinant serotypes, such as Rec2 and Rec3, recently identified from primate brain. The sequences of AAV genomes or of elements of AAV genomes including ITR sequences, rep or cap genes for use methods and compositions described herein may be derived from the following accession numbers for AAV whole genome sequences: Adeno-associated virus 1 NC_002077, AF063497; Adeno-associated virus 2 NC_001401; Adeno-associated virus 3 NC_001729; Adeno-associated virus 3B NC_001863; Adeno-associated virus 4 NC_001829; Adeno-associated virus 5 Y18065, AF085716; Adeno-associated virus 6 NC_001862; Avian AAV ATCC VR-865 AY186198, AY629583, NC_004828; Avian AAV strain DA-1 NC_006263, AY629583; Bovine AAV NC_005889, AY388617.
AAV viruses may also be referred to in terms of clades or clones. This refers to the phylogenetic relationship of naturally derived AAV viruses, and typically to a phylogenetic group of AAV viruses which can be traced back to a common ancestor, and includes all descendants thereof. Additionally, AAV viruses may be referred to in terms of a specific isolate, i.e., a genetic isolate of a specific AAV virus found in nature. The term genetic isolate describes a population of AAV viruses which has undergone limited genetic mixing with other naturally occurring AAV viruses, thereby defining a recognizably distinct population at a genetic level.
In some embodiments, the gene editing compositions of the instant disclosure are delivered to a cell or to a patient using one or more AAV vectors. An AAV vector typically comprises an AAV expression cassette encapsidated by an AAV capsid protein. The serotype of the AAV vector may be selected from AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAVRh74, AAV2i8, AAVRh10, AAV39, AAV43, AAVRh8, avian AAV, bovine AAV, canine AAV, equine AAV, and ovine AAV. In some embodiments, the AAV vector may be replication-defective or conditionally replication defective.
In some embodiments, the AAV vector is selected from any of the AAV vectors disclosed in Table 1 of WO 2019/028306, which is incorporated by reference herein in its entirety. In some embodiments, the AAV vector is selected from one of the serotypes listed in Table 7.
The single-stranded DNA genome of wild-type AAV is about 4.7 kilobases (kb). In practice, AAV genomes of up to about 5.0 kb appear to be completely packaged, i.e., be full-length, into AAV virus particles. With two AAV inverted terminal repeats (ITRs) of about 145 bases, the DNA packaging capacity of an AAV vector is such that a maximum of about 4.4 kb of protein.
The wild-type AAV genome comprises two open reading frames, Rep and Cap, flanked by two inverted terminal repeats (ITRs). Typically, when producing a recombinant AAV, the sequence between the two ITRs is replaced with one or more sequence of interest (e.g., a transgene), and the Rep and Cap sequences are provided in trans. The recombinant AAV genome construct, comprising two ITRs flanking a sequence of interest (such as a transgene), is referred to herein as an AAV expression cassette. The disclosure provides AAV expression cassettes for production of AAV viral vectors.
In some embodiments, an AAV expression cassette comprises a nucleic acid of the disclosure, i.e., a nucleic acid comprising a sequence encoding a single guide RNA (sgRNA) comprising a spacer sequence and a scaffold sequence wherein the spacer sequence targets an exon sequence of the DMD gene, such as a sequence of exon 43, 44, 45, 50, or 53 of the DMD gene.
In some embodiments, an AAV expression cassette comprises a first ITR, a transgene sequence, and a second ITR. In some embodiments, an AAV expression cassette comprises a first ITR, an expression control sequence (such as a promoter or enhancer), a transgene sequence, and a second ITR. In some embodiments, an AAV expression cassette comprises a first ITR, an expression control sequence (such as a promoter or enhancer), a transgene sequence, a stuffer sequence, and a second ITR. The transgene may comprise all or part of a nucleic acid of the disclosure. For example, the transgene may comprise a gRNA sequence (i.e., spacer+scaffold sequences), wherein the gRNA targets an exon sequence of the DMD gene, such as a sequence of exon 43, 44, 45, 50, or 53 of the DMD gene.
In some embodiments, an AAV expression cassette comprises a first ITR, a gRNA sequence, and a second ITR. In some embodiments, an AAV expression cassette comprises a first ITR, an expression control sequence (such as a promoter or enhancer), a gRNA sequence, and a second ITR. In some embodiments, an AAV expression cassette comprises a first ITR, an expression control sequence (such as a promoter or enhancer), a gRNA sequence, a stuffer sequence, and a second ITR.
In some embodiments, the transgene comprises more than one guide RNA sequence, such as two, three, four, five, six, seven, or eight gRNA sequences. In some embodiments, the transgene comprises three, four or five gRNA sequences. In some embodiments, each gRNA sequence is operably linked to an expression control sequence (such as a promoter or enhancer).
In some embodiments, an AAV expression cassette comprises a first ITR, a first expression control sequence (such as a promoter or enhancer), a first gRNA sequence, a second expression control sequence (such as a promoter or enhancer), a second gRNA sequence, and a second ITR.
In some embodiments, an AAV expression cassette comprises a first ITR, a first expression control sequence (such as a promoter or enhancer), a first gRNA sequence, a second expression control sequence (such as a promoter or enhancer), a second gRNA sequence, a third expression control sequence (such as a promoter or enhancer), a third gRNA sequence, and a second ITR.
In some embodiments, an AAV expression cassette comprises a first ITR, a first expression control sequence (such as a promoter or enhancer), a first gRNA sequence, a second expression control sequence (such as a promoter or enhancer), a second gRNA sequence, a third expression control sequence (such as a promoter or enhancer), a third gRNA sequence, a fourth expression control sequence (such as a promoter or enhancer), a fourth gRNA sequence, and a second ITR.
In some embodiments, an AAV expression cassette comprises a first ITR, a first expression control sequence (such as a promoter or enhancer), a first gRNA sequence, a second expression control sequence (such as a promoter or enhancer), a second gRNA sequence, a third expression control sequence (such as a promoter or enhancer), a third gRNA sequence, a fourth expression control sequence (such as a promoter or enhancer), a fourth gRNA sequence, a fifth expression control sequence (such as a promoter or enhancer), a fifth gRNA sequence, and a second ITR.
In some embodiments, all of the gRNA sequences are the same. In some embodiments, two or more of the gRNA sequences are different. In some embodiments, all of the gRNA sequences are different. In some embodiments, the AAV expression cassette further comprises a stuffer sequence. In some embodiments, the AAV expression cassette further comprises a polyadenosine (polyA) sequence.
In some embodiments, an AAV expression cassette comprises sequences encoding a first ITR, a first promoter, a first gRNA comprising a first spacer sequence, a second promoter, a second gRNA comprising a second spacer sequence, a third promoter, a third gRNA comprising a third spacer sequence; and a second ITR. At least one of the first, second, and third spacer sequences may target a sequence of the DMD gene (e.g., exon 43, exon 44, exon 46, exon 50 or exon 53 of the DMD gene). In some embodiments, the first, second, and third spacer sequences are each individually selected from any one of the gRNA spacer sequences in Table 2, or a sequence at least 95% identical thereto. In some embodiments, at least two of the first, second, and third spacer sequences are different. In some embodiments, the first, second, and third spacer sequences are the same. In some embodiments, the first, second, and/or third spacer sequences have a sequence that is at least 95% identical or 100% identical to the sequence of any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617. In some embodiments, the first, second, and/or third spacer sequences have a sequence that is at least 95% identical or 100% identical to the sequence of any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617.
In some embodiments, an AAV expression cassette comprises a first gRNA comprising a first spacer sequence, a second gRNA comprising a second spacer sequence, a third gRNA comprising a third spacer sequence, and a fourth gRNA comprising a fourth spacer sequence. In some embodiments, two, three, or four of the gRNAs are the same. In some embodiments, two, three, or four of the gRNAs are different. In some embodiments, an AAV expression cassette comprises a first promoter, a first gRNA comprising a first spacer sequence, a second promoter, a second gRNA comprising a second spacer sequence, a third promoter, a third gRNA comprising a third spacer sequence, a fourth promoter, and a fourth gRNA comprising a fourth spacer sequence. In some embodiments, an AAV expression cassette comprises a first ITR, a first promoter, a first gRNA comprising a first spacer sequence, a second promoter, a second gRNA comprising a second spacer sequence, a third promoter, a third gRNA comprising a third spacer sequence, a fourth promoter, a fourth gRNA comprising a fourth spacer sequence, and a second ITR. In some embodiments, the expression cassette further comprises a stuffer sequence.
In some embodiments, an AAV expression cassette comprises a first gRNA comprising a first spacer sequence, a second gRNA comprising a second spacer sequence, a third gRNA comprising a third spacer sequence, a fourth gRNA comprising a fourth spacer sequence, and a fifth gRNA comprising a fifth spacer sequence. In some embodiments, two, three, four, or five of the gRNAs are the same. In some embodiments, two, three, four or five of the gRNAs are different. In some embodiments, an AAV expression cassette comprises a first promoter, a first gRNA comprising a first spacer sequence, a second promoter, a second gRNA comprising a second spacer sequence, a third promoter, a third gRNA comprising a third spacer sequence, a fourth promoter, a fourth gRNA comprising a fourth spacer sequence, a fifth promoter, and a fifth gRNA comprising a fifth spacer sequence. In some embodiments, an AAV expression cassette comprises a first ITR, a first promoter, a first gRNA comprising a first spacer sequence, a second promoter, a second gRNA comprising a second spacer sequence, a third promoter, a third gRNA comprising a third spacer sequence, a fourth promoter, a fourth gRNA comprising a fourth spacer sequence, a fifth promoter, a fifth gRNA comprising a fifth spacer sequence, and a second ITR. In some embodiments, the expression cassette further comprises a stuffer sequence.
In some embodiments, an AAV expression cassette comprises sequences encoding a first inverted terminal repeat (ITR), a first promoter, a first gRNA comprising a first spacer sequence (e.g., a sequence at least 95% or 100% identical to any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617) and a scaffold sequence (e.g., a scaffold sequence at least 95% or 100% identical to any of SEQ ID NO: 147 to 153); and a second ITR.
In some embodiments, an AAV expression cassette comprises sequences encoding a first inverted terminal repeat (ITR), a first promoter, a first gRNA comprising a first spacer sequence (e.g., a sequence at least 95% or 100% identical to any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617) and a scaffold sequence (e.g., a scaffold sequence at least 95% or 100% identical to any of SEQ ID NO: 147 to 153), a second promoter, a second gRNA comprising a second spacer sequence (e.g., a sequence at least 95% or 100% identical to any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617) and a scaffold sequence (e.g., a scaffold sequence at least 95% or 100% identical to any of SEQ ID NO: 147 to 153); and a second ITR.
In some embodiments, an AAV expression cassette comprises sequences encoding a first inverted terminal repeat (ITR), a first promoter, a first gRNA comprising a first spacer sequence (e.g., a sequence at least 95% or 100% identical to any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617) and a scaffold sequence (e.g., a scaffold sequence at least 95% or 100% identical to any of SEQ ID NO: 147 to 153), a second promoter, a second gRNA comprising a second spacer sequence (e.g., a sequence at least 95% or 100% identical to any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617) and a scaffold sequence (e.g., a scaffold sequence at least 95% or 100% identical to any of SEQ ID NO: 147 to 153), a third promoter, a third gRNA comprising a third spacer sequence (e.g., a sequence at least 95% or 100% identical to any one of SEQ ID NOs: 135-146, 170-260, 337-339, or 617) and a scaffold sequence (e.g., a scaffold sequence at least 95% or 100% identical to any of SEQ ID NO: 147 to 153); and a second ITR.
In some embodiments, an AAV expression cassette comprises a first inverted terminal repeat (ITR), a first promoter, a nucleic acid comprising a gRNA targeting a sequence of the DMD gene, such as a sequence of Exon 43, 44, 45, 50, or 53 of the DMD gene, and a second ITR. In some embodiments, the AAV expression cassette further comprises a polyadenosine (polyA) sequence.
In some embodiments, one or both of the first ITR and the second ITR are isolated or derived from any one of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAVRh74, AAV2i8, AAVRh10, AAV39, AAV43, AAVRh8, avian AAV, bovine AAV, canine AAV, equine AAV, and ovine AAV. In some embodiments, the expression cassette comprises multiple copies of the gRNA, such as 2, 3, 4, or 5 copies of the gRNA.
In some embodiments, an AAV expression cassette comprises a sequence to make the AAV vector less immunogenic (e.g., a “cloaking” sequence). In some embodiments, the sequence is isolated or derived from a telomere sequence. In some embodiments, the nucleotide sequence binds to a toll-like receptor, such as TLR9.
In some embodiments, an AAV expression cassette comprises sequences encoding a first ITR, a first promoter, a first gRNA comprising a first spacer sequence, a second promoter, a second gRNA comprising the first spacer sequence, a third promoter, a third gRNA comprising the first spacer sequence, and a second ITR.
In some embodiments, an AAV expression cassette comprises sequences encoding a first ITR, a first promoter, a first gRNA comprising a first spacer sequence, a second promoter, a second gRNA comprising the first spacer sequence, a third promoter, a third gRNA comprising the first spacer sequence, (optionally) a first stuffer sequence, and a second ITR. The first spacer sequence may target the DMD gene, for example it may target exon 43, exon 44, exon 46, exon 50 or exon 53 of the DMD gene. In some embodiments, the first spacer sequence is selected from any one of the gRNA sequences in Table 2, or a sequence at least 95% identical thereto.
The AAV expression cassettes described herein may be incorporated into an AAV vector. In some embodiments, an AAV vector comprises an AAV expression cassette encapsidated by an AAV capsid protein.
In some embodiments, the AAV vector is based on one or more of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAVRh74, AAV2i8, AAVRh10, AAV39, AAV43, AAVRh8, avian AAV, bovine AAV, canine AAV, equine AAV, or ovine AAV. In some embodiments, the AAV vector is based on a modified AAV, comprising one or more non-naturally occurring sequences. In some embodiments, the AAV vector is based on a chimeric AAV. The AAV vector may be replication-defective or conditionally replication defective.
Adenoviruses. “Adenovirus expression vector” is meant to include those constructs containing adenovirus sequences sufficient to (a) support packaging of the construct and (b) to express an antisense polynucleotide that has been cloned therein. In this context, expression does not require that the gene product be synthesized.
The expression vector comprises a genetically engineered form of adenovirus. Knowledge of the genetic organization of adenovirus, a 36 kB, linear, double-stranded DNA virus, allows substitution of large pieces of adenoviral DNA with foreign sequences up to 7 kB. In contrast to retrovirus, the adenoviral infection of host cells does not result in chromosomal integration because adenoviral DNA can replicate in an episomal manner without potential genotoxicity. Also, adenoviruses are structurally stable, and no genome rearrangement has been detected after extensive amplification. Adenovirus can infect virtually all epithelial cells regardless of their cell cycle stage. So far, adenoviral infection appears to be linked only to mild disease such as acute respiratory disease in humans.
Adenovirus is particularly suitable for use as a gene transfer vector because of its mid-sized genome, ease of manipulation, high titer, wide target cell range and high infectivity. Both ends of the viral genome contain 100-200 base pair inverted repeats (ITRs), which are cis elements necessary for viral DNA replication and packaging. The early (E) and late (L) regions of the genome contain different transcription units that are divided by the onset of viral DNA replication. The E1 region (E1A and E1B) encodes proteins responsible for the regulation of transcription of the viral genome and a few cellular genes. The expression of the E2 region (E2A and E2B) results in the synthesis of the proteins for viral DNA replication. These proteins are involved in DNA replication, late gene expression and host cell shut-off. The products of the late genes, including the majority of the viral capsid proteins, are expressed only after significant processing of a single primary transcript issued by the major late promoter (MLP). The MLP, (located at 16.8 m.u.) is particularly efficient during the late phase of infection, and all the mRNAs issued from this promoter possess a 5′-tripartite leader (TPL) sequence which makes them preferred mRNA's for translation.
In one system, recombinant adenovirus is generated from homologous recombination between shuttle vector and provirus vector. Due to the possible recombination between two proviral vectors, wild-type adenovirus may be generated from this process. Therefore, it is critical to isolate a single clone of virus from an individual plaque and examine its genomic structure.
Generation and propagation of the current adenovirus vectors, which are replication deficient, depend on a unique helper cell line, designated 293, which was transformed from human embryonic kidney cells by Ad5 DNA fragments and constitutively expresses E1 proteins. Since the E3 region is dispensable from the adenovirus genome, the current adenovirus vectors, with the help of 293 cells, carry foreign DNA in either the E1, the D3 or both regions. In nature, adenovirus can package approximately 105% of the wild-type genome, providing capacity for about 2 extra kb of DNA. Combined with the approximately 5.5 kb of DNA that is replaceable in the E1 and E3 regions, the maximum capacity of the current adenovirus vector is under 7.5 kb, or about 15% of the total length of the vector. More than 80% of the adenovirus viral genome remains in the vector backbone and is the source of vector-borne cytotoxicity. Also, the replication deficiency of the E1-deleted virus is incomplete.
Helper cell lines may be derived from human cells such as human embryonic kidney cells, muscle cells, hematopoietic cells or other human embryonic mesenchymal or epithelial cells. Alternatively, the helper cells may be derived from the cells of other mammalian species that are permissive for human adenovirus. Such cells include, e.g., Vero cells or other monkey embryonic mesenchymal or epithelial cells. As stated above, the preferred helper cell line is 293.
The adenovirus may be of any of the 42 different known serotypes or subgroups A-F. Adenovirus type 5 of subgroup C is the preferred starting material in order to obtain the conditional replication-defective adenovirus vector for use as described herein. This is because Adenovirus type 5 is a human adenovirus about which a great deal of biochemical and genetic information is known, and it has historically been used for most constructions employing adenovirus as a vector.
As stated above, the typical vector according to the present disclosure is replication defective and will not have an adenovirus E1 region. Thus, it will be most convenient to introduce the polynucleotide encoding the gene of interest at the position from which the E1-coding sequences have been removed. However, the position of insertion of the construct within the adenovirus sequences is not critical. The polynucleotide encoding the gene of interest may also be inserted in lieu of the deleted E3 region in E3 replacement vectors, or in the E4 region where a helper cell line or helper virus complements the E4 defect.
Adenovirus is easy to grow and manipulate and exhibits broad host range in vitro and in vivo. This group of viruses can be obtained in high titers, e.g., 109-1012 plaque-forming units per ml, and they are highly infective. The life cycle of adenovirus does not require integration into the host cell genome. The foreign genes delivered by adenovirus vectors are episomal and, therefore, have low genotoxicity to host cells. No side effects have been reported in studies of vaccination with wild-type adenovirus, demonstrating their safety and therapeutic potential as in vivo gene transfer vectors.
Retroviruses. The retroviruses are a group of single-stranded RNA viruses characterized by an ability to convert their RNA to double-stranded DNA in infected cells by a process of reverse-transcription. The resulting DNA then stably integrates into cellular chromosomes as a provirus and directs synthesis of viral proteins. The integration results in the retention of the viral gene sequences in the recipient cell and its descendants. The retroviral genome contains three genes, gag, pol, and env that code for capsid proteins, polymerase enzyme, and envelope components, respectively. A sequence found upstream from the gag gene contains a signal for packaging of the genome into virions. Two long terminal repeat (LTR) sequences are present at the 5′ and 3′ ends of the viral genome.
In order to construct a retroviral vector, a nucleic acid encoding a gene of interest is inserted into the viral genome in the place of certain viral sequences to produce a virus that is replication-defective. In order to produce virions, a packaging cell line containing the gag, pol, and env genes but without the LTR and packaging components is constructed. When a recombinant plasmid containing a cDNA, together with the retroviral LTR and packaging sequences is introduced into this cell line (by calcium phosphate precipitation for example), the packaging sequence allows the RNA transcript of the recombinant plasmid to be packaged into viral particles, which are then secreted into the culture media. The media containing the recombinant retroviruses is then collected, optionally concentrated, and used for gene transfer. Retroviral vectors are able to infect a broad variety of cell types. However, integration and stable expression require the division of host cells.
A novel approach designed to allow specific targeting of retrovirus vectors was recently developed based on the chemical modification of a retrovirus by the chemical addition of lactose residues to the viral envelope. This modification could permit the specific infection of hepatocytes via sialoglycoprotein receptors.
A different approach to targeting of recombinant retroviruses was designed in which biotinylated antibodies against a retroviral envelope protein and against a specific cell receptor were used. The antibodies are coupled via the biotin components by using streptavidin. Using antibodies against major histocompatibility complex class I and class II antigens, a variety of human cells that bear those surface antigens may be infected with an ecotropic virus in vitro.
Other viral vectors. Other viral vectors may be employed as expression constructs. For example, vectors derived from viruses such as vaccinia virus and herpesviruses may be employed. They offer several attractive features for various mammalian cells.
Non-viral methods. Several non-viral methods for the transfer of expression constructs into cultured mammalian cells also are contemplated. These include calcium phosphate precipitation, DEAE-dextran, electroporation, direct microinjection, DNA-loaded liposomes and lipofectamine-DNA complexes, cell sonication, gene bombardment using high velocity microprojectiles, and receptor-mediated transfection. Some of these techniques may be successfully adapted for in vivo or ex vivo use.
Once the expression construct has been delivered into the cell the nucleic acid encoding the gene of interest may be positioned and expressed at different sites. In certain embodiments, the nucleic acid encoding the gene may be stably integrated into the genome of the cell. This integration may be in the cognate location and orientation via homologous recombination (gene replacement) or it may be integrated in a random, non-specific location (gene augmentation). In yet further embodiments, the nucleic acid may be stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid segments or “episomes” encode sequences sufficient to permit maintenance and replication independent of or in synchronization with the host cell cycle. How the expression construct is delivered to a cell and where in the cell the nucleic acid remains is dependent on the type of expression construct employed.
In yet another embodiment, the expression construct may simply consist of naked recombinant DNA or plasmids. Transfer of the construct may be performed by any of the methods mentioned above which physically or chemically permeabilize the cell membrane. This is particularly applicable for transfer in vitro but it may be applied to in vivo use as well. Polyomavirus DNA has been successfully injected in the form of calcium phosphate precipitates into liver and spleen of adult and newborn mice demonstrating active viral replication and acute infection. Direct intraperitoneal injection of calcium phosphate-precipitated plasmids, resulting in expression of the transfected genes, may also be used. It is envisioned that DNA encoding a gene of interest may also be transferred in a similar manner in vivo and express the gene product.
In still another embodiment for transferring a naked DNA expression construct into cells may involve particle bombardment. This method depends on the ability to accelerate DNA-coated microprojectiles to a high velocity allowing them to pierce cell membranes and enter cells without killing them. Several devices for accelerating small particles have been developed. One such device relies on a high voltage discharge to generate an electrical current, which in turn provides the motive force. The microprojectiles used have consisted of biologically inert substances such as tungsten or gold beads.
Selected organs including the liver, skin, and muscle tissue of rats and mice have been bombarded in vivo. This may require surgical exposure of the tissue or cells, to eliminate any intervening tissue between the gun and the target organ, i.e., ex vivo treatment. Again, DNA encoding a particular gene may be delivered via this method and still be incorporated by the instant disclosure.
In a further embodiment, the expression construct may be entrapped in a liposome. Liposomes are vesicular structures characterized by a phospholipid bilayer membrane and an inner aqueous medium. Multilamellar liposomes have multiple lipid layers separated by aqueous medium. They form spontaneously when phospholipids are suspended in an excess of aqueous solution. The lipid components undergo self-rearrangement before the formation of closed structures and entrap water and dissolved solutes between the lipid bilayers. Also contemplated are lipofectamine-DNA complexes.
Liposome-mediated nucleic acid delivery and expression of foreign DNA in vitro has been very successful. A reagent known as Lipofectamine 2000™ is widely used and commercially available.
In certain embodiments, the liposome may be complexed with a hemagglutinating virus (HVJ). This has been shown to facilitate fusion with the cell membrane and promote cell entry of liposome-encapsulated DNA. In other embodiments, the liposome may be complexed or employed in conjunction with nuclear non-histone chromosomal proteins (HMG-1). In yet further embodiments, the liposome may be complexed or employed in conjunction with both HVJ and HMG-1. In that such expression constructs have been successfully employed in transfer and expression of nucleic acid in vitro and in vivo, then they are applicable. Where a bacterial promoter is employed in the DNA construct, it also will be desirable to include within the liposome an appropriate bacterial polymerase.
Other expression constructs which can be employed to deliver a nucleic acid encoding a particular gene into cells are receptor-mediated delivery vehicles. These take advantage of the selective uptake of macromolecules by receptor-mediated endocytosis in almost all eukaryotic cells. Because of the cell type-specific distribution of various receptors, the delivery can be highly specific.
Receptor-mediated gene targeting vehicles generally consist of two components: a cell receptor-specific ligand and a DNA-binding agent. Several ligands have been used for receptor-mediated gene transfer. The most extensively characterized ligands are asialoorosomucoid (ASOR) and transferrin. A synthetic neoglycoprotein, which recognizes the same receptor as ASOR, may be used as a gene delivery vehicle. In some embodiments, epidermal growth factor (EGF) may be used to deliver genes.
A particular embodiment provides transgenic animals that contain mutations in the dystrophin gene. Also, transgenic animals may express a marker that reflects the production of mutant or normal dystrophin gene product.
In a general aspect, a transgenic animal is produced by the integration of a given construct into the genome in a manner that permits the expression of the transgene using methods discussed above. Methods for producing transgenic animals are generally described by Wagner and Hoppe (U.S. Pat. No. 4,873,191; incorporated herein by reference), and Brinster et al. (1985; incorporated herein by reference).
Typically, the construct is transferred by microinjection into a fertilized egg. The microinjected eggs are implanted into a host female, and the progeny are screened for the expression of the transgene. Transgenic animals may be produced from the fertilized eggs from a number of animals including, but not limited to reptiles, amphibians, birds, mammals, and fish.
RNA for microinjection can be prepared by any means known in the art. For example, RNA for microinjection can be cleaved with enzymes appropriate for removing the bacterial plasmid sequences, and the RNA fragments electrophoresed on 1% agarose gels in TBE buffer, using standard techniques. The RNA bands are visualized by staining with ethidium bromide, and the band containing the expression sequences is excised. The excised band is then placed in dialysis bags containing 0.3 M sodium acetate, pH 7.0. RNA is electroeluted into the dialysis bags, extracted with a 1:1 phenol:chloroform solution and precipitated by two volumes of ethanol. The RNA is redissolved in 1 ml of low salt buffer (0.2 M NaCl, 20 mM Tris, pH 7.4, and 1 mM EDTA) and purified on an Elutip-D® column. The column is first primed with 3 ml of high salt buffer (1 M NaCl, 20 mM Tris, pH 7.4, and 1 mM EDTA) followed by washing with 5 ml of low salt buffer. The DNA solutions are passed through the column three times to bind RNA to the column matrix. After one wash with 3 ml of low salt buffer, the RNA is eluted with 0.4 ml high salt buffer and precipitated by two volumes of ethanol. RNA concentrations are measured by absorption at 260 nm in a UV spectrophotometer. For microinjection, DNA concentrations are adjusted to 3 μg/ml in 5 mM Tris, pH 7.4 and 0.1 mM EDTA.
In an exemplary microinjection procedure, female mice six weeks of age are induced to superovulate with a 5 IU injection (0.1 cc, ip) of pregnant mare serum gonadotropin (PMSG; Sigma) followed 48 hours later by a 5 IU injection (0.1 cc, ip) of human chorionic gonadotropin (hCG; Sigma). Females are placed with males immediately after hCG injection. Twenty-one hours after hCG injection, the mated females are sacrificed by CO2 asphyxiation or cervical dislocation and embryos are recovered from excised oviducts and placed in Dulbecco's phosphate buffered saline with 0.5% bovine serum albumin (BSA; Sigma). Surrounding cumulus cells are removed with hyaluronidase (1 mg/ml). Pronuclear embryos are then washed and placed in Earle's balanced salt solution containing 0.5% BSA (EBSS) in a 37.5° C. incubator with a humidified atmosphere at 5% CO2, 95% air until the time of injection. Embryos can be implanted at the two-cell stage.
Randomly cycling adult female mice are paired with vasectomized males. C57BL/6 or Swiss mice or other comparable strains can be used for this purpose. Recipient females are mated at the same time as donor females. At the time of embryo transfer, the recipient females are anesthetized with an intraperitoneal injection of 0.015 ml of 2.5% avertin per gram of body weight. The oviducts are exposed by a single midline dorsal incision. An incision is then made through the body wall directly over the oviduct. The ovarian bursa is then torn with watchmakers forceps. Embryos to be transferred are placed in DPBS (Dulbecco's phosphate buffered saline) and in the tip of a transfer pipet (about 10 to 12 embryos). The pipet tip is inserted into the infundibulum and the embryos transferred. After the transfer, the incision is closed by two sutures.
Where clinical applications are contemplated, pharmaceutical compositions will be prepared in a form appropriate for the intended application. Generally, this will entail preparing compositions that are essentially free of pyrogens, as well as other impurities that could be harmful to humans or animals.
One will generally desire to employ appropriate salts and buffers to render drugs, proteins or delivery vectors stable and allow for uptake by target cells. Aqueous compositions of the disclosure may comprise an effective amount of the drug, vector or proteins, dissolved or dispersed in a pharmaceutically acceptable carrier or aqueous medium. The phrase “pharmaceutically or pharmacologically acceptable” refer to molecular entities and compositions that do not produce adverse, allergic, or other untoward reactions when administered to an animal or a human. As used herein, “pharmaceutically acceptable carrier” includes solvents, buffers, solutions, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents and the like acceptable for use in formulating pharmaceuticals, such as pharmaceuticals suitable for administration to humans. The use of such media and agents for pharmaceutically active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active ingredients of the present disclosure, its use in therapeutic compositions is contemplated. Supplementary active ingredients also can be incorporated into the compositions, provided they do not inactivate the vectors or cells of the compositions.
The active compositions of the present disclosure may include classic pharmaceutical preparations. Administration of these compositions according to the present disclosure may be via any common route so long as the target tissue is available via that route, but generally including systemic administration. This includes oral, nasal, or buccal. Alternatively, administration may be by intradermal, subcutaneous, intramuscular, intraperitoneal or intravenous injection, or by direct injection into muscle tissue. Such compositions would normally be administered as pharmaceutically acceptable compositions, as described supra.
The active compounds may also be administered parenterally or intraperitoneally. By way of illustration, solutions of the active compounds as free base or pharmacologically acceptable salts can be prepared in water suitably mixed with a surfactant, such as hydroxypropylcellulose. Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these preparations generally contain a preservative to prevent the growth of microorganisms.
The pharmaceutical forms suitable for injectable use include, for example, sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersions. Generally, these preparations are sterile and fluid to the extent that easy injectability exists. Preparations should be stable under the conditions of manufacture and storage and should be preserved against the contaminating action of microorganisms, such as bacteria and fungi. Appropriate solvents or dispersion media may contain, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, for example, by the use of a coating, such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. The prevention of the action of microorganisms can be brought about by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the use in the compositions of agents delaying absorption, for example, aluminum monostearate and gelatin.
Sterile injectable solutions may be prepared by incorporating the active compounds in an appropriate amount into a solvent along with any other ingredients (for example as enumerated above) as desired, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the various sterilized active ingredients into a sterile vehicle which contains the basic dispersion medium and the desired other ingredients, e.g., as enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation include vacuum-drying and freeze-drying techniques which yield a powder of the active ingredient(s) plus any additional desired ingredient from a previously sterile-filtered solution thereof.
The compositions generally may be formulated in a neutral or salt form. Pharmaceutically-acceptable salts include, for example, acid addition salts (formed with the free amino groups of the protein) derived from inorganic acids (e.g., hydrochloric or phosphoric acids, or from organic acids (e.g., acetic, oxalic, tartaric, mandelic, and the like). Salts formed with the free carboxyl groups of the protein can also be derived from inorganic bases (e.g., sodium, potassium, ammonium, calcium, or ferric hydroxides) or from organic bases (e.g., isopropylamine, trimethylamine, histidine, procaine and the like.
Upon formulation, solutions are preferably administered in a manner compatible with the dosage formulation and in such amount as is therapeutically effective. The formulations may easily be administered in a variety of dosage forms such as injectable solutions, drug release capsules and the like. For parenteral administration in an aqueous solution, for example, the solution generally is suitably buffered, and the liquid diluent first rendered isotonic for example with sufficient saline or glucose. Such aqueous solutions may be used, for example, for intravenous, intramuscular, subcutaneous and intraperitoneal administration. Preferably, sterile aqueous media are employed as is known to those of skill in the art, particularly in light of the present disclosure. By way of illustration, a single dose may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, “Remington's Pharmaceutical Sciences” 15th Edition, pages 1035-1038 and 1570-1580). Some variation in dosage will necessarily occur depending on the condition of the subject being treated. The person responsible for administration will, in any event, determine the appropriate dose for the individual subject. Moreover, for human administration, preparations should meet sterility, pyrogenicity, general safety and purity standards as required by FDA Office of Biologics standards.
Duchenne muscular dystrophy (DMD) is a recessive X-linked form of muscular dystrophy, affecting around 1 in 5000 boys, which results in muscle degeneration and premature death. The disorder is caused by a mutation in the gene dystrophin, located on the human X chromosome, which codes for the protein dystrophin. Dystrophin is an important component within muscle tissue that provides structural stability to the dystroglycan complex (DGC) of the cell membrane. While both sexes can carry the mutation, females are rarely affected with the skeletal muscle form of the disease.
Mutations vary in nature and frequency. Large genetic deletions are found in about 60-70% of cases, large duplications are found in about 10% of cases, and point mutants or other small changes account for about 15-30% of cases. An examination of some 7000 mutations catalogued a total of 5,682 large mutations (80% of total mutations), of which 4,894 (86%) were deletions (1 exon or larger) and 784 (14%) were duplications (1 exon or larger). There were 1,445 small mutations (smaller than 1 exon, 20% of all mutations), of which 358 (25%) were small deletions and 132 (9%) small insertions, while 199 (14%) affected the splice sites. Point mutations totaled 756 (52% of small mutations) with 726 (50%) nonsense mutations and 30 (2%) missense mutations. Finally, 22 (0.3%) mid-intronic mutations were observed. In addition, mutations were identified within the database that would potentially benefit from novel genetic therapies for DMD including stop codon read-through therapies (10% of total mutations) and exon skipping therapy (80% of deletions and 55% of total mutations).
A. Symptoms
Symptoms usually appear in boys between the ages of 2 and 3 and may be visible in early infancy. Even though symptoms do not appear until early infancy, laboratory testing can identify children who carry the active mutation at birth. Progressive proximal muscle weakness of the legs and pelvis associated with loss of muscle mass is observed first. Eventually this weakness spreads to the arms, neck, and other areas. Early signs may include pseudohypertrophy (enlargement of calf and deltoid muscles), low endurance, and difficulties in standing unaided or inability to ascend staircases. As the condition progresses, muscle tissue experiences wasting and is eventually replaced by fat and fibrotic tissue (fibrosis). By age 10, braces may be required to aid in walking but most patients are wheelchair dependent by age 12. Later symptoms may include abnormal bone development that lead to skeletal deformities, including curvature of the spine. Due to progressive deterioration of muscle, loss of movement occurs, eventually leading to paralysis. Intellectual impairment may or may not be present but if present, does not progressively worsen as the child ages. The average life expectancy for males afflicted with DMD is around 25.
The main symptom of Duchenne muscular dystrophy, a progressive neuromuscular disorder, is muscle weakness associated with muscle wasting with the voluntary muscles being first affected, especially those of the hips, pelvic area, thighs, shoulders, and calves. Muscle weakness also occurs later, in the arms, neck, and other areas. Calves are often enlarged. Symptoms usually appear before age 6 and may appear in early infancy. Other physical symptoms are:
A positive Gowers' sign reflects the more severe impairment of the lower extremities muscles. The child helps himself to get up with upper extremities: first by rising to stand on his arms and knees, and then “walking” his hands up his legs to stand upright. Affected children usually tire more easily and have less overall strength than their peers. Creatine kinase (CPK-MM) levels in the bloodstream are extremely high. An electromyography (EMG) shows that weakness is caused by destruction of muscle tissue rather than by damage to nerves. Genetic testing can reveal genetic errors in the Xp21 gene. A muscle biopsy (immunohistochemistry or immunoblotting) or genetic test (blood test) confirms the absence of dystrophin, although improvements in genetic testing often make this unnecessary.
Additional symptoms may include:
B. Causes
Duchenne muscular dystrophy (DMD) is caused by a mutation of the dystrophin gene at locus Xp21, located on the short arm of the X chromosome. Dystrophin is responsible for connecting the cytoskeleton of each muscle fiber to the underlying basal lamina (extracellular matrix), through a protein complex containing many subunits. The absence of dystrophin permits excess calcium to penetrate the sarcolemma (the cell membrane). Alterations in calcium and signaling pathways cause water to enter into the mitochondria, which then burst.
In skeletal muscle dystrophy, mitochondrial dysfunction gives rise to an amplification of stress-induced cytosolic calcium signals and an amplification of stress-induced reactive-oxygen species (ROS) production. In a complex cascading process that involves several pathways and is not clearly understood, increased oxidative stress within the cell damages the sarcolemma and eventually results in the death of the cell. Muscle fibers undergo necrosis and are ultimately replaced with adipose and connective tissue.
DMD is inherited in an X-linked recessive pattern. Females will typically be carriers for the disease while males will be affected. Typically, a female carrier will be unaware they carry a mutation until they have an affected son. The son of a carrier mother has a 50% chance of inheriting the defective gene from his mother. The daughter of a carrier mother has a 50% chance of being a carrier and a 50% chance of having two normal copies of the gene. In all cases, an unaffected father will either pass a normal Y to his son or a normal X to his daughter. Female carriers of an X-linked recessive condition, such as DMD, can show symptoms depending on their pattern of X-inactivation.
Duchenne muscular dystrophy has an incidence of 1 in 5,000 male infants. Mutations within the dystrophin gene can either be inherited or occur spontaneously during germline transmission.
C. Diagnosis
Genetic counseling is advised for people with a family history of the disorder. Duchenne muscular dystrophy can be detected with about 95% accuracy by genetic studies.
DNA test. The muscle-specific isoform of the dystrophin gene is composed of 79 exons, and DNA testing and analysis can usually identify the specific type of mutation of the exon or exons that are affected. DNA testing confirms the diagnosis in most cases.
Muscle biopsy. If DNA testing fails to find the mutation, a muscle biopsy test may be performed. A small sample of muscle tissue is extracted (usually with a scalpel instead of a needle) and a dye is applied that reveals the presence of dystrophin. Complete absence of the protein indicates the condition.
Over the past several years DNA tests have been developed that detect more of the many mutations that cause the condition, and muscle biopsy is not required as often to confirm the presence of Duchenne's.
Prenatal tests. DMD is carried by an X-linked recessive gene. Males have only one X chromosome, so one copy of the mutated gene will cause DMD. Fathers cannot pass X-linked traits on to their sons, so the mutation is transmitted by the mother.
If the mother is a carrier, and therefore one of her two X chromosomes has a DMD mutation, there is a 50% chance that a female child will inherit that mutation as one of her two X chromosomes, and be a carrier. There is a 50% chance that a male child will inherit that mutation as his one X chromosome, and therefore have DMD.
Prenatal tests can tell whether their unborn child has the most common mutations. There are many mutations responsible for DMD, and some have not been identified, so genetic testing only works when family members with DMD have a mutation that has been identified.
Prior to invasive testing, determination of the fetal sex is important; while males are sometimes affected by this X-linked disease, female DMD is extremely rare. This can be achieved by ultrasound scan at 16 weeks or more recently by free fetal DNA testing. Chorion villus sampling (CVS) can be done at 11-14 weeks, and has a 1% risk of miscarriage. Amniocentesis can be done after 15 weeks, and has a 0.5% risk of miscarriage. Fetal blood sampling can be done at about 18 weeks. Another option in the case of unclear genetic test results is fetal muscle biopsy.
D. Treatment
There is no current cure for DMD, and an ongoing medical need has been recognized by regulatory authorities. Phase 1-2a trials with exon skipping treatment for certain mutations have halted decline and produced clinical improvements in walking. Sarepta's drug Exondys 51 (eteplirsen) has recently received FDA approval. However, treatment is generally aimed at controlling the onset of symptoms to maximize the quality of life, and include the following:
1. Physical Therapy
Physical therapists are concerned with enabling patients to reach their maximum physical potential. Their aim is to:
2. Respiration Assistance
Modern “volume ventilators/respirators,” which deliver an adjustable volume (amount) of air to the person with each breath, are valuable in the treatment of people with muscular dystrophy related respiratory problems. The ventilator may require an invasive endotracheal or tracheotomy tube through which air is directly delivered, but, for some people non-invasive delivery through a face mask or mouthpiece is sufficient. Positive airway pressure machines, particularly bi-level ones, are sometimes used in this latter way. The respiratory equipment may easily fit on a ventilator tray on the bottom or back of a power wheelchair with an external battery for portability.
Ventilator treatment may start in the mid to late teens when the respiratory muscles can begin to collapse. If the vital capacity has dropped below 40% of normal, a volume ventilator/respirator may be used during sleeping hours, a time when the person is most likely to be under ventilating (“hypoventilating”). Hypoventilation during sleep is determined by a thorough history of sleep disorder with an oximetry study and a capillary blood gas (See Pulmonary Function Testing). A cough assist device can help with excess mucus in lungs by hyperinflation of the lungs with positive air pressure, then negative pressure to get the mucus up. If the vital capacity continues to decline to less than 30 percent of normal, a volume ventilator/respirator may also be needed during the day for more assistance. The person gradually will increase the amount of time using the ventilator/respirator during the day as needed.
E. Prognosis
Duchenne muscular dystrophy is a progressive disease which eventually affects all voluntary muscles and involves the heart and breathing muscles in later stages. The life expectancy is currently estimated to be around 25, but this varies from patient to patient. Recent advancements in medicine are extending the lives of those afflicted. The Muscular Dystrophy Campaign, which is a leading UK charity focusing on all muscle disease, states that “with high standards of medical care young men with Duchenne muscular dystrophy are often living well into their 30s.”
In rare cases, persons with DMD have been seen to survive into the forties or early fifties, with the use of proper positioning in wheelchairs and beds, ventilator support (via tracheostomy or mouthpiece), airway clearance, and heart medications, if required. Early planning of the required supports for later-life care has shown greater longevity in people living with DMD.
Curiously, in the mdx mouse model of Duchenne muscular dystrophy, the lack of dystrophin is associated with increased calcium levels and skeletal muscle myonecrosis. The intrinsic laryngeal muscles (ILM) are protected and do not undergo myonecrosis. ILM have a calcium regulation system profile suggestive of a better ability to handle calcium changes in comparison to other muscles, and this may provide a mechanistic insight for their unique pathophysiological properties. The ILM may facilitate the development of novel strategies for the prevention and treatment of muscle wasting in a variety of clinical scenarios.
The following examples are included to demonstrate preferred embodiments of the disclosure. It should be appreciated by those of skill in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventor to function well in the practice of the disclosure, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the disclosure.
Study Approval. All experimental procedures involving animals in this study were reviewed and approved by the University of Texas Southwestern Medical Center's Institutional Animal Care and Use Committee.
Plasmids. The pSpCas9(BB)-2A-GFP (PX458) plasmid containing the human codon optimized SpCas9 gene with 2A-EGFP and the backbone of sgRNA was purchased from Addgene (Plasmid #48138). Cloning of sgRNA was done using Bbs I sites. The AAV TRISPR-CK8-GFP plasmid containing three sgRNAs driven by U6, H1 or 7SK promoter and GFP driven by CK8 promoter.
Human iPSCs maintenance and nucleofection. Human iPSCs were cultured in mTeSR™ 1 media (STEMCELL Technologies) and passaged approximately every 4 days (1:14 split ratio). One hour before nucleofection, iPSCs were treated with 10 μM ROCK inhibitor (Y-27632) and dissociated into single cells using Accutase (Innovative Cell Technologies, Inc.). 1×106 iPSCs were mixed with 5 μg of pX458-gRNA-2A-GFP plasmid and nucleofected using the P3 Primary Cell 4D-Nucleofector X kit (Lonza) according to manufacturer's protocol. After nucleofection, iPSCs were cultured in mTeSR™ 1 media supplemented with 10 μM ROCK inhibitor, penicillin-streptomycin (1:100) (ThermoFisher Scientific) and 100 μg/ml Primosin (InvivoGen). Three days post-nucleofection, GFP(+) and (−) cells were sorted by FACS and subjected to T7E1 assay. Single clones derived from GFP(+) iPSCs were picked and sequenced.
Genomic DNA isolation. Genomic DNA of mouse 10T1/2 fibroblasts, mouse N2a cells, human 293 and human iPSCs was isolated using DirectPCR (cell) lysis reagent (VIAGEN) according to manufacturer's protocol.
PCR amplification of genomic DNA. Genomic DNA was PCR-amplified using GoTaq DNA polymerase (Promega) with primers. PCR products were gel purified and subcloned into pCRII-TOPO vector (Invitrogen) according to the manufacturer's protocol. Individual clones were picked and the DNA was sequenced.
T7E1 analysis of PCR products. Mismatched duplex DNA was obtained by denaturing/renaturing of 25 μl of the genomic PCR product using the following conditions: 95° C. for 5 mins, 95° C. to 85° C. (−2.0° C./seconds), 85° C. to 25° C. (−0.1° C./seconds), hold at 4° C. Then 25 μl of the mismatched duplex DNA was incubated with 2.7 μl of 10× NEB buffer 2 and 0.3 μl of T7E1 (New England BioLabs) at 37° C. for 90 minutes. The T7E1 digested PCR product was analyzed by 2% agarose gel electrophoresis.
Human cardiomyocyte differentiation. Human iPSCs were cultured in mTeSR™ 1 media for 3 days until they reached 90% confluence. To differentiate the iPSCs to cardiomyocytes, the iPSCs were cultured in CDM3-C media for 2 days, followed by CDM3-59 media for 2 days, followed by CDM3 media for 6 days, followed by selective media for 10 days and lastly by basal media for 2 days. Then, the cardiomyocytes were dissociated using TrypLE media and re-plated at 2×106 per well in a 6-well dish.
Dystrophin Western blot analysis. After 30 days post-differentiation, 2×106 cardiomyocytes were harvested and lysed with lysis buffer (10% SDS, 62.5 mM Tris pH=6.8, 1 mM EDTA, and protease inhibitor). Cell lysates were passed through a 22 G syringe and then a 27 G syringe, 10 times each. Protein concentration was determined by BCA assay and 50 ug of total protein was loaded onto an acrylamide gel. After running at 100V (20 mA) for 5 hours and followed by 1 hour 20 min transfer to PVDF membrane at 35V (200 mA) at 4° C. The blot was incubated with mouse anti-dystrophin antibody (MANDYS8, Sigma-Aldrich, D8168) at 4° C. overnight and with goat anti-mouse HRP antibody (Bio-Rad Laboratories) at RT for 1 hour. The blot was developed using Western Blotting Luminol Reagent (Santa Cruz, sc-2048). The loading control was determined by blotting with mouse anti-vinculin antibody (Sigma-Aldrich, V9131).
CRISPR/Cas9-mediated exon deletion in mice. Two single-guide RNA (sgRNA) specific intronic regions surrounding exon 43, exon 45, or exon 52 sequence of the mouse Dmd locus were cloned into vector pX458 using the primers from Table 10. For the in vitro transcription of sgRNA, T7 promoter sequence was added to the sgRNA template by PCR using the primers from Table 11. The gel purified PCR products were used as template for in vitro transcription using the MEGAshortscript T7 Kit (Life Technologies). sgRNA were purified by MEGAclear kit (Life Technologies) and eluted with nuclease-free water (Ambion). The concentration of guide RNA was measured by a NanoDrop instrument (Thermo Scientific).
Genotyping of 443, 445, and 452 DMD Mice. 443, 445, and 452 DMD mice were genotyped using primers encompassing the targeted region from Table 12. Tail biopsies were digested in 100 μL of 25 mM NaOH, 0.2 mM EDTA (pH 12) for 20 min at 95° C. Tails were briefly centrifuged followed by addition of 100 μL of 40 mM Tris.HCl (pH 5) and mixed to homogenize. Two microliters of this reaction was used for subsequent PCR reactions with the primers below, followed by gel electrophoresis.
Histological analysis of muscles. Skeletal muscles from WT, Δ43 DMD, Δ45 DMD, and Δ52 DMD mice were individually dissected and cryoembedded in a 1:2 volume mixture of Gum Tragacanth powder (Sigma-Aldrich) to Tissue Freezing Medium (TFM) (Triangle Bioscience). All embeds were snap frozen in isopentane heat extractant supercooled to −155° C. Resulting blocks were stored overnight at −80° C. prior to sectioning. Eight-micron transverse sections of skeletal muscle, and frontal sections of heart were prepared on a Leica CM3050 cryostat and air-dried prior to same day staining. H&E staining was performed according to established staining protocols and dystrophin immunohistochemistry was performed using MANDYS8 monoclonal antibody (Sigma-Aldrich) with modifications to manufacturer's instructions. In brief, cryostat sections were thawed and rehydrated/delipidated in 1% triton/phosphate-buffered-saline, pH 7.4 (PBS). Following delipidation, sections were washed free of Triton, incubated with mouse IgG blocking reagent (M.O.M. Kit, Vector Laboratories), washed, and sequentially equilibrated with MOM protein concentrate/PBS, and MANDYS8 diluted 1:1800 in MOM protein concentrate/PBS. Following overnight primary antibody incubation at 4° C., sections were washed, incubated with MOM biotinylated anti-mouse IgG, washed, and detection completed with incubation of Vector fluorescein-avidin DCS. Nuclei were counterstained with propidium iodide (Molecular Probes) prior to cover slipping with Vectashield.
Δ43, Δ45, and Δ52 DMD mouse models recapitulate muscle dystrophy phenotype. To investigate CRISPR/Cas9-mediated exon skipping and reframing in vivo, three mimics of the human “hot spot” regions were generated in three mouse models by deleting the exon 43, exon 45, and exon 52, respectively, using CRISPR/Cas9 system directed by 2 single guide RNAs (sgRNA) (
The deletion of Dmd exon 43, exon 45, and exon 52 was confirmed by DNA genotyping. 1-month old mice lacking exon 43, exon 45, or exon 52 showed pronounced dystrophic muscle changes. (
Identification of optimal sgRNAs for CRISPR/Cas9 correction of DMD exon 43, exon 45, and exon 52 deletions. Skipping or reframing of exon 44, exon 46, and exon 53 would apply to about 18% of DMD patients. To restore the ORF of the DMD patient with exon 43, exon 45, or exon 53 deletion, the inventors applied single guide RNA to disrupt the splicing junction of exon 44, exon 46, and exon 53 respectively, which results in reframing of the exon downstream of the deleted exon and restoration of the protein reading frame (
DMD iPSC-derived cardiomyocytes express dystrophin after CRISPR/Cas9 mediated genome editing by exon skipping and exon reframing. The inventors then generated iPSCs from DMD patients (TX16) that have deletion of exon 52 and an isogenic iPSC line with deletion of exon 43 (Δ43 DMD) and a deletion of exon 45 (Δ45 DMD). The inventors then tested exon 44 editing sgRNAs on Δ43 and Δ45 DMD iPSCs and showed restoration of dystrophin protein expression by Western blot analysis and immunostaining of iPSC-derived cardiomyocytes (
All of the compositions and/or methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this disclosure have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the compositions and/or methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the disclosure. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the disclosure as defined by the appended claims.
The following references, to the extent that they provide exemplary procedural or other details supplementary to those set forth herein, are specifically incorporated herein by reference.
This application claims priority to U.S. Provisional Application Ser. No. 62/854,131, filed May 29, 2019, and U.S. Provisional Application Ser. No. 62/688,003, filed Jun. 21, 2018, each of which is incorporated by reference herein in its entirety for all purposes.
This invention was made with government support under grant no. U54 HD 087351 awarded by National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2019/038398 | 6/21/2019 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62854131 | May 2019 | US | |
62688003 | Jun 2018 | US |