In one aspect, the invention relates to methods and compositions for removing replicated genetic material, such as duplicated copy number variations (CNVs) and related uses. In another aspect, the invention relates to methods and compositions for the treatment or prevention of conditions relating to replicate genetic material such as duplicate CNVs, including but not limited to certain Duchenne Muscular Dystrophies and MECP2 duplication syndrome. In another aspect, the invention provides genetically engineered animals having duplicate CNVs and methods for producing same.
All references and sequence listing as noted herein are incorporated by reference. The paper copy and computer-readable form (CRF) are identical. The Sequence Listing forms part of this application.
The use of endonucleases as targeted gene editing tools has rapidly emerged with the advent of technologies such as the use of zinc finger proteins, TALENS and CRISPR/Cas9 systems. Zinc fingers are dimers comprising Zinc fingers which recognize DNA basepair in triplets and the Fokl nuclease, cutting in the spacer region between two distinct ZF target sites. A TAL effector nuclease (TALEN) is similar in principle to the ZF nuclease, comprising the components of the array which recognize individual nucleotides and the Fokl nuclease cutting in the spacer region between two distinct TALENS target sites.
Clustered regularly interspaced short palindromic repeats (CRISPR)-Cas systems are adaptive immune systems used by many bacteria and archaea to fight off foreign DNA in the form of bacterial phages and/or plasmids (1). Specifically, the type II CRISPR/Cas system works through RNA-directed endonuclease cleavage of the invading genomic sequence. The invading sequence is captured and inserted directly into the genome of the host organism between CRISPR regions (2-4). Following transcription and processing of these loci, RNA guided endonucleases are made with the capability to target foreign nucleic acids based on complementarity with the RNA (5).
Since realizing the potential power of a programmable nuclease in editing mammalian genomes, the CRISPR/Cas9 system has been developed as a technology for multiple biological contexts (6, 7). Regardless of the platform, this system requires a mammalian optimized Cas9 protein and a chimeric single guide RNA (sgRNA) which is made up of CRISPR RNAs (crRNA) and a trans-activating CRISPR RNA (tracrRNA) (6-9). The guide sequences are generally 17-20-bp long (10). Target sequences must be adjacent to a protospacer adjacent motif (PAM) sequence for Streptococcus pyogenes Cas9 (SpCas9) in the form of 5′-NGG (11). Cas9 target recognition is dictated by the Watson-Crick base-pairing of an RNA guide with its DNA target (2, 3). Once expressed in cells, the Cas9 nuclease and the sgRNA form a complex, bind to the target sequence, and make a double-stranded break in the target. The break is repaired via the cellular process of non-homologous end joining (NHEJ), which may introduce insertions and deletions (indels) into the target sequence. Targeted mutations can also be introduced by co-transfecting single- or double-stranded DNA templates to promote homology directed repair (HDR) (See
Another application of the CRISPR/Cas9 tool is to regulate gene expression. This approach uses a catalytically inactive or “dead” (dCas9), which when bound to DNA elements may repress transcription by sterically hindering the RNA polymerase machinery (13), likely by stalling transcriptional elongation (See
Gene editing tools such as the site directed endonuclease technology has promise in a number of therapeutic applications in genetic disorders.
One type of genetic disorders that it has not been easily applied are conditions related to duplications of genetic material. With the development of powerful genome analysis platforms, there is growing evidence for the prevalence of copy number variations (CNVs) that are associated with numerous genetic conditions (26). Such conditions include a certain percentage of patients with Duchenne's muscular dystrophy (DMD) and MECP2 duplication syndrome.
However, to date no successful therapeutic strategies have been developed to target these large genomic rearrangements or duplication of genetic material disorders.
The present inventors have developed a method of using an engineered targeted endonuclease technology (such as CRISPR/Cas9 technology) that can be used to remove replicate, such as duplicate genetic material using one guide. In some embodiments the invention is useful to modulate expression of genes that are known to play a critical role in disease pathogenesis to therapeutically target autosomal dominant, heterozygous, gain-of-function mutations and large chromosomal rearrangements such as duplications of genetic material. In one aspect, the invention provides a method for specifically targeting duplicate genetic material, such as CNVs, and restoring proper wild type gene expression using one guide, such as one single guide RNA, that targets the replicated (such as duplicated) genetic material.
In one aspect the invention provides a method of removing replicate (such as duplicate) genetic material in vivo, ex vivo or in vitro that is present head to tail on a nucleotide sequence in a eukaryotic cell. In some embodiments the replicate material may or may not have intervening sequences between them. The method in some embodiments comprises using one guide, such as one single guide RNA, and an endonuclease, such as a Cas protein bearing effector domains to cut genetic material, such as DNA at a desired location, in vivo, ex vivo or in vitro in eukaryotic cells. In one aspect, the guide, such as the guide RNA has a region that is coupled to or complexed with the endonuclease, such as a Cas protein and a region that can bind to the target DNA, delivering the endonuclease to the target site. In some embodiments, the guide can be a Zinc finger or TALENS array coupled to or complexed to an endonuclease, such as Fok1.
In some embodiments, the method further comprises a repair or joining of the cleaved genetic material, for instance by non-homologous end-joining or homology directed repair or other suitable repair mechanisms. However, an endonuclease that creates a double stranded break and repaired by non-homologous end-joining is preferred for certain uses of the invention, such as in removing the replicate genetic material and restoring a single copy of the genetic material. In some embodiments, the method can be used to treat or prevent conditions associated with replicate (or duplicate) genetic material, such as certain types of Duchenne's muscular dystrophy (DMD), to for instance restore full-length dystrophin and a-dystroglycan expression or in the treatment of MECP2 duplication syndrome to remove large genome rearrangements.
As such, in one aspect the invention provides a method for treatment of a condition in a mammal that can benefit from deletion of replcated region of genetic material, the method comprising administering to said mammal an effective amount of:
wherein the nucleic acids of (i) and (ii) are incorporated into a vector and in a delivery vehicle suitable for delivering same to the eukaryotic cell of the mammal in a manner that enables the cell to be transfected with said nucleic acids and express same once transfected to remove the duplicate genetic material and restore wild type gene expression. In one embodiment (i) and (ii) are incorporated into the same vector. In another embodiment they are in different vectors. In one embodiment (i) and (ii) are in the same or different compositions. In one embodiment the endonuclease is exogenous to the cell. In another embodiment it is endogenous (.e.g. a transgene) and expressed in the cell or a suitable stimulus is applied to the cell to induce expression of the desired endonuclease.
In another aspect, the invention provides a composition comprising:
(a) one or more of the following:
(b) a pharmaceutically acceptable carrier.
In another aspect, the invention comprises a kit comprising a composition of the invention and optionally instructions for use.
In other aspects, the invention provides a method of making a genetically engineered animal using an engineered targeted endonuclease technology, such as a CRISPR/Cas9 technology. In one embodiment the genetically engineered animal comprises duplicate genetic material, such as a duplicate CNV, which in some embodiments can be used as an animal disease model for conditions associated with said duplicate CNV. In one embodiment the invention provides a mouse carrying Dmd exons 18-30 duplication and methods and uses thereof. In another embodiment the invention provides vectors comprising the guide and endonuclease, and the guide and endonuclease for making the engineered mouse. Although the invention is not limited to genetically engineered mice the same methodology can be used in other animals, such as rats, hamsters or other animals.
Additional aspects and advantages of the present invention will be apparent in view of the description which follows. It should be understood, however, that the detailed description and the specific examples, while indicating preferred embodiments of the invention, are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description.
The invention will now be described in relation to the drawings, in which:
The present invention provides a method for removing replicated, such as duplicated genetic material, such as genomic material, such as large duplicated genomic rearrangements, using a one guide, such as one single guide RNA, and an endonuclease, such as Cas (Cas9).
In another aspect, the invention provides a method, such as a therapeutic method, for treating conditions related to replicated, such as duplicated, genetic material, such as certain inherited disorders that are caused by duplication of genetic material.
In another aspect, the present inventors have for the first time demonstrated removal of large genomic rearrangements using an engineered targeted endonuclease technology, e.g. CRISPR/Cas9 technology. In one embodiment they have shown it in the X-chromosomal duplication including the MECP2 gene as well as a large duplication of the dystrophin gene in a patient with Duchenne muscular dystrophy. The results demonstrate that large genomic rearrangements caused by copy number variations (CNVs) are amenable to treatment strategies mediated by an engineered endonuclease technology such as the CRISPR/Cas9 methodology.
Using an adapted CRISPR/Cas9 strategy to illustrate their invention, the present inventors have successfully removed a 139 kb duplication (exons 18-30: chrX:32,552,206-32,413,149 (hg19) duplication with an AAAT insertion in the breakpoint junction) in the DMD gene leading to restoration of full-length dystrophin and a-dystroglycan expression in patient myotubes.
In summary, it is herein demonstrated that engineered targeted endonuclease systems, such as CRISPR/Cas9 can have significantly broader therapeutic implications for DMD, which includes strategies to restore full-length dystrophin expression.
An increasing number of genetic disorders are caused by chromosomal rearrangements and CNVs. However, treatments targeting the underlying cause of these disorders are currently not available. Here, the inventors developed a novel strategy employing an engineered targeted endonuclease technology (e.g. the CRISPR/Cas9 system) to remove duplicated regions within the genome. The strategy uses a one guide (e.g., sgRNA) approach, which due to the nature of a tandem (head-to-tail) duplication creates two double stranded breaks. Since the target is a sequence within a duplication, the sgRNA target will be found twice, leading to the formation of two double stranded breaks (“DSB”) and hence removing the intervening sequence which equates to the total size of the duplication. In fact the invention can be used for any replication of sequences not just duplications but also where multiple copies (more than 2) of genetic material are present to leave a single copy of the genetic material, as the guide/endonuclease complex will target each site and remove intervening sequences. Further, although the replications are head to tail, intervening sequences may be present between the replications, which will also be removed upon use of the methods of the invention and the one guide approach presented herein. There are several advantages to this strategy. First, the design of RNA guides is not limited to specific sequences near the breakpoints. This allows for a larger selection of guide RNAs that can target any portion of the duplicated sequence while minimizing off-target sites. Second, given the limited loading capacity of potential in vivo delivery vehicles such as AAV9, strategies using the least amount of CRISPR components will be critical for the development of further therapeutic applications. The inventors have herein successfully removed a large chromosomal rearrangement on the X-chromosome containing the MECP2 gene indicating that this approach can be targeted toward several chromosomal duplication syndromes. Importantly, off-target analysis showed no significant hits in the top 20 predicted sites using NGS, suggesting that the accuracy and safety of our system lends itself as a viable strategy for therapeutics.
The inventors have further herein shown broader applicability of the invention, applying the approach to patients with DMD. To date, treatments that specifically target duplications in DMD have not been extensively studied even though duplications of one or more exons comprise approximately 10% of the DMD mutation spectrum (27). Recent therapeutic strategies for DMD undertaken by other groups include gene replacement therapies, which deliver truncated but functional microdystrophin genes (36, 37). One type of gene replacement therapy is exon skipping, where antisense oligonucleotides complementary to regions of the dystrophin premature mRNA are used to induce skipping of one (38, 39) or more exons (40), hence restoring the open reading frame to produce a shorter dystrophin protein. Similarly, previous studies from other laboratories using CRISPR/Cas9 have demonstrated that this system can be utilized to restore the reading frame of large deletions in the dystrophin gene (20). However, one potential shortcoming of these approaches is that the shorter dystrophin product ameliorates the disease phenotype only to the extent of making them similar to patients with Becker muscular dystrophy, who exhibit expression of a truncated, yet functional dystrophin protein (41). Thus, the data presented herein is of particular importance as removal of a duplication leads to restoration of full-length dystrophin, which represents novel therapeutic opportunities for DMD patients with duplications.
An important consideration in establishing a treatment for DMD is determining how much dystrophin is necessary to ameliorate the disease phenotype. It is estimated that in humans about 20% of truncated dystrophin protein expression is sufficient to have a less severe phenotype and maintain ambulation (42, 43). Furthermore, studies in mdx mice suggest that approximately 5% of full-length dystrophin can improve disease pathology and >20% is needed to fully protect muscle fibers from exercise-induced damage (44-46). The present data demonstrates 4.42% expression of full-length dystrophin accompanied by restoration of components of the DGC.
Here, the inventors have developed a pipeline for genome engineering strategies using easily accessible patient cells and have demonstrated that individually tailored single RNA guides are able to remove large duplicated genomic rearrangements in two different genetic disorders.
In another embodiment, the invention provides compositions and kits comprising one or more of the components for use in the methods of the present invention and optionally instructions for use of same in the methods of the present invention.
In another aspect, the inventors have developed an animal model for studying replicative genetic disorders by successful generating a genetically engineered mouse model comprising a duplication of Dmd exons 18-30 using engineered targeted endonuclease technology. Constructs and methodologies for developing same are also provided as well as uses thereof.
“Administering to the cell(s)” as used herein means both in vitro and in vivo administration to the cells and can be direct or indirect administration, as long as the cells are at some point exposed to the substance being administered.
“Effective Amount” and “Therapeutically Effective Amount” as used herein means an amount effective, at dosages and for periods of time necessary to achieve the desired results. For example, an effective amount of a substance may vary according to factors such as the disease state, age, sex, and weight of the individual, and the ability of the substance to elicit a desired response in the individual. Dosage regimes may be adjusted to provide the optimum therapeutic response. For example, several divided doses may be administered daily or the dose may be proportionally reduced as indicated by the exigencies of the therapeutic situation.
“Pharmaceutically Acceptable Carrier” as used herein means any medium which does not interfere with the effectiveness or activity of an active ingredient and which is not toxic to the hosts to which it is administered. It includes any carrier, excipient, or vehicle, which further includes diluents, binders, adhesives, lubricants, disintegrates, bulking agents, wetting or emulsifying agents, pH buffering agents, and miscellaneous materials such as absorbants that may be needed in order to prepare a particular composition. Examples of carriers, excipient or vehicles include but are not limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. Such media and agents for an active substance and uses thereof are well known in the art (e.g., “Remington: The Sciences and Practice of Pharmacy, 21st Edition”, (University of the Sciences in Philadelphia, 2005).
“Isolated” refers to a protein or nucleic acid that, if naturally occurring, is in an environment different from that in which it may naturally occur.
The terms “nucleic acid”, “polynucleotide”, “nucleotide sequence”, and “oligonucleotide” are used interchangeably and refer to a polymer of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. A polynucleotide may comprise one or more modified nucleotides (e.g. methylated nucleotides and nucleotide analogs). Non-limiting examples of nucleic acids include messenger RNA, isolated DNA of any sequence, isolated RNA of any sequence, guide RNA, recombinant polynucleotides, vectors, probes, and primers.
Herein the term “gene” includes a DNA region encoding a gene product, as well as all DNA regions regulating the production of the gene product, irrespective of whether the particular regulatory sequence is adjacent to the coding and/or transcribed sequences on a chromosome. Accordingly, a gene includes, for example, promoter sequences, terminators enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites, locus control regions, and translational regulatory sequences such as ribosome binding sites.
Herein the term “promoter” refers to a sequence of DNA, usually upstream (5’) of the coding region of a structural gene, which controls the expression of the coding region by providing recognition and binding sites for RNA polymerase and other factors which may be required for initiation of transcription.
Herein the term “gene product” includes the direct transcriptional product of a gene (e.g. mRNA, tRNA, rRNA, antisense RNA) or a protein produced by translation of an RNA transcribed from the gene. Gene products further include modified RNAs (e.g. by capping, polyadenylation, methylation, and editing) and modified proteins (e.g. by methylation, etylation, phosphorylation, ubiquitination, ADP-ribosylation, and glycosylation).
The terms “polypeptide”, “peptide”, and “protein” are used interchangeably and refer to chains or polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non-amino acids. The terms also encompass an amino acid polymer that has been modified, for example, by disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component.
As used herein, the term “amino acid” refers to an organic acid containing both a basic amino group (NH2) and an acidic carboxyl group (COOH), and includes natural and/or unnatural or synthetic amino acids, including both the D or L optical isomers, and amino acid analogs.
Herein the term “plasmid” refers to a circular double-stranded DNA construct capable of being used as a vector for introducing DNA into a cell.
As used herein, the term “Cas protein” has its conventional meaning as used in the art where it refers to a product of a Cas gene which in nature is typically coupled to, associated with, or in the vicinity of a CRISPR locus. In one aspect, the term encompasses proteins which in their natural state have DNA cleavage activity (i.e. endonuclease activity). For instance, the term encompasses Cas9 isolated from for example Streptococcus pyogenes and/or Streptococcus thermophilus. In one aspect, the Cas protein directs cleavage of one or both strands of DNA at the location of a target sequence, such as within the target sequence and/or within the complementary sequence of the target sequence (52, 53, 55, 58). It also encompasses conservative amino acid substitutions of native Cas proteins, wherein conservative substitutions do not affect the native endonuclease activity of the Cas proteins.
“Cas proteins” may be part of a fusion protein, for instance a Cas polypeptide is part of a fusion protein comprising one or more heterologous protein domains. Non-limiting examples of protein domains which may be fused with a Cas polypeptide include reporter sequences, epitope tags, and protein domains that have one or more activities such as: transcriptional activating activity (i.e. functioning as trans-activators of transcription), transcription repression activity, methylase activity, histone modification activity, and nucleic acid binding activity. For instance, in some embodiments a “Cas protein” is a Cas9 isolated from S. pyogenes or S. thermophilus and directly fused to a transcriptional activator (e.g. VP160 and/or VP64) at its 5′ end.
Cas proteins may include amino acid deletions or substitutions which alter the sequence of the protein from its natural state. In some embodiments, changes to the amino acid sequence modify protein activity (e.g. enzymatic activity, including endonuclease activity) which in nature is associated with the protein. For instance, a plasmid encoding a Cas protein can be mutated such that when expressed the Cas protein lacks the capacity to cleave one or both strands of a nucleic acid comprising a target sequence. In another aspect the Cas protein is a S. pyogenes Cas9 protein engineered to contain an aspartic acid to alanine substitution at residue 10 (D10A) in the RuvC I catalytic domain of Cas9, which converts the endonuclease to a nickase.
“Endonuclease” is a protein that has DNA cleavage activity, for instance a Cas protein, Fok1 (for instance that complexes with a zinc finger or TALENS).
Herein the term “guide” refers to any polynucleotide sequence or protein capable of forming a complex with an endonuclease and a target nucleic acid sequence. It can refer to any type of guide useful for the present invention including but not limited to a Zinc finger protein or a TALENS array or a guide RNA or a DNA guide.
Herein the terms “guide RNA”, and “gRNA”, are used interchangeably and refer to the polynucleotide sequence capable of forming a complex with an endonuclease such as a Cas protein and has a region that is complementary to a target sequence of a polynucleotide. The guide RNA may consist of a single polynucleotide or multiple polynucleotides complexed together using for example hydrogen bonds between complementary base pairs. The guide RNA comprises the “guide sequence”, which refers to the sequence within the guide, typically about 18-20 bp in length, which hybridizes to the target site. In one embodiment, a gRNA can include a guide sequence complementary to human or mouse nucleic acid, such as the ones used herein in the Examples. The gRNA may be a chimeric or recombinant RNA. In some embodiments, one or more gRNAs are derived from a type I, type II, or type III CRISPR system (56). In some embodiments, at least a portion of one or more gRNAs is derived from an organism comprising an endogenous CRISPR system, such as Streptococcus pyogenes or Streptococcus thermophilus. In general a gRNA is characterized by sequences and structures which promote the formation of a CRISPR complex at a target sequence.
In general “gRNA” (guide RNA) is a fusion of the crRNA and tracrRNA, provides both targeting specificity and scaffolding/binding ability for an endonuclease such as Cas9 nuclease and does not exist in nature. “gRNA sequence”: in general are the nucleotides (.e.g 18-20 nucleotides) that precede the endonuclease recognition site, such as the PAM sequence, in the genomic DNA, what gets put into a gRNA expression plasmid, does not include the PAM sequence.
“tracrRNA”: the endogenous bacterial RNA that links the crRNA to the endonuclease such as, Cas9 nuclease, can bind any crRNA.
“crRNA” as used herein is the endogenous bacterial RNA that confers target specificity, and requires tracrRNA to bind to the endonuclease, such as Cas9.
Herein the terms “target sequence” and “target site” are used interchangeably and refer to a nucleic acid sequence or a polynucleotide to which a guide sequence is complementary (or sufficiently complementary) to bind. It should have, be proximate to or adjacent to an endonuclease recognition site to enable the endonuclease within a guide/endonuclease complex to bind and cleave the genetic material at the desired location. For instance in the Cas9, the endonuclease target site is a PAM sequence which is within the genomic DNA and adjacent to the target sequence for the guide but not within the guide sequence. The presence of an endonuclease recognition site is important in determining suitable target sequences and in designing and optimizing guides.
Full (100%) complementarity between the guide, e.g. gRNA, and the target sequence is not necessarily required, so long as the guide sequence and target sequence are sufficiently complementary to cause hybridization and promote formation of an endonuclease/guide/target complex, e.g. a CRISPR complex. A target sequence may comprise either DNA or RNA, and in some embodiments may be located in the nucleus, cytoplasm, or organelle (e.g. mitochondria or chloroplast) of a cell.
Herein the term “endonuclease/guide/target” encompasses a guide comprising a region capable of hybridizing or coupling with a target sequence and a region capable of forming a complex with an endonuclease complexed with the endonuclease and a hybridized to the target sequence. Thus “CRISPR complex” as used herein encompasses a guide RNA comprising a region having a guide sequence and a region capable of forming a complex with a Cas complexed to one or more Cas proteins and hybridized to a target sequence.
Herein the terms “patient”, “subject”, and “individual” are used interchangeably and refer to a vertebrate, preferably a mammal, and more preferably a human. In some aspects of the invention, tissues and cells of a biological entity such as from a patient are obtained in vivo or cultured in vitro.
Off-target effects refers to guides, such as gRNA binding to target sequences that do not match exactly, causing the endonuclease, such as Cas9 to function in an unintended location causing effects.
ORF: Open Reading Frame, the codons that make up a gene.
PAM: Protospacer Adjacent Motif, required sequence for endonuclease recognition of the CRISPR/CAS system that must immediately follow the gRNA recognition sequence but is NOT in the gRNA.
sgRNA: single guide RNA, the same as a gRNA, which is a single stranded RNA.
Herein the term “muscular dystrophy” includes all forms of dytrophin-deficient muscular dystrophy. Non-limiting examples include Duchenne muscular dystrophy and Becker muscular dystrophy. More particularly it refers to the subset of said dystrophies caused by duplicate genetic material.
Herein the term “muscle cell” refers to any type of myocyte including cardiac, skeletal, and smooth muscle cells, as well as their progenitor myoblasts, as well as muscle stem cells, called satellite cells.
In one aspect, the invention provides a method for removing replicated genetic material present head to tail on a nucleotide sequence of a cell, such as a eukaryotic cell with or without an intervening sequence in-between the replicated genetic material. In some embodiments the genetic material is duplicated, in other embodiments it may be present in triplicate or more.
In some embodiments, the method comprises:
In some embodiments the exogenous polynucleotide guide and endonuclease can be delivered to the cell by delivering their coding sequence to the cell in one (i.e. on the same vector) or more vectors, such as plasmids, phages, cosmids or protein. In other embodiments they can be delivered as formed endonuclease/guide complex, via suitable delivery means, such as liposomes, electroporation or other suitable means known in the art.
“One exogenous polynucleotide guide” as used herein refers to one type of polynucleotide guide or a guide that targets one target sequence. The guide may be present in multiple copies within the cell or multiple copies of its coding sequence may be delivered to the cell or present within the vector delivered to the cell. The present invention only requires a guide that targets only one target sequence, which can include multiple sites where that target sequence occurs, for instance in replicated genetic material.
In other embodiments, the method of further comprises the step of repairing and joining the cleaved nucleotide sequence. This can be done by non-homologous end joining or homology directed repair, but preferably non-homologous end joining and wherein the method of the invention results in at least one less copy of the genetic material or resulting in one copy of the genetic material where before there was two or more. In some embodiments, other cellular repair mechanisms may also be used, such as microhomology repair.
In some embodiments, the the exogenous polynucleotide guide is delivered to the cell by:
wherein the cell expresses the guide and the endonuclease, the guide forming a complex with the endonuclease, and wherein the vector in (i) and (ii) can be the same or different. In some embodiments, the vector is a plasmid or a viral particle or protein. In other embodiments, it is a plasmid or viral particle.
In some embodiments, the replicative or duplicate genetic material is associated with a disease, such as Duchenne's muscular dystrophy that is caused by duplicative genetic material, such as duplication of exons 18-30 or MECP2 duplication syndrome. causing variant and the method is used in the prevention or treatment of said disease by administering to the patient, or embryo or egg a guide and endonuclease or vector coding same or guide/endonuclease complex alone or together of the invention as per the methods herein before described.
In one embodiment, the invention provides a method for designing an exogenous polynucleotide guide for use in the method of the invention, said method comprising:
In another aspect, the invention provides a composition comprising:
(a) one or more of the following:
(b) a pharmaceutically acceptable carrier.
In another embodiment, the invention provides a use of the compositions in the methods and uses of the invention. In one embodiment the components or compositions of the invention can be in a kit comprising one or more of same and optionally instructions for their use in carrying out any one of the methods of the invention.
Referring to
CRISPR loci in a bacterium contain “spacers” that in type II adaptive immune systems were created from invading viral or plasmid “protospacer” DNA. On subsequent invasion, Cas9 nuclease attached to tracrRNA: crRNA is guided to the invading protospacer sequence, but Cas9 will not cleave the protospacer sequence unless there is an adjacent PAM sequence. The “spacer” in the bacterial CRISPR loci will not contain a PAM sequence, and will thus not be cut by the nuclease. But the protospacer in the invading virus or plasmid will contain the PAM sequence, and will thus be cleaved by the Cas9 nuclease. For editing mammalian genes, guideRNAs (gRNAs) are synthesized to recognize mammalian gene sequences having a PAM sequence at the 3′-end.
For Cas9 to successfully bind to DNA, the target sequence in the genomic DNA must be sufficiently complementary to the gRNA sequence for the gRNA to bind with it and must be immediately followed by the correct protospacer adjacent motif or PAM sequence. The PAM sequence is present in the DNA target sequence but not in the gRNA sequence. Any DNA sequence with the correct target sequence followed by the PAM sequence will be a Cas9 recognition and cleavage site and will be bound by Cas9. The presence of the target sequence without the PAM following it is not sufficient for Cas9 to cut. Further, the presence of the PAM sequence alone is not sufficient for Cas9 to cut.
The PAM sequence varies by the species of the bacteria from which the Cas9 was derived. The most widely used Type II CRISPR system is derived from S. pyogenes and the PAM sequence is NGG located on the immediate 3′ end of the gRNA recognition sequence. The PAM sequences of other Type II CRISPR systems from different bacterial species are listed in the Table below. It is important to note that the components (gRNA, Cas9) derived from different bacteria will generally not function together. Example: S. pyogenes (SP) derived gRNA will not function with a N. meningitidis (NM)derived Cas9.
In one embodiment, of using the CRISPR/Cas9 engineered system both the gRNA and Cas9 is expressed in target cells. The respective promoters for Cas9 and gRNA expression will ultimately determine the species specificity of a particular system. CRISPR cassettes can either contain both gRNA and Cas9 expressing cassettes on a single plasmid or they can be expressed from two separate plasmids.
In one embodiment, the present invention provides a method of restoring wild type gene expression in vivo or in vitro in a eukaryotic cell comprising using one single guide RNA and an endonuclease, such as a Cas protein bearing effector domains to modulate expression in vivo or in vitro in eukaryotic cells.
In one aspect the method in a eukaryotic cell (e.g. skeletal or cardiac muscle cell) comprises:
wherein the eukaryotic cell expresses the RNA and the endonuclease Cas protein, the endonuclease Cas protein complexes with the RNA, the RNA binds to the complementary genomic DNA of the target region of the gene (i.e. the target sequence).
In one embodiment, RNA complementary to genomic DNA of the target region of includes RNA that is complementary to all or part of a target sequence. In one aspect the RNA binds to a specific 18-20 nt long DNA sequence, however length can be different for different Cas9 species or endonucleases. In one aspect the RNA does not need to be 100% complementary to the target sequence of the target region, but is sufficiently complementary to enable base pair binding to the target sequence to enable the Cas protein fusion protein to bind to the target region.
In one embodiment, the guide RNA comprises a region that is 18-20 nt in length (i.e. the “guide sequence”; see definition above) that is complementary to genomic DNA of the target region, and a region that can interact and complex with an endonuclease Cas protein.
In one embodiment, each gRNA is a single chimeric guide RNA comprising a region having a guide sequence (e.g. a sequence complementary to a target sequence) and a region which interacts with an endonuclease Cas protein. In another embodiment the gRNA comprises of multiple RNAs (e.g. a tracrRNA and crRNA), wherein one of the RNAs comprises the guide sequence complementary to the target region and one or more of the RNAs provides a region to interact with the endonuclease Cas protein (see e.g. 9).
In one embodiment the nucleic acid encoding the guide RNA is inserted into a vector suitable for transfection or transduction into the eukaryotic cell and subsequent transcription within the cell. In one embodiment the vector is selected from the group of vectors consisting of: plasmids, recombinant adeno-associated viral (rAAV) or lentiviral particles.
The invention contemplates the use of any gRNA which contains a guide sequence complementary to a desored target polynucleotide. In one embodiment only one gRNA is used in the method.
In one embodiment the DNA encoding the endonuclease Cas protein that interacts and can complex with the gRNA. In another embodiment it is a codon-optimized version of the Cas protein, optimized for the respective eukaryotic cell. In one embodiment the Cas protein is human codon optimized.
The invention contemplates the use of any Cas protein capable of complexing with a gRNA to target a polynucleotide. In certain embodiments, the Cas protein may be a derivative of a naturally occurring Cas protein. The term “derivative” encompasses amino acid sequence variants of a polypeptide, covalent modifications, and fusions thereof. Suitable derivates of Cas polypeptide or a fragment thereof include but are not limited to fusions, mutants, and covalent modifications of a Cas protein or a fragment thereof. A Cas protein, which includes a derivative of a Cas protein, may be obtained from a cell, synthesized chemically, or by a combination of these procedures. The cell may naturally produce Cas protein, or may not naturally produce Cas protein but be engineered to produce Cas protein by one or more exogenously introduced nucleic acids. In other cases the cell may naturally produce Cas protein and be genetically engineered to produce the endogenous Cas protein at a higher expression level than naturally occurs. Non-limiting examples of Cas proteins and complexes which may be used with the present invention include forms of Cas9, Csm/Cas10, Cas3, and one or more proteins of the CRISPR-associated complex for antiviral defense (Cascade), including Cse1, Cse2, Cas7, Cas5, Cas6e, Csy1, Csy2, Csy3, and Cas6f. In some embodiments, multiple activated Cas proteins of a Cas protein complex are used in combination with one or more gRNAs.
The invention contemplates any construct that is capable of containing a coding sequence for Cas9. In one embodiment the vector is selected from the group of vectors consisting of: plasmids, rAAV, and lentiviral particles.
In one embodiment the nucleic acid encoding the single gRNA and the endonuclease Cas protein is DNA. In one embodiment they are on one vector. In another embodiment the DNA encoding the gRNA and the Cas protein are on different vectors. In one embodiment the vector comprises more than one copy of the nucleic acid encoding the gRNA and/or the endonuclease.
In one embodiment components (i) and (ii) of the method noted above are co-transfected at the same time in one composition or proximal in time in different compositions.
In yet another embodiment the components (i) and (ii) of the method noted above are inserted into a suitable delivery system or vehicle, alone or together, such as an adenovirus for transfection of the mammalian cell, rAAV, lentivirus, or nanoparticles.
In yet another embodiment the method contemplates the exogenous expression and isolation of the Cas fusion protein as herein described complexed in vitro with the gRNA as also herein described, and using a composition comprising said complex in a suitable delivery vehicle for transfecting same into the eukaryotic cell.
In one embodiment the eukaryotic cell is selected from the group of cells from humans, mice, dogs, and other species as desired. In one embodiment it is a mammalian cell. In another embodiment the mammalian cell is selected from the group of mammalian cells consisting of: human, mouse and dogs. In one embodiment the cell is a human cell.
In another embodiment the cells are mammalian cells selected from the group consisting of:
oocytes, myoblasts, myocytes and cardiomyocytes.
In one embodiment the invention comprises a cell transfected using the methods of the invention described herein.
The invention can be used in the treatment or therapy of conditions that are caused by replicative or duplicate genetic material such as CNVs, such as in certain patients with DMD or MECP2 duplication syndrome.
Muscular dystrophy is a group of muscle disorders which weaken the musculoskeletal system and impair locomotion. Muscular dystrophies are characterized by defects in muscle proteins, death of muscle cells and tissues, and progressive impairment of muscle movement. Patients severely affected by muscular dystrophy may have cognitive impairment, behavioural, vision, and speech problems (59). Muscular dystrophies are generally inherited and are associated with mutations in different genes generally encoding muscle effector proteins. There are many forms of muscular dystrophy including but not limited to Becker, limb-girdle, congenital, facioscapulohumeral, myotonic, oculopharyngeal, distal, and Emery-Dreifuss muscular dystrophy.
Duchenne muscular dystrophy (DMD) is the most common form of muscular dystrophy and is inherited in an X-linked autosomal recessive form. The disease is caused by mutations in the dystrophin gene (Dmd) that result in a lack of functional dystrophin. In some patients it is caused by duplicate genetic material.
A method of the invention contemplates the use of a single guide and an endonuclease such as gRNAs and Cas fusion proteins described herein in the treatment of a condition in a mammal that can benefit from removal of duplicate genetic material, such as DMD, the method comprising administering to said mammal an effective amount of:
wherein the nucleic acids of (i) and (ii) are incorporated into a vector (either a single vector or separate vectors) and in a delivery vehicle, as applicable, suitable for delivering same to the eukaryotic cell of the mammal in a manner that enables the cell to be transfected with said nucleic acids and express same once transfected.
In another embodiment of the invention, the method comprises administering to a subject in need cells that express or comprise the gRNA and Cas fusion proteins of the invention. In one embodiment, the method comprises harvesting cells from the patients, transfecting the cells in accordance with the present invention to obtain cells wherein expression of the duplicate gene to be restored to wild type expression and administering said modified cells to said patient.
In another embodiment the patient is monitored for expression of the gene in question. In another embodiment the method of the invention is repeated as needed for said patient.
In one embodiment the eukaryotic cells are myoblasts or myocytes.
In one embodiment the gRNA and the Cas fusion proteins of the invention or nucleic acids encoding same can be in one composition or multiple compositions. In another embodiment only one gRNA is used. In another embodiment the compositions of the invention comprise a suitable pharmaceutical acceptable carrier.
In one embodiment the invention provides a composition comprising:
(a) one or more of the following:
(b) optionally a pharmaceutically acceptable carrier.
In another embodiment the invention provides a kit comprising one or more compositions of said invention, and optionally instructions for their use in carrying out any one of the methods of the invention.
An advantage of the present invention is that it can be implemented to treat or prevent conditions caused by duplicate genetic material such as CNVs using only one single guide or gRNA (i.e. as opposed to more than one single guide RNA that has different targets). This makes it easier and less complex solution then using, designing and making multiple guides.
The guides and endonucleases, such as the single gRNAs and Cas fusion proteins of the invention and/or the nucleic acids encoding same can be administered by any means that produce contact of the gRNAs and Cas fusion proteins with the target elements of the cell to cut the target genetic material in vitro or at the desired sites of action in the body of a patient to produce a therapeutic effect, in particular a beneficial effect, and in one embodiment a sustained beneficial effect. The compositions of the invention comprising the one or more nucleic acids encoding the gRNAs, the nucleic acid encoding the Cas fusion proteins of the invention, and/or the Cas fusion proteins complexed with the gRNA can be administered simultaneously or sequentially and in any order at different points in time to provide the desired beneficial effects. A compound and composition of the invention can be formulated for sustained release, for delivery locally or systemically. It lies with the capability of a skilled person, such as a physician or veterinarian to select a form and route of administration that optimizes the effects of the compositions and treatments of the present invention to provide therapeutic effects, in particular beneficial effects.
In one aspect the invention includes administration of the Cas fusion protein and/or gRNA or nucleic acids encoding same to the site of action—directly or through a mode of delivery (e.g. sustained release formulations, delivery vehicles, such as liposomes) that results in delivery or site-directed delivery of the guide and endonuclease (peptide, gRNA or encoding nucleic acids for the guide and endonuclease) to a particular cell or site in the body. This would include but is not limited to the use of a polynucleotide encoding an endonuclease (such as Cas or Cas fusion protei and/or a guide (such as gRNA), e.g. via gene therapy or through an expression system in vitro, ex vivo or in vivo, as the case may be that results in expression of the endonuclease and guide, e.g., Cas fusion protein and/or gRNA, and subsequent creation of the endonuclease/guide/target complex.
The above described substances including Cas fusion protein(s), gRNAs, and nucleic acids encoding these substances may be formulated into pharmaceutical compositions for administration to subjects in a biologically compatible form suitable for administration in vivo. By “biologically compatible form suitable for administration in vivo” is meant a form of the substance to be administered in which any toxic effects are outweighed by the therapeutic effects. The substances may be administered to living organisms including humans, and animals.
Thus in one embodiment, the invention provides the use of Cas fusion protein and/or gRNAs and/or nucleic acids encoding same in the preparation of a medicament for the treatment of muscular dystrophy, such as Duchenne muscular dystrophy in that portion of patients whose disease is caused by duplicate genetic material. In one embodiment, a therapeutically effective amount of same or a pharmaceutical composition as described herein is administered to a patient in need thereof. A patient in need thereof is any animal, in one embodiment a human, that may benefit from the effect of these substances in the treatment of a condition that may benefit from removal of duplicate genetic material.
The compositions of the present invention may be administered in a convenient manner such as by injection (subcutaneous, intravenous, etc.) or oral administration. Depending on the route of administration, the active substance may be coated in a material to protect the components of the composition from the action of enzymes, acids and other natural conditions that may inactivate same. In one embodiment, the compositions of the invention are administered directly or proximate to the desired site of action, by injection or by intravenous administration.
As will also be appreciated by those skilled, administration of substances described herein may be by an inactive viral carrier, such as AAV. In one embodiment the AAV is naturally occurring. In one embodiment the AAV is selected from the group consisting of AAV6, AAV8, and AAV9. In one embodiment the AAV is recombinant. In one embodiment the AAV is a DJ serotype. In one embodiment a Cas fusion protein and/or gRNA and/or a nucleic acid encoding same can be administered in a vehicle comprising saline and acetic acid.
As such, in one embodiment, systemic injection of viral particles (AAV viral particles) comprising the vector encoding the Cas9 and the single guide RNA for transcription in vivo in tissue can be used.
Further, in one embodiment, a Cas fusion protein and/or gRNA, nucleic acids encoding same, vectors comprising same, and vehicles comprising any of the foregoing, may be administered in a form that is conjugated to another molecule or compound, including a peptide, to facilitate delivery to a desired site, or in a vehicle, e.g. a liposome or other vehicle or carrier for delivery.
To develop guide RNAs, each target sequence (about 18 to 20 bp) is followed by an S. pyogenes Cas9 specific proto-spacer adjacent motif (PAM) sequence (NGG). If the guide sequence did not begin with a 5′-G, a G nucleotide was added to the primer to optimize gRNA expression.
DNA sequences encoding the guide RNAs were inserted into vectors such as plasmids, for subsequent transfection into the target cells.
This construct can be incorporated into AAV8 viral particles which can be systemically injected in vivo in tissue to be taken up by the cells and affect genetic editing in vivo.
The following protocol can be used to study effects of systemic rAAV administration:
Taken together, these experiments can be used to evaluate rAAV-mediated genetic editing and whether it provides therapeutic benefit.
In one aspect genetically engineered animals comprising duplicative CNVs can be generated which are useful as animal models for a particular disorder related to the duplicative CNV.
The method comprises generating guides, in one embodiment at least two guides, in another embodiment four guides, towards either end of the intended duplication that are delivered together with the endonuclease to embryonic stem cells (e.g. by incorporating the coding sequence of the guides and endonuclease into plasmids using the same plasmid or separate plasmids) and electroporation into the cell or by other means), selecting successful clones that have incorporated the duplication and expanding same. Aggregating the clones into a blastocyst of a pregnant foster animal such as a mouse and crossing the resulting chimeras to establish a germline transmission of the genetically engineered mouse with the duplication. In one embodiment a dup18-30Dmd mouse was generated using four sgRNAs and the CRISPR/cas9 system. In one embodiment the guide is a nucleic acid, such as RNA and its coding sequence is incorporated into a plasmid, phage or other vehicle for expression in the cell. In one embodiment, the endonuclease can be delivered to the cell as a protein through suitable delivery means, as long as the guide and endonuclease are present together within the cell to enable it to form an endonuclease/guide complex and endonuclease/guide/target complex for creation of the duplication within the cell and incorporation of same within the genome of the cell.
As such, the invention also provides constructs for forming said genetically engineered animal comprising plasmid, phage, cosmid or other vehicle encoding a desired guide and a vehicle, the same or different as the one encoding the guide, comprising the coding sequence for the endonuclease, wherein the vehicle once administered to the cell expresses the guide and endonuclease and results in formation of the endonuclease/guide and endonuclease/guide/target complex. In another embodiment, the invention comprises compositions comprising the vehicle or vehicles expressing the guide and endonuclease. In another embodiment the invention provides a composition comprising the guide/endonuclease complex for administration to the cell. Such a complex can be delivered to the cell through a suitable delivery vehicle such as a liposome or the like. In another embodiment the invention provides kits for forming a genetically engineered mouse comprising one or more of the constructs or complexes required to generate the desired animal and optionally instructions for same.
The invention also provides methods and uses of said animal models and constructs in the formation of new genetically engineered animals, in research, drug and treatment design, including guide design and endonuclease selection and optimization, wherein the effects of any of the foregoing can be monitored by the effect on the animal, the restoration of function where there was dysfunction, and to study effects on the genome, expression of peptides or genetic material and the like.
The present invention is described in the following Examples, which are set forth to aid in the understanding of the invention, and should not be construed to limit in any way the scope of the invention as defined in the claims which follow thereafter.
Here, the inventors have established a pipeline using easily accessible patient cell lines as a proof of the methodology and tools of the present invention and to show that an engineered targeted endonuclease technology (endonuclease together with one guide) can be used to target replicative genetic material, such as a large tandem X-chromosomal duplication including the MECP2 gene and a large duplication of the dystrophin gene in muscle cells of a patient with Duchenne Muscular Dystrophy (DMD), to remove the duplication and restore wild-type function of the affected gene.
More specifically, the examples shows the use of CRISPR technology for large genomic copy number variations (CNV's)(50, 51), however, the invention is not limited to the use of CRISPR technology or the Cas9 endonuclease, it is used as an example. The examples show that one can utilize the CRISPR/Cas9 system to successfully remove a large 278 kb tandem duplication in fibroblasts of a patient with MECP2 duplication syndrome. Furthermore, it is demonstrated that a modified CRISPR/Cas9 strategy involving a single guide approach successfully removes a 145 kb (exons 18-30) duplication in the DMD gene leading to restoration of full-length dystrophin expression in patient myotubes. This is illustrative and proof that the methods of the claim to remove a replicative genetic material or nucleotide sequence using an engineered targeted endonuclease technology works.
Taken together, the studies show that a targeted engineered endonuclease technology such as the CRISPR/Cas9 system can be adapted to target a variety of inherited disorders caused by large genomic rearrangements.
Cell Culture. Fibroblasts from a healthy individual and a patient harbouring a 18-30 exon duplication in DMD (Patient 1) and MECP2 duplication syndrome (Patient 2) were obtained from skin biopsies (skin tags) and established at SickKids pathology laboratory. They were maintained in High Glucose Dulbecco's Modified Eagle's medium (DMEM) (Life Technologies) supplemented with 10% FBS (Life Technologies), L-Glutamine (Life Technologies), 1X penicillin/streptomycin (Life Technologies) at 37° C. with 5% CO2 incubation. The research ethics boards of each institution approved all of the experiments.
Duplication Junction Mapping. A series of probes near the junction were designed and qRT-PCR followed by sequencing was used to map out the exact break point of the duplication. MECP2 duplication forward primer 5′-CCCACAGAGTAGAGTGGAGCAG-3′ (P1, mecp2 dup common) and reverse 5′-TTAGACAGAGTCTCACTCCATCACC-3′(P3, mecp2 dup out). DMD Dup 18-30 primer forward 5′-CAGCATCATGACCTGTTTCAATC-3′ (P1, dup 18 -30 common f) and reverse 5′-TTGTTAGAGGGCAGCAAGTTTGT-3′ (P3, dup18-30 bound r). (See Table 2 and
sgRNA Design. All intronic regions within the duplication were analyzed to find the computationally predicted the most active sgRNAs (23). All sgRNAs with a predicted activity score greater than 0.75 were next analyzed using CRISPR Design Tool (26) and ranked according to the least possible number of potential off-target sites. The 3 best predicted sgRNAs (Table 1, also see Table 3 where the extended table of off target analysis is provided) were then subcloned into the Cas9 nuclease plasmid pSpCas9(BB)-2A-GFP (PX458) (Addgene #48138) (46) or LentiCRISPR 2 vector (Addgene #52961) (48). Each plasmid contained a single locus-specific sgRNA in conjunction with SpCas9.
In one method, fibroblasts can be nucleoporated for instance, using a total of 3 μg of DNA using program U-023 on the Amaxa system and the Primary Fibroblast Kit (Lonza). Cells can then be Fluorescence-activated cell sorted 4 days after nucleoporation and DNA was collected on the same day.
Lentivirus production. Lentiviral particles were generated by transfecting 293T-AAV cells (Agilent) with 10 ug of lentiCRISPR 2 plasmid containing the indicated guides, 7.5 ug of psPAX2 (Addgene #12260), and 5 ug pCMV-VSV-G (Addgene #8454). Viral supernatants were collected 60 hours post transfection, centrifuged at 3000 rpm for 10 minutes, and filtered through 0.45 um low-binding filter (48).
Transduction. Two duplication removal strategies were tested. In the first strategy, Cas9 is guided by a single sgRNA, and in the second two different guides one is specific to the junction of the duplication, and the other is just outside of the duplication (
10 ng of DNA was amplified using Multiplex PCR mix (Qiagen) using three different primers. Primers 1 and 3 (P1, P3) are universal to the region of interest, whereas Primer 2 (P2) is specific to the junction of the duplicated region. PCR amplification using combination of P1, P2, and P3 in patient DNA will results in two bands, which correspond to P1-P2- and P2-P3-derived amplicons, respectively. In contrast, only P1-P2-derived amplicon will be observed in healthy control. Primer sequences are shown in Table 2.
The cells were lysed using RIPA buffer (50 mM Tris HCl pH 7.4, 150 nM NaCl, 1 mM EDTA, 1% deoxycholate, 1% NP40 and 1% Triton X-100 supplemented with Phosphatase and Protease inhibitor cocktails (Roche) and the protein concentration was measured using the BCA assay. 25 μg of of protein lysates were resolved on 3-8% Tris-acetate gels (Novex, Invitrogen) and transferred onto nitrocellulose membranes overnight. The membrane was then incubated in Ponceau S to check for equal loading, blocked in 5% milk/TBST for 1 hour, and probed for dystrophin (MAB1692, Millipore) and β-tubulin (Cat # 05-661, Millipore) overnight at the concentration of 1:200 and 1:2500, respectively. Secondary anti-mouse and -rabbit IgG HRP antibodies were used at the concentration of 1:2500. The signal was detected using SuperSignal West Femto ECL at 1:5 dilution (Life Technologies) and imaged using Bio-Rad Gel Doc imaging system and probed for dystrophin (MAB1692, Millipore), β-dystroglycan (MANDAG clone 7D11, DSHB), α-dystroglycan (kindly provided by Kevin Campbell) and β-tubulin (SantaCruz).
Off-target analysis was conducted as follows for all lenti-based delivery gene editing experiments. Primers targeting loci corresponding to each sgRNA's top 20 off-target hits, as computed by the CRISPR Design Tool (26), were designed and used to amplify DNA from each gene editing experiments using GeneRead DNAseq Targeted Panels (Qiagen). ˜200 bp amplicons were purified using magnetic beads and library preparation was conducted at The Centre for Applied Genomics (TCAG) at The Hospital for Sick Kids with sample-specific barcodes using the IonTorrent Library preparation kit (Life Technologies). Sequencing was performed using the Ion Torrent Proton. Each potential off-target site was evaluated after aligning corresponding sequencing reads to the Human reference genome (hg19). The proportion of reads that match the reference genome versus those with insertions, deletions and substitution near predicted cleavage sites will be used to estimate the off-target activity of a corresponding single sgRNA.
Given the successful removal of a large chromosomal duplication as per Example 4 below, a similar, single sgRNA approach was applied to large exonic duplications in the DMD gene. Duplications of one or more exons comprise approximately 10% of the DMD mutation spectrum (27).
In DMD patient 1, the inventors determined that the exons 18-30 duplication is a chrX:32,552,206-32,413,149 (hg19) direct tandem repeat in head to tail orientation and sequencing of the duplication junction revealed that introns 17 and 30 are joined together via AAAT insertion (
Three days later, puromycin was added to enrich for cells containing the lentiCRISPR 2-guide constructs and DNA was collected 7 days post puromycin selection. The cells were further differentiated towards myotubes and proteins were collected 7 days post-differentiation. The experiment was performed in triplicate. To determine if there was evidence of duplication removal on a molecular level a three-way PCR strategy outlined in
As seen in
The data demonstrate that CRISPR/Cas9-mediated removal of duplications leads to restoration of full-length protein expression in myotubes, which for the first time opens up entirely new treatment strategies for DMD patients with duplications.
Using a similar approach it was demonstrated that the approach could be a therapeutic option for patients with genetic disorders other than DMD that also arise from duplication mutations, namely the MECP2 duplication syndrome. Methyl CpG Binding Protein 2 (MECP2) duplication syndrome, is a rare condition associated with intellectual disability and macrocephaly. It is caused by a variably sized CNV on the X-chromosome that includes a duplication of the MECP2 gene.
In order to explore whether the CRISPR/Cas9 technology could be utilized to target this CNV, it was first determined the exact orientation and breakpoint junction sequence of the duplication in a male patient with this disorder to be chrX:153,420,649-153,142,419 (hg19) duplication with nucleotides CA inserted at the breakpoint junction (
sgRNAs were designed and selected based on the fewest predicted off-target sites, designated as guides 75 and 80. These target regions outside of any known genes or regulatory elements in a 278 kb X-chromosomal duplication that included the MECP2 gene (Table 3). As a proof-of-concept, this 114 kb fragment was deleted by co-nucleofecting Cas9 nuclease plasmids (Addgene #48138) containing the sgRNA guides (
Knowing that the guides were active Strategy 1 was employed, as previously tested in DMD duplication removal, to primary dermal fibroblasts from a male patient with MECP2 duplication syndrome. These cells were transduced with lentiCRISPR containing guide 75 or guide 80 (sgRNA 1 or 2), and DNA was collected at various time points during puromycin selections (
The results establish that a single sgRNA approach provides a novel, highly efficient therapeutic strategy to remove chromosomal duplications that can be explored for a number of different disorders caused by CNVs.
The following examples were conducted to test whether the duplication removal strategy is suitable for in vivo applications.
I. Generation of Dmd Exons 1840 Duplication (dup18-30 Dmd) Mouse Model
The dup18-30 Dmd mouse model was generated using 4 sgRNAs, 2 towards either end of the intended duplication, in intron 17 (i17-1 and i-17-2) and intron 30 (i30-1 and i-30-2) (
II. In Vivo sgRNA Design and Testing
The guides targeting the exons18-30 duplication were designed based on a comultative ranking of the least possible number of potential off-target sites as analyzed by the CRISPR Design tool (28) and predicted activity according to Doench et al. (64) The best predicted sgRNAs (Table 5) were then subcloned into the lentiCRISPR v.2 vector (Addgene 52961)(48).
To experimentally test for the most active guide Neuro-2A cells were transfected with each plasmid using Lipofectamine and DNA collected 3 days later. Cell lysis, genomic PCR, and detection of cleavage were conducted using a GeneArt Genomic Cleavage Detection Kit (Life Technologies) according to the manufacturer's instructions. Briefly, 50 μl of cell lysis buffer and 2 μl of protein degrader were added to cell pellets and lysed in a thermal cycler (68° C. for 15 min and 95° C. for 10 min), genomic PCR was carried out using 1 μl of cell lysates and the following primers: for screening sgRNA i21 primers i21F 5′-AGGATTGCAGATTGCTTCAG-3′ (SEQ. ID. NO. 27) and i21R 5′-GGTGGAGAGAAACCAGATGC-3′ (SEQ. ID. NO. 28) were used, sgRNA i26: i26 F 5′-CATTTCACTGCTCTAGTTTTAATCCTG-3′(SEQ. ID. NO. 29) and i26R 5′-AAACACGCTTGAACTCAAATGCTAC-3′(SEQ. ID. NO. 30), sgRNA i27F: 5′-AGTGAGGTGCTCTATGGGAAATG-3′(SEQ. ID. NO. 32) and i27R 5′-CCATTTAAGAGTGGGTAACCAAGG-3′(SEQ. ID. NO. 31). The PCR products were subjected to re-annealing and a cleavage assay according to the manufacturer's instructions and then analyzed by electrophoresis in 2% agarose gels and ethidium bromide staining. Guide i26 was chosen for in vivo experimentation as it showed the highest degree of cleavage activity.
III. Lentiviral Vector Production
10 cm petri dish of 293T cells (ATCC) at 80% confluency were transfected with 10 μg of transfer LentiCRISPR plasmid containing GFP or sgRNA i26; 5 μg of the envelope (pCMV-VSV-G) (Addgene #8454) plasmid; and 7.5 pg of packaging (psPAX2) (Addgene 12260) plasmid using Calcium Phosphate transfection method. 60 hours post transfection, supernatant was collected, centrifuged at 3000 rpm for 10 minutes and filtered through 0.45 um low-binding filter (Whatman). Viral particles were aliquoted and stored in −80° C. until further use. The LentiCRISPR plasmid also comprisesSpCas9, as such the virus comprises both i26 gRNA and SpCas9 coding regions (SEQ. ID. NO. 25)
IV. Viral Injection
3 months old dup18-30 Dmd mice were anesthetized with isofluorane and injected via intramuscular route with 40 μl of viral particles into the right tibialis anterior (TA). The injection was repeated three days later, and the mice were euthanized 7 days after the last injection. Non-injected, contralateral TA muscles serve as controls.
V. Tissue Isolation and Cryosectioning
TA muscles were isolated and placed on cryomold in an upward orientation. The tissues were then snap frozen in liquid nitrogen-chilled isopentane for 2 minutes, wrapped in tin foil and stored at −80° C. freezer until further use. 8 μm thin cryosections were prepared from three different areas representing proximal, medial and distal (relative to the tendon) areas of the muscles. The slides were stored in −80° C. until further use.
VI. Immunofluorescence Staining for Dystrophin
Cryosectioned muscles were fixed in ice-cold methanol for 10 minutes and blocked with goat blocking buffer containing 3% BSA, 1% normal goat serum, and 0.3% Triton X-100 for 1 hour. The slides were then stained with undiluted mouse monoclonal antibody against dystrophin C-terminus (Novocastra, NCL-DYS2) for 1 hour at room temperature. Alexa 488-conjugated goat-anti mouse antibody (Thermo Fisher) was used as secondary reagent at a concentration of 1:500 for 30 minutes and nuclei were counter-stained with DAPI (Thermo Fisher) at a concentration of 2 ng/μl for 15 minutes. The slides were sealed with coverslips and Prolong Gold mounting media (Thermo Fisher). Images were taken using Zeiss Epifluorescence inverted microscope and analyzed using Volocity software.
VII. Results
After tissue collection, cross sections of the right TA muscle (injected) and left TA muscle (non-injected) were stained with dystrophin specific antibodies. Dystrophin immunoreactivity was only detected in the right TA muscles indicating that the duplication removal strategy is successful and leads to the re-expression of dystrophin protein in treated muscles (
More particularly,
8 μm-cross section of tibialis anterior (TA) muscle isolated from wildtype (C57BI/6J) and Dmd dup18-30 mice were stained for expression of dystrophin and counterstained with DAPI to identify the nuclei. Expression of dystrophin is detected in the right TA (TA-R) of Dmd dup18-30 mice (
As such, the in vivo tests show that the methods of the invention can be used in vivo as illustrated in
ATATCTTCTTAAATACCCGA (SEQ.
GGG
GCTTGGCCATCTAAGTTTA (SEQ.
CGG
GAGTTGTTTGGGTTAAACC (SEQ.
TGG
This application claims priority from United States Provisional Patent Application, U.S. 62/212/934, filed Sep. 1, 2015, the entirety of which is incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CA2016/051041 | 9/1/2016 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62212934 | Sep 2015 | US |