Retinal degenerations are a group of disorders which include Leber's congenital amaurosis (LCA), retinitis pigmentosa (RP), and glaucoma, among others. LCA is a heritable form of retinal degeneration characterized by severe retinal dysfunction and severe visual impairment during the first months of life. LCA is an orphan disease (one that affects fewer than 200,000 Americans), but the 18 subtypes of LCA are together the most common cause of inherited blindness. The subtype designated LCA10, which is the most common subtype, accounting for >20% of all LCA cases. Some forms of LCA are amenable to treatment by recombinant adeno-associated viruses (AAVs) engineered to deliver a functional copy of the defective cellular gene. In 2008, a transgene that complemented the mutation in RPE65 was successfully delivered by AAV to LCA2 patients in a Phase I Clinical trial (Maguire A M et al. N Engl J Med. 2008; 358(21): 2240-2248). Some responses were noted, but these were not durable because transgene expression was eventually lost (Schimmer J et al. Hum Gene Ther Clin Dev. 2015; 26(4): 208-210; Azvolinsky A. Nat Biotechnol. 2015; 33(7): 678-678). Furthermore, some of the genes that cause the different LCA subtypes are simply too large for AAV delivery. These subtypes of LCA therefore remain untreatable.
The ADRP constitutes approximately 30-40% of all cases of RP, and among ADRP patients the most commonly mutated RP associated gene is the one that encodes the rod visual pigment rhodopsin (Dryja, T. P. et al. The New England journal of medicine 323, 1302-1307 (1990); Dryja, T. P. et al. Nature 343, 364-366 (1990)). At the moment, there are no FDA approved treatments for ADRP patients; however, a number of approaches are being developed. Most of these approaches are variations on the theme of “suppression and replacement.” In this approach, one knocks down expression of the gene responsible for degeneration, for example knocking down levels of rhodopsin RNA with a ribozyme or via RNA interference (RNAi) (both shRNAs and siRNA methodologies are being explored), and then replaces expression of the endogenous alleles with a “hardened” gene that is not susceptible to knock down by the ribozyme or RNAi agent. The variant of this theme that is perhaps closest to the clinic is the RhoNova agent being developed by Genable Technologies Limited. RhoNova employs an siRNA to knock down endogenous rhodopsin expression (both mutant and wild-type) combined with an AAV-delivered cDNA that encodes a modified but functional rhodopsin that is not susceptible to siRNA knock down (http://www.genable.net).
Glaucoma, the leading cause of irreversible blindness worldwide (Levkovitch-Verbin H et al. iovsorg 44, 3388-3393 (2003)), is an optic neuropathy in which progressive damage of retinal ganglion cell (RGC) axons at the lamina cribosa of the optic nerve head leads to axon degeneration and cell death (Howell G R et al. J Cell Biol 179, 1523-1537 (2007)). Currently, the only treatment, whether by eye drops, lasers or incisional surgery, is to lower intraocular pressure (IOP) and reduce the injury at the optic nerve head. Unfortunately, this is difficult in some patients while in others, the disease can continue to worsens despite aggressive IOP-lowering. The field has long needed an alternative therapeutic strategy that could complement IOP-lowering by mitigating the RGC response to residual axon injury. Moreover, the NEI has listed optic nerve regeneration amongst its Audacious Goals, and any regenerative therapy necessarily needs to tackle the issue of axotomized RGC survival. To this end, there is a great need to develop a neuroprotective that might directly interfere with the active genetic programs of RGC axon degeneration and/or axon injury-related cell death (Adalbert R et al. Science (2012), doi:10.1126/science.1223899; Yang J et al. Cell 160, 161-176 (2015); Welsbie D S et al. Proc Nat Acad Sci USA 110, 4045-4050 (2013); Watkins T A et al. Proc Nat Acad Sci USA 110, 4039-4044 (2013)).
Thus there is a great need for novel and improved therapies for treating retinal degenerations, like LCA, ADRP, and glaucoma.
The practice of the present invention will typically employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, recombinant nucleic acid (e.g., DNA) technology, immunology, and RNA interference (RNAi) which are within the skill of the art. Non-limiting descriptions of certain of these techniques are found in the following publications: Ausubel, F., et al., (eds.), Current Protocols in Molecular Biology, Current Protocols in Immunology, Current Protocols in Protein Science, and Current Protocols in Cell Biology, all John Wiley & Sons, N.Y., edition as of December 2008; Sambrook, Russell, and Sambrook, Molecular Cloning. A Laboratory Manual, 3rd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 2001; Harlow, E. and Lane, D., Antibodies—A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1988; Freshney, R. I., “Culture of Animal Cells, A Manual of Basic Technique”, 5th ed., John Wiley & Sons, Hoboken, N.J., 2005. Non-limiting information regarding therapeutic agents and human diseases is found in Goodman and Gilman's The Pharmacological Basis of Therapeutics, 11th Ed., McGraw Hill, 2005, Katzung, B. (ed.) Basic and Clinical Pharmacology, McGraw-Hill/Appleton & Lange 10thed. (2006) or 11th edition (July 2009). Non-limiting information regarding genes and genetic disorders is found in McKusick, V. A.: Mendelian Inheritance in Man. A Catalog of Human Genes and Genetic Disorders. Baltimore: Johns Hopkins University Press, 1998 (12th edition) or the more recent online database: Online Mendelian Inheritance in Man, OMIM™. McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University (Baltimore, Md.) and National Center for Biotechnology Information, National Library of Medicine (Bethesda, Md.), as of May 1, 2010, available on the World Wide Web: http://www.ncbi.nlm.nih.gov/omim/ and in Online Mendelian Inheritance in Animals (OMIA), a database of genes, inherited disorders and traits in animal species (other than human and mouse), available on the World Wide Web: http://omia.angis.org.au/contact.shtml. All patents, patent applications, and other publications (e.g., scientific articles, books, websites, and databases) mentioned herein are incorporated by reference in their entirety. In case of a conflict between the specification and any of the incorporated references, the specification (including any amendments thereof, which may be based on an incorporated reference), shall control. Standard art-accepted meanings of terms are used herein unless indicated otherwise. Standard abbreviations for various terms are used herein.
Described herein are methods for treating retinal degenerations, such as optic neuropathies including Leper's congenital amaurosis (e.g., Leber's congenital amaurosis 10 CEP290 mutation (LCA)), retinitis pigmentosa (e.g., Rhodopsin R135 mutations), or glaucoma. The methods use a modified nuclease system, such as Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) (e.g. CRISPR associated (Cas) 9 (CRISPR-Cas9, non-Cas9 CRISPR systems, CRISPR-Cpf-1 system, and the like), to cut and/or repair genomic DNA or RNA (e.g., Cas13a/C2c2 system). The CRISPR-system-based gene editing can be used to inactivate or correct gene mutations causing optic neuropathies and retinal degenerations (e.g., LCA and rhodopsin mutations), thereby providing a gene therapy approach for these groups of diseases. In some embodiments, the CRISPR system is used to introduce a mutation that will inactivate a normal gene (e.g. DLK and/or LZK) causing retinal degeneration (e.g. glaucoma). Because these genes play roles in damage-sensing and cell-survival, the resulting effect is cell survival. The “neuroprotective” approach is not a mutually exclusive approach as there are genetic mutations that could lead to glaucoma as well, and these would be the same as the retinal degenerations. In some embodiments, the mutation targets of glaucoma include, but not limited to, OPTN, TBK1, TMCO1, PMM2, GMDS, GAS7, FNDC3B, TXNRD2, ATXN2, CAV1/CAV2, p16INK4a, SIX6, ABCA1, AFAP1 and CDKN2B-AS.
Thus, one aspect of the invention relates to a method for treating a disorder (e.g., retinal degenerations) affecting a retina area of a subject, the method comprising administering to the retina area of the subject a therapeutically effective amount of a nuclease system comprising a genome targeted nuclease and a guide DNA comprising at least one targeted genomic sequence.
Another aspect of the invention provides methods for treating retinal degenerations utilize a composition comprising a modification of a non-naturally occurring CRISPR system previously described in WO2015/195621 (herein incorporated by reference in its entirety). Such a modification uses certain gRNAs that target retinal degeneration-related genes, such as, but not limited, to LCA10 CEP290 gene, rhodopsin, Dual Leucine Zipper Kinase (DLK), Leucine Zipper Kinase (LZK), JNK1-3, MKK4, MKK7, ATF2, JUN, MEF2A, SOX11, or PUMA. In some embodiments, the composition comprises (a) a non-naturally occurring nuclease system (e.g., CRISPR) comprising one or more vectors comprising: i) a promoter (e.g., bidirectional H1 promoter) operably linked to at least one nucleotide sequence encoding a nuclease system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of a DNA molecule in a cell of the subject, and wherein the DNA molecule encodes one or more gene products expressed in the cell; and ii) a regulatory element operable in a cell operably linked to a nucleotide sequence encoding a genome-targeted nuclease (e.g., Cas9 protein), wherein components (i) and (ii) are located on the same or different vectors of the system, wherein the gRNA targets and hybridizes with the target sequence and the nuclease cleaves the DNA molecule to alter expression of the one or more gene products. In some embodiments, the system is packaged into a single adeno-associated virus (AAV) particle. In some embodiments, the promoter comprises: a) control elements that provide for transcription in one direction of at least one nucleotide sequence encoding a gRNA; and b) control elements that provide for transcription in the opposite direction of a nucleotide sequence encoding a genome-targeted nuclease.
Another aspect of the invention provides methods of altering expression of one or more gene products in a eukaryotic cell, wherein the cell comprises a DNA molecule encoding the one or more gene products, the method comprising introducing into the cell a modified non-naturally occurring CRISPR system previously described in WO2015/195621 (herein incorporated by reference in its entirety). Such a modification uses certain gRNAs that target retinal degeneration-related genes, such as, but not limited, to LCA10 CEP290 gene, rhodopsin, Dual Leucine Zipper Kinase (DLK), Leucine Zipper Kinase (LZK), JNK1-3, MKK4, MKK7, ATF2, JUN, MEF2A, SOX11, or PUMA. In some embodiments, the method comprising introducing into the cell a composition comprising (a) a non-naturally occurring nuclease system (e.g., CRISPR) comprising one or more vectors comprising: i) a promoter (e.g., bidirectional H1 promoter) operably linked to at least one nucleotide sequence encoding a nuclease system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of a DNA molecule in a cell of the subject, and wherein the DNA molecule encodes one or more gene products expressed in the cell; and ii) a regulatory element operable in a cell operably linked to a nucleotide sequence encoding a genome-targeted nuclease (e.g., Cas9 protein), wherein components (i) and (ii) are located on the same or different vectors of the system, wherein the gRNA targets and hybridizes with the target sequence and the nuclease cleaves the DNA molecule to alter expression of the one or more gene products. In some embodiments, the system is packaged into a single adeno-associated virus (AAV) particle. In some embodiments, the promoter comprises: a) control elements that provide for transcription in one direction of at least one nucleotide sequence encoding a gRNA; and b) control elements that provide for transcription in the opposite direction of a nucleotide sequence encoding a genome-targeted nuclease.
One aspect of the invention, relates to a method for treating a retinal degeneration in a subject in need thereof, the method comprising: (a) providing a non-naturally occurring nuclease system comprising one or more vectors comprising: i) a promoter operably linked to at least one nucleotide sequence encoding a nuclease system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of a DNA molecule in a cell of the subject, and wherein the DNA molecule encodes one or more gene products expressed in the cell; and ii) a regulatory element operable in a cell operably linked to a nucleotide sequence encoding a genome-targeted nuclease, wherein components (i) and (ii) are located on the same or different vectors of the system, wherein the gRNA targets and hybridizes with the target sequence and the nuclease cleaves the DNA molecule to alter expression of the one or more gene products; and (b) administering to the retinal area of the subject a therapeutically effective amount of the system.
In some embodiments, the system is CRISPR.
In some embodiments, the system is packaged into a single adeno-associated virus (AAV) particle.
In some embodiments, the system inactivates one or more gene products.
In some embodiments, the nuclease system excises at least one gene mutation.
In some embodiments, the promoter comprises a bidirectional promoter. In some embodiments, the promoter comprises a nucleotide sequence having at least 80%, 85%, 90%, 95%, 98%, 99%, or 100% identity to the nucleotide sequence selected from the group consisting of SEQ ID NOs: 739-787. In some embodiments, the promoter comprises a nucleotide sequence selected from the group consisting of SEQ ID NOs: 739-787.
In some embodiments, the bidirectional promoter is H1 (SEQ ID NO: 787). The H1 promoter is both a pol II and pol III promoter. In some embodiments, the promoter is orthologous to the H1 promoter.
In some embodiments, the orthologous H1 promoter is derived from eutherian mammals.
In some embodiments, the orthologous H1 promoter is derived from Ailuropoda melanoleuca, Bos taurus, Callithrix jacchus, Canis familiaris, Cavia porcellus, Chlorocebus sabaeus, Choloepus hofmanni, Dasypus novemcinctus, Dipodomys ordii, Equus caballus, Erinaceus europaeus, Felis catus, Gorilla gorilla, Homo sapiens, Ictidomys tridecemlineatus, Loxodonta africana, Macaca mulatta, Mus musculus, Mustela putorius furo, Myotis lucifugus, Nomascus leucogenys, Ochotona princeps, Oryctolagus cuniculus, Otolemur garnettii, Ovis aries, Pan troglodytes, Papio anubis, Pongo abelii, Procavia capensis, Pteropus vampyrus, Rattus norvegicus, Sus scrofa, Tarsius syrichta, Tupaia belangeri, Tursiops truncatus, Vicugna pacos.
In some embodiments, the orthologous H1 promoter is derived from mouse or rat.
In some embodiments, the orthologous H1 promoter comprises a nucleotide sequence having at least 80%, 85%, 90%, 95%, 98%, 99%, or 100% identity to the nucleotide sequence set forth in SEQ ID NOs: 752-786.
In some embodiments, the orthologous H1 promoter comprises a nucleotide sequences set forth in the group consisting of SEQ ID NOs: 752-786.
In some embodiments, the H1 promoter comprises: a) control elements that provide for transcription in one direction of at least one nucleotide sequence encoding a gRNA; and b) control elements that provide for transcription in the opposite direction of a nucleotide sequence encoding a genome-targeted nuclease.
In some embodiments, the genome-targeted nuclease is Cas9 protein.
In some embodiments, the Cas9 protein is codon optimized for expression in the cell.
In some embodiments, the promoter is operably linked to at least one, two, three, four, five, six, seven, eight, nine, or ten gRNA.
In some embodiments, the retinal area is the retina.
In some embodiments, the cell is a retinal photoreceptor cell.
In some embodiments, the cell is a retinal ganglion cell.
In some embodiments, the retinal degeneration is selected from the group consisting of Leber's congenital amaurosis (LCA), retinitis pigmentosa (RP), and glaucoma.
In some embodiments, the retinal degeneration is LCA1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, or 18.
In some embodiments, the retinal degeneration is LCA10.
In some embodiments, the target sequence is in the LCA10 CEP290 gene.
In some embodiments, the target sequence is a mutation in the CEP290 gene.
In some embodiments, the target sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 1-109, 164-356, 735-738, or combinations thereof.
In some embodiments, the target sequence comprises SEQ ID NOs: 1, 2, 3, and 4 operably linked.
In some embodiments, the vector comprises the nucleotide sequence set forth in SEQ ID NO: 110.
In some embodiments, the retinal degeneration is an autosomal dominant form of retinitis pigmentosa (ADRP).
In some embodiments, the one or more gene products are rhodopsin.
In some embodiments, the target sequence is a mutation in the rhodopsin gene.
In some embodiments, the target sequence is a mutation at R135 of the rhodopsin gene.
In some embodiments, the mutation at R135 is selected from the group consisting of R135G, R135W, R135L.
In some embodiments, the target sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 111-126, or combinations thereof.
In some embodiments, the gRNA sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 127-142, or combinations thereof.
In some embodiments, the retinal degeneration is glaucoma.
In some embodiments, the one or more gene products are Dual Leucine Zipper Kinase (DLK), Leucine Zipper Kinase (LZK), ATF2, JUN, sex determining region Y (SRY)-box 11 (SOX11), myocyte enhancer factor 2A (MEF2A), JNK1-3, MKK4, MKK7, SOX11, or PUMA, or combinations thereof.
In some embodiments, the one or more gene product are members of the DLK/LZK, MKK4/7, JNK1/2/3 or SOX11/ATF2/JUN/MEF2A pathway.
In some embodiments, the target sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 143-163, or combinations thereof.
In some embodiments, administering to the subject occurs by implantation, injection, or virally.
In some embodiments, administering to the subject occurs by subretinal injection.
In some embodiments, the subject is human.
Another aspect of the invention relates to a method of altering expression of one or more gene products in a cell, wherein the cell comprises a DNA molecule encoding the one or more gene products, the method comprising introducing into the cell a non-naturally occurring nuclease system comprising one or more vectors comprising: a) a promoter operably linked to at least one nucleotide sequence encoding a nuclease system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of the DNA molecule; and
In some embodiments, the system is CRISPR.
In some embodiments, the system is packaged into a single adeno-associated virus (AAV) particle.
In some embodiments, the system inactivates one or more gene products.
In some embodiments, the nuclease system excises at least one gene mutation.
In some embodiments, the promoter is a bidirectional promoter.
In some embodiments, the bidirectional promoter is H1. The H1 promoter is both a pol II and pol III promoter.
In some embodiments, the H1 promoter comprises: a) control elements that provide for transcription in one direction of at least one nucleotide sequence encoding a gRNA; and b) control elements that provide for transcription in the opposite direction of a nucleotide sequence encoding a genome-targeted nuclease.
In some embodiments, the genome-targeted nuclease is Cas9.
In some embodiments, the Cas9 protein is codon optimized for expression in the cell.
In some embodiments, the promoter is operably linked to at least one, two, three, four, five, six, seven, eight, nine, or ten gRNA.
In some embodiments, the cell is a eukaryotic or non-eukaryotic cell.
In some embodiments, the eukaryotic cell is a mammalian or human cell.
In some embodiments, the cell is a retinal photoreceptor cell.
In some embodiments, the cell is a retinal ganglion cell.
In some embodiments, the one or more gene products are LCA10 CEP290.
In some embodiments, the target sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 1-109, 164-356, 735-738, or combinations thereof.
In some embodiments, the target sequence comprises SEQ ID NOs: 1, 2, 3, and 4 operably linked.
In some embodiments, the vector comprises the nucleotide sequence set forth in SEQ ID NO: 110.
In some embodiments, the one or more gene products are rhodopsin.
In some embodiments, the target sequence is a mutation in the rhodopsin gene.
In some embodiments, the target sequence is a mutation at R135 of the rhodopsin gene.
In some embodiments, the mutation at R135 is selected from the group consisting of R135G, R135W, R135L.
In some embodiments, the target sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 111-126, or combinations thereof.
In some embodiments, the gRNA sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 127-142, or combinations thereof.
In some embodiments, the one or more gene products are Dual Leucine Zipper Kinase (DLK), Leucine Zipper Kinase (LZK), ATF2, JUN, sex determining region Y (SRY)-box 11 (SOX11), myocyte enhancer factor 2A (MEF2A), JNK1-3, MKK4, MKK7, SOX11, or PUMA, or combinations thereof.
In some embodiments, the one or more gene product are members of the DLK/LZK, MKK4/7, JNK1/2/3 or SOX11/ATF2/JUN/MEF2A pathway.
In some embodiments, the target sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 143-163, or combinations thereof.
In some embodiments, the expression of the one or more gene products is decreased.
Certain aspects of the presently disclosed subject matter having been stated hereinabove, which are addressed in whole or in part by the presently disclosed subject matter, other aspects will become evident as the description proceeds when taken in connection with the accompanying Examples and Figures as best described herein below.
Having thus described the presently disclosed subject matter in general terms, reference will now be made to the accompanying Figures, which are not necessarily drawn to scale, and wherein:
The presently disclosed subject matter now will be described more fully hereinafter with reference to the accompanying Figures, in which some, but not all embodiments of the presently disclosed subject matter are shown. Like numbers refer to like elements throughout. The presently disclosed subject matter may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will satisfy applicable legal requirements. Indeed, many modifications and other embodiments of the presently disclosed subject matter set forth herein will come to mind to one skilled in the art to which the presently disclosed subject matter pertains having the benefit of the teachings presented in the foregoing descriptions and the associated Figures. Therefore, it is to be understood that the presently disclosed subject matter is not to be limited to the specific embodiments disclosed and that modifications and other embodiments are intended to be included within the scope of the appended claims.
Genome-editing technologies such as zinc fingers nucleases (ZFN) (Porteus, and Baltimore (2003) Science 300: 763; Miller et al. (2007) Nat. Biotechnol. 25:778-785; Sander et al. (2011) Nature Methods 8:67-69; Wood et al. (2011) Science 333:307) and transcription activator-like effectors nucleases (TALEN) (Wood et al. (2011) Science 333:307; Boch et al. (2009) Science 326:1509-1512; Moscou and Bogdanove (2009) Science 326:1501; Christian et al. (2010) Genetics 186:757-761; Miller et al. (2011) Nat. Biotechnol. 29:143-148; Zhang et al. (2011) Nat. Biotechnol. 29:149-153; Reyon et al. (2012) Nat. Biotechnol. 30:460-465) have empowered the ability to generate targeted genome modifications and offer the potential to correct disease mutations with precision. While effective, these technologies are encumbered by practical limitations as both ZFN and TALEN pairs require synthesizing large and unique recognition proteins for a given DNA target site. Several groups have recently reported high-efficiency genome editing through the use of an engineered type II CRISPR/Cas9 system that circumvents these key limitations (Cong et al. (2013) Science 339:819-823; Jinek et al. (2013) eLife 2:e00471; Mali et al. (2013) Science 339:823-826; Cho et al. (2013) Nat. Biotechnol. 31:230-232; Hwang et al. (2013) Nat. Biotechnol. 31:227-229). Unlike ZFNs and TALENs, which are relatively time consuming and arduous to make, the CRISPR constructs, which rely upon the nuclease activity of the Cas9 protein coupled with a synthetic guide RNA (gRNA), are simple and fast to synthesize and can be multiplexed. However, despite the relative ease of their synthesis, CRISPRs have technological restrictions related to their access to targetable genome space, which is a function of both the properties of Cas9 itself and the synthesis of its gRNA.
Cleavage by the CRISPR system requires complementary base pairing of the gRNA to a 20-nucleotide DNA sequence and the requisite protospacer-adjacent motif (PAM), a short nucleotide motif found 3 to the target site (Jinek et al. (2012) Science 337: 816-821). One can, theoretically, target any unique N20-PAM sequence in the genome using CRISPR technology. The DNA binding specificity of the PAM sequence, which varies depending upon the species of origin of the specific Cas9 employed, provides one constraint. Currently, the least restrictive and most commonly used Cas9 protein is from S. pyogenes, which recognizes the sequence NGG, and thus, any unique 21-nucleotide sequence in the genome followed by two guanosine nucleotides (N20NGG) can be targeted. Expansion of the available targeting space imposed by the protein component is limited to the discovery and use of novel Cas9 proteins with altered PAM requirements (Cong et al. (2013) Science 339: 819-823; Hou et al. (2013) Proc. Natl. Acad. Sci. U.S.A., 110(39):15644-9), or pending the generation of novel Cas9 variants via mutagenesis or directed evolution. The second technological constraint of the CRISPR system arises from gRNA expression initiating at a 5′ guanosine nucleotide. Use of the type III class of RNA polymerase III promoters has been particularly amenable for gRNA expression because these short non-coding transcripts have well-defined ends, and all the necessary elements for transcription, with the exclusion of the 1+nucleotide, are contained in the upstream promoter region. However, since the commonly used U6 promoter requires a guanosine nucleotide to initiate transcription, use of the U6 promoter has further constrained genomic targeting sites to GN19NGG (Mali et al. (2013) Science 339:823-826; Ding et al. (2013) Cell Stem Cell 12:393-394). Alternative approaches, such as in vitro transcription by T7, T3, or SP6 promoters, would also require initiating guanosine nucleotide(s) (Adhya et al. (1981) Proc. Natl. Acad. Sci. U.S.A. 78:147-151; Melton et al. (1984) Nucleic Acids Res. 12:7035-7056; Pleiss et al. (1998) RNA 4:1313-1317).
The presently disclosed subject matter relates to the modification of a CRISPR/Cas9 system to target retinal degenerations, which uses the H1 promoter to express guide-RNAs (gRNA or sgRNA) (WO2015/19561, herein incorporated by reference in its entirety) that target retinal degeneration-related genes, such as, but not limited, to LCA10 CEP290 gene, rhodopsin, Dual Leucine Zipper Kinase (DLK), Leucine Zipper Kinase (LZK), JNK1-3, MKK4, MKK7, ATF2, JUN, MEF2A, SOX11, or PUMA. Such a modified CRISPR/Cas9 system can precisely target the pathogenic mutations in these retinal degenerations with greater efficacy, safety, and precision. Moreover, this modification comprising gRNAs retain the compact nature of the CRISPR/Cas9 H1 promoter system that allows for higher-resolution targeting of retinal degenerations over existing CRISPR, TALEN, or Zinc-finger technologies.
In some embodiments, the presently disclosed methods for treating retinal degenerations utilize a composition comprising a modification of a non-naturally occurring CRISPR system previously described in WO2015/195621 (herein incorporated by reference in its entirety). Such a modification uses certain gRNAs that target retinal degeneration-related genes, such as, but not limited, to LCA10 CEP290 gene, rhodopsin, Dual Leucine Zipper Kinase (DLK), Leucine Zipper Kinase (LZK), JNK1-3, MKK4, MKK7, ATF2, JUN, MEF2A, SOX11, or PUMA. In some embodiments, the composition comprises (a) a non-naturally occurring nuclease system (e.g., CRISPR) comprising one or more vectors comprising: i) a promoter (e.g., bidirectional H1 promoter) operably linked to at least one nucleotide sequence encoding a nuclease system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of a DNA molecule in a cell of the subject, and wherein the DNA molecule encodes one or more gene products expressed in the cell; and ii) a regulatory element operable in a cell operably linked to a nucleotide sequence encoding a genome-targeted nuclease (e.g., Cas9 protein), wherein components (i) and (ii) are located on the same or different vectors of the system, wherein the gRNA targets and hybridizes with the target sequence and the nuclease cleaves the DNA molecule to alter expression of the one or more gene products. In some embodiments, the system is packaged into a single adeno-associated virus (AAV) particle. In some embodiments, the adeno-associated virus (AAV) may comprise any of the 51 human adenovirus serotypes (e.g., serotypes 2, 5, or 35). In some embodiments, the system inactivates one or more gene products. In some embodiments, the nuclease system excises at least one gene mutation. In some embodiments, the promoter comprises: a) control elements that provide for transcription in one direction of at least one nucleotide sequence encoding a gRNA; and b) control elements that provide for transcription in the opposite direction of a nucleotide sequence encoding a genome-targeted nuclease. In some embodiments, the Cas9 protein is codon optimized for expression in the cell. In some embodiments, the promoter is operably linked to at least one, two, three, four, five, six, seven, eight, nine, or ten gRNA. In some embodiments, the target sequence is a mutation in the CEP290 gene (e.g., LCA10 CEP290 gene). In some embodiments, the target sequence for CEP290 is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 1-109, 164-356, 735-738, or combinations thereof. In some embodiments, the target sequence comprises SEQ ID NOs: 1, 2, 3, and 4 operably linked. In some embodiments, the vector comprises the nucleotide sequence set forth in SEQ ID NO: 110. In some embodiments, the one or more gene products are rhodopsin. In some embodiments, the target sequence is a mutation in the rhodopsin gene. In some embodiments, the target sequence is a mutation at R135 of the rhodopsin gene (e.g., R135G, R135W, R135L). In some embodiments, the target sequence for rhodopsin R135 is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 111-126, or combinations thereof. In some embodiments, the gRNA sequence for rhodopsin R135 is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 127-142, or combinations thereof. In some embodiments, the one or more gene products are Dual Leucine Zipper Kinase (DLK), Leucine Zipper Kinase (LZK), JNK1-3, MKK4, MKK7, ATF2, JUN, MEF2A, SOX11, or PUMA or combinations thereof. In some embodiments, the mutation targets of glaucoma include, but not limited to, OPTN, TBK1, TMCO1, PMM2, GMDS, GAS7, FNDC3B, TXNRD2, ATXN2, CAV1/CAV2, p16INK4a, SIX6, ABCA1, AFAP1 and CDKN2B-AS. In some embodiments, the target sequence for glaucoma is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 143-163, or combinations thereof.
In some embodiments, the presently disclosed methods for treating retinal degenerations utilize a composition comprising a non-naturally occurring CRISPR system comprising one or more vectors comprising: a) an H1 promoter operably linked to at least one nucleotide sequence encoding a CRISPR system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of a DNA molecule in a cell, and wherein the DNA molecule encodes one or more gene products expressed in the cell; and b) a regulatory element operable in a cell operably linked to a nucleotide sequence encoding a Cas9 protein, wherein components (a) and (b) are located on the same or different vectors of the system, wherein the gRNA targets and hybridizes with the target sequence and the Cas9 protein cleaves the DNA molecule to alter expression of the one or more gene products.
In some embodiments, the presently disclosed methods for treating retinal degenerations utilizes a composition comprising a non-naturally occurring CRISPR system comprising one or more vectors comprising: a) an H1 promoter operably linked to at least one nucleotide sequence encoding a CRISPR system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of a DNA molecule in a eukaryotic cell, and wherein the DNA molecule encodes one or more gene products expressed in the eukaryotic cell; and b) a regulatory element operable in a eukaryotic cell operably linked to a nucleotide sequence encoding a Type-II Cas9 protein, wherein components (a) and (b) are located on the same or different vectors of the system, whereby the gRNA targets and hybridizes with the target sequence and the Cas9 protein cleaves the DNA molecule, and whereby expression of the one or more gene products is altered. In one aspect, the target sequence can be a target sequence that starts with any nucleotide, for example, N20NGG. In some embodiments, the target sequence comprises the nucleotide sequence AN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence GN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence CN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence TN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence AN19NGG or GN19NGG. In another aspect, the Cas9 protein is codon optimized for expression in the cell. In another aspect, the Cas9 protein is codon optimized for expression in the eukaryotic cell. In a further aspect, the eukaryotic cell is a mammalian or human cell. In yet another aspect, the expression of the one or more gene products is decreased.
In some embodiments, the presently disclosed methods for treating retinal degenerations utilizes a composition comprising a non-naturally occurring CRISPR system comprising a vector comprising a bidirectional H1 promoter, wherein the bidirectional H1 promoter comprises: a) control elements that provide for transcription in one direction of at least one nucleotide sequence encoding a CRISPR system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of a DNA molecule in a eukaryotic cell, and wherein the DNA molecule encodes one or more gene products expressed in the eukaryotic cell; and b) control elements that provide for transcription in the opposite direction of a nucleotide sequence encoding a Type-II Cas9 protein, whereby the gRNA targets and hybridizes with the target sequence and the Cas9 protein cleaves the DNA molecule, and whereby expression of the one or more gene products is altered. In one aspect, the target sequence can be a target sequence that starts with any nucleotide, for example, N20NGG. In some embodiments, the target sequence comprises the nucleotide sequence AN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence GN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence CN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence TN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence AN19NGG or GN19NGG. In another aspect, the Cas9 protein is codon optimized for expression in the cell. In another aspect, the Cas9 protein is codon optimized for expression in the eukaryotic cell. In a further aspect, the eukaryotic cell is a mammalian or human cell. In yet another aspect, the expression of the one or more gene products is decreased.
In some embodiments, the CRISPR complex comprises one or more nuclear localization sequences of sufficient strength to drive accumulation of the CRISPR complex in a detectable amount in the nucleus of a cell (e.g., eukaryotic cell). Without wishing to be bound by theory, it is believed that a nuclear localization sequence is not necessary for CRISPR complex activity in eukaryotes, but that including such sequences enhances activity of the system, especially as to targeting nucleic acid molecules in the nucleus. In some embodiments, the CRISPR enzyme is a type II CRISPR system enzyme. In some embodiments, the CRISPR enzyme is a Cas9 enzyme. In some embodiments, the Cas9 enzyme is S. pneumoniae, S. pyogenes, or S. thermophilus Cas9, and may include mutated Cas9 derived from these organisms. The enzyme may be a Cas9 homolog or ortholog.
In general, and throughout this specification, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. Vectors include, but are not limited to, nucleic acid molecules that are single-stranded, double-stranded, or partially double-stranded; nucleic acid molecules that comprise one or more free ends, no free ends (e.g. circular); nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art. One type of vector is a “plasmid,” which refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques. Another type of vector is a viral vector, wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g. retroviruses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno-associated viruses). Viral vectors also include polynucleotides carried by a virus for transfection into a host cell.
Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids.
Recombinant expression vectors can comprise a nucleic acid of the presently disclosed subject matter in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed.
Within a recombinant expression vector, “operably linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).
The term “regulatory element” is intended to include promoters, enhancers, internal ribosomal entry sites (IRES), and other expression control elements (e.g. transcription termination signals, such as polyadenylation signals and poly-U sequences). Such regulatory elements are described, for example, in Goeddel (1990) Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. Regulatory elements include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). A tissue-specific promoter may direct expression primarily in a desired tissue of interest, such as muscle, neuron, bone, skin, blood, specific organs (e.g. liver, pancreas), or particular cell types (e.g. lymphocytes). Regulatory elements may also direct expression in a temporal-dependent manner, such as in a cell-cycle dependent or developmental stage-dependent manner, which may or may not also be tissue or cell-type specific.
In some embodiments, a vector comprises one or more pol III promoters, one or more pol 11 promoters, one or more pol I promoters, or combinations thereof. Examples of pol ITT promoters include, but are not limited to, U6 and H1 promoters. Examples of pol IT promoters include, but are not limited to, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) (e.g., Boshart et al. (1985) Cell 41:521-530), the SV40 promoter, the dihydrofolate reductase promoter, the β-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1α promoter.
Also encompassed by the term “regulatory element” are enhancer elements, such as WPRE; CMV enhancers; the R-U5′ segment in LTR of HTLV-I (Takebe et al. (1988) Mol. Cell. Biol. 8:466-472); SV40 enhancer; and the intron sequence between exons 2 and 3 of rabbit β-globin (O'Hare et al. (1981) Proc. Natl. Acad. Sci. USA. 78(3):1527-31). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression desired, etc. A vector can be introduced into host cells to thereby produce transcripts, proteins, or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., clustered regularly interspersed short palindromic repeats (CRISPR) transcripts, proteins, enzymes, mutant forms thereof, fusion proteins thereof, etc.). Advantageous vectors include lentiviruses and adeno-associated viruses, and types of such vectors can also be selected for targeting particular types of cells.
The terms “polynucleotide”, “nucleotide”, “nucleotide sequence”, “nucleic acid” and “oligonucleotide” are used interchangeably. They refer to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three dimensional structure, and may perform any function, known or unknown. The following are non-limiting examples of polynucleotides: coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA), ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers. A polynucleotide may comprise one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non-nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.
In aspects of the presently disclosed subject matter the terms “chimeric RNA”, “chimeric guide RNA”, “guide RNA”, “single guide RNA” and “synthetic guide RNA” are used interchangeably and refer to the polynucleotide sequence comprising the guide sequence. The term “guide sequence” refers to the about 20 bp sequence within the guide RNA that specifies the target site and may be used interchangeably with the terms “guide” or “spacer”.
As used herein the term “wild type” is a term of the art understood by skilled persons and means the typical form of an organism, strain, gene or characteristic as it occurs in nature as distinguished from mutant or variant forms.
As used herein the term “variant” should be taken to mean the exhibition of qualities that have a pattern that deviates from what occurs in nature.
The terms “non-naturally occurring” or “engineered” are used interchangeably and indicate the involvement of the hand of man. The terms, when referring to nucleic acid molecules or polypeptides mean that the nucleic acid molecule or the polypeptide is at least substantially free from at least one other component with which they are naturally associated in nature and as found in nature.
“Complementarity” refers to the ability of a nucleic acid to form hydrogen bond(s) with another nucleic acid sequence by either traditional Watson-Crick or other non-traditional types. A percent complementarity indicates the percentage of residues in a nucleic acid molecule which can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 700%, 80%, 90%, and 100% complementary). “Perfectly complementary” means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. “Substantially complementary” as used herein refers to a degree of complementarity that is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100% over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions.
As used herein, “stringent conditions” for hybridization refer to conditions under which a nucleic acid having complementarity to a target sequence predominantly hybridizes with the target sequence, and substantially does not hybridize to non-target sequences. Stringent conditions are generally sequence-dependent, and vary depending on a number of factors. In general, the longer the sequence, the higher the temperature at which the sequence specifically hybridizes to its target sequence. Non-limiting examples of stringent conditions are described in detail in Tijssen (1993), Laboratory Techniques In Biochemistry And Molecular Biology-Hybridization With Nucleic Acid Probes Part 1, Second Chapter “Overview of principles of hybridization and the strategy of nucleic acid probe assay”, Elsevier, N.Y.
“Hybridization” refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding may occur by Watson Crick base pairing, Hoogstein binding, or in any other sequence specific manner. The complex may comprise two strands forming a duplex structure, three or more strands forming a multi stranded complex, a single self hybridizing strand, or any combination of these. A hybridization reaction may constitute a step in a more extensive process, such as the initiation of PCR, or the cleavage of a polynucleotide by an enzyme. A sequence capable of hybridizing with a given sequence is referred to as the “complement” of the given sequence.
As used herein, “expression” refers to the process by which a polynucleotide is transcribed from a DNA template (such as into and mRNA or other RNA transcript) and/or the process by which a transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides may be collectively referred to as “gene product.” If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in a eukaryotic cell.
The terms “polypeptide”, “peptide” and “protein” are used interchangeably herein to refer to polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non amino acids. The terms also encompass an amino acid polymer that has been modified; for example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component.
As used herein the term “amino acid” includes natural and/or unnatural or synthetic amino acids, including glycine and both the D or L optical isomers, and amino acid analogs and peptidomimetics.
The practice of the present presently disclosed subject matter employs, unless otherwise indicated, conventional techniques of immunology, biochemistry, chemistry, molecular biology, microbiology, cell biology, genomics and recombinant DNA, which are within the skill of the art (Sambrook, Fritsch and Maniatis (1989) Molecular Cloning: A Laboratory Manual, 2nd edition; Ausubel et al., eds. (1987) Current Protocols in Molecular Biology); MacPherson et al., eds. (1995) Methods in Enzymology (Academic Press, Inc.): PCR 2: A Practical Approach); Harlow and Lane, eds. (1988) Antibodies, A Laboratory Manual; Freshney, ed. (1987) Animal Cell Culture).
Several aspects of the presently disclosed subject matter relate to vector systems comprising one or more vectors, or vectors as such. Vectors can be designed for expression of CRISPR transcripts (e.g. nucleic acid transcripts, proteins, or enzymes) in prokaryotic or eukaryotic cells. For example, CRISPR transcripts can be expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression vectors), yeast cells, or mammalian cells. Suitable host cells are discussed further in Goeddel (1990) Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.
Vectors may be introduced and propagated in a prokaryote. In some embodiments, a prokaryote is used to amplify copies of a vector to be introduced into a eukaryotic cell or as an intermediate vector in the production of a vector to be introduced into a eukaryotic cell (e.g. amplifying a plasmid as part of a viral vector packaging system). In some embodiments, a prokaryote is used to amplify copies of a vector and express one or more nucleic acids, such as to provide a source of one or more proteins for delivery to a host cell or host organism. Expression of proteins in prokaryotes is most often carried out in Escherichia coli with vectors containing constitutive or inducible promoters directing the expression of either fusion or non-fusion proteins.
Fusion vectors add a number of amino acids to a protein encoded therein, such as to the amino terminus of the recombinant protein. Such fusion vectors may serve one or more purposes, such as: (i) to increase expression of recombinant protein; (ii) to increase the solubility of the recombinant protein; and (iii) to aid in the purification of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant protein to enable separation of the recombinant protein from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Example fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson (1988) Gene 67: 31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, N.J.) that fuse glutathione S-transferase (GST), maltose E binding protein, or protein A. respectively, to the target recombinant protein.
Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amrann et al. (1988) Gene 69:301-315) and pET 11d (Studier et al. (1990) Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif.).
In some embodiments, a vector is a yeast expression vector. Examples of vectors for expression in yeast Saccharomyces cerevisae include pYepSec1 (Baldari, et al. (1987) EMBO J. 6: 229-234), pMFa (Kuijan and Herskowitz (1982) Cell 30: 933-943), pJRY88 (Schultz et al. (1987) Gene 54: 113-123), pYES2 (Invitrogen Corporation, San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.).
In some embodiments, a vector is capable of driving expression of one or more sequences in mammalian cells using a mammalian expression vector. Examples of mammalian expression vectors include pCDM8 (Seed (1987) Nature 329: 840) and pMT2PC (Kaufman et al. (1987) EMBO J. 6; 187-195). When used in mammalian cells, the expression vector's control functions are typically provided by one or more regulatory elements. For example, commonly used promoters are derived from polyoma, adenovirus 2, cytomegalovirus, simian virus 40, and others disclosed herein and known in the art. For other suitable expression systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y..
In some embodiments, the recombinant mammalian expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. (1987) Genes Dev. 1: 268-277), lymphoid-specific promoters (Calame and Eaton (1988) Adv. Immunol. 43: 235-275), in particular promoters of T cell receptors (Winoto and Baltimore (1989) EMBO J. 8: 729-733) and immunoglobulins (Baneiji et al. (1983) Cell 33: 729-740: Queen and Baltimore (1983) Cell 33: 741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl. Acad. Sci. USA 86: 5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230: 912-916), and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 and European Application Publication No. 264,166). Developmentally-regulated promoters are also encompassed, e.g., the murine box promoters (Kessel and Gruss (1990) Science 249: 374-379) and the α-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 3: 537-546).
In some embodiments, a regulatory element is operably linked to one or more elements of a CRISPR system so as to drive expression of the one or more elements of the CRISPR system. In general, CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats), also known as SPIDRs (SPacer Interspersed Direct Repeats), constitute a family of DNA loci that are usually specific to a particular bacterial species. The CRISPR locus comprises a distinct class of interspersed short sequence repeats (SSRs) that were recognized in E. coli (Ishino et al. (1987) J. Bacteriol., 169:5429-5433; and Nakata et al. (1989) J. Bacteriol., 171:3553-3556), and associated genes. Similar interspersed SSRs have been identified in Haloferax mediterranei, Streptococcus pyogenes, Anabaena, and Mycobacterium tuberculosis (Groenen et al. (1993) Mol. Microbiol., 10:1057-1065; Hoe et al. (1999) Emerg. Infect. Dis., 5:254-263; Masepohl et al. (1996) Biochim. Biophys. Acta 1307:26-30; and Mojica et al. (1995) Mol. Microbiol., 17:85-93). The CRISPR loci typically differ from other SSRs by the structure of the repeats, which have been termed short regularly spaced repeats (SRSRs) (Janssen et al. (2002) OMICS. J. Integ. Biol., 6:23-33; and Mojica et al. (2000) Mol. Microbiol., 36:244-246) In general, the repeats are short elements that occur in clusters that are regularly spaced by unique intervening sequences with a substantially constant length (Mojica et al. (2000) Mol. Microbiol., 36:244-246). Although the repeat sequences are highly conserved between strains, the number of interspersed repeats and the sequences of the spacer regions typically differ from strain to strain (van Embden et al. (2000) J. Bacteriol., 182.2393-2401). CRISPR loci have been identified in more than 40 prokaryotes (e.g., Jansen et al. (2002) Mol. Microbiol., 43:1565-1575, and Mojica et al. (2005) J. Mol. Evol. 60:174-82) including, but not limited to Aeropyrum, Pyrobaculum, Sulfolobus, Archaeoglobus, Halocarcula, Methanobacterium, Methanococcus, Methanosarcina, Methanopyrus, Pyrococcus, Picrophilus, Thermoplasma, Corynebacterium, Mycobacterium, Streptomyces, Aquifex, Porphyromonas, Chlorobium, Thermus, Bacillus, Listeria, Staphylococcus, Clostridium, Thermoanaerobacter, Mycoplasma, Fusobacterium, Azarcus, Chromobacterium, Neisseria, Nitrosomonas, Desulfovibrio, Geobacter, Myrococcus, Campylobacter, Wolinella, Acinetobacter, Erwinia, Ascherichia, Legionella, Alethylococcus, Pasteurella, Photobacterium, Salmonella, Xanthomonas, Yersinia, Treponema, and Thermotoga.
In general, “CRISPR system” refers collectively to transcripts and other elements involved in the expression of or directing the activity of CRISPR-associated (“Cas”) genes, including sequences encoding a Cas gene, a guide sequence (also referred to as a “spacer” in the context of an endogenous CRISPR system), or other sequences and transcripts from a CRISPR locus. In some embodiments, one or more elements of a CRISPR system is derived from a type 1, type II, or type Ill CRISPR system. In some embodiments, one or more elements of a CRISPR system is derived from a particular organism comprising an endogenous CRISPR system, such as Streptococcus pyogenes. In general, a CRISPR system is characterized by elements that promote the formation of a CRISPR complex at the site of a target sequence (also referred to as a protospacer in the context of an endogenous CRISPR system).
In the context of formation of a CRISPR complex, “target sequence” refers to a sequence to which a guide sequence is designed to have complementarity, where hybridization between a target sequence and a guide sequence promotes the formation of a CRISPR complex. Full complementarity is not necessarily required, provided there is sufficient complementarity to cause hybridization and promote formation of a CRISPR complex. A target sequence may comprise any polynucleotide, such as DNA or RNA polynucleotides. In some embodiments, a target sequence is located in the nucleus or cytoplasm of a cell. In some embodiments, the target sequence may be within an organelle of a eukaryotic cell, for example, mitochondrion or chloroplast. A sequence or template that may be used for recombination into the targeted locus comprising the target sequences is referred to as an “editing template” or “editing polynucleotide” or “editing sequence”. In aspects of the presently disclosed subject matter, an exogenous template polynucleotide may be referred to as an editing template. In an aspect of the presently disclosed subject matter the recombination is homologous recombination.
In some embodiments, a vector comprises one or more insertion sites, such as a restriction endonuclease recognition sequence (also referred to as a “cloning site”). In some embodiments, one or more insertion sites (e.g. about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more insertion sites) are located upstream and/or downstream of one or more sequence elements of one or more vectors. When multiple different guide sequences are used, a single expression construct may be used to target CRISPR activity to multiple different, corresponding target sequences within a cell. For example, a single vector may comprise about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, or more guide sequences. In some embodiments, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more such guide-sequence-containing vectors may be provided, and optionally delivered to a cell.
In some embodiments, a vector comprises a regulatory element operably linked to an enzyme-coding sequence encoding a CRISPR enzyme, such as a Cas protein. Non-limiting examples of Cas proteins include Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csx1, Csx15, Csf1, Csf2, Csf3, Csf4, Cpf1, C2c1, Cas13a, C2c2, C2c3, homologs thereof, or modified versions thereof. These enzymes are known; for example, the amino acid sequence of S. pyogenes Cas9 protein may be found in the SwissProt database under accession number Q99ZW2. In some embodiments, the unmodified CRISPR enzyme has DNA cleavage activity, such as Cas9. In some embodiments the CRISPR enzyme is Cas9, and may be Cas9 from S. pyogenes or S. pneumoniae.
In some embodiments, the CRISPR enzyme directs cleavage of one or both strands at the location of a target sequence, such as within the target sequence and/or within the complement of the target sequence In some embodiments, the CRISPR enzyme directs cleavage of one or both strands within about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, or more base pairs from the first or last nucleotide of a target sequence. In some embodiments, a vector encodes a CRISPR enzyme that is mutated to with respect to a corresponding wild-type enzyme such that the mutated CRISPR enzyme lacks the ability to cleave one or both strands of a target polynucleotide containing a target sequence.
In some embodiments, an enzyme coding sequence encoding a CRISPR enzyme is codon optimized for expression in particular cells, such as eukaryotic cells. The eukaryotic cells may be those of or derived from a particular organism, such as a mammal, including but not limited to human, mouse, rat, rabbit, dog, or non-human primate. In general, codon optimization refers to a process of modifying a nucleic acid sequence for enhanced expression in the host cells of interest by replacing at least one codon (e.g. about or more than about 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more codons) of the native sequence with codons that are more frequently or most frequently used in the genes of that host cell while maintaining the native amino acid sequence. Various species exhibit particular bias for certain codons of a particular amino acid. Codon bias (differences in codon usage between organisms) often correlates with the efficiency of translation of messenger RNA (mRNA), which is in turn believed to be dependent on, among other things, the properties of the codons being translated and the availability of particular transfer RNA (tRNA) molecules. The predominance of selected tRNAs in a cell is generally a reflection of the codons used most frequently in peptide synthesis. Accordingly, genes can be tailored for optimal gene expression in a given organism based on codon optimization. Codon usage tables are readily available, for example, at the “Codon Usage Database”, and these tables can be adapted in a number of ways. See Nakamura et al. (2000) Nucl. Acids Res. 28:292. Computer algorithms for codon optimizing a particular sequence for expression in a particular host cell are also available, such as Gene Forge (Aptagen; Jacobus, Pa.), are also available. In some embodiments, one or more codons (e.g. 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more, or all codons) in a sequence encoding a CRISPR enzyme correspond to the most frequently used codon for a particular amino acid.
In general, a guide sequence is any polynucleotide sequence having sufficient complementarity with a target polynucleotide sequence to hybridize with the target sequence and direct sequence-specific binding of a CRISPR complex to the target sequence. In some embodiments, the degree of complementarity between a guide sequence and its corresponding target sequence, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment may be determined with the use of any suitable algorithm for aligning sequences, non-limiting example of which include the Smith-Waterman algorithm, the Needleman-Wunsch algorithm, algorithms based on the Burrows-Wheeler Transform (e.g. the Burrows Wheeler Aligner), ClustalW, Clustal X, BLAT, Novoalign (Novocraft Technologies, ELAND (Illumina, San Diego, Calif.), SOAP (available at soap.genomics.org.cn), and Maq (available at maq.sourceforge.net). In some embodiments, a guide sequence is about or more than about 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 75, or more nucleotides in length. In some embodiments, a guide sequence is less than about 75, 50, 45, 40, 35, 30, 25, 20, 15, 12, or fewer nucleotides in length.
The ability of a guide sequence to direct sequence-specific binding of a CRISPR complex to a target sequence may be assessed by any suitable assay. For example, the components of a CRISPR system sufficient to form a CRISPR complex, including the guide sequence to be tested, may be provided to a host cell having the corresponding target sequence, such as by transfection with vectors encoding the components of the CRISPR sequence, followed by an assessment of preferential cleavage within the target sequence, such as by Surveyor assay as described herein. Similarly, cleavage of a target polynucleotide sequence may be evaluated in a test tube by providing the target sequence, components of a CRISPR complex, including the guide sequence to be tested and a control guide sequence different from the test guide sequence, and comparing binding or rate of cleavage at the target sequence between the test and control guide sequence reactions. Other assays are possible, and will occur to those skilled in the art.
A guide sequence may be selected to target any target sequence. In some embodiments, the target sequence is a sequence within a genome of a cell. Exemplary target sequences include those that are unique in the target genome. For example, in some embodiments, the target sequence of LCA is selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, and 6; or combinations thereof. SEQ ID NO: 1, 2, 3, 4, 5, and 6; or combinations thereof can result in ˜1 kb deletion removing the cryptic Exon D from CEP290. Combining SEQ ID NO: 1, 2, 3, 4, 5, and 6 with a modified form of Cas9, such as D10A nickase, may provide a safe and effective therapeutic approach. An exemplary saCas9 construct with four gRNAs is set forth in SEQ ID NO: 110. Additional target sequences of LCA may be selected from the nucleotide sequences set forth in SEQ ID NOs: 7-109. In some embodiments, the target sequences of ADRP is selected from the group consisting of SEQ ID NO: 111-126, or combinations thereof. In some embodiments, the gRNA sequences of ADRP is selected from the group consisting of SEQ ID NO: 127-142, or combinations thereof. In some embodiments, the mutation targets of glaucoma include, but not limited to, OPTN, TBK1, TMCO1, PMM2, GMDS, GAS7, FNDC3B, TXNRD2, ATXN2, CAV1/CAV2, p16INK4a, SIX6, ABCA1, AFAP1 and CDKN2B-AS. In some embodiments, the target sequences of glaucoma is selected from the group consisting of SEQ ID NO: 143-163, or combinations thereof. In some embodiments for the methods to treat glaucoma, the non-naturally occurring CRISPR system comprises H1 promoter to express an mCherry-histone 2b fusion in the Pol II direction in combination with at least one gRNA, e.g., directed to the Dlk, Lzk, or other upstream or downstream components of RGC survival pathway identified using the screening methods provided here.
In some embodiments, the target sequence may be 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.1%, 99.2%, 99.3%, 99.4%, 99.5%, 99.6%, 99.7%, 99.8% homologous to the nucleotide sequences set forth in SEQ ID NO: 1-738, or 788-1397.
The term “homologous” refers to the “% homology” and is used interchangeably herein with the term “% identity” herein, and relates to the level of nucleic acid sequence identity when aligned using a sequence alignment program.
For example, as used herein, 80% homology means the same thing as 80% sequence identity determined by a defined algorithm, and accordingly a homologue of a given sequence has greater than 80% sequence identity over a length of the given sequence. Exemplary levels of sequence identity include, but are not limited to about, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99.1%, 99.2%, 99.3%, 99.4%, 99.5%, 99.6%, 99.7%, 99.8% or more sequence identity to the nucleotide sequences set forth in SEQ ID NO: 1-1400.
In some embodiments, the CRISPR enzyme is part of a fusion protein comprising one or more heterologous protein domains (e.g. about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more domains in addition to the CRISPR enzyme). A CRISPR enzyme fusion protein may comprise any additional protein sequence, and optionally a linker sequence between any two domains. Examples of protein domains that may be fused to a CRISPR enzyme include, without limitation, epitope tags, reporter gene sequences, and protein domains having one or more of the following activities: methylase activity, demethylase activity, transcription activation activity, transcription repression activity, transcription release factor activity, histone modification activity, RNA cleavage activity and nucleic acid binding activity. Non-limiting examples of epitope tags include histidine (His) tags, VS tags, FLAG tags, influenza hemagglutinin (HA) tags, Myc tags, VSV-G tags, and thioredoxin (Trx) tags. Examples of reporter genes include, but are not limited to, glutathione-5-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT) beta-galactosidase, beta-glucuronidase, luciferase, green fluorescent protein (GFP), HcRed, DsRed, cyan fluorescent protein (CFP), yellow fluorescent protein (YFP), and autofluorescent proteins including blue fluorescent protein (BFP). A CRISPR enzyme may be fused to a gene sequence encoding a protein or a fragment of a protein that bind DNA molecules or bind other cellular molecules, including but not limited to maltose binding protein (MBP), S-tag, Lex A DNA binding domain (DBD) fusions, GAL4A DNA binding domain fusions, and herpes simplex virus (HSV) BP16 protein fusions. Additional domains that may form part of a fusion protein comprising a CRISPR enzyme are described in US20110059502, incorporated herein by reference. In some embodiments, a tagged CRISPR enzyme is used to identify the location of a target sequence.
In an aspect of the presently disclosed subject matter, a reporter gene which includes but is not limited to glutathione-5-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT) beta-galactosidase, beta-glucuronidase, luciferase, green fluorescent protein (GFP), HcRed, DsRed, cyan fluorescent protein (CFP), yellow fluorescent protein (YFP), and autofluorescent proteins including blue fluorescent protein (BFP), may be introduced into a cell to encode a gene product which serves as a marker by which to measure the alteration or modification of expression of the gene product. In a further embodiment of the presently disclosed subject matter, the DNA molecule encoding the gene product may be introduced into the cell via a vector. In a preferred embodiment of the presently disclosed subject matter the gene product is luciferase. In a further embodiment of the presently disclosed subject matter the expression of the gene product is decreased.
Generally, promoter embodiments of the present presently disclosed subject matter comprise: 1) a complete Pol III promoter, which includes a TATA box, a Proximal Sequence Element (PSE), and a Distal Sequence Element (DSE); and 2) a second basic Pol III promoter that includes a PSE and TATA box fused to the 5′ terminus of the DSE in reverse orientation. The TATA box, which is named for its nucleotide sequence, is a major determinant of Pol III specificity. It is usually located at a position between nt. −23 and −30 relative to the transcribed sequence, and is a primary determinant of the beginning of the transcribed sequence. The PSE is usually located between nt. −45 and −66. The DSE enhances the activity of the basic Pol III promoter. In the H1 promoter, there is no gap between the PSE and the DSE.
Bidirectional promoters consists of: 1) a complete, conventional, unidirectional Pol III promoter that contains 3 external control elements: a DSE, a PSE, and a TATA box; and 2) a second basic Pol III promoter that includes a PSE and a TATA box fused to the 5′ terminus of the DSE in reverse orientation. The TATA box, which is recognized by the TATA binding protein, is essential for recruiting Pol III to the promoter region. Binding of the TATA binding protein to the TATA box is stabilized by the interaction of SNAPc with the PSE. Together, these elements position Pol III correctly so that it can transcribe the expressed sequence. The DSE is also essential for full activity of the Pol Ill promoter (Murphy et al. (1992) Mol. Cell Biol. 12:3247-3261; Mittal et al. (1996) Mol. Cell Biol. 16:1955-1965; Ford and Hernandez (1997) J. Biol. Chem., 272:16048-16055; Ford et al. (1998) Genes, Dev., 12:3528-3540; Hovde et al. (2002) Genes Dev. 16:2772-2777). Transcription is enhanced up to 100-fold by interaction of the transcription factors Oct-1 and/or SBF/Staf with their motifs within the DSE (Kunkel and Hixon (1998) Nucl. Acid Res., 26:1536-1543). Since the forward and reverse oriented basic promoters direct transcription of sequences on opposing strands of the double-stranded DNA templates, the positive strand of the reverse oriented basic promoter is appended to the 5′ end of the negative strand of the DSE. Transcripts expressed under the control of the H1 promoter are terminated by an unbroken sequence of 4 or 5 T's.
In the H1 promoter, the DSE is adjacent to the PSE and the TATA box (Myslinski et al. (2001) Nucl. Acid Res. 29:2502-2509). To minimize sequence repetition, this promoter was rendered bidirectional by creating a hybrid promoter, in which transcription in the reverse direction is controlled by appending a PSE and TATA box derived from the U6 promoter. To facilitate construction of the bidirectional H1 promoter, a small spacer sequence may also inserted between the reverse oriented basic promoter and the DSE.
In some embodiments, bidirectional promoters include, but not limited to, H1, RPPH1-PARP2 (Human), SRP-RPS29, 7sk1-GSTA4, SNAR-G-1-CGB1, SNAR-CGB2, RMRP-CCDC 107, tRNA(Lys)-ALOXE3, RNU6-9-MED 16: tRNA (Gly)-DPP9, RNU6-2-THEM259, or SNORD13-C8orf41.
In some embodiments, the H1 promoter comprise the nucleotide sequence set forth in SEQ ID NO: 787.
In some embodiments, orthologous bidirectional promoters include, but not limited to, RPPH1-PARP2 (Mouse) or RPPH1-PARP2 (Rat), or those derived from Ailuropoda melanoleuca, Bos taurus, Callithrix jacchus, Canis familiaris, Cavia porcellus, Chlorocebus sabaeus, Choloepus hoffmanni, Dasypus novemcinctus, Dipodomys ordii, Equus caballus, Erinaceus europaeus, Felis catus, Gorilla gorilla, Homo sapiens, Ictidomys tridecemlineatus, Loxodonta africana, Macaca mulatta, Mus musculus, Mustela putorius furo, Myotis lucifugus, Nomascus leucogenys, Ochotona princeps, Oryctolagus cuniculus, Otolemur garnettii, Ovis aries, Pan troglodytes, Papio anubis, Pongo abelii, Procavia capensis, Pteropus vampyrus, Rattus norvegicus, Sus scrofa, Tarsius syrichta, Tupaia belangeri, Tursiops truncatus, Vicugna pacos.
homo_sapiens
In some embodiments, the presently disclosed subject matter also provides a method of altering expression of one or more gene products in a eukaryotic cell, wherein the cell comprises a DNA molecule encoding the one or more gene products, the method comprising introducing into the cell a modified non-naturally occurring CRISPR system previously described in WO2015/195621 (herein incorporated by reference in its entirety). Such a modification uses certain gRNAs that target retinal degeneration-related genes, such as, but not limited, to LCA10 CEP290 gene, rhodopsin, Dual Leucine Zipper Kinase (DLK), Leucine Zipper Kinase (LZK), JNK1-3, MKK4, MKK7, ATF2, JUN, MEF2A, SOX11, or PUMA. In some embodiments, the method comprising introducing into the cell a composition comprising (a) a non-naturally occurring nuclease system (e.g., CRISPR) comprising one or more vectors comprising: i) a promoter (e.g., bidirectional H1 promoter) operably linked to at least one nucleotide sequence encoding a nuclease system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of a DNA molecule in a cell of the subject, and wherein the DNA molecule encodes one or more gene products expressed in the cell; and ii) a regulatory element operable in a cell operably linked to a nucleotide sequence encoding a genome-targeted nuclease (e.g., Cas9 protein), wherein components (i) and (ii) are located on the same or different vectors of the system, wherein the gRNA targets and hybridizes with the target sequence and the nuclease cleaves the DNA molecule to alter expression of the one or more gene products. In some embodiments, the system is packaged into a single adeno-associated virus (AAV) particle. In some embodiments, the adeno-associated virus (AAV) may comprise any of the 51 human adenovirus serotypes (e.g., serotypes 2, 5, or 35). In some embodiments, the system inactivates one or more gene products. In some embodiments, the nuclease system excises at least one gene mutation. In some embodiments, the promoter comprises: a) control elements that provide for transcription in one direction of at least one nucleotide sequence encoding a gRNA; and b) control elements that provide for transcription in the opposite direction of a nucleotide sequence encoding a genome-targeted nuclease. In some embodiments, the Cas9 protein is codon optimized for expression in the cell. In some embodiments, the promoter is operably linked to at least one, two, three, four, five, six, seven, eight, nine, or ten gRNA. In some embodiments, the target sequence is a mutation in the CEP290 gene (e.g., LCA10 CEP290 gene). In some embodiments, the target sequence for CEP290 is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 1-109, 164-356, 735-738, or combinations thereof. In some embodiments, the target sequence comprises SEQ ID NOs: 1, 2, 3, and 4 operably linked. In some embodiments, the vector comprises the nucleotide sequence set forth in SEQ ID NO: 110. In some embodiments, the one or more gene products are rhodopsin. In some embodiments, the target sequence is a mutation in the rhodopsin gene. In some embodiments, the target sequence is a mutation at R135 of the rhodopsin gene (e.g., R135G, R135W, R135L). In some embodiments, the target sequence for rhodopsin R135 is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 111-126, or combinations thereof. In some embodiments, the gRNA sequence for rhodopsin R135 is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 127-142, or combinations thereof. In some embodiments, the one or more gene products are Dual Leucine Zipper Kinase (DLK), Leucine Zipper Kinase (LZK), JNK1-3, MKK4, MKK7, ATF2, JUN, MEF2A, SOX11, or PUMA, or combinations thereof. In some embodiments, the mutation targets of glaucoma include, but not limited to, OPTN, TBK1, TMCO1, PMM2, GMDS, GAS7, FNDC3B, TXNRD2, ATXN2, CAV1/CAV2, p16INK4a, SIX6, ABCA1, AFAP1 and CDKN2B-AS. In some embodiments, the target sequence for glaucoma is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 143-163, or combinations thereof.
In some embodiments, the presently disclosed subject matter also provides a method of altering expression of one or more gene products in a cell, wherein the cell comprises a DNA molecule encoding the one or more gene products, the method comprising introducing into the cell a non-naturally occurring CRISPR system comprising one or more vectors comprising: a) an H1 promoter operably linked to at least one nucleotide sequence encoding a CRISPR system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of the DNA molecule; and b) a regulatory element operable in the cell operably linked to a nucleotide sequence encoding a Cas9 protein, wherein components (a) and (b) are located on the same or different vectors of the system, wherein the gRNA targets and hybridizes with the target sequence and the Cas9 protein cleaves the DNA molecule to alter expression of the one or more gene products.
In some embodiments, the presently disclosed subject matter also provides a method of altering expression of one or more gene products in a eukaryotic cell, wherein the cell comprises a DNA molecule encoding the one or more gene products, the method comprising introducing into the cell a non-naturally occurring CRISPR system comprising one or more vectors comprising: a) an H1 promoter operably linked to at least one nucleotide sequence encoding a CRISPR system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of the DNA molecule; and b) a regulatory element operable in the eukaryotic cell operably linked to a nucleotide sequence encoding a Type-II Cas9 protein, wherein components (a) and (b) are located on the same or different vectors of the system, whereby the gRNA targets and hybridizes with the target sequence and the Cas9 protein cleaves the DNA molecule, and whereby expression of the one or more gene products is altered. In one aspect, the target sequence can be a target sequence that starts with any nucleotide, for example, N20NGG. In some embodiments, the target sequence comprises the nucleotide sequence AN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence GN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence CN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence TN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence AN19NGG or GN19NGG. In another aspect, the Cas9 protein is codon optimized for expression in the cell. In yet another aspect, the Cas9 protein is codon optimized for expression in the eukaryotic cell. In a further aspect, the eukaryotic cell is a mammalian or human cell. In another aspect, the expression of the one or more gene products is decreased.
The presently disclosed subject matter also provides a method of altering expression of one or more gene products in a eukaryotic cell, wherein the cell comprises a DNA molecule encoding the one or more gene products, the method comprising introducing into the cell a non-naturally occurring CRISPR system comprising a vector comprising a bidirectional H1 promoter, wherein the bidirectional H1 promoter comprises: a) control elements that provide for transcription in one direction of at least one nucleotide sequence encoding a CRISPR system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of the DNA molecule; and b) control elements that provide for transcription in the opposite direction of a nucleotide sequence encoding a Type-II Cas9 protein, whereby the gRNA targets and hybridizes with the target sequence and the Cas9 protein cleaves the DNA molecule, and whereby expression of the one or more gene products is altered. In one aspect, the target sequence can be a target sequence that starts with any nucleotide, for example, N20NGG. In some embodiments, the target sequence comprises the nucleotide sequence AN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence GN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence CN19NGG. In some embodiments, the target sequence comprises the nucleotide sequence TN19NGG. In another aspect, the target sequence comprises the nucleotide sequence AN19NGG or GN19NGG. In another aspect, the Cas9 protein is codon optimized for expression in the cell. In yet another aspect, the Cas9 protein is codon optimized for expression in the eukaryotic cell. In a further aspect, the eukaryotic cell is a mammalian or human cell. In another aspect, the expression of the one or more gene products is decreased.
In some aspects, the presently disclosed subject matter provides methods comprising delivering one or more polynucleotides, such as or one or more vectors as described herein, one or more transcripts thereof, and/or one or proteins transcribed therefrom, to a host cell. In some aspects, the presently disclosed subject matter further provides cells produced by such methods, and organisms (such as animals, plants, or fungi) comprising or produced from such cells. In some embodiments, a CRISPR enzyme in combination with (and optionally complexed with) a guide sequence is delivered to a cell. Conventional viral and non-viral based gene transfer methods can be used to introduce nucleic acids in mammalian cells or target tissues Such methods can be used to administer nucleic acids encoding components of a CRISPR system to cells in culture, or in a host organism. Non-viral vector delivery systems include DNA plasmids, RNA (e.g. a transcript of a vector described herein), naked nucleic acid, and nucleic acid complexed with a delivery vehicle, such as a liposome. Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell. For a review of gene therapy procedures, see Anderson (1992) Science 256:808-813; Nabel and Felgner (1993) TIBTECH 11:211-217; Mitani and Caskey (1993) TIBTECH 11:162-166; Dillon (1993) TIBTECH 11:167-175; Miller (1992) Nature 357.455-460; Van Brunt (1998) Biotechnology 6(10): 1149-1154; Vigne (1995) Restorative Neurology and Neuroscience 8:35-36; Kremer and Perricaudet (1995) British Medical Bulletin 51(1l):31-44; Haddada et al. (1995) Current Topics in Microbiology and Immunology. Doerfler and Bohm (eds); and Yu et al. (1994) Gene Therapy 1:13-26.
Methods of non-viral delivery of nucleic acids include lipofection, nucleofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. Lipofection is described in e.g., U.S. Pat. Nos. 5,049,386, 4,946,787; and 4,897,355) and lipofection reagents are sold commercially (e.g., Transfectam™ and Lipofectin™). Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Felgner, WO 91/17424; WO 91/16024. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).
The preparation of lipid:nucleic acid complexes, including targeted liposomes such as immunolipid complexes, is well known to one of skill in the art (e.g., Crystal (1995) Science 270:404-410; Blaese et al. (1995) Cancer Gene Ther. 2:291-297: Behr et al. (1994) Bioconjugate Chem. 5:382-389; Remy et al. (1994) Bioconjugate Chem. 5:647-654; Gao et al. (1995) Gene Therapy 2:710-722; Ahmad et al. (1992) Cancer Res. 52:4817-4820; U.S. Pat. Nos. 4,186,183, 4,217,344, 4,235,871, 4,261,975, 4,485,054, 4,501,728, 4,774,085, 4,837,028, and 4,946,787).
The use of RNA or DNA viral based systems for the delivery of nucleic acids take advantage of highly evolved processes for targeting a virus to specific cells in the body and trafficking the viral payload to the nucleus. Viral vectors can be administered directly to patients (in vivo) or they can be used to treat cells in vitro, and the modified cells may optionally be administered to patients (ex vivo). Conventional viral based systems could include retroviral, lentivirus, adenoviral, adeno-associated and herpes simplex virus vectors for gene transfer. Integration in the host genome is possible with the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in long term expression of the inserted transgene Additionally, high transduction efficiencies have been observed in many different cell types and target tissues.
The tropism of a retrovirus can be altered by incorporating foreign envelope proteins, expanding the potential target population of target cells. Lentiviral vectors are retroviral vectors that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors are comprised of cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis-acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression. Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SIV), human immuno deficiency virus (HIV), and combinations thereof (e.g., Buchscher et al. (1992) J. Virol. 66:2731-2739; Johann et al. (1992) J. Virol. 66:1635-1640; Sommnerfelt et al. (1990) J. Virol. 176:58-59; Wilson et al. (1989) J. Virol. 63:2374-2378; Miller et al. (1991) J. Virol. 65:2220-2224; PCT/US94/05700). In applications where transient expression is preferred, adenoviral based systems may be used. Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system. Adeno-associated virus (“AAV”) vectors may also be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (e.g., West et al. (1987) Virology 160:38-47; U.S. Pat. No. 4,797,368; WO 93/24641; Kotin (1994) Human Gene Therapy 5:793-801; Muzyczka (1994) J. Clin. Invest. 94:1351. Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No. 5,173,414; Tratschin et al. (1985) Mol. Cell. Biol. 5:3251-3260; Tratschin et al. (1984) Mol. Cell. Biol. 4:2072-2081; Hermonat and Muzyczka (1984) Proc. Natl. Acad. Sci. U.S.A. 81:6466-6470; and Samulski et al. (1989) J. Virol. 63:03822-3828.
Packaging cells are typically used to form virus particles that are capable of infecting a host cell. Such cells include 293 cells, which package adenovirus, and ψ2 cells or PA317 cells, which package retrovirus. Viral vectors used in gene therapy are usually generated by producing a cell line that packages a nucleic acid vector into a viral particle. The vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the polynucleotide(s) to be expressed. The missing viral functions are typically supplied in trans by the packaging cell line. For example, AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome which are required for packaging and integration into the host genome. Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences. The cell line may also be infected with adenovirus as a helper. The helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid. The helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than AAV. Additional methods for the delivery of nucleic acids to cells are known to those skilled in the art. See, for example, US20030087817, incorporated herein by reference.
In some embodiments, a host cell is transiently or non-transiently transfected with one or more vectors described herein. In some embodiments, a cell is transfected as it naturally occurs in a subject. In some embodiments, a cell that is transfected is taken from a subject. In some embodiments, the cell is derived from cells taken from a subject, such as a cell line. A wide variety of cell lines for tissue culture are known in the art. Examples of cell lines include, but are not limited to, C8161, CCRF-CEM, MOLT, mIMCD-3, NHDF, HeLa-S3, Huh1, Huh4, Huh7, HUVEC, HASMC, HEKn, HEKa, MiaPaCell, Panel, PC-3, TF1, CTLL-2, C1R, Rat6, CV1, RPTE, A10, T24, J82, A375, ARH-77, Calu1, SW480, SW620, SKOV3, SK-UT, CaCo2, P388D1, SEM-K2, WEHI-231, HB56, TIB55, Jurkat, J45.01, LRMB, Bcl-1, BC-3, IC21, DLD2, Raw264.7, NRK, NRK-52E, MRC5, MEF, Hep G2, HeLa B, HeLa T4, COS, COS-1, COS-6, COS-M6A, BS-C-1 monkey kidney epithelial, BALB/3T3 mouse embryo fibroblast, 3T3 Swiss, 3T3-L1, 132-d5 human fetal fibroblasts; 10.1 mouse fibroblasts, 293-T, 3T3, 721, 9L, A2780, A2780ADR, A2780cis, A172, A20, A253, A431, A-549, ALC, B16, B35, BCP-1 cells, BEAS-2B, bEnd.3, BHK-21, BR 293, BxPC3, C3H-10T1/2, C6/36, Cal-27, CHO, CHO-7, CHO-IR, CHO-K1, CHO-K2, CIO-T, CHO Dhfr−/−, COR-L23, COR-L23/CPR, COR-L23/5010, COR-L23/R23, COS-7, COV-434, CML T1, CMT, CT26, D17, DH82, DU145, DuCaP, EL4, EM2, EM3, EMT6/AR1, EMT6/AR10.0, FM3, H1299, H69, HB54, HB55, HCA2, HEK-293, HeLa, Hepa1c1c7, HL-60, HMEC, HT-29, Jurkat, JY cells, K562 cells, Ku812, KCL22, KG1, KYO1, LNCap, Ma-MeI 1-48, MC-38, MCF-7, MCF-10A, MDA-MB-231, MDA-MB-468, MDA-MB-435, MDCK II, MDCK II, MOR/0.2R, MONO-MAC 6, MTD-1 A, MyEnd, NCI-H69/CPR, NCI-H69/LX10, NCI-H69/LX20, NCI-H69/LX4, NIH-3T3, NALM-1, NW-145, OPCN/OPCT cell lines, Peer, PNT-1A/PNT 2, RenCa, RIN-5F, RMA/RMAS, Saos-2 cells, Sf-9, SkBr3, T2, T-47D, T84, THP1 cell line, U373, U87, U937, VCaP, Vero cells, WM39, WT-49, X63, YAC-1, YAR, and transgenic varieties thereof. Cell lines are available from a variety of sources known to those with skill in the art (see, e.g., the American Type Culture Collection (ATCC) (Manassas, Va.)). In some embodiments, a cell transfected with one or more vectors described herein is used to establish a new cell line comprising one or more vector-derived sequences. In some embodiments, a cell transiently transfected with the components of a CRISPR system as described herein (such as by transient transfection of one or more vectors, or transfection with RNA), and modified through the activity of a CRISPR complex, is used to establish a new cell line comprising cells containing the modification but lacking any other exogenous sequence. In some embodiments, cells transiently or non-transiently transfected with one or more vectors described herein, or cell lines derived from such cells are used in assessing one or more test compounds.
In some embodiments, one or more vectors described herein are used to produce a non-human transgenic animal. In some embodiments, the transgenic animal is a mammal, such as a mouse, rat, or rabbit. In certain embodiments, the organism or subject is a plant. Methods for producing transgenic animals are known in the art, and generally begin with a method of cell transfection, such as described herein.
In one aspect, the presently disclosed subject matter provides for methods of modifying a target polynucleotide in a eukaryotic cell, which may be in vivo, ex vivo or in vitro. In some embodiments, the method comprises sampling a cell or population of cells from a human or non-human animal, and modifying the cell or cells. Culturing may occur at any stage ex vivo. The cell or cells may even be re-introduced into the non-human animal.
In one aspect, the presently disclosed subject matter provides for methods of modifying a target polynucleotide in a eukaryotic cell. In some embodiments, the method comprises allowing a CRISPR complex to bind to the target polynucleotide to effect cleavage of the target polynucleotide thereby modifying the target polynucleotide, wherein the CRISPR complex comprises a CRISPR enzyme complexed with a guide sequence hybridized to a target sequence within the target polynucleotide.
In one aspect, the presently disclosed subject matter provides a method of modifying expression of a polynucleotide in a eukaryotic cell. In some embodiments, the method comprises allowing a CRISPR complex to bind to the polynucleotide such that the binding results in increased or decreased expression of the polynucleotide; wherein the CRISPR complex comprises a CRISPR enzyme complexed with a guide sequence hybridized to a target sequence within the polynucleotide.
In one aspect, the presently disclosed subject matter provides methods for using one or more elements of a CRISPR system. The CRISPR complex of the presently disclosed subject matter provides an effective means for modifying a target polynucleotide. The CRISPR complex of the presently disclosed subject matter has a wide variety of utility including modifying (e.g., deleting, inserting, translocating, inactivating, activating) a target polynucleotide in a multiplicity of cell types. As such the CRISPR complex of the presently disclosed subject matter has a broad spectrum of applications in, e.g., gene therapy, drug screening, disease diagnosis, and prognosis. An exemplary CRISPR complex comprises a CRISPR enzyme complexed with a guide sequence hybridized to a target sequence within the target polynucleotide.
The target polynucleotide of a CRISPR complex can be any polynucleotide endogenous or exogenous to the eukaryotic cell. For example, the target polynucleotide can be a polynucleotide residing in the nucleus of the eukaryotic cell. The target polynucleotide can be a sequence coding a gene product (e.g., a protein) or a non-coding sequence (e.g., a regulatory polynucleotide or a junk DNA). Without wishing to be bound by theory, it is believed that the target sequence should be associated with a PAM (protospacer adjacent motif); that is, a short sequence recognized by the CRISPR complex. The precise sequence and length requirements for the PAM differ depending on the CRISPR enzyme used, but PAMs are typically 2-5 base pair sequences adjacent the protospacer (that is, the target sequence). Examples of PAM sequences are given in the examples section below, and the skilled person will be able to identify further PAM sequences for use with a given CRISPR enzyme.
Examples of target polynucleotides include a sequence associated with a signaling biochemical pathway, e.g., a signaling biochemical pathway-associated gene or polynucleotide. Examples of target polynucleotides include a disease associated gene or polynucleotide. A “disease-associated” gene or polynucleotide refers to any gene or polynucleotide which is yielding transcription or translation products at an abnormal level or in an abnormal form in cells derived from a disease-affected tissues compared with tissues or cells of a non disease control. It may be a gene that becomes expressed at an abnormally high level; it may be a gene that becomes expressed at an abnormally low level, where the altered expression correlates with the occurrence and/or progression of the disease. A disease-associated gene also refers to a gene possessing mutation(s) or genetic variation that is directly responsible or is in linkage disequilibrium with a gene(s) that is responsible for the etiology of a disease. The transcribed or translated products may be known or unknown, and may be at a normal or abnormal level.
Embodiments of the presently disclosed subject matter also relate to methods and compositions related to knocking out genes, amplifying genes and repairing particular mutations associated with retinal disorders (Robert D. Wells, Tetsuo Ashizawa, Genetic Instabilities and Neurological Diseases, Second Edition, Academic Press, Oct. 13, 2011-Medical). Specific aspects of tandem repeat sequences have been found to be responsible for more than twenty human diseases (McIvor et al. (2010) RNA Biol. 7(5):551-8). The CRISPR system may be harnessed to correct these defects of genomic instability.
In yet another aspect of the presently disclosed subject matter, the CRISPR system may be used to correct retinal defects that arise from several genetic mutations further described in Traboulsi, ed. (2012) Genetic Diseases of the Eye, Second Edition, Oxford University Press.
The presently disclosed subject matter also provides methods for treating retinal degenerations, such as LCA, ADRP, or glaucoma. In some embodiments, the presently disclosed subject matter provides method for treating a retinal degeneration in a subject (e.g., human) in need thereof. The method comprises the steps of: (a) providing a non-naturally occurring nuclease system (e.g., CRISPR) comprising one or more vectors comprising: i) a promoter (e.g., bidirectional H1 promoter) operably linked to at least one nucleotide sequence encoding a nuclease system guide RNA (gRNA), wherein the gRNA hybridizes with a target sequence of a DNA molecule in a cell (e.g., retinal photoreceptor or ganglion cell) of the subject, and wherein the DNA molecule encodes one or more gene products expressed in the cell; and ii) a regulatory element operable in a cell operably linked to a nucleotide sequence encoding a genome-targeted nuclease (e.g., Cas9), wherein components (i) and (ii) are located on the same or different vectors of the system, wherein the gRNA targets and hybridizes with the target sequence and the nuclease cleaves the DNA molecule to alter expression or inactivates of the one or more gene products; and (b) administering to the retinal area of the subject a therapeutically effective amount of the system. In some embodiments, the system is packaged into a single adeno-associated virus (AAV) particle (e.g., AAV, AAV2, AAV9, and the like). In some embodiments, the nuclease system excises at least one gene mutation. In some embodiments, the H1 promoter comprises a) control elements that provide for transcription in one direction of at least one nucleotide sequence encoding a gRNA; and b) control elements that provide for transcription in the opposite direction of a nucleotide sequence encoding a genome-targeted nuclease. In some embodiments, the promoter is operably linked to at least one, two, three, four, five, six, seven, eight, nine, or ten gRNA. In some embodiments, the retinal degeneration is selected from the group consisting of LCA1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, and 18. In some embodiments, the retinal degeneration is LCA10. In some embodiments of treating LCA, the target sequence is selected from LCA10 CEP290 gene. In some embodiments of treating LCA, the target sequences is located in the LCA10 CEP290 gene and selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 1-109, 164-356, 735-738, or combinations thereof (e.g., SEQ ID NOs: 1, 2, 3, and 4 operably linked). In some embodiments of treating LCA, the vector comprises the nucleotide sequence set forth in SEQ ID NO: 110. In some embodiments, the retinal degeneration is an ADRP. In some embodiments of treating ADRP, the target sequence is a mutation in the rhodopsin gene. In some embodiments of treating ADRP, the target sequence is a mutation at R135 of the rhodopsin gene. In some embodiments, the mutation at R135 is selected from the group consisting of R135G, R135W, R135L. In some embodiments of treating ADRP, the target sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 111-126, or combinations thereof. In some embodiments of treating ADRP, the gRNA sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 127-142, or combinations thereof. In some embodiments, the retinal degeneration is glaucoma. In some embodiments of treating glaucoma, the one or more gene products are Dual Leucine Zipper Kinase (DLK), Leucine Zipper Kinase (LZK), JNK1-3, MKK4 and MKK7, ATF2, JUN, MEF2A, SOX11, or PUMA, or combinations thereof. In some embodiments of treating glaucoma, the one or more gene products are identified using the RNA-based screens described in Example 4 infra. In some embodiments, the mutation targets of glaucoma include, but not limited to, OPTN, TBK1, TMCO1, PMM2, GMDS, GAS7, FNDC3B, TXNRD2, ATXN2, CAV1/CAV2, p16INK4a, SIX6, ABCA1, AFAP1 and CDKN2B-AS. In some embodiments of treating glaucoma, the target sequence is selected from the group consisting of the nucleotide sequences set forth in SEQ ID NO: 143-163, or combinations thereof. In some embodiments, administering to the subject occurs by implantation, injection (e.g., subretinal), or virally.
The CRISPR system may be used facilitate targeted genome editing in eukaryotic cells, including mammalian cells, such as human cells. To facilitate genome editing, the cell to be modified is co-transfected with an expression vector encoding Cas9 or the Cas9 protein, DNA, or RNA itself, along with a guide-RNA molecule itself, or an expression vector comprising a nucleic acid molecule encoding the guide-RNA molecule. For example, in certain embodiments, the introduction of Cas9 can be done by transfecting in Cas9 as a protein, RNA, DNA, or expression vector comprising a nucleic acid that encodes Cas9. In certain embodiments, the guide DNA can itself be administered directly as an RNA molecule (gRNA), DNA molecule, or as expression vector comprising a nucleic acid that encodes the gRNA.
By “retinal degeneration” is meant a disease, disorder, or condition (including an optic neuropathy) associated with degeneration or dysfunction of neurons or other neural cells, such as retinal ganglion or photoreceptor cells. A retinal degeneration can be any disease, disorder, or condition in which decreased function or dysfunction of neurons, or loss or neurons or other neural cells, can occur.
Such diseases, disorders, or conditions include, but are not limited to, glaucoma, amyotrophic lateral sclerosis (ALS), trigeminal neuralgia, glossopharyngeal neuralgia, Bell's Palsy, myasthenia gravis, muscular dystrophy, progressive muscular atrophy, primary lateral sclerosis (PLS), pseudobulbar palsy, progressive bulbar palsy, spinal muscular atrophy, inherited muscular atrophy, invertebrate disk syndromes, cervical spondylosis, plexus disorders, thoracic outlet destruction syndromes, peripheral neuropathies, porphyria, Alzheimer's disease, Huntington's disease, Parkinson's disease, Parkinson's-plus diseases, multiple system atrophy, progressive supranuclear palsy, corticobasal degeneration, dementia with Lewy bodies, frontotemporal dementia, demyelinating diseases, Guillain-Barre syndrome, multiple sclerosis, Charcot-Marie-Tooth disease, prion diseases, Creutzfeldt-Jakob disease, Gerstmann-Straussler-Scheinker syndrome (GSS), fatal familial insomnia (FFI), bovine spongiform encephalopathy (BSE), Pick's disease, epilepsy, and AIDS demential complex.
Other diseases, disorders, or conditions include, but not limited to, Alexander's disease, Alper's disease, ataxia telangiectasia, Batten disease (also known as Spielmeyer-Vogt-Sjogren-Batten disease), Canavan disease, Cockayne syndrome, diabetic neuropathy, frontotemporal lobar degeneration, HIV-associated dementia, Kennedy's disease, Krabbe's disease, neuroborreliosis, Machado-Joseph disease (Spinocerebellar ataxia type 3), wet or dry macular degeneration, Niemann Pick disease, Pelizaeus-Merzbacher Disease, photoreceptor degenerative diseases, such as retinitis pigmentosa and associated diseases, Refsum's disease, Sandhoffs disease, Schilder's disease, subacute combined degeneration of spinal cord secondary to pernicious anemia, Spielmeyer-Vogt-Sjogren-Batten disease (also known as Batten disease), spinocerebellar ataxia (multiple types with varying characteristics), Steele-Richardson-Olszewski disease, tabes dorsalis, lattice dystrophy, retinitis pigmentosa, age-related macular degeneration (AMD), photoreceptor degeneration associated with wet or dry AMD, other retinal degeneration such as retinitis pigmentosa (RP), optic nerve drusen, optic neuropathy, and optic neuritis, such as optic neuritis resulting from multiple sclerosis.
Non-limiting examples of different types of glaucoma that can be prevented or treated according to the presently disclosed subject matter include primary glaucoma (also known as primary open-angle glaucoma, chronic open-angle glaucoma, chronic simple glaucoma, and glaucoma simplex), low-tension glaucoma, primary angle-closure glaucoma (also known as primary closed-angle glaucoma, narrow-angle glaucoma, pupil-block glaucoma, and acute congestive glaucoma), acute angle-closure glaucoma, chronic angle-closure glaucoma, intermittent angle-closure glaucoma, chronic open-angle closure glaucoma, pigmentary glaucoma, exfoliation glaucoma (also known as pseudoexfoliative glaucoma or glaucoma capsulare), developmental glaucoma (e.g., primary congenital glaucoma and infantile glaucoma), secondary glaucoma (e.g., inflammatory glaucoma (e.g., uveitis and Fuchs heterochromic iridocyclitis)), phacogenic glaucoma (e.g., angle-closure glaucoma with mature cataract, phacoanaphylactic glaucoma secondary to rupture of lens capsule, phacolytic glaucoma due to phacotoxic meshwork blockage, and subluxation of lens), glaucoma secondary to intraocular hemorrhage (e.g., hyphema and hemolytic glaucoma, also known as erythroclastic glaucoma), traumatic glaucoma (e.g., angle recession glaucoma, traumatic recession on anterior chamber angle, postsurgical glaucoma, aphakic pupillary block, and ciliary block glaucoma), neovascular glaucoma, drug-induced glaucoma (e.g., corticosteroid induced glaucoma and alpha-chymotrypsin glaucoma), toxic glaucoma, and glaucoma associated with intraocular tumors, retinal detachments, severe chemical burns of the eye, and iris atrophy. In certain embodiments, the neurodegenerative disease, disorder, or condition is a disease, disorder, or condition that is not associated with excessive angiogenesis, for example, a glaucoma that is not neovascular glaucoma. In some embodiments, the mutation targets of glaucoma include, but not limited to, OPTN, TBK1, TMCO1, PMM2, GMDS, GAS7, FNDC3B, TXNRD2, ATXN2, CAV1/CAV2, p16INK4a, SIX6, ABCA1, AFAP1 and CDKN2B-AS.
As used herein, the term “disorder” in general refers to any condition that would benefit from treatment with a compound against one of the identified targets, or pathways, including any disease, disorder, or condition that can be treated by an effective amount of a compound against one of the identified targets, or pathways, or a pharmaceutically acceptable salt thereof.
As used herein, the term “treating” can include reversing, alleviating, inhibiting the progression of, preventing or reducing the likelihood of the disease, disorder, or condition to which such term applies, or one or more symptoms or manifestations of such disease, disorder or condition (e.g., a disease or disorder that causes dysfunction and/or death of retinal ganglion or photoreceptor cells). In some embodiments, the treatment reduces the dysfunction and/or death of retinal ganglion or photoreceptor cells. For example, the treatment can reduce the dysfunction and/or death of retinal ganglion or photoreceptor cells by at least 5%, 10%, 15%, 20%, 25%, 30%, 33%, 35%, 40%, 45%, 50%, 55%, 60%, 66%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more as compared to the dysfunction and/or death of retinal ganglion or photoreceptor cells in a subject before undergoing treatment or in a subject who does not undergo treatment. In some embodiments, the treatment completely inhibits dysfunction and/or death of retinal photoreceptor or ganglion cells in the subject. As used herein, a “retinal ganglion or photoreceptor cell” is a specialized type of neuron found in the retina that is capable of phototransduction. In some embodiments, at least one gene product is rhodopsin.
In some embodiments, the system is packaged into a single adeno-associated virus (AAV) particle before administering to the subject. In some embodiments, administering to the subject occurs by subretinal injection. The treatment, administration, or therapy can be consecutive or intermittent. Consecutive treatment, administration, or therapy refers to treatment on at least a daily basis without interruption in treatment by one or more days. Intermittent treatment or administration, or treatment or administration in an intermittent fashion, refers to treatment that is not consecutive, but rather cyclic in nature. Treatment according to the presently disclosed methods can result in complete relief or cure from a disease, disorder, or condition, or partial amelioration of one or more symptoms of the disease, disease, or condition, and can be temporary or permanent. The term “treatment” also is intended to encompass prophylaxis, therapy and cure.
The term “effective amount” or “therapeutically effective amount” refers to the amount of an agent that is sufficient to effect beneficial or desired results. The therapeutically effective amount may vary depending upon one or more of: the subject and disease condition being treated, the weight and age of the subject, the severity of the disease condition, the manner of administration and the like, which can readily be determined by one of ordinary skill in the art. The term also applies to a dose that will provide an image for detection by any one of the imaging methods described herein. The specific dose may vary depending on one or more of: the particular agent chosen, the dosing regimen to be followed, whether it is administered in combination with other compounds, timing of administration, the tissue to be imaged, and the physical delivery system in which it is carried.
The term “inhibit” or “inhibits” means to decrease, suppress, attenuate, diminish, arrest, or stabilize the development or progression of a disease, disorder, or condition, the activity of a biological pathway, or a biological activity, e.g., by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, 99%, or even 100% compared to an untreated control subject, cell, biological pathway, or biological activity or compared to the target, in a subject before the subject is treated. By the term “decrease” is meant to inhibit, suppress, attenuate, diminish, arrest, or stabilize a symptom of a retinal disease, disorder, or condition. It will be appreciated that, although not precluded, treating a disease, disorder or condition does not require that the disease, disorder, condition or symptoms associated therewith be completely eliminated.
The phrase “pharmaceutically-acceptable carrier” as used herein means a pharmaceutically-acceptable material, composition or vehicle, such as a liquid or solid filler, diluent, excipient, or solvent encapsulating material, involved in carrying or transporting the subject compound from one organ, or portion of the body, to another organ, or portion of the body. Each carrier must be “acceptable” in the sense of being compatible with the other ingredients of the formulation and not injurious to the patient. Some examples of materials which can serve as pharmaceutically-acceptable carriers include: (1) sugars, such as lactose, glucose and sucrose; (2) starches, such as corn starch and potato starch; (3) cellulose, and its derivatives, such as sodium carboxymethyl cellulose, ethyl cellulose and cellulose acetate; (4) powdered tragacanth; (5) malt; (6) gelatin; (7) talc; (8) excipients, such as cocoa butter and suppository waxes; (9) oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, corn oil and soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerin, sorbitol, mannitol and polyethylene glycol; (12) esters, such as ethyl oleate and ethyl laurate; (13) agar; (14) buffering agents, such as magnesium hydroxide and aluminum hydroxide; (15) alginic acid; (16) pyrogen-free water; (17) isotonic saline; (18) Ringer's solution; (19) ethyl alcohol; (20) pH buffered solutions; (21) polyesters, polycarbonates and/or polyanhydrides; and (22) other non-toxic compatible substances employed in pharmaceutical formulations.
“Pharmaceutically-acceptable salts” refers to the relatively non-toxic, inorganic and organic acid addition salts of compounds.
The terms “prevent,” “preventing,” “prevention,” “prophylactic treatment,” and the like refer to reducing the probability of developing a disease, disorder, or condition in a subject, who does not have, but is at risk of or susceptible to developing a disease, disorder, or condition.
The terms “subject” and “patient” are used interchangeably herein. The subject treated by the presently disclosed methods in their many embodiments is desirably a human subject, although it is to be understood that the methods described herein are effective with respect to all vertebrate species, which are intended to be included in the term “subject.” Accordingly, a “subject” can include a human subject for medical purposes, such as for the treatment of an existing condition or disease or the prophylactic treatment for preventing the onset of a condition or disease, or an animal subject for medical, veterinary purposes, or developmental purposes. Suitable animal subjects include mammals including, but not limited to, primates, e.g., humans, monkeys, apes, and the like; bovines, e.g., cattle, oxen, and the like; ovines, e.g., sheep and the like; caprines, e.g., goats and the like; porcines, e.g., pigs, hogs, and the like; equines, e.g., horses, donkeys, zebras, and the like; felines, including wild and domestic cats; canines, including dogs; lagomorphs, including rabbits, hares, and the like; and rodents, including mice, rats, and the like. An animal may be a transgenic animal. In some embodiments, the subject is a human including, but not limited to, fetal, neonatal, infant, juvenile, and adult subjects. Further, a “subject” can include a patient afflicted with or suspected of being afflicted with a condition or disease.
The term “therapeutic effect” refers to a local or systemic effect in animals, particularly mammals, and more particularly humans, caused by a pharmacologically active substance.
The terms “therapeutically-effective amount” and “effective amount” as used herein means that amount of a composition of the present invention which is effective for producing some desired therapeutic effect in at least a sub-population of cells in an animal at a reasonable benefit/risk ratio applicable to any medical treatment.
Although specific terms are employed herein, they are used in a generic and descriptive sense only and not for purposes of limitation. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this presently described subject matter belongs.
Following long-standing patent law convention, the terms “a,” “an,” and “the” refer to “one or more” when used in this application, including the claims. Thus, for example, reference to “a subject” includes a plurality of subjects, unless the context clearly is to the contrary (e.g., a plurality of subjects), and so forth.
Throughout this specification and the claims, the terms “comprise,” “comprises,” and “comprising” are used in a non-exclusive sense, except where the context requires otherwise. Likewise, the term “include” and its grammatical variants are intended to be non-limiting, such that recitation of items in a list is not to the exclusion of other like items that can be substituted or added to the listed items.
For the purposes of this specification and appended claims, unless otherwise indicated, all numbers expressing amounts, sizes, dimensions, proportions, shapes, formulations, parameters, percentages, parameters, quantities, characteristics, and other numerical values used in the specification and claims, are to be understood as being modified in all instances by the term “about” even though the term “about” may not expressly appear with the value, amount or range. Accordingly, unless indicated to the contrary, the numerical parameters set forth in the following specification and attached claims are not and need not be exact, but may be approximate and/or larger or smaller as desired, reflecting tolerances, conversion factors, rounding off, measurement error and the like, and other factors known to those of skill in the art depending on the desired properties sought to be obtained by the presently disclosed subject matter. For example, the term “about,” when referring to a value can be meant to encompass variations of, in some embodiments, ±100% in some embodiments ±50%, in some embodiments ±20%, in some embodiments ±10%, in some embodiments ±5%, in some embodiments ±1%, in some embodiments ±0.5%, and in some embodiments ±0.1% from the specified amount, as such variations are appropriate to perform the disclosed methods or employ the disclosed compositions.
Further, the term “about” when used in connection with one or more numbers or numerical ranges, should be understood to refer to all such numbers, including all numbers in a range and modifies that range by extending the boundaries above and below the numerical values set forth. The recitation of numerical ranges by endpoints includes all numbers, e.g., whole integers, including fractions thereof, subsumed within that range (for example, the recitation of 1 to 5 includes 1, 2, 3, 4, and 5, as well as fractions thereof, e.g., 1.5, 2.25, 3.75, 4.1, and the like) and any range within that range.
The following Examples have been included to provide guidance to one of ordinary skill in the art for practicing representative embodiments of the presently disclosed subject matter. In light of the present disclosure and the general level of skill in the art, those of skill can appreciate that the following Examples are intended to be exemplary only and that numerous changes, modifications, and alterations can be employed without departing from the scope of the presently disclosed subject matter. The synthetic descriptions and specific examples that follow are only intended for the purposes of illustration, and are not to be construed as limiting in any manner to make compounds of the disclosure by other methods.
Human embryonic kidney (HEK) cell line 293T (Life Technologies, Grand Island, NY) was maintained at 37° C. with 5% CO2/20% O2 in Dulbecco's modified Eagle's Medium (DMEM) (Invitrogen) supplemented with 10% fetal bovine serum (Gibco, Life Technologies, Grand Island, NY) and 2 mM GlutaMAX (Invitrogen).
gRNAs targeting Rhodopsin (see Ranganathan, V and Zack, D J. grID: A CRISPR guide RNA Database and Resource for Gene-Editing. Submitted (2015)) was generated by overlapping oligos that were annealed and amplified by PCR using two-step amplification Phusion Flash DNA polymerase (Thermo Fisher Scientific, Rockford, IL), and subsequently purified using Zymo DNA clean and concentrator columns. The purified PCR products were then resuspended in H2O and quantitated using a NanoDrop 1000 (Thermo Fisher Scientific). The gRNA-expressing constructs were generated using the Gibson Assembly (New England Biolabs, Ipswich, MA) (Gibson et al. Nature Methods 6:343-345(2009)) with slight modifications. The total reaction volume was reduced from 20 μl to 2 μl. Clones were verified by Sanger sequencing.
HEK293 cells were co-transfected with Cas9 (unmodified, or cell-cycle regulated versions) and the gRNA construct targeting rhodopsin. 48 hrs post transfection, genomic DNA was harvested and the sequence surrounding the target cut sites were amplified according to the primers listed in the Appendix. The PCR products were then purified and quantitated before performing the T7 Endo I assay. Briefly, 200 ng of PCR product was denatured and then slowly re-annealed to allow for the formation of heteroduplexes, T7 Endonuclease I was added to the PCR products and incubated at 37° C. for 25 minutes to cleave heteroduplexes, the reaction was quenched in loading dye, and finally, the reaction was run on a 6% TBE PAGE gel to resolve the products. The gel was stained with SYBR-Gold, visualized, and quantitated using ImageJ. NHEJ frequencies were calculated using the binomial-derived equation:
where the values of “a” and “b” are equal to the integrated area of the cleaved fragments after background subtraction and “c” is equal to the integrated area of the un-cleaved PCR product after background subtraction.
While still in its infancy, the CRISPR system has revolutionized genome-editing technology, transformed biological research, and ushered in a new era for genetic medicine. Directed against sequences in the human genome, CRISPR-based editing systems can be customized to disrupt any gene or regulatory element, or to delete and replace genomic DNA sequences in a highly-specific fashion. And although numerous studies across the world have demonstrated that disease mutations can be efficiently targeted in vitro, the development of CRISPR-based therapeutics for in vivo use has been significantly hampered by delivery constraints.
Adeno-associated virus (AAV) vectors are the most frequently used viral vectors in gene therapy with several attractive features: the virus is non-pathogenic, it infects both dividing and non-dividing cells, expression can persist for long periods of time, as well as a noteworthy history of safety, efficacy, and a general lack of toxicity in clinical trials. Additionally, combinations of variant AAV serotypes are effective at targeting specific cell types. While AAV vectors provide a safe means of delivering therapeutic CRISPR components, there is one major technical obstacle that limits their utility—their size. Wild type AAV genomes are ˜4.7 kb in length and recombinant viruses can package up to 5.0 kb. This packaging capacity defines the upper limit of the DNA that can be used for a single viral vector.
The CRISPR/Cas9 system is composed of the nuclease (Cas9) and a guide RNA (gRNA), which serves to direct the nuclease to a specific region in a genome. The most commonly used Cas9 protein is from S. pyogenes (SpCas9), which is encoded by a 4104 bp open reading frame. It has been assumed that due to the large size of SpCas9, delivery of both CRISPR components, including promoters and terminator sequences necessary for expression, is limited by the AAV packaging capacity. With standard promoter elements in place, the SpCas9 and gRNA cassettes exceed the packaging capacity of AAV by a significant margin.
A solution to the packaging capacity problem was disclosed in WO2015/195621 (herein incorporated by reference in its entirety) that greatly expands the potential range of applications for therapeutic CRISPRs. At the core of this new approach to CRISPR delivery is a compact bidirectional promoter that is highly active in human cells. The H1 promoter is a remarkably unique genetic element that is recognized by RNA polymerases Pol II and Pol III. Introduced into an AAV vector, the H1 promoter can efficiently express both Cas9 and gRNA. Because the optimized H1 promoter is only ˜230 bp in length, the use of this cassette allows the packaging of SpCas9 and multiple hybrid gRNAs in a single recombinant AAV.
A complete “all-in-one” AAV vector that includes SpCas9, a short poly-A (SPA) sequence, two separately customizable gRNA scaffolds and the H1 promoter element is ˜4640 kb. This is nearly the size of the wild type AAV genome, and still well below the maximum packaging capacity of 5.2 kb. Any of the Cas9 genes can be incorporated into this basic structure. This platform thus provides the capability to target many more genomic sites and mutations, in vivo, than any existing technology (
Another viable approach for clinical delivery, highlighted by the Cas9 from S. aureus (SaCas9), is the use of smaller Cas9 orthologs that can be packaged into AAV vectors. While, alternative Cas9 proteins have alternative targeting specificities, and are likely to be more restrictive than the SpCas9 protein, the use of the smaller saCas9 protein in combination with the H1 system offers significant advantages over the existing approach.
The CRISPR-Cas9 system functions by inducing a dsDNA break, however, a single point mutation in Cas9 can generate a ssDNA “nickase”. While use of a nickase requires twice as many gRNAs to generate a dsDNA break it is universally accepted as being safer. Importantly, the AAV-H1-CRISPR platform can accommodate SaCas9 and over four gRNAs, and can thus safely generate DNA breaks without off-target mutations. Current delivery approaches lack this capability.
Leber's congenital amaurosis (LCA) is comprised of a group of early-onset childhood retinal degenerations that are characterized by severe retinal dysfunction and severe visual impairment, or blindness during the first months of life. LCA, an orphan disease, is the most common cause of inherited blindness constituting as much as 5% of all known hereditary retinal degenerative diseases. Specifically, LCA10—a disease for which no FDA approved therapy exists—is caused by an intronic mutation in the CEP290 gene. This A-to-G mutation results in the creation of a de novo strong splice-donor site and the inclusion of a cryptic exon (Exon X) into the CEP290 mRNA and subsequent photoreceptor or ganglion degeneration. This mutation is a particularly attractive as a therapeutic target.
It was successfully demonstrated that prototypic AAV-H1-CRISPRs robustly express both Cas9 and a gRNA in human cells, and can thereby efficiently direct gene targeting in vitro. Using a mouse model of retinal disease, it was demonstrated that H1-AAV-CRISPR can be employed to precisely target pathogenic mutations in vivo, by subretinal delivery (
An AAV-based strategy to treat the underlying cause of LCA10 with CRISPR is currently being developed for clinical use by Editas Medicine. Their approach to solving the AAV packaging capacity problem is to employ a smaller Cas9 ortholog from S. aureus (SaCas9), which is encoded by a ˜3.2 kb transcript. The compact size of the SaCas9 gene allows it to be packaged into a single AAV vector along with two gRNA cassettes, however our AAV technology provides additional room, which can be used for additional gRNAs.
In terms of safety concerns for CRISPR-based therapeutics, the most significant is undoubtedly off-target mutagenesis; this can occur if Cas9 cleaves DNA at an unintended location. Fortunately, this risk can be nearly eliminated by employing a modified form of Cas9 (known as the D10A nickase) that only cleaves one DNA strand. By separately engaging two gRNAs to generate two closely opposed nicks on opposite strands, the Cas9 nickase can efficiently generate a double strand break. However, an off-target break can only be generated by the nickase if the two gRNAs recognize off-targets that occur in close proximity and on opposite strands elsewhere in the genome, an occurrence that is statistically improbable (Church G. Nature 2015 Dec. 3; 528(7580):S7) and consequently not observed; nicked DNA is efficiently repaired but normal cellular proteins. For this reason, introducing a Cas9(D10A) with four gRNAs (a pair on each side to delete the targeted mutation) for correction of the CEP290 mutation would be viewed universally and unequivocally as the safest approach—including among regulatory agencies. (Shen et al. Nature Methods 11, 399-402 (2014)). The H1 element uniquely endows the space to use this capability.
However, due to the size limitations with the current system (i.e. Editas: saCas9 with the non-H1 system), correction of the CEP290 mutation, which involves dropping out approximately 1 kb of intronic DNA, cannot be done through the safer nickase approach. Four gRNA have been identified that would allow us to delivery a safer therapeutic to the clinic:
However, Editas has constrained their target sites to 5′ G targeting sites due to the U6 promoter. (Friedland et al. Genome Biology 16:257 (2015))
There is an spCas9 targeting site that overlaps with the A-to-G mutation. Using a previously disclosed H1 system and the spCas9, the CEP290 mutation can be directly targeted without need for generating a large deletion. This method would also be expected to be safer than the current Editas system.
The reduced sites from the 5′G initiation requirement illustrates the utility in using the H1 to express the gRNAs over U6. Additionally, the total number of targeting sites shows that spCas9 outnumbers saCas9, as computationally predicted (Ranganathan et al. Nat Commun 2014 Aug. 8; 5:4516).
Provided herein is a AAV-based approach to the treatment of LCA. In contrast to treatments that involve gene transfer, the approach uses CRISPR genome editing technology. CRISPR has been developed in recent years and has very rapidly revolutionized biological research and ushered in a new era for genetic medicine (4, 5). CRISPRs are composed of a bacterial endonuclease and a short RNA that guides this nuclease to a specific cleavage site in the genome. With a customized guide RNA (gRNA), a CRISPR can be programmed to disrupt any human gene or regulatory element, or to delete and replace genomic DNA sequences in a highly-specific fashion. A CRISPR approach to LCA will facilitate the permanent removal of the mutation from the cells at risk for degeneration, and thus cure the disease.
There is one major technical obstacle to employing AAV for CRISPR applications—their size. AAV are small viruses that can package up to 5.2 kb of DNA. A standard CRISPR exceeds this packaging capacity by a significant margin. CRISPRs are composed of the bacterial endonuclease Cas9 and at least one gRNA. The most commonly used Cas9 protein, from S. pyogenes is alone encoded by a 4.1 kb gene. It is impossible to package both CRISPR components—with standard promoters and terminator sequences necessary for expression—into a single virus.
WO2015195621, herein incorporated by reference, discloses a solution to the packaging capacity problem. At the core of this new approach to CRISPR delivery is a compact bidirectional promoter known as H1. A single H1 promoter can efficiently express both Cas9 and gRNA. This unique genetic element allows an assembly composed of any Cas9 gene and multiple gRNAs, optimally four, to be packaged in a single recombinant AAV (
Provided herein is a safe and durable treatment for LCA10 by a singly-administered AAV-H1-CRISPR therapeutic. LCA10 is caused by mutations in the gene CEP290. This subtype of LCA affects about 2,000 individuals in the western world. This therapeutic contrasts with the current technology being developed by Editas Medicine. Their solution to the packaging capacity problem is to employ a smaller Cas9 gene. However, their vector configuration can engage fewer genomic targets than ours, and lacks a critical feature: the use of four gRNA to provide exquisite targeting sensitivity (as mentioned above), that has been shown to prevent off-target mutagenesis, a significant safety issue (6, 7). The AAV-H1-CRISPR retains this critical feature. The rate of off-target mutations caused by our LCA10 vector is expected to be negligible.
Provided herein is an alternative approach to treating ADRP through the development of a CRISPR/Cas9 gene editing technology (Doudna, J. A. et al. Science 346, 1258096 (2014); Hsu, P D., et al. Cell 157, 1262-1278 (2014)), in which the RNA guided endonuclease is used in conjunction with customizable small guide RNAs (gRNAs) to target and cleave the mutant rhodopsin allele, which through error-prone non-homologous end joining (NHEJ) will specifically knock out expression of the mutant allele, without affecting the normal allele. Although this approach may result in expression of only 50% of the wild-type level of rhodopsin, animal data suggests that this should be sufficient to provide clinically useful rod function (Liang, Y. et al. The Journal of biological chemistry 279: 48189-48196 (2004)).
R135 mutations in rhodopsin result in the most aggressive and rapidly progressing form of retinitis pigmentosa (RP). Affected individuals, have night blindness during childhood with visual field losses before the second decade of life. Disease progression is unusually high, with an average 50% loss per year of baseline ERG amplitude and visual field area. By the fourth decade of life macular function is severely compromised (OMIM: http://www.omim.org/entry/180380).
By some estimates, R135 mutations account for the second most common rhodopsin mutations worldwide. The R135 mutations are particularly amenable to correction through NHEJ, as premature stop codons will likely result in degraded transcripts through non-sense mediated decay, relieving the dominant negative effect of this mutation. The most prevalent mutation, P347, occurs in exon 5 of rhodopsin, which presents additional challenges for correction by CRISPR genome-editing. Premature stop codons in the last exon of a gene are not susceptible to non-sense mediated decay.
CCTGGCCATCGAGCGGTACGGTTTTAGAGCTAGAAATAGCAAGTTAAAA
GGCCATCGAGCGGTACGTGGGTTTTAGAGCTAGAAATAGCAAGTTAAAA
CCACGTACCGCTCGATGGCCGTTTTAGAGCTAGAAATAGCAAGTTAAAA
CACCACCACGTACCGCTCGAGTTTTAGAGCTAGAAATAGCAAGTTAAAA
CCTGGCCATCGAGGGGTACGGTTTTAGAGCTAGAAATAGCAAGTTAAA
GGCCATCGAGGGGTACGTGGGTTTTAGAGCTAGAAATAGCAAGTTAAA
CCACGTACCCCTCGATGGCCGTTTTAGAGCTAGAAATAGCAAGTTAAA
CACCACCACGTACCCCTCGAGTTTTAGAGCTAGAAATAGCAAGTTAAAA
CCTGGCCATCGAGTGGTACGGTTTTAGAGCTAGAAATAGCAAGTTAAA
GGCCATCGAGTGGTACGTGGGTTTTAGAGCTAGAAATAGCAAGTTAAA
CCACGTACCACTCGATGGCCGTTTTAGAGCTAGAAATAGCAAGTTAAA
CACCACCACGTACCACTCGAGTTTTAGAGCTAGAAATAGCAAGTTAAAA
CCTGGCCATCGAGCTGTACGGTTTTAGAGCTAGAAATAGCAAGTTAAA
GGCCATCGAGCTGTACGTGGGTTTTAGAGCTAGAAATAGCAAGTTAAA
CCACGTACAGCTCGATGGCCGTTTTAGAGCTAGAAATAGCAAGTTAAA
CACCACCACGTACAGCTCGAGTTTTAGAGCTAGAAATAGCAAGTTAAAA
Glaucoma, the leading cause of irreversible blindness worldwide (1), is an optic neuropathy in which progressive damage of retinal ganglion cell (RGC) axons at the lamina cribosa of the optic nerve head leads to axon degeneration and cell death (2). Currently, the only treatment, whether by eye drops, lasers or incisional surgery, is to lower intraocular pressure (IOP) and reduce the injury at the optic nerve head. Unfortunately, this is difficult in some patients while in others, the disease can continue to worsens despite aggressive IOP-lowering. The field has long needed an alternative therapeutic strategy that could complement IOP-lowering by mitigating the RGC response to residual axon injury. Moreover, the NEI has listed optic nerve regeneration amongst its Audacious Goals, and any regenerative therapy necessarily needs to tackle the issue of axotomized RGC survival. To this end, there is a great need to develop a neuroprotective that might directly interfere with the active genetic programs of RGC axon degeneration and/or axon injury-related cell death (3-6).
In order to identify, in a comprehensive and unbiased manner, genes that could serve as novel drug targets for neuroprotective glaucoma therapy, the entire mouse kinome (a druggable subset of the genome) was screened for kinases whose inhibition promotes RGC survival. For this screen, a high-throughput method was developed for knocking down gene expression in primary RGCs, from male and female C57Bl/6 mice (7), with small interfering RNA oligonucleotides (siRNAs) and coupled it with a quantitative assay of RGC survival (CellTiter-Glo, Promega) (5). Using this approach, an arrayed library of 1869 siRNAs was screened, targeting 623 kinases,
for the ability to increase the survival of RGCs three days after immunopanning (which necessarily axotomizes the cells). The top two hits were dual-leucine zipper kinase (DLK/MAP3K12) and its only known substrate, mitogen-activated protein kinase kinase 7 (MKK7). Using male and female floxed Dlk mice (8) and intravitreal injections of a Cre-expressing, capsid-modified (9, 10), adeno-associated virus (AAV2), it was shown that targeted disruption of Dlk rendered RGCs highly resistant to axon injury-induced cell death in the mouse optic nerve crush model (5). Moreover, since JUN N-terminal kinases (JNK) 1-3, typically activated by one or more upstream MAP3Ks, have been shown to play a central role in RGC cell death (11, 12), it was tested whether DLK (MAP3K12) was a relevant upstream trigger. Indeed, whereas wildtype mice intravitreally injected with AAV2-Cre responded to optic nerve injury with a robust upregulation of JNK signaling in the RGC soma and axon, floxed Dlk mice had a reduced activation of the pathway (5). Finally, using published kinase inhibitor profiling data (13, 14), the protein kinase inhibitor tozasertib was identified as an inhibitor of DLK and showed that it protected RGCs in both the rat optic nerve transection and glaucoma models (5). In some embodiments, target sequences for DLK/MAP3K12 may comprise a nucleotide sequence selected from the group consisting of SEQ ID NOs: 788-1023.
It was noticed that tozasertib consistently improved RGC survival in vitro more than DLK knockdown or knockout alone (5). This suggested that one or more additional kinase targets of tozasertib (of which there are more than 150) might cooperate with DLK to promote cell death and that simultaneous inhibition of both was to promote maximal RGC survival. In order to identify these other targets, the kinome screen was modified to include Dlk siRNA in every well and sensitize it to those library siRNAs that synergized with DLK knockdown to further increase RGC survival (
combined knockdown of both DLK and LZK recapitulated the survival produced by the drug and further addition of tozasertib had no effect (as one would expect if all key drug targets were already genetically disrupted) (
While the evidence for LZK as a key mediator of RGC cell death was substantial, it was entirely based on individual siRNAs. It is important to note that the phenotype produced by any given siRNA is the biological sum of the canonical “on-target” silencing mediated by all 21 nucleotides and the promiscuous “off-target” silencing mediated by a six-to-seven nucleotide seed sequence (17, 18). The latter can silence hundreds of targets, unfortunately causing it to dominate phenotypes and siRNA screen results (19). Thus, two complementary approaches were chosen to validate the LZK finding, siPOOLs and clustered regularly interspaced short palindromic repeats (CRISPR)-mediated gene knockouts. siPOOLs are pools of 30 siRNAs targeting a common gene. Because each of the 30 component siRNAs has a different set of off-targets but shares a common on-target, the siPOOL provides a 30-fold enrichment for on-target versus off-target silencing (20). As expected, while the Lzk siPOOL only minimally improved primary RGC survival, the combination of Dlk and Lzk siPOOLs was highly synergistic in terms of both survival (
CellTiter-Glo assay as similar results were obtained with conventional stains of viability (
It was next sought to identify non-kinase members of the injury signaling pathway and thus we expanded our platform to screen a whole-genome library of 16,877 siRNA minipools (four per gene). In order to improve the signal-to-noise of the assay and facilitate screening at the whole-genome scale, the fact that LZK inhibition does little to improve baseline survival but highly sensitizes cells to further DLK inhibition was leveraged. Thus, in addition to the standard library siRNA minipool, each well also received the Lzk siPOOL. To analyze the results, the fact that genome-scale screens sample siRNAs with every possible seed combination multiple times was taken advantaged, thus permitting two distinct bioinformatics approaches to address pervasive off-target effects. First, the survival effect of a given seed can be tracked as it appears dozens of times throughout the library. This can then be used to generate a correction factor which helps to subtract out the off-target contribution to the phenotype and reveal the component mediated by the on-target (19). Secondly, in addition to knowing the survival/toxicity produced by each seed, one can also predict all its possible off-targets. Then, using a Haystack analysis to search for commonly-targeted genes by similarly-behaving seeds, one can actually try to deconvolute the off-target effects and identify the genes whose off-target silencing affects survival (22).
When the results using the first method (seed-corrected, on-target) were analyzed, it was found that DLK was the top gene amongst all 16,877 genes (
By this point, four different transcription factors (i.e. ATF2, JUN, SOX11, MEF2A) were identified, each which had a partially protective effect when disrupted. Given the lesson of redundancy from DLK and LZK, it was considered the possibility that these four transcription factors might need to be simultaneously inhibited for maximal neuroprotection. Thus, wildtype RGCs were transfected with Mef2a, Jun, Sox11 or Atf2 siPOOLs, alone or in combination, and survival was measured 96 hours later. The results showed that inhibition of the four transcription factors is highly synergistic, increasing survival as much as simultaneous DLK/LZK inhibition (
Targeting DLK as a neuroprotective strategy for glaucoma has several attractive features. As mentioned above, it has a robust, evolutionarily-conserved phenotype (5, 31), it was identified using an unbiased method and it is readily druggable with small molecule inhibitors. In addition, it has been validated
independently and DLK inhibition, rather than simply delaying cell death, seems to prevent it altogether (6). Moreover, the cells kept alive with DLK inhibition remain electrophysiologically active in vitro (5) and, in vivo, retain a relatively healthy gene expression pattern, despite their previous injury (6). Finally, some (4, 6, 8) but not all (32) data suggest that DLK plays a role in the axon degeneration program triggered by axon injury. On the other hand, DLK inhibition retards axon regeneration (6), a potential limitation with respect to the generation of neuroregenerative strategies (33-35). This underscores the need to dissect out the pathway members responsible for cell death decisions from those that are responsible for axon regeneration. Furthermore, despite their central role in cell death (and regeneration), the mechanisms by which DLK and LZK are activated in response to axonal injury and by which they activate the downstream transcription factors has yet to be elucidated.
As disclosed herein, functional genomic screening was integrated with a CRISPR/AAV therapeutic platform in order to further probe the DLK/LZK cell death pathway and seamlessly interdict the activity of the various candidate neuroprotective targets with a viable gene therapy vector.
In some embodiments, an enhanced set of RNA-based screening paradigms, using sgRNAs and networks of siRNAs, will be used in order to identify novel DLK/LZK pathway members, including the as-yet-unidentified upstream activator(s) and targets that might dissociate cell death and regeneration. In some embodiments, a CRISPR/AAV vector will be developed that allows for rapid validation of hits from SA1 in rodent models of optic neuropathy. Variants will be created of these neuroprotective viruses which are suitable for gene therapy applications, thereby expanding the space of potential targets to include non-druggable gene products and even networks of genes. In some embodiments, a combination of proteomics and loss-of-function/gain-of-function experiments will be used to probe the mechanism by which DLK regulates the downstream mediator, MEF2A and determine if MEF2A inhibition can serve as a therapeutic strategy to promote RGC survival without preventing regeneration.
From a mechanism standpoint, the screening platform uniquely combines a disease-relevant primary neuron (i.e. RGCs) with arrayed, whole genome-scale, RNA-based screens that can be completed in a few weeks, to give us an unbiased and comprehensive tool to better understand RGC cell death signaling. Indeed, SOX11, critical for RGC development, is a perfect example of how such an approach can make novel and unexpected discoveries about genes involved in cell death. The modifications proposed below (i.e. to screen gene networks and to use CRISPR technology) will open up the space of potential neuroprotective targets that can be identified.
From a therapeutic standpoint, CRISPR editing in RGCs will be utilized to develop AAV/CRISPR therapeutics in vivo. In some embodiments, a, AAV/CRISPR packaging of two separate cistrons (i.e. one expressing the gRNA and the other expressing spCas9) into a single AAV particle, overcomes a major limitation in the use of CRISPRs for gene therapy. This creates the potential for a gene therapy-based approach to both validate hits from our screens with dramatically improved throughput and to be able to target these pathway members therapeutically, including those that would not be readily druggable with small molecules. Overall, both the DLK target pathway itself and the approaches constitute an innovative strategy for the development of novel neuroprotective therapies for glaucoma and other forms of optic nerve disease.
Use enhanced RNA-based screens in primary RGCs to identify the novel DLK/LZK pathway members, including the as-yet-unidentified upstream activators.
The mechanism by which DLK and LZK are activated in response to optic nerve injury remains to be determined. In D. melanogaster and C. elegans, an E3 ligase (called highwire and RPM-1, respectively), suppresses the basal protein levels of the DLK homolog (called Wallenda and DLK-1, respectively). In response
to axon injury, this suppression is relaxed, leading to DLK accumulation and cell death. Interestingly, the role for the Highwire/RPM-1 homolog in mouse, PHR1, is less clear. As would be expected, disruption of Phr1 in dorsal root ganglion cells increases the level of DLK (36, 37), but in RGCs, knockdown of PHR1 has no effect on DLK levels or survival (data not shown) and the conditional Phr1 knockout mouse has normal RGC survival (38, 39). The intrinsic calcium-sensing motif described in C. elegans DLK-1 (40) does not appear to be functionally conserved as replacement of endogenous mouse LZK (the only mouse DLK-1 homolog which contains the motif) with either wildtype human LZK or a mutant lacking the hexapeptide motif leads to equivalent cell death (
1. Screen an Arrayed sgRNA Library Targeting the Mouse Kinome to Identify Genes Whose Targeted Disruption Improves RGC Survival.
To address the first limitation mentioned above, CRISPR-based screening will be performed, somewhat analogous to the iCRISPR system (42). A colony of spCas9-expressing mice was scaled and robust survival was demonstrated when we isolate RGCs from these mice and transfect them with sgRNAs instead of siRNAs (
For the screen, RGCs will be immunopanned from spCas9-knockin mice and seeded at 1,000 cells per well in 384-well plates. The library of sgRNAs will be reverse transfected with NeuroMag (Oz Biosciences) at 30 ng per well, in duplicates. Since 1 nM Lzk siPOOL worked well to sensitize a whole-genome siRNA library to DLK inhibition, the same reagent will be added to all wells of this screen. Each plate will contain negative (tracrRNA) and positive (Dlk) control sgRNAs that serve as quality controls and normalization standards (to allow for plate-to-plate comparisons). After 48 hours, each well will be treated with 1 μM colchicine to promote DLK/LKZàJNK signaling and lower background survival (15, 47). Finally, after an
additional 48 hours, survival will be measured using CellTiter-Glo and normalized to the median negative control wells on each plate. Genes will be scored and ranked based on the mean and median activity for the three sgRNAs. Active sgRNAs will be retested in spCas9 and wildtype RGCs to ensure that the activity is spCas9-dependent. As with siRNAs, secondary screening will be performed by synthesizing new sgRNAs with sequences not tested in the initial screen.
Given that the platform of transfecting primary RGCs with RNA and measuring survival has been extensively tested, any issues with the screen itself are not anticipated. Moreover, given the results from the whole-genome siRNA screen which used Lzk siPOOL to sensitize the system to DLK inhibition and yielded the Dlk siRNA minipool as the top hit amongst 16,877 genes, the strategy will be able to identify DLK pathway members. The main issue will be the labor involved in synthesizing the sgRNA library. Pooling templates and in vitro transcribing pools of three sgRNAs are further contemplated. Multiple sgRNAs targeting the same gene should lessen the chance that a given locus escapes targeting by having all the indels fortuitously end up as a multiple-of-three nucleotides. However, at least with LZK and DLK, increased efficacy with the pooling strategy have not been observed (
2. Screen an siRNA Library, Grouped to Target Kinase Networks, in Order to Identify Genes Whose Knockdown Improves RGC Survival.
Even if a kinase plays a key role in RGC cell death, its phenotype when inhibited could be masked by redundant/compensatory kinases. The sensitized screens were one approach to circumvent this limitation, but relied on already knowing one member of the redundant pair (e.g. LZK in the genome-wide screen). As an alternative, more unbiased approach, an siRNA library can be screened in which every well simultaneously targets multiple kinases, grouped because they may be part of a redundant signaling network. The Sigma kinome siRNA library will be initially utilized, condensed into 623 minipools (three siRNAs per gene), and then combining the minipools in novel combinations. As it is not feasible to try all pairwise kinase combinations (6232), three distinct bioinformatics approaches will be utilized to
create the pools-of-minipools. First, and conceptually the simplest, siRNAs targeting kinases with the highest degrees of homology (e.g. JNK1, 2 and 3) will be grouped together. Based on the ENSEMBL database, 488 genes in the Sigma kinome siRNA library shared significant homology with another member, leading to 1626 pairs (as a given gene could be homologous to more than one other member). To make the numbers more manageable, these pairs were further clustered into 111 groups with a median of four members per group. Second, kinases with overlapping spectra of substrates might be redundant. Phosphorylation reactions with 289 human kinases were performed on protein microarrays composed of 4,191 unique, full-length human proteins and identified 3,656 kinase-substrate relationships (48). Of the 198 genes in the Sigma kinome siRNA library which were included in the analysis, there were 2,089 pairs with significant overlap of their substrate spectra. As kinases can have overlapping spectra with more than one other kinase, this led to 3,386 potential groups with a median of 11 genes per group. Finally, the fact that all 121 kinases, and all of their pairwise combinations, in S. cerevisiae have been assayed for genetic interactions in order to produce a comprehensive epistasis map (E-MAP) was taken advantage (49). It was reasoned that some of these pairwise genetic interactions might be conserved in mouse and, in some cases, indicate redundant/compensatory pathway members (the genetic substrate of the synthetic lethality interaction). 1,674 pairwise kinase interactions were identified amongst the 223 kinases in the Sigma library which have a yeast homolog, giving rise to 1,003 groups with a median size of two genes. In total, 4,500 siRNAs groups, with the largest group containing 19 kinases. Finally, a pilot experiment was performed to ensure that the pools with the most targets (e.g. 19 kinases), most of which are likely to be inactive, would not dilute out an active pair. It was reassuring to note that RGCs transfected with Dlk and Lzk siRNA minipools produced a detectable phenotype even when diluted 64-fold with an inactive siRNA (
We have confirmed that our three approaches to generating kinase combinations yielded lists that included at least one known interacting kinases (e.g. phospho-networks—INK1/2/3 and MKK4/7; homologs—JNK1/2/3, MKK4/7, and DLK/LZK; yeast E-MAP—DLK/LZK). Also, because screens can be done in a matter of weeks, the process can be iterative, with the results of the screen leading to modifications in the grouping algorithms. The actual construction of the library should be relatively easy given access to a Labcyte Echo 550 acoustic liquid
handler which can assemble all 4,500 groups in a matter of hours, without the use of pipet tips. One obstacle may be the combination of so many siRNAs in a single well could lead to numerous off-target interactions. As a complementary or alternative strategy, siTOOLS (Munich, Germany) may be utilized to build a mouse kinome library using siPOOLs. This would have the advantage of enriching for on-target effects by a factor of 30. Indeed, when A439 cells were transfected with siRNAs versus siPOOLs targeting kinases and assayed for survival, siRNAs targeting the same gene had an R2 of only 0.19 (
3. Determine if Validated Hits from Section 1 or 2 Above Affect DLK/LZK Signaling.
As was done with DLK and LZK (
upstream activators of the DLK/LZK. Those that promote survival without affecting MKK/JNK phosphorylation must be reducing the sensitivity to DLK, as it is sufficient to trigger RGC cell death.
While the effects of siRNA are easy to detect on unselected, whole populations, sgRNAs are more difficult because knockouts do not occur in 100% of the cells. To address this issue, RGCs will be transfected with the sgRNAs and subjected to the standard 96-hour protocol with colchicine in order to provide a strong selection pressure and enrich for knockout cells. Then, depending on the number of cells remaining, JNK pathway status will be measured by immunoblotting for JNK phosphorylation or immunofluorescence using antibodies against phosphorylated-JUN.
4. Determine if Validated Hits from Section 1 or 2 Genetically Function Upstream or Downstream of DLK and LZK.
RGCs will be transfected as in Section 3 and then, after sufficient time for selection, the remaining cells will be challenged with adenovirus overexpressing wildtype rat DLK or human LZK. Genes that are upstream of DLK/LZK should have their knockdown/knockout phenotype reversed with DLK/LZK overexpression. As a complementary approach, adenoviruses will be produced overexpressing the cDNA for the candidate kinases (including any known mutants which render the kinase constitutively active) and tested whether their overexpression promotes RGC cell death and whether it can be blocked with DLK/LZK inhibitors and/or Dlk/Lzk siRNA (as would be expected if they function upstream of DLK/LZK).
Reversing the survival phenotype with our readily-available, wildtype rat DLKexpressing adenovirus and not a true constitutively active mutant may be performed. If the survival phenotype can be reversed, a DLK mutant (S584A/T659A) will be subcloned and expressed which is thought to have increased activity (50). Finally, although the main purpose of Section 1 above is to identify upstream activators of DLK and LZK, kinases will be pursued that appear to be functioning downstream/independent of DLK/LZK.
5. Determine if Validated Hits from Section 1 or 2 Promotes Survival without Affecting Axon Regeneration.
RGCs will be transfected with siRNAs, siPOOLs or sgRNAs (from Section 1 and 2) targeting validated kinases, alone or in combination with Lzk siPOOL (to sensitize the cells to pathway inhibition) and, after 72 hours, stained with calcein AM. A Cellomics automated microscope will be used to image the cells and calculate the overall neurite length. Ideally, genes will be identified whose inhibition promotes survival without affecting neurite outgrowth.
To confirm that the neurite-outgrowth assay might be predictive of regeneration in vivo, we have previously shown that primary RGCs treated with DLK/LZK inhibitors and/or siRNAs have dramatically-reduced neurite length compared to control-treated cells (
6. Develop a CRISPR System, Compatible with AAV Delivery, in Order to Knockout Target Gene Function in RGCs In Vivo.
The screening work has previously identified multiple europrotective targets in RGCs and the approaches outlined in Sections 1 and 2 will further expand the list of candidates. Additional steps include: the validation of these hits in vivo in rodent models of optic neuropathy, the prioritization of the validated hits, and the subsequent development of specific inhibitors of the prioritized targets. A potentially ideal approach would be a gene therapy-based strategy that could both quickly validate hits (i.e. knockout genes in vivo without having to create/obtain knockout mice) and later serve as the therapeutic itself. As demonstrated in
To test this system, an AAV construct was generated that used the H1 promoter to express an mCherry-histone 2B fusion in the Pol II direction and the Dlk #4 gRNA (5′GNNNNNNNNNNNAGATCTNNNGG 3′ (SEQ ID NO: 163)), which conveniently targets a BglII site (
To help with the analysis of RGCs after transduction, a flow cytometry was developed based method for quantifying and characterizing mouse RGCs. Retinas are isolated, dissociated and fixed with acetone (to preserve nucleic acid quality) and then stained with antibodies against the RGC antigens γ-synuclein (SNCG) and β-TH-tubulin (TUJ1). The SNCG+/TUJ1+ double-positive cells represent the RGCs, as confirmed by the findings that they also stain for the additional RGC markers, Thy1.2 and NeuN (
7. Knockout Dlk as Proof-of-Principle that a scAAV/CRISPR-Based Approach can be Used to Target Genes In Vivo in the spCas9-Knockin Mouse.
Using the virus that we have validated in vitro (
Finally, genomic DNA will be purified and assayed for Dlk disruption, as measured by the loss of BglII digestion, in order to determine the optimal titer (i.e. greatest amount of cutting without causing RGC toxicity). In the final step, mice expressing spCas9 will be intravitreally injected in one eye with the optimal dose of scAAV2-Dlk gRNA:H1:H2B-mCherry or a control virus that expresses a nontargeting gRNA in place of the Dlk gRNA. After two weeks, the eyes will undergo a three second optic nerve crush (so as not to compromise the vasculature) or
the sham control procedure and survival in the four groups (n=10 retinas per group) will be measured by SNCG/TUJ1 flow cytometry after an additional two weeks. In the future, whether this system is compatible will be tested with intravenous administration of AAV9 vectors, as this approach has been shown to effectively transduce RGCs (57, 58).
Based upon the already-described floxed Dlk mouse experiments, it is known that targeted disruption of Dlk improves RGC survival (5). Moreover, it is known that CRISPR-based knockout works in vitro. Thus, it seems highly likely that an increase in survival will be detected in those RGCs transduced with the Dlk gRNA. If >50% cutting or a survival phenotype is not seen, mCherry-expressing, AAV-transduced cells will be analyzed. This should be readily achievable because the flow cytometry technique currently works with as many as four channels (
8. Use the scAAV/CRISPR System to Validate that the Hits Identified in SA1 are Mediators of RGC Cell Death in the Mouse Optic Nerve Crush Model.
Identified genes whose knockdown or knockout improves RGC survival in vitro will be targeted in vitro with gRNAs and tested as described for DLK in Section 7. Priority will be given to those genes that appear to function upstream of DLK or that do not seem to affect regeneration. As was done with DLK, multiple gRNAs for each target gene will be tested in primary RGCs to identify the most efficacious ones to move forward into the AAV system. Finally, targets hypothesized to be upstream of DLK will be validated to determine if they suppress the pathway in vivo by immunostaining retinal flatmounts for phosphorylated JUN and DLK (5).
Genes arising from Section 1 should be straightforward to test in vivo as the screen identifies the active gRNAs. However, since the screen was sensitized with Lzk siPOOL, it is conceivable that the phenotype in vivo will require LZK or DLK inhibition. Fortunately, when the Lzk conditional knockout was made, a strategy was used that first produced a null allele. Moreover, unlike the situation with the Dlk-null mice (59), Lzk-null mice were viable and fertile. So, the Lzk-null mice were crossed with the spCas9-knockin line to produce Rosa26Cas9/Cas9Lzk−/− mice, the more appropriate line to confirm Section 1 hits.
Genes identified in Section 2 may be more challenging to validate in vivo as they will likely require simultaneous disruption of multiple genes. For those, the small size of the construct will be utilized, and the fact that an additional H1 driving expression of another gRNA can be accommodated, and still keep the same scAAV design. This will allow knockout of two genes with one viral vector (discussed more in Section 11). Future effort, could attempt to use more than two gRNAs (in place of mCherry or in a single-stranded (ss) AAV design) in order to target networks of genes.
9. Use a Modified scAAV2/CRISPR Approach to Validate that Hits Identified in Section 5 do not Affect Regeneration.
Basal levels of RGC regeneration after optic nerve crush are exceedingly low but can be enhanced by inhibition of PTEN, although not when PTEN and DLK are simultaneously inhibited (6). As one approach to test if inhibition of the candidates identified in Section 5 promotes RGC survival without affecting regeneration (unlike DLK), knockout mice were obtained for each candidate gene and breed them onto the Ptenfl/fl background—a prohibitive amount of time and effort. Instead, the in vivo knockout approach will be used to generate scAAV that express a gRNA targeting the gene-of-interest from the standard bidirectional H1 promoter in the Pol III direction and mCherry without the H2B fusion (in order to visualize axons) in the Pol II direction. The second H1 promoter will express a Pten gRNA. Mice expressing spCas9 will be intravitreally injected in one eye with the optimal dose of scAAV2-X gRNA:H1:mCherry; H1:Pten gRNA where X is either a nontargeting gRNA (negative control for preventing regeneration), the Dlk #4 gRNA (positive control for preventing regeneration) or a gRNA targeting a candidate from Section 5. After two weeks, the eyes will undergo a three second optic nerve crush (so as not to compromise the vasculature). Finally, after an additional two weeks, optic nerves from the three groups (n=10 optic nerves per group) will be harvested, longitudinally sectioned and the number of mCherry-positive axons extending beyond the crush site will be quantified.
Should the regeneration phenotype of Pten deletion be insufficient, the effect with Socs3 can be potentiated (33, 35, 60). These experiments are proposed as alternatives because the seminal work on the role of DLK in regeneration used Pten (and not combined Pten/Socs3) deletion (6). Multiallelic disruption is ambitious and may occur at a low frequency. Fortunately, the compact H1 promoters leave nearly 1 kb of packaging space, even with the two H1 promoters and a scAAV design, allowing one to engineer in fluorescent protein-based reporters for CRISPR editing (61).
10. Knockout Dlk as the Proof-of-Principle that a Therapeutically-Suitable, ssAAV/CRISPR Based Approach can be Used to Target Genes in a Wildtype Mouse.
The experiments outlined in Sections 7 and 8 take advantage of the robustness of scAAV and allow for quick validation of biology, but are not therapeutically useful as they rely on the spCas9 knockin mouse. As a complementary approach, the bidirectional feature of the H1 promoter will be used to drive the expression of Cas9 and gRNA, allowing both cassettes to be packaged into a single viral vector. Although the strategy allows for an insert that is comparable in size to wildtype AAV, it is too large for scAAV. Thus, ssAAV2-Dlk gRNA:H1:spCas9 particles will be prepared and tested as described in Section 7, except that the animals will not express spCas9 (since it is now being provided by the virus). The virus will be tested in a glaucoma model and include functional measures of vision. Importantly, while Dlk and RGCs have been the focus, these data would have an even more far-reaching implication—the use of a single AAV/CRISPR virus to modify the genome and increase the resistance to a disease state.
The percentage of cells undergoing biallelic disruption may be insufficient to reliably detect a survival increase in the optic nerve crush model. To date, nearly 20 Dlk target sites have been sampled using the in vitro model and have not found sites that work much better than #4 (used for Section 6). Even monoallelic disruption of Dlk can increase survival, albeit less than the robust protection conferred by biallelic disruption (37). To sensitize the cells to this monoallelic Dlk disruption, the experiment can be repeated using the Lzk-null mice. A complementary strategy would be to extend the time between optic nerve crush to RGC quantification. Although, the experiment is typically stopped at two weeks (when ˜75-80% of RGCs have died), ongoing loss occurs over the next 1-2 weeks, eventually reaching a plateau of ˜90-95% cell death. Even if only 5% of the cells have biallelic disruption of Dlk, that could produce a 50-100% relative increase in the number of surviving cells at one month (compared to the 20-25% increase at two weeks). Finally, although the Dlk #4 target site is preferred (because of the BglII site), the experiment could be repeated with #5 which has the exact same sequence in mice and humans and thus allows for the animal model to test a human therapeutic.
11. Measure the Off-Target Produced by the AAV/CRISPR Therapeutic Developed in Section 10. It is Well-Established that spCas9 Tolerates Mismatches Better in the Portion of the Protospacer that is Distal to the PAM (62, 63), Thus the Most Likely Off-Targets have a Conserved 3′ Sequence and 1-2 Mismatches in the 5′ End.
Since the BglII site is located near the cut site (3′), nearly all likely off-targets for the #4 gRNA retain the BglII site. The genomic DNA obtained in Section 10 can be used and the BglII assay described in
While the specificity in vitro (
12. Develop an AAV/CRISPR Therapeutic that Delivers Cas9 and Two gRNAs in a Single Vector.
By using the H1 promoter, the Dlk gRNA:H1:spCas9 has a size of 4.5 kb, well under the 4.7-4.8 kb size of the wildtype virus and the 5.2 kb AAV packaging limit. This affords an opportunity to modify the construct proposed in Section 10 with an additional H1-gRNA cassette (˜0.2 kb). Expressing two gRNAs from a single virus (in addition to spCas9) opens up several possibilities, including the use of the more specific “nickase” mutant spCas9 (67), the generation of large deletions in a single gene and, relevant to the theme of our work, the simultaneous targeting of two compensatory genes. To test this, the H1:Lzk gRNA #1 will be added to expression cassette to the construct described in Section 10 and generate ssAAV2-Dlk gRNA4:H1:spCas9;H1:Lzk gRNA5 viral particles. These will be tested as described in Section 10, this time comparing viruses that target Dlk alone or in combination with Lzk (n=10 retinas per group).
The size of this construct pushes the limit of the packaging capacity of AAV (although remains less than 0.1 kb larger than wildtype virus). If this becomes an issue, the smaller mouse H1 promoter (which also functions bidirectionally) could be used or saCas9, which is nearly 1 kb smaller than spCas9 (52). The latter approach would require different target sites as neither the #4 nor #5 site is compatible with the saCas9 PAM requirement.
13. Explore the Mechanism by which DLK Regulates MEF2A to Promote RGC Cell Death.
MEF2A has a well-studied role in muscle differentiation and cardiovascular physiology (68). In neurons, MEF2A has been shown to play a critical role in promoting survival (27) and, although the brain-specific conditional knockout of Mef2a does not affect gross neuronal survival, the combined knockout of Mef2a/c/d shows a clear increase in neuronal apoptosis leading to early post-natal lethality (30). In response to excitotoxic stimuli, cortical neurons utilize two mechanisms to inhibit MEF2A signaling and promote apoptosis: a caspase catalyzed cleavage of MEF2A that results in dominant-negative activity (69) and a cyclin-dependent kinase 5 (CDK5) phosphorylation of S408 of MEF2A that leads to inactivation of the transactivation domain (70) or, especially in the case of dendritic/synaptic physiology, the subsequent sumoylation at K403 and the formation of a transcriptionally-repressive form of MEF2A (29, 71). In each of these cases, MEF2A promotes neuronal survival and inhibition of MEF2A leads to cell death. In contrast, MEF2A was identified as a mediator of RGC cell death in vitro and in vivo (
In order to identify the mechanism by which DLK/LZK regulates MEF2A, MEF2A was purified from floxed Dlk/Lzk primary RGCs transduced with adeno-Cre (in order to ablate DLK/LZK signaling), adeno-GFP (which have active DLK/LZK signaling), wildtype rat DLK (which superactivates the pathway) or kinase-dead rat DLK (which functions as a dominant-negative). Phosphorylated residues and the relative abundance of phosphorylation at each of these sites will be determined by tandem mass tag (TMT, Thermo Scientific) quantification and liquid chromatography tandem mass spectrometry (LC-MS/MS) analysis on one of several Orbitrap mass spectrometers (Thermo Scientific) available to our group. This approach also permits an analysis of other post-translational modifications like sumoylation and acetylation, already known to play a role in MEF2A signaling (29, 71). Changes that appear dependent on the presence of DLK/LZK and especially those that increase with superactivation of the pathway will be followed up in Section 15.
The biggest challenge may be the amount of starting material, given that MEF2A post-translational modifications will be assayed in the context of our disease relevant, primary cell system. To circumvent the problem, 3-5,000,000 RGCs can be routinely isolated, which should be more than enough given the low-complexity of the IP (i.e. only looking at one protein). If not enough MEF2A protein can be obtained, overexpression of an epitope-tagged MEF2A may be used. Second, the sensitivity and accuracy of MS analysis can be increased via multiple reaction monitoring (MRM) approaches targeting potential MEF2A phosphorylation sites—both known and unknown—after first optimizing an MRM assay for these species (72, 73). If phosphorylation at multiple simultaneous sites is critical for MEF2A regulation by DLK/LZK, a top-down proteomics approach can be employed on intact non-trypsinized MEF2A to determine multiplicity of phosphorylation.
Posttranslational modifications identified in Section 14 will be tested in the primary RGC system. Mutants for each of the residues (e.g. S408A) will be engineered into the mouse Mef2a cDNA and shuttled into an adenoviral vector as described multiple times above. RGCs will then be isolated from the floxed Mef2a mouse and transduced with adenovirus-Cre (MOI 1000) to promote survival and ablate endogenous MEF2A. After 48 hours, adenovirus will be used to reintroduce wildtype MEF2A or the unmodifiable mutants. If a post-translational modification is necessary for MEF2A-dependent killing, the mutant is expected to be a loss-of-function. Moreover, overexpression of wildtype DLK (which superactivates the pathway and accelerates RGC cell death) will be used in combination with the mutants to see if they possess dominant-negative activity (i.e. as would be expected if the modification were part of the mechanism by which DLK signaling activates MEF2A). Finally, at least in the case of phosphosites, phosphomimetic mutations (e.g. S408E) will be engineered and tested whether these gain constitutive activity and promote cell death even in the setting of combined DLK/LZK inhibition (using siRNA).
16. Use a CRISPR/AAV Strategy to Test Whether Targeted Disruption of Mef2a In Vivo Promotes RGC Survival and without Affecting Axonal Regeneration.
Although DLK plays a key role in both cell death and axon regeneration, four downstream transcription factors were identified that mediate the cell death signal. While at least two of them (i.e. SOX11 and JUN) have known roles in regeneration, the role of MEF2A is unknown. Thus, MEF2A could potentially represent a neuroprotective target that dissociates survival and regeneration. Indeed, in vitro, MEF2A knockdown does not appear to grossly affect neurite outgrowth (data not shown). Normally, this line of investigation would be of limited utility since MEF2A, being a transcription factor, is not readily druggable. However, the strategy outlined in Section 10 allows us to therapeutically target genes like MEF2A. Moreover, the approach described in Section 9, in which multiple genes were knocked out, and even
selectively label the cells that have active CRISPR editing, provides an opportunity to easily test MEF2A biology in vivo. Thus, spCas9-2A-GFP knockin mice will be transduced with the virus described in Section 9, modified to target Mef2a, Dlk (positive control for preventing regeneration) or a non-targeting control (negative control for preventing regeneration), and assayed as described in Section 9.
In summary, described herein are sets of experiments aimed at further exploring the upstream and downstream mechanism by which DLK and LZK promote RGC cell death. Moreover, using CRISPR/AAV therapeutics, a platform has been developed that allows for rapid validation in vivo and, most importantly, a seamless transition to a highly-specific gene therapy-based therapeutic. Indeed, while there is tremendous excitement about CRISPR-based therapeutics for genome-editing, the field is limited by the fact that spCas9 and the gRNA cannot be simultaneously packaged into a single viral vector. Using the H1 promoter, this obstacle has been overcome. Using DLK as an example, genome modification can be used therapeutically in optic neuropathies.
HEK293 cells were seeded in T25 flasks and grown to semi-confluence in DMEM supplemented with 10% fetal bovine serum and antibiotics. Two days after plating, cells in one flask were transfected with the plasmid pAAV-CEP290. A transfection mixture containing 4 ug plasmid DNA, 20 ul Lipofectamine 3000 (ThermoFisher/Invitrogen) and 10 ul of the P3000 reagent (ThermoFisher/Invitrogen) in a total of 500 ul additive-free OptiMEM medium was added to the cell culture medium. Cells in a second flask were infected with 100 ul of a packaged and purified stock of AAV2-CEP290 (2.06e10 viral genomes). A third T25 flask was untreated and served as a control. Cells were harvested in trypsin-EDTA 48 h following transfection or infection, and pelleted by centrifugation. For each sample, the cell pellet was resuspended in 200 ul PBS and processed with the Qiagen DNA mini kit as per manufacturer's protocol. Genomic DNA was eluted in 200 ul water.
For T7 EndoI analysis, genomic DNA was extracted by resuspending cells in QuickExtract solution (Epicentre, Madison, WI), incubating at 65° C. for 20 min, and then at 98° C. for 20 min. The extract solution was used directly or cleaned using DNA Clean and Concentrator (Zymo Research, Irvine, CA), and quantitated by NanoDrop (Thermo 30 Fisher Scientific). The genomic region surrounding the CRISPR target site was amplified from ˜100 ng of genomic DNA using Phusion DNA polymerase (New England Biolabs). Multiple independent PCR reactions were pooled and purified using DNA Clean and Concentrator (Zymo Research, Irvine, CA). A 25 μl volume containing 150 ng of the PCR product in 10 mM Tris-HCl, 50 mM NaCl, 10 mM MgCl2 and 100 μg/ml BSA was denatured and slowly reannealed to allow for the formation of heteroduplexes: 95° C. for 10 min, 95° C. to 85° C. ramped at −1.0° C./sec, 85° C. for 1 sec, 85° C. to 75° C. ramped at −1.0° C./sec, 75° C. for 1 sec, 75° C. to 65° C. ramped 5 at −1.0° C./sec, 65° C. for 1 sec, 65° C. to 55° C. ramped at −1.0° C./sec, 55° C. for 1 sec, 55° C. to 45° C. ramped at −1.0° C./sec, 45° C. for 1 sec, 45° C. to 35° C. ramped at −1.0° C./sec, 35° C. for 1 sec, 35° C. to 25° C. ramped at −1.0° C./sec, and then held at 4° C. 1 μl of T7 EndoI (New England Biolabs) were added to each reaction, incubated at 37° C. for 30 min, and then immediately placed on ice. For gel analysis, 3 μl of the reaction was mixed with 3 μl 2× Loading dye (New England Biolabs), loaded onto a 6% TBE-PAGE gel, and stained with SYBR Gold (1:10,000) for ˜15 minutes prior to visualization. Gels were visualized on a Gel Logic 200 Imaging System (Kodak, 15 Rochester, NY), and quantitated using ImageJ v. 1.46. NHEJ frequencies were calculated using the binomial-derived equation:
where the values of “a” and “b” are equal to the integrated area of the cleaved fragments after background subtraction and “c” is equal to the integrated area of the un-cleaved PCR product after background subtraction.
For restriction analysis, genomic DNA was extracted by resuspending cells in QuickExtract solution (Epicentre, Madison, WI), incubating at 65° C. for 20 min, and then at 98° C. for 20 min. The extract solution was used directly or cleaned using DNA Clean and Concentrator (Zymo Research, Irvine, CA), and quantitated by NanoDrop (Thermo 30 Fisher Scientific). The genomic region surrounding the CRISPR target site was amplified from ˜100 ng of genomic DNA using Phusion DNA polymerase (New England Biolabs). Multiple independent PCR reactions were pooled and purified using DNA Clean and Concentrator (Zymo Research, Irvine, CA). A 25 μl volume containing 150 ng of the PCR product in 10 mM Tris-HCl, 50 mM NaCl, 10 mM MgCl2 and 100 μg/ml BSA was digested at 37° C. for 1 hr. For gel analysis, 3 μl of the reaction was mixed with 3 μl 2× Loading dye (New England Biolabs), loaded onto a 6% TBE-PAGE gel, and stained with SYBR Gold (1:10,000) for ˜15 minutes prior to visualization. Gels were visualized on a Gel Logic 200 Imaging System (Kodak, 15 Rochester, NY), and quantitated using ImageJ v. 1.46. NHEJ frequencies were calculated using the equation:
where the values of “a” and “b” are equal to the integrated area of the cleaved fragments after background subtraction and “c” is equal to the integrated area of the un-cleaved PCR product after background subtraction.
Leber's congenital amaurosis (LCA) is comprised of a group of early-onset childhood retinal dystrophies that are characterized by severe retinal dysfunction and severe visual impairment, or blindness during the first months of life. LCA, an orphan disease, is the most common cause of inherited blindness constituting as much as 5% of all known hereditary retinal degenerative diseases. The most common cause for LCA10—a disease for which no FDA approved therapy exists—is a deep intronic mutation in the CEP290 gene (den Hollander, A. I. et al. (2006) American journal of human genetics 79, 556-561) (
Some forms of LCA are potentially amenable to treatment by recombinant adeno-associated viruses (AAV) engineered to deliver a functional copy of the defective cellular gene. In 2008, a transgene that complemented the mutation in RPE65 was successfully delivered by AAV to LCA2 patients in a Phase I Clinical trial (Maguire, A. M. et al. (2008) The New England journal of medicine 358, 2240-2248). Some responses were noted, but unfortunately, the effects were not durable, potentially due to losing transgene expression (Azvolinsky, A. (2015) Nat Biotechnol 33, 678; Schimmer, J. et al. (2015) Human gene therapy. Clinical development 26, 208-210). Furthermore, mutations in some genes that cause the different LCA subtypes, like CEP290, are simply too large for AAV delivery, and therefore remain untreatable by gene therapy approaches. Moreover, photoreceptors are highly sensitive to protein levels (Olsson, J. E. et al. (1992) Neuron 9, 815-830; Tan, E. et al. (2001) Investigative ophthalmology & visual science 42, 589-600; Seo, S. et al. (2013) Investigative ophthalmology & visual science 54), and transgenic experiments have demonstrated photoreceptor toxicity from CEP290 overexpression (Burnight, E. R. et al. (2014) Gene therapy 21, 662-672). Given the following properties, we believe that the CEP290 mutation is a particularly attractive as a therapeutic target for CRISPR-Cas9 technology.
The development of CRISPR-Cas9 technology has revolutionized the field of gene-editing and offers a profoundly new approach to treating genetic diseases. The CRISPR-Cas9 system is composed of a guide RNA (gRNA) that targets the Cas9 nuclease in a sequence-specific fashion. Cleavage by the CRISPR system requires complementary base pairing of the gRNA to a DNA sequence and the requisite
protospacer-adjacent motif (PAM), a short nucleotide motif found 3′ to the target site (Doudna, J. A. et al. (2014) Science 346, 1258096; Hsu, P. D. et al. (2014) Cell 157, 1262-1278). Currently, the least restrictive and most commonly used Cas9 protein is from S. pyogenes, which recognizes the sequence NGG, and thus, the CRISPR targeting sequence is N20NGG. While numerous studies have shown that disease mutations can be efficiently targeted in vitro, the development of CRISPR-Cas9-based therapeutics for in vivo use is been hampered by safety concerns and delivery constraints.
While CRISPR targeting of disease mutations has been shown to be effective in numerous in vitro settings, and as well in vivo through mouse and other animal studies, all current approaches are still far from clinical use due in large part to delivery constraints. AAV vectors are the most frequently and successfully used viral vectors in ocular gene therapy injection (Dalkara, D. et al. (2014) Comptes rendus biologies 337, 185-192; Day, T. P. et al. (2014) Advances in experimental medicine and biology 801, 687-693; Willett, K. et al. (2013) Frontiers in immunology 4, 261; Dinculescu, A. et al. (2005) Human gene therapy 16, 649-663). Several features make AAV the most attractive choice: the virus is nonpathogenic, it infects both dividing and non-dividing cells, expression can persist for long periods of time, and it is particularly noteworthy for its history of safety, efficacy and a general lack of toxicity in clinical trials. Additionally, specific AAV serotypes are effective in targeting photoreceptor cells after subretinal injection. While AAV vectors provide a safe means of delivering therapeutic CRISPR components, there is one major technical obstacle that limits their utility—their size. Wild type AAV genomes are ˜4.7 kb in length and recombinant viruses can package up to ˜5.0 kb (Dong, B. et al. (2010) Molecular therapy: the journal of the American Society of Gene Therapy 18, 87-92; Wu, Z. et al. (2010) Molecular therapy: the journal of the American Society of Gene Therapy 18, 80-86). This packaging capacity defines the upper limit of the DNA that can be used for a single viral vector.
The DNA required to express Cas9 and the gRNA, by conventional methods, exceeds 5.2 kb: Pol II promoter (˜0.5 kb), SpCas9 (˜4.1 kb), Pol II terminator (˜0.2 kb), U6 promoter (˜0.3 kb), and the gRNA (˜0.1 kb). One approach to AAV delivery challenge is a two-vector approach: one AAV vector to deliver the Cas9, and another AAV vector for the gRNA (Swiech, L. et al. (2015) Nat Biotechnol 33, 102-106). However, the double AAV approach utilizes the small mouse Mecp2 promoter, a gene that has been found to be expressed in retinal cells—with the critical exception of rods (Song, C. et al. (2014) Epigenetics & chromatin 7, 17; Jain, D. et al. (2010) Pediatric neurology 43, 35-40)—suggesting that, aside from the potential toxicity due to increased viral delivery load, the co-delivery approach would likely fail to target the vast majority of LCA mutations apriori. While this is a potentially viable approach for other gene therapy-mediated genomic editing, we are instead proposing a single vector approach for retinal gene editing that should increase efficiency, target photoreceptors specifically, and reduce potential toxicity from viral load delivery.
We recently reported that use of the H1 promoter, rather than the more traditionally used U6 promoter, to direct gRNA transcription allows an approximate doubling of the available CRISPR gene targeting space (Ranganathan, V. et al. (2014) Nature communications 5, 4516). Notably, we also detected a lower propensity for off-target cutting suggesting that the H1 promoter would be more favorable for therapeutic approaches. During these studies, we noticed the presence of a protein-coding gene (PARP-2) in close genomic proximity to the endogenous H1RNA gene (Myslinski, E. et al. (2001) Nucleic Acids Res 29, 2502-2509; Baer, M. et al. (1990) Nucleic Acids Res 18, 97-103). The sequence between the start of the H1RNA (a pol III RNA transcript) and the PARP-2 gene (a pol II transcript) is 230 bp (
We will develop two CRISPR/AAV therapeutics in parallel based on orthogonal Cas9 systems. First is the development of an LCA10 therapeutic mediated by the co-delivery of the SpCas9 and a guide RNA through a single AAV vector. Second is the development of an LCA10 therapeutic mediated by the co-delivery of the SaCas9 nickase and four guide RNAs through a single AAV vector. Both strategies are being developed for eventual clinical use and thus safety is paramount. Lastly, we will generate isogenic human stem cells lines containing the LCA10 for the characterization and development of novel therapeutics.
Development of an LCA10 Therapeutic Mediated by the Co-Delivery of the S. pyogenes Cas9 (SpCas9) and a Guide RNA Through a Single AAV Vector.
Background and Rationale. Although numerous studies have shown that disease mutations can be efficiently targeted in vitro, the development of SpCas9-based therapeutics for in vivo use has been hampered by delivery constraints. Using the compact bidirectional promoter system, we have demonstrated a clinically viable platform for the co-delivery of SpCas9 and a gRNA through a single AAV vector. SpCas9 offers several advantages: it is the most commonly used, most versatile, and best understood CRISPR system. Its PAM requirement (NGG) is considerably less stringent than other Cas9 proteins, which in turn means that more genes and more mutations can be directly targeted. Importantly for clinical therapeutic approaches, recent advances in protein engineering of SpCas9 have developed multiple high-specificity/high-fidelity variants with as little as zero detectable genome-wide off-target effects (Kleinstiver, B. P. et al. (2016) Nature doi:10.1038/nature16526; Slaymaker, I. M. et al. (2016) Science 351, 84-88). Indeed, unlike other Cas9 ortholog targeting sites, the intronic CEP290 mutation falls within an SpCas9 site (
By customizing the gRNA sequence, we can direct SpCas9 (or SpCas9 variant) to the CEP290 mutation, causing a double strand-DNA break near the splice-donor site. Cellular response to DNA breaks occurs primarily through one of two competing pathways: nonhomologous end-joining (NHEJ), or Homology Directed Repair (HDR). NHEJ, the more dominant pathway for DNA repair, is an error-prone pathway that results in deletions and insertions near the break point, commonly +/−˜15 nt (Mali, P. et al. (2013) Science 339, 823-826). Thus, by using the cells normal DNA repair machinery, we can disrupt the sequences near the splice-donor site, prevent inclusion of the exon X, and restore normal CEP290 splicing and function. Importantly, many mutations in this intronic region would be expected to restore in normal splicing.
Our approach will involve modifying and optimizing CRISPR-Cas9 methodology so all the needed components can be delivered to photoreceptors by a single AAV5, an AAV serotype with documented performance in mammalian rods. By swapping Cas9 for GFP in using our H1-AAV system, we have been able to demonstrate efficient delivery of GFP to photoreceptors using AAV5 (
1. Computational selection and analysis of gRNAs. Since the constructs are being developed with the goal of eventual clinical use, it is essential to carefully monitor them for potential off-target activity (Wu, X. et al. (2014) Quantitative biology 2, 59-70). For this purpose, we will pursue several complementary approaches. Taking a bioinformatics approach, we determined all the potential CRISPR sites in the human genome using a custom Perl script written to search both strands and overlapping occurrences of the SpCas9 targeting site; for example, in the human genome there are 137,409,562 CRISPR sites after filtering out repetitive sequences. We have computationally determined the propensity for each site to exhibit off-target effects using Bowtie (Langmead, B. et al. (2009) Genome biology 10, R25) to realign each CRISPR site back onto the genome allowing up to 3 base mismatches throughout the targeting sequence. Our analysis of the LCA10 SpCas9 site identifies 13 potential off-target loci that we will test for spurious targeting: 1 site with 2 mismatches, and 12 sites with 3 mismatches (computational data is available at http://crispr.technology). PCR primers flanking the on-target and predicted off-target sites will be used with a high-fidelity polymerase (NEB, Phusion) to amplify the genomic sequence that will then be tested by the T7EI assay. This will allow us to monitor the targeting accuracy for our optimization experiments both in vitro and in vivo.
2. In vitro evaluation in human cell lines. We have developed several SpCas9 targeting plasmids that contain a unique restriction site for simple target gRNA insertion, and flanking NotI sites to allow for easy subcloning into the ITR containing vector required for AAV production. In addition, we will generate constructs containing the two recently reported high-fidelity Cas9 variants SpCas9-HF and
eSpCas926,27 (
3. High-throughput sequencing for on-target/off-target mutagenesis. We aim to perform site-specific deep sequencing analysis of on-target and off-target sites. Genomic sequences flanking the CRISPR target site and predicted off-target sites will be amplified using high-fidelity polymerase (NEB, Phusion) for 15 cycles, and then purified (Zymo, DNA Clean & Concentrator-5). Purified PCR products will be amplified for 5 cycles to attach Illumina P5 adapters and sample-specific barcodes, purified again, and then quantitated by SYBR green fluorescence, analyzed on a Bioanalyzer, and finally pooled in an equimolar ratio prior to sequencing with a MiSeq Personal Sequencer. To analyze the sequencing data, 300 bp paired-end MiSeq reads will be de-multiplexed using Illumina MiSeq Reporter software, followed by
adapter and quality trimming of raw reads. Alignments will be performed on all reads to the wild-type sequence and NHEJ frequency will be calculated by: 100*(number of indel reads/number of indel reads+number of WT reads).
4. AAV Virus Production. High titer GMP-like preclinical AAV5 vector will be generated by our collaborator (Dr. William Hauswirth, University of Florida) in their independent vector production facility using the helper-free, plasmid transfection method developed by their lab. Vectors are purified by iodixanol gradient centrifugation followed by Q-column FPLC chromatography, and to establish the
GLP-like purity of the AAV vector stocks, each vector will be subjected to a standardized battery of physical and biological assays including assessment of purity, bioburden, sterility, DNA containing particle titer, infectious titer, particle-to-infectivity ratio and potential contamination by replication competent AAV, each a critical element for the clinical trial IND CMC section.
Development of an LCA10 Therapeutic Mediated by the Co-Delivery of the S. aureus Cas9 (SaCas9) Nickase and Four Guide RNAs Through a Single AAV Vector.
Background and Rationale. Another promising approach that recently emerged to deliver CRISPR by AAV is the use of smaller orthogonal Cas9 proteins.
Highlighted by the S. aureus Cas9 (SaCas9), which is encoded by a ˜3.2 kb transcript (Ran, F. A. et al. (2015) Nature 520, 186-191). However, one limitation in using the SaCas9 is due to its PAM requirement (NNGRRT). The number of unique genomic targeting sites is ˜4-fold less than SpCas9 due to the longer the PAM sequence, and there are no SaCas9 sites that fall on the LCA10 A>G mutation. The specific mutation cannot be targeted by SpCas9 (as described above), thus our approach is to employ a deletion strategy to remove a small region surrounding the cryptic exon. The compact size of the SaCas9 gene allows it to be packaged into a single AAV vector along with one, two, or potentially three, gRNA cassettes using standard promoter and terminator elements32. In terms of safety concerns for CRISPR-based therapeutics, the most significant is undoubtedly off-target mutagenesis; this can occur if Cas9 cleaves DNA at an unintended location. Fortunately, this risk can be reduced by several orders of magnitude by employing a point mutation of Cas9, known as a nickase, which only cleaves one DNA strand. By separately engaging two gRNAs to generate two closely opposed nicks on opposite strands, the Cas9 nickase approach can efficiently generate a double-strand break (
1. Computational selection of gRNAs and construct generation. Similar to our bioinformatics described above, we have identified every SaCas9 targeting site in the genome, and databased the information (data available at http://crispr.technology). In order to target a deletion using the SaCas9 nickase system, we identified four candidate gRNA sites from our computational analysis with favorable properties for generating a deletion using the nickase (Friedland, A. E. et al. (2015) Genome biology 16, 257; Mali, P. et al. (2013) Nat Biotechnol, doi:10.1038/nbt.2675; Ran, F. A. et al. (2013) Cell, doi:10.1016/j.cell.2013.08.021) (
2. In vitro evaluation in human cell lines. We have constructed SaCas9 targeting plasmids that contain flanking NotI sites to allow for easy subcloning into the ITR containing vector required for AAV production. Like the SpCas9 targeting plasmids, these contain unique restriction sites for rapid cloning of specific gRNA sequences. To ensure that we have efficient targeting at each site, we will first assay each site for dsDNA cutting in HEK293 cells. Using PCR primers flanking the target site we use a high-fidelity polymerase (NEB, Phusion) to amplify the genomic sequence that will then be tested for cleavage activity by the T7EI assay; we routinely identify targeting efficiencies between 30-75%. After these sites have been verified for their ability to induce cleavage, they will be cloned into the SaCas9 nickase targeting plasmid we constructed that contains four H1 promoter cassettes for the expression of four gRNAs. Subsequently, deletion analysis by PCR will be performed in HEK293 cells to assess the efficiency of our targeting constructs. Off-target loci predicted from our bioinformatics will be assessed for spurious mutagenesis.
3. High-throughput sequencing for on-target/off-target mutagenesis. As described above, we will also perform site-specific deep sequencing analysis for our SaCas9 targeting experiments. Given that we are using the nickase approach, we do not expect to detect off-target mutagenesis, however, we have determined alternative SaCas9 targeting sites that are potential alternatives for generating an intronic deletion of the cryptic exon.
4. AAV Virus Production. This will be performed as described above.
4 Background and Rationale. While mice can serve as excellent models for retinal degenerations, the rd16 mouse, which is caused by a mutation in CEP290, is not an apt model for testing the CRISPR/AAV therapeutics. Unlike the human point mutation, the mouse degeneration phenotype is caused by a large (897 bp) homozygous in-frame deletion36. Thus, in order to better reflect the human disease, we are currently using two approaches to generate the LCA10 cells: 1) engineering the IVS26 c.2991+1655 A>G mutation into lines from the H7 human embryonic stem cells, and 2) isogenic iPS cell lines derived from the LCA10 patient fibroblasts.
1. Gene edited hESC lines. We have had success using CRISPR-Cas9 to engineer point mutations using ˜150 base oligo donors (˜75 bases of flanking homology). Plasmids encoding Cas9, the gRNA sequence describe above, and a donor oligo will be introduced by electroporation (Ranganathan, V. et al. (2014) Nature communications 5, 4516). Transfection efficiency will be monitored by fluorescence 24-hrs post-electroporation, and bulk gene targeting will be assessed by T7EI assay. Finally, recombinant clones will be screened for the desired insertion by PCR, and then verified by Sanger sequencing. Multiple homozygous and heterozygous lines carrying the A>G mutation will be isolated and assessed for off-target mutagenesis using the bioinformatics described above.
2. Patient-derived iPSC lines. In collaboration with the Johns Hopkins Wilmer Retinal Degeneration Clinic, we are currently searching for patient fibroblasts harboring the CEP290 mutation. Upon identification, skin biopsies will be collected after informed consent under an existing IRB approved protocol with the JHMI-SOM Institutional Review Board. LCA10 patient-derived iPSCs will be made using established protocols (Galluzzi, L. et al. (2012) Cell death and differentiation 19, 107-120; Yu, J. et al. (2007) Science 318, 1917-1920; Takahashi, K. et al. (2006) Cell 126, 663-676; Ludwig, T. E. et al. (2006) Nat Biotechnol 24, 185-187). Using the techniques described above, we will generate mutation corrected isogenic cell lines in parallel, and the predicted off-target loci will be Sanger sequenced to verify the absence of extraneous mutations.
Bioethics. We will be using the human Embryonic Stem Cell line H7 (WiCell); use of these cell lines for the proposed experiments is approved by JHU ISCRO committee (application ISCRO00000023). In addition, we will be using hiPSC generated in our lab under informed consent under the auspices of the JHU Institutional review board (IRB #NA_00047271). Although not considered an ES cell line, we still follow all policies and regulations dictated by the JHU Embryonic Stem Cell Research Oversight committee.
3. in vitro editing of gene edited hESC lines and/or editing of patient derived iPSC lines. In order to thoroughly assess the various lead viral vectors developed in above, cells from either patient derived iPSC lines or gene edited hESC lines will be tested; the constructs that target the LCA10 mutation directly with WT SpCas9, eSpCas9, or the SpCas9-HF, and the construct that uses the SaCas9 nickase approach with four gRNA (
AAV5 was delivered by sub-retinal injection to P0.5 mice. Following either 14 days of 28 days, the retina was harvested and a T7 Endo I assay was performed. (Note: AAV5 targets rod photoreceptors, and the assay was performed on total retina).
For T7 EndoI analysis, genomic DNA was extracted by resuspending cells in QuickExtract solution (Epicentre, Madison, WI), incubating at 65° C. for 20 min, and then at 98° C. for 20 min. The extract solution was used directly or cleaned using DNA Clean and Concentrator (Zymo Research, Irvine, CA), and quantitated by NanoDrop (Thermo 30 Fisher Scientific). The genomic region surrounding the CRISPR target site was amplified from ˜100 ng of genomic DNA using Phusion DNA polymerase (New England Biolabs). Multiple independent PCR reactions were pooled and purified using DNA Clean and Concentrator (Zymo Research, Irvine, CA). A 25 μl volume containing 150 ng of the PCR product in 10 mM Tris-HCl, 50 mM NaCl, 10 mM MgCl2 and 100 μg/ml BSA was denatured and slowly reannealed to allow for the formation of heteroduplexes: 95° C. for 10 min, 95° C. to 85° C. ramped at −1.0° C./sec, 85° C. for 1 sec, 85° C. to 75° C. ramped at −1.0° C./sec, 75° C. for 1 sec, 75° C. to 65° C. ramped 5 at −1.0° C./sec, 65° C. for 1 sec, 65° C. to 55° C. ramped at −1.0° C./sec, 55° C. for 1 sec, 55° C. to 45° C. ramped at −1.0° C./sec, 45° C. for 1 sec, 45° C. to 35° C. ramped at −1.0° C./sec, 35° C. for 1 sec, 35° C. to 25° C. ramped at −1.0° C./sec, and then held at 4° C. 1 μl of T7 EndoI (New England Biolabs) were added to each reaction, incubated at 37° C. for 30 min, and then immediately placed on ice. For gel analysis, 3 μl of the reaction was mixed with 3 μl 2× Loading dye (New England Biolabs), loaded onto a 6% TBE-PAGE gel, and stained with SYBR Gold (1:10,000) for ˜15 minutes prior to visualization. Gels were visualized on a Gel Logic 200 Imaging System (Kodak, 15 Rochester, NY), and quantitated using ImageJ v. 1.46. NHEJ frequencies were calculated using the binomial-derived equation:
% gene modification=100*(1−(SQRT(1−((a+b)/(a+b+c)))))
where the values of “a” and “b” are equal to the integrated area of the cleaved fragments after background subtraction and “c” is equal to the integrated area of the un-cleaved PCR product after background subtraction.
All publications, patent applications, patents, and other references mentioned in the specification are indicative of the level of those skilled in the art to which the presently disclosed subject matter pertains. All publications, patent applications, patents, and other references are herein incorporated by reference to the same extent as if each individual publication, patent application, patent, and other reference was specifically and individually indicated to be incorporated by reference. It will be understood that, although a number of patent applications, patents, and other references are referred to herein, such reference does not constitute an admission that any of these documents forms part of the common general knowledge in the art.
Although the foregoing subject matter has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be understood by those skilled in the art that certain changes and modifications can be practiced within the scope of the appended claims.
This application claims the benefit of U.S. Provisional Application No. 62/358,337, filed Jul. 5, 2016, the entirety of which is hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
62358337 | Jul 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 16315462 | Jan 2019 | US |
Child | 18380920 | US |