Incorporated herein by reference in its entirety is the Sequence Listing submitted via EFS-Web as a text file named PeranteauSequenceListing.txt, created Oct. 5, 2021 and having a size of 24,575 bytes.
This invention relates to the fields of genetic disease and gene editing technology. More specifically, the invention provides compositions and methods for correcting gene sequences in utero, thereby curing or ameliorating symptoms of genetic lung disease before or after birth.
Several publications and patent documents are cited throughout the specification in order to describe the state of the art to which this invention pertains. Each of these citations is incorporated by reference herein as though set forth in full.
Congenital genetic lung diseases, such as inherited surfactant protein (SP) syndromes, cystic fibrosis and alpha-1 antitrypsin deficiency, are a source of morbidity and mortality for which no definitive treatment options exist (1-4). These disorders present with a spectrum of severity and timing of onset. Some, such as cystic fibrosis and alpha-1 antitrypsin deficiency, present in late childhood or early adulthood with subsequent disease progression and shortened life expectancy (5, 6). Alternatively, mutations in SP genes can cause respiratory failure at birth and perinatal death or chronic diffuse lung disease. Genetic mutations in one of three surfactant system genes, SFTPB, SFTPC, or ATP-binding cassette protein member 3 (ABCA3), result in either a loss of function phenotype through disruption of surfactant metabolism or its biophysical activity (SP deficiency syndrome) or a toxic gain of function phenotype from disrupted lung development or diffuse parenchymal lung disease perpetrated by cytosolic accumulation of abnormal SP conformers in alveolar type 2 (AT2) cells. Heterozygous mutations in SFTPC are a primary cause of children's interstitial lung disease (chILD). The age of onset and severity of disease due to SFTPC mutations depends on the specific mutation and varies from severe respiratory failure in neonates to idiopathic pulmonary fibrosis in adulthood (7, 8). Unlike surfactant deficiency of prematurity, the inherited forms of SP disease do not respond to exogenous surfactant, anti-inflammatory, or anti-fibrotic therapies. Treatment options for patients presenting with neonatal respiratory failure are limited to palliative care or pediatric lung transplant which is limited by organ availability (9, 10). Thus, there is an urgent need for novel therapies for early correction of lethal genetic lung disorders including SP syndromes.
In accordance with the present invention, compositions and a CRISPR-Cas system-mediated genome editing method are disclosed. An exemplary method comprises introducing into a eukaryotic cell containing and expressing a DNA molecule having a target sequence and encoding at least one mutated gene product in the lung, an engineered, non-naturally occurring CRISPR-Cas system comprising one or more vectors comprising a) a first regulatory element operable in a eukaryotic cell operably linked to at least one nucleotide sequence encoding a CRISPR-Cas system guide RNA that hybridizes with the target sequence, and b) a second regulatory element operable in a eukaryotic cell operably linked to a nucleotide sequence encoding a Cas9 protein or variant thereof, wherein components (a) and (b) are located on same or different vectors of the system or are affixed molecules which effect nucleic acid delivery into mammalian cells, whereby expression of the at least one gene product is altered through the CRISPR-Cas system acting via the DNA molecule comprising the guide RNA directing sequence-specific binding of the CRISPR-Cas system, causing genome editing to remove one or more undesired mutations; and, wherein the Cas9 protein and the guide RNA do not naturally occur together, and said guide strand targets a gene in fetal or post-natal lung selected from the group consisting mutated SFTPB, SFTPC, ABCA3, SERPINA1, and CFTR. In certain embodiments of the method, the CRISPR-Cas system further comprises one or more nuclear localization sequence(s) and, or a tracr sequence.
In some embodiments that the Cas9 protein is codon optimized for expression in a human lung cell.
The invention also provides a method of treating a monogenic lung disease in a subject in need thereof comprising editing a gene in a lung cell of the subject using the CRISPR-Cas system described above. The subject may be selected from a fetal, post-natal, pediatric or adult subject.
Also disclosed is a CRISPR/Cas nuclease comprising a single guide RNA that binds to a target site in a mutated gene causing monogenic lung disease, wherein the nuclease cleaves and inactivates the mutated gene. In one aspect, the gene is SFTPC. Mammalian cells comprising the nuclease are also within the scope of the invention.
In another embodiment, a method of inactivating an endogenous gene causing monogenic disease in a lung cell is provided. An exemplary method comprises the steps of: administering to the cell a CRISPR/Cas nuclease described above, wherein the nuclease cleaves and inactivates a gene causing lethal monogenic lung disease. In preferred embodiments the CRISPR-Cas system is provided such that is it present transiently in the subject.
Also disclosed are guide strands useful for targeting surfactant protein C and the CFTR gene. Kits for practicing the methods disclosed are also within the scope of the invention.
Recent improvements in gene editing technology, including advances in CRISPR-Cas9 (Clustered Regularly Interspaced Short Palindromic Repeats [CRISPR]-CRISPR-associated 9) technology, offer an unprecedented opportunity for therapeutic correction of monogenic disorders (11-15). Standard CRISPR-Cas9 gene editing uses a single guide RNA (sgRNA) to instigate a double strand DNA break (DSB) in a site-specific fashion. Normal cellular mechanisms repair the DSB via nonhomologous-end joining (NHEJ), or if a donor repair template is provided, homology-directed repair (HDR) can be accomplished at low efficiency. Studies in postnatal mouse models have demonstrated the therapeutic potential of in vivo CRISPR-Cas9 gene editing to correct monogenic diseases (16-21).
Although postnatal in vivo gene editing studies are encouraging, some diseases, such as SP syndromes, result in morbidity and mortality at the time of or shortly after birth, precluding a postnatal approach. Several examples of early zygote gene editing have been described and could prove useful where mutation detection at very early developmental time points is possible (22-24). However, de novo mutations that occur later in development may not be treatable by early zygote gene editing. Using CRISPR-Cas9 gene editing to correct lung diseases during later prenatal developmental stages has the potential to reverse such genetic abnormalities before transition to postnatal life when pulmonary function becomes essential. In utero gene editing also provides the opportunity to take advantage of the normal developmental properties of the fetus to accomplish efficient gene editing. Specifically, the small size and immunologic immaturity of the fetus allow for the optimization of the CRISPR-Cas9 “dose” per recipient weight while avoiding a potential immune response to the bacterial Cas9 protein or delivering viral vector (25-28). Additionally, the target cell population for gene editing may be more accessible in the fetus. In the postnatal lung, immune and physical barriers including mucus and glycocalyx proteins limit access to pulmonary epithelial cells including alveolar type 2 (AT2) cells, the target cell population for SP disorders (29, 30). These immune and physical barriers are not as significant in the fetus, and multiple murine studies have demonstrated efficient gene transfer to pulmonary epithelial cells following prenatal viral vector delivery via intra-amniotic injection to take advantage of fetal breathing movements for lung targeting (31-33).
Monogenic lung diseases that are caused by mutations in surfactant genes of the pulmonary epithelium are marked by perinatal lethal respiratory failure or chronic diffuse parenchymal lung disease with few therapeutic options. Using a unique CRISPR fluorescent reporter system, we demonstrate that precisely timed in utero intra-amniotic delivery of CRISPR-Cas9 gene editing reagents during fetal development results in targeted and specific gene editing in fetal lungs. Pulmonary epithelial cells are predominantly targeted in this approach, with alveolar type 1, alveolar type 2, and airway secretory cells exhibiting high and persistent gene editing. We then used this in utero technique to evaluate a therapeutic approach to reduce the severity of the lethal interstitial lung disease observed in a mouse model of the human SFTPCI73T mutation. Embryonic expression of SftpcI73T alleles is characterized by severe diffuse parenchymal lung damage and rapid demise of mutant mice at birth. Following in utero CRISPR-Cas9 mediated inactivation of the mutant SftpcI73T gene, fetuses and postnatal mice showed improved lung morphology and increased survival. These studies demonstrate that in utero gene editing is a novel and promising approach for treatment and rescue of monogenic lung diseases that are lethal at birth in animals and humans.
The terms “polynucleotide”, “nucleotide”, “nucleotide sequence”, “nucleic acid” and “oligonucleotide” are used interchangeably. They refer to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. A polynucleotide may comprise one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.
As used herein the term “wild type” is a term of the art understood by skilled persons and means the typical form of an organism, strain, gene or characteristic as it occurs in nature as distinguished from mutant or variant forms. As used herein the term “variant” should be taken to mean the exhibition of qualities that have a pattern that deviates from the wild type or a comprises non naturally occurring components.
The terms “non-naturally occurring” or “engineered” are used interchangeably and indicate the involvement of the hand of man. The terms, when referring to nucleic acid molecules or polypeptides mean that the nucleic acid molecule or the polypeptide is at least substantially free from at least one other component with which they are naturally associated in nature and as found in nature.
“Complementarity” refers to the ability of a nucleic acid to form hydrogen bond(s) with another nucleic acid sequence by either traditional Watson-Crick or other non-traditional types. A percent complementarity indicates the percentage of residues in a nucleic acid molecule which can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). “Perfectly complementary” means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. “Substantially complementary” as used herein refers to a degree of complementarity that is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%. 95%, 97%, 98%, 99%, or 100% over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions.
As used herein, “stringent conditions” for hybridization refer to conditions under which a nucleic acid having complementarity to a target sequence predominantly hybridizes with the target sequence, and substantially does not hybridize to non-target sequences. Stringent conditions are generally sequence-dependent, and vary depending on a number of factors. In general, the longer the sequence, the higher the temperature at which the sequence specifically hybridizes to its target sequence. Non-limiting examples of stringent conditions are described in detail in Tijssen (1993), Laboratory Techniques In Biochemistry And Molecular Biology-Hybridization With Nucleic Acid Probes Part I, Second Chapter “Overview of principles of hybridization and the strategy of nucleic acid probe assay”, Elsevier, N.Y.
“Hybridization” refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding may occur by Watson Crick base pairing, Hoogstein binding, or in any other sequence specific manner. The complex may comprise two strands forming a duplex structure, three or more strands forming a multi stranded complex, a single self hybridizing strand, or any combination of these. A hybridization reaction may constitute a step in a more extensive process, such as the initiation of PCR, or the cleavage of a polynucleotide by an enzyme. A sequence capable of hybridizing with a given sequence is referred to as the “complement” of the given sequence.
As used herein, “expression” refers to the process by which a polynucleotide is transcribed from a DNA template (such as into and mRNA or other RNA transcript) and/or the process by which a transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides may be collectively referred to as “gene product.” If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in a eukaryotic cell.
The terms “polypeptide”, “peptide” and “protein” are used interchangeably herein to refer to polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non amino acids. The terms also encompass an amino acid polymer that has been modified; for example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component. As used herein the term “amino acid” includes natural and/or unnatural or synthetic amino acids, including glycine and both the D or L optical isomers, and amino acid analogs and peptidomimetics.
The terms “subject,” “individual,” and “patient” are used interchangeably herein to refer to a vertebrate, preferably a mammal, more preferably a human. Mammals include, but are not limited to, murines, simians, humans, farm animals, sport animals, and pets. Tissues, cells and their progeny of a biological entity obtained in vivo or cultured in vitro are also encompassed.
The terms “therapeutic agent”, “therapeutic capable agent” or “treatment agent” are used interchangeably and refer to a molecule or compound that confers some beneficial effect upon administration to a subject. The beneficial effect includes enablement of diagnostic determinations; amelioration of a disease, symptom, disorder, or pathological condition; reducing or preventing the onset of a disease, symptom, disorder or condition; and generally counteracting a disease, symptom, disorder or pathological condition.
As used herein, “treatment” or “treating,” or “palliating” or “ameliorating” are used interchangeably. These terms refer to an approach for obtaining beneficial or desired results including but not limited to a therapeutic benefit and/or a prophylactic benefit. By therapeutic benefit is meant any therapeutically relevant improvement in or effect on one or more diseases, conditions, or symptoms under treatment. For prophylactic benefit, the compositions may be administered to a subject at risk of developing a particular disease, condition, or symptom, or to a subject reporting one or more of the physiological symptoms of a disease, even though the disease, condition, or symptom may not have yet been manifested.
The term “effective amount” or “therapeutically effective amount” refers to the amount of an agent that is sufficient to effect beneficial or desired results. The therapeutically effective amount may vary depending upon one or more of: the subject and disease condition being treated, the weight and age of the subject, the severity of the disease condition, the manner of administration and the like, which can readily be determined by one of ordinary skill in the art. The term also applies to a dose that will provide an image for detection by any one of the imaging methods described herein. The specific dose may vary depending on one or more of: the particular agent chosen, the dosing regimen to be followed, whether it is administered in combination with other compounds, timing of administration, the tissue to be imaged, and the physical delivery system in which it is carried.
The practice of the present invention employs, unless otherwise indicated, conventional techniques of immunology, biochemistry, chemistry, molecular biology, microbiology, cell biology, genomics and recombinant DNA, which are within the skill of the art. See Sambrook, Fritsch and Maniatis, MOLECULAR CLONING: A LABORATORY MANUAL, 2nd edition (1989); CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (F. M. Ausubel, et al. eds., (1987)); the series METHODS IN ENZYMOLOGY (Academic Press, Inc.): PCR 2: A PRACTICAL APPROACH (M. J. MacPherson, B. D. Hames and G. R. Taylor eds. (1995)), Harlow and Lane, eds. (1988) ANTIBODIES, A LABORATORY MANUAL, and ANIMAL CELL CULTURE (R. I. Freshney, ed. (1987)).
Several aspects of the invention relate to vector systems comprising one or more vectors, or vectors as such. Vectors can be designed for expression of CRISPR transcripts (e.g. nucleic acid transcripts, proteins, or enzymes) in prokaryotic or eukaryotic cells. For example, CRISPR transcripts can be expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression vectors), yeast cells, or mammalian cells. Suitable host cells are discussed further in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press. San Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.
In some embodiments, a vector is capable of driving expression of one or more sequences in mammalian cells using a mammalian expression vector. Examples of mammalian expression vectors include pCDM8 (Seed, 1987. Nature 329: 840) and pMT2PC (Kaufman, et al., 1987. EMBO J. 6: 187-195). When used in mammalian cells, the expression vector's control functions are typically provided by one or more regulatory elements. For example, commonly used promoters are derived from polyoma, adenovirus 2, cytomegalovirus, simian virus 40, and others disclosed herein and known in the art. For other suitable expression systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of Sambrook, et al., MOLECULAR CLONING: A LABORATORY MANUAL. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989.
In some embodiments, the recombinant mammalian expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific promoters include the albumin promoter (liver-specific; Pinkert, et al., 1987. Genes Dev. 1: 268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol. 43: 235-275), in particular promoters of T cell receptors (Winoto and Baltimore, 1989. EMBO J. 8: 729-733) and immunoglobulins (Baneiji, et al., 1983. Cell 33: 729-740; Queen and Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle, 1989. Proc. Natl. Acad. Sci. USA 86: 5473-5477), pancreas-specific promoters (Edlund, et al., 1985. Science 230: 912-916), and mammary gland-specific promoters (e.g., milk whey promoter, U.S. Pat. No. 4,873,316 and European Application Publication No. 264,166). Developmentally-regulated promoters are also encompassed, e.g., the murine hox promoters (Kessel and Gruss, 1990. Science 249: 374-379) and the α-fetoprotein promoter (Campes and Tilghman, 1989. Genes Dev. 3: 537-546).
In general, “CRISPR system” refers collectively to transcripts and other elements involved in the expression of or directing the activity of CRISPR-associated (“Cas”) genes, including sequences encoding a Cas gene, a tracr (trans-activating CRISPR) sequence (e.g. tracrRNA or an active partial tracrRNA), a tracr-mate sequence (encompassing a “direct repeat” and a tracrRNA-processed partial direct repeat in the context of an endogenous CRISPR system), a guide sequence (also referred to as a “spacer” in the context of an endogenous CRISPR system), or other sequences and transcripts from a CRISPR locus. In some embodiments, one or more elements of a CRISPR system is derived from a type I, type II, or type III CRISPR system. In some embodiments, one or more elements of a CRISPR system is derived from a particular organism comprising an endogenous CRISPR system, such as Streptococcus pyogenes. In general, a CRISPR system is characterized by elements that promote the formation of a CRISPR complex at the site of a target sequence (also referred to as a protospacer in the context of an endogenous CRISPR system). In the context of formation of a CRISPR complex, “target sequence” refers to a sequence to which a guide sequence is designed to have complementarity, where hybridization between a target sequence and a guide sequence promotes the formation of a CRISPR complex. Full complementarity is not necessarily required, provided there is sufficient complementarity to cause hybridization and promote formation of a CRISPR complex. A target sequence may comprise any polynucleotide, such as DNA or RNA polynucleotides. In some embodiments, a target sequence is located in the nucleus or cytoplasm of a cell. In some embodiments, the target sequence may be within an organelle of a eukaryotic cell, for example, mitochondrion or chloroplast. A sequence or template that may be used for recombination into the targeted locus comprising the target sequences is referred to as an “editing template” or “editing polynucleotide” or “editing sequence”. In aspects of the invention, an exogenous template polynucleotide may be referred to as an editing template. In an aspect of the invention the recombination is homologous recombination.
In some embodiments, one or more vectors driving expression of one or more elements of a CRISPR system are introduced into a host cell such that expression of the elements of the CRISPR system direct formation of a CRISPR complex at one or more target sites. For example, a Cas enzyme, a guide sequence linked to a tracr-mate sequence, and a tracr sequence could each be operably linked to separate regulatory elements on separate vectors. Alternatively, two or more of the elements expressed from the same or different regulatory elements, may be combined in a single vector, with one or more additional vectors providing any components of the CRISPR system not included in the first vector. CRISPR system elements that are combined in a single vector may be arranged in any suitable orientation, such as one element located 5′ with respect to (“upstream” of) or 3′ with respect to (“downstream” of) a second element. The coding sequence of one element may be located on the same or opposite strand of the coding sequence of a second element, and oriented in the same or opposite direction. In some embodiments, a single promoter drives expression of a transcript encoding a CRISPR enzyme and one or more of the guide sequence, tracr mate sequence (optionally operably linked to the guide sequence), and a tracr sequence embedded within one or more intron sequences (e.g. each in a different intron, two or more in at least one intron, or all in a single intron). In some embodiments, the CRISPR enzyme, guide sequence, tracr mate sequence, and tracr sequence are operably linked to and expressed from the same promoter.
In some embodiments, a vector comprises one or more insertion sites, such as a restriction endonuclease recognition sequence (also referred to as a “cloning site”). In some embodiments, one or more insertion sites (e.g. about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more insertion sites) are located upstream and/or downstream of one or more sequence elements of one or more vectors. In some embodiments, a vector comprises an insertion site upstream of a tracr mate sequence, and optionally downstream of a regulatory element operably linked to the tracr mate sequence, such that following insertion of a guide sequence into the insertion site and upon expression the guide sequence directs sequence-specific binding of a CRISPR complex to a target sequence in a eukaryotic cell. In some embodiments, a vector comprises two or more insertion sites, each insertion site being located between two tracr mate sequences so as to allow insertion of a guide sequence at each site. In such an arrangement, the two or more guide sequences may comprise two or more copies of a single guide sequence, two or more different guide sequences, or combinations of these. When multiple different guide sequences are used, a single expression construct may be used to target CRISPR activity to multiple different, corresponding target sequences within a cell. For example, a single vector may comprise about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, or more guide sequences. In some embodiments, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more such guide-sequence-containing vectors may be provided, and optionally delivered to a cell.
In some embodiments, a vector comprises a regulatory element operably linked to an enzyme-coding sequence encoding a CRISPR enzyme, such as a Cas protein. Non-limiting examples of Cas proteins include Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csx1, Csx15, Csf1, Csf2, Csf3, Csf4, homologs thereof, or modified versions thereof. These enzymes are known; for example, the amino acid sequence of S. pyogenes Cas9 protein may be found in the SwissProt database under accession number Q99ZW2. In some embodiments, the unmodified CRISPR enzyme has DNA cleavage activity, such as Cas9. In some embodiments the CRISPR enzyme is Cas9, and may be Cas9 from S. pyogenes or S. pneumoniae. In some embodiments, the CRISPR enzyme directs cleavage of one or both strands at the location of a target sequence, such as within the target sequence and/or within the complement of the target sequence. In some embodiments, the CRISPR enzyme directs cleavage of one or both strands within about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, or more base pairs from the first or last nucleotide of a target sequence.
In some embodiments, a vector encodes a CRISPR enzyme that is mutated to with respect to a corresponding wild-type enzyme such that the mutated CRISPR enzyme lacks the ability to cleave one or both strands of a target polynucleotide containing a target sequence. For example, an aspartate-to-alanine substitution (D10A) in the RuvC I catalytic domain of Cas9 from S. pyogenes converts Cas9 from a nuclease that cleaves both strands to a nickase (cleaves a single strand). Other examples of mutations that render Cas9 a nickase include, without limitation, H840A, N854A, and N863A. In aspects of the invention, nickases may be used for genome editing via homologous recombination.
In some embodiments, an enzyme coding sequence encoding a CRISPR enzyme is codon optimized for expression in particular cells, such as eukaryotic cells. The eukaryotic cells may be those of or derived from a particular organism, such as a mammal, including but not limited to human, mouse, rat, rabbit, dog, or non-human primate. In general, codon optimization refers to a process of modifying a nucleic acid sequence for enhanced expression in the host cells of interest by replacing at least one codon (e.g. about or more than about 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more codons) of the native sequence with codons that are more frequently or most frequently used in the genes of that host cell while maintaining the native amino acid sequence. Various species exhibit particular bias for certain codons of a particular amino acid. Codon bias (differences in codon usage between organisms) often correlates with the efficiency of translation of messenger RNA (mRNA), which is in turn believed to be dependent on, among other things, the properties of the codons being translated and the availability of particular transfer RNA (tRNA) molecules. The predominance of selected tRNAs in a cell is generally a reflection of the codons used most frequently in peptide synthesis. Accordingly, genes can be tailored for optimal gene expression in a given organism based on codon optimization. Codon usage tables are readily available, for example, at the “Codon Usage Database”, and these tables can be adapted in a number of ways. See Nakamura, Y., et al. “Codon usage tabulated from the international DNA sequence databases: status for the year 2000” Nucl. Acids Res. 28:292 (2000). Computer algorithms for codon optimizing a particular sequence for expression in a particular host cell are also available, such as Gene Forge (Aptagen; Jacobus, Pa.), are also available. In some embodiments, one or more codons (e.g. 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more, or all codons) in a sequence encoding a CRISPR enzyme correspond to the most frequently used codon for a particular amino acid.
In general, a guide sequence is any polynucleotide sequence having sufficient complementarity with a target polynucleotide sequence to hybridize with the target sequence and direct sequence-specific binding of a CRISPR complex to the target sequence. In some embodiments, the degree of complementarity between a guide sequence and its corresponding target sequence, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment may be determined with the use of any suitable algorithm for aligning sequences, non-limiting example of which include the Smith-Waterman algorithm, the Needleman-Wunsch algorithm, algorithms based on the Burrows-Wheeler Transform (e.g. the Burrows Wheeler Aligner), ClustalW, Clustal X, BLAT, Novoalign (Novocraft Technologies, ELAND (Illumina, San Diego, Calif.), SOAP (available at soap.genomics.org.cn), and Maq (available at maq.sourceforge.net).
In general, a tracr mate sequence includes any sequence that has sufficient complementarity with a tracr sequence to promote one or more of: (1) excision of a guide sequence flanked by tracr mate sequences in a cell containing the corresponding tracr sequence; and (2) formation of a CRISPR complex at a target sequence, wherein the CRISPR complex comprises the tracr mate sequence hybridized to the tracr sequence. In general, degree of complementarity is with reference to the optimal alignment of the tracr mate sequence and tracr sequence, along the length of the shorter of the two sequences. Optimal alignment may be determined by any suitable alignment algorithm, and may further account for secondary structures, such as self-complementarity within either the tracr sequence or tracr mate sequence.
In some embodiments, the CRISPR enzyme is part of a fusion protein comprising one or more heterologous protein domains (e.g. about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more domains in addition to the CRISPR enzyme). A CRISPR enzyme fusion protein may comprise any additional protein sequence, and optionally a linker sequence between any two domains. Examples of protein domains that may be fused to a CRISPR enzyme include, without limitation, epitope tags, reporter gene sequences, and protein domains having one or more of the following activities: methylase activity, demethylase activity, transcription activation activity, transcription repression activity, transcription release factor activity, histone modification activity, RNA cleavage activity and nucleic acid binding activity. Non-limiting examples of epitope tags include histidine (His) tags, V5 tags, FLAG tags, influenza hemagglutinin (HA) tags, Myc tags, VSV-G tags, and thioredoxin (Trx) tags. Examples of reporter genes include, but are not limited to, glutathione-5-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT) beta-galactosidase, beta-glucuronidase, luciferase, green fluorescent protein (GFP), HcRed, DsRed, cyan fluorescent protein (CFP), yellow fluorescent protein (YFP), and autofluorescent proteins including blue fluorescent protein (BFP). A CRISPR enzyme may be fused to a gene sequence encoding a protein or a fragment of a protein that bind DNA molecules or bind other cellular molecules, including but not limited to maltose binding protein (MBP), S-tag, Lex A DNA binding domain (DBD) fusions, GAL4 DNA binding domain fusions, and herpes simplex virus (HSV) BP16 protein fusions. Additional domains that may form part of a fusion protein comprising a CRISPR enzyme are described in US20110059502, incorporated herein by reference. In some embodiments, a tagged CRISPR enzyme is used to identify the location of a target sequence.
In an aspect of the invention, a reporter gene which includes but is not limited to glutathione-5-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT) beta-galactosidase, beta-glucuronidase, luciferase, green fluorescent protein (GFP), HcRed, DsRed, cyan fluorescent protein (CFP), yellow fluorescent protein (YFP), and autofluorescent proteins including blue fluorescent protein (BFP), may be introduced into a cell to encode a gene product which serves as a marker by which to measure the alteration or modification of expression of the gene product. In a further embodiment of the invention, the DNA molecule encoding the gene product may be introduced into the cell via a vector. In a preferred embodiment of the invention the gene product is luciferase. In a further embodiment of the invention the expression of the gene product is decreased.
In some aspects, the invention provides methods comprising delivering one or more polynucleotides, such as or one or more vectors as described herein, one or more transcripts thereof, and/or one or proteins transcribed therefrom, to a host cell. In some aspects, the invention further provides cells produced by such methods, and organisms (such as animals, plants, or fungi) comprising or produced from such cells. In some embodiments, a CRISPR enzyme in combination with (and optionally complexed with) a guide sequence is delivered to a cell. Conventional viral and non-viral based gene transfer methods can be used to introduce nucleic acids in mammalian cells or target tissues. Such methods can be used to administer nucleic acids encoding components of a CRISPR system to cells in culture, or in a host organism. Non-viral vector delivery systems include DNA plasmids, RNA (e.g. a transcript of a vector described herein), naked nucleic acid, and nucleic acid complexed with a delivery vehicle, such as a liposome. Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell. For a review of gene therapy procedures, see Anderson, Science 256:808-813 (1992); Nabel & Felgner, TIBTECH 11:211-217 (1993); Mitani & Caskey, TIBTECH 11:162-166 (1993); Dillon, TIBTECH 11:167-175 (1993); Miller, Nature 357:455-460 (1992); Van Brunt, Biotechnology 6(10):1149-1154 (1988); Vigne, Restorative Neurology and Neuroscience 8:35-36 (1995); Kremer & Perricaudet, British Medical Bulletin 51(1):31-44 (1995); Haddada et al., in Current Topics in Microbiology and Immunology Doerfler and Bihm (eds) (1995); and Yu et al., Gene Therapy 1:13-26 (1994).
Methods of non-viral delivery of nucleic acids include lipofection, nucleofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. Lipofection is described in e.g., U.S. Pat. Nos. 5,049,386, 4,946,787; and 4,897,355) and lipofection reagents are sold commercially (e.g., Transfectam™ and Lipofectin™). Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Feigner, WO 91/17424; WO 91/16024. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).
The preparation of lipid:nucleic acid complexes, including targeted liposomes such as immunolipid complexes, is well known to one of skill in the art (see, e.g., Crystal, Science 270:404-410 (1995); Blaese et al., Cancer Gene Ther. 2:291-297 (1995); Behr et al., Bioconjugate Chem. 5:382-389 (1994); Remy et al., Bioconjugate Chem. 5:647-654 (1994); Gao et al., Gene Therapy 2:710-722 (1995); Ahmad et al., Cancer Res. 52:4817-4820 (1992); U.S. Pat. Nos. 4,186,183, 4,217,344, 4,235,871, 4,261,975, 4,485,054, 4,501,728, 4,774,085, 4,837,028, and 4,946,787).
The use of RNA or DNA viral based systems for the delivery of nucleic acids take advantage of highly evolved processes for targeting a virus to specific cells in the body and trafficking the viral payload to the nucleus. Viral vectors can be administered directly to patients (in vivo) or they can be used to treat cells in vitro, and the modified cells may optionally be administered to patients (ex vivo). Conventional viral based systems could include retroviral, lentivirus, adenoviral, adeno-associated and herpes simplex virus vectors for gene transfer. Integration in the host genome is possible with the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in long term expression of the inserted transgene. Additionally, high transduction efficiencies have been observed in many different cell types and target tissues.
The tropism of a retrovirus can be altered by incorporating foreign envelope proteins, expanding the potential target population of target cells. Lentiviral vectors are retroviral vectors that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors are comprised of cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis-acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression. Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SIV), human immuno deficiency virus (HIV), and combinations thereof (see, e.g., Buchscher et al., J. Virol. 66:2731-2739 (1992); Johann et al., J. Virol. 66:1635-1640 (1992); Sommnerfelt et al., Virol. 176:58-59 (1990); Wilson et al., J. Virol. 63:2374-2378 (1989); Miller et al., J. Virol. 65:2220-2224 (1991); PCT/US94/05700). In applications where transient expression is preferred, adenoviral based systems may be used. Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system. Adeno-associated virus (“AAV”) vectors may also be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West et al., Virology 160:38-47 (1987); U.S. Pat. No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, J. Clin. Invest. 94:1351 (1994). Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No. 5,173,414; Tratschin et al., Mol. Cell. Biol. 5:3251-3260 (1985); Tratschin, et al., Mol. Cell. Biol. 4:2072-2081 (1984); Hermonat & Muzyczka, PNAS 81:6466-6470 (1984); and Samulski et al., J. Virol. 63:03822-3828 (1989).
Packaging cells are typically used to form virus particles that are capable of infecting a host cell. Such cells include 293 cells, which package adenovirus, and ψ2 cells or PA317 cells, which package retrovirus. Viral vectors used in gene therapy are usually generated by producing a cell line that packages a nucleic acid vector into a viral particle. The vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the polynucleotide(s) to be expressed. The missing viral functions are typically supplied in trans by the packaging cell line. For example, AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome which are required for packaging and integration into the host genome. Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences. The cell line may also be infected with adenovirus as a helper. The helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid. The helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than AAV.
In some embodiments, a host cell is transiently or non-transiently transfected with one or more vectors described herein. In some embodiments, a cell is transfected as it naturally occurs in a subject. In some embodiments, a cell that is transfected is taken from a subject. In some embodiments, the cell is derived from cells taken from a subject, such as a cell line.
In one aspect, the invention provides for methods of modifying a target polynucleotide in a eukaryotic cell, which may be in vivo, ex vivo or in vitro. In some embodiments, the method comprises sampling a cell or population of cells from a human or non-human animal, and modifying the cell or cells. Culturing may occur at any stage ex vivo. The cell or cells may be re-introduced into the human or non-human animal.
In one aspect, the invention provides for methods of modifying a target polynucleotide in a eukaryotic cell. In some embodiments, the method comprises allowing a CRISPR complex to bind to the target polynucleotide to effect cleavage of said target polynucleotide thereby modifying the target polynucleotide, wherein the CRISPR complex comprises a CRISPR enzyme complexed with a guide sequence hybridized to a target sequence within said target polynucleotide, wherein said guide sequence is linked to a tracr mate sequence which in turn hybridizes to a tracr sequence.
In one aspect, the invention provides kits containing any one or more of the elements disclosed in the above methods and compositions. In some embodiments, the kit comprises a vector system or components for an alternative delivery system such as those described above and instructions for using the kit. In some embodiments, the vector or delivery system comprises (a) a first regulatory element operably linked to a tracr mate sequence and one or more insertion sites for inserting a guide sequence upstream of the tracr mate sequence, wherein when expressed, the guide sequence directs sequence-specific binding of a CRISPR complex to a target sequence in a eukaryotic cell, wherein the CRISPR complex comprises a CRISPR enzyme complexed with (1) the guide sequence that is hybridized to the target sequence, and (2) the tracr mate sequence that is hybridized to the tracr sequence; and/or (b) a second regulatory element operably linked to an enzyme-coding sequence encoding said CRISPR enzyme comprising a nuclear localization sequence. Elements may be provided individually or in combinations, and may be provided in any suitable container, such as a vial, a bottle, or a tube. In some embodiments, the kit includes instructions in one or more languages, for example in more than one language.
In some embodiments, a kit comprises one or more reagents for use in a process utilizing one or more of the elements described herein. Reagents may be provided in any suitable container. For example, a kit may provide one or more reaction or storage buffers. Reagents may be provided in a form that is usable in a particular assay, or in a form that requires addition of one or more other components before use (e.g. in concentrate or lyophilized form). A buffer can be any buffer, including but not limited to a sodium carbonate buffer, a sodium bicarbonate buffer, a borate buffer, a Tris buffer, a MOPS buffer, a HEPES buffer, and combinations thereof. In some embodiments, the buffer is alkaline. In some embodiments, the buffer has a pH from about 7 to about 10. In some embodiments, the kit comprises one or more oligonucleotides corresponding to a guide sequence for insertion into a vector so as to operably link the guide sequence and a regulatory element. In some embodiments, the kit comprises a homologous recombination template polynucleotide.
In one aspect, the invention provides methods for using one or more elements of a CRISPR system. The CRISPR complex of the invention provides an effective means for modifying a target polynucleotide. The CRISPR complex of the invention has a wide variety of utility including modifying (e.g., deleting, inserting, translocating, inactivating, activating) a target polynucleotide in a multiplicity of cell types in methods of gene therapy. An exemplary CRISPR complex comprises a CRISPR enzyme complexed with a guide sequence hybridized to a target sequence within the target polynucleotide. The guide sequence is linked to a tracr mate sequence, which in turn hybridizes to a tracr sequence.
The materials and methods are provided below to facilitate the practice of the present invention.
The feasibility and efficiency of prenatal lung gene editing after intra-amniotic delivery of CRISPR-Cas9 via adenoviral vector was evaluated in order to demonstrate that prenatal pulmonary gene editing can alter the phenotype of a perinatal lethal monogenic lung disease. Experimental animals were fetuses injected with viral vectors containing SpyCas9 and an sgRNA. Control animals were fetuses injected with viral vectors containing SpyCas9 and no sgRNA, Cre recombinase, or noninjected fetuses. Sample size was determined by availability and previous experience with in utero gene and cellular therapy experiments in the mouse model. No outliers were excluded from the study. A minimum of 3 animals per group were used for studies involving statistical analyses, and the n for individual experiments is indicated in the figure legends. Pregnant mice were randomly allocated to experimental and control groups. Intra-amniotic injections and dissections were conducted in a non-blinded fashion. Blinding was performed during data collection and analysis, when possible given the survival and morphology differences in treated and untreated groups. For each experiment, sample size reflects the number of independent biological replicates.
Selection of Single Guide RNAs (sgRNAs)
sgRNAs for the R26mTmG/+ and SftpcI73T mouse models were chosen based on high on-target efficiency and low off-target effects using the online tool at crispr.mit.edu (12). For R26mTmG/+ mouse experiments, sgRNAs were designed to target both the loxP sites flanking the mT-tdTomato and stop cassette, causing the edited cells to express EGFP. For SftpcI73T mouse experiments, sgRNAs were designed to target the 5′ and 3′ ends of Sftpc gene. The sgRNAs targeting the Sftpc gene were screened by Surveyor assay in vitro. Briefly, the Sftpc sgRNAs were cloned into plasmid pSpyCas9(BB)-2A-GFP (PX458; a gift from Feng Zhang; Addgene plasmid #48138) (12), which was used to transfect mouse Neuro-2a cells (N2a). Genomic DNA was extracted using DNeasy blood and tissue kit (QIAGEN) 48 hours after transfection. Indel efficiency of each sgRNA was assessed by Surveyor nuclease assay (IDT) as previously described after amplifying with primers flanking the target site (12). The protospacer and PAM sequences screened and the PCR primers used in the Surveyor assay are listed in tables 1 and 2.
The mTmG sgRNA was cloned into plasmid pX330-U6-Chimeric_BB-CBh-hSpyCas9 (a gift from Feng Zhang; Addgene plasmid #42230) (50). The Sftpc sgRNAs (1A-targeting exon 1 and 5B-targeting exon 5) which were noted to have activity on Surveyor assay during in vitro screening were used for the in vivo experiments. The plasmids pX330-U6-Chimeric_BB-CBh-hSpyCas9 and pSpyCas9(BB)-2A-GFP (PX458) in which no sgRNA was cloned served as the negative control for the R26mTmG/+ and SftpcI73T mouse experiments, respectively. Vector Biolabs used these constructs to generate recombinant adenovirus type 5 particles. Premade adenovirus (Ad) type 5 particles containing Cre recombinase under a CMV promoter were purchased from Penn Vector Core. Ad viral vectors are referred to as Ad.mTmG, Ad.Sftpc.GFP, Ad.Cre, Ad.Null, and Ad.Null.GFP. The final viral titer used for experiments ranged from 0.6×1010-1.2×1011 PFU/ml.
C57Bl/6, B6.129(Cg)-Gt(ROSA)26Sortm4(ACTB-tdTomato,-EGFP)Luo/J (called R26mTmG/+; stock #007676), and B6.129S4-Gt(ROSA)26Sortm2(FLP*)Sor/J (called Flp-O mice; stock #12930) were purchased from Jackson Laboratories. SftpcI73T mice were created and provided by Dr. Michael Beers (35). Animals were housed in the Laboratory Animal Facility of the Abramson Research Center and the Colket Translational Research Building at The Children's Hospital of Philadelphia (CHOP). The experimental protocols were approved by the Institutional Animal Care and Use Committee at CHOP and followed guidelines set forth in the National Institutes of Health's Guide for the Care and Use of Laboratory Animals.
Intra-amniotic in utero injections were performed as previously described (data not shown) (32). Briefly, the amniotic cavity of fetuses of time-dated mice was injected at gestational day (E) 16, a time during murine fetal development at which fetal breathing movements are optimal. Under isoflurane anesthesia and after providing local anesthetic (0.25% bupivacaine subcutaneously), a midline laparotomy was made and the uterine horn exposed. Under a dissecting microscope, 10 μL of virus combined with 10 μL of theophylline (1.6 mg/ml) were injected into the amniotic sac of each fetus. The uterus was then returned to the abdominal cavity and the laparotomy incision was closed in a single layer with 4-0 Vicryl suture. After recovery from anesthesia, pregnant dams were placed in a chamber containing 10% CO2 for 1 hour. Theophylline injection and maternal CO2 exposure was performed to enhance fetal respiratory drive to more efficiently target the fetal lung (40).
The Gt(ROSA)26Sortm4(ACTB-tdTomato,-EGFP)Luo (R26mTmG/+) mouse model is a fluorescent reporter mouse model that consists of a membrane bound tdTomato (mT) and 3′ stop codon that is flanked by loxP sites. Downstream to the distal lox P site, is the membrane-bound green fluorescent protein (mG-EGFP). All cells at baseline express tdTomato. Expression of Cre recombinase causes deletion of the mT-tdTomato cDNA along with a transcriptional stop cassette and expression of the mG-EGFP (34). R26mTmG/+ fetuses were injected intra-amniotically with either Ad.mTmG, Ad.Cre, or Ad.Null at E16, and the injected fetuses were analyzed at E19, postnatal day 7 (P7), P30, and 6 months of age. At the time of analysis, the lungs and other organs of injected mice were assessed for EGFP expression by fluorescent stereomicroscope (MZ16FA; Leica). For larger mice at P30—6 months of age, the right lung was fixed in 2% paraformaldehyde for immunohistochemistry (IHC) analysis, and the left lung was used to extract genomic DNA with DNeasy blood and tissue kit (QIAGEN) and for fluorescence-activated cell sorting (FACS). At E19 and P7, the right lung was used for IHC and the left lung was used to extract DNA from a single mouse. Both the right and left lungs from another mouse were used for FACS. For the experimental group, all fetuses from a single dam were injected with Ad.mTmG. Ad.Cre was injected at comparable titers to all fetuses from another dam as a positive control, and Ad.Null without a sgRNA was injected at comparable titers for negative control experiments.
The SftpcI73T mouse line has targeted alleles containing an HA-tagged mouse SP-CI3T sequence knocked into the endogenous mouse Sftpc locus. Heterozygous mutant mice accumulate mistrafficked mutant SP-C3T pro-protein within AT2 cells, causing arrest of lung morphogenesis and death within 6 hours of birth. An intronic FRT-PGK-neo-FRT cassette results in a homomorphous phenotype and enables mice to survive to adulthood (35). In this study, SftpcI73T/I73T/Neo+/+ mice were crossed with FlpO+/+ mice to produce SftpcI73T fetuses. Two Ad-5 vectors with one virus expressing spyCas9, a sgRNA targeting the 5′ end of Sftpc gene, and EGFP and the other virus expressing spyCas9, a sgRNA targeting the 3′ end of Sftpc gene, and EGFP were injected into E16 SftpcI73T/WT or C57BL/6 fetuses. Fetuses were harvested at E19 for analysis as described above. For survival analysis of SftpcI73T/WT injected fetuses, pups were allowed to be born and fostered with Balb/c dams until P7, at which time they were euthanized by decapitation for morphological and IHC analysis.
Primers 5′ and 3′ to the sgRNA target sites in the mTmG-loxP and Sftpc gene were used for PCR analysis to detect gene editing (table 3). PCR amplification with mTmG primers results in a 2951 bp band for the unedited mTmG sequence and a 545 bp band for the edited mTmG sequence. Similarly, PCR amplification with Sftpc primers results in a 3828 bp band for the unedited SftpcWT gene, a 3950 bp band for the unedited SftpcI73T gene, and a 605 bp band for the edited Sftpc gene. For quantification of gene-edited Sftpc alleles, quantitative real-time PCR was performed on a QuantiStudio 7 Flex using SYBR green reagents and primers specific for the unedited and edited alleles (table 4).
Lungs were harvested and processed into single-cell suspension using a dispase (Collaborative Biosciences)/collagenase (Life Technologies)/DNase solution as previously described (51). For the R26mTmG experiments, lung epithelial, endothelial, and mesenchymal cell populations were assessed using a MoFlo Astrios EQ (Beckman Coulter) flow cytometer with antibody staining for DAPI, EpCAM-APC (eBioscience), CD31-PECy7 (eBioscience), and CD45-ef450 (eBioscience). Cells were negatively gated for DAPI and CD45 channels to exclude dead cells and lymphohematopoietic cells. Pulmonary epithelial (EpCAM+CD31−), endothelial (EpCAM−CD31+), and mesenchymal (EpCAM−CD31−) cells were evaluated for EGFP expression to determine the percentage of editing within each cell type (
Lungs were directly fixed in 2% paraformaldehyde. Lungs that were harvested for morphological analyses were inflation-fixed with 20 cm H2O at E19 and 30 cm H2O at P7 or later. After serial dehydration, tissue was embedded in paraffin and sectioned. Hematoxylin and eosin staining was performed for tissue morphology. IHC to detect proteins was performed using the following antibodies on paraffin sections: GFP (goat, Abcam, 1:100), GFP (chicken, Ayes, 1:500), RFP (rabbit, Rockland, 1:250), SFTPC (rabbit, Santa Cruz, 1:250), SFTPB (rabbit, Abcam 1:500), AQP5 (rabbit, Abcam, 1:100), SCGB1A1 (goat, Santa Cruz, 1:20), FOXJ1 (mouse, Santa Cruz, 1:250), HA (mouse, Abcam, 1:4000); HOPX (mouse, Santa Cruz, 1:50).
Confocal microscopy using a Leica TCS SP8 confocal scope was used to capture images. For each mouse, confocal z stack images were taken in 5 or 10 random airway and alveolar areas, respectively, and analyzed using ImageJ software. The specific cell types that were EGFP+ in the R26mTmG experiments or HA+ in SftpcI73T experiments were manually counted using the Cell Counter plug-in for ImageJ.
For quantification of mean linear intercept, 10 pictures for each sample were taken with a 40× objective lens for E19 lungs and at 20× for P7 and adult lungs. The images were viewed under a field of equally spaced horizontal lines using ImageJ, and MLI was calculated as the average of total length of lines divided by the total intercepts of alveolar septa from each lung. For quantification of AT1 cell spreading, 5 pictures from each lung sample were taken with a 40× objective, and average distance between HOPX-stained AT1 cells was measured using ImageJ as previously described (52).
Off-target sites for Sftpc were predicted using CRISPOR (http://crispor.tefor.net), and the top twenty sites, as ranked by the CFD off-target score (41), were assessed by next-generation DNA sequencing at the Massachusetts General Hospital CCIB DNA Core (CRISPR Sequencing Service; https://dnacore.mgh.harvard.edu/new-cgi-bim.site/pages/crispr_sequencing_main.jsp). Please refer to tables 5 and 6 for the predicted off-target sites and the PCR primers used for off-target NGS analysis. The number of paired-end reads typically exceeded 50,000 per target site per sample. Off-target indel mutagenesis rates were determined as previously described (18).
At least three mice were used for experimental and control groups undergoing statistical analyses, with the n values indicated in the figure legends. All animals that inhaled the virus after intra-amniotic delivery, as represented by EGFP+ lungs, were included for the final analysis. Animals that had EGFP− lungs were considered as technical failure and excluded from final analysis. All data points used in statistical analyses are represented as the mean±one standard deviation (SD). For histologic analyses, all data points were means of technical replicates and presented as percentages or means±one SD. A two-tailed Student's t-test was used for experiments involving the comparison of two groups in which data were normally distributed, as determined by the Shapiro-Wilk test of normality. A one-way ANOVA followed by Tukey's multiple comparison tests was used for statistical analyses of experiments involving the comparison of more than two groups. Survival analysis of gene-edited SftpcI73T mice was performed using survival proportions and by log-rank (Mantel-Cox) test for comparison of survival curves. P<0.05 was considered significant. Statistical analyses were performed with GraphPad Prism 7.
The following examples are provided to illustrate certain embodiments of the invention. They are not intended to limit the invention in any way.
In the present example we demonstrate the feasibility, efficiency, and specificity of prenatal CRISPR-Cas9 mediated gene editing of the lung in two mouse models. We first developed a targeting strategy for a commercially-available fluorescent reporter mouse model (34), to demonstrate efficient and persistent gene editing, and found that our approach predominantly restricted gene editing to pulmonary epithelial cells following intra-amniotic delivery of CRISPR-Cas9 reagents. We then evaluated the therapeutic role of fetal lung gene editing utilizing a mouse model expressing a chILD-causing mutation, SftpcI73T(35). When expressed in mice during embryogenesis, the SP-CI73T proprotein arrests lung development leading to rapid perinatal death. We show that CRISPR-Cas9 induced excision of the mutant SftpcI73T gene can rescue the lung from toxic accumulation of the disease-associated protein and improve lung development in SftpcI73T mutant mice, leading to their increased survival. Our proof-of-concept study demonstrates that in utero gene editing provides a new therapeutic approach for treatment of congenital lung diseases caused by defects in the pulmonary epithelium.
We established a model to define the efficiency and persistence of prenatal lung gene editing that could be easily monitored and quantified. Gt(ROSA)26Sortm4(ACTB-tdTomato,-EGFP)Luo mice (referred to as R26mTmG) have a two-color fluorescent cassette (mT-tdTomato: cell membrane-bound red; mG-EGFP: cell membrane-bound green) that can be differentially activated by Cre recombinase (34). mT-tdTomato red fluorescence is constitutively expressed in the plasma membrane of all cells, including pulmonary epithelial, endothelial, and mesenchymal cells. Upon Cre expression, the mT-tdTomato cDNA along with a transcriptional stop cassette is deleted and the mG-EGFP cassette is subsequently expressed. We chose Streptococcus pyogenes Cas9 (SpyCas9) to perform in utero CRISPR-Cas9 gene editing because SpyCas9 remains the most efficient version of the enzyme. Due to the large size of SpyCas9 (˜4.2 kb), we chose an adenovirus (Ad) to deliver this enzyme to the developing fetus. As previous work has demonstrated that fetal breathing movements combined with theophylline and mild maternal hypercarbia treatment to stimulate respiratory drive can promote efficient and fairly specific delivery of viral vectors into the fetal lung after intra-amniotic injection (32, 33), we delivered Ad vectors containing SpyCas9 and a sgRNA targeting the loxP sites flanking the mT/stop cassette (Ad.mTmG) into the amniotic cavity of E16 R26mTmG/+ fetuses (
Since various lung epithelial cell types are affected in congenital monogenic lung diseases, we evaluated the efficiency of gene editing in individual pulmonary cell lineages (epithelial, endothelial, and mesenchymal cells) using the R26mTmG/+ model. The distribution of EGFP+ cells was confined to the epithelial lining throughout the lung section with sparing of the blood vessels and subepithelial regions (
We next assessed the efficiency of gene editing in several important lung epithelial lineages including AQP5+ alveolar epithelial type 1 cells (AT1), SFTPC+ type 2 alveolar epithelial cells (AT2), SCGB1A1+ secretory airway epithelial cells, and FOXJ1+ ciliated airway epithelial cells (
Pulmonary Epithelial Cell Editing is Stable after Prenatal CRISPR Delivery
Although the lung is considered to be a fairly quiescent organ, there is a slow steady-state turnover of cells in the postnatal period. Using the R26mTmG/+ model, we assessed the persistence of gene-edited pulmonary cells over time using flow cytometry and IHC at gestational day (E) 19, postnatal day (P) 7, P30, and 6 months after intra-amniotic injection of Ad.mTmG at E16 (
Prenatal Gene Editing in SftpcI73T Mice Decreases Mutant SP-CI73T Proprotein and Improves Lung Alveolarization
Because our data demonstrated effective and persistent gene editing in the developing lung, we next tested whether prenatal gene editing can rescue a clinically relevant monogenic human lung disease model. Among SFTPC variants associated with clinical ILD, the missense substitution (g.1286T>C), resulting in a change of isoleucine to threonine at position 73 in the SP-C proprotein (“SP-CI73T”), is the most common known SFTPC mutation in humans (7, 37). Functionally, expression of SFTPCI73T in vitro results in a toxic cellular response initiated by the markedly altered intracellular trafficking of the SP-CI73T proprotein to the plasma membrane (38, 39). The SftpcI73T knock-in mouse, which models human SFTPCI73T, has shown that intracellular accumulation of mutant proprotein triggers an aberrant injury-repair response resulting in fibrotic lung remodeling in adult mice (35). Under appropriate conditions, SftpcI73T mice can be induced to show an allele-dependent arrest of lung morphogenesis in late sacculation with no live births. Because previous studies have shown that Sftpc null mice have normal growth and lung function in homeostasis (40), we hypothesized that excision of the SftpcI73T gene would reduce the synthesis of mistrafficked SP-CI73T proprotein, thereby correcting the dysfunctional AT2 cell phenotype and improving survival in gene-edited SftpcI73T mice (
sgRNAs were designed to target the 5′ and 3′ ends of the Sftpc gene and screened for efficient DNA cutting (
The founder SftpcI73T-Neo mouse line has a targeted allele containing an HA-tagged mouse SftpcI73T sequence knocked into the endogenous mouse Sftpc locus (35). This allele contains an intronic FRT flanked PGK/neo cassette producing a milder phenotype. Deletion of the neo cassette using a homozygous FlpO deleter line results in increased expression of SftpcI73T and a more severe phenotype characterized by abnormalities in sacculation, prenatal arrest of lung development, and perinatal death (
To assess improvement in lung alveolarization after rescue, lungs of E19 fetuses were inflation-fixed for morphometric analysis. Ad.Sftpc.GFP-treated mice demonstrated decreased mean linear intercept (MLI) compared to Ad.Null.GFP treated fetuses, indicating improved lung sacculation (
Because CRISPR-mediated excision of the SftpcI73T gene decreased the synthesis of the mutant SP-CI73T pro-protein, improved AT2 and AT1 cell morphology and function, and improved lung maturation, we next tested if gene-edited mice exhibited improved survival. E16 SftpcI73T/WT fetuses were injected intra-amniotically with Ad.Null.GFP or Ad.Sftpc.GFP and assessed for survival up to one week of age (
In this example, we demonstrate that CRISPR-Cas9 can be used to perform gene editing during tissue development through in utero intra-amniotic delivery to rescue a perinatal lethal monogenic lung disease. This approach targets the lung, with pulmonary epithelial cells including AT1, AT2, and secretory airway epithelial cells being preferentially edited. We show that in utero gene editing can ameliorate the phenotype of a congenital lung disease caused by the SftpcI73T mutation and improves survival of rescued mice. This study supports an important application of CRISPR-Cas9 to rescue viability at birth due to a lethal genetic mutation.
The design of the R26mTmG/+ model allowed for the tracking of cell type specificity, efficiency, and long-term persistence of gene editing. Our results demonstrate gene editing occurring predominantly in the lung and persisting up to 6 months of life, the last point of analysis. Using the intra-amniotic route of delivery, we were able to achieve approximately 20% editing in the lung epithelium, including the distally located AT1 and AT2 cells, at birth. Increased pulmonary epithelial cell editing compared to pulmonary endothelial and mesenchymal cell editing is likely due to direct contact between epithelial cells and the “inhaled” amniotic fluid as well as the location of adenovirus receptors on pulmonary epithelial cells that facilitates transduction (33, 42, 43).
In addition to pulmonary cell editing, we identified a few clusters of gene-edited cells in the proximal gastrointestinal tract after “swallowing” the amniotic fluid containing the viral vector, as previously demonstrated (33). Lower gastric compared to pulmonary cell editing might be due to the fairly rapid amniotic fluid inhalation, which was promoted by the administration of theophylline and maternal hypercarbia to enhance fetal breathing movements. Intra-amniotic injection might also be expected to target the skin. The lack of skin gene editing is likely explained by the skin barrier, formed initially by the periderm at E13 and completed by E17 after keratinization, which prevents viral vector transduction and thus epidermal editing after intra-amniotic delivery at E16 (31, 33). Lung-targeted gene editing is an advantage for genes that specifically cause lung disease, although they may be expressed in other organs. Thus, a targeted approach may minimize the exposure of other organs to potentially deleterious on- and off-target effects. Although lung-specific gene editing is beneficial for SP disease in the current study, alternative delivery approaches, including the intravenous route, may allow for efficient prenatal editing of other organs to address congenital genetic disorders that cause morbidity and mortality before or shortly after birth (27). Finally, although the use of theophylline and CO2 to increase respiratory drive favored lung targeting in fetal mice via the intra-amniotic route, a more directed fetoscopic intra-tracheal approach could be performed in large animal models and in humans (44, 45).
Another advantage of in utero gene editing is the relatively uniform targeting of most of the major pulmonary epithelial cell types, including both proximal and distal lineages. In general, the inhalational route of drug delivery to postnatal lungs results in a differential distribution, with peripheral regions of the lungs receiving lower amounts compared to proximal and central regions (46). The efficiency of inhalational drug distribution is further impaired in the injured lung due to heterogeneity of lung disease, with some regions of the lung being overinflated and other regions collapsed. Thus, particularly for more complex lung disease, the more uniform distribution of vector delivery observed via an in utero intervention may provide an advantage in future therapies.
Given that many congenital lung diseases such as cystic fibrosis and inherited SP disease are generally caused by monogenic mutations, they should be ideal candidates for gene editing technologies. In mice, Sftpc expression is not required for survival and lung function at normal physiologic conditions (40), and thus a simple deletion of the mutant SftpcI73T gene was sufficient to improve mortality in our mouse model. Future therapeutic approaches in patients will likely require more targeted modifications in DNA. In humans, correction of the SFTPCI73T mutation would be more desirable than excision of the mutated gene for treating the disease. However, our study demonstrates the feasibility of targeting the lung for gene editing before birth and presents evidence that even lethal mutations can be mitigated through prenatal gene editing techniques.
The use of Ad vectors is exemplified herein. Other delivery techniques, including AAV and/or lipid nanoparticles and smaller Cas9 genes can also be employed (18, 48, 49).
Although prenatal gene editing has the potential to take advantage of normal developmental properties to enhance editing efficiency and treat perinatal lethal diseases before birth, additional points must be considered that are not present for postnatal gene editing. Any prenatal intervention involves the possibility of affecting not only the fetus, but the mother who is an immunocompetent and often disease-free “bystander”. Thus, injection techniques and gene editing delivery vehicles can be optimized to avoid exposure to the mother. Given the potential maternal risk, initial disease targets should include those which cause major morbidity and/or mortality before or shortly after birth and for which no adequate treatments exist. Prenatal gene editing can involve mid to late gestation gene editing as detailed in the current study or early embryo gene editing. Early embryo gene editing can be performed ex vivo followed by implantation into the mother, thus avoiding maternal exposure to gene editing technology. In addition, early embryo gene editing may allow for more efficient correction of a larger number of cells in multiple organs, with the possibility of correcting germline cells. However, later gestation gene editing may allow editing to be more specific for a target organ or cell population, including avoiding germ cell editing, and would allow for the possibility of treating de novo mutations diagnosed later in pregnancy.
With the rapid pace at which CRISPR technology is advancing towards clinical translation, techniques that improve the efficiency and specificity of gene editing for targeting specific organs or tissues associated with specific diseases provide a new avenue for treatment of such disorders. Our studies demonstrate the feasibility of prenatal gene editing with high specificity for the lung represent a promising approach to address the unmet need for therapeutic approaches to congenital lung diseases that are fatal at birth.
The results presented in the previous example, indicate that gene editing could have a significant impact on monogenic lung disease in larger animal subjects and humans. In this example, we describe compositions and methods for targeting the developing lung in a large animal model (e.g., fetal sheep) which is relevant to human treatment. Fetal sheep were injected via the intratracheal route with an adenoviral vector containing SpCas9 and a guide RNA targeting the ovine SPC gene using an open technique. This approach entails performing a maternal laparotomy and opening the uterus. The fetal trachea was then identified and injected with a biologically compatible solution comprising the viral vector, in this example, an adenovirus. Fetal organs were harvested at 7-10 days post injection and DNA was assessed by surveyor and next generation sequencing for editing at the target sites. This demonstrated editing efficiencies of approximately 5% of all pulmonary cells (
In another approach, other vectors may be used to carry the gene therapy or gene editing material. These vectors include, without limitation, adeno-associated viruses, retroviral constructs, and nanoparticle technology. Additionally, a minimally invasive approach of fetoscopic access to the fetus may be used throughout gestation with subsequent cannulation of the trachea and injection of the therapy after which temporary closure of the trachea would be performed or, alternatively, no closure of the trachea would be performed.
The results presented in the previous examples, indicate that gene editing could have a significant impact on monogenic lung disease in human subjects. Potential target diseases include without limitation, Cystic fibrosis, Surfactant protein deficiencies including for example, surfactant protein C deficiency, surfactant protein B deficiency, and ABCA3 deficiency, and Alveolar capillary dysplasia, and Alpha-1-antitrypsin disease.
Hereditary surfactant protein B (SP-B) deficiency is an autosomal recessive disorder that causes fatal respiratory failure in the neonatal period. Full-term infants born with SF-B deficiency have respiratory failure and the disease is fatal by 3-6 months of age. Currently, the only treatment for SP-B deficiency is a lung transplant. Carriers of the disease are asymptomatic. Although, more than 40 distinct mutations in the SP-B gene have been identified, two thirds of the mutant disease-causing alleles result from the 121ins2 mutation (Ref SNP: rs35328240) in exon 4. The mutation consists of a net 2-base pair insertion in exon 4 of the SFTPB gene (375C-GAA change) resulting in a frameshift and premature termination of the protein.
Using SP-B deficiency as an example, it is clear that clinical application of in utero gene editing entailing the use of targeted CRISPR-Cas9 mediated homology directed repair to correct the common 121ins2 mutation in the SFTPB gene (which causes lethal respiratory failure shortly after birth), is feasible. Parents can be initially screened for the disease-causing mutations in order to identify carriers of the mutated gene. In pregnancies from carriers identified as having of the mutant allele, CVS or other diagnostic modalities can be offered to identify the presence of a homozygous mutation in the fetus. Once an affected fetus is identified, in utero gene editing can be offered to the parents.
In one approach, during mid-gestation (e.g., between 20 and 28 weeks of pregnancy, preferably at 20 weeks, 22 weeks, 24 weeks, or 26 weeks of gestation), a fetoscope can be used to introduce into the fetal airway a catheter comprising an insufflated balloon with an injection port distal to the balloon. The balloon can be deployed to block the airway to prevent the escape of the gene editing system which is injected as a bolus immediately after balloon deployment. After approximately 1 week following delivery of the system, a second procedure can be performed to remove the balloon, thereby eliminating obstruction of the airway to preclude any issues at birth and to allow for continued development of the lungs without tracheal occlusion. This is significant as tracheal occlusion is known to affect normal development of the lung and is associated with abnormal surfactant production. Follow up clinical studies can be performed to ensure that the mutated gene has been corrected and normal phenotypes restored.
In another approach, fetuses harboring the delta 508 mutation in CF or the I73T mutation associated with surfactant protein C deficiency, or the myriad of other, less prevalent mutations present in CF and the surfactant protein deficiencies can be treated using gene editing systems comprising reagents suitable for correcting these genetic mutations. In yet other approaches, post-natal infants can be treated employing a vector system described herein. In on aspect of this method, the vector is delivered in aerosolized form, or via an inhaler/nebulizer. Another method entails direct injection of vector containing biologically compatible liquids directly into the airways via a bronchoscopy
Several different approaches are available to the person of skill in this art area for delivering the genetic editing systems described herein. These include without limitation:
1. Viral vectors such as adenovirus, lentivirus, AAV virus (including the multiple different serotypes), lentivirus
2. Non-viral delivery techniques, e.g., loaded exosomes, nanofiber and nanoparticle delivery approaches described above.
Human target genes and GenBank Reference numbers of relevant gene sequences include for example, ABCA3—NG_011790.1; SFTPC—NG_016968.1; SFTPB—NG_016967.1; CFTR—NG_016465.4. CFTR and SERPINA1—NP_000286.3.
Guide strands useful in the methods described for editing the Surfactant protein C can be selected from:
Guide strands useful in the methods described for editing the CFTR can be selected from
In preferred embodiments, non-integrating viral vectors or nanoparticles are employed to deliver the CRISPR-Cas9 system, eliminating the need for removal of the system as expression of the system should be transient.
As an alternative approach, base editing, a form of gene editing, can be used to correct disease causing mutations in congenital genetic lung diseases. For surfactant protein C deficiency, the most common disease-causing mutation is a T→C base change resulting in a gain-of-function mutation. Using cytosine deaminase base editors (CBE), which can change a C→T, we have identified guide RNAs and CBEs that can correct the mutation in a mouse model of Surfactant protein C deficiency that has the most common human mutation. The human SPC gene sequence has been screened and multiple gRNAs have been identified that have the potential to change the disease causing C→T mutation with CBE (Table 7).
While certain of the preferred embodiments of the present invention have been described and specifically exemplified above, it is not intended that the invention be limited to such embodiments. It will be apparent to one skilled in the art that various changes and modifications can be made therein without departing from the scope of the present invention, as set forth in the following claims.
This application is a § 371 of International Application No, PCT/US2020/026949, filed Apr. 6, 2020, which claims priority to U.S. Provisional Application No. 62/830,032 filed on Apr. 5, 2019, the entire contents of each being incorporated herein by reference.
This invention was made with government support under Grant Numbers 1U01HL134745, UL1-TR001878 and HL119436 awarded by the National Institutes of Health and Grant Number 1I01BX001176 awarded by the US Dept. of Veteran Affairs. The US government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/026949 | 4/6/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62830032 | Apr 2019 | US |