Cystic fibrosis (CF) is the most common autosomal recessive disease in the Caucasian population. It causes severe damage to the lungs, pancreas, liver, intestines, sinuses, and other organs of the body. Various known mutations within the CF transmembrane conductance regulator (CFTR) gene cause CF. While technological advances have increased the life expectancy of CF patients, there is still no effective cure for the disease. It is therefore of great interest to develop new therapies for CF.
The present disclosure is based, at least in part, on the development of efficient gene editing systems for correcting mutations in a CF transmembrane conductance regulator (CFTR) gene. In some embodiments, the gene editing system relies on the identification of effective editing positions in intron 10 of the CFTR gene (e.g., those disclosed herein) for effective insertion of a nucleic acid encoding exons 11 to 27 of the CFTR gene, which would result in the correction of 99% of the most frequent CFTR non-responsive alleles.
As such, in some aspects, the disclosure relates to gene-editing systems for modifying a cystic fibrosis transmembrane regulator (CFTR) gene. Such a gene-editing system may comprise: (a) a first polynucleotide, which comprises a first nucleotide sequence encoding exons 11 to 27 of the CFTR gene; (b) a second polynucleotide, which comprises a second nucleotide sequence encoding an RNA-guided DNA endonuclease, or the RNA-guided DNA endonuclease; and (c) a third polynucleotide, which comprises a third nucleotide sequence encoding a guide RNA (gRNA), wherein the gRNA directs cleavage by the RNA-guided DNA endonuclease at a target site, which is position 1220, 2068, 3821, 4262, 5041, 5052, 5278, 5343, 5538, or 6150 of intron 10 in the CFTR gene. In some embodiments, the first nucleotide of the first polynucleotide of (a) is free of intron sequences.
In some embodiments, the first polynucleotide of (a) may further comprise a 5′ homologous arm upstream to the first nucleotide sequence and/or a 3′ homologous arm downstream to the first nucleotide sequence. The 5′ homologous arm may comprise a nucleic acid sequence that is homologous to a region upstream to the target site. Alternatively or in addition, the 3′ homologous arm may comprise a nucleic acid sequence that is homologous to a region downstream to the target site. In some embodiments, the 5′ homologous arm and the 3′ homologous arm comprise nucleotide sequences selected from the group consisting of:
(i) SEQ ID NO: 17 and SEQ ID NO: 18, respectively;
(ii) SEQ ID NO: 19 and SEQ ID NO: 20, respectively;
(iii) SEQ ID NO: 21 and SEQ ID NO: 22, respectively;
(iv) SEQ ID NO: 23 and SEQ ID NO: 24, respectively;
(v) SEQ ID NO: 25 and SEQ ID NO: 345, respectively;
(vi) SEQ ID NO: 346 and SEQ ID NO: 347, respectively;
(vii) SEQ ID NO: 348 and SEQ ID NO: 349, respectively;
(viii) SEQ ID NO: 350 and SEQ ID NO: 351, respectively;
(ix) SEQ ID NO: 352 and SEQ ID NO: 353, respectively; and
(x) SEQ ID NO: 354 and SEQ ID NO: 355, respectively.
In some embodiments, the first nucleotide sequence in (a) may further comprise a first fragment upstream to the first nucleotide sequence and downstream to the 5′ homologous arm, and wherein the first fragment contains an acceptor splice site. For example, the first fragment may comprise the nucleotide sequence of TATACACTTCTGCTTAGGATGATAATTGGAGGCAAGTGAATCCTGAGCGTGATTTGA TAATGACCTAATAATGATGGGTTTTATTTCCAG (SEQ ID NO: 1), wherein the 3′ end AG is the splicing acceptor site.
In some embodiments, the second nucleotide sequence encoding the RNA-guided DNA endonuclease in (b) may further comprise a nucleotide sequence encoding a nuclear localization signal (NLS), which is fused in-frame with the RNA-guided DNA endonuclease. In some embodiments, the NLS is a SV40 NLS. In some embodiments, the RNA-guided DNA endonuclease can be a Cas9 endonuclease, for example, a Staphylococcus aureus Cas9 enzyme (saCas9).
In some embodiments, the third nucleotide sequence in (c), which encodes the gRNA, may comprise one of the following nucleotide sequences:
It should be understood that because the third nucleotide sequence encoding the gRNA can be either DNA sequences or RNA sequences, any of the thymines (T) in the sequences may be replaced with a uracil (U). In some embodiments, the third nucleotide sequence in (c) may further comprise a scaffold sequence, which in some examples may comprise the nucleotide sequence of
In some embodiments, (a), (b), and (c) may be located on the same vector. In other embodiments, (a), (b), and/or (c) may be located on different vectors. For example, (a) and (c) may be located on a first vector, and (b) may be located on a second vector which is different from the first vector. Alternatively, (a) may be located on a first vector, and (b) and (c) may be located on a second vector which is different from the first vector. In some embodiments, the vector(s) is a viral vector(s), for example an adeno-associated viral (AAV) vector(s).
Also within the scope of the present disclosure are viral particles or sets of viral particles, which collectively comprise any of the gene-editing systems disclosed herein. In some embodiments, the viral particle is, or set of viral particles are, AAV particle(s).
In yet other aspects, the disclosure relates to methods of editing a CFTR gene, the method comprises contacting a cell with (i) any of the gene-editing systems disclosed herein or (ii) a viral particle or a set of viral particles, which collectively comprise the gene-editing system. In some embodiments, the cell may comprise a mutation in one or more of exons 11-27 of the CFTR gene. Example mutations include, but are not limited to, F508del, I507del, G542X, S549N, G551D, R553X, D1152H, N1303K, W1282X, 2789+5G>A, or 3849+10kbC>T.
In some embodiments, the contacting step is performed by administering the gene-editing system or the viral particle(s) to a subject in need thereof. In some examples, the gene-editing system or the viral particle(s) is administered to the respiratory tract of the subject. In some embodiments, the subject is a human patient having cystic fibrosis. In some examples, the human patient is a child.
In some embodiments, the cell is a stem cell, for example an iPSC cell or a bronchioalveolar stem cell.
In some examples, any of the methods described herein may further comprise administering the cell with the edited CFTR gene to a subject in need thereof. In some examples, the cell with the edited CFTR gene is administered to the respiratory tract of the subject (e.g., a human patient having cystic fibrosis). In some examples, the human patient may be a child.
In other aspects, the present disclosure relates to nucleic acids, which may be viral vectors such as AAV vectors. The nucleic acid may comprise: (a) a first nucleotide sequence encoding exons 11 to 27 of a CFTR gene; (b) a 5′ homologous arm upstream to the first nucleotide sequence, wherein the 5′ homologous arm comprises a nucleic acid sequence that is homologous to a region upstream to a target position in intron 10 of the CFTR gene; and (c) a 3′ homologous arm downstream to the first nucleotide sequence, wherein the 3′ homologous arm comprises a nucleic acid sequence that is homologous to a region downstream to a target position in intron 10 of the CFTR gene; wherein the target position is selected from the group consisting of position 1220, 2068, 3821, 4262, 5041, 5052, 5278, 5343, 5538, or 6150 of intron 10 in the CFTR gene. In some embodiments, the first nucleotide sequence of (a) is free of intron sequences.
In other embodiments, the nucleic acid may comprise: (a) a first nucleotide sequence encoding an RNA-guided DNA endonuclease, and (b) a second nucleotide sequence encoding a guide RNA (gRNA), wherein the gRNA directs cleavage by the RNA-guided DNA endonuclease at a target position of a CFTR gene, wherein the target position is selected from the group consisting of position 1220, 2068, 3821, 4262, 5041, 5052, 5278, 5343, 5538, or 6150 of intron 10 in the CFTR gene; wherein each of the first nucleotide sequence and the second nucleotide sequence is in operable linkage to a promoter.
In some embodiments, the first nucleotide sequence in (a) may further comprise a first fragment linked to the 5′ end of the nucleotide sequence encoding exons 11 to exon 27 of the CFTR gene, wherein the first fragment comprises a acceptor splice site (e.g., those disclosed herein).
In some embodiments, the nucleic acid may further comprise a second nucleotide sequence encoding a guide RNA (gRNA), wherein the gRNA directs cleavage by the RNA-guided DNA endonuclease at any of the target positions disclosed herein. Exemplary gRNAs are also provided in the present disclosure. See page 14 below.
In other aspects, the present disclosure relates to genetically edited lung cells or precursor cells, thereof. The genetically edited lung cell or the precursor cell, thereof may comprise a genetically edited endogenous CFTR gene, in which an exogenous nucleic acid is inserted into intron 10 of the endogenous CFTR gene, wherein the exogenous nucleic acid comprises a first nucleotide sequence encoding exons 11 to 27 of a CFTR gene. In some embodiments, the first nucleotide sequence is free of intron sequences. In some embodiments, the exogenous nucleic acid further comprises a second nucleotide sequence linked to the 5′ end of the first nucleotide sequence, the second nucleotide sequence comprising an acceptor slice site. In some embodiments, the second nucleotide sequence comprises SEQ ID NO: 1. In some embodiments, the edited lung cell or precursor cell thereof is a human cell. In some embodiments, the precursor cell is an iPSC cell or a bronchioalveolar stem cell.
Also within the scope of the present disclosure are uses of any of the gene-editing systems described herein, components thereof, or any of the genetically engineered cells described herein for treating CF, as well as uses thereof for manufacturing a medicament for the intended medical treatment.
The details of one or more embodiments of the disclosure are set forth in the description below. Other features or advantages of the present disclosure will be apparent from the detailed description of several embodiments and also from the appended claims.
Gene editing (including genomic editing) is a type of genetic engineering in which nucleotide(s)/nucleic acid(s) is/are inserted, deleted, and/or substituted in a DNA sequence, such as in the genome of a targeted cell. Targeted gene editing enables insertion, deletion, and/or substitution at pre-selected sites in the genome of a targeted cell (e.g., in a targeted gene or targeted DNA sequence). When a sequence of an endogenous gene is edited, for example by deletion, insertion or substitution of nucleotide(s)/nucleic acid(s), the endogenous gene comprising the affected sequence may be knocked-out or knocked-down due to the sequence alteration. Therefore, targeted editing may be used to disrupt endogenous gene expression. Alternatively or in addition, a desired nucleic acid may be inserted into a target site in a DNA sequence (e.g., in an endogenous gene), which is known as targeted integration. “Targeted integration” refers to a process involving insertion of one or more exogenous sequences, with or without deletion of an endogenous sequence at the insertion site. Targeted integration can result from targeted gene editing when a donor template containing an exogenous sequence is present.
The present disclosure is based, at least in part, on the development of efficient gene editing systems for correcting a mutation(s) in a CF transmembrane conductance regulator (CFTR) gene, e.g., mutations in regions encoded by one of exons 11-27 and cause CF. As described herein, cell survival, indel rates, and indel patterns were determined at 120 previously uncharacterized target positions in intron 10 of the CFTR gene and a number of effective editing positions were identified based on the results. Accordingly, the gene-editing systems described herein rely on the identification of candidate target positions (e.g., those disclosed herein) that facilitate effective insertion (as determined by cell survival, indel rates, and indel patterns) of a nucleic acid encoding exons 11 to 27 of the CFTR gene, which would result in the correction of 99% of the most frequent CFTR non-responsive alleles.
Accordingly, provided herein are gene-editing systems for efficient modification of CFTR genes and uses thereof for correcting mutations in the CFTR gene, thereby treating CF. Components of the gene-editing systems and genetically modified cells resulting from application of the gene-editing systems are also within the scope of the present disclosure.
In some aspects, the disclosure relates to gene-editing systems for modifying a cystic fibrosis transmembrane regulator (CFTR) gene. A “gene-editing system” refers to a combination of components for genetic editing a target gene (e.g., CFTR), or one or more agents for producing such components. For example, a gene-editing system may comprise: (a) a nuclease, or an agent (e.g., a nucleic acid encoding the nuclease) for producing such; (b) a guide RNA (gRNA), or an agent for producing such (e.g., a vector capable of expressing the gRNA); and/or (c) a donor template, or an agent for producing such (e.g., a vector capable of producing the donor template).
The gene-editing systems as described herein may exhibit one or more advantageous in modifying a CFTR gene. For example, it would achieve a high gene editing rates, such as homology-directed repair rates (e.g., at least 5%, at least 6%, at least 7%, at least 8%, at least 9%, at least 10%, at least 11%, at least 12%, at least 13%, at least 14%, at least 15%, at least 16%, at least 17%, at least 18%, at least 19%, at least 20%, at least 21%, at least 22%, at least 23%, at least 24%, at least 25%, at least 30%, at least 35%, or at least 40% as assessed by methods known in the art (e.g., by methods as described herein)). Further, cells edited by the gene-editing system disclosed herein would have a high survival rate (e.g., at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 85%, at least 90%, at least 95%, or at least 99%) relative to an unedited control. Alternative or in addition, cells edited by the gene-editing system would exhibit an increased CFTR activity (e.g., by at least 30%, 50%, 100%, 2-fold, 5-fold, or 10-fold) relative to the unedited control.
In one exemplary embodiment, a gene-editing system as described herein may comprise: (a) a first polynucleotide, which comprises a nucleotide sequence encoding the donor template; (b) a second polynucleotide, which comprises a nucleotide sequence encoding the RNA-guided DNA endonuclease; and (c) a third polynucleotide, which comprises a nucleotide sequence encoding the gRNA. In another exemplary embodiment, a gene-editing system may comprise: (a) a first polynucleotide, which comprises a nucleotide sequence encoding the donor template; (b) a polypeptide, which comprises the RNA-guided DNA endonuclease; and (c) a third polynucleotide, which comprises a nucleotide sequence encoding the gRNA.
A. Nuclease
Targeted gene editing can be achieved either through a nuclease-independent approach, or through a nuclease-dependent approach. In the nuclease-independent targeted editing approach, homologous recombination is guided by homologous sequences flanking an exogenous polynucleotide to be introduced into an endogenous sequence through the enzymatic machinery of the host cell. The exogenous polynucleotide may introduce deletions, insertions or replacement of nucleotides in the endogenous sequence.
Alternatively, the nuclease-dependent approach can achieve targeted editing through the introduction of double strand breaks (DSBs) at specific locations using sequence-specific nucleases (e.g., endonucleases). Such nuclease-dependent targeted editing also utilizes DNA repair mechanisms, for example, non-homologous end joining (NHEJ), which occurs in response to DSBs. DNA repair by NHEJ often leads to random insertions or deletions (indels) of a small number of endogenous nucleotides. In contrast to NHEJ mediated repair, repair can also occur by a homology directed repair (HDR). When a donor template containing exogenous genetic material flanked by a pair of homology arms is present, the exogenous genetic material can be introduced into the genome by HDR, which results in targeted integration of the exogenous genetic material.
A nuclease of a gene-editing system may be provided in the form of a polypeptide (i.e., an enzymatic form). Alternatively, a gene-editing system may comprise an agent for the production of a nuclease. For example, a gene-editing system may comprise a nucleotide sequence encoding the sequence of a nuclease and an additional nucleotide sequence that facilitates expression/production of the nuclease as a polypeptide.
Available nucleases capable of introducing specific and targeted DSBs include, but not limited to, zinc-finger nucleases (ZFN), transcription activator-like effector nucleases (TALEN), and RNA-guided endonucleases (e.g., CRISPR-Cas9 or CRISPR/Cas9; Clustered Regular Interspaced Short Palindromic Repeats Associated 9 nucleases). Additional examples of targeted nucleases suitable for use as provided herein include, but are not limited to, Bxbl, phiC31, R4, PhiBT1, and Wβ/SPBc/TP901-1, whether used individually or in combination. Other non-limiting examples of targeted nucleases include naturally-occurring and recombinant nucleases, e.g., CRISPR/Cas9, restriction endonucleases, meganucleases homing endonucleases, and the like.
(i) Zinc Finger Nucleases (ZFNs)
ZFNs are targeted nucleases comprising a nuclease fused to a zinc finger DNA binding domain (ZFBD), which is a polypeptide domain that binds DNA in a sequence-specific manner through one or more zinc fingers. A zinc finger is a domain of about 30 amino acids within the zinc finger binding domain whose structure is stabilized through coordination of a zinc ion. Examples of zinc fingers include, but not limited to, C2H2 zinc fingers, C3H zinc fingers, and C4 zinc fingers. A designed zinc finger domain is a domain not occurring in nature whose design/composition results principally from rational criteria, e.g., application of substitution rules and computerized algorithms for processing information in a database storing information of existing ZFP designs and binding data. See, for example, U.S. Pat. Nos. 6,140,081; 6,453,242; and 6,534,261; see also WO 98/53058; WO 98/53059; WO 98/53060; WO 02/016536 and WO 03/016496. A selected zinc finger domain is a domain not found in nature whose production results primarily from an empirical process such as phage display, interaction trap or hybrid selection. ZFNs are described in greater detail in U.S. Pat. Nos. 7,888,121 and 7,972,854. The most recognized example of a ZFN is a fusion of the FokI nuclease with a zinc finger DNA binding domain.
(ii) TALEN Nucleases
A TALEN is a targeted nuclease comprising a nuclease fused to a TAL effector DNA binding domain. A “transcription activator-like effector DNA binding domain”, “TAL effector DNA binding domain”, or “TALE DNA binding domain” is a polypeptide domain of TAL effector proteins that is responsible for binding of the TAL effector protein to DNA. TAL effector proteins are secreted by plant pathogens of the genus Xanthomonas during infection. These proteins enter the nucleus of the plant cell, bind effector-specific DNA sequences via their DNA binding domain, and activate gene transcription at these sequences via their transactivation domains. TAL effector DNA binding domain specificity depends on an effector-variable number of imperfect 34 amino acid repeats, which comprise polymorphisms at select repeat positions called repeat variable-diresidues (RVD). TALENs are described in greater detail in US Patent Application No. 2011/0145940. The most recognized example of a TALEN in the art is a fusion polypeptide of the FokI nuclease to a TAL effector DNA binding domain.
(iii) RNA-Guided Endonucleases
RNA-guided endonucleases are enzymes that utilize RNA:DNA base-pairing to target and cleave a polynucleotide. RNA-guided endonuclease may cleave single-stranded polynucleic acids or at least one strand of a double-stranded polynucleotide. A gene editing-system may comprise one RNA-guided endonuclease. Alternatively, a gene-editing system may comprise at least two (e.g., two, three, four, five, six, seven, eight, nine, ten, or more than ten) RNA-guided endonucleases.
The CRISPR-Cas9 system is a naturally-occurring defense mechanism in prokaryotes that has been repurposed as a RNA-guided DNA-targeting platform used for gene editing. It relies on the DNA nuclease Cas9, and two noncoding RNAs—crisprRNA (crRNA) and trans-activating RNA (tracrRNA)—to target the cleavage of DNA. crRNA drives sequence recognition and specificity of the CRISPR-Cas9 complex through Watson-Crick base pairing typically with a 20 nucleotide (nt) sequence in the target DNA. Changing the sequence of the 5′ 20 nt in the crRNA allows targeting of the CRISPR-Cas9 complex to specific loci. The CRISPR-Cas9 complex only binds DNA sequences that contain a sequence match to the first 20 nt of the crRNA if the target sequence is followed by a specific short DNA motif (with the sequence NGG) referred to as a protospacer adjacent motif (PAM). TracrRNA hybridizes with the 3′ end of crRNA to form an RNA-duplex structure that is bound by the Cas9 endonuclease to form the catalytically active CRISPR-Cas9 complex, which can then cleave the target DNA.
Once the CRISPR-Cas9 complex is bound to DNA at a target site, two independent nuclease domains within the Cas9 enzyme each cleave one of the DNA strands upstream of the PAM site, leaving a double-strand break (DSB) where both strands of the DNA terminate in a base pair (a blunt end).
A gene-editing system may comprise a CRISPR endonuclease (e.g., a CRISPR associated protein 9 or Cas9 nuclease). In some embodiments, the endonuclease is from Streptococcus aureus (e.g., saCas9), although other CRISPR homologs may be used. It should be understood that a Cas9 may be substituted with another RNA-guided endonuclease known in the art, such as Cpf1. Finally, it should be understood, that a wild-type RNA-guided endonuclease may be used or modified versions may be used (e.g., evolved versions of Cas9, Cas9 orthologues, Cas9 chimeric/fusion proteins, or other Cas9 functional variants). For example, in some embodiments, the RNA-guided endonuclease is modified to comprise a nuclear localization signal (NLS), such as an SV40 NLS or a NucleoPlasmine NLS. Examples of other nuclear localization signals are known to those having skill in the art. In some embodiments, the NLS comprises an SV40 NLS and a NucleoPlasmine NLS.
B. Guide RNA
The present disclosure provides a genome-targeting nucleic acid, or an agent for producing such (e.g., a polynucleotide comprising a nucleotide sequence encoding a gRNA), that can direct the activities of an associated polypeptide (e.g., an RNA-guided endonuclease) to a specific target sequence within a target nucleic acid. The genome-targeting nucleic acid can be an RNA. A genome-targeting RNA is referred to as a “guide RNA” or “gRNA” herein. In some embodiments, a gene-editing systems comprises one gRNA. In other embodiments, a gene-editing system comprises at least two gRNAs (e.g., two, three, four, five, six, seven, eight, nine, ten, or more than ten gRNAs).
A gRNA of a gene-editing system may be provided in a synthesized form. For example, a guide RNA may be synthesized by chemical means, as illustrated below and described in the art. While chemical synthetic procedures are continually expanding, purifications of such RNAs by procedures such as high performance liquid chromatography (HPLC, which avoids the use of gels such as PAGE) tends to become more challenging as polynucleotide lengths increase significantly beyond a hundred or so nucleotides. One approach used for generating RNAs of greater length is to produce two or more molecules that are ligated together. Much longer RNAs are more readily generated enzymatically. Various types of RNA modifications can be introduced during or after chemical synthesis and/or enzymatic generation of RNAs, e.g., modifications that enhance stability, reduce the likelihood or degree of innate immune response, and/or enhance other attributes, as described in the art.
Alternatively, a gene-editing system may comprise an agent for the production of a gRNA. For example, a gene-editing system may comprise a nucleotide sequence encoding the nucleotide sequence of a gRNA and an additional nucleotide sequence that facilitates expression/production of the gRNA.
A gRNA may be a double-molecule guide RNA. A double-molecule gRNA comprises two strands of RNA. The first strand may comprise in the 5′ to 3′ direction, an optional spacer extension sequence, a spacer sequence and a scaffold sequence a minimum CRISPR repeat sequence. The second strand comprises a minimum tracrRNA sequence (complementary to the minimum CRISPR repeat sequence), a 3′ tracrRNA sequence, and an optional tracrRNA extension sequence.
Alternatively, a gRNA may be a single-molecule guide RNA comprising a spacer sequence and a scaffold sequence. A single-molecule guide RNA may further comprise an optional spacer extension.
(i) gRNA Spacer
As is understood by the person of ordinary skill in the art, each gRNA is designed to include a spacer sequence complementary to its genomic target sequence. See Jinek et al., Science, 337, 816-821 (2012) and Deltcheva et al., Nature, 471, 602-607 (2011). A spacer sequence is a sequence (e.g., a 20 nucleotide sequence) that defines the target sequence (e.g., a DNA target sequences, such as a genomic target sequence) of a target nucleic acid of interest. The gRNA can comprise a variable length spacer sequence with 17-30 nucleotides at the 5′ end of the gRNA sequence. In some embodiments, the spacer sequence is 15 to 30 nucleotides. In some embodiments, the spacer sequence is 15, 16, 17, 18, 19, 29, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides. In some embodiments, a spacer sequence is 20 nucleotides.
The “target sequence” is adjacent to a PAM sequence and is the sequence modified by an RNA-guided nuclease (e.g., Cas9). The “target nucleic acid” is a double-stranded molecule: one strand comprises the target sequence and is referred to as the “PAM strand,” and the other complementary strand is referred to as the “non-PAM strand.” One of skill in the art recognizes that the gRNA spacer sequence hybridizes to the reverse complement of the target sequence, which is located in the non-PAM strand of the target nucleic acid of interest. Thus, the gRNA spacer sequence is the RNA equivalent of the target sequence. For example, if the target sequence is 5′-AGAGCAACAGTGCTGTGGCC-3′ (SEQ ID NO: 13), then the gRNA spacer sequence is 5′-AGAGCAACAGUGCUGUGGCC-3′ (SEQ ID NO: 14). The spacer of a gRNA interacts with a target nucleic acid of interest in a sequence-specific manner via hybridization (i.e., base pairing). The nucleotide sequence of the spacer thus varies depending on the target sequence of the target nucleic acid of interest.
The spacer sequence is designed to hybridize to a region of the target nucleic acid that is located 5′ of a PAM of the Cas9 enzyme used in the system. The spacer may perfectly match the target sequence or may have mismatches. Each Cas9 enzyme has a particular PAM sequence that it recognizes in a target DNA. For example, S. pyogenes Cas9 recognizes in a target nucleic acid a PAM that comprises the sequence 5′-NRG-3′, where R comprises either A or G, where N is any nucleotide and N is immediately 3′ of the target nucleic acid sequence targeted by the spacer sequence.
In some embodiments, the target nucleic acid sequence comprises 20 nucleotides. In some embodiments, the target nucleic acid comprises less than 20 nucleotides. In some embodiments, the target nucleic acid comprises more than 20 nucleotides. In some embodiments, the target nucleic acid comprises at least: 5, 10, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30 or more nucleotides. In some embodiments, the target nucleic acid comprises at most: 5, 10, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30 or more nucleotides. In some embodiments, the target nucleic acid sequence comprises 20 bases immediately 5′ of the first nucleotide of the PAM. For example, in a sequence comprising 5′-NNNNNNNNNNNNNNNNNNNNNRG-3′, the target nucleic acid comprises the sequence that corresponds to the Ns, wherein N is any nucleotide, and the underlined NRG sequence is the S. aureus PAM.
In some embodiments, a gRNA for use in the gene-editing system disclosed herein may direct cleavage by the RNA-guided endonuclease to a target site at position 1220, 2068, 3821, 4262, 5041, 5052, 5278, 5343, 5538, or 6150 of intron 10 in the CFTR gene, wherein the designated number corresponds to the nucleotide positon within intron 10 of the hsCFTR gene (i.e., the first nucleotide of the intron is given a value of 1, the second a value of 2, and so forth). Exemplary gRNAs may comprise one of the following spacer sequences:
(ii) gRNA Scaffold
In some embodiments, the gRNA further comprises a scaffold sequence. A scaffold sequence may comprise the sequence of a minimum CRISPR repeat sequence, a single-molecule guide linker, a minimum tracrRNA sequence, a 3′ tracrRNA sequence, and/or an optional tracrRNA extension sequence. In some embodiments, the scaffold sequence comprises the nucleotide sequence of GUUUUAGUACUCUGGAAACAGAAUCUACUAAAACAAGGCA AAAUGCCGUGUUUAUCUCGUCAACUUGUUGGCGAGAUUUU (SEQ ID NO: 12). The scaffold sequence may be connected to the 5′ and/or 3′ end of a spacer sequence. In some embodiments the scaffold sequence is connected to the 3′ end of the spacer sequence. In other embodiments, the scaffold sequence is connected to the 5′ end of the spacer sequence.
(iii) Exemplary gRNA Sequences
A gRNA for use in the gene-editing system disclosed herein may comprise, consist essentially of (e.g., contain up to 20 extra nucleotides at the 5′ end and/or the 3′ end of the following sequences), or consist of one of the following nucleotide sequences:
It is understood that because the gRNA sequences described above are RNA sequences. Any T (thymine) in the sequences referring to gRNAs would refer to U (or uracil) in the context of RNA molecules. Sequences containing T (thymine) herein would encompass both DNA molecules and RNA molecules (wherein T refers to U).
Moreover, the single-molecule gRNA can comprise no uracil at the 3′ end of the gRNA sequence. The gRNA can comprise one or more uracil at the 3′ end of the gRNA sequence. For example, the gRNA can comprise 1 uracil (U) at the 3′ end of the gRNA sequence. The gRNA can comprise 2 uracil (UU) at the 3′ end of the gRNA sequence. The gRNA can comprise 3 uracil (UUU) at the 3′ end of the gRNA sequence. The gRNA can comprise 4 uracil (UUUU) at the 3′ end of the gRNA sequence. The gRNA can comprise 5 uracil (UUUUU) at the 3′ end of the gRNA sequence. The gRNA can comprise 6 uracil (UUUUUU) at the 3′ end of the gRNA sequence. The gRNA can comprise 7 uracil (UUUUUUU) at the 3′ end of the gRNA sequence. The gRNA can comprise 8 uracil (UUUUUUUU) at the 3′ end of the gRNA sequence.
It is further understood that the nucleotides of the gRNAs described above may comprise modified nucleic acids at any nucleotide position. Accordingly, a gRNA can be unmodified or modified. For example, modified gRNAs can comprise one or more 2′-O-methyl phosphorothioate nucleotides. Examples of additional modified nucleic acids are known to those having skill in the art.
(iv) Ribonucleoprotein Complexes
In some instances, the gene-editing system disclosed herein may comprise a ribonucleoprotein complex (RNP), in which a gRNA and a nuclease (e.g., as described above) form a complex. As used herein, the term “ribonucleoprotein” or “RNP” refers to a protein that is structurally associated with a nucleic acid (either DNA or RNA). For example, in some embodiments, a Cas9 RNA-guided endonuclease and a gRNA of a gene-editing system are in the form of an RNP.
C. Donor Template
A donor template comprises a nucleic acid sequence that is to be inserted into a target site in a DNA sequence (e.g., in an endogenous gene). Accordingly, in some embodiments, a donor template of a gene-editing system may comprise a CFTR mini-gene containing exons 11 to 27 of a CFTR gene. The donor template may further comprise the nucleotide sequence of an acceptor splice site and/or one or more homologous arms.
A donor template of a gene-editing system may be provided in a synthesized form. Alternatively, a gene-editing system may comprise an agent (e.g., a nucleic acid such as a vector) for the production of a donor template. For example, a gene-editing system may comprise a nucleic acid (e.g., a vector) for producing the donor template.
(i) CFTR Mini-gene Comprising Exons 11 to 27 of CFTR
The donor template may comprise a CFTR mini-gene coding for exons 11 to 27 of a CFTR gene (e.g., hsCFTR). The CFTR mini-gene may contain one or more of intron 11 to intron 26 as in a wild-type CFTR gene. For example, the CFTR mini-gene may comprise an intron nucleotide sequence located 3′ to exon 11 of the CFTR gene and/or an intron nucleotide sequence between one or more of exons 11 and 12, exons 12 and 13, exons 13 and 14, exons 14 and 15, exons 15 and 16, exons 16 and 17, exons 17 and 18, exons 18 and 19, exons 19 and 20, exons 20 and 21, exons 21 and 22, exons 22 and 23, exons 23 and 24, exons 24 and 25, exons 25 and 26, and exons 26 and 27. In some instances, the CFTR mini-gene lacks at least one of introns 11-26.
If the CFTR mini-gene contains an intron nucleotide sequence between exon 11 and exon 27, such an intron sequence may be a native CFTR intron. For example, the intron 3′ to exon 11 of CFTR may be a wild-type CFTR intron 10. Likewise, wild-type CFTR intron 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, and/or 26 (or any combination thereof) may be positioned between one or more of exons 11 and 12, exons 12 and 13, exons 13 and 14, exons 14 and 15, exons 15 and 16, exons 16 and 17, exons 17 and 18, exons 18 and 19, exons 19 and 20, exons 20 and 21, exons 21 and 22, exons 22 and 23, exons 23 and 24, exons 24 and 25, exons 25 and 26, and/or exons 26 and 27 (or any combination thereof).
Alternatively, the CFTR mini-gene may contain a synthetic intron (i.e., non-endogenous to the CFTR gene) between exon 11-27. In some embodiments, the first nucleotide sequence comprises a synthetic intron between exons 11 and 12, exons 12 and 13, exons 13 and 14, exons 14 and 15, exons 15 and 16, exons 16 and 17, exons 17 and 18, 18 and 19, 19 and 20, 20 and 21, 21 and 22, 22 and 23, 23 and 24, 24 and 25, 25 and 26, and/or 26 and 27 (or any combination thereof). In some embodiments, a synthetic intron comprises a modified CFTR intron nucleotide sequence (i.e., a native CFTR intron that has been modified by addition or deletion of one or more nucleotides).
In some embodiments, the CFTR mini-gene may be free from any intron sequence (e.g., between exon 11-27). In some instances, the CFTR mini-gene encodes a fragment of a wild-type human CFTR, which is encoded by exon 11-27 of a wild-type human CFTR gene. For example, the CFTR mini-gene may comprise a nucleotide sequence encoding the following CFTR fragment (SEQ ID NO: 36):
In one example, the CFTR mini-gene comprises the nucleotide sequence of SEQ ID NO: 37, which encodes the above-noted CFTR fragment (SEQ ID NO: 36).
Alternatively, the CFTR mini-gene may comprise a nucleotide sequence that shares at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, or at least 95 percent identity with SEQ ID NO: 37 and encodes a functional CFTR fragment (e.g., SEQ ID NO: 36).
(ii) Acceptor Splice Site
A donor template may further comprise a nucleotide fragment located 5′ to the CFTR mini-gene to provide an acceptor splice site such that the 5′ end of the CFTR mini-gene may be connected accurately to the upstream exon 9 via RNA splicing after being inserted into intron 10 of the CFTR gene. In some examples, the nucleotide fragment carrying the splice acceptor site may be a 3′ end fragment of intron 10 of a native CFTR gene. In some embodiments, the acceptor splice site may be the native acceptor splice site of intron 10 of the native CFTR gene. In other embodiments, the acceptor splice site can be a synthetic (i.e., non-native) splice acceptor site. Examples of nucleotide sequences comprising synthetic acceptor splice sites are known to those having skill in the art. Any of the nucleotide fragment carrying a slice acceptor site may be of 50-200 nucleotides in length (e.g., 80-150 or 100-150 nucleotides in length).
For example, in some embodiments, the first fragment encodes for a synthetic acceptor splice site comprises, from 5′ to 3′: TATACACTTCTGCTTAGGATGATAATTGGAGGCAA GTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGATGGGTTTTATTTCCAG (SEQ ID NO: 1), wherein the 3′ end AG is the slicing acceptor site.
(iii) Homologous Arms
The donor template may further comprise one or more homologous arms flanking the CFTR mini-gene to allow for efficient homology dependent recombination (HDR) at a genomic location of interest (e.g., intron 10 of CFTR). The length of a homologous arm may vary. For example, a homologous arm may be at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, at least 400, at least 450, at least 500, at least 550, at least 600, at least 650, at least 700, at least 750, at least 800, at least 850, at least 900, at least 950, or at least 1000 nucleotides in length. Likewise, a homologous arm may be 50 to 100, 50 to 200, 50 to 300, 50 to 400, 50 to 500, 50 to 600, 50 to 700, 50 to 800, 50 to 900, 50 to 1000, 100 to 200, 100 to 300, 100 to 400, 100 to 500, 100 to 600, 100 to 700, 100 to 800, 100 to 900, 100 to 1000, 200 to 300, 200 to 400, 200 to 500, 200 to 600, 200 to 700, 200 to 800, 200 to 900, 200 to 1000, 300 to 400, 300 to 500, 300 to 600, 300 to 700, 300 to 800, 300 to 900, 300 to 1000, 400 to 500, 400 to 600, 400 to 700, 400 to 800, 400 to 900, 400 to 1000, 500 to 600, 500 to 700, 500 to 800, 500 to 900, 500 to 1000, 600 to 700, 600 to 800, 600 to 900, 600 to 1000, 700 to 800, 700 to 900, 700 to 1000, 800 to 900, 800 to 1000, or 900 to 1000 nucleotides in length. In particular, a homologous arm may be 500 nucleotides in length.
For example, in some embodiments a donor template comprises a 5′ homologous arm (i.e., positioned upstream to the first nucleotide sequence) and a 3′ homologous arm (i.e., positioned downstream to the first nucleotide sequence), wherein the 5′ homologous arm comprises a nucleic acid sequence that is homologous to a region upstream to the genomic location of interest, and wherein the 3′ homologous arm comprises a nucleic acid sequence that is homologous to a region downstream to the genomic location of interest. In some embodiments, the 5′ homologous arm and the 3′ homologous arm comprise nucleotide sequences selected from the group consisting of:
(i) SEQ ID NO: 17 and SEQ ID NO: 18, respectively;
(ii) SEQ ID NO: 19 and SEQ ID NO: 20, respectively;
(iii) SEQ ID NO: 21 and SEQ ID NO: 22, respectively;
(iv) SEQ ID NO: 23 and SEQ ID NO: 24, respectively;
(v) SEQ ID NO: 25 and SEQ ID NO: 345, respectively;
(vi) SEQ ID NO: 346 and SEQ ID NO: 347, respectively;
(vii) SEQ ID NO: 348 and SEQ ID NO: 349, respectively;
(viii) SEQ ID NO: 350 and SEQ ID NO: 351, respectively;
(ix) SEQ ID NO: 352 and SEQ ID NO: 353, respectively; and
(x) SEQ ID NO: 354 and SEQ ID NO: 355, respectively.
In other embodiments, the donor template may comprise a 5′ homologous arm and lack a 3′ homologous arm. In yet other embodiments, the donor template may comprise a 3′ homologous arm and lack a 5′ homologous arm. A 5′ or 3′ homologous arm may comprise the nucleotide sequence of any one of SEQ ID NOs: 17-25 and SEQ ID NOs: 345-355, respectively.
Alternatively, a donor template may lack homologous arms. For example, in some instances, a donor template may be integrated by NHEJ-dependent end joining following cleavage at the target site.
In some instances, the donor template may comprise, from 5′ end to 3′ end, a 5′ homologous arm, a nucleotide fragment containing a splice acceptor site, a CFTR mini-gene, and a 3′ end homologous arm. These components may be linked directly. Alternatively, they may be linked via a nucleotide linker.
A donor template can be DNA or RNA, single-stranded and/or double-stranded, and can be introduced into a cell in linear or circular form. If introduced in linear form, the ends of the donor sequence can be protected (e.g., from exonucleolytic degradation) by methods known to those of skill in the art. For example, one or more dideoxynucleotide residues are added to the 3′ terminus of a linear molecule and/or self-complementary oligonucleotides are ligated to one or both ends. See, for example, Chang et al., (1987) Proc. Natl. Acad. Sci. USA 84:4959-4963; Nehls et al., (1996) Science 272:886-889. Additional methods for protecting exogenous polynucleotides from degradation include, but are not limited to, addition of terminal amino group(s) and the use of modified internucleotide linkages such as, for example, phosphorothioates, phosphoramidates, and O-methyl ribose or deoxyribose residues.
A donor template can be introduced into a cell as part of a vector molecule having additional sequences such as, for example, replication origins, promoters and genes encoding antibiotic resistance. Moreover, a donor template can be introduced as naked nucleic acid, as nucleic acid complexed with an agent such as a liposome or poloxamer, or can be delivered by viruses (e.g., adenovirus, AAV, herpesvirus, retrovirus, lentivirus and integrase defective lentivirus (IDLV)).
A donor template, in some embodiments, is inserted so that its expression is driven by the endogenous promoter, such as the promoter that drives expression of the endogenous gene into which the donor is inserted.
Furthermore, exogenous sequences may also include transcriptional or translational regulatory sequences, for example, promoters, enhancers, insulators, internal ribosome entry sites, sequences encoding 2A peptides and/or polyadenylation signals.
D. Viral Vector/Viral Particle-Based Gene-Editing System
In some embodiments, the gene-editing system disclosed herein may comprise polynucleic acids (e.g., vectors such as viral vectors) or viral particles comprising such. The polynucleic acid(s) produces the components (e.g., a nuclease, a gRNA, and a donor template) for editing a CFTR gene as described herein.
In some examples, the gene-editing system comprises one polynucleic acid capable of producing all components of the gene-editing system, including a nuclease, a gRNA, and a donor template. In other examples, the gene-editing system comprises two polynucleic acids, one encoding the nuclease and the gRNA and the other comprising the donor template. Alternatively, the gene-editing system comprises two polynucleic acids, one encoding the nuclease and the other comprising the donor template and encoding the gRNA. In another example, the gene-editing system comprises three polynucleic acids, the first one encoding the nuclease, the second one encoding the gRNA, and the third one comprising the donor template.
The nucleic acid (or at least one nucleic acid in the set of nucleic acids) may be a vector such as a viral vector, such as a retroviral vector, an adenovirus vector, an adeno-associated viral (AAV) vector, and a herpes simplex virus (HSV) vector.
In some examples, the gene-editing system may comprise one or more viral particles that carry genetic materials for producing the components of the gene-editing system as disclosed herein. A viral particle (e.g., AAV particle) may comprise one or more components (or agents for producing one or more components) of a gene-editing system (e.g., as described herein). A viral particle (or virion) comprises a nucleic acid, which encodes the viral genome, and an outer shell of protein (i.e., a capsid). In some instances, a viral particle further comprises an envelope of lipids that surround the protein shell.
In some examples, a viral particle comprises a polynucleic acid capable of producing all components of the gene-editing system, including a nuclease, a gRNA, and a donor template. In other examples, a viral particle comprises a polynucleic acid capable of producing one or more components of the gene-editing system. For example a viral particle may comprise a polynucleic acid capable of producing the nuclease and the gRNA. Alternatively, a viral particle may comprise a polynucleic acid capable of producing the donor template and encoding the gRNA. In another example, a viral particle may comprise a polynucleic acid capable of producing only one of the nuclease, the gRNA, or the donor template.
The viral particles described herein may be derived from any viral particle known in the art including, but not limited to, a retroviral particle, an adenovirus particle, an adeno-associated viral (AAV) particle, or a herpes simplex virus (HSV) particle. In some embodiments, the viral particle is an AAV particle.
In some embodiments, a set of viral particles comprises more than one gene-editing system. In some embodiments, each viral particle in the set of viral particles is an AAV particle. In other embodiments, a set of viral particles comprises more than one type of viral particle (e.g., a retroviral particle, an adenovirus particle, an adeno-associated viral (AAV) particle, or a herpes simplex virus (HSV) particle).
E. Additional Exemplary Gene-Editing Systems
In addition, the gene-editing system disclosed herein may comprise a nuclease (e.g., a Cas9 enzyme) as disclosed herein. Such a gene-editing system may further comprise the gRNA, and the donor template. The nuclease and the gRNA, optionally in combination with the donor template, may form an RNP for delivery. Further, the gene-editing system may further comprise the gRNA and a polynucleic acid (e.g., a vector as those described herein) for producing the donor template. The nuclease and the gRNA may form an RNP complex. Alternative, the gene-editing system may further comprise one or more polynucleic acids for producing the gRNA and the donor template.
Alternatively, the gene-editing system disclosed herein may comprise an agent for produce the nuclease, for example, an expression vector such as a viral vector as disclosed herein capable of expressing the nuclease. Such a gene-editing system may further comprise the gRNA and/or the donor template, or agents for producing such.
Any other format of the gene-editing system comprising the components as disclosed herein for modifying the CFTR gene or agents producing such are within the scope of the present disclosure.
In some aspects, the disclosure relates to methods of editing a cystic fibrosis transmembrane regulator (CFTR) gene using any of the gene-editing systems disclosed herein. An editing event may correct a mutation in a CFTR gene. One or more copies (i.e., alleles) of a gene (e.g., CFTR) may comprise a mutation. In some embodiments, the methods of gene editing described herein may be used to correct at least one copy (i.e., allele) of a CFTR gene. In some embodiments, two copies (i.e., alleles) of a CFTR gene are edited.
More than 2000 mutations in CFTR have been described that confer a range of molecular and functional phenotypes. See e.g., Veit G. et al., Mol. Biol. Cell. 2016 Feb. 1; 27(3): 424-33. Examples include, but are not limited to, M1V, A46D, E56K, P67L, R74W, G85E, E92K, P99L, D110H, D110E, R117C, R117H, R170G, G178R, E193K, P205S, L206W, V232D, R334W, R334W, I336K, T338I, S341P, R347P, R347H, R352Q, L467P, S492F, ΔI507, ΔF508, V520F, G542X, S549R, S549N, G551S, G551N, G551D, A455E, S549N, R553X, A559T, R560T, R560S, R560K, A561E, Y569D, D579G, D614G, R668C, L927P, S945L, S977F, L997F, F1052V, H1054D, K1060T, L1065P, R1066C, R1066M, R1066H, A1067T, R1070Q, R1070W, F1074L, H1085R, M1101K, D1270N, D1152H, L1077P, S1235R, G1244E, S1251N, S1255P, W1282X, N1303K, G1349D, Q1411X, 2789+5G>A, and 3849+10kbC>T. In some embodiments, the methods described herein may be used to correct one or more mutation(s) listed above or otherwise known in the prior art.
A method of editing a CFTR gene may comprise contacting a cell with: a gene-editing system as described herein; a viral particle or set of viral particle comprising a gene-editing system as described herein; and/or a nucleic acid or set of nucleic acids comprising a gene-editing system as described herein. These methods may be performed, for example, on one or more cells existing within a living subject (e.g., in vivo). Alternatively or in addition, these methods may be performed on one or more cells existing in culture (e.g., in vitro). In some instances, a cell edited in culture is then administered to a subject (categorized herein as “cell-based therapy”).
A. Delivery Methods
The contacting of the cell (or subject) with the gene-editing system, viral particle or set of viral particles, and/or nucleic acid or set of nucleic acids may be performed via various delivery methods. For example, nucleases and/or donor templates may be delivered using a vector system, including, but not limited to, plasmid vectors, DNA minicircles, retroviral vectors, lentiviral vectors, adenovirus vectors, poxvirus vectors; herpesvirus vectors and adeno-associated virus vectors, and combinations thereof.
Conventional viral and non-viral based gene transfer methods can be used to introduce nucleic acids encoding nucleases and donor templates in cells. Non-viral vector delivery systems include DNA plasmids, DNA minicircles, naked nucleic acid, and nucleic acid complexed with a delivery vehicle such as a liposome or poloxamer. Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell.
Methods of non-viral delivery of nucleic acids include, but are not limited to, electroporation, lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, naked RNA, capped RNA, artificial virions, and agent-enhanced uptake of DNA. Sonoporation using, e.g., the Sonitron 2000 system (Rich-Mar) can also be used for delivery of nucleic acids.
Methods for delivery of proteins (e.g., RNA-guided endonucleases) include, but are not limited to, the use of cell-penetrating peptides and nanovehicles.
(i) Adeno-Associated Viral Delivery
The donor nucleic acid encoding exons 11 to 27 of the CFTR gene may be delivered to a cell using an adeno-associated virus (AAV). AAVs are small viruses which integrate site-specifically into the host genome and can therefore deliver a transgene, such as exons 11 to 27 of the CFTR gene. Inverted terminal repeats (ITRs) are present flanking the AAV genome and/or the transgene of interest and serve as origins of replication. Also present in the AAV genome are rep and cap proteins which, when transcribed, form capsids which encapsulate the AAV genome for delivery into target cells. Surface receptors on these capsids which confer AAV serotype, which determines which target organs the capsids will primarily bind and thus what cells the AAV will most efficiently infect. There are twelve currently known human AAV serotypes. In some embodiments, the AAV is AAV serotype 6 (AAV6).
Adeno-associated viruses are among the most frequently used viruses for gene therapy for several reasons. First, AAVs do not provoke an immune response upon administration to mammals, including humans. Second, AAVs are effectively delivered to target cells, particularly when consideration is given to selecting the appropriate AAV serotype. Finally, AAVs have the ability to infect both dividing and non-dividing cells because the genome can persist in the host cell without integration. This trait makes them an ideal candidate for gene therapy.
(ii) Homology-Directed Repair (HDR)
The donor nucleic acid encoding exons 11 to 27 of the CFTR gene may be inserted into the target genomic region of the edited cell by homology directed repair (HDR). Both strands of the DNA at the target genomic region are cut by a CRISPR Cas9 enzyme. HDR then occurs to repair the double-strand break (DSB) and insert the donor DNA. For this to occur correctly, the donor sequence is designed with flanking residues which are complementary to the sequence surrounding the DSB site in the target gene (hereinafter “homology arms”). These homology arms serve as the template for DSB repair and allow HDR to be an essentially error-free mechanism. The rate of homology directed repair (HDR) is a function of the distance between the mutation and the cut site so choosing overlapping or nearby target sites is important. Templates can include extra sequences flanked by the homologous regions or can contain a sequence that differs from the genomic sequence, thus allowing sequence editing.
(iii) Non-Homologous End Joining (NHEJ)
The NHEJ pathway may also produce, at very low frequency, inserts containing exons 11-27. Such repair should correct CFTR expression when the insert is in the sense strand orientation.
B. Cell Therapy
The methods described herein may be performed on one or more cells existing in culture (e.g., in vitro). Accordingly, in some aspects, the disclosure relates to genetically edited cells comprising an edited cystic fibrosis transmembrane regulator (CFTR) gene in which an exogenous nucleic acid is inserted into intron 10 of the endogenous CFTR gene. The exogenous sequence may comprise one or more of the components of the donor template described above (e.g., a nucleotide sequence encoding exons 11 to 27 of CFTR, a nucleotide sequence encoding an acceptor splice site, and/or a nucleotide sequence encoding one or more homologous arm).
(i) Cells with Edited CFTR Gene
Genetically-edited cells may be produced using any of the methods described herein. In some embodiments, one or more gene edits within a population of edited cells results in a phenotype associated with changes in CFTR functionality.
In some embodiments, genetically-edited cells of the present disclosure exhibit increased CFTR activity (e.g., by at least 30%, 50%, 100%, 2-fold, 5-fold, or 10-fold) relative to the unedited control. For example, the levels of CFTR activity may be increased by at least 30%, at least 50%, at least 100%, at least 200%, at least 500%, at least 1000% relative to control unedited cells. In some embodiments, the levels of CFTR activity may be increased by 30%-50%, 30%-100%, 30%-200%, 30%-500%, 30%-1000%, 50%-100%, 50%-200%, 50%-500%, 50%-1000%, 100%-200%, 100%-500%, 100%-1000%, 200%-500%, 200%-1000%, or 500%-1000% relative to control T cells.
(ii) Methods of Administration
In some instances, a genetically edited cell may be administered to a subject. The step of administering may include the placement (e.g., transplantation) of genetically engineered cells into a subject, by a method or route that results in at least partial localization of the introduced cells at a desired site, such that a desired effect(s) is produced and where at least a portion of the implanted cells or components of the cells remain viable. The period of viability of the cells after administration to a subject can be as short as a few hours, e.g., twenty-four hours, to a few days, to as long as several years, or even the life time of the subject, i.e., long-term engraftment. In some embodiments, the administration is to the respiratory tract of the subject.
Modes of administration include injection, infusion, instillation, or ingestion. Injection includes, without limitation, intravenous, intramuscular, intra-arterial, intrathecal, intraventricular, intracapsular, intraorbital, intracardiac, intradermal, intraperitoneal, transtracheal, subcutaneous, subcuticular, intraarticular, sub capsular, subarachnoid, intraspinal, intracerebro spinal, and intrasternal injection and infusion. In some embodiments, the route is intravenous.
In some embodiments, genetically engineered cells are administered systemically, which refers to the administration of a population of cells other than directly into a target site, tissue, or organ, such that it enters, instead, the subject's circulatory system and, thus, is subject to metabolism and other like processes.
For use in the various aspects described herein, an effective amount of genetically engineered cells comprises at least 102 cells, at least 5×102 cells, at least 103 cells, at least 5×103 cells, at least 104 cells, at least 5×104 cells, at least 105 cells, at least 2×105 cells, at least 3×105 cells, at least 4×105 cells, at least 5×105 cells, at least 6×105 cells, at least 7×105 cells, at least 8×105 cells, at least 9×105 cells, at least 1×106 cells, at least 2×106 cells, at least 3×106 cells, at least 4×106 cells, at least 5×106 cells, at least 6×106 cells, at least 7×106 cells, at least 8×106 cells, at least 9×106 cells, or multiples thereof. In some examples described herein, the cells are expanded in culture prior to administration to a subject in need thereof.
C. Effective Amount
In some aspects, the disclosure relates to methods of administering an effective amount of a gene-editing system as descried herein, a viral particle or set of viral particles comprising a gene-editing system as described herein, a nucleic acid or set of nucleic acids comprising a gene-editing system as described herein, or a composition of edited cells as described herein to a subject in need thereof.
A subject may be any subject for whom diagnosis, treatment, or therapy is desired. In some embodiments, the subject is a mammal. In some embodiments, the subject is a human. In some embodiments, the subject is a human patient having cystic fibrosis. In some embodiments, the human patient is a child.
An effective amount refers to the amount of a gene-editing system, a viral particle or set of viral particles comprising a gene-editing system, a nucleic acid or set of nucleic acids comprising a gene-editing system, or a population of genetically engineered cells needed to prevent or alleviate at least one or more signs or symptoms of a medical condition (i.e., CF), and relates to a sufficient amount of a composition to provide the desired effect (i.e., to treat a subject having CF). An effective amount also includes an amount sufficient to prevent or delay the development of a symptom of the disease, alter the course of a symptom of the disease (for example but not limited to, slow the progression of a symptom of the disease), or reverse a symptom of the disease. It is understood that for any given case, an appropriate effective amount can be determined by one of ordinary skill in the art using routine experimentation.
The efficacy of a treatment comprising a composition for the treatment of a medical condition can be determined by the skilled clinician. A treatment is considered an “effective treatment,” if any one or all of the signs or symptoms of, as but one example, levels of functional target are altered in a beneficial manner (e.g., increased by at least 10%), or other clinically accepted symptoms or markers of disease (e.g., CF) are improved or ameliorated. Efficacy can also be measured by failure of a subject to worsen as assessed by hospitalization or need for medical interventions (e.g., progression of the disease is halted or at least slowed). Methods of measuring these indicators are known to those of skill in the art and/or described herein. Treatment includes any treatment of a disease in subject and includes: (1) inhibiting the disease, e.g., arresting, or slowing the progression of symptoms; or (2) relieving the disease, e.g., causing regression of symptoms; and (3) preventing or reducing the likelihood of the development of symptoms.
The present disclosure also provides kits for use of the compositions described herein. For example, the present disclosure provides kits comprising a gene-editing system as described herein; a viral particle or set of viral particle comprising a gene-editing system as described herein; a nucleic acid or set of nucleic acids comprising a gene-editing system as described herein; and/or a population of genetically-edited cells as described herein.
In some embodiments, the kit can additionally comprise instructions for use in any of the methods described herein. The included instructions may comprise a description of: (i) the delivery of a gene-editing system as described herein; a viral particle or set of viral particle comprising a gene-editing system as described herein; and/or a nucleic acid or set of nucleic acids comprising a gene-editing system as described herein; and/or (ii) the administration of a population of genetically-edited cells as described herein.
The kit may further comprise a description of selecting a subject suitable for treatment based on identifying whether the subject is in need of the treatment. The instructions may include information as to dosage, dosing schedule, and route of administration for the intended treatment. The containers may be unit doses, bulk packages (e.g., multi-dose packages) or sub-unit doses. Instructions supplied in the kits of the disclosure are typically written instructions on a label or package insert. The label or package insert indicates that the pharmaceutical compositions are used for treating, delaying the onset, and/or alleviating a disease or disorder in a subject.
The kits provided herein are in suitable packaging. Suitable packaging includes, but is not limited to, vials, bottles, jars, flexible packaging, and the like. Also contemplated are packages for use in combination with a specific device, such as an inhaler, nasal administration device, or an infusion device. A kit may have a sterile access port (for example, the container may be an intravenous solution bag or a vial having a stopper pierceable by a hypodermic injection needle). The container may also have a sterile access port.
Kits optionally may provide additional components such as buffers and interpretive information. Normally, the kit comprises a container and a label or package insert(s) on or associated with the container. In some embodiment, the disclosure provides articles of manufacture comprising contents of the kits described above.
The practice of the present disclosure will employ, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, biochemistry, and immunology, which are within the skill of the art. Such techniques are explained fully in the literature, such as Molecular Cloning: A Laboratory Manual, second edition (Sambrook, et al., 1989) Cold Spring Harbor Press; Oligonucleotide Synthesis (M. J. Gait, ed. 1984); Methods in Molecular Biology, Humana Press; Cell Biology: A Laboratory Notebook (J. E. Cellis, ed., 1989) Academic Press; Animal Cell Culture (R. I. Freshney, ed. 1987); Introduction to Cell and Tissue Culture (J. P. Mather and P. E. Roberts, 1998) Plenum Press; Cell and Tissue Culture: Laboratory Procedures (A. Doyle, J. B. Griffiths, and D. G. Newell, eds. 1993-8) J. Wiley and Sons; Methods in Enzymology (Academic Press, Inc.); Handbook of Experimental Immunology (D. M. Weir and C. C. Blackwell, eds.): Gene Transfer Vectors for Mammalian Cells (J. M. Miller and M. P. Calos, eds., 1987); Current Protocols in Molecular Biology (F. M. Ausubel, et al. eds. 1987); PCR: The Polymerase Chain Reaction, (Mullis, et al., eds. 1994); Current Protocols in Immunology (J. E. Coligan et al., eds., 1991); Short Protocols in Molecular Biology (Wiley and Sons, 1999); Immunobiology (C. A. Janeway and P. Travers, 1997); Antibodies (P. Finch, 1997); Antibodies: a practice approach (D. Catty., ed., IRL Press, 1988-1989); Monoclonal antibodies: a practical approach (P. Shepherd and C. Dean, eds., Oxford University Press, 2000); Using antibodies: a laboratory manual (E. Harlow and D. Lane (Cold Spring Harbor Laboratory Press, 1999); The Antibodies (M. Zanetti and J. D. Capra, eds. Harwood Academic Publishers, 1995); DNA Cloning: A practical Approach, Volumes I and II (D. N. Glover ed. 1985); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. (1985»; Transcription and Translation (B. D. Hames & S. J. Higgins, eds. (1984»; Animal Cell Culture (R. I. Freshney, ed. (1986»; Immobilized Cells and Enzymes (IRL Press, (1986»; and B. Perbal, A practical Guide To Molecular Cloning (1984); F. M. Ausubel et al. (eds.).
Without further elaboration, it is believed that one skilled in the art can, based on the above description, utilize the present disclosure to its fullest extent. The following specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way whatsoever. All publications cited herein are incorporated by reference for the purposes or subject matter referenced herein.
Methods
Cells and Culture: Lung Progenitor Cells (LPCs) were derived from human lung donors diagnosed with Cystic fibrosis. LPC donor ID numbers 14071 and 14335 contain the CFTR genotype dF508/dF508 and dF508/G542X, respectively. LPCs were derived and expanded using BEGM™ Bronchial Epithelial Cell Growth Medium (Lonza) supplemented with Vertex's proprietary reagents.
Human Bronchial Epithelial cells (HBEs) were obtained by directed differentiation of LPCs using the Air Liquid Interface (ALI) culture format. 80,000 LPC cells were seeded per well in the apical side of an ALI-96-well plate. LPC were fed from the apical and basolateral side every other day during 5 days by using BEGM™ Bronchial Epithelial Cell Growth Medium (Lonza) supplemented with Vertex's proprietary reagents. At day 6, apical media was removed to promote Air Liquid Interface. Basolateral media was replaced with Vertex's proprietary HBE differentiation media. Cells were fed every other day by replacing HBE differentiation media in the basolateral side. Complete HBE differentiation was obtained after 5 weeks of HBE differentiation.
CRISPR-Cas9 Gene-Editing Reagents: Synthetic gRNAs were purchased from Synthego. gRNAs were HPLC (high-performance liquid chromatography) purified and contained chemically modified nucleotides (2′-O-methyl 3′-phosphorothioate) at the three terminal positions at both the 5′ and 3′ ends. gRNAs contained a 22 nucleotide long spacer sequence to specifically target intron 10 of CFTR gene (see TABLE 1) and an 80 nucleotide long scaffold sequence that allows binding to saCAS9 protein.
saCas9 mRNA was design by CRISPR Therapeutics and synthetized by TriLink Biotechnologies. saCas9 mRNA expresses a Staphylococcus aureus Cas9 (Uniprot entry code J7RUA5) with SV40 and NucleoPlasmine nuclear localization signals. saCas9 mRNA also contains a CAP1 structure and a polyadenylated signal to obtain optimal expression levels in mammalian cells. saCAS9 mRNA was HPLC purified.
CFTR Super-Exon AAV Donor Construct Design and AAV Transduction CFTR intron 10 AAV donor construct contained: (i) 500 nucleotide long left and right homology arms relative to gRNA cut site, (ii) a splice acceptor site, (iii) cDNA of wild type CFTR gene since exon 11 until 27, and (iv) a stop and polyadenylation sequence. AAV donor preparations were made by using a AAV6 serotype, purified and titrated by Vector Biolabs. AAV titration was reported as viral genomes per mL. AAV transductions were performed by adding AAV6 vector into cells at specified vector genome copies per cell for 36 hours at 37° C.
Electroporation: Electroporations were performed by using the Lonza 4D-Nucleofector™ System coupled to 96-well shuttle system. 1.8×105 LPC cells were resuspended in 20 μL of P4 Electroporation buffer Lonza (V4SP-4096). 20 μL of cell mixture was combined with 2 μL of CRISPR-Cas9 reagent mix containing 1 μg of saCAS9 mRNA and 1 μg of gRNA. 20 μL of cell and CRISPR-Cas9 mixture was transferred into one well of a 96-well electroporation plate. Cells were electroporated by using program CM-138. A fraction of electroporated LPC cells were transferred into a well of a 384-well plate or an ALI-96-well plate containing LPCs culture media. Cells were incubated at 37° C. in a 5% CO2 incubator.
Genomic DNA Isolation: Genomic DNA was isolated by incubating cells for 30 min at 37° C. with 50 μL and 15 μL of DNA Quick extract solution (Epicentre) per well of a 96-well and a 384-well plate, respectively. Cellular extract was mixed and transferred into a 96-well PCR plate and then incubated for 6 min at 65° C. and 2 min at 98° C. Genomic DNA was immediately use in downstream applications or it was store at 4° C.
Measurement of INDEL Rates for CFTR Intron gRNA Screening: Long-range PCR PrimeSTAR kit (R045B, TAKARA) was used to amplify a 12 Kb DNA fragment corresponding to CFTR intron 10. PCR reactions were carried out following manufacturer instructions. In brief, 2.5 μL of genomic DNA solution was combined with 47.5 μL of PrimeSTAR master mix containing dNTPS, 5×GXL buffer, GXL Polymerase and the corresponding CFTR Intron 10 forward and reverse primers (CFTR 12 Kb F3: GCTACCAGTGTGATGGAGTAG (SEQ ID NO: 160) and CFTR 12 Kb R3: AGCCAGGGATACAATATCTTCACAA (SEQ ID NO: 161) at 250 nM each). PCR reactions were performed using the following thermal cycling protocol: 1) 94° C. 30 s; 2), 94° C. 10 s; 3), 68° C. 10 min; 4) repeat steps 2-3 32 times.
PCR reactions for the VEGFA positive control were performed using Phire Green Hot Start II PCR Master Mix kit (F126L, Thermo Scientific) following manufacturer instructions. In brief, 1.5 μL of genomic DNA solution was combined with 18.5 μL of Phire master mix with the corresponding VEGFA forward and reverse primer pair (VEGFA-F1: CCAGTCACTGACTAACCCCG (SEQ ID NO: 162) and VEGFA-R1: ACTCTGTCCAGAGACACGCG (SEQ ID NO: 163) at 625 nM each). PCR reactions were performed using the following thermal cycling protocol: 1) 98° C. 30 s; 2) 98° C. 5 s; 3) 60° C. 5 s; 472°) C. 10 s 5) repeat steps 2-4 35 times; 6), 72° C. 4 min.
The PCR products were enzymatically purified, and CFTR intron 10 PCR products were amplified using Rolling Cycle Amplification at GENEWIZ. DNA samples were Sanger sequenced using sequencing primers that bind near the cut site of sgRNA tested (TABLE 2). Each sequencing chromatogram was analyzed using TIDE software against reference sequences (tide.nki.nl). References sequences were obtained from mock-electroporated samples. Tide parameters were set to cover an indel spectrum of +/−30 nucleotides of gRNA cut site and the decomposition window was set to cover the largest possible window with high-quality traces. Total indel (insertion and deletions) rates were obtained directly from TIDE plots. GraphPad Prism 7 software was used to make Graphs and to calculate the all Statistical information.
Measurement of CFTR Super-Exon Insertion Rates by Droplet Digital PCR (ddPCR): HDR-mediated insertion of CFTR super-exon 10 was assessed by ddPCR (QX200, Bio-Rad Laboratories, Inc.). Multiplex ddPCR assays were performed to specifically determine the amount of HDR-edited alleles relative to total alleles present in the sample. GAPDH allele count was used to determine the total number of alleles amplified by ddPCR. ddPCR were performed (1× assay: 900 nM primers, and 250 nM each probe) by using 2 ddPCR probes per assay (IDT, Inc.). One FAM-labeled probe was specific for HDR-edited alleles with one primer positioned outside of the template matching region of CFTR super-exon to prevent amplification of donor template. The second HEX-labeled probe and pair of primers were specific for GAPDH. ddPCR assays were assembled and droplets were generated by an Automated Droplet Generator (Bio-Rad). ddPCR assays were performed by using ddPCR supermix for probes (no dUTP) using the following thermal cycling protocol: 1) 95° C. 5 min; 2), 94° C. 30 s; 3), 58° C. 1 min; 4), 72° C. 3 min; 5) repeat steps 2-4 39 times; 6), 98° C. 5 min, with all the steps ramped by 2° C./s. QuantaSoft was used for quantification (Bio-Rad). The fraction of positive droplets corresponding to FAM and HEX fluorescent channels determines the amount of HDR-edited and GAPDH alleles, respectively. Percentage of HDR-edited alleles was calculated relative to GAPDH alleles which represent 100% of amplified alleles in the sample. The confidence intervals for each well were calculated by QuantaSoft based on Poisson distribution. Primers and probes sequences are shown in TABLE 3 and TABLE 4.
Measurement of LPC Cell Survival Rates: Cells were incubated with 5 μg/mL of Hoechst 33342 (Life technologies: H3570) and 0.5 μg/mL of Propidium Iodide (PI; Life technologies: P3566) in culture media for 1 hour at 37° C. Cells were imaged to measure Hoescht positive events (Live and death cells) and PI positive events (Death cells) by using a High-Content Imaging System (Molecular devices). Relative cell survival rate was calculated as follows: [(Hoescht+events −PI+events) of Sample]/(Hoescht+events −PI+events) of Control]*100. Control was Mock transfected cells and its cell survival rate was set arbitrarily as 100%.
Measurement of CFTR Function in HBEs: Using chamber experiments were performed on polarized airway epithelial cells expressing dF508del to assess the functional efficacy of gene-edited cells. LPC-derived HBEs were grown on Costar® Snapwell™ cell culture inserts and mounted in an Ussing chamber (Physiologic Instruments, Inc., San Diego, Calif.). The transepithelial resistance and short-circuit current in the presence of a basolateral to apical chloride gradient (Isc) were measured using a voltage-clamp system (Department of Bioengineering, University of Iowa. IA). Briefly, LPC-derived HBEs were examined under voltage-clamp recording conditions (Vhold=0 mV) at 37° C. The basolateral solution contained (in mM) 145 NaCl, 0.83 K2HPO4, 3.3 KH2PO4, 1.2 MgCl2, 1.2 CaCl2, 10 Glucose, 10 HEPES (pH adjusted to 7.35 with NaOH) and the apical solution contained (in mM) 145 NaGluconate, 1.2 MgCl2. 1.2 CaCl2, 10 glucose, 10 HEPES (pH adjusted to 7.35 with NaOH). Positive controls for CFTR function were dF508del cells treated with CFTR modulators cocktail (TC) on the basolateral side during 18-24 h prior to assay. Negative controls were cells treated with DMSO. Forskolin (10 μM) was added to the apical side during the assay to stimulate CFTR-mediated chloride transport. A CFTR-inhibitor cocktail (30 μM) was subsequently added to the apical side during the assay to inhibit CFTR-mediated chloride transport. CFTR function is expressed as μA/cm2 and it is calculated by using the following formula: Maximum Forskolin-induced current Minimal current in the presence of the CFTR inhibitor cocktail.
Nucleotide Sequences of Exemplary sgRNA and Donor Template Pairs
CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGC
GTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAG
TGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCACGCGTATTGAGGAAATAAATTTAA
AGACATGAAAGAATCAAATTAGAGATGAGAAAGAGCTTTCTAGTATTAGAATGGGCTAAAG
GGCAATAGGTATTTGCTTCAGAAGTCTATAAAATGGTTCCTTGTTCCCATTTGATTGTCATTT
TAGCTGTGGTACTTTGTAGAAATGTGAGAAAAAGTTTAGTGGTCTCTTGAAGCTTTTCAAAA
TACTTTCTAGAATTATACCGAATAATCTAAGACAAACAGAAAAAGAAAGAGAGGAAGGAA
GAAAGAAGGAAATGAGGAAGAAAGGAAGTAGGAGGAAGGAAGGAAGGAAAGAAGGAAGG
AAGTAAGAGGGAAGCAGTGCTGCTGCTGTAGGTAAAAATGTTAATGAAAATAGAAATTAAG
AAAGACTCCTGAAAGGCAATTATTTATCAATATCTAAGATGAGGAGAACCATATTTTGAAG
AATTGAATATGAGACTTGGGAAACAAAATGCCACAAAAAATTTCCACTCAATAA
TATACACT
TCTGCTTAGGATGATAATTGGAGGCAAGTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGAT
GGGTTTTATTTCCAGacttactgctgatggtcatcatgggcgagctggaacccagtgaggggaagatcaaacactcaggacggatttatat
gacatgataagatacattgatgagtttggacaaaccacaactagaatgcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgcttta
tttgtaaccattataagctgcaataaacaagttaacaacaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggtttttt
AT
TTGGTGTCAGGCTGGGTGCAGTGGCTCACACTTGTAATCCTAGCACTTTTGGAGGCAGAGGC
AGGTGAATTGCTTGAGTCCAGGAGTTTGAGACCAGCGTGGGCAACATGGCAAACCCCACCT
CTACAAAAAACACAAACAAAAGAAAATAGCTGGGTGTGGTGGTGTGTGCCTGTAGTCCCAG
CTACTTGGGAGGCTGAGGTGGGAGGATCACCTGAGCCTGAGAAGTGGAGGCTGCAGTGAGC
CATGATTGCACCACTGTACCCTAGCCTAGGTGATAGGCTCAAAAAAAAAAAAAATTGGTGT
TTGCAATGCTAATAATACAATTTGGTTGTTTCTCTCTCCAGTTGTTTTCCTACATACGAAACA
GCTTTTAAAACAAAATAGCTGGAATTGTGCATTTTTTCTTACAAAAACATTTTCTTTCTTAAA
ATGTTATTATTTTTCTTTTATATCTTGTATATTATTACTAGCAGTGTTCACTATTAAAAAATTA
TAGGTAACCACGTGCGGACCGAGGCTGCAGCGTCGTCCTCCCTAGGAACCCCTAGTGATG
GAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGG
TCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCT
GCCTGCAGG.
The uppercase boldface letters correspond to sequence comprising the AAV ITRs (SEQ ID NO: 15 for the 5′ end ITR and SEQ ID NO: 16 for the 3′ ITR); the uppercase underlined letters correspond to the nucleotide sequence comprising the 5′ and 3′ homology arms (SEQ ID NO: 17 and SEQ ID NO: 18, respectively); the uppercase italicized letters correspond to the nucleotide sequence comprising the splice site acceptor (SEQ ID NO: 1); the lowercase letters (non-boldface) correspond to the nucleotide sequence comprising CFTR exons 11-27 (SEQ ID NO: 37); and the lowercase boldface letters correspond to the nucleotide sequence comprises 3′UTR elements (SEQ ID NO: 159).
CFTR Intron 10 Target Site 2068
CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGC
GTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAG
TGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCACGCGTCAGCTTTTAAAACAAAATA
GCTGGAATTGTGCATTTTTTCTTACAAAAACATTTTCTTTCTTAAAATGTTATTATTTTTCTTT
TATATCTTGTATATTATTACTAGCAGTGTTCACTATTAAAAAATTATACTATAGGAGGGGCT
GATACTAAATAAGTTAGCAATGGTCTAAACAAGGATGTTTATTTATGAAAAGGTAGTAATTG
TGTTTCATAGAATTTTTAAAATTAATTCTGCGTATGTCTTCAAGATCAATTCTATGATAGATG
TGCAAAAATAGCTTTGGAATTACAAATTCCAAGACTTACTGGCAATTAAATTTCAGGCAGTT
TTATTAAAATTGATGAGCAGATAATTACTGGCTGACAGTGCAGTTATAGCTTATGAAAAGCA
GCTATGAAGGCAGAGTTAGAGGAAGGCAGTGGTCCCTTGGGAATATTTAAACACTTCTGAG
AAACGGAGTTTACTAACTCAATCTAGGAGGCTGCCTTTTAGTAGT
TATACACTTCTGCTTAGGA
TGATAATTGGAGGCAAGTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGATGGGTTTTATTT
CCAGacttctctgctgatggtcatcatgggcgagctggaacccagtgaggggaagatcaaacactcaggacggatttctttttgcagtcagttctcatg
cattgatgagtttggacaaaccacaactagaatgcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgctttatttgtaaccattata
agctgcaataaacaagttaacaacaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggtttttt
ATTAGGAATGG
AACACTTTATAGTTTTTTTTGGACAAAAGATCTAGCTAAAATATAAGATTGAATAATTGAAA
ATATTAACATTTTAAGTTAAATCTTACCCACTCAATACAATTTGGTAATTTGTATCAGAAGCT
TAAAAGATAACCTAATAGTTCTTCTACTTCTATAACTTACCCAAATATGTTTGCAGAGATCTT
ATGTAAAGCTCTTCATTATAACACTGCTTTCAGGAGCCAAAAATTGGGTGGGGGAGCCCCAT
AAATGTTGAATAATAGGGGTTTGATTAGATAAATTTTGGTGTAGTTCTATAATGGCGTGTTA
TTCAGCCAATAAAAGGTTTGTTAAAGAATGACTGTGACGGATGTATATGATATACTCTTAAG
TGAATAAAGAGTTACAAAATGTTATGTACAAGTTACAAAATGTATGTACATTATGATCCATT
TTTCATAAAATCATATGTATGTATATATGTGTGTCTGGAAGGATAAATTTATCGGTAACCAC
TCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGC
CCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCTGCCTGCAGG.
The uppercase boldface letters correspond to sequence comprising the AAV ITRs (SEQ ID NO: 15 for the 5′ end ITR and SEQ ID NO: 16 for the 3′ ITR); the uppercase underlined letters correspond to the nucleotide sequence comprising the 5′ and 3′ homology arms (SEQ ID NO: 19 and SEQ ID NO: 20, respectively); the uppercase italicized letters correspond to the nucleotide sequence comprising the splice site acceptor (SEQ ID NO: 1); the lowercase letters (non-boldface) correspond to the nucleotide sequence comprising CFTR exons 11-27 (SEQ ID NO: 37); and the lowercase boldface letters correspond to the nucleotide sequence comprises 3′UTR elements (SEQ ID NO: 159).
CFTR Intron 10 Target Site 3821
CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGC
GTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAG
TGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCACGCGTTTAGAGATTAGGTCTTACT
CTGTCACCCAGGCTGAACTTCAGTGGTGTGATCATAGCTCACTGTAACCTTGAACTCCTGGG
CTCAATTGACCTTTCCGCTTCAGCCTCCCAAAGTGCTGGGTTTATAGGCATGAGCCACTGTGT
CTGGTCCAATATGCATATATATATTTTTAACCTGGATTATCAGAGCTATATTGTGTTTAGGTT
TATAAAGCTGTACTATGTGAAAATATCACTTCTAGGTTTAATTTTGTACAAAGGAATTTTATA
TAGAAATGAGGTAATTCAGATTTTTTCCCATGTAATAAGAATTGTAAAATTTACTGAAACAA
ACATCAAAAAGATATCTGTTACATGACCTTCCTTTCTTTTGAATATATTTCAGGTGATATTAT
TTATTAAAATTTAAAAATGAAAATTAAAATATATAAAAAGTTGAAAATTATTCCTTTCTTTA
CTGTCTCTCATCTGTCCATTTTCCATTCTCCTGCATTCCCTCA
TATACACTTCTGCTTAGGATGAT
AATTGGAGGCAAGTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGATGGGTTTTATTTCCAG
tgagtttggacaaaccacaactagaatgcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgctttatttgtaaccattataagctgc
aataaacaagttaacaacaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggttttttTCCAACCAAGGTAGC
CAATCCAGGTAACTTTTTTTAGTATCTTCCCAGAGATGTTTCTCTCTATATATATAATCAATA
TACATTTTTTATTATTCCCCACCTCTCTTTTTATGTAACAATATGCAGAGTTTTGCTTCTTGCT
TTTCCCACTATCTTGGACAACTTTCCATATTCAAAGCACAGAGGACTTGCACATATGTTCAG
ACTGCTGAATATTTCTGTCTCTCCCCTGCCATTCATATGTTGAAATCCTAATTCCCAAGGTGA
TGGTATTGCAGGGTGGGGCCTTTGGGAGGTGATTAGTCCATGAGGGTGAAGTCTTTAGTAAA
TGAGATTAGTGTCTTTATAAAAGAAACCTTAGAGAGACCCTCACACCTTAGAGAGACCCTCA
CCCCTTTCTGCCATGTGAGAACACAGCAGGAAGACAGCTGGCTATCCAGGATTCAGGAGTCT
CTTAGCAGACCCAAATCTGCTGGCACCTTGATCTTGGACTTCCCAGCGGTAACCACGTGCGG
TCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGG
CTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCTGCCTGCAGG.
The uppercase boldface letters correspond to sequence comprising the AAV ITRs (SEQ ID NO: 15 for the 5′ end ITR and SEQ ID NO: 16 for the 3′ ITR); the uppercase underlined letters correspond to the nucleotide sequence comprising the 5′ and 3′ homology arms (SEQ ID NO: 21 and SEQ ID NO: 22, respectively); the uppercase italicized letters correspond to the nucleotide sequence comprising the splice site acceptor (SEQ ID NO: 1); the lowercase letters (non-boldface) correspond to the nucleotide sequence comprising CFTR exons 11-27 (SEQ ID NO: 37); and the lowercase boldface letters correspond to the nucleotide sequence comprises 3′UTR elements (SEQ ID NO: 159).
CFTR Intron 10 Target Site 4262
CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGC
GTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAG
TGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCACGCGTTTATTCCTTTCTTTACTGTC
TCTCATCTGTCCATTTTCCATTCTCCTGCATTCCCTCATCCAACCAAGGTAGCCAATCCAGGT
AACTTTTTTTAGTATCTTCCCAGAGATGTTTCTCTCTATATATATAATCAATATACATTTTTTA
TTATTCCCCACCTCTCTTTTTATGTAACAATATGCAGAGTTTTGCTTCTTGCTTTTCCCACTAT
CTTGGACAACTTTCCATATTCAAAGCACAGAGGACTTGCACATATGTTCAGACTGCTGAATA
TTTCTGTCTCTCCCCTGCCATTCATATGTTGAAATCCTAATTCCCAAGGTGATGGTATTGCAG
GGTGGGGCCTTTGGGAGGTGATTAGTCCATGAGGGTGAAGTCTTTAGTAAATGAGATTAGTG
TCTTTATAAAAGAAACCTTAGAGAGACCCTCACACCTTAGAGAGACCCTCACCCCTTTCTGC
CATGTGAGAACACAGCAGGAAGACAGCTGGCTATCCAGGA
TATACACTTCTGCTTAGGATGAT
AATTGGAGGCAAGTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGATGGGTTTTATTTCCAG
tgagtttggacaaaccacaactagaatgcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgctttatttgtaaccattataagctgc
aataaacaagttaacaacaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggtttttt
TTCAGGAGTCTCTTA
GCAGACCCAAATCTGCTGGCACCTTGATCTTGGACTTCCCAGCCTCCAGAACTGTGAGAAAT
AAATTCCTGTTGTTTATAAGCCACACAGTTCATGGTATTTTGTTATAGCAGCCTGAACAAGG
ACACACACACACACACACACACATGCACACACATTTAAATAGATGCATAGTATTCTATCATA
TGGATGGATATTCTATGATATAATGAATCACTATTGATTGACATTTGGGTTGTTTCCAATATT
TTGTTAACACAAAGAACAACACTACAAATAACTTTATATACATATCATTTAGCACATCTGCA
ATTGTATCAGTAGGCTTCCTATAAGTGGTCAAGCATTTGTGTACTTGTGATTTTGGTAGATGT
TGTCAAATGTCCTTCCCTGAAATTTGTACCAATTCGTACTCATGCCATACACTCTAAATAGAG
TGCTGATTTCCCCACAGCATTACTAACAGATGATATTATCTAATTTAAGGTAACCACGTGCG
CTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGG
GCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCTGCCTGCAGG.
The uppercase boldface letters correspond to sequence comprising the AAV ITRs (SEQ ID NO: 15 for the 5′ end ITR and SEQ ID NO: 16 for the 3′ ITR); the uppercase underlined letters correspond to the nucleotide sequence comprising the 5′ and 3′ homology arms (SEQ ID NO: 23 and SEQ ID NO: 24, respectively); the uppercase italicized letters correspond to the nucleotide sequence comprising the splice site acceptor (SEQ ID NO: 1); the lowercase letters (non-boldface) correspond to the nucleotide sequence comprising CFTR exons 11-27 (SEQ ID NO: 37); and the lowercase boldface letters correspond to the nucleotide sequence comprises 3′UTR elements (SEQ ID NO: 159).
CFTR Intron 10 Target Site 5041
CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGC
GTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAG
TGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCACGCGTACAAAGAACAACACTACA
AATAACTTTATATACATATCATTTAGCACATCTGCAATTGTATCAGTAGGCTTCCTATAAGTG
GTCAAGCATTTGTGTACTTGTGATTTTGGTAGATGTTGTCAAATGTCCTTCCCTGAAATTTGT
ACCAATTCGTACTCATGCCATACACTCTAAATAGAGTGCTGATTTCCCCACAGCATTACTAA
CAGATGATATTATCTAATTTAAAAAGTTTCTCATCTTATAGGGAAAATAGTATGTCAATGTA
TTCTTAACTTGCATTTCTTTTATTATAAGTAGTGTAAAATATCATTTCAACTTATACACAGGA
GGAATTTCTCTCTATATAAAGTGATCCTAGAATCATAATGAAAAATATCACCAACTCATTAG
GAAAATGTACAAAGGATTGAATAGATATCTCATCAAAAATAAAAATATAAGTGGCCTTTAA
ACATTGAAAGGTAACATTTGAACAAAGACTTGCAGGAGGTGAGGGA
TATACACTTCTGCTTAG
GATGATAATTGGAGGCAAGTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGATGGGTTTTAT
TTCCAGacttctctgctgatggtcatcatgggcgagctggaacccagtgaggggaagatcaaacactcaggacggatttctttttgcagtcagttctca
cattgatgagtttggacaaaccacaactagaatgcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgctttatttgtaaccattata
agctgcaataaacaagttaacaacaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggtttttt
TTAGGGAATGC
AGACTCTGGGAAGAGTCTTCCAAGTAGCAGGTGAAGCAAGTGCAAAGCTTTCAGATGGGAC
TGACTATACCTGTCTGGTTTGAAGAACAGTAAGGAGGTCACTGAGGCTGGCATAGAGTAAG
ACAGGGAGGGTAGAATACTGTCAGAGAAGTAATCGGCGGTGGAGGTAGGGGGTAAACCAT
AAAGTGCTCGTAAAGACTAAGGCTTATTTCTCTGGGTGAGATTAGAGGCCACTGGAGAGTTT
TAAACAGAAGTAACAGGGCCACTTTGGCTAATGTTTTTAGGCTATTCTGTAGGGAGACAAGG
GAGGAAGCAAGGAGATGAGTTAGGAGTCTATTGTGCCAGTTCAGGCAAGTGATGATGGTGG
CTTGATCCAGGTAGTAGTGGAAGTAGTATAGTAGGAAGTGATCAGATTCAGGACATGCTTTG
AAGGAAGATCCAATAGGATTAATGGATAAGTTGAACAATGGCATATGAGAAAAGTCACAGG
TTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTCG
CCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCTGCC
TGCAGG.
The uppercase boldface letters correspond to sequence comprising the AAV ITRs (SEQ ID NO: 15 for the 5′ end ITR and SEQ ID NO: 16 for the 3′ ITR); the uppercase underlined letters correspond to the nucleotide sequence comprising the 5′ and 3′ homology arms (SEQ ID NO: 25 and SEQ ID NO: 345, respectively); the uppercase italicized letters correspond to the nucleotide sequence comprising the splice site acceptor (SEQ ID NO: 1); the lowercase letters (non-boldface) correspond to the nucleotide sequence comprising CFTR exons 11-27 (SEQ ID NO: 37); and the lowercase boldface letters correspond to the nucleotide sequence comprises 3′UTR elements (SEQ ID NO: 159).
CFTR Intron 10 Target Site 5052
CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGC
GTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAG
TGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCACGCGTAATAACTTTATATACATAT
CATTTAGCACATCTGCAATTGTATCAGTAGGCTTCCTATAAGTGGTCAAGCATTTGTGTACTT
GTGATTTTGGTAGATGTTGTCAAATGTCCTTCCCTGAAATTTGTACCAATTCGTACTCATGCC
ATACACTCTAAATAGAGTGCTGATTTCCCCACAGCATTACTAACAGATGATATTATCTAATT
TAAAAAGTTTCTCATCTTATAGGGAAAATAGTATGTCAATGTATTCTTAACTTGCATTTCTTT
TATTATAAGTAGTGTAAAATATCATTTCAACTTATACACAGGAGGAATTTCTCTCTATATAA
AGTGATCCTAGAATCATAATGAAAAATATCACCAACTCATTAGGAAAATGTACAAAGGATT
GAATAGATATCTCATCAAAAATAAAAATATAAGTGGCCTTTAAACATTGAAAGGTAACATTT
GAACAAAGACTTGCAGGAGGTGAGGGATTAGGGAATGCAGACTCT
TATACACTTCTGCTTAG
GATGATAATTGGAGGCAAGTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGATGGGTTTTAT
TTCCAGacttctctgctgatggtcatcatgggcgagctggaacccagtgaggggaagatcaaacactcaggacggatttctttttgcagtcagttctca
cattgatgagtttggacaaaccacaactagaatgcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgctttatttgtaaccattata
agctgcaataaacaagttaacaacaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggtttttt
GGGAAGAGTCT
TCCAAGTAGCAGGTGAAGCAAGTGCAAAGCTTTCAGATGGGACTGACTATACCTGTCTGGTT
TGAAGAACAGTAAGGAGGTCACTGAGGCTGGCATAGAGTAAGACAGGGAGGGTAGAATAC
TGTCAGAGAAGTAATCGGCGGTGGAGGTAGGGGGTAAACCATAAAGTGCTCGTAAAGACTA
AGGCTTATTTCTCTGGGTGAGATTAGAGGCCACTGGAGAGTTTTAAACAGAAGTAACAGGG
CCACTTTGGCTAATGTTTTTAGGCTATTCTGTAGGGAGACAAGGGAGGAAGCAAGGAGATG
AGTTAGGAGTCTATTGTGCCAGTTCAGGCAAGTGATGATGGTGGCTTGATCCAGGTAGTAGT
GGAAGTAGTATAGTAGGAAGTGATCAGATTCAGGACATGCTTTGAAGGAAGATCCAATAGG
ATTAATGGATAAGTTGAACAATGGCATATGAGAAAAGTCACAGAGGAGTCAAAGATGATTC
GTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTC
GCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCTGC
CTGCAGG.
The uppercase boldface letters correspond to sequence comprising the AAV ITRs AAV ITRs (SEQ ID NO: 15 for the 5′ end ITR and SEQ ID NO: 16 for the 3′ ITR); the uppercase underlined letters correspond to the nucleotide sequence comprising the 5′ and 3′ homology arms (SEQ ID NO: 346 and SEQ ID NO: 347, respectively); the uppercase italicized letters correspond to the nucleotide sequence comprising the splice site acceptor (SEQ ID NO: 1); the lowercase letters (non-boldface) correspond to the nucleotide sequence comprising CFTR exons 11-27 (SEQ ID NO: 37); and the lowercase boldface letters correspond to the nucleotide sequence comprises 3′UTR elements (SEQ ID NO: 159).
CFTR Intron 10 Target Site 5278
CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGC
GTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAG
TGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCACGCGTTAGGGAAAATAGTATGTCA
ATGTATTCTTAACTTGCATTTCTTTTATTATAAGTAGTGTAAAATATCATTTCAACTTATACA
CAGGAGGAATTTCTCTCTATATAAAGTGATCCTAGAATCATAATGAAAAATATCACCAACTC
ATTAGGAAAATGTACAAAGGATTGAATAGATATCTCATCAAAAATAAAAATATAAGTGGCC
TTTAAACATTGAAAGGTAACATTTGAACAAAGACTTGCAGGAGGTGAGGGATTAGGGAATG
CAGACTCTGGGAAGAGTCTTCCAAGTAGCAGGTGAAGCAAGTGCAAAGCTTTCAGATGGGA
CTGACTATACCTGTCTGGTTTGAAGAACAGTAAGGAGGTCACTGAGGCTGGCATAGAGTAA
GACAGGGAGGGTAGAATACTGTCAGAGAAGTAATCGGCGGTGGAGGTAGGGGGTAAACCA
TAAAGTGCTCGTAAAGACTAAGGCTTATTTCTCTGGGTGAGATTAGAGGCCA
TATACACTTCT
GCTTAGGATGATAATTGGAGGCAAGTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGATGG
GTTTTATTTCCAGacttctctgctgatggtcatcatgggcgagctggaacccagtgaggggaagatcaaacactcaggacggatttctttttgca
tgataagatacattgatgagtttggacaaaccacaactagaatgcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgctttatttgt
aaccattataagctgcaataaacaagttaacaacaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggtttttt
CTGG
AGAGTTTTAAACAGAAGTAACAGGGCCACTTTGGCTAATGTTTTTAGGCTATTCTGTAGGGA
GACAAGGGAGGAAGCAAGGAGATGAGTTAGGAGTCTATTGTGCCAGTTCAGGCAAGTGATG
ATGGTGGCTTGATCCAGGTAGTAGTGGAAGTAGTATAGTAGGAAGTGATCAGATTCAGGAC
ATGCTTTGAAGGAAGATCCAATAGGATTAATGGATAAGTTGAACAATGGCATATGAGAAAA
GTCACAGAGGAGTCAAAGATGATTCCAAGCTTTCTGGACTGAGTAACTGGAAGGATAAATG
TGCCGTTTACTAGAAAGATAATGGGAGAAACAGGTTTTGGATGGAGCTTGGTTTGGGAATAT
TAAGTTTGAAATGCCTATTTGACATCCAAATAGAGATGTTAGTTGGATGTACAAGTCTAGTT
TCAAGGAAGAGGGGGCTGGTAGTGTGAAGATGGGGCTGGATAAGATTCTAAAGGAAAGAG
GGTTGAGGTAACCACGTGCGGACCGAGGCTGCAGCGTCGTCCTCCCTAGGAACCCCTAGTG
ATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAA
AGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCA
GCTGCCTGCAGG.
The uppercase boldface letters correspond to sequence comprising the AAV ITRs (SEQ ID NO: 15 for the 5′ end ITR and SEQ ID NO: 16 for the 3′ ITR); the uppercase underlined letters correspond to the nucleotide sequence comprising the 5′ and 3′ homology arms (SEQ ID NO: 348 and SEQ ID NO: 349, respectively); the uppercase italicized letters correspond to the nucleotide sequence comprising the splice site acceptor (SEQ ID NO: 1); the lowercase letters (non-boldface) correspond to the nucleotide sequence comprising CFTR exons 11-27 (SEQ ID NO: 37); and the lowercase boldface letters correspond to the nucleotide sequence comprises 3′UTR elements (SEQ ID NO: 159).
CFTR Intron 10 Target Site 5343
CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGC
GTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAG
TGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCACGCGTTATCATTTCAACTTATACAC
AGGAGGAATTTCTCTCTATATAAAGTGATCCTAGAATCATAATGAAAAATATCACCAACTCA
TTAGGAAAATGTACAAAGGATTGAATAGATATCTCATCAAAAATAAAAATATAAGTGGCCT
TTAAACATTGAAAGGTAACATTTGAACAAAGACTTGCAGGAGGTGAGGGATTAGGGAATGC
AGACTCTGGGAAGAGTCTTCCAAGTAGCAGGTGAAGCAAGTGCAAAGCTTTCAGATGGGAC
TGACTATACCTGTCTGGTTTGAAGAACAGTAAGGAGGTCACTGAGGCTGGCATAGAGTAAG
ACAGGGAGGGTAGAATACTGTCAGAGAAGTAATCGGCGGTGGAGGTAGGGGGTAAACCAT
AAAGTGCTCGTAAAGACTAAGGCTTATTTCTCTGGGTGAGATTAGAGGCCACTGGAGAGTTT
TAAACAGAAGTAACAGGGCCACTTTGGCTAATGTTTTTAGGCTATTCTGTAG
TATACACTTCT
GCTTAGGATGATAATTGGAGGCAAGTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGATGG
GTTTTATTTCCAGacttctctgctgatggtcatcatgggcgagctggaacccagtgaggggaagatcaaacactcaggacggatttctttttgca
tgataagatacattgatgagtttggacaaaccacaactagaatgcagtgaaaaaaatgctttatttgtgaaattttgatgctattgctttatttgt
aaccattataagctgcaataaacaagttaacaacaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggtttttt
GGAG
ACAAGGGAGGAAGCAAGGAGATGAGTTAGGAGTCTATTGTGCCAGTTCAGGCAAGTGATGA
TGGTGGCTTGATCCAGGTAGTAGTGGAAGTAGTATAGTAGGAAGTGATCAGATTCAGGACA
TGCTTTGAAGGAAGATCCAATAGGATTAATGGATAAGTTGAACAATGGCATATGAGAAAAG
TCACAGAGGAGTCAAAGATGATTCCAAGCTTTCTGGACTGAGTAACTGGAAGGATAAATGT
GCCGTTTACTAGAAAGATAATGGGAGAAACAGGTTTTGGATGGAGCTTGGTTTGGGAATATT
AAGTTTGAAATGCCTATTTGACATCCAAATAGAGATGTTAGTTGGATGTACAAGTCTAGTTT
CAAGGAAGAGGGGGCTGGTAGTGTGAAGATGGGGCTGGATAAGATTCTAAAGGAAAGAGG
GTTGATAAGAAGAGAAAGGGGTGTAGGGGTTAGCCTAAGGGCATTCTAAGTATTAGAGGTT
AAGGAGGGGTAACCACGTGCGGACCGAGGCTGCAGCGTCGTCCTCCCTAGGAACCCCTAG
TGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACC
AAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCG
CAGCTGCCTGCAGG.
The uppercase boldface letters correspond to sequence comprising the AAV ITRs (SEQ ID NO: 15 for the 5′ end ITR and SEQ ID NO: 16 for the 3′ ITR); the uppercase underlined letters correspond to the nucleotide sequence comprising the 5′ and 3′ homology arms (SEQ ID NO: 350 and SEQ ID NO: 351, respectively); the uppercase italicized letters correspond to the nucleotide sequence comprising the splice site acceptor (SEQ ID NO: 1); the lowercase letters (non-boldface) correspond to the nucleotide sequence comprising CFTR exons 11-27 (SEQ ID NO: 37); and the lowercase boldface letters correspond to the nucleotide sequence comprises 3′UTR elements (SEQ ID NO: 159).
CFTR Intron 10 Target Site 5538
CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGC
GTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAG
TGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCACGCGTGGATTAGGGAATGCAGACT
CTGGGAAGAGTCTTCCAAGTAGCAGGTGAAGCAAGTGCAAAGCTTTCAGATGGGACTGACT
ATACCTGTCTGGTTTGAAGAACAGTAAGGAGGTCACTGAGGCTGGCATAGAGTAAGACAGG
GAGGGTAGAATACTGTCAGAGAAGTAATCGGCGGTGGAGGTAGGGGGTAAACCATAAAGT
GCTCGTAAAGACTAAGGCTTATTTCTCTGGGTGAGATTAGAGGCCACTGGAGAGTTTTAAAC
AGAAGTAACAGGGCCACTTTGGCTAATGTTTTTAGGCTATTCTGTAGGGAGACAAGGGAGG
AAGCAAGGAGATGAGTTAGGAGTCTATTGTGCCAGTTCAGGCAAGTGATGATGGTGGCTTG
ATCCAGGTAGTAGTGGAAGTAGTATAGTAGGAAGTGATCAGATTCAGGACATGCTTTGAAG
GAAGATCCAATAGGATTAATGGATAAGTTGAACAATGGCATATGAGAAAAGTCA
TATACACT
TCTGCTTAGGATGATAATTGGAGGCAAGTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGAT
GGGTTTTATTTCCAGacttctctgctgatggtcatcatgggcgagctggaacccagtgaggggaagatcaaacactcaggacggatttcttttt
gacatgataagatacattgatgagtttggacaaaccacaactagaatgcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgcttta
tttgtaaccattataagctgcaataaacaagttaacaacaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggtttttt
CA
GAGGAGTCAAAGATGATTCCAAGCTTTCTGGACTGAGTAACTGGAAGGATAAATGTGCCGT
TTACTAGAAAGATAATGGGAGAAACAGGTTTTGGATGGAGCTTGGTTTGGGAATATTAAGTT
TGAAATGCCTATTTGACATCCAAATAGAGATGTTAGTTGGATGTACAAGTCTAGTTTCAAGG
AAGAGGGGGCTGGTAGTGTGAAGATGGGGCTGGATAAGATTCTAAAGGAAAGAGGGTTGA
TAAGAAGAGAAAGGGGTGTAGGGGTTAGCCTAAGGGCATTCTAAGTATTAGAGGTTAAGGA
GGTGGGTGAAGAAAACCCAATAAAATAAAAGTCTGAGAAGACAAAGCTAGTGAATGAATG
TGGTATCCCGGAACCCAACTGATGTCAAGCAGAAGGGTGTTATCAACTAGGTCAAATGCTC
ATTCATCAAGTAAGATGAAACTGTTATAATTAACCGGTGTCTTCTGAAATACGGAGATAACT
CGTGACTTAGGTAACCACGTGCGGACCGAGGCTGCAGCGTCGTCCTCCCTAGGAACCCCTA
GTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGAC
CAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGC
GCAGCTGCCTGCAGG.
The uppercase boldface letters correspond to sequence comprising the AAV ITRs (SEQ ID NO: 15 for the 5′ end ITR and SEQ ID NO: 16 for the 3′ ITR); the uppercase underlined letters correspond to the nucleotide sequence comprising the 5′ and 3′ homology arms (SEQ ID NO: 352 and SEQ ID NO: 353, respectively); the uppercase italicized letters correspond to the nucleotide sequence comprising the splice site acceptor (SEQ ID NO: 1); the lowercase letters (non-boldface) correspond to the nucleotide sequence comprising CFTR exons 11-27 (SEQ ID NO: 37); and the lowercase boldface letters correspond to the nucleotide sequence comprises 3′UTR elements (SEQ ID NO: 159).
CFTR Intron 10 Target Site 6150
CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGC
GTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAG
TGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCACGCGTGGAATATTAAGTTTGAAAT
GCCTATTTGACATCCAAATAGAGATGTTAGTTGGATGTACAAGTCTAGTTTCAAGGAAGAGG
GGGCTGGTAGTGTGAAGATGGGGCTGGATAAGATTCTAAAGGAAAGAGGGTTGATAAGAA
GAGAAAGGGGTGTAGGGGTTAGCCTAAGGGCATTCTAAGTATTAGAGGTTAAGGAGGTGGG
TGAAGAAAACCCAATAAAATAAAAGTCTGAGAAGACAAAGCTAGTGAATGAATGTGGTATC
CCGGAACCCAACTGATGTCAAGCAGAAGGGTGTTATCAACTAGGTCAAATGCTCATTCATCA
AGTAAGATGAAACTGTTATAATTAACCGGTGTCTTCTGAAATACGGAGATAACTCGTGACTT
AATGAAAGCAATAGTAGAGAAGGTCAAACTTGACCAGAATGAAATTAGAAAGAATAAGAG
GAAAGAAAAGACCAAATACAGACAACCATTGATGCCTTATTCTTTTGATATAC
TATACACTTC
TGCTTAGGATGATAATTGGAGGCAAGTGAATCCTGAGCGTGATTTGATAATGACCTAATAATGATGG
GTTTTATTTCCAGacttctctgctgatggtcatcatgggcgagctggaacccagtgaggggaagatcaaacactcaggacggatttctttttgca
tgataagatacattgatgagtttggacaaaccacaactagaatgcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgctttatttgt
aaccattataagctgcaataaacaagttaacaacaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggtttttt
TCCTG
GAGTCCACTTGCTAATACAATTGACCCTTAAACAATACAGGCTTGAACTGCATGGGTCCACT
TATTTGTGAATTTTTTTTCAGTTAATACATTGGAAAATTTTTGGGGTTTTTTGACAATTTGAA
AAAACTCACAAACTGTCTAGCCTAGAAATACCGAGAAAATTAAGAAAAAGTAAGATATGCC
ATGAATGCATAAAATATATGTAGACACTAGCCTATTTTATCATTTGCTACTATAAAATATAC
ACAATCTATTATAAAAAGTTAAAATTTATCAAAACTTAACACACACTAACACCTACCCTACC
TGGCACCATTCACAGTAAAGAGAAATGTAAATAAACATAAAAATGTAGTATTAAACCATAA
TGGCATAAAACTAATTGTAGTACATATGGTACTACTGTAATAATTTGGAAGCCACTTCCTGT
TGCTATTACGGTAAGCTCAAGCATTGTGGATAGCCATTTAAAACACCACGTGATGCTAATCA
GTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTC
GCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCTGC
CTGCAGG.
The uppercase boldface letters correspond to sequence comprising the AAV ITRs (SEQ ID NO: 15 for the 5′ end ITR and SEQ ID NO: 16 for the 3′ ITR); the uppercase underlined letters correspond to the nucleotide sequence comprising the 5′ and 3′ homology arms (SEQ ID NO: 354 and SEQ ID NO: 355, respectively); the uppercase italicized letters correspond to the nucleotide sequence comprising the splice site acceptor (SEQ ID NO: 1); the lowercase letters (non-boldface) correspond to the nucleotide sequence comprising CFTR exons 11-27 (SEQ ID NO: 37); and the lowercase boldface letters correspond to the nucleotide sequence comprises 3′UTR elements (SEQ ID NO: 159).
Results
Design of dual AAV gene-editing systems for correcting mutations of the CFTR gene:
Identification of effective saCAS9-gRNAs gene-editing complexes in CFTR intron 10: Lung progenitor cells (LPCs) were electroporated with saCAS9 mRNA together with a sgRNA targeting a site located in CFTR intron (see
LPCs derived from two Cystic Fibrosis donors (14071 and 14335) were then electroporated with saCAS9 mRNA together with a gRNA targeting one of the 10 candidate CFTR intron 10 target sites. Indel rates were determined by using TIDE assay 72 h after electroporation. There were no significant differences in Indel rates between LPC donors for each of the 10 candidate sgRNA target sites (
Determination of Indel pattern consistency of the 10 candidate sgRNA target site: Indel patterns for the 10 candidate sgRNA target sites (identified and labeled in
Determination of the rates of CFTR super-exon insertion by HDR in LPC: LPCs were electroporated with saCAS9 mRNA together with an sgRNA targeting one of the 10 candidate target sites. LPC cells were seeded with media containing CFTR super-exon AAV vectors. Homology dependent recombination (HDR) rates were measured by ddPCR after 5 days of treatment in LPCs and after 5 weeks of LPC differentiation into HBEs. CFTR function was measured in 5 weeks differentiated HBEs by Ussing assay (see
Determination of functional CFTR correction in HBEs derived from gene-edited LPCs: dF508/dF508 LPCs were electroporated with or without saCAS9 mRNA and an sgRNA targeting one of the 10 candidate target sites. LPC cells were seeded with media containing CFTR super-exon AAV vectors. Homology dependent recombination (HDR) rates were measured by ddPCR after 5 days of treatment in LPCs and after 5 weeks of LPC differentiation into HBEs. CFTR function was measured in 5 weeks differentiated HBEs by Ussing assay (see
All of the features disclosed in this specification may be combined in any combination. Each feature disclosed in this specification may be replaced by an alternative feature serving the same, equivalent, or similar purpose. Thus, unless expressly stated otherwise, each feature disclosed is only an example of a generic series of equivalent or similar features.
From the above description, one of skill in the art can easily ascertain the essential characteristics of the present disclosure, and without departing from the spirit and scope thereof, can make various changes and modifications of the disclosure to adapt it to various usages and conditions. Thus, other embodiments are also within the claims.
While several inventive embodiments have been described and illustrated herein, those of ordinary skill in the art will readily envision a variety of other means and/or structures for performing the function and/or obtaining the results and/or one or more of the advantages described herein, and each of such variations and/or modifications is deemed to be within the scope of the inventive embodiments described herein. More generally, those skilled in the art will readily appreciate that all parameters, dimensions, materials, and configurations described herein are meant to be exemplary and that the actual parameters, dimensions, materials, and/or configurations will depend upon the specific application or applications for which the inventive teachings is/are used. Those skilled in the art will recognize, or be able to ascertain, using no more than routine experimentation, many equivalents to the specific inventive embodiments described herein. It is, therefore, to be understood that the foregoing embodiments are presented by way of example only and that, within the scope of the appended claims and equivalents thereto, inventive embodiments may be practiced otherwise than as specifically described and claimed. Inventive embodiments of the present disclosure are directed to each individual feature, system, article, material, kit, and/or method described herein. In addition, any combination of two or more such features, systems, articles, materials, kits, and/or methods, if such features, systems, articles, materials, kits, and/or methods are not mutually inconsistent, is included within the inventive scope of the present disclosure.
All definitions, as defined and used herein, should be understood to control over dictionary definitions, definitions in documents incorporated by reference, and/or ordinary meanings of the defined terms.
All references, patents and patent applications disclosed herein are incorporated by reference with respect to the subject matter for which each is cited, which in some cases may encompass the entirety of the document.
The indefinite articles “a” and “an,” as used herein in the specification and in the claims, unless clearly indicated to the contrary, should be understood to mean “at least one.”
The phrase “and/or,” as used herein in the specification and in the claims, should be understood to mean “either or both” of the elements so conjoined, i.e., elements that are conjunctively present in some cases and disjunctively present in other cases. Multiple elements listed with “and/or” should be construed in the same fashion, i.e., “one or more” of the elements so conjoined. Other elements may optionally be present other than the elements specifically identified by the “and/or” clause, whether related or unrelated to those elements specifically identified. Thus, as a non-limiting example, a reference to “A and/or B”, when used in conjunction with open-ended language such as “comprising” can refer, in one embodiment, to A only (optionally including elements other than B); in another embodiment, to B only (optionally including elements other than A); in yet another embodiment, to both A and B (optionally including other elements); etc.
As used herein in the specification and in the claims, “or” should be understood to have the same meaning as “and/or” as defined above. For example, when separating items in a list, “or” or “and/or” shall be interpreted as being inclusive, i.e., the inclusion of at least one, but also including more than one, of a number or list of elements, and, optionally, additional unlisted items. Only terms clearly indicated to the contrary, such as “only one of” or “exactly one of,” or, when used in the claims, “consisting of,” will refer to the inclusion of exactly one element of a number or list of elements. In general, the term “or” as used herein shall only be interpreted as indicating exclusive alternatives (i.e., “one or the other but not both”) when preceded by terms of exclusivity, such as “either,” “one of,” “only one of,” or “exactly one of.” “Consisting essentially of,” when used in the claims, shall have its ordinary meaning as used in the field of patent law.
As used herein in the specification and in the claims, the phrase “at least one,” in reference to a list of one or more elements, should be understood to mean at least one element selected from any one or more of the elements in the list of elements, but not necessarily including at least one of each and every element specifically listed within the list of elements and not excluding any combinations of elements in the list of elements. This definition also allows that elements may optionally be present other than the elements specifically identified within the list of elements to which the phrase “at least one” refers, whether related or unrelated to those elements specifically identified. Thus, as a non-limiting example, “at least one of A and B” (or, equivalently, “at least one of A or B,” or, equivalently “at least one of A and/or B”) can refer, in one embodiment, to at least one, optionally including more than one, A, with no B present (and optionally including elements other than B); in another embodiment, to at least one, optionally including more than one, B, with no A present (and optionally including elements other than A); in yet another embodiment, to at least one, optionally including more than one, A, and at least one, optionally including more than one, B (and optionally including other elements); etc.
It should also be understood that, unless clearly indicated to the contrary, in any methods claimed herein that include more than one step or act, the order of the steps or acts of the method is not necessarily limited to the order in which the steps or acts of the method are recited.
This application is a continuation of International Patent Application Serial No. PCT/US2019/064718, filed Dec. 5, 2019, which claims the benefit of priority under 35 U.S.C. § 119(e) of the filing date of U.S. Provisional Application Ser. No. 62/775,637, entitled “GENE-EDITING SYSTEMS FOR EDITING A CYSTIC FIBROSIS TRANSMEMBRANE REGULATOR (CFTR) GENE”, filed on Dec. 5, 2018; the entire contents of each of which are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
62775637 | Dec 2018 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2019/064718 | Dec 2019 | US |
Child | 17339425 | US |