The present application is being filed along with a Sequence Listing in electronic format. The Sequence Listing file, entitled 2057_1077PCT_SL.txt, was created on Sep. 3, 2020, and is 6,461,592 bytes in size. The information in electronic format of the Sequence Listing is incorporated herein by reference in its entirety.
The present disclosure relates to composition, methods, and processes for the preparation, use, and/or formulation of adeno-associated virus (AAV) particles comprising at least one element for the Vectorized Editing of Nucleic acids to correct Overt Mutations (VENOM).
Genetic engineering where DNA can be inserted, replaced, deleted or modified in the genome of a living organism is referred to as “genome editing,” “genome engineering,” or “gene editing”. Unlike early techniques that randomly inserted genetic material into a host genome, genome editing targets the insertions to site specific locations.
And while other genome editing technologies such as zinc finger (ZFNs), transcription activator-like effector nucleases (TALENs) and homing meganuclease have recently been utilized to edit DNA in hopes of repairing genomic defect, these suffer from lack of scalability, difficulty in individualized engineering and the expense of designing a unique complex for each locus to be altered. Additionally, work has to be done to define the optimized components of a system which will be effective in exploiting the cas9 CRISPR system as a tool for research, diagnostics and therapeutics with the ultimate goal of target-specific genome editing.
The present disclosure addresses the need for new technologies for treating genetic disorders caused by abnormalities in the genome whether heritable or acquired, monogenic or multifactorial, by providing AAV-based compositions and complexes which go beyond those of the art. Provided herein are recombinant adeno-associated virus particles having at least one element for the Vectorized Editing of Nucleic acids to correct Overt Mutations (VENOM).
For delivery of VENOM elements, an AAV is used as the delivery modality for a nucleic acid sequence encoding the VENOM elements, or a fragment thereof, which results in in vivo expression of the encoded payload. The mechanism underlying the vectorized delivery of VENOM elements is thought to proceed through the following steps. First, the vector genome enters the cell via endocytosis, then escapes from the endosomal compartment and is transported to the nucleus wherein the vector genome is released and converted into a double-stranded episomal molecule of DNA by the host. The transcriptionally active episome results in the expression of encoded VENOM elements that may then be secreted from the cell into the circulation. Therefore, AAV delivery of VENOM elements may therefore enable continuous, sustained and long-term delivery by a single injection of AAV particles.
The present disclosure provides compositions, methods and processes for the design, preparation, manufacture and/or formulation of recombinant adeno-associated virus (AAV) particles having at least one element for the Vectorized Editing of Nucleic acids to correct Overt Mutations (VENOM).
Provided herein are AAV particles comprising a capsid and a vector genome which has at least one Vectorized Editing of Nucleic acids to correct Overt Mutations (VENOM) element and at least one payload.
In some aspects, the VENOM element may comprise a polynucleotide encoding at least one guide RNA (gRNA) and a promoter, and the expression of the at least one guide gRNA may be driven by the promoter. The promoter may be an RNA polymerase III-dependent promoter.
In some aspects, the VENOM element may comprise a polynucleotide encoding a first guide RNA, a second guide RNA, a donor template and a promoter, and the expression of the first guide RNA, the second guide RNA and the donor template may be driven by the promoter. The promoter may be an RNA polymerase III-dependent promoter.
In some aspects, the VENOM element may comprise at least one VENOM element comprising a polynucleotide encoding an enzyme and a promoter, and the enzyme may be, but is not limited to, Cas9, Cas9 orthologue, Cpf1 (Cas12a), Cpf1 (Cas12a) orthologue, Cas13, Cas13 orthologue, Cas13b and Cas13b orthologue. The expression of the enzyme may be driven by the promoter such as, but not limited to, a ubiquitous promoter or tissue-specific promoter. As a non-limiting example, the promoter is a ubiquitous promoter. As another non-limiting example, the promoter is a tissue-specific promoter.
In some aspects, the VENOM element comprises the polynucleotide encoding Cas9 and the expression of Cas9 may be driven by a tissue-specific promoter. The tissue-specific promoter may be, but is not limited to, a neuron specific promoter, a muscle specific promoter, a cardiac specific promoter, and a liver specific promoter.
In some aspects, the VENOM element comprises the polynucleotide encoding Cas9 and the expression of Cas9 may be driven by a constitutive promoter.
In some aspects, the VENOM element comprises the polynucleotide encoding Cas9 and the expression of Cas9 may be driven by an inducible promoter.
In some aspects, the VENOM element comprises the polynucleotide encoding Cpf1 (Cas12a) and the expression of Cpf1 (Cas12a) may be driven by a tissue-specific promoter. The tissue-specific promoter may be, but is not limited to, a neuron specific promoter, a muscle specific promoter, a cardiac specific promoter, and a liver specific promoter.
In some aspects, the VENOM element comprises the polynucleotide encoding Cpf1 (Cas12a) and the expression of Cpf1 (Cas12a) may be driven by a constitutive promoter.
In some aspects, the VENOM element comprises the polynucleotide encoding Cpf1 (Cas12a) and the expression of Cpf1 (Cas12a) may be driven by an inducible promoter.
In some aspects, the VENOM element may comprise at least one VENOM element comprising a polynucleotide encoding an enzyme and a promoter, and the enzyme may be, but is not limited to, Cas9, Cas9 orthologue, Cpf1 (Cas12a) and Cpf1 (Cas12a) orthologue, Cas13, Cas13 orthologue, Cas13b and Cas13b orthologue and further encodes at least one gRNA. The expression of the at least one gRNA may be driven by an RNA polymerase III-dependent promoter.
Also provided herein are methods for modulating the expression level or the sequence of a gene in a cell from a subject by contacting the cell with an effective amount of at least one AAV particle described herein. The expression level of the gene may be suppressed or stimulated.
In some aspects, the modulation of the sequence of the gene comprises correcting one or more mutations of the gene, a frameshift mutation which causes a premature stop codon or a truncated gene product, a disrupted reading frame via gene deletion, an aberrant splice acceptor site, or an aberrant splice donor site.
In some aspects, the modulation of the sequence of the gene comprises correcting one or more point-mutations of the gene.
In some aspects, the modulation of the sequence of the gene comprises contacting the cell with the AAV particle and a DNA donor, wherein the AAV particle comprises a viral genome encoding a first gRNA, a second gRNA, a DNA donor, a first promoter, a second promoter, a third promoter, wherein the expression of the first gRNA is controlled by the first promoter, the expression of the second gRNA is controlled by the second promoter, wherein the donor DNA comprises a single-strand oligonucleotide of 100-200 bp. The cell may be, but is not limited to, a neuronal cell, a neural stem cell, an astrocyte, an oligodendrocyte, a microglia cell, a retinal cell, a tumor cell, a hematopoietic stem cell, an insulin producing beta cell, a lung epithelium cell, an endothelial cell, a liver cell, a skeletal muscle cell, a muscle stem cell, a muscle satellite cell, or a cardiac muscle cell
In some aspects, the modulation of the sequence of the gene comprises inserting the full-length sequence or fragment of the gene.
In some aspects, modulation of the sequence of the gene comprises inserting a fragment of gene, wherein the fragment may be 1-500, 501-1000, or 1001-2000 nucleotides of the gene.
In some aspects, the modulation of the sequence of the gene comprises contacting the cell with the AAV particle comprising a viral genome encoding a first gRNA, a second gRNA, a DNA donor, a first promoter, a second promoter, and a third promoter. The expression of the first gRNA may be driven by the first promoter, the expression of the second gRNA may be driven by the second promoter, and the expression of the DNA donor may be driven by the third promoter.
In some aspects, the repair of the sequence is by homology-directed repair wherein the full-length or fragment of the gene is deleted. The fragment deleted may be 1-500, 501-1000, or 1001-2000 nucleotides of the gene.
In some aspects, the modulation of the sequence of the gene comprises contacting the cell with the AAV particle comprising a viral genome encoding a first gRNA, a second gRNA, a first promoter, and a second promoter. The expression of the first gRNA may be controlled by the first promoter, and the expression of the second gRNA may be controlled by the second promoter.
In some aspects, the modulation of the sequence of the gene is by an indel mutation and comprises contacting the cell with the AAV particle comprising a viral genome encoding one or more gRNA.
In some aspects, the repair of the sequence is by homologous end joining repair.
In some aspects, the AAV particles and methods described herein may be used to treat a disease, disorder and/or condition. As a non-limiting example, the disease, disorder and/or condition is a neurological disease such as, but not limited to, Parkinson's disease, Friedreich's Ataxia, Amyotrophic lateral sclerosis (ALS), Huntington's disease, and Spinal muscular atrophy (SMA). As a non-limiting example, the disease, disorder and/or condition is a muscular disease such as, but not limited to, Duchenne muscular disease. As a non-limiting example, the disease, disorder and/or condition is cancer.
The present disclosure relates to compositions, methods and processes for the design, preparation, manufacture and/or formulation of recombinant adeno-associated virus (AAV) particles having at least one element for the Vectorized Editing of Nucleic acids to correct Overt Mutations (VENOM) (herein referred to as “VENOM elements”) and methods of using the same. AAV particles comprising at least one VENOM element may be referred to herein as VENOM AAV Particles (herein “VAP”).
As used herein, an “AAV particle” is a virus which comprises a vector genome with at least one payload region and at least one inverted terminal repeat (ITR) region. The AAV particle and/or its component capsid and vector genome may be engineered to alter tropism to a particular cell-type, tissue, organ or organism.
Vectors of the present disclosure may be produced recombinantly and may be based on adeno-associated virus (AAV) parent or reference sequences. As used herein, a “vector” is any molecule or moiety which transports, transduces, or otherwise acts as a carrier of a heterologous molecule such as the nucleic acids described herein.
As used herein, “vector genome” or “viral genome” refers to the nucleic acid sequence(s) encapsulated in an AAV particle. A vector genome comprises a nucleic acid sequence with at least one payload region encoding a payload and at least one ITR.
As used herein, a “payload” or “payload region” is any nucleic acid molecule which encodes one or more polypeptides. At a minimum, a payload region comprises nucleic acid sequences that encode an element for the Vectorized Editing of Nucleic acids to correct Overt Mutations (VENOM), or a fragment thereof, but may also optionally comprise one or more functional or regulatory elements to facilitate transcriptional expression and/or polypeptide translation.
The nucleic acid sequences and polypeptides disclosed herein may be engineered to contain a payload region with at least one element to modify the expression of overt mutations in a gene of interest. In some embodiments, the nucleic acid sequence comprising the payload region may comprise one or more of a promoter region, an intron, a Kozak sequence, an enhancer, or a polyadenylation sequence. The payload regions may be delivered to one or more target cells, tissues, organs, or organisms within the vector genome of an AAV particle and/or a VAP.
In some embodiments, the AAV particle and/or VAP comprises a VENOM element and a “tunable element.” As used herein, a “tunable element” or a “regulatable element” can impart regulatable or tunable feature(s) to the viral genome encoding them. In some embodiments, the VENOM element and tunable elements are the same element. In some embodiments, the VENOM element and the tunable element are different elements in the AAV particle and/or VAP.
In some embodiments, VENOM elements may be used to introduce mutations into a gene as a protection against a disease, disorder, and/or condition.
In some embodiments, the VENOM element is a CRISPR genome editing element. As used herein, the term “CRISPR genome editing element” refers to any component/element involved in directing the activity of CRISPR-associated (“Cas”) proteins to alter the genome in a cell.
The CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)-Cas system is widely found in bacterial and archaeal genomes as a defense mechanism against invading viruses and mobile genetic elements such as bacteriophages and plasmids, which sometimes is called RNA-mediated adaptive immune system. A CRISPR locus in a bacterial genome is a DNA region with an array of short identically repeated sequences of generally 21-37 base pairs, separated by spacers with unique sequences of generally 20-40 base pairs. The CRISPR locus can be found on both chromosomal and plasmid DNA. The spacers are often derived from nucleic acid of invading viruses and plasmids, which can be used as recognition elements to find matching virus genomes or plasmid sequences and destroy them as part of defense system. CRISPR activity requires the presence of a set of CRISPR associated (cas) genes, which are usually found adjacent to the CRISPR array and encode CRISPR associated (Cas) proteins with a variety of predicted nucleic acid-manipulating activities such as nucleases, helicases and polymerases. The CRISPR-Cas system targets DNA and/or RNA as a way of protecting against viruses and other mobile genetic elements and can be developed for programmable genetic editing.
In bacteria, the CRISPR-Cas system relies on the activity of short mature CRISPR RNAs (crRNAs) that guide Cas proteins (e.g. Cas9) to silence invading nucleic acids. The crRNRs are processed from the DNA sequences clustered in the CRISPR array. The CRISPR array transcript, the precursor CRISPR RNA (pre-crRNA), is processed into individual short mature crRNAs. In the CRISPR-Cas type II system, a small, non-coding RNA molecule called trans-activating crRNA (tracrRNA), which has a sequence complementary to the identical repeat in the CRISPR, is required for crRNA maturation. The tracrRNA is encoded in the vicinity of the cas genes and CRISPR repeat-spacer array. Following the hybridization of tracrRNA to the short identical repeat in the pre-crRNA, the bacterial double-stranded RNA specific endoribonuclease, RNase III, processes/cleaves the pre-crRNA transcript to generate a dual-tracrRNA:crRNA that guides the CRISPR-associated endonuclease Cas9 (Csn1) to cleave site-specifically cognate target DNA.
In bacteria, CRISPR-Cas immunity operates in three steps with the principle that an intruder once memorized by the system will be remembered and silenced upon a repeated infection. During the initial adaptation phase, a part of an invading nucleic acid sequence is incorporated as a new spacer within the repeat-spacer CRISPR array and the infection is thus memorized. During the expression phase, the CRISPR array is transcribed as a pre-crRNA molecule that undergoes processing to generate short mature crRNAs, each complementary to a unique invader sequence. During the interference phase, the individual crRNAs guide Cas protein(s) to cleave the cognate invading nucleic acids in a sequence-specific manner for their ultimate destruction.
The CRISPR-Cas systems have recently been classified into three distinct types (I-III). Types I and III share some common features, with crRNAs and Cas proteins being the only known components required for the steps of expression and interference. In both types I and III, the mature crRNAs guide a complex of several Cas proteins to the cognate-invading nucleic acids and a Cas endonuclease of the ribonucleoprotein complex cleaves the target nucleic acids. Type II CRISPR-Cas has evolved distinct pre-crRNA processing and interference mechanisms, as described above. Pre-crRNA processing requires base pairing of every pre-crRNA repeat with a tracrRNA. The tracrRNA:crRNA duplex forms a ternary silencing complex in the presence of Cas9, the endonuclease of Type II CRISPR-Cas system.
Though naturally being identified as a defense mechanism in prokaryotes, the CRISPR-Cas system (e.g., the Type 11 CRISPR-Cas9 system) has been developed as a RNA-guided DNA targeting platform. It has been widely studied for the use of genomic editing and transcription modulation in eukaryotic cells, and has shown great potential in correcting mutations in human genetic diseases. As learned from the bacterial system, two distinct components are required in CRISPR-Cas based genome editing: (1) a guide RNA molecule, such as a tracrRNA:crRNA duplex and (2) an endonuclease, such as Cas9. When the guide RNA molecule and Cas9 are expressed in a cell, the guide RNA/Cas9 complex is recruited to the target sequence by the base-pairing between the guide RNA sequence and the target sequence in the genomic DNA. The recruited Cas9 cuts both strands of DNA causing a Double Strand Break (DSB). For successful location of Cas9 to the target sequence, the genomic target sequence must also contain the correct Protospacer Adjacent Motif (PAM) sequence immediately following the target sequence. Cas9 cuts 3-4 nucleotides upstream of the PAM sequence. A DSB can be repaired through one of two general repair pathways: (1) the Non-Homologous End Joining (NHEJ) DNA repair pathway, or (2) the Homology Directed Repair (HDR) pathway. The NHEJ repair pathway often results in inserts/deletions at the DSB site that can lead to frameshifts and/or premature stop codons, effectively disrupting the open reading frame (ORF) of the targeted gene. The HDR pathway requires the presence of a repair template, which is used to fix the DSB. HDR faithfully copies the sequence of the repair template to the cut target sequence. Specific nucleotide changes can be introduced into a targeted gene using HDR with a repair template.
The CRISPR-Cas9 system has been used as a tool to manipulate the genome in mammalian cells (i.e. eukaryotes). Vu et al (Wu et al., Science, 2013, 339: 819-823) and Mali et al (Mali et al., Science. 2013, 339: 823-826) first demonstrated that expressing a codon-optimized Cas9 protein and a guide RNA leads to efficient cleavage and short insertion/deletion of target loci, which could inactivate protein-coding genes by inducing frameshifts and/or creating premature stop codons. The CRISPR-cas system can be used to simultaneously edit more than one genes by delivering multiple guide RNAs (Yang et al., Cell, 2013, 154:1370-1379; and Jao et al., Proc Natl Acad Sci USA. 2013, 110:13904-13909); to introduce deletions and inversions of regions ranging from 100 bps to 1000000 bps on a chromosome (Xiao et al., Nucleic Acids Res., 2013, 41:e141; and Canver et al., J Biol Chem., 2014, 289(31): 21312-21324); to engineer chromosomal translocations between different chromosomes (Torres et al., Nat Commun. 2014, 5: 3964); to correct mutations in disease genes (Yin et al., Nat Biotechnol. 2014, 32(6): 551-553; Schwank et al., Cell Stem Cell. 2013, 13: 653-658; and Wu et al., Cell Stem Cell. 2013, 13: 659-662); and to introduce specific nucleotide modifications at the target sequence in combination with a DNA repair template containing the desired sequence but having a high degree of homology to the target sequence, for example, by cotransfection of the template DNA into the cell along with the guide RNA/Cas9 complex.
The CRISPR-Cas system has also been adapted to label proteins by introducing specific sequences such as HA-tag (Auer et al., Genome Res. 2014: 24:142-153). Other expended applications of the CRISPR-Cas system include site-specific imaging of endogenous loci in living cells, by using a fluorescent marker (e.g., GFP). In addition to human and rodent cells, the system has also been adapted to many other species, including monkey, pig, rat, zebrafish, worm, yeast, and several plants.
The CRISPR-Cas system is a remarkably flexible tool for genome manipulation. One of the primary advantages of this technology is that the nuclease activity and the DNA-binding activity of Cas9 are discrete functions in the protein. The Cas9 nuclease activity (cutting) is performed by 2 separate domains, RuvC and HNH. Each domain cuts one strand of DNA and each can be inactivated by a single point mutation. In S. pyogenes, a Cas9 D10A mutant has an inactive RuvC domain (RuvC−) and an active HNH domain (HNH+) and a Cas9 H840A mutant has an inactive HNH domain (HNH−) and an active RuvC domain (RuvC+). When both domains are inactive (D10A and H840A, RuvC− and HNH−) the Cas9 protein has no nuclease activity (catalytically inactive) and is said to be ‘dead’ (dCas9): however, the inactive dCas9 still retains the ability to bind to DNA based on guide RNA specificity. Such dCas9 protein may be used as a platform to recruit other functional proteins to a target DNA sequence.
The CRISPR-cas system may also be used to modulate transcription in a genome by introducing sequence specific control of gene expression. In some applications, a catalytically inactive dCas9 can be generated by mutating the two nuclease domains of Cas9, which can bind DNA without introducing cleavage or mutation. When targeted to a promoter, the nuclease-null dCas9 binding alone can interfere with transcription initiation, likely by blocking binding of transcription factors or RNA polymerases. When targeted to the non-template strand within the gene, the dCas9 complex blocks RNA polymerase II transcription elongation (Jinek et al., Science. 2012, 337(6096): 816-821; Qi et al., Cell. 2013, 152:1173-1183; and Gilbert et al., Cell. 2013, 154: 442-451). In other applications, the inactive or nuclease-null cas9 may be fused with effector domains with distinct regulatory functions, such as transcription repressor domains (e.g., the Krueppel-associated box (KRAB)) to lead to stronger silencing of mammalian genes (Gilbert et al., Cell, 2013, 154(2): 442-451), or with activator domains (e.g., VP64) to activate transcriptions (Larson et al., Nat Protoc. 2013, 8: 2180-2196). Such guide RNA based methods are referred to as CRISPR interference (i.e. CRISPRi) (Qi et al., Cell, 2013, 152: 1173-1183; and Larson et al., Nature Protocols, 2013, 8: 2180-2196). Such RNA based methods for regulation of gene expression in a genome-wide scale are also known as CRISPR interference ((i.e., CRISPR interference, CRISPRi).
In addition to modulating gene expression, the fusion of dCas9 with other heterologous effector domains could enable many other applications. For example, one could fuse catalytically inactive dCas9 with chromatin modifiers to change the epigenetic state of a locus (Mali et al., Nat Methods. 2013, 10:957-963: and Sander et al., Nat Biotechnol. 2014, 32: 347-355). dCas9 fused to an epitope tag(s) can be used to purify genomic DNA bound by the guide RNA. Building on the well-established concept of ChIP (Chromatin Immunoprecipitation), researchers have created enChIP (engineered DNA-binding molecule-mediated ChIP) that allows for the purification of any genomic sequence specified by a particular guide RNA. By creating pooled libraries of guide RNAs, the CRISPR-Cas system can be used in powerful genomic screening techniques or for nucleic acid enrichment (See, e.g., US patent publication NO.: 20140356867, the contents of which is herein incorporated by reference in its entirety).
Safe delivery of CRISPR/Cas endonucleases remains one of the major barriers to the widespread application of in vivo genome editing as with this type of viral delivery system, active endonucleases will remain in the retina for an extended period, making genotoxicity a significant consideration in clinical applications. One way to address this issue is to include a self-destructing “kamikaze” CRISPR/Cas system from Li et al. that disrupts the Cas enzyme itself following expression (Hum Gene Ther. 2019 Aug. 2. doi: 10.1089/hum.2019.021; the contents of which are herein incorporated by reference in their entirety). Li et al. initially designed four guide RNAs (sgRNAs) to target Streptococcus pyogenes Cas9 (SpCas9) and the selected sgRNAs were cloned into a dual AAV vector genome. One vector genome delivered SpCas9 and the other delivered sgRNAs directed against SpCas9 and the target locus. Both constructs were packaged into an AAV particle and/or VAP particle and intravitreally administered. Li et al found that a reduction of SpCas9 mRNA when the dual approached was used as compared to CRISPR/Cas alone. In some embodiments, the AAV particles and/or VAPs described herein may include a self-destructive “kamikaze” CRISPR/Cas system.
In some embodiments, the VENOM elements are CRISPR/Cas9 and they may be used to introduce mutations into a gene as a protection against a disease, disorder, and/or condition.
In some embodiments, the VENOM elements encoded in the AAV particle are Cas9, a DNA-targeting RNA, a targeter-RNA that hybridizes with a target sequence of the target DNA molecule and an activator-RNA that hybridizes with the targeter-RNA to form a double-stranded RNA duplex as described in U.S. Pat. No. 10,227,611, the contents of which are herein incorporated by reference in their entirety. As a non-limiting example, the targeter-RNA includes the 12-nucleotide crRNA sequence as described in U.S. Pat. No. 10,227,611 as SEQ ID NO: 679. As a non-limiting example, the activator-RNA includes the 67-nucleotide tracrRNA sequence as described in U.S. Pat. No. 10,227,611 as SEQ ID NO: 432.
Currently the most commonly used RNA-guided endonuclease for genome editing in the CRISPR-Cas system is the Type II CRISPR associated (Cas) nuclease. Cas9. The Cas9 nuclease is a DNA endonuclease with two nuclease domains, namely, the N-terminal RuvC-like nuclease (RNAse H fold) and the HNH (McrA-like) nuclease domain that is located in the middle of the protein, each cleaving each of the two DNA strands. When both of these domains are active, the Cas9 protein causes double strand breaks (DSBs) in the genomic DNA. In the absence of a suitable repair template, the DSB is repaired by the Non-Homologous End Joining (NHEJ) DNA repair pathway. During NHEJ repair, InDels (insertions/deletions) may occur as a small number of nucleotides are either inserted or deleted at random at the DSB site.
Cas9 may be inactivated with both the functional domains mutated ((RuvC− and HNH−), generating a nuclease-null Cas9 (dCas9). Cas9 may also be modified as “nickase: a Cas9 protein containing a single inactive catalytic domain, either RuvC− or HNH−. With only one active nuclease domain, the Cas9 nickase cuts only one strand of the target DNA, creating a single-strand break or ‘nick’. Similar to the inactive dCas9, a Cas9 nickase is still able to bind DNA based on guide RNA specificity, though nickases will only cut one of the DNA strands. A single-strand break, or nick, is normally quickly repaired through the HDR pathway, using the intact complementary DNA strand as the template. Two proximal, opposite strand nicks introduced by a Cas9 nickase (often referred to as a ‘double nick’ or ‘dual nickase’ CRISPR system) are treated as a Double Strand Break (DSB), which can be repaired by either NHEJ or HDR depending on the desired effect on the gene target.
The most commonly used Cas9 is derived from Streptococcus pyogenes and the RuvC domain can be inactivated by a D10A mutation and the HNH domain can be inactivated by an H840A mutation.
In addition to Cas9 derived from S. pyogenes, other RNA guided endonucleases (RGEN) may also be used for programmable genome editing. Cas9 sequences have been identified in more than 600 bacterial strains. Though Cas9 family shows high diversity of amino acid sequences and protein sizes, all Cas9 proteins share a common architecture with a central HNH nuclease domain and a split RuvC/RHase H domain. Examples of Cas9 orthologs from other bacterial strains including but not limited to, Cas proteins identified in Acaryochloris marina MBIC 11017; Acetohalobium arabaticum DSM 5501; Acidithiobacillus caldus; Acidithiobacillus ferrooxidans ATCC 23270; Alicyclobacillus acidocaldarius LAA1; Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446; Allochromatium vinosum DSM 180; Ammonfex degensii KC4; Anabaena variabilis ATCC 29413; Arthrospira maxima CS-328; Arthrospira platensis str. Paraca; Arthrospira sp. PCC 8005; Bacillus pseudomycoides DSM 12442; Bacillus selenitireducens MLS10; Burkholderiales bacterium 1_1_47; Caldicellulosiruptor becscii DSM 6725; Candidatus Desulforudis audaxviator MP104C; Caldicellulosiruptor hydrothermalis_108; Clostridium phage c-st; Clostridium botulinum A3 str. Loch Maree; Clostridium botulinum Ba4 str. 657; Clostridium difficile QCD-63q42; Crocosphaera watsonii WH 8501; Cyanothece sp. ATCC 51142; Cyanothece sp. CCY0110; Cyanothece sp. PCC 7424; Cyanothece sp. PCC 7822; Exiguobacterium sibiricum 255-15; Finegoldia magna ATCC 29328; Ktedonobacter racemifer DSM 44963; Lactobacillus delbrueckii subsp. bulgaricus PB2003/044-T3-4; Lactobacillus salivarius ATCC 11741; Listeria innocua; Lyngbva sp. PCC 8106; Marinobacter sp. ELB17; Methanohalobium evestigatum Z-7303; Microcystis phage Ma-LMM01; Microcystis aeruginosa NIES-843; Microscilla marina ATCC 23134; Microcoleus chthonoplastes PCC 7420; Neisseria meningitidis; Nitrosococcus halophilus Nc4; Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111; Nodularia spumigena CCY9414; Nostoc sp. PCC 7120; Oscillatoria sp. PCC 6506; Pelotomaculum_thermopropionicum_SI; Petrotoga mobilis SJ95; Polaromonas naphthalenivorans CJ2; Polaromonas sp. JS666; Pseudoalteromonas haloplanktis TAC125; Streptomyces pristinaespiralis ATCC 25486; Streptomyces pristnaespiralis ATCC 25486; Streptococcus thermophilus; Streptomyces viridochromogenes DSM 40736; Streptosporangium roseum DSM 43021; Synechococcus sp. PCC 7335; and Thermosipho africanus TCF52B (Chylinski et al., RNA Biol., 2013; 10(5): 726-737).
In addition to Cas9 orthologs, other Cas9 variants such as fusion proteins of inactive dCas9 and effector domains with different functions may be served as a platform for genetic modulation.
Cas12 is a type V CRISPR-CAS system and is also known as Cpf1 or C2c1. It is a compact enzyme from Francisella novicida, Acidaminococcus sp., Lachnospiracaea sp., and Prevotella sp. that is about 1,100 to 1,300 amino acids in length with a guide spacer that is 18-25 nucleotides and a total guide length of 42-44 nucleotides. Cas12 creates staggered cuts in dsDNA and creates a 5 nucleotide 5′ overhang when it cuts. Cas12 also processes its own guide RNAs which leads to increased multiplexing ability. Recently it was determined that Cas12a can cut single-stranded DNA once it is activated by a target DNA molecule matching the spacer sequence.
In some embodiments, the VENOM element is Cas12.
In some embodiments, the VENOM element is Cas12a.
In some embodiments, a Cas12a and a CRISPR array can be encoded in a single transcript using a stabilizer tertiary RNA structure as described by Campa et al. (Multiplexed genome engineering by Cas12a and CRISPR arrays encoded on single transcripts, Nature Methods 2019, doi.org/10.1038/s41592-019-0508-6; the contents of which are herein incorporated by reference in their entirety). Using this method, Campa et al. noted that up to 25 individual CRISPR RNAs can be delivered on a single plasmid. As a non-limiting example, the VENOM elements are Cas12a and a CRISPR array. As a non-limiting example, the VENOM element is Cas12a and a CRISPR array encoded in a single transcript using a stabilizer tertiary RNA structure.
Type V CRISPR systems have several differences from Type II systems. For example, Cas12 is a single RNA-guided endonuclease that, in contrast to Type II systems, lacks tracrRNA and thus are processed into mature crRNAs without the requirement of an additional trans-activating tracrRNA. Cas12 also uses a T-rich protospacer-adjacent motif (PAM) such that Cas12-crRNA complexes efficiently cleave target DNA preceded by a short T-rich PAM so Type V systems cleave at a point that is distant from the PAM.
Cas13 is a Type VI CRISPR-Cas system and is also known as C2c2 or CasRx. It is different than most of the other CRISPR systems because Cas13 targets RNA and not DNA. This is beneficial as Cas13 can be used as a therapeutic for influencing gene expression without altering the genome sequence of the subject. Cas13 is from Leptotrichia buccalis, Leptotrichia shahii, Ruminococcus flavefaciens, Bergeyella zoohelcum, Prevotella buccae, and Listeria seeligeri, and is about 900 to 1,300 amino acids in length with a guide spacer length of 22-30 nucleotides and a total guide length of 52-66 nucleotides. Once Cas13 is activated by a ssRNA sequence which is complementary to its crRNA spacer, Cas13 destroys all nearby RNA regardless of the sequence.
In some embodiments, the VENOM element is Cas13. As a non-limiting example, the Cas13 VENOM element may be used in diagnostics in vitro. As another non-limiting example, the Cas13 VENOM element can be used for specific RNA knockdown or RNA sequence editing in mammalian cells.
In some embodiments, the VENOM element is Cas13 or a variant of Cas13.
In some embodiments, the VENOM element may target any of the Cas13 or Cas13 orthologues as described in International Patent Publication No. WO2019005884. WO2019071048, and WO2019084062, the contents of each of which are herein incorporated by reference in their entirety.
In some embodiments, the VENOM element is Cas13b.
In some embodiments, the VENOM element is any of the Cas13b orthologues as described in International Patent Publication No. WO2018170333, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the VENOM element may target any of the Cas13 orthologues as described in International Patent Publication No. WO201905886, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the VENOM element may include or target any of the RNA-targeting CRISPR effector proteins as described in International Patent Publication No. WO2019018423, the contents of which are herein incorporated by reference in its entirety.
The present disclosure provides compositions, methods and processes for the design, preparation, manufacture and/or formulation of recombinant adeno-associated virus (AAV) particles having at least one element for the Vectorized Editing of Nucleic acids to correct Overt Mutations (VENOM).
In some embodiments, AAV particles and/or VAPs comprising a capsid and a vector genome which has at least one Vectorized Editing of Nucleic acids to correct Overt Mutations (VENOM) element and at least one payload.
In some embodiments, the VENOM element may comprise a polynucleotide encoding at least one guide RNA (gRNA) and a promoter, and the expression of the at least one guide gRNA may be driven by the promoter. The promoter may be an RNA polymerase III-dependent promoter.
In some embodiments, the VENOM element may comprise a polynucleotide encoding a first guide RNA, a second guide RNA, a donor template and a promoter, and the expression of the first guide RNA, the second guide RNA and the donor template may be driven by the promoter. The promoter may be an RNA polymerase Ill-dependent promoter.
In some embodiments, the VENOM element may comprise at least one VENOM element comprising a polynucleotide encoding an enzyme and a promoter, and the enzyme may be, but is not limited to, Cas9, Cas9 orthologue, Cpf1 (Cas12a) and Cpf1 (Cas12a) orthologue, Cas13, Cas13 orthologue, Cas13b and Cas13b orthologue. The expression of the enzyme may be driven by the promoter such as, but not limited to, a ubiquitous promoter and tissue-specific promoter. As a non-limiting example, the promoter is a ubiquitous promoter. As another non-limiting example, the promoter is a tissue-specific promoter.
In some embodiments, the VENOM element comprises the polynucleotide encoding Cas9 and the expression of Cas9 may be driven by a tissue-specific promoter. The tissue-specific promoter may be, but is not limited to, a neuron specific promoter, a muscle specific promoter, a cardiac specific promoter, and a liver specific promoter.
In some embodiments, the VENOM element comprises the polynucleotide encoding Cas9 and the expression of Cas9 may be driven by a constitutive promoter.
In some embodiments, the VENOM element comprises the polynucleotide encoding Cas9 and the expression of Cas9 may be driven by an inducible promoter.
In some embodiments, the VENOM element comprises the polynucleotide encoding Cpf1 (Cas12a) and the expression of Cpf1 (Cas12a) may be driven by a tissue-specific promoter. The tissue-specific promoter may be, but is not limited to, a neuron specific promoter, a muscle specific promoter, a cardiac specific promoter, and a liver specific promoter.
In some embodiments, the VENOM element comprises the polynucleotide encoding Cpf1 (Cas12a) and the expression of Cpf1 (Cas12a) may be driven by a constitutive promoter.
In some embodiments, the VENOM element comprises the polynucleotide encoding Cpf1 (Cas12a) and the expression of Cpf1 (Cas12a) may be driven by an inducible promoter.
In some embodiments, the VENOM element may comprise at least one VENOM element comprising a polynucleotide encoding an enzyme and a promoter, and the enzyme may be, but is not limited to, Cas9, Cas9 orthologue, Cpf1 (Cas12a), Cpf1 (Cas12a) orthologue, Cas13, Cas13 orthologue, Cas13b and Cas13b orthologue and further encodes at least one gRNA. The expression of the at least one gRNA may be driven by an RNA polymerase III-dependent promoter.
In some embodiments, the VENOM element is a CRISPR variant from Prevotella and Francisella 1 (CPF1). Non-limiting examples of CPF1 (Cas12a) variants which may be used in the AAV particles of the present disclosure include those taught in US Patent Publication No. US20180030425, the contents of which are herein incorporated by reference in their entirety. Examples of CPF1 (Cas12a) variants with an altered PAM specificity are described in US Patent Publication No. US20190010481, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the VENOM element is a fusion protein which includes a catalytically inactive Lachnospiraceae bacterium D2006 protein Cpf1 (dLbCpf1) fused to at least one activation domain. Non-limiting examples of the fusion proteins are described in International Patent Publication No. WO2018195540, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, provided are methods for modulating the expression level or the sequence of a gene in a cell from a subject by contacting the cell with an effective amount of at least one AAV particle. The expression level of the gene may be suppressed or stimulated.
In some embodiments, the modulating the sequence of the gene comprises correcting one or more mutations of the gene, a frameshift mutation which causes a premature stop codon or a truncated gene product, a disrupted reading frame via gene deletion, an aberrant splice acceptor site, or an aberrant splice donor site.
In some embodiments, the modulating the sequence of the gene comprises correcting one or more point-mutations of the gene.
In some embodiments, the modulating the sequence of the gene comprises contacting the cell with the AAV particle and a DNA donor, wherein the AAV particle comprises a viral genome encoding a first gRNA, a second gRNA, a DNA donor, a first promoter, a second promoter, a third promoter, wherein the expression of the first gRNA is controlled by the first promoter, the expression of the second gRNA is controlled by the second promoter, wherein the donor DNA comprises a single-strand oligonucleotide of 100-200 bp. The cell may be, but is not limited to, a neuronal cell, a neural stem cell, an astrocyte, an oligodendrocyte, a microglial cell, a retinal cell, a tumor cell, a hematopoietic stem cell, an insulin producing beta cell, a lung epithelium cell, an endothelial cell, a liver cell, a skeletal muscle cell, a muscle stem cell, a muscle satellite cell, or a cardiac muscle cell
In some embodiments, the modulating the sequence of the gene comprises inserting the full-length sequence or fragment of the gene.
In some embodiments, modulating the sequence of the gene comprises inserting a fragment of gene, wherein the fragment may be 1-500, 501-1000, or 1001-2000 nucleotides of the gene.
In some embodiments, the modulating the sequence of the gene comprises contacting the cell with the AAV particle comprising a viral genome encoding a first gRNA, a second gRNA, a DNA donor, a first promoter, a second promoter, and a third promoter. The expression of the first gRNA may be driven by the first promoter, the expression of the second gRNA may be driven by the second promoter, and the expression of the DNA donor may be driven by the third promoter.
In some embodiments, the repair of the sequence is by homology-directed repair wherein the full-length or fragment of the gene is deleted. The fragment deleted may be 1-500, 501-1000, or 1001-2000 nucleotides of the gene.
In some embodiments the modulating the sequence of the gene comprises contacting the cell with the AAV particle comprising a viral genome encoding a first gRNA, a second gRNA, a first promoter, and a second promoter. The expression of the first gRNA may be controlled by the first promoter, and the expression of the second gRNA may be controlled by the second promoter.
In some embodiments, the modulating the sequence of the gene is by an indel mutation and comprises contacting the cell with the AAV particle comprising a viral genome encoding one or more gRNA.
In some embodiments, the repair of the sequence is by homologous end joining repair.
In some embodiments, the AAV particles and methods described herein may be used to treat a disease, disorder and/or condition. As a non-limiting example, the disease, disorder and/or condition is a neurological disease such as, but not limited to, Parkinson's disease, Friedreich's Ataxia, Amyotrophic lateral sclerosis (ALS), Huntington's disease, and Spinal muscular atrophy (SMA). As a non-limiting example, the disease, disorder and/or condition is a muscular disease such as, but not limited to, Duchenne muscular disease. As a non-limiting example, the disease, disorder and/or condition is cancer.
In some embodiments, the VENOM element includes or is used with a genome-editing nuclease and/or customizable DNA-binding domain fusion protein described in International Patent Publication No. WO2018071892, the contents of which are herein incorporated by reference in its entirety.
In some embodiments, the VENOM element is a nucleobase editing molecule which causes specific nucleotides to convert to a different nucleotide. As a non-limiting example, the nucleobase editing molecule is a DNA base editing molecule. As another non-limiting example, the nucleobase editing molecule is an RNA base editing molecule.
In some embodiments, the nucleobase editing molecule is a programmable RNA base editing molecule. As a non-limiting example, the programmable RNA base editing molecule may be RESCUE (RNA Editing for Specific C to U Exchange), as described by Abudayyeh et al., 2019 (see Abudayyeh et al., 2019, Science July 26; 365(6451):382-386, the contents of which are herein incorporated by reference in their entirety). As another non-limiting example, the programmable RNA base editing molecule may be REPAIR (RNA editing for programmable A to 1 (G) replacement), as described by Cox et al., 2017 (see Cox et al., 2018, Science November 24; 358(6366):1019-1027, the contents of which are herein incorporated by reference in their entirety). The RESCUE and REPAIR programmable RNA base editing molecules may be used separately or in combination.
In some embodiments, RESCUE may enable cytidine (C) to uridine (U) RNA editing of dsRNA in mammalian cells. RESCUE may comprise a cytidine deaminase synthetically evolved from the adenine deaminase domain of adenosine deaminase acting on RNA ADAR2 (ADAR2dd) fused to a catalytically inactivated RNA-targeting CRISPR-Cas13 (dCas13). ADAR2dd may be fused to dRanCas13b, which is the catalytically inactive Cas13b ortholog from Riemerella anatipestifer. In some embodiments, dRanCas13b is equivalent to dPsPcas13b, which is the catalytically inactive Cas13b ortholog from Prevotella sp. P5-125. In some embodiments, the RESCUE construct comprises dRanCas13b.
In some embodiments, mutations in RESCUE through ADAR2dd may allow for direct interactions with the RNA target within a catalytic pocket, which may enable fitting of either adenosine or cytidine for deamination. In some embodiments, RESCUE may comprise a cytidine deaminase. In some embodiments, RESCUE may comprise an adenine deaminase.
In some embodiments, mutations in the RESCUE catalytic core, such as but not limited to V351G and K3501, modulate RESCUE activity. In some embodiments, mutations in RESCUE contacting the RNA target, such as but not limited to S486A and S495N, modulate RESCUE activity.
In some embodiments, RESCUE retains adenosine deaminase activity and the native pre-CRISPR(cr)RNA processing of Cas13b enables multiplexed adenine and cytosine deamination. In such embodiments, the delivery of RESCUE along with a pre-CRISPR(cr)RNA may target an adenine and cytosine in a transcript.
In some embodiments, RESCUE is optimally active with C or U base-flips across the target base using guides. In certain embodiments, the guides may be 30 nucleotides (nt) in length. In certain other embodiments the guides follow guide design rules. In some embodiments, the guide design rule relates to features of the motif, including but not limited to preference for a 5′ U or A. In other instances, the guide rule may relate to a mismatch position. In some embodiments tailored guide RNAs may be used to enable RESCUE multiplexed C-to-U and A-to-I editing.
In some embodiments, the targetable amino acid codon space of RESCUE may enable modulation of post-translational modifications, such as but not limited to phosphorylation, glycosylation, and methylation.
In some embodiments, the targetable amino acid codon space of RESCUE may enable targeting of catalytic residues.
In some embodiments, the targetable amino acid codon space of RESCUE may enable targeting of disease mutations.
In some embodiments, the targetable amino acid codon space of RESCUE may enable targeting of protective alleles.
In some embodiments, RESCUE constructs may be shortened for viral delivery to cells or subjects. In some embodiments, the RESCUE construct may be shortened via truncations, such as but not limited to C-terminal truncations. In certain embodiments, the C-terminal truncations may be of the catalytically inactive Cas13b, which may be dRanCas13b. In some embodiments, RESCUE constructs shortened for viral delivery to cells or subjects may exhibit enhanced deaminase activity.
In some embodiments amino acid conversions possible using cytidine deamination by RESCUE include, but are not limited, to Leu-to-Phe, Gln-to-stop, His-to-Tyr, Ala-to-Val, Pro-to-Ser, Pro-to-Leu, Ser-to-Leu, Ser-to-Phe, Thr-to-Lle, Thr-to-Met, Arg-to-stop, Arg-to-Cys and Arg-to-Trp conversions.
In some embodiments, REPAIR comprises a Cas13b enzyme. As a non-limiting example, Cas13b may be from Prevotella sp. P5-125 (PspCas13b, also referred to herein as dCas13b). As another non-limiting example, Cas13b may be from Porphyromonas gulae (PguCas13b) C terminally fused to the HIV Rev nuclear export sequence (NES). As yet another non-limiting example. Cas13b may be from Riemerella anatipestifer (RanCas13b)C-terminally fused to the mitogen-activated protein kinase NES.
In some embodiments, REPAIR comprises a Cas13b enzyme that possesses pre-CRISPR-RNA processing activity to allow for multiplex editing of multiple variants.
In some embodiments, REPAIR may comprise PspCas13b that is engineered to lack nuclease activity via mutations in conserved catalytic residues in the higher eukaryotes and prokaryotes nucleotide-binding (HEPN) domain.
In some embodiments, REPAIR comprises human ADAR1dd or ADAR2dd containing hyperactivating mutations that may enhance catalytic activity. As a non-limiting example, the hyperactivating mutation may be E1008Q (ADAR1dd). As another non-limiting example, the hyperactivating mutation may be E488Q (ADAR2dd).
In some embodiments dCas13b-ADAR1dd(E1008Q) may require a longer guide RNA than dCas13b-ADAR2dd(E488Q), which is functional with a number of RNA guide lengths and has greater editing efficiency.
In some embodiments REPAIR uses guide RNAs comprising spacers. As a non-limiting example, the spacers are 30 nt in length. As another non-limiting example, the spacers are 50nt in length.
In some embodiments, REPAIR comprises a linker between dCas13b and ADAR2dd (E488Q).
In some embodiments, REPAIR comprises the ADAR2dd with the E488Q mutation fused to catalytically inactive PsPCas13b, which may also be referred to herein as dCas13b. In some embodiments, engineered dCas13b-ADARdd fusions naturally deaminates adenosines to inosines, a base that functionally mimics guanosine in the cell, in dsRNA.
In some embodiments, a guide RNA that hybridizes with the target RNA may generate the requisite duplex RNA substrate for editing and recruiting of the dCas13b-ADARdd fusion comprising REPAIR. The guide RNA crRNA component hybridizes to bases surrounding the target adenosine. The mismatch cytidine in the crRNA opposite the target may enhance the RNA editing reaction to promote adenosine deamination to inosine. In some embodiments, REPAIR comprises a mismatched cytidine opposite the target adenosine that may increase deamination frequency.
In some embodiments, REPAIR may be used for RNA editing in mammalian cells.
In some embodiments, REPAIR is localized to the cytoplasm of cells.
In some embodiments, REPAIR directly deaminates target adenosines to inosines without relying on endogenous cellular repair pathways for desired editing outcomes and therefore may be used in post-mitotic cells. As a non-limiting example, the post-mitotic cell is a neuron.
In some embodiments, destabilizing REPAIR ADAR2dd(E488Q)-RNA binding may selectively decrease off-target editing. As a non-limiting example, REPAIR ADAR2dd(E488Q)-RNA binding is destabilized by applying a rational mutagenesis strategy to vary ADAR2dd(E488Q) residues that contact the RNA duplex.
In some embodiments, the REPAIR ADAR2dd(E488Q) is mutated. As a nonlimiting example, ADAR2dd(E488Q) is mutated to generate in ADAR2dd(E488QMT375G).
In some embodiments, REPAIR may be used for RNA editing of disease-relevant mutations. The disease-relevant mutations may be G to A mutations. As a non-limiting example, the disease relevant mutation may be 878G to A (AVPR2 W293X) in X-linked nephrogenic diabetes insipidus.
In some embodiments, REPAIR may be used for the treatment of diseases associated with temporary changes in cell state, such as but not limited to inflammation.
In some embodiments, REPAIR may be used to modify the function of proteins involved in disease-related signal transduction. As a non-limiting example, REPAIR may be used to recode serine, threonine and tyrosine residues targeted for phosphorylation by kinases in neurodegenerative diseases and/or conditions, such as but not limited to Alzheimer's disease.
In some embodiments, REPAIR may be used to transiently or chronically alter the sequence of expressed, risk-modifying G-to-A variants to decrease the chance of entering a disease state for patient. As a non-limiting example, REPAIR may be used to functionally mimic A-to-G alleles of the Interferon Induced With Helicase C Domain 1 (IFIH1) gene to protect against autoimmune disorders, such as but not limited to type I diabetes and systemic lupus erythematosus.
In some embodiments, REPAIR constructs may be modified for packaging into therapeutically relevant viral vectors, such as but not limited to, AAV vectors. As a non-limiting examples constructs may be modified by C terminal truncations of dCas13 fused to ADAR2dd(E488Q). As another non-limiting example, constructs may be modified by N terminal truncations of dCas13 fused to ADAR2dd(E488Q).
The REPAIR method has no strict sequence constraints and can be used to edit full-length transcripts containing mutations. A variant of the REPAIR method may be used when more specificity is needed. REPAIRv2 as described by Cox et al (Science. 2017 Nov. 24; 358(6366): 1019-1027. doi:10.1126/science.aaq0180) is a method that is at least 919 more specific than REPAIR and is more adaptable to viral delivery.
In some embodiments, the REPAIR method uses the mutant ADAR2dd(E488Q/T375G).
Adenine base editors (ABEs) mediate conversion of A to T and G to C in genomic DNA. Gaudelli et al. reported a tRNA adenosine deaminase to operate on DNA when fused to a catalytically impaired CRISPR-Cas9 (Nature. 2017 Nov. 23; 551(7681): 464-471. doi:10.1038/nature24644; the contents of which are herein incorporated by reference). ABEs are able to introduce point mutations more efficiently and cleanly than the Cas9 nuclease-based method, induce less off-target genome modification than Cas9, and install disease-correcting or disease-suppressing mutations in cells. Additionally, ABEs are able to introduce mutations without double-stranded cleavage.
In some embodiments, the VENOM element is an ABE and can mediate conversion of A to T and G to C in genomic DNA.
Base editor 3 (BE3) converts C to G base pairs or T to A base pairs in a variety of cell lines and with a higher efficiency and lower indel frequency than what can be achieved using other genome editing methods (Komor A. et al. Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature. 2016; 533:420-424 and Komor et al., Sci. Adv. 2017; 3: eaao4774: the contents of which are herein incorporated by reference in their entireties). In order to operate efficiently, BE3 requires the presence of an NGG PAM that places the target C within a five-nucleotide window near the PAM-distal end of the protospacer, but this limits the number of sites in the human genome that can be efficiently targeted by BE3. In order to expand the number of sites that could be targeted, Kim et al. replaced the nickase form of SpCas9 with that of SaCas9 in BE3 to generate APEBEC1-SaCas9n-UG1 (SaBE3) which can mediate base editing at sites not accessible to BE3 (Nat Biotechnol. 2017 April; 35(4): 371-376. doi:10.1038/nbt.3803; the contents of which are herein incorporated by reference in their entireties).
In some embodiments, the VENOM element is a BE3 and can mediate conversion of C to G base pairs or T to A base pairs in genomic DNA.
In some embodiments, the VENOM element is a SaBE3 and can mediate conversion of C to G base pairs or T to A base pairs in genomic DNA.
The targeting range of base editors can be expanded by using engineered Cas9 variants that expand or alter PAM specificities (see Kim et al. Nat Biotechnol. 2017 April; 35(4): 371-376. doi:10.1038/nbt.3803; the contents of which are herein incorporated by reference in their entireties). Non-limiting examples of engineered Cas9 variants includes SpCas9 variants that accept NGA (VQR-Cas9), NGAN (VQR-BE3), NGAG (EQR-Cas9, EQR-BE3) or NGCG (VRER-Cas9, VRER-BE3) PAM sequences or SaCas9 variants that contain three mutations (SaKKH-BE3) that expand the PAM requirement to NNNRRT (SaKKH-BE3).
In some embodiments, the VENOM element is an engineered Cas9 variants that expand or alter PAM specificities. As a non-limiting example, the engineered Cas9 variant is VQR-Cas9, VQR-BE3, EQR-Cas9, EQR-BE3, VRER-Cas9, VRER-BE3, SaKKH-BE3, SaKKH-BE3.
In some embodiments, the VENOM element is a SpCas9 variant that contains four point mutations N497A, R661A, Q695A and Q926A.
In some embodiments, the VENOM element is a BE3 variant that contains four point mutations N497A, R661A, Q695A and Q926A and is called HF-BE3 as described by Rees et al. (Nature Communications, 8:15790, DOI: 10.1038/ncomms15790; the contents of which are incorporated by reference in their entirety).
In some embodiments, nucleobase editing may be used to introduce a stop codon into a gene. The introduction of the stop codon may be used to stop the expression of toxic repeats in a subject. As a non-limiting example, the introduction of the stop codon may be used to treat autosomal dominant disorders.
In some embodiments, nucleobase editing may be used to introduce a protective mutation into a gene. The mutation may be introduced adjacent to the aspartyl protease beta-site in a gene. As a non-limiting example, nucleobase editing may be used to introduce the protective mutation (A673T) in the APP gene to protects against Alzheimer's disease and cognitive decline in the elderly. As a non-limiting example, nucleobase editing may be used to introduce the protective mutation adjacent to the aspartyl protease beta-site in the APP gene to protects against Alzheimer's disease and cognitive decline in the elderly.
In some embodiments, nucleobase editing may be used to correct a gene mutation which causes a disease, disorder and/or condition.
In some embodiments, nucleobase editing may be used to correct a gene mutation which causes a neurological disease in a subject. As a non-limiting example, the neurological disease is Alzheimer's Disease.
In some embodiments, the nucleobase editing may be used to correct a gene mutation in that APP gene which causes Alzheimer's Disease. As a non-limiting example, the mutation is V717I.
In some embodiments, the REPAIR method may be used for nucleobase editing to correct the V717I mutation in the APP gene. This correction may be used to treat or reduce the symptoms of Alzheimer's Disease.
In some embodiments, the CRISPR-Cas system described in US Patent Publication No. US20170073670, the contents of which are incorporated by reference in their entirety, may be used as the VENOM element. As a non-limiting example, the VENOM element may be a nucleic acid-modifying enzyme complex wherein a nucleic acid sequence-recognizing module that specifically binds to a target nucleotide sequence in a given double stranded DNA and a nucleic acid base converting enzyme are bonded and the nucleic acid sequence-recognizing module is a CRISPR-Cas system wherein at least one DNA cleavage ability of Cas is inactivated, which complex converts one or more nucleotides in the targeted site to other one or more nucleotides or deletes one or more nucleotides, or inserts one or more nucleotides into said targeted site, without cleaving at least one strand of said double stranded DNA in the targeted site.
In some embodiments, the CRISPR-Cas system described in US Patent Publication No. US20170321210, the contents of which are incorporated by reference in their entirety, may be used as the VENOM element. As a non-limiting example, the VENOM element may be a nucleic acid-modifying enzyme complex wherein a nucleic acid sequence-recognizing module that specifically binds to a target nucleotide sequence in a given double stranded DNA and DNA glycosylase with sufficiently low reactivity with a DNA having an unrelaxed double helix structure (unrelaxed DNA) are bonded, which converts one or more nucleotides in the targeted site to other one or more nucleotides or deletes one or more nucleotides, or inserts one or more nucleotides into said targeted site, without cleaving at least one strand of said double stranded DNA in the targeted site.
In some embodiments, the CRISPR-Cas system described in US Patent Publication No. US20190024098, the contents of which are incorporated by reference in their entirety, may be used as the VENOM element. As a non-limiting example, the VENOM element may be a nucleic acid-modifying enzyme complex for modifying a targeted site of a double stranded DNA in a host cell, the complex comprising (a) a crRNA comprising a sequence complementary to a target strand of a target nucleotide sequence in the given double stranded DNA, and (b) a nucleic acid-modifying enzyme complex comprising a protein group constituting Cascade, and a nucleic acid base converting enzyme that has formed a complex with any protein in the protein group.
In some embodiments, the CRISPR-Cas system described in EP Patent Publication No. EP3348638, the contents of which are incorporated by reference in their entirety, may be used as the VENOM element. As a non-limiting example, the VENOM element may be a nucleic acid-modifying enzyme complex of a nucleic acid sequence-recognizing module that specifically binds to a target nucleotide sequence in a double stranded DNA of a gram-positive bacterium and a nucleic acid base converting enzyme bonded to each other, which complex converts one or more nucleotides in the targeted site to other one or more nucleotides or deletes one or more nucleotides, or inserts one or more nucleotides into said targeted site, without cleaving at least one strand of said double stranded DNA in the targeted site and is functionable in the gram-positive bacterium.
In some embodiments, the CRISPR-Cas system described in US Patent Publication No. US20190085342, the contents of which are incorporated by reference in their entirety, may be used as the VENOM element. As a non-limiting example, the VENOM element may be a nucleic acid-modifying enzyme complex wherein a nucleic acid sequence-recognizing module that specifically binds to a target nucleotide sequence in a double stranded DNA of a monocot cell and a nucleic acid base converting enzyme are bonded, which functions in the monocot cell and converts one or more nucleotides in the targeted site to other one or more nucleotides or deletes one or more nucleotides, or inserts one or more nucleotides into said targeted site, without cleaving at least one strand of said double stranded DNA in the targeted site.
In some embodiments, the CRISPR-Cas system described in EP Patent Publication No. EP3447139, the contents of which are incorporated by reference in their entirety, may be used as the VENOM element. As a non-limiting example, the VENOM element may be a nucleic acid-modifying enzyme complex wherein a nucleic acid sequence-recognizing module that specifically binds to a target nucleotide sequence in a given double-stranded DNA, a nucleic acid base converting enzyme and a base excision repair inhibitor are bonded, which complex converts one or more nucleotides in the targeted site to other one or more nucleotides or deletes one or more nucleotides, or inserts one or more nucleotides into said targeted site, without cleaving at least one strand of said double-stranded DNA in the targeted site, wherein the nucleic acid sequence-recognizing module is a CRISPR-Cas system wherein at least one DNA cleavage ability of Cas is inactivated.
In some embodiments, the Cas9 fusion protein described in U.S. Pat. No. 10,167,457, the contents of which are incorporated by reference in their entirety, may be used as the VENOM element. As a non-limiting example, the fusion protein may comprise: (i) a Cas9 domain, wherein the Cas9 domain when in conjunction with a bound guide RNA (gRNA) specifically binds to a target nucleic acid sequence; (ii) a cytidine deaminase domain, wherein the cytidine deaminase domain deaminates a cytosine base in a single-stranded portion of the target nucleic acid sequence when in conjunction with the Cas9 domain and the gRNA; and (iii) an uracil glycosylase inhibitor (UGI) domain, wherein the UGI domain inhibits a uracil-DNA glycosylase. The Cas9 domain may be a Cas9 nickase (nCas9) domain that cuts a nucleotide target strand of a nucleotide duplex. The nucleotide target strand may be the strand that binds to the gRNA. The cytidine deaminase domain is a deaminase from an apolipoprotein B mRNA-editing complex (APOBEC) family deaminase such as, but not limited to, OBEC1 deaminase, APOBEC2 deaminase, APOBEC3A deaminase, APOBEC3B deaminase, APOBEC3C deaminase, APOBEC3D deaminase. APOBEC3F deaminase. APOBEC3G deaminase, and APOBEC3H deaminase.
In some embodiments, any of the adenosine deaminases may be used as a VENOM element for base editing as described in U.S. Pat. No. 10,113,163, the contents of which are incorporated by reference in their entirety. As a non-limiting example, the adenosine deaminase may include any of the mutations described in U.S. Pat. No. 10,113,163, such as, but not limited to, those taught in Table 4.
In some embodiments, VENOM elements may be used to introduce unnatural amino acids into proteins. As a non-limiting example, the unnatural amino acids may be introduced using base editing as described in International Patent Publication No. WO2018039438, the contents of which are incorporated by reference in their entirety.
In some embodiments, a split Cas9 protein as described in US Patent publication No. US20180127780, the contents of which are incorporated by reference in their entirety, may be used as a VENOM element. The split Cas9 may be used alone or in combination with a nucleobase editor in order to form a complete and functional gene editing mechanism.
In some embodiments, VENOM elements may be used to introduce protective and/or loss of function variants of a gene. As a non-limiting example, the VENOM element may be the CRISPR/Cas9-based nucleobase editor described in WO2018119359 and WO2018119354, the contents of each of which are incorporated by reference in their entirety. Introduction of the protective and/or loss of function variants of a gene may treat or reduce the symptoms of the disease, disorder and/or condition.
In some embodiments, a VENOM element may be used to introduce heteroclitic or cryptic peptides that are more immunogenic than the native peptide derived from the tumor associated antigens. The heteroclitic or cryptic peptides are able to elicit a strong tumor-specific immune response and can inhibit tumor growth and metastasis. Non-limiting examples of nucleobase editors which can be used as VENOM elements are described in International Patent Publication No. WO2018165631, the contents of which are incorporated by reference in their entirety.
In some embodiments, a nucleobase editor capable of inducing a cytosine (C) to guanine (G) change in a nucleic acid as described in International Patent Publication No. WO2018165629, the contents of which are incorporated by reference in their entirety, may be used as a VENOM element. As a non-limiting example, the VENOM element may encode a fusion protein comprising (i) a nucleic acid programmable DNA binding protein (napDNAbp), and (ii) a cytidine deaminase domain, and (iii) a uracil binding protein (UBP).
In some embodiments, targeted editing of nucleic acids may be achieved using the fusion proteins as described in US Patent Publication No. US20180312828, the contents of which are incorporated by reference in their entirety. As a non-limiting example, VENOM elements may encode fusion proteins of nucleic acid programmable DNA binding proteins (napDNAbp), e.g., Cpf1 or variants thereof, and nucleic acid editing proteins or protein domains, e.g., deaminase domains.
In some embodiments, the VENOM elements may be regulated using the ligand-responsive self-cleaving catalytic RNAs (aptazymes) which are incorporated into guide RNAs as described in International Patent Publication No. WO2018209320, the contents of which are incorporated by reference in their entirety. As a non-limiting example, the aptazyme may be any of those described in International Patent Publication No. WO2018209320. The aptazyme may include a ribozyme and may be, but is not limited to, a hammerhead ribozyme and a ligand-responsive ribozyme.
In some embodiments, the VENOM element may be an evolved base editor such as, but not limited to, the APOBEC1. CDA, and AID cytidine deaminase domains described in International Patent Publication No. WO2019023680, the contents of which are incorporated by reference in their entirety. As a non-limiting example, the evolved base editor may be obtained as a result of the phage-assisted continuous evolution (PACE) system.
In some embodiments, the VENOM element may be an adenosine nucleobase editor such as, but not limited to, the adenosine nucleobase editor described in International Patent Publication No. WO2019079347, the contents of which are incorporated by reference in their entirety. As a non-limiting example, the adenosine nucleobase editor may introduce a point mutation that increases the expression of a gene which is currently under-expressed and is thus causing a disease, disorder and/or condition.
In some embodiments, the VENOM element may be a nucleobase editor such as, but not limited to, the nucleobase editor described in International Patent Publication No. WO2018165504, the contents of which are incorporated by reference in their entirety. As a non-limiting example, the adenosine nucleobase editor may introduce a point mutation that increases the expression of a gene which is currently under-expressed and is thus causing a disease, disorder and/or condition.
In some embodiments, the VENOM element may be a Cas9 variant as described in US Patent Publication No. US20160215276, the contents of which are incorporated by reference in their entirety. As a non-limiting example, the Cas9 variant may be a dimer or tetramer of a fusion protein wherein the fusion protein comprises two domains: (i) a nuclease-inactivated Cas9 (dCas9); and (ii) a recombinase catalytic domain.
In some embodiments, guide RNAs (gRNAs) may be used to activity and/or improving the specificity of RNA-programmable endonucleases, such as Cas9. Non-limiting examples of methods controlling the activity and/or improving the specificity of RNA-programmable endonucleases are described in U.S. Pat. No. 9,228,207, the contents of which are incorporated by reference in their entirety. Also provided in U.S. Pat. No. 9,228,207 are sgRNAs which may be used to control the activity and/or improve the specificity of RNA-programmable endonucleases.
In some embodiments, the VENOM element may for modifying a cytosine in a target locus. International Patent Publication No. WO2018213726, the contents of which are herein incorporated by reference in its entirety, describes various methods proteins and molecules which may be used to modify cytosine. As a non-limiting example. Cytosine may be modified in a target locus of interest, by delivering to the locus (a) a Cpf1 nickase protein, (b) a guide molecule which comprises a guide sequence linked to a direct repeat; and (c) a cytidine deaminase protein or catalytic domain thereof. The cytidine deaminase protein or catalytic domain thereof may be covalently or non-covalently linked to said Cpf1 nickase protein or the guide molecule or may be adapted to link thereto after delivery. The guide molecule may form a complex with the Cpf1 nickase protein and direct the complex to bind a first DNA strand at a target locus of interest.
In some embodiments, the VENOM element may be or include an adenosine deaminase as described in International Patent Publication No. WO2019005884 and WO2019071048, the contents of each which are herein incorporated by reference in its entirety.
In some embodiments, the VENOM element may be or include a cytidine deaminase as described in International Patent Publication No. WO2019005886, the contents of which are herein incorporated by reference in its entirety.
In some embodiments, the VENOM element may be or include a base editing (BE) technology as described in International Patent Publication No. WO2018218188, the contents of which are herein incorporated by reference in its entirety.
In some embodiments, the VENOM element may cause nuclcobase mutations in a gene of interest. As a non-limiting example, the nucleobase mutation may be A to G or T to C using the deaminases described in International Patent Publication No. WO2019079347, the contents of which are herein incorporated by reference in its entirety.
Most gene editing methods require two homology arms for gene editing. However, Suzuki et al. have recently described precise in vivo gene editing via a single homology arm and intron-targeting gene integration (Cell Research (2019) 0:1-16; doi.org/10.1038/s41422-019-0213-0; the contents of which are herein incorporated by reference in their entireties). The single homology arm provides reduces the amount of homology needed for the gene editing machine to attach to the target gene but it does not address the common issue that incorrect mutations (including deletions) can occur during the cutting and insertion of the new sequence. To address this aspect, Suzuki et al. have discovered that if you target the intron for gene integration, the effect of these undesired mutations is minimized since the intron is not translated. This method is called Single homology Arm donor mediated intron-Targeting Integration (SATI) and allows for the targeting of a broad range of mutations and cell types
In some embodiments, the VENOM element is SATI.
In some embodiments, the SATI method may be used to correct a gene mutation which causes a disease, disorder and/or condition.
A number of switches that function to regulate transgene expression in the context of cell systems and animal models have been described. One such mechanism of regulation relies on a chemical agent, such as a drug, or a physiological stimulus that acts as a switch to turn the expression of a transgene on or off. The most extensively studied regulatory switch mechanism is the Tet ON/OFF system, in which a tet repressor protein can only activate transcription from a promoter with a tet response element in the presence or absence of teracyclin, first described by Bujard and Gossen (Proc Natl Acad Sci USA. 1992 Jun. 15; 89(12):5547-51, Tight control of gene expression in mammalian cells by tetracycline-responsive promoters; the contents of which are herein incorporated by reference in its entirety). Several ligand and hormone regulatable systems, which employ the dimerization of two separate proteins for activation or repression, have also been described.
Approaches in which a second transgene encodes a regulatory enzyme such as a CRE recombinase, which modulates expression of a target transgene through site specific recombination, is extensively used in transgenic mouse studies, where a tissue restricted or temporally restricted expression pattern is desired.
In addition, transgene expression can also be controlled through regulation of transcript mRNA stability or protein stability, through the inclusion of stabilizing or destabilizing elements.
Adeno-associated viruses (AAV) are small non-enveloped icosahedral capsid viruses of the Parvoviridae family characterized by a single stranded DNA vector genome. Parvoviridae family viruses consist of two subfamilies: Parvovirinae, which infect vertebrates, and Densovirinae, which infect invertebrates. The Parvoviridae family comprises the Dependovirus genus which includes AAV, capable of replication in vertebrate hosts including, but not limited to, human, primate, bovine, canine, equine, and ovine species.
The parvoviruses and other members of the Parvoviridae family are generally described in Kenneth 1. Berns, “Parvoviridae: The Viruses and Their Replication,” Chapter 69 in FIELDS VIROLOGY (3d Ed. 1996), the contents of which are incorporated by reference in their entirety.
AAV have proven to be useful as a biological tool due to their relatively simple structure, their ability to infect a wide range of cells (including quiescent and dividing cells) without integration into the host genome and without replicating, and their relatively benign immunogenic profile. The genome of the virus may be manipulated to contain a minimum of components for the assembly of a functional recombinant virus, or viral particle, which is loaded with or engineered to target a particular tissue and express or deliver a desired payload.
The wild-type vector genome is a linear, single-stranded DNA (ssDNA) molecule approximately 5,000 nucleotides (nt) in length. Inverted terminal repeats (ITRs) traditionally cap the vector genome at both the 5′ and the 3′ end, providing origins of replication for the vector genome. While not wishing to be bound by theory, an AAV vector genome typically comprises two ITR sequences. These ITRs have a characteristic T-shaped hairpin structure defined by a self-complementary region (145nt in wild-type AAV) at the 5′ and 3′ ends of the ssDNA which form an energetically stable double stranded region. The double stranded hairpin structures comprise multiple functions including, but not limited to, acting as an origin for DNA replication by functioning as primers for the endogenous DNA polymerase complex of the host viral replication cell.
The wild-type AAV vector genome further comprises nucleotide sequences for two open reading frames, one for the four non-structural Rep proteins (Rep78, Rep68, Rep52. Rep40, encoded by Rep genes) and one for the three capsid, or structural, proteins (VP1, VP2, VP3, encoded by capsid genes or Cap genes). The Rep proteins are important for replication and packaging, while the capsid proteins are assembled to create the protein shell of the AAV, or AAV capsid. Alternative splicing and alternate initiation codons and promoters result in the generation of four different Rep proteins from a single open reading frame and the generation of three capsid proteins from a single open reading frame. Though it varies by AAV serotype, as a non-limiting example, for AAV9/hu.14 (SEQ ID NO: 123 of U.S. Pat. No. 7,906,111, the contents of which are herein incorporated by reference in their entirety) VP1 refers to amino acids 1-736, VP2 refers to amino acids 138-736, and VP3 refers to amino acids 203-736. In other words, VP1 is the full-length capsid sequence, while VP2 and VP3 are shorter components of the whole. As a result, changes in the sequence in the VP3 region, are also changes to VP1 and VP2, however, the percent difference as compared to the parent sequence will be greatest for VP3 since it is the shortest sequence of the three. Though described here in relation to the amino acid sequence, the nucleic acid sequence encoding these proteins can be similarly described. Together, the three capsid proteins assemble to create the AAV capsid protein. While not wishing to be bound by theory, the AAV capsid protein typically comprises a molar ratio of 1:1:10 of VP1:VP2:VP3. As used herein, an “AAV serotype” is defined primarily by the AAV capsid. In some instances, the ITRs are also specifically described by the AAV serotype (e.g., AAV2/9).
For use as a biological tool, the wild-type AAV vector genome can be modified to replace the rep/cap sequences with a nucleic acid sequence comprising a payload region with at least one ITR region. Typically, in recombinant AAV vector genomes there are two ITR regions. The rep/cap sequences can be provided in trans during production to generate AAV particles and/or VAPs.
In addition to the encoded heterologous payload, vector genomes may comprise the vector genome, in whole or in part, of any naturally occurring and/or recombinant AAV serotype nucleotide sequence or variant. AAV variants may have sequences of significant homology at the nucleic acid (genome or capsid) and amino acid levels (capsids), to produce constructs which are generally physical and functional equivalents, replicate by similar mechanisms, and assemble by similar mechanisms. Chiorini et al., J. Vir. 71: 6823-33(1997); Srivastava et al., J. Vir. 45:555-64 (1983): Chiorini et al., J. Vir. 73:1309-1319 (1999); Rutledge et al., J. Vir. 72:309-319 (1998); and Wu et al., J. Vir. 74: 8635-47 (2000), the contents of each of which are incorporated herein by reference in their entirety.
In some embodiments, AAV particles and/or VAPs of the present disclosure are recombinant AAV viral vectors which are replication defective and lacking sequences encoding functional Rep and Cap proteins within their vector genome. These defective vector genomes may lack most or all parental coding sequences and essentially carry only one or two AAV ITR sequences and the nucleic acid of interest for delivery to a cell, a tissue, an organ, or an organism.
In some embodiments, the vector genome of the AAV particles and/or VAPs of the present disclosure comprise at least one control element which provides for the replication, transcription, and translation of a coding sequence encoded therein. Not all of the control elements need always be present as long as the coding sequence is capable of being replicated, transcribed, and/or translated in an appropriate host cell. Non-limiting examples of expression control elements include sequences for transcription initiation and/or termination, promoter and/or enhancer sequences, efficient RNA processing signals such as splicing and polyadenylation signals, sequences that stabilize cytoplasmic mRNA, sequences that enhance translation efficacy (e.g., Kozak consensus sequence), sequences that enhance protein stability, and/or sequences that enhance protein processing and/or secretion.
According to the present disclosure, AAV particles and/or VAPs for use in therapeutics and/or diagnostics comprise a virus that has been distilled or reduced to the minimum components necessary for transduction of a nucleic acid payload or cargo of interest. In this manner, AAV particles and/or VAPs are engineered as vehicles for specific delivery while lacking the deleterious replication and/or integration features found in wild-type viruses.
In addition to single stranded AAV vector genomes (e.g., ssAAVs), the present disclosure also provides for self-complementary AAV (scAAVs) vector genomes, scAAV vector genomes contain DNA strands which anneal together to form double stranded DNA. By skipping second strand synthesis, scAAVs allow for rapid expression in the transduced cell.
In some embodiments, the AAV particle and/or VAP of the present disclosure is an scAAV.
In some embodiments, the AAV particle and/or VAP of the present disclosure is an ssAAV.
Methods for producing and/or modifying AAV particles and/or VAPs are disclosed in the art such as pseudotyped AAV vector genomes (PCT Patent Publication Nos. WO200028004; WO200123001: WO2004112727; WO2005005610; and WO2005072364, the content of each of which is incorporated herein by reference in its entirety).
AAV particles and/or VAPs may be modified to enhance the efficiency of delivery. Such modified AAV particles and/or VAPs can be packaged efficiently and be used to successfully infect the target cells at high frequency and with minimal toxicity. In some embodiments, the capsids of the AAV particles and/or VAPs are engineered according to the methods described in US Publication Number US20130195801, the contents of which are incorporated herein by reference in their entirety.
In some embodiments, the AAV particles and/or VAPs comprising a payload region encoding the polypeptides may be introduced into mammalian cells.
AAV particles and/or VAPs of the present disclosure may comprise or be derived from any natural or recombinant AAV serotype. According to the present disclosure, the AAV particles and/or VAPs may utilize or be based on a serotype or include a peptide selected from any of the following VOY101, VOY201, AAVPHP.B (PHP.B), AAVPHP.A (PHP.A), AAVG2B-26, AAVG2B-13, AAVTH1.1-32. AAVTH1.1-35, AAVPHP.B2 (PHP.B2), AAVPHP.B3 (PHP.B3), AAVPHP.N/PHP.B-DGT, AAVPHP.B-EST, AAVPHP.B-GGT, AAVPHP.B-ATP, AAVPHP.B-ATT-T, AAVPHP.B-DGT-T, AAVPHP.B-GGT-T, AAVPHP.B-SGS, AAVPHP.B-AQP, AAVPHP.B-QQP, AAVPHP.B-SNP(3), AAVPHP.B-SNP, AAVPHP.B-QGT, AAVPHP.B-NQT, AAVPHP.B-EGS, AAVPHP.B-SGN, AAVPHP.B-EGT, AAVPHP.B-DST, AAVPHP.B-DST. AAVPHP.B-STP, AAVPHP.B-PQP, AAVPHP.B-SQP, AAVPHP.B-QLP, AAVPHP.B-TMP, AAVPHP.B-TTP, AAVPHP.S/G2A12, AAVG2A15/G2A3 (G2A3), AAVG2B4 (G2B4), AAVG2B5 (G2B5), PHP.S, AAV1, AAV2, AAV2G9, AAV3, AAV3a. AAV3b, AAV3-3, AAV4, AAV4-4, AAV5, AAV6, AAV6.1, AAV6.2, AAV6.1.2, AAV7, AAV7.2, AAV8, AAV9, AAV9.11. AAV9.13, AAV9.16, AAV9.24, AAV9.45, AAV9.47, AAV9.61, AAV9.68, AAV9.84, AAV9.9, AAV10, AAV11, AAV12, AAV16.3. AAV24.1, AAV27.3, AAV42.12, AAV42-1b, AAV42-2, AAV42-3a. AAV42-3b, AAV42-4, AAV42-5a, AAV42-5b, AAV42-6b, AAV42-8, AAV42-10, AAV42-11, AAV42-12, AAV42-13, AAV42-15, AAV42-aa, AAV43-1. AAV43-12, AAV43-20, AAV43-21, AAV43-23, AAV43-25, AAV43-5, AAV44.1, AAV44.2, AAV44.5, AAV223.1. AAV223.2, AAV223.4, AAV223.5, AAV223.6, AAV223.7, AAV1-7/rb.48, AAV1-8/rh.49, AAV2-15/rh.62, AAV2-3/rh.61, AAV2-4/rh.50, AAV2-5/rh.51, AAV3.1/hu.6, AAV3.1/hu.9, AAV3-9/rh.52, AAV3-11/rh.53, AAV4-8/r11.64, AAV4-9/rh.54, AAV4-19/rh.55, AAV5-3/rh.57, AAV5-22/rh.58, AAV7.3/hu.7, AAV16.8/hu.10, AAV16.12/hu.11, AAV29.3/bb.1, AAV29.5/bb.2, AAV106.1/hu.37, AAV114.3/hu.40, AAV127.2/hu.41, AAV127.5/hu.42, AAV128.3/hu.44, AAV130.4/hu.48, AAV145.1/hu.53, AAV145.5/hu.54, AAV145.6/hu.55, AAV161.10/hu.60, AAV161.6/hu.61, AAV33.12/hu.17, AAV33.4/hu.15, AAV33.8/hu.16, AAV52/hu.19, AAV52.1/hu.20, AAV58.2/hu.25, AAVA3.3, AAVA3.4, AAVA3.5, AAVA3.7, AAVC1, AAVC2, AAVC5, AAV-DJ, AAV-DJ8, AAVF3. AAVF5, AAVH2, AAVrh.72, AAVhu.8, AAVrh.68, AAVrh.70, AAVp1.1, AAVpi.3, AAVpi.2, AAVrh.60, AAVrh.44, AAVrh.65, AAVrh.55, AAVrh.47, AAVrb.69, AAVrh.45, AAVrh.59, AAVhu.12, AAVH6, AAVLK03, AAVH-1/hu.1, AAVH-5/hu.3, AAVLG-10/rh.40, AAVLG-4/rh.38, AAVLG-9/hu.39, AAVN721-8/rh.43, AAVCh.5. AAVCh.5R1, AAVcy.2, AAVcy.3, AAVcy.4, AAVcy.5, AAVCy.5R1, AAVCy.5R2, AAVCy.5R3, AAVCy.5R4, AAVcy.6, AAVhu.1, AAVhu.2, AAVhu.3, AAVhu.4, AAVhu.5, AAVhu.6, AAVhu.7, AAVhu.9, AAVhu.10, AAVhu.11, AAVhu.13, AAVhu.15, AAVhu.16, AAVhu.17, AAVhu.18, AAVhu.20, AAVhu.21, AAVhu.22, AAVhu.23.2, AAVhu.24, AAVhu.25, AAVhu.27, AAVhu.28, AAVhu.29. AAVhu.29R. AAVhu.31, AAVhu.32, AAVhu.34, AAVhu.35, AAVhu.37, AAVhu.39, AAVhu.40, AAVhu.41, AAVhu.42, AAVhu.43, AAVhu.44, AAVhu.44R1, AAVhu.44R2, AAVhu.44R3, AAVhu.45, AAVhu.46, AAVhu.47, AAVhu.48, AAVhu.48R1, AAVhu.48R2, AAVhu.48R3, AAVhu.49, AAVhu.51, AAVhu.52, AAVhu.54, AAVhu.55, AAVhu.56, AAVhu.57, AAVhu.58, AAVhu.60, AAVhu.61, AAVhu.63, AAVhu.64, AAVhu.66, AAVhu.67, AAVhu.14/9, AAVhu.t 19, AAVrh.2, AAVrh.2R, AAVrh.8, AAVrh.8R, AAVrh.10. AAVrh.12, AAVrh.13, AAVrh.13R, AAVrh.14, AAVrh.17. AAVrh.18, AAVrh.19, AAVrh.20, AAVrh.21, AAVrh.22, AAVrh.23, AAVrh.24, AAVrh.25, AAVrh.31. AAVrh.32, AAVrh.33, AAVrh.34, AAVrh.35, AAVrh.36, AAVrh.37, AAVrh.37R2, AAVrh.38, AAVrh.39, AAVrh.40, AAVrh.46, AAVrh.48, AAVrh.48.1, AAVrh.48.1.2, AAVrh.48.2, AAVrh.49, AAVrb.51, AAVrh.52, AAVrh.53, AAVrh.54, AAVrh.56, AAVrh.57, AAVrh.58, AAVrb.61, AAVrh.64, AAVrh.64R1, AAVrh.64R2, AAVrh.67. AAVrh.73, AAVrh.74, AAVrh8R. AAVrh8R A586R mutant, AAVrh8R R533A mutant, AAAV, BAAV, caprine AAV, bovine AAV, AAVhE1.1. AAVhEr1.5, AAVhER1.14, AAVhEr1.8, AAVhEr1.16, AAVhEr1.18, AAVhEr1.35, AAVhEr1.7, AAVhEr1.36, AAVhEr2.29, AAVhEr2.4, AAVhEr2.16, AAVhEr2.30, AAVhEr2.31, AAVhEr2.36, AAVhER1.23. AAVhEr3.1, AAV2.5T, AAV-PAEC, AAV-LK01, AAV-LK02, AAV-LK03, AAV-LK04, AAV-LK05, AAV-LK06, AAV-LK07, AAV-LK08, AAV-LK09, AAV-LK10, AAV-LK 11, AAV-LK12, AAV-LK13, AAV-LK14, AAV-LK15, AAV-LK16, AAV-LK17. AAV-LK18, AAV-LK19. AAV-PAEC2, AAV-PAEC4, AAV-PAEC6, AAV-PAEC7, AAV-PAEC8, AAV-PAEC11, AAV-PAEC12, AAV-2-pre-miRNA-101, AAV-8h, AAV-8b, AAV-h, AAV-b, AAV SM 10-2, AAV Shuffle 100-1, AAV Shuffle 100-3, AAV Shuffle 100-7, AAV Shuffle 10-2, AAV Shuffle 10-6, AAV Shuffle 10-8, AAV Shuffle 100-2. AAV SM 10-1, AAV SM 10-8, AAV SM 100-3, AAV SM 100-10, BNP61 AAV, BNP62 AAV, BNP63 AAV, AAVrh.50, AAVrh.43, AAVrh.62, AAVrh.48, AAVhu.19, AAVhu.11, AAVhu.53, AAV4-8/rh.64, AAVLG-9/hu.39, AAV54.5/hu.23, AAV54.2/hu.22, AAV54.7/hu.24, AAV54.1/hu.21, AAV54.4R/hu.27, AAV46.2/hu.28, AAV46.6/hu.29, AAV128.1/hu.43, true type AAV (ttAAV), UPENN AAV 10, Japanese AAV 10 serotypes, AAV CBr-7.1, AAV CBr-7.10, AAV CBr-7.2, AAV CBr-7.3, AAV CBr-7.4, AAV CBr-7.5, AAV CBr-7.7, AAV CBr-7.8, AAV CBr-B7.3, AAV CBr-B7.4, AAV CBr-E1, AAV CBr-E2, AAV CBr-E3, AAV CBr-E4, AAV CBr-E5, AAV CBr-e5, AAV CBr-E6, AAV CBr-E7, AAV CBr-E8, AAV CHt-1, AAV CHt-2, AAV CHt-3, AAV CHt-6.1, AAV CHt-6.10, AAV CHt-6.5, AAV CHt-6.6, AAV CHt-6.7, AAV CHt-6.8, AAV CHt-P1, AAV CHt-P2, AAV CHt-P5, AAV CHt-P6, AAV CHt-P8, AAV CHt-P9, AAV CKd-1, AAV CKd-10, AAV CKd-2. AAV CKd-3, AAV CKd-4, AAV CKd-6, AAV CKd-7. AAV CKd-8, AAV CKd-B1, AAV CKd-B2, AAV CKd-B3, AAV CKd-B4, AAV CKd-B5, AAV CKd-B6, AAV CKd-B7, AAV CKd-B8, AAV CKd-H1, AAV CKd-H2, AAV CKd-H3, AAV CKd-H4, AAV CKd-H5, AAV CKd-H6, AAV CKd-N3, AAV CKd-N4. AAV CKd-N9, AAV CLg-F1, AAV CLg-F2, AAV CLg-F3, AAV CLg-F4, AAV CLg-F5, AAV CLg-F6, AAV CLg-F7, AAV CLg-F8, AAV CLv-1, AAV CLv1-1, AAV Clv1-10, AAV CLv1-2, AAV CLv-12, AAV CLv1-3, AAV CLv-13, AAV CLv1-4, AAV Clv1-7, AAV Clv1-8, AAV Clv1-9, AAV CLv-2, AAV CLv-3, AAV CLv-4, AAV CLv-6, AAV CLv-8, AAV CLv-D1, AAV CLv-D2, AAV CLv-D3, AAV CLv-D4, AAV CLv-D5, AAV CLv-D6, AAV CLv-D7, AAV CLv-D8, AAV CLv-E1, AAV CLv-K1, AAV CLv-K3, AAV CLv-K6, AAV CLv-L4, AAV CLv-L5, AAV CLv-L6, AAV CLv-M1, AAV CLv-M11, AAV CLv-M2, AAV CLv-M5, AAV CLv-M6, AAV CLv-M7, AAV CLv-M8, AAV CLv-M9, AAV CLv-R1, AAV CLv-R2, AAV CLv-R3, AAV CLv-R4, AAV CLv-R5, AAV CLv-R6, AAV CLv-R7, AAV CLv-R8, AAV CLv-R9, AAV CSp-1, AAV CSp-10, AAV CSp-11, AAV CSp-2, AAV CSp-3, AAV CSp-4, AAV CSp-6, AAV CSp-7, AAV CSp-8, AAV CSp-8.10, AAV CSp-8.2, AAV CSp-8.4, AAV CSp-8.5, AAV CSp-8.6, AAV CSp-8.7, AAV CSp-8.8, AAV CSp-8.9, AAV CSp-9, AAV.hu.48R3, AAV.VR-355, AAV3B, AAV4, AAV5, AAVF1/HSC1, AAVF11/HSC11, AAVF12/HSC12, AAVF13/HSC13, AAVF14/HSC14, AAVF15/HSC15, AAVF16/HSC16, AAVF17/HSC17, AAVF2/HSC2, AAVF3/HSC3, AAVF4/HSC4, AAVF5/HSC5, AAVF6/HSC6, AAVF7/HSC7, AAVF8/HSC8, and/or AAVF9/HSC9 and variants thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in United States Publication No. US20030138772, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV1 (SEQ ID NO: 6 and 64 of US20030138772), AAV2 (SEQ ID NO: 7 and 70 of US20030138772), AAV3 (SEQ ID NO: 8 and 71 of US20030138772), AAV4 (SEQ ID NO: 63 of US20030138772), AAV5 (SEQ ID NO: 114 of US20030138772), AAV6 (SEQ ID NO: 65 of US20030138772), AAV7 (SEQ ID NO: 1-3 of US20030138772), AAV8 (SEQ ID NO: 4 and 95 of US20030138772), AAV9 (SEQ ID NO: 5 and 100 of US20030138772), AAV10 (SEQ ID NO: 117 of US20030138772), AAV11 (SEQ ID NO: 118 of US20030138772), AAV12 (SEQ ID NO: 119 of US20030138772), AAVrh10 (amino acids 1 to 738 of SEQ ID NO: 81 of US20030138772), AAV16.3 (US20030138772 SEQ ID NO: 10), AAV29.3/bb.1 (US20030138772 SEQ ID NO: 11), AAV29.4 (US20030138772 SEQ ID NO: 12), AAV29.5/bb.2 (US20030138772 SEQ ID NO: 13), AAV1.3 (US20030138772 SEQ ID NO: 14), AAV13.3 (US20030138772 SEQ ID NO: 15), AAV24.1 (US20030138772 SEQ ID NO: 16), AAV27.3 (US20030138772 SEQ ID NO: 17), AAV7.2 (US20030138772 SEQ ID NO: 18), AAVC1 (US20030138772 SEQ ID NO: 19), AAVC3 (US20030138772 SEQ ID NO: 20), AAVC5 (US20030138772 SEQ ID NO: 21), AAVF1 (US20030138772 SEQ ID NO: 22), AAVF3 (US20030138772 SEQ ID NO: 23), AAVF5 (US20030138772 SEQ ID NO: 24), AAVH6 (US20030138772 SEQ ID NO: 25), AAVH2 (US20030138772 SEQ ID NO: 26), AAV42-8 (US20030138772 SEQ ID NO: 27), AAV42-15 (US20030138772 SEQ ID NO: 28), AAV42-5b (US20030138772 SEQ ID NO: 29), AAV42-1b (US20030138772 SEQ ID NO: 30), AAV42-13 (US20030138772 SEQ ID NO: 31), AAV42-3a (US20030138772 SEQ ID NO: 32), AAV42-4 (US20030138772 SEQ ID NO: 33), AAV42-5a (US20030138772 SEQ ID NO: 34), AAV42-10 (US20030138772 SEQ ID NO: 35), AAV42-3b (US20030138772 SEQ ID NO: 36), AAV42-11 (US20030138772 SEQ ID NO: 37), AAV42-6b (US20030138772 SEQ ID NO: 38), AAV43-1 (US20030138772 SEQ ID NO: 39), AAV43-5 (US20030138772 SEQ ID NO: 40), AAV43-12 (US20030138772 SEQ ID NO: 41), AAV43-20 (US20030138772 SEQ ID NO: 42), AAV43-21 (US20030138772 SEQ ID NO: 43), AAV43-23 (US20030138772 SEQ ID NO: 44), AAV43-25 (US20030138772 SEQ ID NO: 45), AAV44.1 (US20030138772 SEQ ID NO: 46), AAV44.5 (US20030138772 SEQ ID NO: 47), AAV223.1 (US20030138772 SEQ ID NO: 48), AAV223.2 (US20030138772 SEQ ID NO: 49), AAV223.4 (US20030138772 SEQ ID NO: 50), AAV223.5 (US20030138772 SEQ ID NO: 51), AAV223.6 (US20030138772 SEQ ID NO: 52). AAV223.7 (US20030138772 SEQ ID NO: 53), AAVA3.4 (US20030138772 SEQ ID NO: 54), AAVA3.5 (US20030138772 SEQ ID NO: 55), AAVA3.7 (US20030138772 SEQ ID NO: 56), AAVA3.3 (US20030138772 SEQ ID NO: 57), AAV42.12 (US20030138772 SEQ ID NO: 58). AAV44.2 (US20030138772 SEQ ID NO: 59), AAV42-2 (US20030138772 SEQ ID NO: 9), or variants thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in United States Publication No. US20150159173, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV2 (SEQ ID NO: 7 and 23 of US20150159173), rh20 (SEQ ID NO: 1 of US20150159173), rh32/33 (SEQ ID NO: 2 of US20150159173), rh39 (SEQ ID NO: 3, 20 and 36 of US20150159173), rh46 (SEQ ID NO: 4 and 22 of US20150159173), rh73 (SEQ ID NO: 5 of US20150159173), rh74 (SEQ ID NO: 6 of US20150159173), AAV6.1 (SEQ ID NO: 29 of US20150159173), rh.8 (SEQ ID NO: 41 of US20150159173), rh.48.1 (SEQ ID NO: 44 of US20150159173), hu.44 (SEQ ID NO: 45 of US20150159173), hu.29 (SEQ ID NO: 42 of US20150159173), hu.48 (SEQ ID NO: 38 of US20150159173), rh54 (SEQ ID NO: 49 of US20150159173), AAV2 (SEQ ID NO: 7 of US20150159173), cy.5 (SEQ ID NO: 8 and 24 of US20150159173), rh.10 (SEQ ID NO: 9 and 25 of US20150159173), rh.13 (SEQ ID NO: 10 and 26 of US20150159173), AAV1 (SEQ ID NO: 11 and 27 of US20150159173), AAV3 (SEQ ID NO: 12 and 28 of US20150159173), AAV6 (SEQ ID NO: 13 and 29 of US20150159173), AAV7 (SEQ ID NO: 14 and 30 of US20150159173), AAV8 (SEQ ID NO: 15 and 31 of US20150159173), hu.13 (SEQ ID NO: 16 and 32 of US20150159173), hu.26 (SEQ ID NO: 17 and 33 of US20150159173), hu.37 (SEQ ID NO: 18 and 34 of US20150159173), hu.53 (SEQ ID NO: 19 and 35 of US20150159173), rh.43 (SEQ ID NO: 21 and 37 of US20150159173), rh2 (SEQ ID NO: 39 of US20150159173), rh.37 (SEQ ID NO: 40 of US20150159173), rh.64 (SEQ ID NO: 43 of US20150159173), rh.48 (SEQ ID NO: 44 of US20150159173), ch.5 (SEQ ID NO 46 of US20150159173), rh.67 (SEQ ID NO: 47 of US20150159173), rh.58 (SEQ ID NO: 48 of US20150159173), or variants thereof including, but not limited to Cy5R1, Cy5R2, Cy5R3, Cy5R4, rh.13R, rh.37R2, rh.2R, rh.8R, rh.48.1, rh.48.2, rh.48.1.2, hu.44R1, hu.44R2, hu.44R3, hu.29R, ch.5R1, rh64R1, rh64R2, AAV6.2, AAV6.1, AAV6.12, hu.48R1, hu.48R2, and hu.48R3.
In some embodiments, the AAV serotype may be, or have, a sequence as described in U.S. Pat. No. 7,198,951, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV9 (SEQ ID NO: 1-3 of U.S. Pat. No. 7,198,951), AAV2 (SEQ ID NO: 4 of U.S. Pat. No. 7,198,951), AAV1 (SEQ ID NO: 5 of U.S. Pat. No. 7,198,951), AAV3 (SEQ ID NO: 6 of U.S. Pat. No. 7,198,951), and AAV8 (SEQ ID NO: 7 of U.S. Pat. No. 7,198,951).
In some embodiments, the AAV serotype may be, or have, a mutation in the AAV9 sequence as described by N Pulicherla et al. (Molecular Therapy 19(6):1070-1078 (2011), herein incorporated by reference in its entirety), such as but not limited to, AAV9.9. AAV9.11, AAV9.13, AAV9.16, AAV9.24, AAV9.45, AAV9.47, AAV9.61, AAV9.68, AAV9.84.
In some embodiments, the AAV serotype may be, or have, a sequence as described in U.S. Pat. No. 6,156,303, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV3B (SEQ ID NO: 1 and 10 of U.S. Pat. No. 6,156,303), AAV6 (SEQ ID NO: 2, 7 and 11 of U.S. Pat. No. 6,156,303), AAV2 (SEQ ID NO: 3 and 8 of U.S. Pat. No. 6,156,303), AAV3A (SEQ ID NO: 4 and 9, of U.S. Pat. No. 6,156,303), or derivatives thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in United States Publication No. US20140359799, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV8 (SEQ ID NO: 1 of US20140359799), AAVDJ (SEQ ID NO: 2 and 3 of US20140359799), or variants thereof.
In some embodiments, the serotype may be AAVDJ or a variant thereof, such as AAVDJ8 (or AAV-DJ8), as described by Grimm et al. (Journal of Virology 82(12): 5887-5911 (2008), herein incorporated by reference in its entirety). The amino acid sequence of AAVDJ8 may comprise two or more mutations in order to remove the heparin binding domain (HBD). As a non-limiting example, the AAV-DJ sequence described as SEQ ID NO: 1 in U.S. Pat. No. 7,588,772, the contents of which are herein incorporated by reference in their entirety, may comprise two mutations: (1) R587Q where arginine (R; Arg) at amino acid 587 is changed to glutamine (Q; Gln) and (2) R590T where arginine (R; Arg) at amino acid 590 is changed to threonine (T; Thr). As another non-limiting example, may comprise three mutations: (1) K406R where lysine (K; Lys) at amino acid 406 is changed to arginine (R; Arg), (2) R587Q where arginine (R; Arg) at amino acid 587 is changed to glutamine (Q; Gln) and (3) R590T where arginine (R; Arg) at amino acid 590 is changed to threonine (T; Thr).
In some embodiments, the AAV serotype may be, or have, a sequence of AAV4 as described in International Publication No. WO1998011244, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to AAV4 (SEQ ID NO: 1-20 of WO1998011244).
In some embodiments, the AAV serotype may be, or have, a mutation in the AAV2 sequence to generate AAV2G9 as described in International Publication No. WO2014144229 and herein incorporated by reference in its entirety.
In some embodiments, the AAV serotype may be, or have, a sequence as described in International Publication No. WO2005033321, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to AAV3-3 (SEQ ID NO: 217 of WO2005033321), AAV1 (SEQ ID NO: 219 and 202 of WO2005033321), AAV106.1/hu.37 (SEQ ID No: 10 of WO2005033321), AAV 114.3/hu.40 (SEQ ID No: 11 of WO2005033321), AAV127.2/hu.41 (SEQ ID NO:6 and 8 of WO2005033321), AAV128.3/hu.44 (SEQ ID No: 81 of WO2005033321), AAV130.4/hu.48 (SEQ ID NO: 78 of WO2005033321), AAV145.1/hu.53 (SEQ ID No: 176 and 177 of WO2005033321), AAV145.6/hu.56 (SEQ ID NO: 168 and 192 of WO2005033321), AAV16.12/hu.11 (SEQ ID NO: 153 and 57 of WO2005033321), AAV16.8/hu.10 (SEQ ID NO: 156 and 56 of WO2005033321), AAV161.10/hu.60 (SEQ ID No: 170 of WO2005033321), AAV161.6/hu.61 (SEQ ID No: 174 of WO2005033321), AAV1-7/rh.48 (SEQ ID NO: 32 of WO2005033321), AAV1-8/rh.49 (SEQ ID NOs: 103 and 25 of WO2005033321), AAV2 (SEQ ID NO: 211 and 221 of WO2005033321), AAV2-15/rh.62 (SEQ ID No: 33 and 114 of WO2005033321), AAV2-3/rh.61 (SEQ ID NO: 21 of WO2005033321), AAV2-4/rh.50 (SEQ ID No: 23 and 108 of WO2005033321), AAV2-5/rh.51 (SEQ ID NO: 104 and 22 of WO2005033321), AAV3.1/hu.6 (SEQ ID NO: 5 and 84 of WO2005033321), AAV3.1/hu.9 (SEQ ID NO: 155 and 58 of WO2005033321), AAV3-11/rh.53 (SEQ ID NO: 186 and 176 of WO2005033321), AAV3-3 (SEQ ID NO: 200 of WO2005033321), AAV33.12/hu.17 (SEQ ID NO:4 of WO2005033321), AAV33.4/hu.15 (SEQ ID No: 50 of WO2005033321), AAV33.8/hu.16 (SEQ ID No: 51 of WO2005033321), AAV3-9/rh.52 (SEQ ID NO: 96 and 18 of WO2005033321), AAV4-19/rh.55 (SEQ ID NO: 117 of WO2005033321), AAV4-4 (SEQ ID NO: 201 and 218 of WO2005033321), AAV4-9/rh.54 (SEQ ID NO: 116 of WO2005033321), AAV5 (SEQ ID NO: 199 and 216 of WO2005033321), AAV52.1/hu.20 (SEQ ID NO: 63 of WO2005033321), AAV52/hu.19 (SEQ ID NO: 133 of WO2005033321), AAV5-22/rh.58 (SEQ ID No: 27 of WO2005033321), AAV5-3/rh.57 (SEQ ID NO: 105 of WO2005033321), AAV5-3/rh.57 (SEQ ID No: 26 of WO2005033321), AAV58.2/hu.25 (SEQ ID No: 49 of WO2005033321), AAV6 (SEQ ID NO: 203 and 220 of WO2005033321), AAV7 (SEQ ID NO: 222 and 213 of WO2005033321), AAV7.3/hu.7 (SEQ ID No: 55 of WO2005033321), AAV8 (SEQ ID NO: 223 and 214 of WO2005033321), AAVH-1/hu.1 (SEQ ID No: 46 of WO2005033321), AAVH-5/hu.3 (SEQ ID No: 44 of WO2005033321), AAVhu.1 (SEQ ID NO: 144 of WO2005033321), AAVhu.10 (SEQ ID NO: 156 of WO2005033321), AAVhu.11 (SEQ ID NO: 153 of WO2005033321), AAVhu.12 (WO2005033321 SEQ ID NO: 59), AAVhu.13 (SEQ ID NO: 129 of WO2005033321), AAVhu.14/AAV9 (SEQ ID NO: 123 and 3 of WO2005033321), AAVhu.15 (SEQ ID NO: 147 of WO2005033321), AAVhu.16 (SEQ ID NO: 148 of WO2005033321), AAVhu.17 (SEQ ID NO: 83 of WO2005033321), AAVhu.18 (SEQ ID NO: 149 of WO2005033321), AAVhu.19 (SEQ ID NO: 133 of WO2005033321), AAVhu.2 (SEQ ID NO: 143 of WO2005033321), AAVhu.20 (SEQ ID NO: 134 of WO2005033321), AAVhu.21 (SEQ ID NO: 135 of WO2005033321), AAVhu.22 (SEQ ID NO: 138 of WO2005033321), AAVhu.23.2 (SEQ ID NO: 137 of WO2005033321), AAVhu.24 (SEQ ID NO: 136 of WO2005033321), AAVhu.25 (SEQ ID NO: 146 of WO2005033321), AAVhu.27 (SEQ ID NO: 140 of WO2005033321), AAVhu.29 (SEQ ID NO: 132 of WO2005033321), AAVhu.3 (SEQ ID NO: 145 of WO2005033321), AAVhu.31 (SEQ ID NO: 121 of WO2005033321), AAVhu.32 (SEQ ID NO: 122 of WO2005033321), AAVhu.34 (SEQ ID NO: 125 of WO2005033321), AAVhu.35 (SEQ ID NO: 164 of WO2005033321), AAVhu.37 (SEQ ID NO: 88 of WO2005033321), AAVhu.39 (SEQ ID NO: 102 of WO2005033321), AAVhu.4 (SEQ ID NO: 141 of WO2005033321), AAVhu.40 (SEQ ID NO: 87 of WO2005033321), AAVhu.41 (SEQ ID NO: 91 of WO2005033321), AAVhu.42 (SEQ ID NO: 85 of WO2005033321), AAVhu.43 (SEQ ID NO: 160 of WO2005033321), AAVhu.44 (SEQ ID NO: 144 of WO2005033321), AAVhu.45 (SEQ ID NO: 127 of WO2005033321), AAVhu.46 (SEQ ID NO: 159 of WO2005033321), AAVhu.47 (SEQ ID NO: 128 of WO2005033321), AAVhu.48 (SEQ ID NO: 157 of WO2005033321), AAVhu.49 (SEQ ID NO: 189 of WO2005033321), AAVhu.51 (SEQ ID NO: 190 of WO2005033321), AAVhu.52 (SEQ ID NO: 191 of WO2005033321), AAVhu.53 (SEQ ID NO: 186 of WO2005033321), AAVhu.54 (SEQ ID NO: 188 of WO2005033321), AAVhu.55 (SEQ ID NO: 187 of WO2005033321), AAVhu.56 (SEQ ID NO: 192 of WO2005033321), AAVhu.57 (SEQ ID NO: 193 of WO2005033321), AAVhu.58 (SEQ ID NO: 194 of WO2005033321), AAVhu.6 (SEQ ID NO: 84 of WO2005033321), AAVhu.60 (SEQ ID NO: 184 of WO2005033321), AAVhu.61 (SEQ ID NO: 185 of WO2005033321), AAVhu.63 (SEQ ID NO: 195 of WO2005033321), AAVhu.64 (SEQ ID NO: 1% of WO2005033321), AAVhu.66 (SEQ ID NO: 197 of WO2005033321), AAVhu.67 (SEQ ID NO: 198 of WO2005033321), AAVhu.7 (SEQ ID NO: 150 of WO2005033321), AAVhu.8 (WO2005033321 SEQ ID NO: 12), AAVhu.9 (SEQ ID NO: 155 of WO2005033321), AAVLG-10/rh.40 (SEQ ID No: 14 of WO2005033321), AAVLG4/rh.38 (SEQ ID NO: 86 of WO2005033321), AAVLG-4/rh.38 (SEQ ID No: 7 of WO2005033321), AAVN721-8/rh.43 (SEQ ID NO: 163 of WO2005033321), AAVN721-8/rh.43 (SEQ ID No: 43 of WO2005033321), AAVpi.1 (WO2005033321 SEQ ID NO: 28), AAVpi.2 (WO2005033321 SEQ ID NO: 30), AAVpi.3 (WO2005033321 SEQ ID NO: 29), AAVrh.38 (SEQ ID NO: 86 of WO2005033321), AAVrh.40 (SEQ ID NO: 92 of WO2005033321), AAVrh.43 (SEQ ID NO: 163 of WO2005033321), AAVrh.44 (WO2005033321 SEQ ID NO: 34), AAVrh.45 (WO2005033321 SEQ ID NO: 41), AAVrh.47 (WO2005033321 SEQ ID NO: 38), AAVrh.48 (SEQ ID NO: 115 of WO2005033321), AAVrh.49 (SEQ ID NO: 103 of WO2005033321), AAVrh.50 (SEQ ID NO: 108 of WO2005033321), AAVrh.51 (SEQ ID NO: 104 of WO2005033321), AAVrh.52 (SEQ ID NO: 96 of WO2005033321), AAVrh.53 (SEQ ID NO: 97 of WO2005033321), AAVrh.55 (WO2005033321 SEQ ID NO: 37), AAVrh.56 (SEQ ID NO: 152 of WO2005033321), AAVrh.57 (SEQ ID NO: 105 of WO2005033321), AAVrh.58 (SEQ ID NO: 106 of WO2005033321), AAVrh.59 (WO2005033321 SEQ ID NO: 42), AAVrh.60 (WO2005033321 SEQ ID NO: 31), AAVrh.61 (SEQ ID NO: 107 of WO2005033321), AAVrh.62 (SEQ ID NO: 114 of WO2005033321), AAVrh.64 (SEQ ID NO: 99 of WO2005033321), AAVrh.65 (WO2005033321 SEQ ID NO: 35), AAVrh.68 (WO2005033321 SEQ ID NO: 16), AAVrh.69 (WO2005033321 SEQ ID NO: 39), AAVrh.70 (WO2005033321 SEQ ID NO: 20), AAVrh.72 (WO2005033321 SEQ ID NO: 9), or variants thereof including, but not limited to, AAVcy.2, AAVcy.3, AAVcy.4, AAVcy.5, AAVcy.6, AAVrh.12, AAVrh.17, AAVrh.18, AAVrh.19, AAVrh.21, AAVrh.22, AAVrh.23, AAVrh.24, AAVrh.25, AAVrh.25/42 15, AAVrh.31, AAVrh.32, AAVrh.33, AAVrh.34, AAVrh.35, AAVrh.36, AAVrh.37, AAVrh14. Non limiting examples of variants include SEQ ID NO: 13, 15, 17, 19, 24, 36, 40, 45, 47, 48, 51-54, 60-62, 64-77, 79, 80, 82, 89, 90, 93-95, 98, 100, 101, 109-113, 118-120, 124, 126, 131, 139, 142, 151,154, 158, 161, 162, 165-183, 202, 204-212, 215, 219, 224-236, of WO2005033321, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the AAV serotype may be, or have, a sequence as described in International Publication No. WO2015168666, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAVrh8R (SEQ ID NO: 9 of WO2015168666), AAVrh8R A586R mutant (SEQ ID NO: 10 of WO2015168666), AAVrh8R R533A mutant (SEQ ID NO: 11 of WO2015168666), or variants thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in U.S. Pat. No. 9,233,131, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAVhE1.1 (SEQ ID NO:44 of U.S. Pat. No. 9,233,131), AAVhEr1.5 (SEQ ID NO:45 of U.S. Pat. No. 9,233,131), AAVhER1.14 (SEQ ID NO:46 of U.S. Pat. No. 9,233,131), AAVhEr1.8 (SEQ ID NO:47 of U.S. Pat. No. 9,233,131), AAVhEr1.16 (SEQ ID NO:48 of U.S. Pat. No. 9,233,131), AAVhEr1.18 (SEQ ID NO:49 of U.S. Pat. No. 9,233,131), AAVhEr1.35 (SEQ ID NO:50 of U.S. Pat. No. 9,233,131), AAVhEr1.7 (SEQ ID NO:51 of U.S. Pat. No. 9,233,131), AAVhEr1.36 (SEQ ID NO:52 of U.S. Pat. No. 9,233,131), AAVhEr2.29 (SEQ ID NO:53 of U.S. Pat. No. 9,233,131), AAVhEr2.4 (SEQ ID NO:54 of U.S. Pat. No. 9,233,131), AAVhEr2.16 (SEQ ID NO:55 of U.S. Pat. No. 9,233,131), AAVhEr2.30 (SEQ ID NO:56 of U.S. Pat. No. 9,233,131), AAVhEr2.31 (SEQ ID NO:58 of U.S. Pat. No. 9,233,131), AAVhEr2.36 (SEQ ID NO:57 of U.S. Pat. No. 9,233,131), AAVhER1.23 (SEQ ID NO:53 of U.S. Pat. No. 9,233,131), AAVhEr3.1 (SEQ ID NO:59 of U.S. Pat. No. 9,233,131), AAV2.5T (SEQ ID NO:42 of U.S. Pat. No. 9,233,131), or variants thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in United States Patent Publication No. US20150376607, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV-PAEC (SEQ ID NO:1 of US20150376607), AAV-LK01 (SEQ ID NO:2 of US20150376607), AAV-LK02 (SEQ ID NO:3 of US20150376607), AAV-LK03 (SEQ ID NO:4 of US20150376607), AAV-LK04 (SEQ ID NO:5 of US20150376607), AAV-LK05 (SEQ ID NO:6 of US20150376607), AAV-LK06 (SEQ ID NO:7 of US20150376607), AAV-LK07 (SEQ ID NO:8 of US20150376607), AAV-LK08 (SEQ ID NO:9 of US20150376607), AAV-LK09 (SEQ ID NO:10 of US20150376607), AAV-LK10 (SEQ ID NO:11 of US20150376607), AAV-LK11 (SEQ ID NO:12 of US20150376607), AAV-LK12 (SEQ ID NO:13 of US20150376607), AAV-LK13 (SEQ ID NO:14 of US20150376607), AAV-LK14 (SEQ ID NO:15 of US20150376607), AAV-LK15 (SEQ ID NO:16 of US20150376607), AAV-LK16 (SEQ ID NO:17 of US20150376607), AAV-LK17 (SEQ ID NO:18 of US20150376607), AAV-LK18 (SEQ ID NO:19 of US20150376607), AAV-LK19 (SEQ ID NO:20 of US20150376607), AAV-PAEC2 (SEQ ID NO:21 of US20150376607), AAV-PAEC4 (SEQ ID NO:22 of US20150376607), AAV-PAEC6 (SEQ ID NO:23 of US20150376607), AAV-PAEC7 (SEQ ID NO:24 of US20150376607), AAV-PAEC8 (SEQ ID NO:25 of US20150376607), AAV-PAEC11 (SEQ ID NO:26 of US20150376607), AAV-PAEC12 (SEQ ID NO:27, of US20150376607), or variants thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in U.S. Pat. No. 9,163,261, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV-2-pre-miRNA-101 (SEQ ID NO: 1 U.S. Pat. No. 9,163,261), or variants thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in United States Patent Publication No. US20150376240, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV-8h (SEQ ID NO: 6 of US20150376240), AAV-8b (SEQ ID NO: 5 of US20150376240), AAV-h (SEQ ID NO: 2 of US20150376240), AAV-b (SEQ ID NO: 1 of US20150376240), or variants thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in United States Patent Publication No. US20160017295, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV SM 10-2 (SEQ ID NO: 22 of US20160017295), AAV Shuffle 100-1 (SEQ ID NO: 23 of US20160017295), AAV Shuffle 100-3 (SEQ ID NO: 24 of US20160017295), AAV Shuffle 100-7 (SEQ ID NO: 25 of US20160017295), AAV Shuffle 10-2 (SEQ ID NO: 34 of US20160017295), AAV Shuffle 10-6 (SEQ ID NO: 35 of US20160017295), AAV Shuffle 10-8 (SEQ ID NO: 36 of US20160017295). AAV Shuffle 100-2 (SEQ ID NO: 37 of US20160017295), AAV SM 10-1 (SEQ ID NO: 38 of US20160017295), AAV SM 10-8 (SEQ ID NO: 39 of US20160017295), AAV SM 100-3 (SEQ ID NO: 40 of US20160017295), AAV SM 100-10 (SEQ ID NO: 41 of US20160017295), or variants thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in United States Patent Publication No. US20150238550, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, BNP61 AAV (SEQ ID NO: 1 of US20150238550), BNP62 AAV (SEQ ID NO: 3 of US20150238550), BNP63 AAV (SEQ ID NO: 4 of US20150238550), or variants thereof.
In some embodiments, the AAV serotype may be or may have a sequence as described in United States Patent Publication No. US20150315612, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAVrh.50 (SEQ ID NO: 108 of US20150315612), AAVrh.43 (SEQ ID NO: 163 of US20150315612), AAVrh.62 (SEQ ID NO: 114 of US20150315612), AAVrh.48 (SEQ ID NO: 115 of US20150315612), AAVhu.19 (SEQ ID NO: 133 of US20150315612), AAVhu.11 (SEQ ID NO: 153 of US20150315612), AAVhu.53 (SEQ ID NO: 186 of US20150315612), AAV4-8/rh.64 (SEQ ID No: 15 of US20150315612), AAVLG-9/hu.39 (SEQ ID No: 24 of US20150315612), AAV54.5/hu.23 (SEQ ID No: 60 of US20150315612), AAV54.2/hu.22 (SEQ ID No: 67 of US20150315612), AAV54.7/hu.24 (SEQ ID No: 66 of US20150315612), AAV54.1/hu.21 (SEQ ID No: 65 of US20150315612), AAV54.4R/hu.27 (SEQ ID No: 64 of US20150315612), AAV46.2/hu.28 (SEQ ID No: 68 of US20150315612), AAV46.6/hu.29 (SEQ ID No: 69 of US20150315612). AAV128.1/hu.43 (SEQ ID No: 80 of US20150315612), or variants thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in International Publication No. WO2015121501, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, true type AAV (ttAAV) (SEQ ID NO: 2 of WO2015121501), “UPenn AAV10” (SEQ ID NO: 8 of WO2015121501), “Japanese AAV10” (SEQ ID NO: 9 of WO2015121501), or variants thereof.
According to the present disclosure. AAV capsid serotype selection or use may be from a variety of species. In some embodiments, the AAV may be an avian AAV (AAAV). The AAAV serotype may be, or have, a sequence as described in U.S. Pat. No. 9,238,800, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAAV (SEQ ID NO: 1, 2, 4, 6, 8, 10, 12, and 14 of U.S. Pat. No. 9,238,800), or variants thereof.
In some embodiments, the AAV may be a bovine AAV (BAAV). The BAAV serotype may be, or have, a sequence as described in U.S. Pat. No. 9,193,769, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, BAAV (SEQ ID NO: 1 and 6 of U.S. Pat. No. 9,193,769), or variants thereof. The BAAV serotype may be or have a sequence as described in U.S. Pat. No. 7,427,396, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, BAAV (SEQ ID NO: 5 and 6 of U.S. Pat. No. 7,427,396), or variants thereof.
In some embodiments, the AAV may be a caprine AAV. The caprine AAV serotype may be, or have, a sequence as described in U.S. Pat. No. 7,427,396, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, caprine AAV (SEQ ID NO: 3 of U.S. Pat. No. 7,427,396), or variants thereof.
In other embodiments the AAV may be engineered as a hybrid AAV from two or more parental serotypes. In some embodiments, the AAV may be AAV2G9 which comprises sequences from AAV2 and AAV9. The AAV2G9 AAV serotype may be, or have, a sequence as described in United States Patent Publication No. US20160017005, the contents of which are herein incorporated by reference in its entirety.
In some embodiments, the AAV may be a serotype generated by the AAV9 capsid library with mutations in amino acids 390-627 (VP1 numbering) as described by Pulicherla et al. (Molecular Therapy 19(6):1070-1078 (2011), the contents of which are herein incorporated by reference in their entirety. The serotype and corresponding nucleotide and amino acid substitutions may be, but is not limited to, AAV9.1 (G1594C; D532H). AAV6.2 (T1418A and T1436X; V473D and 1479K), AAV9.3 (T1238A; F413Y), AAV9.4 (T1250C and A1617T; F417S), AAV9.5 (A1235G, A1314T, A1642G, C1760T; Q412R, T548A, A587V), AAV9.6 (T1231A; F411I), AAV9.9 (G1203A, G1785T; W595C), AAV9.10 (A1500G. T1676C; M559T), AAV9.11 (A1425T, A1702C, A1769T; T568P, Q590L), AAV9.13 (A1369C, A1720T: N457H, T574S), AAV9.14 (T1340A, T1362C, T1560C, G1713A; L447H), AAV9.16 (A1775T: Q592L), AAV9.24 (T1507C, T1521G; W503R), AAV9.26 (A1337G, A1769C; Y446C, Q590P), AAV9.33 (A1667C; D556A), AAV9.34 (A1534G, C1794T; N512D), AAV9.35 (A1289T, T1450A, C1494T, A1515T, C1794A, G1816A; Q430L, Y484N, N98K, V6061), AAV9.40 (A1694T. E565V), AAV9.41 (A1348T, T1362C; T450S), AAV9.44 (A1684C, A1701T, A1737G; N562H, K567N), AAV9.45 (A1492T, C1804T; N498Y, L602F), AAV9.46 (G1441C, T1525C, T1549G; G481R, W509R, L517V), 9.47 (G1241A, G1358A, A1669G, C1745T; S414N, G453D, K557E, T5821), AAV9.48 (C1445T, A1736T; P482L, Q579L), AAV9.50 (A1638T, C1683T, T1805A; Q546H, L602H), AAV9.53 (G1301A, A1405C, C1664T, G1811T; R134Q, S469R, A555V, G604V), AAV9.54 (C1531A, T1609A; L5111, L537M), AAV9.55 (T1605A; F535L), AAV9.58 (C1475T, C1579A; T4921, H527N), AAV.59 (T1336C; Y446H), AAV9.61 (A1493T; N4981), AAV9.64 (C1531A, A1617T; L5111), AAV9.65 (C1335T, T1530C, C1568A; A523D), AAV9.68 (C1510A; P504T), AAV9.80 (G1441A; G481R), AAV9.83 (C1402A, A1500T; P468T, E500D), AAV9.87 (T1464C, T1468C; S490P), AAV9.90 (A1196T; Y399F), AAV9.91 (T1316G, A1583T, C1782G, T1806C; L439R, K5281), AAV9.93 (A1273G, A1421G, A1638C, C1712T, G1732A, A1744T, A1832T; S425G, Q474R, Q546H, P571L, G578R. T582S, D611V), AAV9.94 (A1675T; M559L) and AAV9.95 (T1605A; F535L).
In some embodiments, the AAV serotype may be, or have, a sequence as described in International Publication No. WO2016049230, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to AAVF1/HSC1 (SEQ ID NO: 2 and 20 of WO2016049230), AAVF2/HSC2 (SEQ ID NO: 3 and 21 of WO2016049230), AAVF3/HSC3 (SEQ ID NO: 5 and 22 of WO2016049230), AAVF4/HSC4 (SEQ ID NO: 6 and 23 of WO2016049230), AAVF5/HSC5 (SEQ ID NO: 11 and 25 of WO2016049230), AAVF6/HSC6 (SEQ ID NO: 7 and 24 of WO2016049230), AAVF7/HSC7 (SEQ ID NO: 8 and 27 of WO2016049230), AAVF8/HSC8 (SEQ ID NO: 9 and 28 of WO2016049230), AAVF9/HSC9 (SEQ ID NO: 10 and 29 of WO2016049230), AAVF11/HSC11 (SEQ ID NO: 4 and 26 of WO2016049230), AAVF12/HSC12 (SEQ ID NO: 12 and 30 of WO2016049230), AAVF13/HSC13 (SEQ ID NO: 14 and 31 of WO2016049230), AAVF14/HSC14 (SEQ ID NO: 15 and 32 of WO2016049230), AAVF15/HSC15 (SEQ ID NO: 16 and 33 of WO2016049230), AAVF16/HSC16 (SEQ ID NO: 17 and 34 of WO2016049230), AAVF17/HSC17 (SEQ ID NO: 13 and 35 of WO2016049230), or variants or derivatives thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in U.S. Pat. No. 8,734,809, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV CBr-E1 (SEQ ID NO: 13 and 87 of U.S. Pat. No. 8,734,809), AAV CBr-E2 (SEQ ID NO: 14 and 88 of U.S. Pat. No. 8,734,809), AAV CBr-E3 (SEQ ID NO: 15 and 89 of U.S. Pat. No. 8,734,809), AAV CBr-E4 (SEQ ID NO: 16 and 90 of U.S. Pat. No. 8,734,809), AAV CBr-E5 (SEQ ID NO: 17 and 91 of U.S. Pat. No. 8,734,809), AAV CBr-c5 (SEQ ID NO: 18 and 92 of U.S. Pat. No. 8,734,809), AAV CBr-E6 (SEQ ID NO: 19 and 93 of U.S. Pat. No. 8,734,809), AAV CBr-E7 (SEQ ID NO: 20 and 94 of U.S. Pat. No. 8,734,809), AAV CBr-E8 (SEQ ID NO: 21 and 95 of U.S. Pat. No. 8,734,809), AAV CLv-D1 (SEQ ID NO: 22 and 96 of U.S. Pat. No. 8,734,809), AAV CLv-D2 (SEQ ID NO: 23 and 97 of U.S. Pat. No. 8,734,809), AAV CLv-D3 (SEQ ID NO: 24 and 98 of U.S. Pat. No. 8,734,809), AAV CLv-D4 (SEQ ID NO: 25 and 99 of U.S. Pat. No. 8,734,809), AAV CLv-D5 (SEQ ID NO: 26 and 100 of U.S. Pat. No. 8,734,809), AAV CLv-D6 (SEQ ID NO: 27 and 101 of U.S. Pat. No. 8,734,809), AAV CLv-D7 (SEQ ID NO: 28 and 102 of U.S. Pat. No. 8,734,809), AAV CLv-D8 (SEQ ID NO: 29 and 103 of U.S. Pat. No. 8,734,809), AAV CLv-E1 (SEQ ID NO: 13 and 87 of U.S. Pat. No. 8,734,809), AAV CLv-R1 (SEQ ID NO: 30 and 104 of U.S. Pat. No. 8,734,809), AAV CLv-R2 (SEQ ID NO: 31 and 105 of U.S. Pat. No. 8,734,809), AAV CLv-R3 (SEQ ID NO: 32 and 106 of U.S. Pat. No. 8,734,809), AAV CLv-R4 (SEQ ID NO: 33 and 107 of U.S. Pat. No. 8,734,809), AAV CLv-R5 (SEQ ID NO: 34 and 108 of U.S. Pat. No. 8,734,809), AAV CLv-R6 (SEQ ID NO: 35 and 109 of U.S. Pat. No. 8,734,809), AAV CLv-R7 (SEQ ID NO: 36 and 110 of U.S. Pat. No. 8,734,809), AAV CLv-R8 (SEQ ID NO: X and X of U.S. Pat. No. 8,734,809), AAV CLv-R9 (SEQ ID NO: X and X of US8734809), AAV CLg-F1 (SEQ ID NO: 39 and 113 of U.S. Pat. No. 8,734,809), AAV CLg-F2 (SEQ ID NO: 40 and 114 of U.S. Pat. No. 8,734,809), AAV CLg-F3 (SEQ ID NO: 41 and 115 of U.S. Pat. No. 8,734,809), AAV CLg-F4 (SEQ ID NO: 42 and 116 of U.S. Pat. No. 8,734,809), AAV CLg-F5 (SEQ ID NO: 43 and 117 of U.S. Pat. No. 8,734,809), AAV CLg-F6 (SEQ ID NO: 43 and 117 of U.S. Pat. No. 8,734,809), AAV CLg-F7 (SEQ ID NO: 44 and 118 of U.S. Pat. No. 8,734,809), AAV CLg-F8 (SEQ ID NO: 43 and 117 of U.S. Pat. No. 8,734,809), AAV CSp-1 (SEQ ID NO: 45 and 119 of U.S. Pat. No. 8,734,809), AAV CSp-10 (SEQ ID NO: 46 and 120 of U.S. Pat. No. 8,734,809), AAV CSp-11 (SEQ ID NO: 47 and 121 of U.S. Pat. No. 8,734,809), AAV CSp-2 (SEQ ID NO: 48 and 122 of U.S. Pat. No. 8,734,809), AAV CSp-3 (SEQ ID NO: 49 and 123 of U.S. Pat. No. 8,734,809), AAV CSp-4 (SEQ ID NO: 50 and 124 of U.S. Pat. No. 8,734,809), AAV CSp-6 (SEQ ID NO: 51 and 125 of U.S. Pat. No. 8,734,809), AAV CSp-7 (SEQ ID NO: 52 and 126 of U.S. Pat. No. 8,734,809), AAV CSp-8 (SEQ ID NO: 53 and 127 of U.S. Pat. No. 8,734,809), AAV CSp-9 (SEQ ID NO: 54 and 128 of U.S. Pat. No. 8,734,809), AAV CHt-2 (SEQ ID NO: 55 and 129 of U.S. Pat. No. 8,734,809), AAV CHt-3 (SEQ ID NO: 56 and 130 of U.S. Pat. No. 8,734,809), AAV CKd-1 (SEQ ID NO: 57 and 131 of U.S. Pat. No. 8,734,809), AAV CKd-10 (SEQ ID NO: 58 and 132 of U.S. Pat. No. 8,734,809), AAV CKd-2 (SEQ ID NO: 59 and 133 of U.S. Pat. No. 8,734,809), AAV CKd-3 (SEQ ID NO: 60 and 134 of U.S. Pat. No. 8,734,809), AAV CKd-4 (SEQ ID NO: 61 and 135 of U.S. Pat. No. 8,734,809), AAV CKd-6 (SEQ ID NO: 62 and 136 of U.S. Pat. No. 8,734,809), AAV CKd-7 (SEQ ID NO: 63 and 137 of U.S. Pat. No. 8,734,809), AAV CKd-8 (SEQ ID NO: 64 and 138 of U.S. Pat. No. 8,734,809), AAV CLv-1 (SEQ ID NO: 35 and 139 of U.S. Pat. No. 8,734,809), AAV CLv-12 (SEQ ID NO: 66 and 140 of U.S. Pat. No. 8,734,809), AAV CLv-13 (SEQ ID NO: 67 and 141 of U.S. Pat. No. 8,734,809), AAV CLv-2 (SEQ ID NO: 68 and 142 of U.S. Pat. No. 8,734,809), AAV CLv-3 (SEQ ID NO: 69 and 143 of U.S. Pat. No. 8,734,809), AAV CLv-4 (SEQ ID NO: 70 and 144 of U.S. Pat. No. 8,734,809), AAV CLv-6 (SEQ ID NO: 71 and 145 of U.S. Pat. No. 8,734,809), AAV CLv-8 (SEQ ID NO: 72 and 146 of U.S. Pat. No. 8,734,809), AAV CKd-B1 (SEQ ID NO: 73 and 147 of U.S. Pat. No. 8,734,809), AAV CKd-B2 (SEQ ID NO: 74 and 148 of U.S. Pat. No. 8,734,809), AAV CKd-B3 (SEQ ID NO: 75 and 149 of U.S. Pat. No. 8,734,809), AAV CKd-B4 (SEQ ID NO: 76 and 150 of U.S. Pat. No. 8,734,809), AAV CKd-B5 (SEQ ID NO: 77 and 151 of U.S. Pat. No. 8,734,809), AAV CKd-B6 (SEQ ID NO: 78 and 152 of U.S. Pat. No. 8,734,809), AAV CKd-B7 (SEQ ID NO: 79 and 153 of U.S. Pat. No. 8,734,809), AAV CKd-B8 (SEQ ID NO: 80 and 154 of U.S. Pat. No. 8,734,809), AAV CKd-H1 (SEQ ID NO: 81 and 155 of U.S. Pat. No. 8,734,809), AAV CKd-H2 (SEQ ID NO: 82 and 156 of U.S. Pat. No. 8,734,809), AAV CKd-H3 (SEQ ID NO: 83 and 157 of U.S. Pat. No. 8,734,809), AAV CKd-H4 (SEQ ID NO: 84 and 158 of U.S. Pat. No. 8,734,809), AAV CKd-H5 (SEQ ID NO: 85 and 159 of U.S. Pat. No. 8,734,809), AAV CKd-H6 (SEQ ID NO: 77 and 151 of U.S. Pat. No. 8,734,809), AAV CHt-1 (SEQ ID NO: 86 and 160 of U.S. Pat. No. 8,734,809), AAV CLv1-1 (SEQ ID NO: 171 of U.S. Pat. No. 8,734,809), AAV CLv1-2 (SEQ ID NO: 172 of U.S. Pat. No. 8,734,809), AAV CLv1-3 (SEQ ID NO: 173 of U.S. Pat. No. 8,734,809), AAV CLv1-4 (SEQ ID NO: 174 of U.S. Pat. No. 8,734,809), AAV Clv1-7 (SEQ ID NO: 175 of US8734809), AAV Clv1-8 (SEQ ID NO: 176 of U.S. Pat. No. 8,734,809), AAV Clv1-9 (SEQ ID NO: 177 of U.S. Pat. No. 8,734,809), AAV Clv1-10 (SEQ ID NO: 178 of U.S. Pat. No. 8,734,809), AAV.VR-355 (SEQ ID NO: 181 of U.S. Pat. No. 8,734,809), AAV.hu.48R3 (SEQ ID NO: 183 of U.S. Pat. No. 8,734,809), or variants or derivatives thereof.
In some embodiments, the AAV serotype may be, or have, a sequence as described in International Publication No. WO2016065001, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to AAV CHt-P2 (SEQ ID NO: 1 and 51 of WO2016065001), AAV CHt-P5 (SEQ ID NO: 2 and 52 of WO2016065001), AAV CHt-P9 (SEQ ID NO: 3 and 53 of WO2016065001), AAV CBr-7.1 (SEQ ID NO: 4 and 54 of WO2016065001), AAV CBr-7.2 (SEQ ID NO: 5 and 55 of WO2016065001), AAV CBr-7.3 (SEQ ID NO: 6 and 56 of WO2016065001), AAV CBr-7.4 (SEQ ID NO: 7 and 57 of WO2016065001), AAV CBr-7.5 (SEQ ID NO: 8 and 58 of WO2016065001), AAV CBr-7.7 (SEQ ID NO: 9 and 59 of WO2016065001), AAV CBr-7.8 (SEQ ID NO: 10 and 60 of WO2016065001), AAV CBr-7.10 (SEQ ID NO: 11 and 61 of WO2016065001), AAV CKd-N3 (SEQ ID NO: 12 and 62 of WO2016065001), AAV CKd-N4 (SEQ ID NO: 13 and 63 of WO2016065001), AAV CKd-N9 (SEQ ID NO: 14 and 64 of WO2016065001), AAV CLv-L4 (SEQ ID NO: 15 and 65 of WO2016065001), AAV CLv-L5 (SEQ ID NO: 16 and 66 of WO2016065001), AAV CLv-L6 (SEQ ID NO: 17 and 67 of WO2016065001), AAV CLv-K1 (SEQ ID NO: 18 and 68 of WO2016065001), AAV CLv-K3 (SEQ ID NO: 19 and 69 of WO2016065001), AAV CLv-K6 (SEQ ID NO: 20 and 70 of WO2016065001), AAV CLv-M1 (SEQ ID NO: 21 and 71 of WO2016065001), AAV CLv-M11 (SEQ ID NO: 22 and 72 of WO2016065001), AAV CLv-M2 (SEQ ID NO: 23 and 73 of WO2016065001), AAV CLv-M5 (SEQ ID NO: 24 and 74 of WO2016065001), AAV CLv-M6 (SEQ ID NO: 25 and 75 of WO2016065001), AAV CLv-M7 (SEQ ID NO: 26 and 76 of WO2016065001), AAV CLv-M8 (SEQ ID NO: 27 and 77 of WO2016065001), AAV CLv-M9 (SEQ ID NO: 28 and 78 of WO2016065001), AAV CHt-P1 (SEQ ID NO: 29 and 79 of WO2016065001), AAV CHt-P6 (SEQ ID NO: 30 and 80 of WO2016065001), AAV CHt-P8 (SEQ ID NO: 31 and 81 of WO2016065001), AAV CHt-6.1 (SEQ ID NO: 32 and 82 of WO2016065001), AAV CHt-6.10 (SEQ ID NO: 33 and 83 of WO2016065001), AAV CHt-6.5 (SEQ ID NO: 34 and 84 of WO2016065001), AAV CHt-6.6 (SEQ ID NO: 35 and 85 of WO2016065001), AAV CHt-6.7 (SEQ ID NO: 36 and 86 of WO2016065001), AAV CHt-6.8 (SEQ ID NO: 37 and 87 of WO2016065001), AAV CSp-8.10 (SEQ ID NO: 38 and 88 of WO2016065001), AAV CSp-8.2 (SEQ ID NO: 39 and 89 of WO2016065001), AAV CSp-8.4 (SEQ ID NO: 40 and 90 of WO2016065001), AAV CSp-8.5 (SEQ ID NO: 41 and 91 of WO2016065001), AAV CSp-8.6 (SEQ ID NO: 42 and 92 of WO2016065001), AAV CSp-8.7 (SEQ ID NO: 43 and 93 of WO2016065001), AAV CSp-8.8 (SEQ ID NO: 44 and 94 of WO2016065001), AAV CSp-8.9 (SEQ ID NO: 45 and 95 of WO2016065001), AAV CBr-B7.3 (SEQ ID NO: 46 and 96 of WO2016065001), AAV CBr-B7.4 (SEQ ID NO: 47 and 97 of WO2016065001), AAV3B (SEQ ID NO: 48 and 98 of WO2016065001), AAV4 (SEQ ID NO: 49 and 99 of WO2016065001), AAV5 (SEQ ID NO: 50 and 100 of WO2016065001), or variants or derivatives thereof.
In some embodiments, the AAV particle and/or VAP may have, or may be a serotype selected from any of those found in Table 1.
In some embodiments, the AAV capsid may comprise a sequence, fragment or variant thereof, of any of the sequences in Table 1.
In some embodiments, the AAV capsid may be encoded by a sequence, fragment or variant as described in Table 1.
In any of the DNA and RNA sequences referenced and/or described herein, the single letter symbol has the following description: A for adenine; C for cytosine; G for guanine; T for thymine; U for Uracil; W for weak bases such as adenine or thymine; S for strong nucleotides such as cytosine and guanine; M for amino nucleotides such as adenine and cytosine; K for keto nucleotides such as guanine and thymine; R for purines adenine and guanine; Y for pyrimidine cytosine and thymine, B for any base that is not A (e.g., cytosine, guanine, and thy-mine); D for any base that is not C (e.g., adenine, guanine and thymine): H for any base that is not G (e.g., adenine, cytosine, and thymine); V for any base that is not T (e.g., adenine, cytosine, and guanine); N for any nucleotide (which is not a gap); and Z is for zero.
In any of the amino acid sequences referenced and/or described herein, the single letter symbol has the following description: G (Gly) for Glycine; A (Ala) for Alanine: L (Leu) for Leucine; M (Met) for Methionine; F (Phe) for Phenylalanine, W (Trp) for Tryptophan: K (Lys) for Lysine; Q (Gin) for Glutamine; E (Glu) for Glutamic Acid, S (Ser) for Serine; P (Pro) for Proline; V (Val) for Valine I (Ile) for Isoleucine C (Cys) for Cysteine; Y (Tyr) for Tyrosine H (His) for Histidine: R (Arg) for Arginine N (Asn) for Asparagine: D (Asp) for Aspartic Acid: T (Thr) for Threonine; B (Asx) for Aspartic acid or Asparagine; J (Xle) for Leucine or Isoleucine; O (Pyl) for Pyrrolysine; U (Sec) for Selenocysteine; X (Xaa) for any amino acid, and Z (Glx) for Glutamine or Glutamic acid.
In some embodiments, the AAV serotype may be, or may have a sequence as described in International Patent Publication WO2015038958, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV9 (SEQ ID NO: 2 and 11 of WO2015038958 or SEQ ID NO: 137 and 138 respectively herein), PHP.B (SEQ ID NO: 8 and 9 of WO2015038958, herein SEQ ID NO: 5 and 6), G2B-13 (SEQ ID NO: 12 of WO2015038958, herein SEQ ID NO: 7), G2B-26 (SEQ ID NO: 13 of WO2015038958, herein SEQ ID NO: 5), TH1.1-32 (SEQ ID NO: 14 of WO2015038958, herein SEQ ID NO: 8), TH1.1-35 (SEQ ID NO: 15 of WO2015038958, herein SEQ ID NO: 9) or variants thereof. Further, any of the targeting peptides or amino acid inserts described in WO2015038958, may be inserted into any parent AAV serotype, such as, but not limited to, AAV9 (SEQ ID NO: 137 for the DNA sequence and SEQ ID NO: 138 for the amino acid sequence). In some embodiments, the amino acid insert is inserted between amino acids 586-592 of the parent AAV (e.g., AAV9). In another embodiment, the amino acid insert is inserted between amino acids 588-589 of the parent AAV sequence. The amino acid insert may be, but is not limited to, any of the following amino acid sequences, TLAVPFK (SEQ ID NO: 1 of WO2015038958; herein SEQ ID NO: 1262), KFPVALT (SEQ ID NO: 3 of WO2015038958; herein SEQ ID NO: 1263), LAVPFK (SEQ ID NO: 31 of WO2015038958; herein SEQ ID NO: 1264), AVPFK (SEQ ID NO: 32 of WO2015038958; herein SEQ ID NO: 1265), VPFK (SEQ ID NO: 33 of WO2015038958; herein SEQ ID NO: 1266), TLAVPF (SEQ ID NO: 34 of WO2015038958; herein SEQ ID NO: 1267), TLAVP (SEQ ID NO: 35 of WO2015038958; herein SEQ ID NO: 1268), TLAV (SEQ ID NO: 36 of WO2015038958; herein SEQ ID NO: 1269), SVSKPFL (SEQ ID NO: 28 of WO2015038958; herein SEQ ID NO: 1270), FTLTTPK (SEQ ID NO: 29 of WO2015038958; herein SEQ ID NO: 1271), MNATKNV (SEQ ID NO: 30 of WO2015038958; herein SEQ ID NO: 1272), QSSQTPR (SEQ ID NO: 54 of WO2015038958; herein SEQ ID NO: 1273), ILGTGTS (SEQ ID NO: 55 of WO2015038958; herein SEQ ID NO: 1274), TRTNPEA (SEQ ID NO: 56 of WO2015038958; herein SEQ ID NO: 1275), NGGTSSS (SEQ ID NO: 58 of WO2015038958; herein SEQ ID NO: 1276), or YTLSQGW (SEQ ID NO: 60 of WO2015038958; herein SEQ ID NO: 1277). Non-limiting examples of nucleotide sequences that may encode the amino acid inserts include the following, AAGTTTCCTGTGGCGTTGACT (for SEQ ID NO: 3 of WO2015038958; herein SEQ ID NO: 1278), ACTTGGCGGTGCCTTTTAAG (SEQ ID NO: 24 and 49 of WO2015038958; herein SEQ ID NO: 1279), AGTGTGAGTAAGCCTTTTG (SEQ ID NO: 25 of WO2015038958; herein SEQ ID NO: 1280), TTACGTTGACGACGCCTAAG (SEQ ID NO: 26 of WO2015038958; herein SEQ ID NO: 1281), ATGAATGCTACGAAGAATGTG (SEQ ID NO: 27 of WO2015038958; herein SEQ ID NO: 1282), CAGTCGTCGCAGACGCCTAGG (SEQ ID NO: 48 of WO2015038958; herein SEQ ID NO: 1283), ATTCTGGGGACTGGTACTTCG (SEQ ID NO: 50 and 52 of WO2015038958; herein SEQ ID NO: 1284), ACGCGGACTAATCCTGAGGCT (SEQ ID NO: 51 of WO2015038958; herein SEQ ID NO: 1285), AATGGGGGGACTAGTAGTTCT (SEQ ID NO: 53 of WO2015038958; herein SEQ ID NO: 1286), or TATACTTTGTCGCAGGGTTGG (SEQ ID NO: 59 of WO2015038958; herein SEQ ID NO: 1287).
In some embodiments, the AAV serotype may be, or may have a sequence as described in International Patent Publication WO2017100671, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV9 (SEQ ID NO: 45 of WO2017100671, herein SEQ ID NO: 11), PHP.N (SEQ ID NO: 46 of WO2017100671, herein SEQ ID NO: 4), PHP.S (SEQ ID NO: 47 of WO2017100671, herein SEQ ID NO: 10), or variants thereof. Further, any of the targeting peptides or amino acid inserts described in WO2017100671 may be inserted into any parent AAV serotype, such as, but not limited to, AAV9. In some embodiments, the amino acid insert is inserted between amino acids 586-592 of the parent AAV (e.g., AAV9). In another embodiment, the amino acid insert is inserted between amino acids 588-589 of the parent AAV sequence. The amino acid insert may be, but is not limited to, any of the following amino acid sequences. AQTLAVPFKAQ (SEQ ID NO: 1 of WO2017100671; herein SEQ ID NO: 1288), AQSVSKPFLAQ (SEQ ID NO: 2 of WO2017100671; herein SEQ ID NO: 1289), AQFTLTTPKAQ (SEQ ID NO: 3 in the sequence listing of WO2017100671; herein SEQ ID NO: 1290), DGTLAVPFKAQ (SEQ ID NO: 4 in the sequence listing of WO2017100671; herein SEQ ID NO: 1291), ESTLAVPFKAQ (SEQ ID NO: 5 of WO2017100671; herein SEQ ID NO: 1292), GGTLAVPFKAQ (SEQ ID NO: 6 of WO2017100671; herein SEQ ID NO: 1293), AQTLATPFKAQ (SEQ ID NO: 7 and 33 of WO2017100671; herein SEQ ID NO: 1294), ATTLATPFKAQ (SEQ ID NO: 8 of WO2017100671; herein SEQ ID NO: 1295), DGTLATPFKAQ (SEQ ID NO: 9 of WO2017100671; herein SEQ ID NO: 1296), GGTLATPFKAQ (SEQ ID NO: 10 of WO2017100671; herein SEQ ID NO: 1297), SGSLAVPFKAQ (SEQ ID NO: 11 of WO2017100671; herein SEQ ID NO: 1298), AQTLAQPFKAQ (SEQ ID NO: 12 of WO2017100671; herein SEQ ID NO: 1299), AQTLQQPFKAQ (SEQ ID NO: 13 of WO2017100671; herein SEQ ID NO: 1300), AQTLSNPFKAQ (SEQ ID NO: 14 of WO2017100671; herein SEQ ID NO: 1301), AQTLAVPFSNP (SEQ ID NO: 15 of WO2017100671; herein SEQ ID NO: 1302), QGTLAVPFKAQ (SEQ ID NO: 16 of WO2017100671; herein SEQ ID NO: 1303), NQTLAVPFKAQ (SEQ ID NO: 17 of WO2017100671; herein SEQ ID NO: 1304), EGSLAVPFKAQ (SEQ ID NO: 18 of WO2017100671; herein SEQ ID NO: 1305), SGNLAVPFKAQ (SEQ ID NO: 19 of WO2017100671; herein SEQ ID NO: 1306), EGTLAVPFKAQ (SEQ ID NO: 20 of WO2017100671; herein SEQ ID NO: 1307), DSTLAVPFKAQ (SEQ ID NO: 21 in Table 1 of WO2017100671; herein SEQ ID NO: 1308), AVTLAVPFKAQ (SEQ ID NO: 22 of WO2017100671; herein SEQ ID NO: 1309), AQTLSTPFKAQ (SEQ ID NO: 23 of WO2017100671; herein SEQ ID NO: 1310), AQTLPQPFKAQ (SEQ ID NO: 24 and 32 of WO2017100671; herein SEQ ID NO: 1311), AQTLSQPFKAQ (SEQ ID NO: 25 of WO2017100671; herein SEQ ID NO: 1312), AQTLQLPFKAQ (SEQ ID NO: 26 of WO2017100671; herein SEQ ID NO: 1313), AQTLTMPFKAQ (SEQ ID NO: 27, and 34 of WO2017100671 and SEQ ID NO: 35 in the sequence listing of WO2017100671; herein SEQ ID NO: 1314), AQTLTTPFKAQ (SEQ ID NO: 28 of WO2017100671; herein SEQ ID NO: 1315), AQYTLSQGWAQ (SEQ ID NO: 29 of WO2017100671; herein SEQ ID NO: 1316), AQMNATKNVAQ (SEQ ID NO: 30 of WO2017100671; herein SEQ ID NO: 1317), AQVSGGHHSAQ (SEQ ID NO: 31 of WO2017100671; herein SEQ ID NO: 1318), AQTLTAPFKAQ (SEQ ID NO: 35 in Table 1 of WO2017100671; herein SEQ ID NO: 1319), AQTLSKPFKAQ (SEQ ID NO: 36 of WO2017100671; herein SEQ ID NO: 1320), QAVRTSL (SEQ ID NO: 37 of WO2017100671; herein SEQ ID NO: 1321), YTLSQGW (SEQ ID NO: 38 of WO2017100671; herein SEQ ID NO: 1277), LAKERLS (SEQ ID NO: 39 of WO2017100671; herein SEQ ID NO: 1322), TLAVPFK (SEQ ID NO: 40 in the sequence listing of WO2017100671; herein SEQ ID NO: 1262), SVSKPFL (SEQ ID NO: 41 of WO2017100671; herein SEQ ID NO: 1270), FTLTTPK (SEQ ID NO: 42 of WO2017100671; herein SEQ ID NO: 1271), MNSTKNV (SEQ ID NO: 43 of WO2017100671; herein SEQ ID NO: 1323), VSGGHHS (SEQ ID NO: 44 of WO2017100671; herein SEQ ID NO: 1324), SAQTLAVPFKAQAQ (SEQ ID NO: 48 of WO2017100671; herein SEQ ID NO: 1325), SXXXLAVPFKAQAQ (SEQ ID NO: 49 of WO2017100671 wherein X may be any amino acid; herein SEQ ID NO: 1326), SAQXXXVPFKAQAQ (SEQ ID NO: 50 of WO2017100671 wherein X may be any amino acid; herein SEQ ID NO: 1327), SAQTLXXXFKAQAQ (SEQ ID NO: 51 of WO2017100671 wherein X may be any amino acid; herein SEQ ID NO: 1328), SAQTLAVXXXAQAQ (SEQ ID NO: 52 of WO2017100671 wherein X may be any amino acid; herein SEQ ID NO: 1329), SAQTLAVPFXXXAQ (SEQ ID NO: 53 of WO2017100671 wherein X may be any amino acid; herein SEQ ID NO: 1330), TNHQSAQ (SEQ ID NO: 65 of WO2017100671; herein SEQ ID NO: 1331), AQAQTGW (SEQ ID NO: 66 of WO2017100671; herein SEQ ID NO: 1332), DGTLATPFK (SEQ ID NO: 67 of WO2017100671; herein SEQ ID NO: 1333), DGTLATPFKXX (SEQ ID NO: 68 of WO2017100671 wherein X may be any amino acid; herein SEQ ID NO: 1334), LAVPFKAQ (SEQ ID NO: 80 of WO2017100671; herein SEQ ID NO: 1335), VPFKAQ (SEQ ID NO: 81 of WO2017100671; herein SEQ ID NO: 1336), FKAQ (SEQ ID NO: 82 of WO2017100671; herein SEQ ID NO: 1337), AQTLAV (SEQ ID NO: 83 of WO2017100671; herein SEQ ID NO: 1338), AQTLAVPF (SEQ ID NO: 84 of WO2017100671; herein SEQ ID NO: 1339), QAVR (SEQ ID NO: 85 of WO2017100671; herein SEQ ID NO: 1340), AVRT (SEQ ID NO: 86 of WO2017100671; herein SEQ ID NO: 1341), VRTS (SEQ ID NO: 87 of WO2017100671; herein SEQ ID NO: 1342), RTSL (SEQ ID NO: 88 of WO2017100671; herein SEQ ID NO: 1343), QAVRT (SEQ ID NO: 89 of WO2017100671; herein SEQ ID NO: 1344), AVRTS (SEQ ID NO: 90 of WO2017100671; herein SEQ ID NO: 1345), VRTSL (SEQ ID NO: 91 of WO2017100671; herein SEQ ID NO: 1346), QAVRTS (SEQ ID NO: 92 of WO2017100671; herein SEQ ID NO: 1347), or AVRTSL (SEQ ID NO: 93 of WO2017100671; herein SEQ ID NO: 1348).
Non-limiting examples of nucleotide sequences that may encode the amino acid inserts include the following, GATGGGACTTGGCGGTGCCTTTTAAGGCACAG (SEQ ID NO: 54 of WO2017100671; herein SEQ ID NO: 1349), GATGGGACGTTGGCGGTGCCTTTTAAGGCACAG (SEQ ID NO: 55 of WO2017100671; herein SEQ ID NO: 1350), CAGGCGGTTAGGACGTCTTTG (SEQ ID NO: 56 of WO2017100671; herein SEQ ID NO: 1351), CAGGTCTTCACGGACTCAGACTATCAG (SEQ ID NO: 57 and 78 of WO2017100671; herein SEQ ID NO: 1352), CAAGTAAAACCTCTACAAATGTGGTAAAATCG (SEQ ID NO: 58 of WO2017100671; herein SEQ ID NO: 1353), ACTCATCGACCAATACTTGTACTATCTCTCTAGAAC (SEQ ID NO: 59 of WO2017100671; herein SEQ ID NO: 1354), GGAAGTATTCCTTGGTTTTGAACCCA (SEQ ID NO: 60 of WO2017100671; herein SEQ ID NO: 1355), GGTCGCGGTTCTTGTTTGTGGAT (SEQ ID NO: 61 of WO2017100671; herein SEQ ID NO: 1356), CGACCTTGAAGCGCATGAACTCCT (SEQ ID NO: 62 of WO2017100671; herein SEQ ID NO: 1357), GTATTCCTTGGTTTTGAACCCAACCGGTCTGCGCCTGTGCMNNMNNMNNMNNMNN MNNMNNTTGGGCACTCTGGTGGTTTGTC (SEQ ID NO: 63 of WO2017100671 wherein N may be A, C, T, or G; herein SEQ ID NO: 1358), GTATTCCTTGGTTTTGAACCCAACCGGTCTGCGCMNNMNNMNNAAAAGGCACCGCC AAAGTTTG (SEQ ID NO: 69 of WO2017100671 wherein N may be A, C, T, or G; herein SEQ ID NO: 1359), GTATTCCTGGTTTGAACCCAACCGGTCTGCGCCTGTGCMNNMNNMNNCACCGCC AAAGTTTGGGCACT (SEQ ID NO: 70 of WO2017100671 wherein N may be A, C, T, or G; herein SEQ ID NO: 1360), GTATTCCTTGGITGAACCCAACCGGTCTGCGCCTGTGCCTTAAAMNNMNNMNNC AAAGTTTGGGCACTCTGGTGG (SEQ ID NO: 71 of WO2017100671 wherein N may be A, C, T, or G; herein SEQ ID NO: 1361), GTATTCCTTGGTTTTGAACCCAACCGGTCTGCGCCTGTGCCTTAAAAGGCACMNNMN NMNNTTGGGCACTCTGGTGGTfTGTG (SEQ ID NO: 72 of WO2017100671 wherein N may be A, C, T, or G; herein SEQ ID NO: 1362), ACTTTGGCGGTGCCTTTTAAG (SEQ ID NO: 74 of WO2017100671; herein SEQ ID NO: 1279), AGTGTGAGTAAGCCTTTMTG (SEQ ID NO: 75 of WO2017100671; herein SEQ ID NO: 1280), TTTACGTTGACGACGCCTAAG (SEQ ID NO: 76 of WO2017100671; herein SEQ ID NO: 1281), TATACTTTGTCGCAGGGTTGG (SEQ ID NO: 77 of WO2017100671; herein SEQ ID NO: 1287), or CTTGCGAAGGAGCGGCTTTCG (SEQ ID NO: 79 of WO2017100671; herein SEQ ID NO: 1363).
In some embodiments, the AAV serotype may be, or may have a sequence as described in U.S. Pat. No. 9,624,274, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV1 (SEQ ID NO: 181 of U.S. Pat. No. 9,624,274), AAV6 (SEQ ID NO: 182 of U.S. Pat. No. 9,624,274), AAV2 (SEQ ID NO: 183 of U.S. Pat. No. 9,624,274), AAV3b (SEQ ID NO: 184 of U.S. Pat. No. 9,624,274), AAV7 (SEQ ID NO: 185 of U.S. Pat. No. 9,624,274), AAV8 (SEQ ID NO: 186 of U.S. Pat. No. 9,624,274), AAV10 (SEQ ID NO: 187 of U.S. Pat. No. 9,624,274), AAV4 (SEQ ID NO: 188 of U.S. Pat. No. 9,624,274), AAV 11 (SEQ ID NO: 189 of U.S. Pat. No. 9,624,274), bAAV (SEQ ID NO: 190 of U.S. Pat. No. 9,624,274), AAV5 (SEQ ID NO: 191 of U.S. Pat. No. 9,624,274), GPV (SEQ ID NO: 192 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 879), B19 (SEQ ID NO: 193 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 880), MVM (SEQ ID NO: 194 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 881), FPV (SEQ ID NO: 195 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 882), CPV (SEQ ID NO: 196 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 883) or variants thereof. Further, any of the structural protein inserts described in U.S. Pat. No. 9,624,274, may be inserted into, but not limited to, 1-453 and 1-587 of any parent AAV serotype, such as, but not limited to, AAV2 (SEQ ID NO: 183 of U.S. Pat. No. 9,624,274). The amino acid insert may be, but is not limited to, any of the following amino acid sequences, VNLTWSRASG (SEQ ID NO: 50 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1364), EFCINHRGYWVCGD (SEQ ID NO:55 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1365), EDGQVMDVDLS (SEQ ID NO: 85 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1366), EKQRNGTLT (SEQ ID NO: 86 of US %24274; herein SEQ ID NO: 1367), TYQCRVTHPHLPRALMR (SEQ ID NO: 87 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1368), RHSTTQPRKTKGSG (SEQ ID NO: 88 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1369), DSNPRGVSAYLSR (SEQ ID NO: 89 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1370), TITCLWDLAPSK (SEQ ID NO: 90 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1371), KTKGSGFFVF (SEQ ID NO: 91 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1372), THPHLPRALMRS (SEQ ID NO: 92 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1373), GETYQCRVTHPHLPRALMRSTFK (SEQ ID NO: 93 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1374), LPRALMRS (SEQ ID NO: 94 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1375), INHRGYWV (SEQ ID NO: 95 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1376), CDAGSVRTNAPD (SEQ ID NO: 60 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1377), AKAVSNLTESRSESLQS (SEQ ID NO: % of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1378), SLTGDEFKKVLET (SEQ ID NO: 97 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1379), REAVAYRFEED (SEQ ID NO: 98 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1380), INPEIITLDG (SEQ ID NO: 99 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1381), DISVTGAPVITATYL (SEQ ID NO: 100 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1382), DISVTGAPVITA (SEQ ID NO: 101 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1383), PKTVSNLTESSSESVQS (SEQ ID NO: 102 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1384), SLMGDEFKAVLET (SEQ ID NO: 103 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1385), QHSVAYTFEED (SEQ ID NO: 104 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1386), INPEIITRDG (SEQ ID NO: 105 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1387), DISLTGDPVITASYL (SEQ ID NO: 106 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1388), DISLTGDPVITA (SEQ ID NO: 107 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1389), DQSIDFEIDSA (SEQ ID NO: 108 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1390), KNVSEDLPLPTFSPTLLGDS (SEQ ID NO: 109 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1391), KNVSEDLPLPT (SEQ ID NO: 110 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1392), CDSGRVRTDAPD (SEQ ID NO: 111 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1393), FPEHLLVDFLQSLS (SEQ ID NO: 112 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1394), DAEFRHDSG (SEQ ID NO: 65 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1395), HYAAAQWDFGNTMCQL (SEQ ID NO: 113 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1396), YAAQWDFGNTMCQ (SEQ ID NO: 114 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1397), RSQKEGLHYT (SEQ ID NO: 115 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1398), SSRTPSDKPVAHWANPQAE (SEQ ID NO: 116 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1399), SRTPSDKPVAHWANP (SEQ ID NO: 117 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1400), SSRTPSDKP (SEQ ID NO: 118 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1401), NADGNVDYHMNSVP (SEQ ID NO: 119 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1402), DGNVDYHMNSV (SEQ ID NO: 120 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1403), RSFKEFLQSSLRALRQ (SEQ ID NO: 121 of US %24274; herein SEQ ID NO: 1404); FKEFLQSSLRA (SEQ ID NO: 122 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1405), or QMWAPQWGPD (SEQ ID NO: 123 of U.S. Pat. No. 9,624,274; herein SEQ ID NO: 1406).
In some embodiments, the AAV serotype may be, or may have a sequence as described in U.S. Pat. No. 9,475,845, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV capsid proteins comprising modification of one or more amino acids at amino acid positions 585 to 590 of the native AAV2 capsid protein. Further the modification may result in, but not be limited to, the amino acid sequence RGNRQA (SEQ ID NO: 3 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1407), SSSTDP (SEQ ID NO: 4 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1408), SSNTAP (SEQ ID NO: 5 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1409), SNSNLP (SEQ ID NO: 6 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1410), SSTTAP (SEQ ID NO: 7 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1411), AANTAA (SEQ ID NO: 8 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1412), QQNTAP (SEQ ID NO: 9 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1413), SAQAQA (SEQ ID NO: 10 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1414), QANTGP (SEQ ID NO: 11 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1415), NATTAP (SEQ ID NO: 12 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1416), SSTAGP (SEQ ID NO: 13 and 20 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1417), QQNTAA (SEQ ID NO: 14 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1418), PSTAGP (SEQ ID NO: 15 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1419), NQNTAP (SEQ ID NO: 16 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1420), QAANAP (SEQ ID NO: 17 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1421), SIVGLP (SEQ ID NO: 18 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1422), AASTAA (SEQ ID NO: 19, and 27 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1423), SQNTTA (SEQ ID NO: 21 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1424), QQDTAP (SEQ ID NO: 22 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1425), QTNTGP (SEQ ID NO: 23 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1426), QTNGAP (SEQ ID NO: 24 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1427), QQNAAP (SEQ ID NO: 25 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1428), or AANTQA (SEQ ID NO: 26 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1429). In some embodiments, the amino acid modification is a substitution at amino acid positions 262 through 265 in the native AAV2 capsid protein or the corresponding position in the capsid protein of another AAV with a targeting sequence. The targeting sequence may be, but is not limited to, any of the amino acid sequences, NGRAHA (SEQ ID NO: 38 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1430), QPEHSST (SEQ ID NO: 39 and 50 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1431), VNTANST (SEQ ID NO: 40 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1432), HGPMQKS (SEQ ID NO: 41 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1433), PHKPPLA (SEQ ID NO: 42 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1434), IKNNEMW (SEQ ID NO: 43 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1435), RNLDTPM (SEQ ID NO: 44 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1436), VDSHRQS (SEQ ID NO: 45 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1437), YDSKTKT (SEQ ID NO: 46 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1438), SQLPHQK (SEQ ID NO: 47 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1439), STMQQNT (SEQ ID NO: 48 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1440), TERYMTQ (SEQ ID NO: 49 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1441), DASLSTS (SEQ ID NO: 51 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1442), DLPNKKT (SEQ ID NO: 52 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1443), DLTAARL (SEQ ID NO: 53 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1444), EPHQFNY (SEQ ID NO: 54 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1445), EPQSNHT (SEQ ID NO: 55 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1446), MSSWPSQ (SEQ ID NO: 56 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1447), NPKHNAT (SEQ ID NO: 57 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1448), PDGMRTT (SEQ ID NO: 58 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1449), PNNNKTT (SEQ ID NO: 59 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1450), QSTTHDS (SEQ ID NO: 60 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1451), TGSKQKQ (SEQ ID NO: 61 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1452), SLKHQAL (SEQ ID NO: 62 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1453), SPIDGEQ (SEQ ID NO: 63 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1454), WIFPWIQL (SEQ ID NO: 64 and 112 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1455), CDCRGDCFC (SEQ ID NO: 65 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1456), CNGRC (SEQ ID NO: 66 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1457), CPRECES (SEQ ID NO: 67 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1458), CTIHWGFTLC (SEQ ID NO: 68 and 123 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1459), CGRRAGGSC (SEQ ID NO: 69 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1460), CKGGRAKDC (SEQ ID NO: 70 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1461), CVPELGHEC (SEQ ID NO: 71 and 115 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1462), CRRETAWAK (SEQ ID NO: 72 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1463), VSWFSHRYSPFAVS (SEQ ID NO: 73 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1464), GYRDGYAGPILYN (SEQ ID NO: 74 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1465), XXXYXXX (SEQ ID NO: 75 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1466), YXNW (SEQ ID NO: 76 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1467), RPLPPLP (SEQ ID NO: 77 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1468), APPLPPR (SEQ ID NO: 78 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1469), DVFYPYPYASGS (SEQ ID NO: 79 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1470), MYWYPY (SEQ ID NO: 80 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1471), DITWDQLWDLMK (SEQ ID NO: 81 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1472), CWDDXWLC (SEQ ID NO: 82 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1473), EWCEYLGGYLRCYA (SEQ ID NO: 83 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1474), YXCXXGPXTWXCXP (SEQ ID NO: 84 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1475), IEGPTLRQWLAARA (SEQ ID NO: 85 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1476), LWXXX (SEQ ID NO: 86 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1477), XFXXYLW (SEQ ID NO: 87 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1478), SSIISHFRWGLCD (SEQ ID NO: 88 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1479), MSRPACPPNDKYE (SEQ ID NO: 89 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1480), CLRSGRGC (SEQ ID NO: 90 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1481), CHWMFSPWC (SEQ ID NO: 91 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1482), WXXF (SEQ ID NO: 92 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1483), CSSRLDAC (SEQ ID NO: 93 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1484), CLPVASC (SEQ ID NO: 94 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1485), CGFECVRQCPERC (SEQ ID NO: 95 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1486), CVALCREACGEGC (SEQ ID NO: 96 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1487), SWCEPGWCR (SEQ ID NO: 97 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1488), YSGKWGW (SEQ ID NO: 98 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1489), GLSGGRS (SEQ ID NO: 99 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1490), LMLPRAD (SEQ ID NO: 100 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1491), CSCFRDVCC (SEQ ID NO: 101 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1492), CRDVVSVIC (SEQ ID NO: 102 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1493), MARSGL (SEQ ID NO: 103 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1494), MARAKE (SEQ ID NO: 104 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1495), MSRTMS (SEQ ID NO: 105 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1496, KCCYSL (SEQ ID NO: 106 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1497), MYWGDSHWLQYWYE (SEQ ID NO: 107 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1498), MQLPLAT (SEQ ID NO: 108 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1499), EWLS (SEQ ID NO: 109 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1500), SNEW (SEQ ID NO: 110 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1501), TNYL (SEQ ID NO: 111 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1502), WDLAWMFRLPVG (SEQ ID NO: 113 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1503), CTVALPGGYVRVC (SEQ ID NO: 114 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1504), CVAYCIEHHCWTC (SEQ ID NO: 116 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1505). CVFAHNYDYLVC (SEQ ID NO: 117 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1506), CVFTSNYAFC (SEQ ID NO: 118 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1507), VHSPNKK (SEQ ID NO: 119 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1508), CRGDGWC (SEQ ID NO: 120 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1509), XRGCDX (SEQ ID NO: 121 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1510), PXXX (SEQ ID NO: 122 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1511), SGKGPRQITAL (SEQ ID NO: 124 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1512), AAAAAAAAAXXXXX (SEQ ID NO: 125 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1513), VYMSPF (SEQ ID NO: 126 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1514), ATWLPPR (SEQ ID NO: 127 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1515), HTMYYHHYQHHL (SEQ ID NO: 128 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1516), SEVGCRAGPLQWLCEKYFG (SEQ ID NO: 129 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1517), CGLLPVGRPDRNVWRWLC (SEQ ID NO: 130 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1518), CKGQCDRFKGLPWEC (SEQ ID NO: 131 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1519), SGRSA (SEQ ID NO: 132 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1520), WGFP (SEQ ID NO: 133 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1521), AEPMPHSLNFSQYLWYT (SEQ ID NO: 134 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1522), WAYXSP (SEQ ID NO: 135 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1523), IELLQAR (SEQ ID NO: 136 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1524), AYTKCSRQWRTCMTTH (SEQ ID NO: 137 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1525), PQNSKIPGPTFLDPH (SEQ ID NO: 138 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1526), SMEPALPDWWWKMFK (SEQ ID NO: 139 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1527), ANTPCGPYTHDCPVKR (SEQ ID NO: 140 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1528), TACHQHVRMVRP (SEQ ID NO: 141 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1529), VPWMEPAYQRFL (SEQ ID NO: 142 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1530), DPRATPGS (SEQ ID NO: 143 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1531), FRPNRAQDYNTN (SEQ ID NO: 144 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1532), CTKNSYLMC (SEQ ID NO: 145 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1533), CXXTXXXGXGC (SEQ ID NO: 146 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1534), CPIEDRPMC (SEQ ID NO: 147 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1535), HEWSYLAPYPWF (SEQ ID NO: 148 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1536), MCPKHPLGC (SEQ ID NO: 149 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1537), RMWPSSTVNLSAGRR (SEQ ID NO: 150 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1538), SAKTAVSQRVWLPSHRGGEP (SEQ ID NO: 151 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1539), KSREHVNNSACPSKRITAAL (SEQ ID NO: 152 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1540), EGFR (SEQ ID NO: 153 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1541), AGLGVR (SEQ ID NO: 154 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1542), GTRQGHTMRLGVSDG (SEQ ID NO: 155 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1543), IAGLATPGWSHWLAL (SEQ ID NO: 156 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1544), SMSIARL (SEQ ID NO: 157 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1545), HTFEPGV (SEQ ID NO: 158 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1546), NTSLKRISNKRIRRK (SEQ ID NO: 159 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1547), LRIKRKRRKRKKTRK (SEQ ID NO: 160 of U.S. Pat. No. 9,475,845; herein SEQ ID NO: 1548), GGG, GFS, LWS, EGG, LLV, LSP, LBS, AGG, GRR, GGH and GTV.
In some embodiments, the AAV serotype may be, or may have a sequence as described in United States Publication No. US 20160369298, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, site-specific mutated capsid protein of AAV2 (SEQ ID NO: 97 of US 20160369298; herein SEQ ID NO: 1549) or variants thereof, wherein the specific site is at least one site selected from sites R447, G453, S578, N587, N587+1. S662 of VP1 or fragment thereof.
Further, any of the mutated sequences described in US 20160369298, may be or may have, but not limited to, any of the following sequences SDSGASN (SEQ ID NO: 1 and SEQ ID NO: 231 of US20160369298; herein SEQ ID NO: 1550), SPSGASN (SEQ ID NO: 2 of US20160369298; herein SEQ ID NO: 1551), SHSGASN (SEQ ID NO: 3 of US20160369298; herein SEQ ID NO: 1552), SRSGASN (SEQ ID NO: 4 of US20160369298; herein SEQ ID NO: 1553), SKSGASN (SEQ ID NO: 5 of US20160369298; herein SEQ ID NO: 1554), SNSGASN (SEQ ID NO: 6 of US20160369298; herein SEQ ID NO: 1555), SGSGASN (SEQ ID NO: 7 of US20160369298; herein SEQ ID NO: 1556), SASGASN (SEQ ID NO: 8, 175, and 221 of US20160369298; herein SEQ ID NO: 1557), SESGTSN (SEQ ID NO: 9 of US20160369298; herein SEQ ID NO: 1558), STTGGSN (SEQ ID NO: 10 of US20160369298; herein SEQ ID NO: 1559), SSAGSTN (SEQ ID NO: 11 of US20160369298; herein SEQ ID NO: 1560), NNDSQA (SEQ ID NO: 12 of US20160369298; herein SEQ ID NO: 1561), NNRNQA (SEQ ID NO: 13 of US20160369298; herein SEQ ID NO: 1562), NNNKQA (SEQ ID NO: 14 of US20160369298; herein SEQ ID NO: 1563), NAKRQA (SEQ ID NO: 15 of US20160369298; herein SEQ ID NO: 1564), NDEHQA (SEQ ID NO: 16 of US20160369298; herein SEQ ID NO: 1565), NTSQKA (SEQ ID NO: 17 of US20160369298; herein SEQ ID NO: 1566), YYLSRTNTPSGTDTQSRLVFSQAGA (SEQ ID NO: 18 of US20160369298; herein SEQ ID NO: 1567), YYLSRTNTDSGTETQSGLDFSQAGA (SEQ ID NO: 19 of US20160369298; herein SEQ ID NO: 1568), YYLSRTNTESGTPTQSALEFSQAGA (SEQ ID NO: 20 of US20160369298; herein SEQ ID NO: 1569), YYLSRTNTHSGTHTQSPLHFSQAGA (SEQ ID NO: 21 of US20160369298; herein SEQ ID NO: 1570), YYLSRTNTSSGTITISHLIFSQAGA (SEQ ID NO: 22 of US20160369298; herein SEQ ID NO: 1571), YYLSRTNTRSGIMTKSSLMFSQAGA (SEQ ID NO: 23 of US20160369298; herein SEQ ID NO: 1572), YYLSRTNTKSGRKTLSNLSFSQAGA (SEQ ID NO: 24 of US20160369298; herein SEQ ID NO: 1573), YYLSRTNDGSGPVTPSKLRFSQRGA (SEQ ID NO: 25 of US20160369298; herein SEQ ID NO: 1574), YYLSRTNAASGHATHSDLKFSQPGA (SEQ ID NO: 26 of US20160369298; herein SEQ ID NO: 1575), YYLSRTNGQAGSLTMSELGFSQVGA (SEQ ID NO: 27 of US20160369298; herein SEQ ID NO: 1576), YYLSRTNSTGGNQTTSQLLFSQLSA (SEQ ID NO: 28 of US20160369298; herein SEQ ID NO: 1577), YFLSRTNNNTGLNTNSTLNFSQGRA (SEQ ID NO: 29 of US20160369298; herein SEQ ID NO: 1578), SKTGADNNNSEYSWTG (SEQ ID NO: 30 of US20160369298; herein SEQ ID NO: 1579), SKTDADNNNSEYSWTG (SEQ ID NO: 31 of US20160369298; herein SEQ ID NO: 1580), SKTEADNNNSEYSWTG (SEQ ID NO: 32 of US20160369298; herein SEQ ID NO: 1581), SKTPADNNNSEYSWTG (SEQ ID NO: 33 of US20160369298; herein SEQ ID NO: 1582), SKTHADNNNSEYSWTG (SEQ ID NO: 34 of US20160369298; herein SEQ ID NO: 1583), SKTQADNNNSEYSWTG (SEQ ID NO: 35 of US20160369298; herein SEQ ID NO: 1584), SKTIADNNNSEYSWTG (SEQ ID NO: 36 of US20160369298; herein SEQ ID NO: 1585), SKTMADNNNSEYSWTG (SEQ ID NO: 37 of US20160369298; herein SEQ ID NO: 1586), SKTRADNNNSEYSWTG (SEQ ID NO: 38 of US20160369298; herein SEQ ID NO: 1587), SKTNADNNNSEYSWTG (SEQ ID NO: 39 of US20160369298; herein SEQ ID NO: 1588), SKTVGRNNNSEYSWTG (SEQ ID NO: 40 of US20160369298; herein SEQ ID NO: 1589), SKTADRNNNSEYSWTG (SEQ ID NO: 41 of US20160369298; herein SEQ ID NO: 1590), SKKLSQNNNSKYSWQG (SEQ ID NO: 42 of US20160369298; herein SEQ ID NO: 1591), SKPTTGNNNSDYSWPG (SEQ ID NO: 43 of US20160369298; herein SEQ ID NO: 1592), STQKNENNNSNYSWPG (SEQ ID NO: 44 of US20160369298; herein SEQ ID NO: 1593), HKDDEGKF (SEQ ID NO: 45 of US20160369298; herein SEQ ID NO: 1594), HKDDNRKF (SEQ ID NO: 46 of US20160369298; herein SEQ ID NO: 1595), HKDDTNKF (SEQ ID NO: 47 of US20160369298; herein SEQ ID NO: 1596), HEDSDKNF (SEQ ID NO: 48 of US20160369298; herein SEQ ID NO: 1597), HRDGADSF (SEQ ID NO: 49 of US20160369298; herein SEQ ID NO: 1598), HGDNKSRF (SEQ ID NO: 50 of US20160369298; herein SEQ ID NO: 1599), KQGSEKTNVDFEEV (SEQ ID NO: 51 of US20160369298; herein SEQ ID NO: 1600), KQGSEKTNVDSEEV (SEQ ID NO: 52 of US20160369298; herein SEQ ID NO: 1601), KQGSEKTNVDVEEV (SEQ ID NO: 53 of US20160369298; herein SEQ ID NO: 1602), KQGSDKTNVDDAGV (SEQ ID NO: 54 of US20160369298; herein SEQ ID NO: 1603), KQGSSKTNVDPREV (SEQ ID NO: 55 of US20160369298; herein SEQ ID NO: 1604), KQGSRKTNVDHKQV (SEQ ID NO: 56 of US20160369298; herein SEQ ID NO: 1605), KQGSKGGNVDTNRV (SEQ ID NO: 57 of US20160369298; herein SEQ ID NO: 1606), KQGSGEANVDNGDV (SEQ ID NO: 58 of US20160369298; herein SEQ ID NO: 1607), KQDAAADNIDYDHV (SEQ ID NO: 59 of US20160369298; herein SEQ ID NO: 1608), KQSGTRSNAAASSV (SEQ ID NO: 60 of US20160369298; herein SEQ ID NO: 1609), KENTNTNDTELTNV (SEQ ID NO: 61 of US20160369298; herein SEQ ID NO: 1610), QRGNNVAATADVNT (SEQ ID NO: 62 of US20160369298; herein SEQ ID NO: 1611), QRGNNEAATADVNT (SEQ ID NO: 63 of US20160369298; herein SEQ ID NO: 1612), QRGNNPAATADVNT (SEQ ID NO: 64 of US20160369298; herein SEQ ID NO: 1613), QRGNNHAATADVNT (SEQ ID NO: 65 of US20160369298; herein SEQ ID NO: 1614), QEENNIAATPGVNT (SEQ ID NO: 66 of US20160369298; herein SEQ ID NO: 1615), QPPNNMAATHEVNT (SEQ ID NO: 67 of US20160369298; herein SEQ ID NO: 1616), QHHNNSAATTIVNT (SEQ ID NO: 68 of US20160369298; herein SEQ ID NO: 1617), QTTNNRAAFNMVET (SEQ ID NO: 69 of US20160369298; herein SEQ ID NO: 1618), QKKNNNAASKKVAT (SEQ ID NO: 70 of US20160369298; herein SEQ ID NO: 1619), QGGNNKAADDAVKT (SEQ ID NO: 71 of US20160369298; herein SEQ ID NO: 1620). QAAKGGAADDAVKT (SEQ ID NO: 72 of US20160369298; herein SEQ ID NO: 1621), QDDRAAAANESVDT (SEQ ID NO: 73 of US20160369298; herein SEQ ID NO: 1622), QQQHDDAAYQRVHT (SEQ ID NO: 74 of US20160369298; herein SEQ ID NO: 1623), QSSSSLAAVSTVQT (SEQ ID NO: 75 of US20160369298; herein SEQ ID NO: 1624), QNNQTTAAIRNVTT (SEQ ID NO: 76 of US20160369298; herein SEQ ID NO: 1625), NYNKKSDNVDFT (SEQ ID NO: 77 of US20160369298; herein SEQ ID NO: 1626), NYNKKSENVDFT (SEQ ID NO: 78 of US20160369298; herein SEQ ID NO: 1627), NYNKKSLNVDFT (SEQ ID NO: 79 of US20160369298; herein SEQ ID NO: 1628), NYNKKSPNVDFT (SEQ ID NO: 80 of US20160369298; herein SEQ ID NO: 1629), NYSKKSHCVDFT (SEQ ID NO: 81 of US20160369298; herein SEQ ID NO: 1630). NYRKTIYVDFT (SEQ ID NO: 82 of US20160369298; herein SEQ ID NO: 1631), NYKEKKDVHFT (SEQ ID NO: 83 of US20160369298; herein SEQ ID NO: 1632), NYGHRAIVQFT (SEQ ID NO: 84 of US20160369298; herein SEQ ID NO: 1633), NYANHQFVVCT (SEQ ID NO: 85 of US20160369298; herein SEQ ID NO: 1634), NYDDDPTGVLLT (SEQ ID NO: 86 of US20160369298; herein SEQ ID NO: 1635), NYDDPTGVLLT (SEQ ID NO: 87 of US20160369298; herein SEQ ID NO: 1636), NFEQQNSVEWT (SEQ ID NO: 88 of US20160369298; herein SEQ ID NO: 1637), SQSGASN (SEQ ID NO: 89 and SEQ ID NO: 241 of US20160369298; herein SEQ ID NO: 1638), NNGSQA (SEQ ID NO: 90 of US20160369298; herein SEQ ID NO: 1639), YYLSRTNTPSGTTTWSRLQFSQAGA (SEQ ID NO: 91 of US20160369298; herein SEQ ID NO: 1640), SKTSADNNNSEYSWTG (SEQ ID NO: 92 of US20160369298; herein SEQ ID NO: 1641), HKDDEEKF (SEQ ID NO: 93, 209, 214, 219, 224, 234, 239, and 244 of US20160369298; herein SEQ ID NO: 1642), KQGSEKTNVDIEEV (SEQ ID NO: 94 of US20160369298; herein SEQ ID NO: 1643), QRGNNQAATADVNT (SEQ ID NO: 95 of US20160369298; herein SEQ ID NO: 1644), NYNKKSVNVDFT (SEQ ID NO: 96 of US20160369298; herein SEQ ID NO: 1645), SQSGASNYNTPSGTTTQSRLQFSTSADNNNSEYSWVTGATKYH (SEQ ID NO: 106 of US20160369298; herein SEQ ID NO: 1646), SASGASNFNSEGGSLTQSSLGFSTDGENNNSDFSWTGATKYH (SEQ ID NO: 107 of US20160369298; herein SEQ ID NO: 1647), SQSGASNYNTPSGTTTQSRLQFSTDGENNNSDFSWTGATKYH (SEQ ID NO: 108 of US20160369298; herein SEQ ID NO: 1648), SASGASNYNTPSGTTQSRLQFSTSADNNNSEFSWPGA TTYH (SEQ ID NO: 109 of US20160369298; herein SEQ ID NO: 1649), SQSGASNFNSEGGSLTQSSLGFSTDGENNNSDFSWTGATKYH (SEQ ID NO: 110 of US20160369298; herein SEQ ID NO: 1650), SASGASNYNTPSGSLTQSSLGFSTDGENNNSDFSWTGATKYH (SEQ ID NO: 111 of US20160369298; herein SEQ ID NO: 1651), SQSGASNYNTPSGTTTQSRLQFSTSADNNNSDFSWfGATKYH (SEQ ID NO: 112 of US20160369298; herein SEQ ID NO: 1652), SGAGASNFNSEGGSLTQSSLGFSTDGENNNSDFSWTGATKYH (SEQ ID NO: 113 of US20160369298; herein SEQ ID NO: 1653), SGAGASN (SEQ ID NO: 176 of US20160369298; herein SEQ ID NO: 1654), NSEGGSLTQSSLGFS (SEQ ID NO: 177, 185, 193 and 202 of US20160369298; herein SEQ ID NO: 1655), TDGENNNSDFS (SEQ ID NO: 178 of US20160369298; herein SEQ ID NO: 1656), SEFSWPGATT (SEQ ID NO: 179 of US20160369298; herein SEQ ID NO: 1657), TSADNNNSDFSWT (SEQ ID NO: 180 of US20160369298; herein SEQ ID NO: 1658), SQSGASNY (SEQ ID NO: 181, 187, and 198 of US20160369298; herein SEQ ID NO: 1659), NTPSGTTTQSRLQFS (SEQ ID NO: 182, 188, 191, and 199 of US20160369298; herein SEQ ID NO: 1660), TSADNNNSEYSWTGATKYH (SEQ ID NO: 183 of US20160369298; herein SEQ ID NO: 1661), SASGASNF (SEQ ID NO: 184 of US20160369298; herein SEQ ID NO: 1662), TDGENNNSDFSWTGATKYH (SEQ ID NO: 186, 189, 194, 197, and 203 of US20160369298; herein SEQ ID NO: 1663), SASGASNY (SEQ ID NO: 190 and SEQ ID NO: 195 of US20160369298; herein SEQ ID NO: 1664), TSADNNNSEFSWPGATTYH (SEQ ID NO: 192 of US20160369298; herein SEQ ID NO: 1665), NTPSGSLTQSSLGFS (SEQ ID NO: 196 of US20160369298; herein SEQ ID NO: 1666), TSADNNNSDFSWTGATKYH (SEQ ID NO: 200 of US20160369298; herein SEQ ID NO: 1667), SGAGASNF (SEQ ID NO: 201 of US20160369298; herein SEQ ID NO: 1668), CTCCAGVVSVVSMRSRVCVNSGCAGCTDHCVVSRNSGTCVMSACACAA (SEQ ID NO: 204 of US20160369298; herein SEQ ID NO: 1669), CTCCAGAGAGGCAACAGACAAGCAGCTACCGCAGATGTCAACACACAA (SEQ ID NO: 205 of US20160369298; herein SEQ ID NO: 1670), SAAGASN (SEQ ID NO: 206 of US20160369298; herein SEQ ID NO: 1671), YFLSRTNTESGSTTQSTLRFSQAG (SEQ ID NO: 207 of US20160369298; herein SEQ ID NO: 1672), SKTSADNNNSDFS (SEQ ID NO: 208, 228, and 253 of US20160369298; herein SEQ ID NO: 1673), KQGSEKTDVDIDKV (SEQ ID NO: 210 of US20160369298; herein SEQ ID NO: 1674), STAGASN (SEQ ID NO: 211 of US20160369298; herein SEQ ID NO: 1675), YFLSRTNTTSGIETQSTLRFSQAG (SEQ ID NO: 212 and SEQ ID NO: 247 of US20160369298; herein SEQ ID NO: 1676), SKTDGENNNSDFS (SEQ ID NO: 213 and SEQ ID NO: 248 of US20160369298; herein SEQ ID NO: 1677). KQGAAADDVEIDGV (SEQ ID NO: 215 and SEQ ID NO: 250 of US20160369298; herein SEQ ID NO: 1678), SEAGASN (SEQ ID NO: 216 of US20160369298; herein SEQ ID NO: 1679), YYLSRTNTPSGTTTQSRLQFSQAG (SEQ ID NO: 217, 232 and 242 of US20160369298; herein SEQ ID NO: 1680), SKTSADNNNSEYS (SEQ ID NO: 218, 233, 238, and 243 of US20160369298; herein SEQ ID NO: 1681), KQGSEKTNVDIEKV (SEQ ID NO: 220, 225 and 245 of US20160369298; herein SEQ ID NO: 1682), YFLSRTNDASGSDTKSTLLFSQAG (SEQ ID NO: 222 of US20160369298; herein SEQ ID NO: 1683), STTPSENNNSEYS (SEQ ID NO: 223 of US20160369298; herein SEQ ID NO: 1684), SAAGATN (SEQ ID NO: 226 and SEQ ID NO: 251 of US20160369298; herein SEQ ID NO: 1685), YFLSRTNGEAGSATLSELRFSQAG (SEQ ID NO: 227 of US20160369298; herein SEQ ID NO: 1686), HGDDADRF (SEQ ID NO: 229 and SEQ ID NO: 254 of US20160369298; herein SEQ ID NO: 1687), KQGAEKSDVEVDRV (SEQ ID NO: 230 and SEQ ID NO: 255 of US20160369298; herein SEQ ID NO: 1688), KQDSGGDNIDIDQV (SEQ ID NO: 235 of US20160369298; herein SEQ ID NO: 1689), SDAGASN (SEQ ID NO: 236 of US20160369298; herein SEQ ID NO: 1690), YFLSRTNTEGGHDTQSTLRFSQAG (SEQ ID NO: 237 of US20160369298; herein SEQ ID NO: 1691), KEDGGGSDVAIDEV (SEQ ID NO: 240 of US20160369298; herein SEQ ID NO: 1692), SNAGASN (SEQ ID NO: 246 of US20160369298; herein SEQ ID NO: 1693), and YFLSRTNGEAGSATLSELRFSQPG (SEQ ID NO: 252 of US20160369298; herein SEQ ID NO: 1694). Non-limiting examples of nucleotide sequences that may encode the amino acid mutated sites include the following, AGCVVMDCAGGARSCASCAAC (SEQ ID NO: 97 of US20160369298; herein SEQ ID NO: 1695), AACRACRRSMRSMAGGCA (SEQ ID NO: 98 of US20160369298; herein SEQ ID NO: 1696), CACRRGGACRRCRMSRRSARSTTT (SEQ ID NO: 99 of US20160369298; herein SEQ ID NO: 1697), TATTTCTTGAGCAGAACAAACRVCVVSRSCGGAMNCVHSACGMHSTCAVVSCTTVDS TTTTCTCAGSBCRGSGCG (SEQ ID NO: 100 of US20160369298; herein SEQ ID NO: 1698), TCAAMAMMAVNSRVCSRSAACAACAACAGTRASTTCTCGTGGMMAGGA (SEQ ID NO: 101 of US20160369298; herein SEQ ID NO: 1699), AAGSAARRCRSCRVSRVARVCRATRYCGMSNHCRVMVRSGTC (SEQ ID NO: 102 of US20160369298; herein SEQ ID NO: 1700), CAGVVSVVSMRSRVCVNSGCAGCTDHCVVSRNSGTCVMSACA (SEQ ID NO: 103 of US20160369298; herein SEQ ID NO: 1701), AACTWCRVSVASMVSVHSDDTGTGSWSTKSACT (SEQ ID NO: 104 of US20160369298; herein SEQ ID NO: 1702), TTGTTGAACATCACCACGTGACGCACGTTC (SEQ ID NO: 256 of US20160369298; herein SEQ ID NO: 1703), TCCCCGTGGTTCTACTACATAATGTGGCCG (SEQ ID NO: 257 of US20160369298; herein SEQ ID NO: 1704), TTCCACACTCCGTTTGGATAATGTFGAAC (SEQ ID NO: 258 of US20160369298; herein SEQ ID NO: 1705), AGGGACATCCCCAGCTCCATGCTGTGGTCG (SEQ ID NO: 259 of US20160369298; herein SEQ ID NO: 1706), AGGGACAACCCCTCCGACTCGCCCTAATCC (SEQ ID NO: 260 of US20160369298; herein SEQ ID NO: 1707), TCCTAGTAGAAGACACCCTCTCACTGCCCG (SEQ ID NO: 261 of US20160369298; herein SEQ ID NO: 1708), AGTACCATGTACACCCACTCTCCCAGTGCC (SEQ ID NO: 262 of US20160369298; herein SEQ ID NO: 1709), ATATGGACGTTCATGCTGATCACCATACCG (SEQ ID NO: 263 of US20160369298; herein SEQ ID NO: 1710), AGCAGGAGCTCCTTGGCCTCAGCGTGCGAG (SEQ ID NO: 264 of US20160369298; herein SEQ ID NO: 1711), ACAAGCAGCTTCACTATGACAACCACTGAC (SEQ ID NO: 265 of US20160369298; herein SEQ ID NO: 1712), CAGCCTAGGAACTGGCTTCCTGGACCCTGTTACCGCCAGCAGAGAGTCTCAAMAMM AVNSRVCSRSAACAACAACAGTRASTTCTCCTGGMMAGGAGCTACCAAGTACCACCT CAATGGCAGAGACTCTCTGGTGAATCCCGGACCAGCTATGGCAAGCCACRRGGACR RCRMSRRSARSTTTTTCCTCAGAGCGGGGTTCTCATCTTGGGAAGSAARRCRSCRV SRVARVCRATRYCGMSNHCRVMVRSGTCATGATTACAGACGAAGAGGAGATCTGGA C (SEQ ID NO: 266 of US20160369298; herein SEQ ID NO: 1713), TGGGACAATGGCGGTCGTCTCTCAGAGTTKTKKT (SEQ ID NO: 267 of US20160369298; herein SEQ ID NO: 1714), AGAGGACCKKTCCTCGATGGTTCATGGTGGAGTTA (SEQ ID NO: 268 of US20160369298; herein SEQ ID NO: 1715), CCACTTAGGGCCTGGTCGATACCGTTCGGTG (SEQ ID NO: 269 of US20160369298; herein SEQ ID NO: 1716), and TCTCGCCCCAAGAGTAGAAACCCTTCSTTYYG (SEQ ID NO: 270 of US20160369298; herein SEQ ID NO: 1717).
In some embodiments, the AAV serotype may comprise an ocular cell targeting peptide as described in International Patent Publication WO2016134375, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to SEQ ID NO: 9, and SEQ ID NO:10 of WO2016134375. Further, any of the ocular cell targeting peptides or amino acids described in WO2016134375, may be inserted into any parent AAV serotype, such as, but not limited to, AAV2 (SEQ ID NO:8 of WO2016134375; herein SEQ ID NO: 1718), or AAV9 (SEQ ID NO: 11 of WO2016134375; herein SEQ ID NO: 1719). In some embodiments, modifications, such as insertions are made in AAV2 proteins at P34-A35, T138-A139, A139-P140, G453-T454, N587-R588, and/or R588-Q589. In certain embodiments, insertions are made at D384, G385, 1560, T561, N562, E563, E564, E565, N704, and/or Y705 of AAV9. The ocular cell targeting peptide may be, but is not limited to, any of the following amino acid sequences, GSTPPPM (SEQ ID NO: 1 of WO2016134375; herein SEQ ID NO: 1720), or GETRAPL (SEQ ID NO: 4 of WO2016134375; herein SEQ ID NO: 1721).
In some embodiments, the AAV serotype may be modified as described in the United States Publication US 20170145405 the contents of which are herein incorporated by reference in their entirety. AAV serotypes may include, modified AAV2 (e.g., modifications at Y444F, Y500F, Y730F and/or S662V), modified AAV3 (e.g., modifications at Y705F, Y731F and/or T492V), and modified AAV6 (e.g., modifications at S663V and/or T492V).
In some embodiments, the AAV serotype may be modified as described in the International Publication WO2017083722 the contents of which are herein incorporated by reference in their entirety. AAV serotypes may include, AAV1 (Y705+731F+T492V), AAV2 (Y444+500+730F+T491V), AAV3 (Y705+731F), AAV5, AAV 5 (Y436+693+719F), AAV6 (VP3 variant Y705F/Y731F/T492V), AAV8 (Y733F), AAV9, AAV9 (VP3 variant Y731F), and AAV10 (Y733F).
In some embodiments, the AAV serotype may comprise, as described in International Patent Publication WO2017015102, the contents of which are herein incorporated by reference in their entirety, an engineered epitope comprising the amino acids SPAKFA (SEQ ID NO: 24 of WO2017015102; herein SEQ ID NO: 1722) or NKDKLN (SEQ ID NO:2 of WO2017015102; herein SEQ ID NO: 1723). The epitope may be inserted in the region of amino acids 665 to 670 based on the numbering of the VP1 capsid of AAV8 (SEQ ID NO: 3 of WO2017015102) and/or residues 664 to 668 of AAV3B (SEQ ID NO: 3).
In some embodiments, the AAV serotype may be, or may have a sequence as described in International Patent Publication WO2017058892, the contents of which are herein incorporated by reference in their entirety, such as, but not limited to, AAV variants with capsid proteins that may comprise a substitution at one or more (e.g., 2, 3, 4, 5, 6, or 7) of amino acid residues 262-268, 370-379, 451-459, 472473, 493-500, 528-534, 547-552, 588-597, 709-710, 716-722 of AAV1, in any combination, or the equivalent amino acid residues in AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAVrh8, AAVrh10, AAVrh32.33, bovine AAV or avian AAV. The amino acid substitution may be, but is not limited to, any of the amino acid sequences described in WO2017058892. In some embodiments, the AAV may comprise an amino acid substitution at residues 256L, 258K, 259Q, 261S, 263A, 264S, 265T, 266G, 272H, 385S, 386Q, S472R, V473D, N500E 547S, 709A, 710N, 716D, 717N, 718N, 720L, A456T, Q457T, N458Q, K459S, T492S, K493A, S586R, S587G, S588N, T589R and/or 722T of AAV1 (SEQ ID NO: 1 of WO2017058892) in any combination, 244N, 246Q, 248R, 249E, 250I, 251K, 252S, 253G, 254S, 255V, 256D, 263Y, 377E, 378N, 453L, 456R, 532Q, 533P, 535N, 536P, 537G, 538T, 539T, 540A, 541T, 542Y, 543L, 546N, 653V. 654P, 656S, 697Q, 698F, 704D, 705S, 706T, 707G, 708E, 709Y and/or 710R of AAV5 (SEQ ID NO:5 of WO2017058892) in any combination, 248R, 316V, 317Q, 318D, 319S, 443N, 530N, 531 S, 532Q 533P, 534A, 535N, 540A, 541 T, 542Y, 543L, 5450, 546N, 697Q, 704D, 706T, 708E, 709Y and/or 710R of AAV5 (SEQ ID NO: 5 of WO2017058892) in any combination, 264S, 266G, 269N, 272H, 457Q, 588S and/or 5891 of AAV6 (SEQ ID NO:6 WO2017058892) in any combination, 457T, 459N, 496G, 499N, 500N, 589Q, 590N and/or 592A of AAV8 (SEQ ID NO: 8 WO2017058892) in any combination, 451I, 452N, 453G, 454S, 455G. 456Q, 457N and/or 458Q of AAV9 (SEQ ID NO: 9 WO2017058892) in any combination.
In some embodiments, the AAV may include a sequence of amino acids at positions 155, 156 and 157 of VP1 or at positions 17, 18, 19 and 20 of VP2, as described in International Publication No. WO 2017066764, the contents of which are herein incorporated by reference in their entirety. The sequences of amino acid may be, but not limited to, N—S—S, S—X—S, S—S—Y, N—X—S, N—S—Y, S—X—Y and N—X—Y, where N, X and Y are, but not limited to, independently non-serine, or non-threonine amino acids, wherein the AAV may be, but not limited to AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11 and AAV12. In some embodiments, the AAV may include a deletion of at least one amino acid at positions 156, 157 or 158 of VP1 or at positions 19, 20 or 21 of VP2, wherein the AAV may be, but not limited to AAV1, AAV2, AAV3, AAV4, AAV5, AAV6. AAV7, AAV8, AAV9, AAV10, AAV 11 and AAV12.
In some embodiments, the AAV may be a serotype generated by Cre-recombination-based AAV targeted evolution (CREATE) as described by Deverman et al., (Nature Biotechnology 34(2):204-209 (2016)), the contents of which are herein incorporated by reference in their entirety. In some embodiments, AAV serotypes generated in this manner have improved CNS transduction and/or neuronal and astrocytic tropism, as compared to other AAV serotypes. As non-limiting examples, the AAV serotype may include a peptide such as, but not limited to, PHP.B, PHP.B2, PHP.B3, PHP.A, PHP.S, G2A12, G2A15, G2A3, G2B4, and G2B5. In some embodiments, these AAV serotypes may be AAV9 (SEQ ID NO: 11 or 138) derivatives with a 7-amino acid insert between amino acids 588-589. Non-limiting examples of these 7-amino acid inserts include TLAVPFK (PHP.B: SEQ ID NO: 1262), SVSKPFL (PHP.B2; SEQ ID NO: 1270), FTLTTPK (PHP.B3; SEQ ID NO: 1271), YTLSQGW (PHP.A; SEQ ID NO: 1277), QAVRTSL (PHP.S; SEQ ID NO: 1321), LAKERLS (G2A3; SEQ ID NO: 1322), MNSTKNV (G2B4; SEQ ID NO: 1323), and/or VSGGHHS (G2B5; SEQ ID NO: 1324).
In some embodiments, the AAV serotype may be as described in Jackson et al (Frontiers in Molecular Neuroscience 9:154 (2016)), the contents of which are herein incorporated by reference in their entirety. In some embodiments, the AAV serotype is PHP.B or AAV9. In some embodiments, the AAV serotype is paired with a synapsin promoter to enhance neuronal transduction, as compared to when more ubiquitous promoters are used (i.e., CBA or CMV).
In some embodiments, the AAV serotype is a serotype comprising the AAVPHP.N (PHP.N) peptide, or a variant thereof.
In some embodiments the AAV serotype is a serotype comprising the AAVPHP.B (PHP.B) peptide, or a variant thereof.
In some embodiments, the AAV serotype is a serotype comprising the AAVPHP.A (PHP.A) peptide, or a variant thereof.
In some embodiments, the AAV serotype is a serotype comprising the PHP.S peptide, or a variant thereof.
In some embodiments, the AAV serotype is a serotype comprising the PHP.B2 peptide, or a variant thereof.
In some embodiments, the AAV serotype is a serotype comprising the PHP.B3 peptide, or a variant thereof.
In some embodiments, the AAV serotype is a serotype comprising the G2B4 peptide, or a variant thereof.
In some embodiments, the AAV serotype is a serotype comprising the G2B5 peptide, or a variant thereof.
In some embodiments the AAV serotype is VOY101, or a variant thereof.
In some embodiments, the AAV serotype is VOY201, or a variant thereof.
In some embodiments, the AAV serotype is selected for use due to its tropism for cells of the central nervous system. In some embodiments, the cells of the central nervous system are neurons. In another embodiment, the cells of the central nervous system are astrocytes.
In some embodiments, the AAV serotype is selected for use due to its tropism for cells of the muscle(s).
In some embodiments, the initiation codon for translation of the AAV VP1 capsid protein may be CTG, TTG, or GTG as described in U.S. Pat. No. 8,163,543, the contents of which are herein incorporated by reference in its entirety.
The present disclosure refers to structural capsid proteins (including VP1, VP2 and VP3) which are encoded by capsid (Cap) genes. These capsid proteins form an outer protein structural shell (i.e. capsid) of a viral vector such as AAV. VP capsid proteins synthesized from Cap polynucleotides generally include a methionine as the first amino acid in the peptide sequence (Met1), which is associated with the start codon (AUG or ATG) in the corresponding Cap nucleotide sequence. However, it is common for a first-methionine (Met1) residue or generally any first amino acid (AA1) to be cleaved off after or during polypeptide synthesis by protein processing enzymes such as Met-aminopeptidases. This “Met/AA-clipping” process often correlates with a corresponding acetylation of the second amino acid in the polypeptide sequence (e.g., alanine, valine, serine, threonine, etc.). Met-clipping commonly occurs with VP1 and VP3 capsid proteins but can also occur with VP2 capsid proteins.
Where the Met/AA-clipping is incomplete, a mixture of one or more (one, two or three) VP capsid proteins comprising the viral capsid may be produced, some of which may include a Met1/AA1 amino acid (Met+/AA+) and some of which may lack a Met1/AA1 amino acid as a result of Met/AA-clipping (Met−/AA−). For further discussion regarding Met/AA-clipping in capsid proteins, see Jin, et al. Direct Liquid Chromatography/Mass Spectrometry Analysis for Complete Characterization of Recombinant Adeno-Associated Virus Capsid Proteins. Hum Gene Ther Methods. 2017 Oct. 28(5):255-267; Hwang, et al. N-Terminal Acetylation of Cellular Proteins Creates Specific Degradation Signals. Science. 2010 Feb. 19, 327(5968): 973-977; the contents of which are each incorporated herein by reference in its entirety.
According to the present disclosure, references to capsid proteins is not limited to either clipped (Met−/AA−) or unclipped (Met+/AA+) and may, in context, refer to independent capsid proteins, viral capsids comprised of a mixture of capsid proteins, and/or polynucleotide sequences (or fragments thereof) which encode, describe, produce or result in capsid proteins of the present disclosure. A direct reference to a “capsid protein” or “capsid polypeptide” (such as VP1, VP2 or VP2) may also comprise VP capsid proteins which include a Met1/AA1 amino acid (Met+/AA+) as well as corresponding VP capsid proteins which lack the Met1/AA1 amino acid as a result of Met/AA-clipping (Met−/AA−).
Further according to the present disclosure, a reference to a specific SEQ ID NO: (whether a protein or nucleic acid) which comprises or encodes, respectively, one or more capsid proteins which include a Met1/AA1 amino acid (Met+/AA+) should be understood to teach the VP capsid proteins which lack the Met1/AA1 amino acid as upon review of the sequence, it is readily apparent any sequence which merely lacks the first listed amino acid (whether or not Met1/AA1).
As a non-limiting example, reference to a VP1 polypeptide sequence which is 736 amino acids in length and which includes a “Met1” amino acid (Met+) encoded by the AUG/ATG start codon may also be understood to teach a VP1 polypeptide sequence which is 735 amino acids in length and which does not include the “Met1” amino acid (Met−) of the 736 amino acid Met+ sequence. As a second non-limiting example, reference to a VP1 polypeptide sequence which is 736 amino acids in length and which includes an “AA1” amino acid (AA1+) encoded by any NNN initiator codon may also be understood to teach a VP1 polypeptide sequence which is 735 amino acids in length and which does not include the “AA1” amino acid (AA1−) of the 736 amino acid AA1+ sequence.
References to viral capsids formed from VP capsid proteins (such as reference to specific AAV capsid serotypes), can incorporate VP capsid proteins which include a Met1/AA1 amino acid (Met+/AA1+), corresponding VP capsid proteins which lack the Met1/AA1 amino acid as a result of Met/AA1-clipping (Met−/AA1−), and combinations thereof (Met+/AA1+ and Met−/AA1−).
As a non-limiting example, an AAV capsid serotype can include VP1 (Met+/AA1+), VP1 (Met−/AA1−), or a combination of VP1 (Met+/AA1+) and VP1 (Met−/AA1−). An AAV capsid serotype can also include VP3 (Met+/AA1+), VP3 (Met−/AA1−), or a combination of VP3 (Met+/AA1+) and VP3 (Met−/AA1−): and can also include similar optional combinations of VP2 (Met+/AA1) and VP2 (Met−/AA1−).
Vector genome Component: Inverted Terminal Repeats (ITRs)
The AAV particles and/or VAPs of the present disclosure comprise a vector genome with at least one ITR region and a payload region. In some embodiments, the vector genome has two ITRs. These two ITRs flank the payload region at the 5′ and 3′ ends. The ITRs function as origins of replication comprising recognition sites for replication. ITRs comprise sequence regions which can be complementary and symmetrically arranged. ITRs incorporated into vector genomes may be comprised of naturally occurring polynucleotide sequences or recombinantly derived polynucleotide sequences.
The ITRs may be derived from the same serotype as the capsid, selected from any of the serotypes listed in Table 1, or a derivative thereof. The ITR may be of a different serotype than the capsid. In some embodiments, the AAV particle and/or VAP has more than one ITR. In a non-limiting example, the AAV particle and/or VAP has a vector genome comprising two ITRs. In some embodiments, the ITRs are of the same serotype as one another. In another embodiment, the ITRs are of different serotypes. Non-limiting examples include zero, one or both of the ITRs having the same serotype as the capsid. In some embodiments both ITRs of the vector genome of the AAV particle and/or VAP are AAV2 ITRs.
Independently, each ITR may be about 100 to about 150 nucleotides in length. An ITR may be about 100-105 nucleotides in length, 106-110 nucleotides in length, 111-115 nucleotides in length, 116-120 nucleotides in length, 121-125 nucleotides in length, 126-130 nucleotides in length, 131-135 nucleotides in length, 136-140 nucleotides in length, 141-145 nucleotides in length or 146-150 nucleotides in length. In some embodiments, the ITRs are 140-142 nucleotides in length. Non-limiting examples of ITR length are 102, 130, 140, 141, 142, 145 nucleotides in length, and those having at least 95% identity thereto.
In some embodiments, each ITR may be 141 nucleotides in length.
In some embodiments, each ITR may be 130 nucleotides in length.
In some embodiments, the AAV particles and/or VAPs comprise two ITRs and one ITR is 141 nucleotides in length and the other ITR is 130 nucleotides in length.
In some embodiments, the payload region of the vector genome comprises at least one element to enhance the transgene target specificity and expression (See e.g., Powell et al. Viral Expression Cassette Elements to Enhance Transgene Target Specificity and Expression in Gene Therapy, 2015: the contents of which are herein incorporated by reference in its entirety). Non-limiting examples of elements to enhance the transgene target specificity and expression include promoters, endogenous miRNAs, post-transcriptional regulatory elements (PREs), polyadenylation (PolyA) signal sequences and upstream enhancers (USEs), CMV enhancers and introns.
A person skilled in the art may recognize that expression of the polypeptides in a target cell may require a specific promoter, including but not limited to, a promoter that is species specific, inducible, tissue-specific, or cell cycle-specific (Parr et al., Nat. Med. 3: 1145-9 (1997); the contents of which are herein incorporated by reference in their entirety).
In some embodiments, the promoter is deemed to be efficient when it drives expression of the polypeptide(s) encoded in the payload region of the vector genome of the AAV particle and/or VAR
In some embodiments, the promoter is a promoter deemed to be efficient when it drives expression in the cell being targeted.
In some embodiments, the promoter drives expression of at least one VENOM element for a period of time in targeted tissues. Expression driven by a promoter may be for a period of 1 hour, 2, hours, 3 hours, 4 hours, 5 hours, 6 hours, 7 hours. 8 hours, 9 hours, 10 hours, 11 hours, 12 hours, 13 hours, 14 hours, 15 hours, 16 hours, 17 hours, 18 hours, 19 hours, 20 hours, 21 hours, 22 hours, 23 hours, 1 day, 2 days, 3 days, 4 days, 5 days, 6 days, 1 week, 8 days, 9 days, 10 days, 11 days, 12 days, 13 days, 2 weeks, 15 days, 16 days, 17 days, 18 days, 19 days, 20 days. 3 weeks, 22 days, 23 days, 24 days, 25 days, 26 days, 27 days, 28 days, 29 days, 30 days, 31 days, 1 month, 2 months, 3 months, 4 months, 5 months, 6 months, 7 months, 8 months, 9 months, 10 months, 11 months, 1 year, 13 months, 14 months, 15 months, 16 months, 17 months, 18 months. 19 months, 20 months, 21 months, 22 months, 23 months, 2 years, 3 years, 4 years, 5 years, 6 years, 7 years, 8 years, 9 years, 10 years or more than 10 years. Expression may be for 1-5 hours, 1-12 hours, 1-2 days, 1-5 days, 1-2 weeks, 1-3 weeks, 1-4 weeks, 1-2 months, 1-4 months, 1-6 months, 2-6 months, 3-6 months, 3-9 months, 4-8 months, 6-12 months, 1-2 years, 1-5 years, 2-5 years, 3-6 years, 3-8 years, 4-8 years, or 5-10 years.
In some embodiments, the promoter drives expression of at least one VENOM element for at least 1 month, 2 months, 3 months, 4 months, 5 months, 6 months, 7 months, 8 months, 9 months, 10 months, 11 months, 1 year, 2 years, 3 years 4 years, 5 years, 6 years, 7 years, 8 years, 9 years, 10 years, 11 years, 12 years, 13 years, 14 years, 15 years, 16 years, 17 years, 18 years, 19 years, 20 years, 21 years, 22 years, 23 years, 24 years, 25 years, 26 years, 27 years, 28 years, 29 years, 30 years, 31 years, 32 years, 33 years, 34 years, 35 years, 36 years, 37 years, 38 years, 39 years, 40 years, 41 years, 42 years, 43 years, 44 years, 45 years, 46 years, 47 years. 48 years. 49 years, 50 years, 55 years, 60 years, 65 years, or more than 65 years.
Promoters may be naturally occurring or non-naturally occurring. Non-limiting examples of promoters include viral promoters, plant promoters and mammalian promoters. In some embodiments, the promoters may be human promoters. In some embodiments, the promoter may be truncated.
Promoters which drive or promote expression in most tissues include, but are not limited to, human elongation factor 1α-subunit (EF1α), cytomegalovirus (CMV) immediate-early enhancer and/or promoter, chicken β-actin (CBA) and its derivative CAG, β glucuronidase (GUSB), or ubiquitin C (UBC). Tissue-specific expression elements can be used to restrict expression to certain cell types such as, but not limited to, muscle specific promoters, B cell promoters, monocyte promoters, leukocyte promoters, macrophage promoters, pancreatic acinar cell promoters, endothelial cell promoters, lung tissue promoters, astrocyte promoters, or nervous system promoters which can be used to restrict expression to neurons, astrocytes, or oligodendrocytes.
Non-limiting examples of muscle-specific promoters include mammalian muscle creatine kinase (MCK) promoter, mammalian desmin (DES) promoter, mammalian troponin I (TNN12) promoter, and mammalian skeletal alpha-actin (ASKA) promoter (see, e.g. U.S. Patent Publication US20110212529, the contents of which are herein incorporated by reference in their entirety)
Non-limiting examples of tissue-specific expression elements for neurons include neuron-specific enolase (NSE), platelet-derived growth factor (PDGF), platelet-derived growth factor B-chain (PDGF-β), synapsin (Syn), methyl-CpG binding protein 2 (MeCP2), Ca2+/calmodulin-dependent protein kinase II (CaMKII), metabotropic glutamate receptor 2 (mGluR2), neurofilament light (NFL) or heavy (NFH), β-globin minigene nβ2, preproenkephalin (PPE), enkephalin (Enk) and excitatory amino acid transporter 2 (EAAT2) promoters. Non-limiting examples of tissue-specific expression elements for astrocytes include glial fibrillary acidic protein (GFAP) and EAAT2 promoters. A non-limiting example of a tissue-specific expression element for oligodendrocytes includes the myelin basic protein (MBP) promoter.
In some embodiments, the promoter may be less than 1 kb. The promoter may have a length of 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, or more than 800 nucleotides. The promoter may have a length between 200-300, 200-400, 200-500, 200-600, 200-700, 200-800, 300-400, 300-500, 300-600, 300-700, 300-800, 400-500, 400-600, 400-700, 400-800, 500-600, 500-700, 500-800, 600-700, 600-800, or 700-800.
In some embodiments, the promoter may be a combination of two or more components of the same or different starting or parental promoters such as, but not limited to, CMV and CBA. Each component may have a length of 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, or more than 800. Each component may have a length between 200-300, 200400, 200-500, 200-600, 200-700, 200-800, 300400, 300-500, 300-600, 300-700, 300-800, 400-500, 400-600, 400-700, 400-800, 500-600, 500-700, 500-800, 600-700, 600-800 or 700-800. In some embodiments, the promoter is a combination of a 382 nucleotide CMV-enhancer sequence and a 260 nucleotide CBA-promoter sequence.
In some embodiments, the vector genome comprises a ubiquitous promoter. Non-limiting examples of ubiquitous promoters include CMV, CBA (including derivatives CAG, CB6, CBh, etc.), EF-1α, PGK, UBC, GUSB (hGBp), and UCOE (promoter of HNRPA2B1-CBX3).
Yu et al. (Molecular Pain 2011, 7:63; the contents of which are herein incorporated by reference in their entirety) evaluated the expression of eGFP under the CAG, EF1α, PGK and UBC promoters in rat DRG cells and primary DRG cells using lentiviral vectors and found that UBC showed weaker expression than the other 3 promoters and only 10-12% glial expression was seen for all promoters. Soderblom et al. (E. Neuro 2015; the contents of which are herein incorporated by reference in its entirety) evaluated the expression of eGFP in AAV8 with CMV and UBC promoters and AAV2 with the CMV promoter after injection in the motor cortex. Intranasal administration of a plasmid containing a UBC or EF1α promoter showed a sustained airway expression greater than the expression with the CMV promoter (See e.g., Gill et al., Gene Therapy 2001, Vol. 8, 1539-1546; the contents of which are herein incorporated by reference in their entirety). Husain et al. (Gene Therapy 2009; the contents of which are herein incorporated by reference in its entirety) evaluated an HβH construct with a hGUSB promoter, an HSV-1LAT promoter and an NSE promoter and found that the HβH construct showed weaker expression than NSE in mouse brain. Passini and Wolfe (J. Virol. 2001, 12382-12392, the contents of which are herein incorporated by reference in its entirety) evaluated the long-term effects of the HβH vector following an intraventricular injection in neonatal mice and found that there was sustained expression for at least 1 year. Low expression in all brain regions was found by Xu et al. (Gene Therapy 2001, 8, 1323-1332; the contents of which are herein incorporated by reference in their entirety) when NFL and NFH promoters were used as compared to the CMV-lacZ, CMV-luc, EF, GFAP, hENK, nAChR, PPE, PPE+wpre, NSE (0.3 kb), NSE (1.8 kb) and NSE (1.8 kb+wpre). Xu et al. found that the promoter activity in descending order was NSE (1.8 kb), EF, NSE (0.3 kb), GFAP, CMV, hENK, PPE, NFL and NFH. NFL is a 650 nucleotide promoter and NFH is a 920-nucleotide promoter which are both absent in the liver but NFH is abundant in the sensory proprioceptive neurons, brain and spinal cord and NFH is present in the heart. Scn8a is a 470 nucleotide promoter which expresses throughout the DRG, spinal cord and brain with particularly high expression seen in the hippocampal neurons and cerebellar Purkinje cells, cortex, thalamus, and hypothalamus (See e.g., Drews et al. Identification of evolutionary conserved, functional noncoding elements in the promoter region of the sodium channel gene SCN8A, Mamm Genome (2007) 18:723-731; and Raymond et al. Expression of Alternatively Spliced Sodium Channel α-subunit genes, Journal of Biological Chemistry (2004) 279(44) 46234-46241; the contents of each of which are herein incorporated by reference in their entireties).
Any of promoters taught by the aforementioned Yu, Soderblom, Gill, Husain, Passini, Xu, Drews, or Raymond may be used in the present disclosures.
In some embodiments, the promoter is not cell specific.
In some embodiments, the promoter is a ubiquitin c (UBC) promoter. The UBC promoter may have a size of 300-350 nucleotides. As a non-limiting example, the UBC promoter is 332 nucleotides.
In some embodiments, the promoter is a β-glucuronidase (GUSB) promoter. The GUSB promoter may have a size of 350-400 nucleotides. As a non-limiting example, the GUSB promoter is 378 nucleotides.
In some embodiments, the promoter is a neurofilament light (NFL) promoter. The NFL promoter may have a size of 600-700 nucleotides. As a non-limiting example, the NFL promoter is 650 nucleotides.
In some embodiments, the promoter is a neurofilament heavy (NFH) promoter. The NFH promoter may have a size of 900-950 nucleotides. As a non-limiting example, the NFH promoter is 920 nucleotides.
In some embodiments, the promoter is a scn8a promoter. The scn8a promoter may have a size of 450-500 nucleotides. As a non-limiting example, the scn8a promoter is 470 nucleotides.
In some embodiments, the promoter is a phosphoglycerate kinase 1 (PGK) promoter.
In some embodiments, the promoter is a chicken β-actin (CBA) promoter, or a variant thereof.
In some embodiments, the promoter is a CB6 promoter.
In some embodiments, the promoter is a minimal CB promoter.
In some embodiments, the promoter is a cytomegalovirus (CMV) promoter.
In some embodiments, the promoter is a CAG promoter.
In some embodiments, the promoter is a GFAP promoter.
In some embodiments, the promoter is a synapsin promoter.
In some embodiments, the promoter is a liver or a skeletal muscle promoter. Non-limiting examples of liver promoters include human α-1-antitrypsin (hAAT) and thyroxine binding globulin (TBG). Non-limiting examples of skeletal muscle promoters include Desmin, MCK or synthetic C5-12.
In some embodiments, the promoter is an RNA pol III promoter. As a non-limiting example, the RNA pol III promoter is U6. As a non-limiting example, the RNA pol III promoter is H1.
In some embodiments, the vector genome comprises two promoters. As a non-limiting example, the promoters are an EF1α promoter and a CMV promoter.
In some embodiments, the vector genome comprises an enhancer element, a promoter and/or a 5′UTR intron. The enhancer element, also referred to herein as an “enhancer,” may be, but is not limited to, a CMV enhancer, the promoter may be, but is not limited to, a CMV, CBA, UBC, GUSB, NSE, Synapsin, McCP2, and GFAP promoter and the 5′UTR/intron may be, but is not limited to, SV40, and CBA-MVM. As a non-limiting example, the enhancer, promoter and/or intron used in combination may be: (1) CMV enhancer, CMV promoter, SV40 5′UTR intron; (2) CMV enhancer, CBA promoter, SV 40 5′UTR intron; (3) CMV enhancer, CBA promoter, CBA-MVM 5′UTR intron; (4) UBC promoter; (5) GUSB promoter; (6) NSE promoter; (7) Synapsin promoter; (8) MeCP2 promoter; and (9) GFAP promoter.
In some embodiments, the vector genome comprises an engineered promoter.
In another embodiment, the vector genome comprises a promoter from a naturally expressed protein.
By definition, wild type untranslated regions (UTRs) of a gene are transcribed but not translated. Generally, the 5′ UTR starts at the transcription start site and ends at the start codon and the 3′ UTR starts immediately following the stop codon and continues until the termination signal for transcription.
Features typically found in abundantly expressed genes of specific target organs may be engineered into UTRs to enhance the stability and protein production. As a non-limiting example, a 5′ UTR from mRNA normally expressed in the liver (e.g., albumin, serum amyloid A, Apolipoprotein A/B/E, transferrin, alpha fetoprotein, erythropoietin, or Factor VIII) may be used in the vector genomes of the AAV particles and/or VAPs to enhance expression in hepatic cell lines or liver.
While not wishing to be bound by theory, wild-type 5′ untranslated regions (UTRs) include features which play roles in translation initiation. Kozak sequences, which are commonly known to be involved in the process by which the ribosome initiates translation of many genes, are usually included in 5′ UTRs. Kozak sequences have the consensus CCR(A/G)CCAUGG, where R is a purine (adenine or guanine) three bases upstream of the start codon (ATG), which is followed by another ‘G’.
In some embodiments, the 5′UTR in the vector genome includes a Kozak sequence.
In some embodiments, the 5′UTR in the vector genome does not include a Kozak sequence.
While not wishing to be bound by theory, wild-type 3′ UTRs are known to have stretches of Adenosines and Uridines embedded therein. These AU rich signatures are particularly prevalent in genes with high rates of turnover. Based on their sequence features and functional properties, the AU rich elements (AREs) can be separated into three classes (Chen et al, 1995, the contents of which are herein incorporated by reference in its entirety): Class I AREs, such as, but not limited to, c-Myc and MyoD, contain several dispersed copies of an AUUUA motif within U-rich regions. Class II AREs, such as, but not limited to, GM-CSF and TNF-α, possess two or more overlapping UUAUUUA(U/A)(U/A) nonamers. Class III ARES, such as, but not limited to, c-Jun and Myogenin, are less well defined. These U rich regions do not contain an AUUUA motif. Most proteins binding to the AREs are known to destabilize the messenger, whereas members of the ELAV family, most notably HuR, have been documented to increase the stability of mRNA. HuR binds to AREs of all the three classes. Engineering the HuR specific binding sites into the 3′ UTR of nucleic acid molecules will lead to HuR binding and thus, stabilization of the message in vivo.
Introduction, removal or modification of 3′ UTR AU rich elements (AREs) can be used to modulate the stability of polynucleotides. When engineering specific polynucleotides, e.g., payload regions of vector genomes, one or more copies of an ARE can be introduced to make polynucleotides less stable and thereby curtail translation and decrease production of the resultant protein. Likewise, AREs can be identified and removed or mutated to increase the intracellular stability and thus increase translation and production of the resultant protein.
In some embodiments, the 3′ UTR of the vector genome may include an oligo(dT) sequence for templated addition of a poly-A tail.
In some embodiments, the vector genome may include at least one miRNA seed, binding site or full sequence. microRNAs (or miRNA or miR) are 19-25 nucleotide noncoding RNAs that bind to the sites of nucleic acid targets and down-regulate gene expression either by reducing nucleic acid molecule stability or by inhibiting translation. A microRNA sequence comprises a “seed” region, i.e., a sequence in the region of positions 2-8 of the mature microRNA, which sequence has perfect Watson-Crick complementarity to the miRNA target sequence of the nucleic acid.
In some embodiments, the vector genome may be engineered to include, alter or remove at least one miRNA binding site, sequence, or seed region.
Any UTR from any gene known in the art may be incorporated into the vector genome of the AAV particle and/or VAP. These UTRs. or portions thereof, may be placed in the same orientation as in the gene from which they were selected, or they may be altered in orientation or location. In some embodiments, the UTR used in the vector genome of the AAV particle and/or VAP may be inverted, shortened, lengthened, made with one or more other 5′ UTRs or 3′ UTRs known in the art. As used herein, the term “altered” as it relates to a UTR, means that the UTR has been changed in some way in relation to a reference sequence. For example, a 3′ or 5′ UTR may be altered relative to a wild type or native UTR by the change in orientation or location as taught above or may be altered by the inclusion of additional nucleotides, deletion of nucleotides, swapping or transposition of nucleotides.
In some embodiments, the vector genome of the AAV particle and/or VAP comprises at least one artificial UTRs which is not a variant of a wild type UTR.
In some embodiments, the vector genome of the AAV particle and/or VAP comprises UTRs which have been selected from a family of transcripts whose proteins share a common function, structure, feature or property.
In some embodiments, the vector genome of the AAV particles and/or VAP of the present disclosure comprise at least one polyadenylation sequence. The vector genome of the AAV particle and/or VAP may comprise a polyadenylation sequence between the 3′ end of the payload coding sequence and the 5′ end of the 3′ITR.
In some embodiments, the polyadenylation sequence or “polyA sequence” may range from absent to about 500 nucleotides in length. The polyadenylation sequence may be, but is not limited to, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111,112, 113,114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218,219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403,404, 405, 406, 407,408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479,480, 481, 482, 483, 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, 500, 501, 502, 503, 504, 505, 506, 507, 508, 509, 510, 511, 512, 513, 514, 515, 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, and 600 nucleotides in length.
In some embodiments, the polyadenylation sequence is 50-100 nucleotides in length.
In some embodiments, the polyadenylation sequence is 50-150 nucleotides in length.
In some embodiments, the polyadenylation sequence is 50-160 nucleotides in length.
In some embodiments, the polyadenylation sequence is 50-200 nucleotides in length.
In some embodiments, the polyadenylation sequence is 60-100 nucleotides in length.
In some embodiments, the polyadenylation sequence is 60-150 nucleotides in length.
In some embodiments, the polyadenylation sequence is 60-160 nucleotides in length.
In some embodiments, the polyadenylation sequence is 60-200 nucleotides in length.
In some embodiments, the polyadenylation sequence is 70-100 nucleotides in length.
In some embodiments, the polyadenylation sequence is 70-150 nucleotides in length.
In some embodiments, the polyadenylation sequence is 70-160 nucleotides in length.
In some embodiments, the polyadenylation sequence is 70-200 nucleotides in length.
In some embodiments, the polyadenylation sequence is 80-100 nucleotides in length.
In some embodiments, the polyadenylation sequence is 80-150 nucleotides in length.
In some embodiments, the polyadenylation sequence is 80-160 nucleotides in length.
In some embodiments, the polyadenylation sequence is 80-200 nucleotides in length.
In some embodiments, the polyadenylation sequence is 90-100 nucleotides in length.
In some embodiments, the polyadenylation sequence is 90-150 nucleotides in length.
In some embodiments, the polyadenylation sequence is 90-160 nucleotides in length.
In some embodiments, the polyadenylation sequence is 90-200 nucleotides in length.
In some embodiments, the polyadenylation sequence is 127 nucleotides in length.
In some embodiments, the polyadenylation sequence is 477 nucleotides in length.
In some embodiments, the polyadenylation sequence is 552 nucleotides in length.
Vector genomes may be engineered with one or more spacer or linker regions to separate coding or non-coding regions.
In some embodiments, the payload region of the AAV particle and/or VAP may optionally encode one or more linker sequences. In some cases, the linker may be a peptide linker that may be used to connect the polypeptides encoded by the payload region. Some peptide linkers may be cleaved after expression to separate the polypeptides. Linker cleavage may be enzymatic. In some cases, linkers comprise an enzymatic cleavage site to facilitate intracellular or extracellular cleavage. Some payload regions encode linkers that interrupt polypeptide synthesis during translation of the linker sequence from an mRNA transcript. Such linkers may facilitate the translation of separate protein domains from a single transcript. In some cases, two or more linkers are encoded by a payload region of the vector genome. Non-limiting examples of linkers that may be encoded by the payload region of an AAV particle and/or VAP vector genome are given in Table 2.
Some payload regions encode linkers comprising furin cleavage sites. Furin is a calcium dependent serine endoprotease that cleaves proteins just downstream of a basic amino acid target sequence (Arg-X-(Arg/Lys)-Arg) (Thomas, G., 2002. Nature Reviews Molecular Cell Biology 3(10): 753-66; the contents of which are herein incorporated by reference in its entirety). Furin is enriched in the trans-golgi network where it is involved in processing cellular precursor proteins. Furin also plays a role in activating a number of pathogens. This activity can be taken advantage of for expression of polypeptides.
2A peptides are small “self-cleaving” peptides (18-22 amino acids) derived from viruses such as foot-and-mouth disease virus (F2A), porcine teschovirus-1 (P2A). Thoseaasigna virus (T2A), or equine rhinitis A virus (E2A). The 2A designation refers specifically to a region of picornavirus polyproteins that lead to a ribosomal skip at the glycyl-prolyl bond in the C-terminus of the 2A peptide (Kim, J. H. et al., 2011. PLoS One 6(4): e18556; the contents of which are herein incorporated by reference in its entirety). This skip results in a cleavage between the 2A peptide and its immediate downstream peptide. As opposed to IRES linkers, 2A peptides generate stoichiometric expression of proteins flanking the 2A peptide and their shorter length can be advantageous in generating viral expression vectors.
Internal ribosomal entry site (IRES) is a nucleotide sequence (>500 nucleotides) that allows for initiation of translation in the middle of an mRNA sequence (Kim, J. H. et al., 2011. PLoS One 6(4): e18556; the contents of which are herein incorporated by reference in its entirety). Use of an IRES sequence ensures co-expression of genes before and after the IRES, though the sequence following the IRES may be transcribed and translated at lower levels than the sequence preceding the IRES sequence.
In some embodiments, the payload region may encode one or more linkers comprising cathepsin, matrix metalloproteinases or legumain cleavage sites. Such linkers are described e.g. by Cizeau and Macdonald in International Publication No. WO2008052322, the contents of which are herein incorporated in their entirety. Cathepsins are a family of proteases with unique mechanisms to cleave specific proteins. Cathepsin B is a cysteine protease and cathepsin D is an aspartyl protease. Matrix metalloproteinases are a family of calcium-dependent and zinc-containing endopeptidases. Legumain is an enzyme catalyzing the hydrolysis of (-Asn-Xaa-) bonds of proteins and small molecule substrates.
In some embodiments, payload regions may encode linkers that are not cleaved. Such linkers may include a simple amino acid sequence, such as a glycine rich sequence. In some cases, linkers may comprise flexible peptide linkers comprising glycine and serine residues. The linker may comprise flexible peptide linkers of different lengths, e.g. n×G4S, where n=1-10 (SEQ ID NO: 1743), and the length of the encoded linker varies between 5 and 50 amino acids. In a non-limiting example, the linker may be 5×G4S (SEQ ID NO: 1742). These flexible linkers are small and without side chains, so they tend not to influence secondary protein structure while providing a flexible linker between antibody segments (George, R. A., et al., 2002. Protein Engineering 15(11): 871-9; Huston. J. S. et al., 1988. PNAS 85:5879-83; and Shan, D. et al., 1999. Journal of Immunology. 162(11):6589-95; the contents of each of which are herein incorporated by reference in their entirety). Furthermore, the polarity of the serine residues improves solubility and prevents aggregation problems.
In some embodiments, payload regions may encode small and unbranched serine-rich peptide linkers, such as those described by Huston et al. in U.S. Pat. No. 5,525,491, the contents of which are herein incorporated in their entirety. Polypeptides encoded by the payload region, linked by serine-rich linkers, have increased solubility.
In some embodiments, payload regions may encode artificial linkers, such as those described by Whitlow and Filpula in U.S. Pat. No. 5,856,456 and Ladner et al. in U.S. Pat. No. 4,946,778, the contents of each of which are herein incorporated by their entirety.
In some embodiments, the payload region encodes at least one G4S3 linker (“G4S3” disclosed as SEQ ID NO: 1741).
In some embodiments, the payload region encodes at least one G4S linker (“G4S” disclosed as SEQ ID NO: 1740).
In some embodiments, the payload region encodes at least one furin site.
In some embodiments, the payload region encodes at least one T2A linker.
In some embodiments, the payload region encodes at least one F2A linker.
In some embodiments, the payload region encodes at least one P2A linker.
In some embodiments, the payload region encodes at least one IRES sequence.
In some embodiments, the payload region encodes at least one G4S5 linker (“G4S5” disclosed as SEQ ID NO: 1742).
In some embodiments, the payload region encodes at least one furin and one 2A linker.
In some embodiments, the payload region encodes at least one hinge region. As a non-limiting example, the hinge is an IgG hinge.
In some embodiments, the linker region may be 1-50, 1-100, 50-100, 50-150, 100-150, 100-200, 150-200, 150-250, 200-250, 200-300, 250-300, 250-350, 300-350, 300-400, 350-400, 350-450, 400-450, 400-500, 450-500, 450-550, 500-550, 500-600, 550-600, 550-650, or 600-650 nucleotides in length. The linker region may have a length of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 115, 120, 125, 130, 135, 140, 145, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 165, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 185, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 620, 621, 622, 623, 624, 625, 626, 627, 628, 629, 630, 631, 632, 633, 634, 635, 640, 650 or greater than 650. In some embodiments, the linker region may be 12 nucleotides in length. In some embodiments, the linker region may be 18 nucleotides in length. In some embodiments, the linker region may be 45 nucleotides in length. In some embodiments, the linker region may be 54 nucleotides in length. In some embodiments, the linker region may be 66 nucleotides in length. In some embodiments, the linker region may be 75 nucleotides in length. In some embodiments, the linker region may be 78 nucleotides in length. In some embodiments, the linker region may be 87 nucleotides in length. In some embodiments, the linker region may be 108 nucleotides in length. In some embodiments, the linker region may be 153 nucleotides in length. In some embodiments, the linker region may be 198 nucleotides in length. In some embodiments, the linker region may be 623 nucleotides in length.
In some embodiments, the payload region comprises at least one element to enhance the expression such as one or more introns or portions thereof. Non-limiting examples of introns include, MVM (67-97 bps), F.IX truncated intron 1 (300 bps), pi-globin SD/immunoglobulin heavy chain splice acceptor (250 bps), adenovirus splice donor/immunoglobin splice acceptor (500 bps), SV40 late splice donor/splice acceptor (19S/16S) (180 bps) and hybrid adenovirus splice donor/IgG splice acceptor (230 bps).
In some embodiments, the intron or intron portion may be 1-100, 100-500, 500-1000, or 1000-1500 nucleotides in length. The intron may have a length of 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, or greater than 500. The intron may have a length between 80-100, 80-120, 80-140, 80-160, 80-180, 80-200, 80-250, 80-300, 80-350, 80-400, 80-450, 80-500, 200-300, 200-400, 200-500, 300-400, 300-500, or 400-500. In some embodiments, the intron may be 15 nucleotides in length. In some embodiments, the intron may be 32 nucleotides in length. In some embodiments, the intron may be 41 nucleotides in length. In some embodiments, the intron may be 53 nucleotides in length. In some embodiments, the intron may be 54 nucleotides in length. In some embodiments, the intron may be 59 nucleotides in length. In some embodiments, the intron may be 73 nucleotides in length. In some embodiments, the intron may be 102 nucleotides in length. In some embodiments, the intron may be 134 nucleotides in length. In some embodiments, the intron may be 168 nucleotides in length. In some embodiments, the intron may be 172 nucleotides in length. In some embodiments, the intron may be 347 nucleotides in length. In some embodiments, the intron may be 1074 nucleotides in length.
Any, or all components of a vector genome may be modified or optimized to improve expression or targeting of the payload. Such components include, but are not limited to, intron, signal peptide sequences, VENOM element, linkers, cleavage sites, polyadenylation sequences, stuffer sequences, other regulatory sequences, and/or the backbone of the ITR to ITR sequence.
The AAV particles and/or VAPs of the present disclosure comprise at least one payload region. As used herein, “payload” or “payload region” refers to one or more polynucleotides or polynucleotide regions encoded by or within a vector genome or an expression product of such polynucleotide or polynucleotide region, e.g., a transgene, a polynucleotide encoding a polypeptide or multi-polypeptide or a modulatory nucleic acid or regulatory nucleic acid. Payloads of the present disclosure typically VENOM elements or fragments or variants thereof.
The payload region may be constructed in such a way as to reflect a region similar to or mirroring the natural organization of an mRNA.
The payload region may comprise a combination of coding and non-coding nucleic acid sequences.
In some embodiments, the AAV payload region may encode a coding or non-coding RNA.
In some embodiments, the AAV particle and/or VAP comprises a vector genome with a payload region comprising nucleic acid sequences encoding more than VENOM element. In such an embodiment, a vector genome encoding more than one polypeptide may be replicated and packaged into a viral particle. A target cell transduced with a viral particle comprising more than one polypeptide may express each of the polypeptides in a single cell.
In some embodiments, the payload region may comprise at least one inverted terminal repeat (ITR), a promoter region, an intron region, and a coding region.
Where the AAV particle and/or VAP payload region encodes a polypeptide, the polypeptide may be a peptide or protein. A protein encoded by the AAV particle and/or VAP payload region may comprise a secreted protein, an intracellular protein, an extracellular protein, and/or a membrane protein. The encoded proteins may be structural or functional. In addition to the VENOM elements, proteins encoded by the payload region may include, in combination, certain mammalian proteins involved in immune system regulation. The AAV vector genomes encoding polypeptides described herein may be useful in the fields of human disease, viruses, infections, veterinary applications and a variety of in vivo and in vitro settings.
In some embodiments, the AAV particles and/or VAPs are useful in the field of medicine for the treatment, prophylaxis, palliation, or amelioration of neurological diseases and/or disorders.
In some embodiments, the payload region of the AAV particle comprises one or more nucleic acid sequences encoding VENOM elements, variants or fragments thereof.
In some embodiments, the payload region of the AAV particle comprises one or more nucleic acid sequences encoding one or more of the VENOM elements, or variants or fragments thereof. As used herein, “VENOM polynucleotide” refers to a nucleic acid sequence encoding VENOM element.
In some embodiments, the AAV particles or VAPs may comprise a vector genome, wherein one or more components may be codon-optimized. Codon-optimization may be achieved by any method known to one with skill in the art such as, but not limited to, by a method according to Genescript, EMBOSS, Bioinformatics, NUS, NUS2, Geneinfinity, IDT, NUS3, GregThatcher, Insilico, Molbio, N2P, Snapgene, and/or VectorNTI. Antibody heavy and/or light chain sequences within the same vector genome may be codon-optimized according to the same or according to different methods.
VENOM elements by payload regions of the vector genomes may be translated as a whole polypeptide, a plurality of polypeptides or fragments of polypeptides, which independently may be encoded by one or more nucleic acids, fragments of nucleic acids or variants of any of the aforementioned. As used herein, “polypeptide” means a polymer of amino acid residues (natural or unnatural) linked together most often by peptide bonds. The term, as used herein, refers to proteins, polypeptides, and peptides of any size, structure, or function. In some instances, the polypeptide encoded is smaller than about 50 amino acids and the polypeptide is then termed a peptide. If the polypeptide is a peptide, it will be at least about 2, 3, 4, or at least 5 amino acid residues long. Thus, polypeptides include gene products, naturally occurring polypeptides, synthetic polypeptides, homologs, orthologs, paralogs, fragments and other equivalents, variants, and analogs of the foregoing. A polypeptide may be a single molecule or may be a multi-molecular complex such as a dimer, trimer or tetramer. They may also comprise single chain or multichain polypeptides and may be associated or linked. The term polypeptide may also apply to amino acid polymers in which one or more amino acid residues are an artificial chemical analogue of a corresponding naturally occurring amino acid.
The term “polypeptide variant” refers to molecules which differ in their amino acid sequence from a native or reference sequence. The amino acid sequence variants may possess substitutions, deletions, and/or insertions at certain positions within the amino acid sequence, as compared to a native or reference sequence. Ordinarily, variants will possess at least about 50% identity (homology) to a native or reference sequence, and preferably, they will be at least about 80%, more preferably at least about 90% identical (homologous) to a native or reference sequence.
In some embodiments “variant mimics” are provided. As used herein, the term “variant mimic” is one which contains one or more amino acids which would mimic an activated sequence. For example, glutamate may serve as a mimic for phosphoro-threonine and/or phosphoro-serine. Alternatively, variant mimics may result in deactivation or in an inactivated product containing the mimic, e.g., phenylalanine may act as an inactivating substitution for tyrosine; or alanine may act as an inactivating substitution for serine.
The term “amino acid sequence variant” refers to molecules with some differences in their amino acid sequences as compared to a native or starting sequence. The amino acid sequence variants may possess substitutions, deletions, and/or insertions at certain positions within the amino acid sequence. “Native” or “starting” sequence should not be confused with a wild type sequence. As used herein, a native or starting sequence is a relative term referring to an original molecule against which a comparison may be made. “Native” or “starting” sequences or molecules may represent the wild-type (that sequence found in nature) but do not have to be the wild-type sequence.
Ordinarily, variants will possess at least about 70% homology to a native sequence, and preferably, they will be at least about 80%, more preferably at least about 90% homologous to a native sequence. “Homology” as it applies to amino acid sequences is defined as the percentage of residues in the candidate amino acid sequence that are identical with the residues in the amino acid sequence of a second sequence after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent homology. Methods and computer programs for the alignment are well known in the art. It is understood that homology depends on a calculation of percent identity but may differ in value due to gaps and penalties introduced in the calculation.
By “homologs” as it applies to amino acid sequences is meant the corresponding sequence of other species having substantial identity to a second sequence of a second species.
“Analogs” is meant to include polypeptide variants which differ by one or more amino acid alterations, e.g., substitutions, additions, or deletions of amino acid residues that still maintain the properties of the parent polypeptide.
Sequence tags or amino acids, such as one or more lysines, can be added to the peptide sequences (e.g., at the N-terminal or C-terminal ends). Sequence tags can be used for peptide purification or localization. Lysines can be used to increase peptide solubility or to allow for biotinylation. Alternatively, amino acid residues located at the carboxy and amino terminal regions of the amino acid sequence of a peptide or protein may optionally be deleted providing for truncated sequences. Certain amino acids (e.g., C-terminal or N-terminal residues) may alternatively be deleted depending on the use of the sequence, as for example, expression of the sequence as part of a larger sequence which is soluble, or linked to a solid support.
“Substitutional variants” when referring to proteins are those that have at least one amino acid residue in a native or starting sequence removed and a different amino acid inserted in its place at the same position. The substitutions may be single, where only one amino acid in the molecule has been substituted, or they may be multiple, where two or more amino acids have been substituted in the same molecule.
As used herein the term “conservative amino acid substitution” refers to the substitution of an amino acid that is normally present in the sequence with a different amino acid of similar size, charge, or polarity. Examples of conservative substitutions include the substitution of a non-polar (hydrophobic) residue such as isoleucine, valine, and leucine for another non-polar residue. Likewise, examples of conservative substitutions include the substitution of one polar (hydrophilic) residue for another such as between arginine and lysine, between glutamine and asparagine, and between glycine and serine. Additionally, the substitution of a basic residue such as lysine, arginine, or histidine for another, or the substitution of one acidic residue such as aspartic acid or glutamic acid for another acidic residue are additional examples of conservative substitutions. Examples of non-conservative substitutions include the substitution of a non-polar (hydrophobic) amino acid residue such as isoleucine, valine, leucine, alanine, methionine for a polar (hydrophilic) residue such as cysteine, glutamine, glutamic acid or lysine and/or a polar residue for a non-polar residue.
“Insertional variants” when referring to proteins are those with one or more amino acids inserted immediately adjacent to an amino acid at a particular position in a native or starting sequence. “Immediately adjacent” to an amino acid means connected to either the alpha-carboxy or alpha-amino functional group of the amino acid.
“Deletional variants” when referring to proteins, are those with one or more amino acids in the native or starting amino acid sequence removed. Ordinarily, deletional variants will have one or more amino acids deleted in a particular region of the molecule.
As used herein, the term “derivative” is used synonymously with the term “variant” and refers to a molecule that has been modified or changed in any way relative to a reference molecule or starting molecule. In some embodiments, derivatives include native or starting proteins that have been modified with an organic proteinaceous or non-proteinaceous derivatizing agent, and post-translational modifications. Covalent modifications are traditionally introduced by reacting targeted amino acid residues of the protein with an organic derivatizing agent that is capable of reacting with selected side-chains or terminal residues, or by harnessing mechanisms of post-translational modifications that function in selected recombinant host cells. The resultant covalent derivatives are useful in programs directed at identifying residues important for biological activity, for immunoassays, or for the preparation of anti-protein antibodies for immunoaffinity purification of the recombinant glycoprotein. Such modifications are within the ordinary skill in the art and are performed without undue experimentation.
Certain post-translational modifications are the result of the action of recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are frequently post-translationally deamidated to the corresponding glutamyl and aspartyl residues. Alternatively, these residues are deamidated under mildly acidic conditions. Either form of these residues may be present in the proteins used in accordance with the present disclosure.
Other post-translational modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the alpha-amino groups of lysine, arginine, and histidine side chains (T. E. Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San Francisco, pp. 79-86 (1983)).
“Features” when referring to proteins are defined as distinct amino acid sequence-based components of a molecule. Features of the proteins of the present disclosure include surface manifestations, local conformational shape, folds, loops, half-loops, domains, half-domains, sites, termini or any combination thereof.
As used herein when referring to proteins the term “surface manifestation” refers to a polypeptide-based component of a protein appearing on an outermost surface.
As used herein when referring to proteins the term “local conformational shape” means a polypeptide based structural manifestation of a protein which is located within a definable space of the protein.
As used herein when referring to proteins the term “fold” means the resultant conformation of an amino acid sequence upon energy minimization. A fold may occur at the secondary or tertiary level of the folding process. Examples of secondary level folds include beta sheets and alpha helices. Examples of tertiary folds include domains and regions formed due to aggregation or separation of energetic forces. Regions formed in this way include hydrophobic and hydrophilic pockets, and the like.
As used herein the term “turn” as it relates to protein conformation means a bend which alters the direction of the backbone of a peptide or polypeptide and may involve one, two, three or more amino acid residues.
As used herein when referring to proteins the term “loop” refers to a structural feature of a peptide or polypeptide which reverses the direction of the backbone of a peptide or polypeptide and comprises four or more amino acid residues. Oliva et al. have identified at least 5 classes of protein loops (J. Mol Biol 266 (4): 814-830; 1997).
As used herein when referring to proteins the term “half-loop” refers to a portion of an identified loop having at least half the number of amino acid residues as the loop from which it is derived. It is understood that loops may not always contain an even number of amino acid residues. Therefore, in those cases where a loop contains or is identified to comprise an odd number of amino acids, a half-loop of the odd-numbered loop will comprise the whole number portion or next whole number portion of the loop (number of amino acids of the loop/2+/−0.5 amino acids). For example, a loop identified as a 7-amino acid loop could produce half-loops of 3 amino acids or 4 amino acids (7/2=3.5+/−0.5 being 3 or 4).
As used herein when referring to proteins the term “domain” refers to a motif of a polypeptide having one or more identifiable structural or functional characteristics or properties (e.g., binding capacity, serving as a site for protein-protein interactions).
As used herein when referring to proteins the term “half-domain” means portion of an identified domain having at least half the number of amino acid residues as the domain from which it is derived. It is understood that domains may not always contain an even number of amino acid residues. Therefore, in those cases where a domain contains or is identified to comprise an odd number of amino acids, a half-domain of the odd-numbered domain will comprise the whole number portion or next whole number portion of the domain (number of amino acids of the domain/2+/−0.5 amino acids). For example, a domain identified as a 7-amino acid domain could produce half-domains of 3 amino acids or 4 amino acids (7/2=3.5+/−0.5 being 3 or 4). It is also understood that sub-domains may be identified within domains or half-domains, these subdomains possessing less than all of the structural or functional properties identified in the domains or half domains from which they were derived. It is also understood that the amino acids that comprise any of the domain types herein need not be contiguous along the backbone of the polypeptide (i.e., nonadjacent amino acids may fold structurally to produce a domain, half-domain or subdomain).
As used herein when referring to proteins the terms “site” as it pertains to amino acid-based embodiments is used synonymous with “amino acid residue” and “amino acid side chain”. A site represents a position within a peptide or polypeptide that may be modified, manipulated, altered, derivatized or varied within the polypeptide-based molecules of the present disclosure.
As used herein the terms “termini or terminus” when referring to proteins refers to an extremity of a peptide or polypeptide. Such extremity is not limited only to the first or final site of the peptide or polypeptide but may include additional amino acids in the terminal regions. The polypeptide-based molecules of the present disclosure may be characterized as having both an N-terminus (terminated by an amino acid with a free amino group (NH2)) and a C-terminus (terminated by an amino acid with a free carboxyl group (COOH)). Proteins are in some cases made up of multiple polypeptide chains brought together by disulfide bonds or by non-covalent forces (multimers, oligomers). These sorts of proteins will have multiple N- and C-termini. Alternatively, the termini of the polypeptides may be modified such that they begin or end, as the case may be, with a non-polypeptide-based moiety such as an organic conjugate.
Once any of the features have been identified or defined as a component of a molecule, any of several manipulations and/or modifications of these features may be performed by moving, swapping, inverting, deleting, randomizing, or duplicating. Furthermore, it is understood that manipulation of features may result in the same outcome as a modification to the molecules. For example, a manipulation which involves deleting a domain would result in the alteration of the length of a molecule just as modification of a nucleic acid to encode less than a full-length molecule would.
Modifications and manipulations can be accomplished by methods known in the art such as site directed mutagenesis. The resulting modified molecules may then be tested for activity using in vitro or in vivo assays such as those described herein, or any other suitable screening assay known in the art.
The present disclosure provides methods for the generation of parvoviral particles, e.g. AAV particles, by vector genome replication in a viral replication cell.
In accordance with the disclosure, the vector genome comprising at least one VENOM element or fragment thereof, will be incorporated into the AAV particle produced in the viral replication cell. Methods of making AAV particles are well known in the art and are described in e.g., U.S. Pat. Nos. 6,204,059, 5,756,283, 6,258,595, 6,261,551, 6,270,996, 6,281,010, 6,365,394, 6,475,769, 6,482,634, 6,485,966, 6,943,019, 6,953,690, 7,022,519, 7,238,526, 7,291,498 and 7,491,508, 5,064,764, 6,194,191, 6,566,118, 8,137,948; or International Publication Nos. WO1996039530, WO1998010088, WO1999014354, WO1999015685, WO1999047691, WO2000055342, WO2000075353, and WO2001023597; Methods In Molecular Biology, ed. Richard, Humana Press, N J (1995); O'Reilly et al., Baculovirus Expression Vectors, A Laboratory Manual, Oxford Univ. Press (1994); Samulski et al., J. Vir. 63:3822-8 (1989); Kajigaya et al., Proc. Nat'l. Acad. Sci. USA 88: 4646-50 (1991); Ruffing et al., J. Vir. 66:6922-30 (1992); Kimbauer et al., Vir., 219:37-44 (1996); Zhao et al., Vir. 272:382-93 (2000); the contents of each of which are herein incorporated by reference in their entirety. In some embodiments, the AAV particles are made using the methods described in WO2015191508, the contents of which are herein incorporated by reference in their entirety.
Viral replication cells commonly used for production of recombinant AAV viral vectors include but are not limited to 293 cells, COS cells, HeLa cells, KB cells, and other mammalian cell lines as described in U.S. Pat. Nos. 6,156,303, 5,387,484, 5,741,683, 5,691,176, and 5,688,676; U.S. patent publication No. 2002/0081721, and International Patent Publication Nos. WO 00/47757, WO 00/24916, and WO 96/17947, the contents of each of which are herein incorporated by reference in their entireties.
In some embodiments, the AAV particles of the present disclosure may be produced in insect cells (e.g., Sf9 cells).
In some embodiments, the AAV particles of the present disclosure may be produced using triple transfection.
In some embodiments, the AAV particles of the present disclosure may be produced in mammalian cells.
In some embodiments, the AAV particles of the present disclosure may be produced by triple transfection in mammalian cells.
In some embodiments, the AAV particles of the present disclosure may be produced by triple transfection in HEK293 cells.
The present disclosure provides a method for producing an AAV particle comprising the steps of: 1) co-transfecting competent bacterial cells with a bacmid vector and either a viral construct vector and/or AAV payload construct vector, 2) isolating the resultant viral construct expression vector and AAV payload construct expression vector and separately transfecting viral replication cells, 3) isolating and purifying resultant payload and viral construct particles comprising viral construct expression vector or AAV payload construct expression vector, 4) co-infecting a viral replication cell with both the AAV payload and viral construct particles comprising viral construct expression vector or AAV payload construct expression vector. 5) harvesting and purifying the viral particle comprising a parvovector genome.
In some embodiments, the present disclosure provides a method for producing an AAV particle comprising the steps of 1) simultaneously co-transfecting mammalian cells, such as, but not limited to HEK293 cells, with a payload region, a construct expressing rep and cap genes and a helper construct, 2) harvesting and purifying the AAV particle comprising a vector genome.
In some embodiments, the viral construct vector(s) used for AAV production may contain a nucleotide sequence encoding the AAV capsid proteins where the initiation codon of the AAV VP1 capsid protein is a non-ATG, i.e., a suboptimal initiation codon, allowing the expression of a modified ratio of the viral capsid proteins in the production system, to provide improved infectivity of the host cell. In a non-limiting example, a viral construct vector may contain a nucleic acid construct comprising a nucleotide sequence encoding AAV VP1, VP2, and VP3 capsid proteins, wherein the initiation codon for translation of the AAV VP1 capsid protein is CTG, TTG, or GTG, as described in U.S. Pat. No. 8,163,543, the contents of which are herein incorporated by reference in its entirety.
In some embodiments, the viral construct vector(s) used for AAV production may contain a nucleotide sequence encoding the AAV rep proteins where the initiation codon of the AAV rep protein or proteins is a non-ATG. In some embodiments, a single coding sequence is used for the Rep78 and Rep52 proteins, wherein initiation codon for translation of the Rep78 protein is a suboptimal initiation codon, selected from the group consisting of ACG, TTG, CTG and GTG, that effects partial exon skipping upon expression in insect cells, as described in U.S. Pat. No. 8,512,981, the contents of which is herein incorporated by reference in its entirety, for example to promote less abundant expression of Rep78 as compared to Rep52, which may be advantageous in that it promotes high vector yields.
In some embodiments, the vector genome of the AAV particle optionally encodes a selectable marker. The selectable marker may comprise a cell-surface marker, such as any protein expressed on the surface of the cell including, but not limited to receptors, CD markers, lectins, integrins, or truncated versions thereof.
In some embodiments, selectable marker reporter genes are selected from those described in International Application No. WO 96/23810; Heim et al., Current Biology 2:178-182 (1996): Heim et al., Proc. Natl. Acad. Sci. USA (1995); or Heim et al., Science 373:663-664 (1995); WO 96/30540, the contents of each of which are incorporated herein by reference in their entireties).
The AAV vector genomes encoding at least one VENOM element described herein may be useful in the fields of human disease, veterinary applications and a variety of in vivo and in vitro settings. The AAV particles and/or VAPs of the present disclosure may be useful in the field of medicine for the treatment, prophylaxis, palliation or amelioration of diseases and/or disorders. The AAV particles and/or VAPs of the present disclosure may be useful in the field of medicine for the treatment, prophylaxis, palliation or amelioration of neurological diseases and/or disorders.
Various embodiments herein provide a pharmaceutical composition comprising the AAV particles and/or VAPs described herein and a pharmaceutically acceptable excipient.
Various embodiments herein provide a method of treating a subject in need thereof comprising administering to the subject a therapeutically effective amount of the pharmaceutical composition described herein.
Certain embodiments of the method provide that the subject is treated by a route of administration of the pharmaceutical composition selected from the group consisting of: intravenous, intracerebroventricular, intraparenchymal, intrathecal, subpial and intramuscular, or a combination thereof. Certain embodiments of the method provide that the subject is treated for a tauopathy and/or other neurological disorder. In one aspect of the method, a pathological feature of the tauopathy or other neurological disorder is alleviated and/or the progression of the tauopathy or other neurological disorder is halted, slowed, ameliorated or reversed.
Various embodiments herein describe a method of decreasing the level of soluble tau in the central nervous system of a subject in need thereof comprising administering to said subject an effective amount of the pharmaceutical composition described herein.
Also described herein are compositions, methods, processes, kits and devices for the design, preparation, manufacture and/or formulation of AAV particles and/or VAPs. In some embodiments, payloads, such as but not limited to anti-tau antibodies, may be encoded by payload constructs or contained within plasmids or vectors or recombinant adeno-associated viruses (AAVs).
The present disclosure also provides administration and/or delivery methods for vectors and viral particles, e.g., AAV particles and/or VAPs, for the treatment or amelioration of neurological disease, such as, but not limited to tauopathy.
According to the present disclosure the AAV particles and/or VAPs may be prepared as pharmaceutical compositions. It will be understood that such compositions necessarily comprise one or more active ingredients and, most often, a pharmaceutically acceptable excipient.
Relative amounts of the active ingredient (e.g. AAV particle and/or VAP), a pharmaceutically acceptable excipient, and/or any additional ingredients in a pharmaceutical composition in accordance with the present disclosure may vary, depending upon the identity, size, and/or condition of the subject being treated and further depending upon the route by which the composition is to be administered. For example, the composition may comprise between 0.1% and 99% (w/w) of the active ingredient. By way of example, the composition may comprise between 0.1% and 100%, e.g., between 0.5 and 50%, between 1-30%, between 5-80%, at least 80% (w/w) active ingredient.
In some embodiments, the AAV particle and/or VAP pharmaceutical compositions described herein may comprise at least one payload. As a non-limiting example, the pharmaceutical compositions may contain an AAV particle and/or VAP with 1, 2, 3, 4 or 5 payloads. In some embodiments, the pharmaceutical composition may contain a nucleic acid encoding a payload construct comprising at least one VENOM elements.
Although the descriptions of pharmaceutical compositions provided herein are principally directed to pharmaceutical compositions which are suitable for administration to humans, it will be understood by the skilled artisan that such compositions are generally suitable for administration to any other animal, e.g., to non-human animals, e.g. non-human mammals. Modification of pharmaceutical compositions suitable for administration to humans in order to render the compositions suitable for administration to various animals is well understood, and the ordinarily skilled veterinary pharmacologist can design and/or perform such modification with merely ordinary, if any, experimentation. Subjects to which administration of the pharmaceutical compositions is contemplated include, but are not limited to, humans and/or other primates; mammals, including commercially relevant mammals such as cattle, pigs, horses, sheep, cats, dogs, mice, rats, birds, including commercially relevant birds such as poultry, chickens, ducks, geese, and/or turkeys.
In some embodiments, compositions are administered to humans, human patients, or subjects.
The AAV particles and/or VAPs can be formulated using one or more excipients to: (1) increase stability, (2) increase cell transfection or transduction; (3) permit the sustained or delayed expression of the payload; (4) alter the biodistribution (e.g., target the viral particle to specific tissues or cell types); (5) increase the translation of encoded protein; (6) alter the release profile of encoded protein; and/or (7) allow for regulatable expression of the payload.
Formulations of the present disclosure can include, without limitation, saline, liposomes, lipid nanoparticles, polymers, peptides, proteins, cells transfected with viral vectors (e.g., for transfer or transplantation into a subject) and combinations thereof.
Formulations of the pharmaceutical compositions described herein may be prepared by any method known or hereafter developed in the art of pharmacology. As used herein the term “pharmaceutical composition” refers to compositions comprising at least one active ingredient and optionally one or more pharmaceutically acceptable excipients.
In general, such preparatory methods include the step of associating the active ingredient with an excipient and/or one or more other accessory ingredients. As used herein, the phrase “active ingredient” generally refers either to an AAV particle and/or VAP carrying a payload region encoding the polypeptides encoded by a vector genome of by an AAV particle and/or VAP as described herein.
A pharmaceutical composition in accordance with the present disclosure may be prepared, packaged, and/or sold in bulk, as a single unit dose, and/or as a plurality of single unit doses. As used herein, a “unit dose” refers to a discrete amount of the pharmaceutical composition comprising a predetermined amount of the active ingredient. The amount of the active ingredient is generally equal to the dosage of the active ingredient which would be administered to a subject and/or a convenient fraction of such a dosage such as, for example, one-half or one-third of such a dosage.
In some embodiments, the AAV particles and/or VAPs may be formulated in phosphate buffered saline (PBS), in combination with an ethylene oxide/propylene oxide copolymer (also known as Pluronic or poloxamer).
In some embodiments, the AAV particles and/or VAPs may be formulated in PBS with 0.001% Pluronic acid (F-68) (poloxamer 188) at a pH of about 7.0.
In some embodiments, the AAV particles and/or VAPs may be formulated in PBS with 0.001% Pluronic acid (F-68) (poloxamer 188) at a pH of about 7.3.
In some embodiments, the AAV particles and/or VAPs may be formulated in PBS with 0.001% Pluronic acid (F-68) (poloxamer 188) at a pH of about 7.4.
In some embodiments, the AAV particles and/or VAPs may be formulated in a solution comprising sodium chloride, sodium phosphate and an ethylene oxide/propylene oxide copolymer.
In some embodiments, the AAV particles and/or VAPs may be formulated in a solution comprising sodium chloride, sodium phosphate dibasic, sodium phosphate monobasic and poloxamer 188/Pluronic acid (F-68).
In some embodiments, the AAV particles and/or VAPs may be formulated in a solution comprising about 180 mM sodium chloride, about 10 mM sodium phosphate and about 0.001% poloxamer 188. In some embodiments, this formulation may be at a pH of about 7.3. The concentration of sodium chloride in the final solution may be 150 mM-200 mM. As non-limiting examples, the concentration of sodium chloride in the final solution may be 150 mM, 160 mM, 170 mM, 180 mM, 190 mM or 200 mM. The concentration of sodium phosphate in the final solution may be 1 mM-50 mM. As non-limiting examples, the concentration of sodium phosphate in the final solution may be 1 mM, 2 mM, 3 mM, 4 mM, 5 mM, 6 mM, 7 mM, 8 mM, 9 mM, 10 mM, 15 mM, 20 mM, 25 mM, 30 mM, 40 mM, or 50 mM. The concentration of poloxamer 188 (Pluronic acid (F-68)) may be 0.0001%-1%. As non-limiting examples, the concentration of poloxamer 188 (Pluronic acid (F-68)) may be 0.0001%, 0.0005%, 0.001%, 0.005%, 0.01%, 0.05%, 0.1%, 0.5%, or 1%. The final solution may have a pH of 6.8-7.7. Non-limiting examples for the pH of the final solution include a pH of 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, or 7.7.
In some embodiments, the AAV particles and/or VAPs of the disclosure may be formulated in a solution comprising about 1.05% sodium chloride, about 0.212% sodium phosphate dibasic, heptahydrate, about 0.025% sodium phosphate monobasic, monohydrate, and 0.001% poloxamer 188, at a pH of about 7.4. As a non-limiting example, the concentration of AAV particle and/or VAP in this formulated solution may be about 0.001%. The concentration of sodium chloride in the final solution may be 0.1-2.0%, with non-limiting examples of 0.1%, 0.25%, 0.5%, 0.75%, 0.95%, 0.96%, 0.97%, 0.98%, 0.99%, 1.00%, 1.01%, 1.02%, 1.03%, 1.04%, 1.05%, 1.06%, 1.07%, 1.08%, 1.09%, 1.10%, 1.25%, 1.5%, 1.75%, or 2%. The concentration of sodium phosphate dibasic in the final solution may be 0.100-0.300% with non-limiting examples including 0.100%, 0.125%, 0.150%, 0.175%, 0.200%, 0.210%, 0.211%, 0.212%, 0.213%, 0.214%, 0.215%, 0.225%, 0.250%, 0.275%, 0.300%. The concentration of sodium phosphate monobasic in the final solution may be 0.010-0.050%, with non-limiting examples of 0.010%, 0.015%, 0.020%, 0.021%, 0.022%, 0.023%, 0.024%, 0.025%, 0.026%, 0.027%, 0.028%, 0.029%, 0.030%, 0.035%, 0.040%, 0.045%, or 0.050%. The concentration of poloxamer 188 (Pluronic acid (F-68)) may be 0.0001-1%. As non-limiting examples, the concentration of poloxamer 188 (Pluronic acid (F-68)) may be 0.0001%, 0.0005%, 0.001%, 0.005%, 0.01%, 0.05%, 0.1%, 0.5%, or 1%. The final solution may have a pH of 6.8-7.7. Non-limiting examples for the pH of the final solution include a pH of 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, or 7.7.
Relative amounts of the active ingredient (e.g. AAV particle and/or VAP), the pharmaceutically acceptable excipient, and/or any additional ingredients in a pharmaceutical composition in accordance with the present disclosure may vary, depending upon the identity, size, and/or condition of the subject being treated and further depending upon the route by which the composition is to be administered. For example, the composition may comprise between 0.1% and 99% (w/w) of the active ingredient. By way of example, the composition may comprise between 0.1% and 100%, e.g., between 0.5 and 50%, between 1-30%, between 5-80%, or at least 80% (w/w) active ingredient.
In some embodiments, the AAV formulations described herein may contain sufficient AAV particles and/or VAPs for expression of at least one VENOM element. As a non-limiting example, the AAV particles and/or VAPs may contain vector genomes encoding 1, 2, 3, 4, or 5 functional antibodies.
According to the present disclosure AAV particles and/or VAPs may be formulated for CNS delivery. Agents that cross the brain blood barrier may be used. For example, some cell penetrating peptides that can target molecules to the brain blood barrier endothelium may be used for formulation (e.g., Mathupala, Expert Opin Ther Pat., 2009, 19, 137-140; the content of which is incorporated herein by reference in its entirety).
In some embodiments, a pharmaceutically acceptable excipient may be at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% pure. In some embodiments, an excipient is approved for use for humans and for veterinary use. In some embodiments, an excipient may be approved by United States Food and Drug Administration. In some embodiments, an excipient may be of pharmaceutical grade. In some embodiments, an excipient may meet the standards of the United States Pharmacopoeia (USP), the European Pharmacopoeia (EP), the British Pharmacopoeia, and/or the International Pharmacopoeia.
Excipients, as used herein, include, but are not limited to, any and all solvents, dispersion media, diluents, or other liquid vehicles, dispersion or suspension aids, surface active agents, isotonic agents, thickening or emulsifying agents, preservatives, and the like, as suited to the particular dosage form desired. Various excipients for formulating pharmaceutical compositions and techniques for preparing the composition are known in the art (see Remington: The Science and Practice of Pharmacy, 21st Edition, A. R. Gennaro, Lippincott, Williams & Wilkins, Baltimore, Md., 2006; incorporated herein by reference in its entirety). The use of a conventional excipient medium may be contemplated within the scope of the present disclosure, except insofar as any conventional excipient medium may be incompatible with a substance or its derivatives, such as by producing any undesirable biological effect or otherwise interacting in a deleterious manner with any other component(s) of the pharmaceutical composition.
Exemplary diluents include, but are not limited to, calcium carbonate, sodium carbonate, calcium phosphate, dicalcium phosphate, calcium sulfate, calcium hydrogen phosphate, sodium phosphate lactose, sucrose, cellulose, microcrystalline cellulose, kaolin, mannitol, sorbitol, inositol, sodium chloride, dry starch, cornstarch, powdered sugar, etc., and/or combinations thereof.
In some embodiments, AAV particle and/or VAP formulations may comprise at least one inactive ingredient. As used herein, the term “inactive ingredient” refers to one or more agents that do not contribute to the activity of the active ingredient of the pharmaceutical composition included in formulations. In some embodiments, all, none or some of the inactive ingredients which may be used in the formulations of the present disclosure may be approved by the US Food and Drug Administration (FDA).
Pharmaceutical composition formulations of AAV particles and/or VAPs disclosed herein may include cations or anions. In some embodiments, the formulations include metal cations such as, but not limited to, Zn2+, Ca2+, Cu2+, Mn2+, Mg+ and combinations thereof. As a non-limiting example, formulations may include polymers and complexes with a metal cation (See e.g., U.S. Pat. Nos. 6,265,389 and 6,555,525, each of which is herein incorporated by reference in its entirety).
The AAV particles and/or VAPs of the present disclosure may be administered by any delivery route which results in a therapeutically effective outcome. These include, but are not limited to, enteral (into the intestine), gastroenteral, epidural (into the dura mater), oral (by way of the mouth), transdermal, intracerebral (into the cerebrum), intracerebroventricular (into the cerebral ventricles), epicutaneous (application onto the skin), intradermal (into the skin itself), subcutaneous (under the skin), nasal administration (through the nose), intravenous (into a vein), intravenous bolus, intravenous drip, intra-arterial (into an artery), intramuscular (into a muscle), intracardiac (into the heart), intraosseous infusion (into the bone marrow), intrathecal (into the spinal canal), intraparenchymal (into the substance of, e.g., into brain tissue), intraperitoneal (infusion or injection into the peritoneum), intravesical infusion, intravitreal (through the eye), intracavernous injection (into a pathologic cavity) intracavitary (into the base of the penis), intravaginal administration, intrauterine, extra-amniotic administration, transdermal (diffusion through the intact skin for systemic distribution), transmucosal (diffusion through a mucous membrane), transvaginal, insufflation (snorting), sublingual, sublabial, enema, eye drops (onto the conjunctiva), ear drops, auricular (in or by way of the ear), buccal (directed toward the cheek), conjunctival, cutaneous, dental (to a tooth or teeth), electro-osmosis, endocervical, endosinusial, endotracheal, extracorporeal, hemodialysis, infiltration, interstitial, intra-abdominal, intra-amniotic, intra-articular, intrabiliary, intrabronchial, intrabursal, intracartilaginous (within a cartilage), intracaudal (within the cauda equine), intracisternal (within the cisterna magna cerebellomedularis), intracorneal (within the cornea), dental intracoronal, intracoronary (within the coronary arteries), intracorporus cavernosum (within the dilatable spaces of the corporus cavernosa of the penis), intradiscal (within a disc), intraductal (within a duct of a gland), intraduodenal (within the duodenum), intradural (within or beneath the dura), intraepidermal (to the epidermis), intraesophageal (to the esophagus), intragastric (within the stomach), intragingival (within the gingivae), intraileal (within the distal portion of the small intestine), intralesional (within or introduced directly to a localized lesion), intraluminal (within a lumen of a tube), intralymphatic (within the lymph), intramedullary (within the marrow cavity of a bone), intrameningeal (within the meninges), intramyocardial (within the myocardium), intraocular (within the eye), intraovarian (within the ovary), intrapericardial (within the pericardium), intrapleural (within the pleura), intraprostatic (within the prostate gland), intrapulmonary (within the lungs or its bronchi), intrasinal (within the nasal or periorbital sinuses), intraspinal (within the vertebral column), intrasynovial (within the synovial cavity of a joint), intratendinous (within a tendon), intratesticular (within the testicle), intrathecal (within the cerebrospinal fluid at any level of the cerebrospinal axis), intrathoracic (within the thorax), intratubular (within the tubules of an organ), intratumor (within a tumor), intratympanic (within the aurus media), intravascular (within a vessel or vessels), intraventricular (within a ventricle), iontophoresis (by means of electric current where ions of soluble salts migrate into the tissues of the body), irrigation (to bathe or flush open wounds or body cavities), laryngeal (directly upon the larynx), nasogastric (through the nose and into the stomach), occlusive dressing technique (topical route administration which is then covered by a dressing which occludes the area), ophthalmic (to the external eye), oropharyngeal (directly to the mouth and pharynx), parenteral, percutaneous, periarticular, peridural, perineural, periodontal, rectal, respiratory (within the respiratory tract by inhaling orally or nasally for local or systemic effect), retrobulbar (behind the pons or behind the eyeball), soft tissue, subarachnoid, subconjunctival, submucosal, topical, transplacental (through or across the placenta), transtracheal (through the wall of the trachea), transtympanic (across or through the tympanic cavity), ureteral (to the ureter), urethral (to the urethra), vaginal, caudal block, diagnostic, nerve block, biliary perfusion, cardiac perfusion, photopheresis, and spinal.
In some embodiments, compositions may be administered in a way which allows them to cross the blood-brain barrier, vascular barrier, or other epithelial barrier. The AAV particles and/or VAPs of the present disclosure may be administered in any suitable form, either as a liquid solution or suspension, as a solid form suitable for liquid solution or suspension in a liquid solution. The AAV particles and/or VAPs may be formulated with any appropriate and pharmaceutically acceptable excipient.
In some embodiments, the AAV particles and/or VAPs of the present disclosure may be delivered to a subject via a single route administration.
In some embodiments, the AAV particles and/or VAPs of the present disclosure may be delivered to a subject via a multi-site route of administration. A subject may be administered at 2, 3, 4, 5, or more than 5 sites.
In some embodiments, a subject may be administered the AAV particles and/or VAPs of the present disclosure using a bolus infusion.
In some embodiments, a subject may be administered the AAV particles and/or VAPs of the present disclosure using sustained delivery over a period of minutes, hours, or days. The infusion rate may be changed depending on the subject, distribution, formulation or another delivery parameter.
In some embodiments, the AAV particles and/or VAPs may be delivered by more than one route of administration. As non-limiting examples of combination administrations, AAV particles and/or VAPs may be delivered by intrathecal and intracerebroventricular, or by intravenous and intraparenchymal administration.
In some embodiments, the AAV particles and/or VAPs may be administered to a subject by systemic administration.
In some embodiments, the systemic administration is intravenous administration.
In another embodiment, the systemic administration is intraarterial administration.
In some embodiments, the AAV particles and/or VAPs of the present disclosure may be administered to a subject by intravenous administration.
In some embodiments, the intravenous administration may be achieved by subcutaneous delivery.
In some embodiments, the intravenous administration may be achieved by a tail vein injection (e.g., in a mouse model).
In some embodiments, the intravenous administration may be achieved by retro-orbital injection.
In some embodiments, the AAV particles and/or VAPs may be delivered by direct injection into the brain. As a non-limiting example, the brain delivery may be by intrahippocampal administration.
In some embodiments, the AAV particles and/or VAPs of the present disclosure may be administered to a subject by intraparenchymal administration. In some embodiments, the intraparenchymal administration is to tissue of the central nervous system.
In some embodiments, the AAV particles and/or VAPs of the present disclosure may be administered to a subject by intracranial delivery (See, e.g., U.S. Pat. No. 8,119,611; the content of which is incorporated herein by reference in its entirety).
In some embodiments, the AAV particles and/or VAPs may be delivered by injection into the CSF pathway. Non-limiting examples of delivery to the CSF pathway include intrathecal and intracerbroventricular administration.
In some embodiments, the AAV particles and/or VAPs may be delivered to the brain by systemic delivery. As a non-limiting example, the systemic delivery may be by intravascular administration. As a non-limiting example, the systemic or intravascular administration may be intravenous.
In some embodiments, the AAV particles and/or VAPs of the present disclosure may be delivered by intraocular delivery route. A non-limiting example of intraocular administration include an intravitreal injection.
In some embodiments, the AAV particles and/or VAPs may be delivered by intramuscular administration. Whilst not wishing to be bound by theory, the multi-nucleated nature of muscle cells provides an advantage to gene transduction subsequent to AAV delivery. Cells of the muscle are capable of expressing recombinant proteins with the appropriate post-translational modifications. The enrichment of muscle tissue with vascular structures allows for transfer to the blood stream and whole-body delivery. Examples of intramuscular administration include systemic (e.g., intravenous), subcutaneous or directly into the muscle. In some embodiments, more than one injection is administered.
In some embodiments, the AAV particles and/or VAPs of the present disclosure may be delivered by intramuscular delivery route. (See, e.g., U.S. Pat. No. 6,506,379; the content of which is incorporated herein by reference in its entirety). Non-limiting examples of intramuscular administration include an intravenous injection or a subcutaneous injection.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are administered to a subject and transduce muscle of a subject. As a non-limiting example, the AAV particles and/or VAPs are administered by intramuscular administration.
In some embodiments, the AAV particles and/or VAPs of the present disclosure may be administered to a subject by subcutaneous administration.
In some embodiments, the intramuscular administration is via systemic delivery.
In some embodiments, the intramuscular administration is via intravenous delivery.
In some embodiments, the intramuscular administration is via direct injection to the muscle.
In some embodiments, the muscle is transduced by administration, and this is referred to as intramuscular administration.
In some embodiments, the intramuscular delivery comprises administration at one site.
In some embodiments, the intramuscular delivery comprises administration at more than one site. In some embodiments, the intramuscular delivery comprises administration at two sites. In some embodiments, the intramuscular delivery comprises administration at three sites. In some embodiments, the intramuscular delivery comprises administration at four sites. In some embodiments, the intramuscular delivery comprises administration at more than four sites.
In some embodiments, intramuscular delivery is combined with at least one other method of administration.
In some embodiments, the AAV particles and/or VAPs that may be administered to a subject by peripheral injections. Non-limiting examples of peripheral injections include intraperitoneal, intramuscular, intravenous, conjunctival, or joint injection. It was disclosed in the art that the peripheral administration of AAV vector genomes can be transported to the central nervous system, for example, to the motor neurons (e.g., U.S. Patent Publication Nos. US20100240739 and US20100130594; the content of each of which is incorporated herein by reference in their entirety).
In some embodiments, the AAV particles and/or VAPs of the present disclosure may be administered to a subject by intraparenchymal administration. In some embodiments, the intraparenchymal administration is to muscle tissue.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are delivered as described in Bright et al 2015 (Neurobiol Aging. 36(2):693-709), the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are administered to the gastrocnemius muscle of a subject.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are administered to the bicep femorii of the subject.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are administered to the tibialis anterior muscles.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are administered to the soleus muscle.
As described herein, in some embodiments, pharmaceutical compositions, AAV particles and/or VAPs of the present disclosure are formulated in depots for extended release. Generally, specific organs or tissues (“target tissues”) are targeted for administration.
In some aspects, pharmaceutical compositions, AAV particles and/or VAPs of the present disclosure are spatially retained within or proximal to target tissues. Provided are methods of providing pharmaceutical compositions, AAV particles and/or VAPs, to target tissues of mammalian subjects by contacting target tissues (which comprise one or more target cells) with pharmaceutical compositions, AAV particles and/or VAPs, under conditions such that they are substantially retained in target tissues, meaning that at least 10, 20, 30, 40, 50, 60, 70, 80, 85, 90, 95, 96, 97, 98, 99, 99.9, 99.99 or greater than 99.991% of the composition is retained in the target tissues. Advantageously, retention is determined by measuring the amount of pharmaceutical compositions, AAV particles and/or VAPs, that enter one or more target cells. For example, at least 1%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, 99.9%, 99.99%, or greater than 99.99% of pharmaceutical compositions, AAV particles and/or VAPs, administered to subjects are present intracellularly at a period of time following administration. For example, intramuscular injection to mammalian subjects may be performed using aqueous compositions comprising pharmaceutical compositions. AAV particles and/or VAPs of the present disclosure and one or more transfection reagents, and retention is determined by measuring the amount of pharmaceutical compositions, AAV particles and/or VAPs, present in muscle cells.
Certain aspects are directed to methods of providing pharmaceutical compositions, AAV particles and/or VAPs of the present disclosure to a target tissues of mammalian subjects, by contacting target tissues (comprising one or more target cells) with pharmaceutical compositions, AAV particles and/or VAPs under conditions such that they are substantially retained in such target tissues. Pharmaceutical compositions, AAV particles and/or VAPs comprise enough active ingredient such that the effect of interest is produced in at least one target cell. In some embodiments, pharmaceutical compositions, AAV particles and/or VAPs generally comprise one or more cell penetration agents, although “naked” formulations (such as without cell penetration agents or other agents) are also contemplated, with or without pharmaceutically acceptable carriers.
In some embodiments, the AAV particles and/or VAPs or pharmaceutical compositions of the present disclosure may be administered or delivered using the methods for treatment of disease described in U.S. Pat. No. 8,999,948, or International Publication No. WO2014178863, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the AAV particles and/or VAPs or pharmaceutical compositions of the present disclosure may be administered or delivered using the methods for delivering gene therapy in Alzheimer's Disease or other neurodegenerative conditions as described in US Application No. 20150126590, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the AAV particles and/or VAPs or pharmaceutical compositions of the present disclosure may be administered or delivered using the methods for delivery of a CNS gene therapy as described in U.S. Pat. Nos. 6,436,708, and 8,946,152, and International Publication No. WO2015168666, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the AAV particles and/or VAPs or pharmaceutical compositions of the present disclosure may be administered or delivered using the methods for delivering proteins using AAV vector genomes described in European Patent Application No. EP2678433, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the AAV particles and/or VAPs or pharmaceutical compositions of the present disclosure may be administered or delivered using the methods for delivering DNA to the bloodstream described in U.S. Pat. No. 6,211,163, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the AAV particles and/or VAPs or pharmaceutical compositions of the present disclosure may be administered or delivered using the methods for delivering a payload to the central nervous system described in U.S. Pat. No. 7,588,757, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the AAV particles and/or VAPs or pharmaceutical compositions of the present disclosure may be administered or delivered using the methods for delivering a payload described in U.S. Pat. No. 8,283,151, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the AAV particles and/or VAPs or pharmaceutical compositions of the present disclosure may be administered or delivered using the methods for delivering a payload using a glutamic acid decarboxylase (GAD) delivery vector described in International Patent Publication No. WO2001089583, the contents of which are herein incorporated by reference in their entirety.
In some embodiments, the AAV particles and/or VAPs or pharmaceutical compositions of the present disclosure may be administered or delivered using the methods for delivering a payload to neural cells described in International Patent Publication No. WO2012057363, the contents of which are herein incorporated by reference in their entirety.
The present disclosure provides a method of delivering to a cell or tissue any of the above-described AAV particles and/or VAPs, comprising contacting the cell or tissue with said AAV particle and/or VAP or contacting the cell or tissue with a formulation comprising said AAV particle and/or VAP, or contacting the cell or tissue with any of the described compositions, including pharmaceutical compositions. The method of delivering the AAV particle and/or VAP to a cell or tissue can be accomplished in vitro, ex vivo, or in vivo.
The present disclosure additionally provides a method of delivering to a subject, including a mammalian subject, any of the above-described AAV particles and/or VAPs comprising administering to the subject said AAV particle and/or VAP, or administering to the subject a formulation comprising said AAV particle and/or VAP, or administering to the subject any of the described compositions, including pharmaceutical compositions.
The present disclosure provides methods of administering AAV particles and/or VAPs in accordance with the disclosure to a subject in need thereof. The pharmaceutical, diagnostic, or prophylactic AAV particles and/or VAPs and compositions of the present disclosure may be administered to a subject using any amount and any route of administration effective for preventing, treating, managing, or diagnosing diseases, disorders and/or conditions. The exact amount required will vary from subject to subject, depending on the species, age, and general condition of the subject, the severity of the disease, the particular composition, its mode of administration, its mode of activity, and the like. The subject may be a human, a mammal, or an animal. Compositions in accordance with the disclosure are typically formulated in unit dosage form for ease of administration and uniformity of dosage. It will be understood, however, that the total daily usage of the compositions of the present disclosure may be decided by the attending physician within the scope of sound medical judgment. The specific therapeutically effective, prophylactically effective, or appropriate diagnostic dose level for any particular individual will depend upon a variety of factors including the disorder being treated and the severity of the disorder; the activity of the specific payload employed; the specific composition employed; the age, body weight, general health, sex and diet of the patient; the time of administration, route of administration, and rate of excretion of the specific AAV particle and/or VAP employed; the duration of the treatment; drugs used in combination or coincidental with the specific AAV particle and/or VAP employed; and like factors well known in the medical arts.
In certain embodiments, AAV particle and/or VAP pharmaceutical compositions in accordance with the present disclosure may be administered at dosage levels sufficient to deliver from about 0.0001 mg/kg to about 100 mg/kg, from about 0.001 mg/kg to about 0.05 mg/kg, from about 0.005 mg/kg to about 0.05 mg/kg, from about 0.001 mg/kg to about 0.005 mg/kg, from about 0.05 mg/kg to about 0.5 mg/kg, from about 0.01 mg/kg to about 50 mg/kg, from about 0.1 mg/kg to about 40 mg/kg, from about 0.5 mg/kg to about 30 mg/kg, from about 0.01 mg/kg to about 10 mg/kg, from about 0.1 mg/kg to about 10 mg/kg, or from about 1 mg/kg to about 25 mg/kg, of subject body weight per day, one or more times a day, to obtain the desired therapeutic, diagnostic, or prophylactic, effect. It will be understood that the above dosing concentrations may be converted to vg or vector genomes per kg or into total vector genomes administered by one of skill in the art.
In certain embodiments, AAV particle and/or VAP pharmaceutical compositions in accordance with the present disclosure may be administered at about 10 to about 600 μl/site, 50 to about 500 μl/site, 100 to about 400 μl/site, 120 to about 300 μl/site, 140 to about 200 μl/site, about 160 μl/site. As non-limiting examples, AAV particles and/or VAPs may be administered at 50 μl/site and/or 150 μl/site.
In certain embodiments, the desired dosage may be delivered using multiple administrations (e.g., two, three, four, or more than four administrations). When multiple administrations are employed, split dosing regimens such as those described herein may be used. As used herein, a “split dose” is the division of “single unit dose” or total daily dose into two or more doses, e.g., two or more administrations of the “single unit dose”. As used herein, a “single unit dose” is a dose of any therapeutic administered in one dose/at one time/single route/single point of contact. i.e., single administration event.
The desired dosage of the AAV particles and/or VAPs of the present disclosure may be administered as a “pulse dose” or as a “continuous flow”. As used herein, a “pulse dose” is a series of single unit doses of any therapeutic administered with a set frequency over a period of time. As used herein, a “continuous flow” is a dose of therapeutic administered continuously for a period of time in a single route/single point of contact, i.e., continuous administration event. A total daily dose, an amount given or prescribed in 24-hour period, may be administered by any of these methods, or as a combination of these methods, or by any other methods suitable for a pharmaceutical administration.
In some embodiments, delivery of the AAV particles and/or VAPs of the present disclosure to a subject provides neutralizing activity to a subject. The neutralizing activity can be for at least 1 month, 2 months, 3 months, 4 months, 5 months, 6 months, 7 months, 8 months, 9 months, 10 months, 11 months, 1 year, 13 months, 14 months, 15 months, 16 months, 17 months, 18 months. 19 months, 20 months, 20 months, 21 months, 22 months, 23 months, 2 years, 3 years, 4 years, 5 years, 6 years, 7 years, 8 years, 9 years, 10 years or more than 10 years.
In some embodiments, delivery of the AAV particles and/or VAPs of the present disclosure results in minimal serious adverse events (SAEs) as a result of the delivery of the AAV particles and/or VAPs.
In some embodiments, delivery of AAV particles and/or VAPs may comprise a total dose between about 1×106 VG and about 1×1016 VG. In some embodiments, delivery may comprise a total dose of about 1×106, 2×106, 3×106, 4×106, 5×106, 6×106, 7×106, 8×106, 9×106, 1×107, 2×107, 3×107, 4×107, 5×107, 6×107, 7×107, 8×107, 9×107, 1×108, 2×108, 3×108, 4×108, 5×108, 6×108, 7×108, 8×108, 9×108, 1×109, 2×109, 3×109, 4×109, 5×109, 6×109, 7×109, 8×109, 9×109, 1×1010, 1.9×1010, 2×1010, 3×1010, 3.73×1010, 4×1010, 5×1010, 6×1010, 7×1010, 8×1010, 9×1010, 1×1011, 2×1011, 2.5×1011, 3×1011, 4×1011, 5×101, 6×101, 7×1011, 8×1011, 9×1011, 1×1011, 2×1012, 3×1012, 4×1012, 5×1012, 6×1012, 7×1012, 8×1012, 9×1012, 1×1013, 2×1013, 3×1013, 4×1013, 5×1013, 6×1013, 7×1013, 8×1013, 9×1013, 1×1014, 2×1014, 3×1014, 4×1014, 5×1014, 6×1014, 7×1014, 8×1014, 9×1014, 1×1015, 2×1015, 3×1015, 4×1015, 5×1015, 6×1015, 7×1015, 8×1015, 9×1015, or 1×1016 VG. As a non-limiting example, the total dose is 1×1013 VG. As another non-limiting example, the total dose is 2.1×1012VG.
In some embodiments, delivery of AAV particles and/or VAPs may comprise a composition concentration between about 1×106 VG/mL and about 1×1016 VG/mL. In some embodiments, delivery may comprise a composition concentration of about 1×106, 2×106, 3×106, 4×106, 5×106, 6×106, 7×106, 8×106, 9×106, 1×107, 2×107, 3×107, 4×107, 5×107, 6×107, 7×107, 8×107, 9×107, 1×108, 2×108, 3×108, 4×108, 5×108, 6×108, 7×108, 8×108, 9×108, 1×109, 2×109, 3×109, 4×109, 5×109 6×109, 7×109, 8×109, 9×109, 1×1010, 2×1010, 3×1010, 4×1010, 5×1010, 6×1010, 7×1010, 8×1010, 9×1010, 1×1011, 2×1011, 3×1011, 4×1011, 5×1011, 6×1011, 7×1011, 8×1011, 9×1011, 1×1012, 2×1012, 3×1012, 4×1012, 5×1012, 6×1012, 7×1012, 8×1012, 9×1012, 1×1013, 2×1013, 3×1013, 4×1013, 5×1013, 6×1013, 7×1013, 8×1013, 9×1013, 1×1014, 2×1014, 3×1014, 4×1014, 5×1014, 6×1014, 7×1014, 8×1014 9×1014, 1×1015, 2×1015, 3×1015, 4×1015, 5×1015, 6×1015, 7×1015, 8×1015, 9×1015, or 1×1016 VG/mL. In some embodiments, the delivery comprises a composition concentration of 1×1013 VG/mL. In some embodiments, the delivery comprises a composition concentration of 2.1×1012 VG/mL.
The AAV particles and/or VAPs may be used in combination with one or more other therapeutic, prophylactic, research or diagnostic agents. By “in combination with,” it is not intended to imply that the agents must be administered at the same time and/or formulated for delivery together, although these methods of delivery are within the scope of the present disclosure. Compositions can be administered concurrently with, prior to, or subsequent to, one or more other desired therapeutics or medical procedures. In general, each agent will be administered at a dose and/or on a time schedule determined for that agent. In some embodiments, the present disclosure encompasses the delivery of pharmaceutical, prophylactic, research, or diagnostic compositions in combination with agents that may improve their bioavailability, reduce and/or modify their metabolism, inhibit their excretion, and/or modify their distribution within the body.
Expression of payloads from vector genomes may be determined using various methods known in the art such as, but not limited to immunochemistry (e.g., IHC), in situ hybridization (ISH), enzyme-linked immunosorbent assay (ELISA), affinity ELISA. ELISPOT, flow cytometry, immunocytology, immunohistochemistry, surface plasmon resonance analysis, kinetic exclusion assay, liquid chromatography-mass spectrometry (LCMS), high-performance liquid chromatography (HPLC). BCA assay, immunoelectrophoresis. Western blot, SDS-PAGE, protein immunoprecipitation, and/or PCR.
In some embodiments, the ELISA assays used are those described in Liu et al 2016, the contents of which are herein incorporated by reference in their entirety (Liu. W et al., 2016 J Neurosci 36(49):12425-12435).
The present disclosure provides a method for treating a disease, disorder and/or condition in a mammalian subject, including a human subject, comprising administering to the subject any of the AAV particles and/or VAPs described herein or administering to the subject any of the described compositions, including pharmaceutical compositions, described herein.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are administered to a subject prophylactically.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are administered to a subject having at least one of the diseases described herein.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are administered to a subject to treat a disease or disorder described herein. The subject may have the disease or disorder or may be at-risk to developing the disease or disorder.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are part of an active immunization strategy to protect against diseases and disorders. In an active immunization strategy, a vaccine or AAV particles and/or VAPs are administered to a subject to prevent an infectious disease by activating the subject's production of antibodies that can fight off invading bacteria or viruses.
In some embodiments, the AAV particles and/or VAPs of the present disclosure are part of a passive immunization strategy. In a passive immunization strategy, antibodies against a particular infectious agent are given directly to the subject.
The AAV particles and/or VAPs of the present disclosure may be used for diagnostic purposes or as diagnostic tools for any disease or disorder. As non-limiting examples, the AAV particles and/or VAPs of the present disclosure or the antibodies encoded within the vector genome therein may be used as a biomarker for disease diagnosis. As a second non-limiting example, the AAV particles and/or VAPs of the present disclosure or the antibodies encoded within the vector genome therein may be used for diagnostic imaging purposes, e.g., MRI, PET, CT or ultrasound.
The AAV particles and/or VAPs of the present disclosure or the antibodies encoded by the vector genome therein may be used to prevent disease or stabilize the progression of disease. In some embodiments, the AAV particles and/or VAPs of the present disclosure are used to as a prophylactic to prevent a disease or disorder in the future. In some embodiments, the AAV particles and/or VAPs of the present disclosure are used to halt further progression of a disease or disorder. As a non-limiting example, the AAV particles and/or VAPs may be used in a manner similar to that of a vaccine.
The AAV particles and/or VAPs of the present disclosure or the antibodies encoded by the vector genome therein may also be used as research tools. The AAV particles and/or VAPs may be used as in any research experiment, e.g., in vivo or in vitro experiments. In a non-limiting example, the AAV particles and/or VAPs may be used in cultured cells. The cultured cells may be derived from any origin known to one with skill in the art, and may be as non-limiting examples, derived from a stable cell line, an animal model or a human patient or control subject. In a non-limiting example, the AAV particles and/or VAPs may be used in in vivo experiments in animal models (i.e., mouse, rat, rabbit, dog, cat, non-human primate, guinea pig, ferret, c-elegans, drosophila, zebrafish, or any other animal used for research purposes, known in the art). In another non-limiting example, the AAV particles and/or VAPs may be used in human research experiments or human clinical trials.
The AAV particles and/or VAPs may be used as a combination therapy with any other therapeutic molecule known in the art. The therapeutic molecule may be approved by the US Food and Drug Administration or may be in clinical trial or at the preclinical research stage. The therapeutic molecule may utilize any therapeutic modality known in the art, with non-limiting examples including gene silencing or interference (i.e., miRNA, siRNA, RNAi, shRNA), gene editing (i.e., TALEN, CRISPR/Cas9 systems, zinc finger nucleases), and gene, protein or enzyme replacement.
The present disclosure additionally provides a method for treating neurological diseases and/or disorders in a mammalian subject, including a human subject, comprising administering to the subject any of the AAV particles and/or VAPs.
In some embodiments, methods of treating diseases and/or disorders in a subject in need thereof may comprise the steps of: (1) deriving, generating and/or selecting at least one VENOM element or fragment thereof; (2) producing an AAV particle with a vector genome that includes a payload region encoding the VENOM element of (1) (referred to as a VAP); and (3) administering the VAP (or pharmaceutical composition thereof) to the subject.
The present disclosure provides a method for administering to a subject in need thereof, including a human subject, a therapeutically effective amount of the AAV particles and/or VAPs to slow, stop or reverse disease progression.
In some embodiments, the AAV particles and/or VAPs are used to correct toxic repeats in autosomal dominant disorders such as, but not limited to, Huntington's Disease (HD), Amyotrophic Lateral Sclerosis (ALS) and frontotemporal dementia (FTD). As a non-limiting example, the autosomal dominant disorder is HD and the toxic repeats are in the Huntingtin gene (HTT). As another non-limiting example, the autosomal dominant disorder is ALS and the toxic repeats are in C9orf72. As another non-limiting example, the autosomal dominant disorder is FTD and the toxic repeats are in C9orf72.
In some embodiments, the AAV particles and/or VAPs correct toxic repeats in autosomal dominant disorders by introducing a stop codon into the gene. The introduction of the stop codon is able to reduce the number of toxic repeats and treats the autosomal dominant disorder.
In some embodiments, the AAV particles and/or VAPs correct toxic repeats in autosomal dominant disorders by using Cas13 to induce exon-skipping which reduces the number of toxic repeats and treats the autosomal dominant disorder.
In some embodiments, the AAV particles and/or VAPs are used to introduce protective mutations in a gene in order to treat Alzheimer's Disease (AD). The AD may be familial AD (FAD) (when a familial history of AD is known), early-onset AD (when AD is diagnosed in a subject under the age of 65), or sporadic AD (SAD) (AD in patients without any family history of AD).
In some embodiments, the protective mutation may be introduced by nucleobase editing. The gene may be, but is not limited to, Amyloid Beta Precursor Protein (APP), Presenilin 1 (PSEN1 or PS1), and Presenilin 2 (PSEN2 or PS2).
VENOM elements such as, but not limited to CRISPR/Cas9 may be used to introduce mutations into a gene as a protection against Alzheimer's Disease. Paquet et al. (Nature. 2016 May 5; 533(7601):125-9. doi: 10.1038/nature17664; the contents of which are herein incorporated by reference in their entirety) and Tremblay et al. (International Publication No. WO2015168800, the contents of which are herein incorporated by reference in their entirety) describe treating Alzheimer's Disease by introducing the A673T mutation into the APP gene using the CRISPR/Cas9 system.
In some embodiments, the AAV particles and/or VAPs are used to introduce the A673T protective mutation in the APP gene to treat and/or reduce the symptoms of Alzheimer's Disease.
In some embodiments, the AAV particles and/or VAPs are used to correct mutations in a gene to treat AD. As a non-limiting example, the gene is APP and the mutation is at amino acid 396 (Glu693Gln), amino acid 717 (Val717IIe, Val717Leu, Val717Phe, or Val717Gly), amino acid 693 (Glu693Gln), amino acid 670, amino acid 671, and/or amino acid 692. As a non-limiting example, the gene is PS1 and the mutation is at amino acid 206 (Gly206Ala), amino acid 146 (Met146Len), amino acid 280 (Glu280Ala), amino acid 163 (His163Arg), amino acid 264 (Pro264Leu), and/or amino acid 318 (Glu318Gly). As a non-limiting example, the gene is PS2.
In some embodiments, the AAV particles and/or VAPs use the REPAIR method as the VENOM element in order to treat and/or reduce the symptoms of Alzheimer's Disease. As a non-limiting example, the REPAIR method may be used to correct the V717I mutation in the APP gene.
In some embodiments, to treat Alzheimer's Disease protective mutations may be introduced into a gene and mutations causing a disease, disorder and/or condition may be removed. As a non-limiting example, the A673T protective mutation may be introduced into the APP gene and the V717I mutation may be corrected in the APP gene in order to treat Alzheimer's Disease. The REPAIR method may be used for the introduction of the A673T mutation and the correction of the V717I mutation.
In some embodiments, the disclosure provides a variety of kits for conveniently and/or effectively carrying out methods of the present disclosure. Typically, kits will comprise sufficient amounts and/or numbers of components to allow a user to perform multiple treatments of a subject(s) and/or to perform multiple experiments.
Any of the AAV particles and/or VAPs of the present disclosure may be comprised in a kit. In some embodiments, kits may further include reagents and/or instructions for creating and/or synthesizing compounds and/or compositions of the present disclosure. In some embodiments, kits may also include one or more buffers. In some embodiments, kits may include components for making protein or nucleic acid arrays or libraries and thus, may include, for example, solid supports.
In some embodiments, kit components may be packaged either in aqueous media or in lyophilized form. The container means of the kits will generally include at least one vial, test tube, flask, bottle, syringe or other container means, into which a component may be placed, and preferably, suitably aliquoted. Where there is more than one kit component, (labeling reagent and label may be packaged together), kits may also generally contain second, third or other additional containers into which additional components may be separately placed. In some embodiments, kits may also comprise second container means for containing sterile, pharmaceutically acceptable buffers and/or other diluents. In some embodiments, various combinations of components may be comprised in one or more vial. Kits of the present disclosure may also typically include means for containing compounds and/or compositions of the present disclosure, e.g., proteins, nucleic acids, and any other reagent containers in close confinement for commercial sale. Such containers may include injection or blow-molded plastic containers into which desired vials are retained.
In some embodiments, kit components are provided in one and/or more liquid solutions. In some embodiments, liquid solutions are aqueous solutions, with sterile aqueous solutions being particularly preferred. In some embodiments, kit components may be provided as dried powder(s). When reagents and/or components are provided as dry powders, such powders may be reconstituted by the addition of suitable volumes of solvent. In some embodiments, it is envisioned that solvents may also be provided in another container means. In some embodiments, labeling dyes are provided as dried powders. In some embodiments, it is contemplated that 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 120, 120, 130, 140, 150, 160, 170, 180, 190, 200, 300, 400, 500, 600, 700, 800, 900, 1000 micrograms or at least or at most those amounts of dried dye are provided in kits. In such embodiments, dye may then be resuspended in any suitable solvent, such as DMSO.
In some embodiments, kits may include instructions for employing kit components as well the use of any other reagent not included in the kit. Instructions may include variations that may be implemented.
In some embodiments, the AAV particles and/or VAPs may delivered to a subject using a device to deliver the AAV particles and/or VAPs and a head fixation assembly. The head fixation assembly may be, but is not limited to, any of the head fixation assemblies sold by MRI interventions. As a non-limiting example, the head fixation assembly may be any of the assemblies described in U.S. Pat. Nos. 8,099,150, 8,548,569, and 9,031,636 and International Patent Publication Nos. WO201108495 and WO2014014585, the contents of each of which are incorporated by reference in their entireties. A head fixation assembly may be used in combination with an MRI compatible drill such as, but not limited to, the MRI compatible drills described in International Patent Publication No. WO2013181008 and US Patent Publication No. US20130325012, the contents of which are herein incorporated by reference in its entirety.
In some embodiments, the AAV particles and/or VAPs may be delivered using a method, system and/or computer program for positioning apparatus to a target point on a subject to deliver the AAV particles and/or VAPs. As a non-limiting example, the method, system and/or computer program may be the methods, systems and/or computer programs described in U.S. Pat. No. 8,340,743, the contents of which are herein incorporated by reference in its entirety. The method may include: determining a target point in the body and a reference point, wherein the target point and the reference point define a planned trajectory line (PTL) extending through each; determining a visualization plane, wherein the PTL intersects the visualization plane at a sighting point; mounting the guide device relative to the body to move with respect to the PTL, wherein the guide device does not intersect the visualization plane; determining a point of intersection (GPP) between the guide axis and the visualization plane; and aligning the GPP with the sighting point in the visualization plane.
In some embodiments, the AAV particles and/or VAPs may be delivered to a subject using a convention-enhanced delivery device. Non-limiting examples of targeted delivery of drugs using convection are described in US Patent Publication Nos. US20100217228, US20130035574, and US 20130035660 and International Patent Publication No. WO2013019830 and WO2008144585, the contents of each of which are herein incorporated by reference in their entireties.
In some embodiments, a subject may be imaged prior to, during and/or after delivery of the AAV particles and/or VAPs. The imaging method may be a method known in the art and/or described herein, such as but not limited to, magnetic resonance imaging (MRI). As a non-limiting example, imaging may be used to assess therapeutic effect. As another non-limiting example, imaging may be used for assisted delivery of AAV particles and/or VAPs.
In some embodiments, the AAV particles and/or VAPs may be delivered using an MRI-guided device. Non-limiting examples of MRI-guided devices are described in U.S. Pat. Nos. 9,055,884, 9,042,958, 8,886,288, 8,768,433, 8,396,532, 8,369,930, 8,374,677, and 8,175,677 and US Patent Application No. US20140024927 the contents of each of which are herein incorporated by reference in their entireties. As a non-limiting example, the MRI-guided device may be able to provide data in real time such as those described in U.S. Pat. Nos. 8,886,288 and 8,768,433, the contents of each of which is herein incorporated by reference in its entirety. As another non-limiting example, the MRI-guided device or system may be used with a targeting cannula such as the systems described in U.S. Pat. Nos. 8,175,677 and 8,374,677, the contents of each of which are herein incorporated by reference in their entireties. As yet another non-limiting example, the MRI-guided device includes a trajectory guide frame for guiding an interventional device as described, for example, in U.S. Pat. No. 9,055,884 and US Patent Application No. US20140024927, the contents of each of which are herein incorporated by reference in their entireties.
In some embodiments, the AAV particles and/or VAPs may be delivered using an MRI-compatible tip assembly. Non-limiting examples of MRI-compatible tip assemblies are described in US Patent Publication No. US20140275980, the contents of which is herein incorporated by reference in its entirety.
In some embodiments, the AAV particles and/or VAPs may be delivered using a cannula which is MRI-compatible. Non-limiting examples of MRI-compatible cannulas include those taught in International Patent Publication No. WO2011130107, the contents of which are herein incorporated by reference in its entirety.
In some embodiments, the AAV particles and/or VAPs may be delivered using a catheter which is MRI-compatible. Non-limiting examples of MRI-compatible catheters include those taught in International Patent Publication No. WO2012116265, U.S. Pat. No. 8,825,133 and US Patent Publication No. US20140024909, the contents of each of which are herein incorporated by reference in their entireties.
In some embodiments, the AAV particles and/or VAPs may be delivered using a device with an elongated tubular body and a diaphragm as described in US Patent Publication Nos. US20140276582 and US20140276614, the contents of each of which are herein incorporated by reference in their entireties.
In some embodiments, the AAV particles and/or VAPs may be delivered using an MRI compatible localization and/or guidance system such as, but not limited to, those described in US Patent Publication Nos. US20150223905 and US20150230871, the contents of each of which are herein incorporated by reference in their entireties. As a non-limiting example, the MRI compatible localization and/or guidance systems may comprise a mount adapted for fixation to a patient, a targeting cannula with a lumen configured to attach to the mount so as to be able to controllably translate in at least three dimensions, and an elongate probe configured to snugly advance via slide and retract in the targeting cannula lumen, the elongate probe comprising at least one of a stimulation or recording electrode.
In some embodiments, the AAV particles and/or VAPs may be delivered to a subject using a trajectory frame as described in US Patent Publication Nos. US20150031982 and US20140066750 and International Patent Publication Nos. WO2015057807 and WO2014039481, the contents of each of which are herein incorporated by reference in their entireties.
In some embodiments, the AAV particles and/or VAPs may be delivered to a subject using a gene gun.
At various places in the present specification, substituents of compounds of the present disclosure are disclosed in groups or in ranges. It is specifically intended that the present disclosure include each and every individual sub combination of the members of such groups and ranges.
About: As used herein, the term “about” means+/−10% of the recited value.
Adeno-associated virus: The term “adeno-associated virus” or “AAV” as used herein refers to members of the dependovirus genus comprising any particle, sequence, gene, protein, or component derived therefrom.
AAV Particle: As used herein, an “AAV particle” is a virus which comprises a vector genome with at least one payload region and at least one ITR region. AAV vector genomes of the present disclosure may be produced recombinantly and may be based on adeno-associated virus (AAV) parent or reference sequences. AAV particle may be derived from any serotype, described herein or known in the art, including combinations of serotypes (i.e., “pseudotyped” AAV) or from various genomes (e.g., single stranded or self-complementary). In addition, the AAV particle may be replication defective and/or targeted.
Activity: As used herein, the term “activity” refers to the condition in which things are happening or being done. Compositions may have activity and this activity may involve one or more biological events.
Administered in combination: As used herein, the term “administered in combination” or “combined administration” means that two or more agents are administered to a subject at the same time or within an interval such that there may be an overlap of an effect of each agent on the patient. In some embodiments, they are administered within about 60, 30, 15, 10, 5, or 1 minute of one another. In some embodiments, the administrations of the agents are spaced sufficiently closely together such that a combinatorial (e.g., a synergistic) effect is achieved.
Amelioration: As used herein, the term “amelioration” or “ameliorating” refers to a lessening of severity of at least one indicator of a condition or disease. For example, in the context of neurodegeneration disorder, amelioration includes the reduction of neuron loss.
Animal: As used herein, the term “animal” refers to any member of the animal kingdom. In some embodiments, “animal” refers to humans at any stage of development. In some embodiments, “animal” refers to non-human animals at any stage of development. In certain embodiments, the non-human animal is a mammal (e.g., a rodent, a mouse, a rat, a rabbit, a monkey, a dog, a cat, a sheep, cattle, a primate, or a pig). In some embodiments, animals include, but are not limited to, mammals, birds, reptiles, amphibians, fish, and worms. In some embodiments, the animal is a transgenic animal, genetically-engineered animal, or a clone.
Antibody: As used herein, the term “antibody” is referred to in the broadest sense and specifically covers various embodiments including, but not limited to monoclonal antibodies, polyclonal antibodies, multispecific antibodies (e.g. bispecific antibodies formed from at least two intact antibodies), and antibody fragments (e.g., diabodies) so long as they exhibit a desired biological activity (e.g., “functional”). Antibodies are primarily amino-acid based molecules but may also comprise one or more modifications (including, but not limited to the addition of sugar moieties, fluorescent moieties, chemical tags, etc.). Non-limiting examples of antibodies or fragments thereof include VH and VL domains, scFvs, Fab, Fab′, F(ab′)2, Fv fragment, diabodies, linear antibodies, single chain antibody molecules, multispecific antibodies, bispecific antibodies, intrabodies, monoclonal antibodies, polyclonal antibodies, humanized antibodies, codon-optimized antibodies, tandem scFv antibodies, bispecific T-cell engagers, mAb2 antibodies, chimeric antigen receptors (CAR), tetravalent bispecific antibodies, biosynthetic antibodies, native antibodies, miniaturized antibodies, unibodies, maxibodies, antibodies to senescent cells, antibodies to conformers, antibodies to disease specific epitopes, or antibodies to innate defense molecules.
Approximately: As used herein, the term “approximately” or “about,” as applied to one or more values of interest, refers to a value that is similar to a stated reference value. In certain embodiments, the term “approximately” or “about” refers to a range of values that fall within 25%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, or less in either direction (greater than or less than) of the stated reference value unless otherwise stated or otherwise evident from the context (except where such number would exceed 100% of a possible value).
Associated with: As used herein, the terms “associated with,” “conjugated,” “linked,” “attached,” and “tethered,” when used with respect to two or more moieties, means that the moieties are physically associated or connected with one another, either directly or via one or more additional moieties that serves as a linking agent, to form a structure that is sufficiently stable so that the moieties remain physically associated under the conditions in which the structure is used, e.g., physiological conditions. An “association” need not be strictly through direct covalent chemical bonding. It may also suggest ionic or hydrogen bonding or a hybridization-based connectivity sufficiently stable such that the “associated” entities remain physically associated.
Bifunctional: As used herein, the term “bifunctional” refers to any substance, molecule or moiety which is capable of or maintains at least two functions. The functions may affect the same outcome or a different outcome. The structure that produces the function may be the same or different.
Biocompatible: As used herein, the term “biocompatible” means compatible with living cells, tissues, organs or systems posing little to no risk of injury, toxicity or rejection by the immune system.
Biodegradable: As used herein, the term “biodegradable” means capable of being broken down into innocuous products by the action of living things.
Biologically active: As used herein, the phrase “biologically active” refers to a characteristic of any substance that has activity in a biological system and/or organism. For instance, a substance that, when administered to an organism, has a biological effect on that organism, is considered to be biologically active. In particular embodiments, an AAV particle of the present disclosure may be considered biologically active if even a portion of the encoded payload is biologically active or mimics an activity considered biologically relevant.
Capsid: As used herein, the term “capsid” refers to the protein shell of a virus particle.
Chimeric antigen receptor (CAR): As used herein, the term “chimeric antigen receptor” or “CAR” refers to an artificial chimeric protein comprising at least one antigen specific targeting region (ASTR), a transmembrane domain and an intracellular signaling domain, wherein the antigen specific targeting region comprises a full-length antibody or a fragment thereof. Any molecule that is capable of binding a target antigen with high affinity can be used in the ASTR of a CAR. The CAR may optionally have an extracellular spacer domain and/or a co-stimulatory domain. A CAR may also be used to generate a cytotoxic cell carrying the CAR.
Complementary and substantially complementary: As used herein, the term “complementary” refers to the ability of polynucleotides to form base pairs with one another. Base pairs are typically formed by hydrogen bonds between nucleotide units in antiparallel polynucleotide strands. Complementary polynucleotide strands can form base pair in the Watson-Crick manner (e.g., A to T, A to U, C to G), or in any other manner that allows for the formation of duplexes. As persons skilled in the art are aware, when using RNA as opposed to DNA, uracil rather than thymine is the base that is considered to be complementary to adenosine. However, when a U is denoted in the context of the present disclosure, the ability to substitute a T is implied, unless otherwise stated. Perfect complementarity or 100% complementarity refers to the situation in which each nucleotide unit of one polynucleotide strand can form hydrogen bond with a nucleotide unit of a second polynucleotide strand. Less than perfect complementarity refers to the situation in which some, but not all, nucleotide units of two strands can form hydrogen bond with each other. For example, for two 20-mers, if only two base pairs on each strand can form hydrogen bond with each other, the polynucleotide strands exhibit 10% complementarity. In the same example, if 18 base pairs on each strand can form hydrogen bonds with each other, the polynucleotide strands exhibit 90% complementarity. As used herein, the term “substantially complementary” means that the siRNA has a sequence (e.g., in the antisense strand) which is sufficient to bind the desired target mRNA, and to trigger the RNA silencing of the target mRNA.
Compound: Compounds of the present disclosure include all of the isotopes of the atoms occurring in the intermediate or final compounds. “Isotopes” refers to atoms having the same atomic number but different mass numbers resulting from a different number of neutrons in the nuclei. For example, isotopes of hydrogen include tritium and deuterium.
The compounds and salts of the present disclosure can be prepared in combination with solvent or water molecules to form solvates and hydrates by routine methods.
Conditionally active: As used herein, the term “conditionally active” refers to a mutant or variant of a wild-type polypeptide, wherein the mutant or variant is more or less active at physiological conditions than the parent polypeptide. Further, the conditionally active polypeptide may have increased or decreased activity at aberrant conditions as compared to the parent polypeptide. A conditionally active polypeptide may be reversibly or irreversibly inactivated at normal physiological conditions or aberrant conditions.
Conserved: As used herein, the term “conserved” refers to nucleotides or amino acid residues of a polynucleotide sequence or polypeptide sequence, respectively, that are those that occur unaltered in the same position of two or more sequences being compared. Nucleotides or amino acids that are relatively conserved are those that are conserved amongst more related sequences than nucleotides or amino acids appearing elsewhere in the sequences.
In some embodiments, two or more sequences are said to be “completely conserved” if they are 100% identical to one another. In some embodiments, two or more sequences are said to be “highly conserved” if they are at least 70% identical, at least 80% identical, at least 90% identical, or at least 95% identical to one another. In some embodiments, two or more sequences are said to be “highly conserved” if they are about 70% identical, about 80% identical, about 90% identical, about 95%, about 98%, or about 99% identical to one another. In some embodiments, two or more sequences are said to be “conserved” if they are at least 30% identical, at least 40% identical, at least 50% identical, at least 60% identical, at least 70% identical, at least 80% identical, at least 90° 1% identical, or at least 95% identical to one another. In some embodiments, two or more sequences are said to be “conserved” if they are about 30% identical, about 40% identical, about 50% identical, about 60% identical, about 70% identical, about 80% identical, about 90% identical, about 95% identical, about 98% identical, or about 99% identical to one another. Conservation of sequence may apply to the entire length of a polynucleotide or polypeptide or may apply to a portion, region or feature thereof.
Control Elements: As used herein, “control elements”, “regulatory control elements”, or “regulatory sequences” refers to promoter regions, polyadenylation signals, transcription termination sequences, upstream regulatory domains, origins of replication, internal ribosome entry sites (“IRES”), enhancers, and the like, which provide for the replication, transcription and translation of a coding sequence in a recipient cell. Not all of these control elements need always be present as long as the selected coding sequence is capable of being replicated, transcribed and/or translated in an appropriate host cell.
Controlled Release: As used herein, the term “controlled release” refers to a pharmaceutical composition or compound release profile that conforms to a particular pattern of release to affect a therapeutic outcome.
CRISPR genome editing element: As used herein, the term “CRISPR genome editing element” refers to any component/element involved in directing the activity of CRISPR-associated (“Cas”) proteins to alter the genome in a cell, including sequences encoding a Cas gene, a tracr (trans-activating CRISPR) sequence (e.g. tracrRNA or an active partial tracrRNA), a tracr-complementary sequence (encompassing a “direct repeat” and a tracrRNA-processed partial direct repeat in the context of an endogenous CRISPR system), a target (guide) sequence (also referred to as a “spacer” in the context of an endogenous CRISPR system), or other sequences and transcripts from a CRISPR locus. In some embodiments, one or more elements of a CRISPR system is derived from a type I, type II, or type III CRISPR-Cas system. In some embodiments, one or more elements of a CRISPR-Cas system is derived from a particular organism which may comprise an endogenous CRISPR-Cas system, such as Streptococcus pyogenes. In general, a CRISPR-Cas system is characterized by components that promote the formation of a CRISPR/Cas complex at the site of a target sequence (also referred to as a protospacer in the context of an endogenous CRISPR-Cas system). In the context of formation of a CRISPR complex, “target sequence” refers to a sequence to which a target recognition sequence is designed to have complementarity, where hybridization between a target sequence and a target recognition sequence promotes the formation of a CRISPR/Cas complex. A target sequence may comprise any polynucleotide, such as DNA or RNA polynucleotides. In some embodiments, a target sequence is located in the nucleus or cytoplasm of a cell.
Cytostatic: As used herein, “cytostatic” refers to inhibiting, reducing, suppressing the growth, division, or multiplication of a cell (e.g., a mammalian cell (e.g., a human cell)), bacterium, virus, fungus, protozoan, parasite, prion, or a combination thereof.
Cytotoxic: As used herein, “cytotoxic” refers to killing or causing injurious, toxic, or deadly effect on a cell (e.g., a mammalian cell (e.g., a human cell)), bacterium, virus, fungus, protozoan, parasite, prion, or a combination thereof.
Delivery: As used herein, “delivery” refers to the act or manner of delivering an AAV particle, a compound, substance, entity, moiety, cargo or payload.
Delivery Agent: As used herein, “delivery agent” refers to any substance which facilitates, at least in part, the in vivo delivery of an AAV particle to targeted cells.
Destabilized: As used herein, the term “destable”, “destabilize”, or “destabilizing region” means a region or molecule that is less stable than a starting, wild-type or native form of the same region or molecule.
Detectable label: As used herein, “detectable label” refers to one or more markers, signals, or moieties which are attached, incorporated or associated with another entity that is readily detected by methods known in the art including radiography, fluorescence, chemiluminescence, enzymatic activity, absorbance and the like. Detectable labels include radioisotopes, fluorophores, chromophores, enzymes, dyes, metal ions, ligands such as biotin, avidin, streptavidin and haptens, quantum dots, and the like. Detectable labels may be located at any position in the peptides or proteins disclosed herein. They may be within the amino acids, the peptides, or proteins, or located at the N- or C-termini.
Digest: As used herein, the term “digest” means to break apart into smaller pieces or components. When referring to polypeptides or proteins, digestion results in the production of peptides.
Distal: As used herein, the term “distal” means situated away from the center or away from a point or region of interest.
Dosing regimen: As used herein, a “dosing regimen” is a schedule of administration or physician determined regimen of treatment, prophylaxis, or palliative care.
Element: As used herein, the term “element” refers to a distinct portion of an entity. In some embodiments, an element may be a polynucleotide sequence with a specific purpose, incorporated into a longer polynucleotide sequence.
Encapsulate: As used herein, the term “encapsulate” means to enclose, surround or encase.
Engineered: As used herein, embodiments are “engineered” when they are designed to have a feature or property, whether structural or chemical, that varies from a starting point, wild type or native molecule.
Effective Amount: As used herein, the term “effective amount” of an agent is that amount sufficient to effect beneficial or desired results, for example, clinical results, and, as such, an “effective amount” depends upon the context in which it is being applied. For example, in the context of administering an agent that treats cancer, an effective amount of an agent is, for example, an amount sufficient to achieve treatment, as defined herein, of cancer, as compared to the response obtained without administration of the agent.
Epitope: As used herein, an “epitope” refers to a surface or region on a molecule that is capable of interacting with a biomolecule. For example, a protein may contain one or more amino acids, e.g., an epitope, which interacts with an antibody, e.g., a biomolecule. In some embodiments, when referring to a protein or protein module, an epitope may comprise a linear stretch of amino acids or a three-dimensional structure formed by folded amino acid chains.
EvoMap™: As used herein, an EvoMap™ refers to a map of a polypeptide, wherein detailed informatics are presented about the effects of single amino acid mutations within the length of the polypeptide and their influence on the properties and characteristics of that polypeptide.
Expression: As used herein, “expression” of a nucleic acid sequence refers to one or more of the following events: (1) production of an RNA template from a DNA sequence (e.g., by transcription); (2) processing of an RNA transcript (e.g., by splicing, editing, 5′ cap formation, and/or 3′ end processing); (3) translation of an RNA into a polypeptide or protein; and (4) post-translational modification of a polypeptide or protein.
Feature: As used herein, a “feature” refers to a characteristic, a property, or a distinctive element.
Formulation: As used herein, a “formulation” includes at least one AAV particle and a delivery agent.
Fragment: A “fragment,” as used herein, refers to a portion. For example, fragments of proteins may comprise polypeptides obtained by digesting full-length protein isolated from cultured cells.
Functional: As used herein, a “functional” biological molecule is a biological molecule in a form in which it exhibits a property and/or activity by which it is characterized.
Fusion protein: As used herein, a “fusion protein” or “chimeric protein” is a protein created through the combination of two or more genes encoding separate proteins. Translation of this fusion gene results in one or more polypeptides with functional properties derived from each of the original proteins. In some embodiments, the fusion protein or chimeric protein can be engineered to include the full sequence of both original proteins. In other embodiments, the fusion protein or chimeric protein includes a part, such as a functional part or domain, from each of the proteins. Various spatial arrangements of the domains may be envisioned according to the present disclosure. It is contemplated as part of the current disclosure that the domains may be N-terminal, C-terminal, or interspersed with respect to each other in various orientations. In some embodiments, the fusion proteins may comprise multiple copies of a particular domain. The various domains of the fusion proteins may be connected to each other by linkers. In some embodiments, the domains may be encoded on separate polynucleotides. In some embodiments, they may be encoded on the same polynucleotide. In some embodiments, the various domains may be on the same polypeptide. In other embodiments, the domains may be on separate polypeptides. It is also understood that the fusion proteins or proteins of the present disclosure may be codon optimized for expression in the system of interest according to methods known in the art.
Gene expression: The term “gene expression” refers to the process by which a nucleic acid sequence undergoes successful transcription and in most instances translation to produce a protein or peptide. For clarity, when reference is made to measurement of “gene expression”, this should be understood to mean that measurements may be of the nucleic acid product of transcription, e.g., RNA or mRNA or of the amino acid product of translation, e.g., polypeptides or peptides. Methods of measuring the amount or levels of RNA, mRNA, polypeptides and peptides are well known in the art.
Homology: As used herein, the term “homology” refers to the overall relatedness between polymeric molecules, e.g. between polynucleotide molecules (e.g. DNA molecules and/or RNA molecules) and/or between polypeptide molecules. In some embodiments, polymeric molecules are considered to be “homologous” to one another if their sequences are at least 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% identical or similar. The term “homologous” necessarily refers to a comparison between at least two sequences (polynucleotide or polypeptide sequences). In accordance with the disclosure, two polynucleotide sequences are considered to be homologous if the polypeptides they encode are at least about 50%, 60%, 70%, 80%, 90%, 95%, or even 99% for at least one stretch of at least about 20 amino acids. In some embodiments, homologous polynucleotide sequences are characterized by the ability to encode a stretch of at least 4-5 uniquely specified amino acids. For polynucleotide sequences less than 60 nucleotides in length, homology is determined by the ability to encode a stretch of at least 4-5 uniquely specified amino acids. In accordance with the disclosure, two protein sequences are considered to be homologous if the proteins are at least about 50%, 60%, 70%, 80%, or 90% identical for at least one stretch of at least about 20 amino acids.
Heterologous Region: As used herein the term “heterologous region” refers to a region which would not be considered a homologous region.
Homology directed repair (HL)R): As used herein, the term “homology directed repair” refers to a mechanism in cells to repair DNA double-strand break using a homologous template. HDR generally requires a DNA donor template for requisite sequence homology with the sequence flanking the double-strand break so that the DNA donor can serve as a suitable template for repair. In some embodiments, HDR results in the replacement of the entire DNA donor or a portion of the DNA donor sequence at the site of the DNA target sequence.
Homologous Region: As used herein the term “homologous region” refers to a region which is similar in position, structure, evolution origin, character, form or function.
Identity: As used herein, the term “identity” refers to the overall relatedness between polymeric molecules, e.g., between polynucleotide molecules (e.g. DNA molecules and/or RNA molecules) and/or between polypeptide molecules. Calculation of the percent identity of two polynucleotide sequences, for example, can be performed by aligning the two sequences for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second nucleic acid sequences for optimal alignment and non-identical sequences can be disregarded for comparison purposes). In certain embodiments, the length of a sequence aligned for comparison purposes is at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or 100% of the length of the reference sequence. The nucleotides at corresponding nucleotide positions are then compared. When a position in the first sequence is occupied by the same nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which needs to be introduced for optimal alignment of the two sequences. The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm. For example, the percent identity between two nucleotide sequences can be determined using methods such as those described in Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; Computer Analysis of Sequence Data. Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; and Sequence Analysis Primer. Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; each of which is incorporated herein by reference. For example, the percent identity between two nucleotide sequences can be determined using the algorithm of Meyers and Miller (CABIOS, 1989, 4:11-17), which has been incorporated into the ALIGN program (version 2.0) using a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4. The percent identity between two nucleotide sequences can, alternatively, be determined using the GAP program in the GCG software package using an NWSgapdna.CMP matrix. Methods commonly employed to determine percent identity between sequences include, but are not limited to those disclosed in Carillo, H. and Lipman, D., SIAM J Applied Math., 48:1073 (1988); incorporated herein by reference. Techniques for determining identity are codified in publicly available computer programs. Exemplary computer software to determine homology between two sequences include, but are not limited to, GCG program package, Devereux, J., et al., Nucleic Acids Research, 12(1), 387 (1984)), BLASTP, BLASTN, and FASTA Altschul, S. F. et al., J. Molec. Biol., 215, 403 (1990)).
Inhibit expression of a gene: As used herein, the phrase “inhibit expression of a gene” means to cause a reduction in the amount of an expression product of the gene. The expression product can be an RNA transcribed from the gene (e.g., an mRNA) or a polypeptide translated from an mRNA transcribed from the gene. Typically, a reduction in the level of an mRNA results in a reduction in the level of a polypeptide translated therefrom. The level of expression may be determined using standard techniques for measuring mRNA or protein.
In vitro: As used herein, the term “in vitro” refers to events that occur in an artificial environment, e.g., in a test tube or reaction vessel, in cell culture, in a Petri dish, etc., rather than within an organism (e.g., animal, plant, or microbe).
In vivo: As used herein, the term “in vivo” refers to events that occur within an organism (e.g., animal, plant, or microbe or cell or tissue thereof).
Isolated: As used herein, the term “isolated” refers to a substance or entity that has been separated from at least some of the components with which it was associated (whether in nature or in an experimental setting). Isolated substances may have varying levels of purity in reference to the substances from which they have been associated. Isolated substances and/or entities may be separated from at least about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, or more of the other components with which they were initially associated. In some embodiments, isolated agents are more than about 80%, about 85%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, or more than about 99% pure. As used herein, a substance is “pure” if it is substantially free of other components.
Substantially isolated: By “substantially isolated” is meant that a substance is substantially separated from the environment in which it was formed or detected. Partial separation can include, for example, a composition enriched in the substance or AAV particles of the present disclosure. Substantial separation can include compositions containing at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or at least about 99% by weight of the compound of the present disclosure, or salt thereof. Methods for isolating compounds and their salts are routine in the art.
Linker: As used herein “linker” refers to a molecule or group of molecules which connects two molecules. A linker may be a nucleic acid sequence connecting two nucleic acid sequences encoding two different polypeptides. The linker may or may not be translated. The linker may be a cleavable linker.
MicroRNA (miRNA) binding site: As used herein, a microRNA (miRNA) binding site represents a nucleotide location or region of a nucleic acid transcript to which at least the “seed” region of a miRNA binds.
Modified: As used herein “modified” refers to a changed state or structure of a molecule. Molecules may be modified in many ways including chemically, structurally, and functionally.
Naturally Occurring: As used herein, “naturally occurring” or “wild-type” means existing in nature without artificial aid, or involvement of the hand of man.
Non-homologous end joining repair: As used herein, the term “non-homologous end joining repair” refers to a mechanism to repair double-strand break in DNA by direct ligation of one end of the break to the other end without a requirement for a DNA donor template. NHEJ is an error-prone DNA repair pathway without the use of repair template and often results nucleotide being randomly inserted or deleted (Indel) at the site of the double strand break.
Non-human vertebrate: As used herein, a “non-human vertebrate” includes all vertebrates except Homo sapiens, including wild and domesticated species. Examples of non-human vertebrates include, but are not limited to, mammals, such as alpaca, banteng, bison, camel, cat, cattle, deer, dog, donkey, gayal, goat, guinea pig, horse, llama, mule, pig, rabbit, reindeer, sheep water buffalo, and yak.
Off-target: As used herein, “off target” refers to any unintended effect on any one or more target, gene, or cellular transcript.
Open reading frame: As used herein, “open reading frame” or “ORF” refers to a sequence which does not contain a stop codon in a given reading frame.
Operably linked: As used herein, the phrase “operably linked” refers to a functional connection between two or more molecules, constructs, transcripts, entities, moieties or the like.
Parent sequence: As used herein, a “parent sequence” is a nucleic acid or amino acid sequence from which a variant is derived. In some embodiments, a parent sequence is a sequence into which a heterologous sequence is inserted. In other words, a parent sequence may be considered an acceptor or recipient sequence. In some embodiments, a parent sequence is an AAV capsid sequence into which a targeting sequence is inserted.
Particle: As used herein, a “particle” is a virus comprised of at least two components, a protein capsid and a polynucleotide sequence enclosed within the capsid.
Patient: As used herein, “patient” refers to a subject who may seek or be in need of treatment, requires treatment, is receiving treatment, will receive treatment, or a subject who is under care by a trained professional for a particular disease or condition.
Payload: As used herein. “payload” or “payload region” refers to one or more polynucleotides or polynucleotide regions encoded by or within a vector genome or an expression product of such polynucleotide or polynucleotide region, e.g., a transgene, a polynucleotide encoding a polypeptide or multi-polypeptide or a modulatory nucleic acid or regulatory nucleic acid.
Payload construct: As used herein, “payload construct” or “construct” is one or more polynucleotide regions encoding or comprising a payload that is flanked on one or both sides by an inverted terminal repeat (ITR) sequence. The payload construct is a template that is replicated in a viral production cell to produce a vector genome.
Payload construct vector: As used herein. “payload construct vector” is a vector encoding or comprising a payload construct, and regulatory regions for replication and expression in bacterial cells.
Payload construct expression vector: As used herein, a “payload construct expression vector” is a vector encoding or comprising a payload construct and which further comprises one or more polynucleotide regions encoding or comprising components for viral expression in a viral replication cell.
Peptide: As used herein, “peptide” is less than or equal to 50 amino acids long, e.g., about 5, 10, 15, 20, 25, 30, 35, 40, 45, or 50 amino acids long.
Pharmaceutically acceptable: The phrase “pharmaceutically acceptable” is employed herein to refer to those compounds, materials, compositions, and/or dosage forms which are, within the scope of sound medical judgment, suitable for use in contact with the tissues of human beings and animals without excessive toxicity, irritation, allergic response, or other problem or complication, commensurate with a reasonable benefit/risk ratio.
Pharmaceutically acceptable excipients: The phrase “pharmaceutically acceptable excipient.” as used herein, refers any ingredient other than the compounds described herein (for example, a vehicle capable of suspending or dissolving the active compound) and having the properties of being substantially nontoxic and non-inflammatory in a patient. Excipients may include, for example: antiadherents, antioxidants, binders, coatings, compression aids, disintegrants, dyes (colors), emollients, emulsifiers, fillers (diluents), film formers or coatings, flavors, fragrances, glidants (flow enhancers), lubricants, preservatives, printing inks, sorbents, suspensing or dispersing agents, sweeteners, and waters of hydration. Exemplary excipients include, but are not limited to: butylated hydroxytoluene (BHT), calcium carbonate, calcium phosphate (dibasic), calcium stearate, croscarmellose, crosslinked polyvinyl pyrrolidone, citric acid, crospovidone, cysteine, ethylcellulose, gelatin, hydroxypropyl cellulose, hydroxypropyl methylcellulose, lactose, magnesium stearate, maltitol, mannitol, methionine, methylcellulose, methyl paraben, microcrystalline cellulose, polyethylene glycol, polyvinyl pyrrolidone, povidone, pregelatinized starch, propyl paraben, retinyl palmitate, shellac, silicon dioxide, sodium carboxymethyl cellulose, sodium citrate, sodium starch glycolate, sorbitol, starch (corn), stearic acid, sucrose, talc, titanium dioxide, vitamin A, vitamin E, vitamin C, and xylitol.
Pharmaceutically acceptable salts: The present disclosure also includes pharmaceutically acceptable salts of the compounds described herein. As used herein, “pharmaceutically acceptable salts” refers to derivatives of the disclosed compounds wherein the parent compound is modified by converting an existing acid or base moiety to its salt form (e.g., by reacting the free base group with a suitable organic acid). Examples of pharmaceutically acceptable salts include, but are not limited to, mineral or organic acid salts of basic residues such as amines; alkali or organic salts of acidic residues such as carboxylic acids; and the like. Representative acid addition salts include acetate, acetic acid, adipate, alginate, ascorbate, aspartate, benzenesulfonate, benzene sulfonic acid, benzoate, bisulfate, borate, butyrate, camphorate, camphorsulfonate, citrate, cyclopentanepropionate, digluconate, dodecylsulfate, ethanesulfonate, fumarate, glucoheptonate, glycerophosphate, hemisulfate, heptonate, hexanoate, hydrobromide, hydrochloride, hydroiodide, 2-hydroxy-ethanesulfonate, lactobionate, lactate, laurate, lauryl sulfate, malate, maleate, malonate, methanesulfonate, 2-naphthalenesulfonate, nicotinate, nitrate, oleate, oxalate, palmitate, pamoate, pectinate, persulfate, 3-phenylpropionate, phosphate, picrate, pivalate, propionate, stearate, succinate, sulfate, tartrate, thiocyanate, toluenesulfonate, undecanoate, valerate salts, and the like. Representative alkali or alkaline earth metal salts include sodium, lithium, potassium, calcium, magnesium, and the like, as well as nontoxic ammonium, quaternary ammonium, and amine cations, including, but not limited to ammonium, tetramethylammonium, tetraethylammonium, methylamine, dimethylamine, trimethylamine, triethylamine, ethylamine, and the like. The pharmaceutically acceptable salts of the present disclosure include the conventional non-toxic salts of the parent compound formed, for example, from non-toxic inorganic or organic acids. The pharmaceutically acceptable salts of the present disclosure can be synthesized from the parent compound which contains a basic or acidic moiety by conventional chemical methods. Generally, such salts can be prepared by reacting the free acid or base forms of these compounds with a stoichiometric amount of the appropriate base or acid in water or in an organic solvent, or in a mixture of the two; generally, nonaqueous media like ether, ethyl acetate, ethanol, isopropanol, or acetonitrile are preferred. Lists of suitable salts are found in Remington's Pharmaceutical Sciences, 171 ed., Mack Publishing Company, Easton, Pa., 1985, p. 1418, Pharmaceutical Salts: Properties, Selection, and Use, P. H. Stahl and C. G. Wermuth (eds.), Wiley-VCH, 2008, and Berge et al., Journal of Pharmaceutical Science, 66, 1-19 (1977), each of which is incorporated herein by reference in its entirety.
Pharmaceutically acceptable solvate: The term “pharmaceutically acceptable solvate,” as used herein, means a compound wherein molecules of a suitable solvent are incorporated in the crystal lattice. A suitable solvent is physiologically tolerable at the dosage administered. For example, solvates may be prepared by crystallization, recrystallization, or precipitation from a solution that includes organic solvents, water, or a mixture thereof. Examples of suitable solvents are ethanol, water (for example, mono-, di-, and tri-hydrates), N-methylpyrrolidinone (NMP), dimethyl sulfoxide (DMSO), N,N′-dimethylformamide (DMF), N,N′-dimethylacetamide (DMAC), 1,3-dimethyl-2-imidazolidinone (DMEU), 1,3-dimethyl-3,4,5,6-tetrahydro-2-(1H)-pyrimidinone (DMPU), acetonitrile (ACN), propylene glycol, ethyl acetate, benzyl alcohol, 2-pyrrolidone, benzyl benzoate, and the like. When water is the solvent, the solvate is referred to as a “hydrate.”
Pharmacokinetic: As used herein, “pharmacokinetic” refers to any one or more properties of a molecule or compound as it relates to the determination of the fate of substances administered to a living organism. Pharmacokinetics is divided into several areas including the extent and rate of absorption, distribution, metabolism and excretion. This is commonly referred to as ADME where: (A) Absorption is the process of a substance entering the blood circulation; (D) Distribution is the dispersion or dissemination of substances throughout the fluids and tissues of the body; (M) Metabolism (or Biotransformation) is the irreversible transformation of parent compounds into daughter metabolites; and (E) Excretion (or Elimination) refers to the elimination of the substances from the body. In rare cases, some drugs irreversibly accumulate in body tissue.
Physicochemical: As used herein, “physicochemical” means of or relating to a physical and/or chemical property.
Preventing: As used herein, the term “preventing” refers to partially or completely delaying onset of an infection, disease, disorder and/or condition; partially or completely delaying onset of one or more symptoms, features, or clinical manifestations of a particular infection, disease, disorder, and/or condition; partially or completely delaying onset of one or more symptoms, features, or manifestations of a particular infection, disease, disorder, and/or condition; partially or completely delaying progression from an infection, a particular disease, disorder and/or condition: and/or decreasing the risk of developing pathology associated with the infection, the disease, disorder, and/or condition.
Proliferate: As used herein, the term “proliferate” means to grow, expand or increase or cause to grow, expand or increase rapidly. “Proliferative” means having the ability to proliferate. “Anti-proliferative” means having properties counter to or inapposite to proliferative properties.
Prophylactic: As used herein, “prophylactic” refers to a therapeutic or course of action used to prevent the spread of disease.
Prophylaxis: As used herein, a “prophylaxis” refers to a measure taken to maintain health and prevent the spread of disease.
Protein of interest: As used herein, the terms “proteins of interest” or “desired proteins” include those provided herein and fragments, mutants, variants, and alterations thereof.
Proximal: As used herein, the term “proximal” means situated nearer to the center or to a point or region of interest.
Purified: As used herein, “purify,” “purified,” “purification” means to make substantially pure or clear from unwanted components, material defilement, admixture or imperfection. “Purified” refers to the state of being pure. “Purification” refers to the process of making pure.
Regulatable CRISPR genome-editing element: As used herein, the term “regulatable genome-editing element” refers to one or more polynucleotides encoding any regulatable or tunable component and any component of a CRISPR system including, but not limited, to one or more gRNA, one or more donor DNA templates, a Cas9, a Cas9 orthologue, a Cpf1 or a Cpf1 orthologues which impart regulatable or tunable feature to a viral genome encoding them. Where CRISPR regulatable elements are present, the AAV particle may be referred to as a “regulatable CRISPR-AAV particle” or “Tunable CRISPR-AAV particle.”
Regulatable Elements: As used herein, the term “regulatable element” refers to one or more components, factors, polynucleotide features or motifs which imparts regulatable or tunable features to regulate the expression of a payload. The expression of the regulatable elements may also be further regulated.
Region: As used herein, the term “region” refers to a zone or general area. In some embodiments, when referring to a protein or protein module, a region may comprise a linear sequence of amino acids along the protein or protein module or may comprise a three-dimensional area, an epitope and/or a cluster of epitopes. In some embodiments, regions comprise terminal regions. As used herein, the term “terminal region” refers to regions located at the ends or termini of a given agent. When referring to proteins, terminal regions may comprise N- and/or C-termini. N-termini refer to the end of a protein comprising an amino acid with a free amino group. C-termini refer to the end of a protein comprising an amino acid with a free carboxyl group. N- and/or C-terminal regions may there for comprise the N- and/or C-termini as well as surrounding amino acids. In some embodiments, N- and/or C-terminal regions comprise from about 3 amino acid to about 30 amino acids, from about 5 amino acids to about 40 amino acids, from about 10 amino acids to about 50 amino acids, from about 20 amino acids to about 100 amino acids and/or at least 100 amino acids. In some embodiments, N-terminal regions may comprise any length of amino acids that includes the N-terminus, but does not include the C-terminus. In some embodiments, C-terminal regions may comprise any length of amino acids, which include the C-terminus, but do not comprise the N-terminus.
In some embodiments, when referring to a polynucleotide, a region may comprise a linear sequence of nucleic acids along the polynucleotide or may comprise a three-dimensional area, secondary structure, or tertiary structure. In some embodiments, regions comprise terminal regions. As used herein, the term “terminal region” refers to regions located at the ends or termini of a given agent. When referring to polynucleotides, terminal regions may comprise 5′ and 3′ termini. 5′ termini refer to the end of a polynucleotide comprising a nucleic acid with a free phosphate group. 3′ termini refer to the end of a polynucleotide comprising a nucleic acid with a free hydroxyl group. 5′ and 3′ regions may there for comprise the 5′ and 3′ termini as well as surrounding nucleic acids. In some embodiments, 5′ and 3′ terminal regions comprise from about 9 nucleic acids to about 90 nucleic acids, from about 15 nucleic acids to about 120 nucleic acids, from about 30 nucleic acids to about 150 nucleic acids, from about 60 nucleic acids to about 300 nucleic acids and/or at least 300 nucleic acids. In some embodiments, 5′ regions may comprise any length of nucleic acids that includes the 5′ terminus, but does not include the 3′ terminus. In some embodiments, 3′ regions may comprise any length of nucleic acids, which include the 3′ terminus, but does not comprise the 5′ terminus.
RNA or RNA molecule: As used herein, the term “RNA” or “RNA molecule” or “ribonucleic acid molecule” refers to a polymer of ribonucleotides: the term “DNA” or “DNA molecule” or “deoxyribonucleic acid molecule” refers to a polymer of deoxyribonucleotides. DNA and RNA can be synthesized naturally, e.g., by DNA replication and transcription of DNA, respectively; or be chemically synthesized. DNA and RNA can be single-stranded (i.e., ssRNA or ssDNA, respectively) or multi-stranded (e.g., double stranded, i.e., dsRNA and dsDNA, respectively). The term “mRNA” or “messenger RNA”, as used herein, refers to a single stranded RNA that encodes the amino acid sequence of one or more polypeptide chains.
Sample: As used herein, the term “sample” or “biological sample” refers to a subset of its tissues, cells or component parts (e.g. body fluids, including but not limited to blood, mucus, lymphatic fluid, synovial fluid, cerebrospinal fluid, saliva, amniotic fluid, amniotic cord blood, urine, vaginal fluid and semen). A sample further may include a homogenate, lysate or extract prepared from a whole organism or a subset of its tissues, cells or component parts, or a fraction or portion thereof, including but not limited to, for example, plasma, serum, spinal fluid, lymph fluid, the external sections of the skin, respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, blood cells, tumors, organs. A sample further refers to a medium, such as a nutrient broth or gel, which may contain cellular components, such as proteins or nucleic acid molecule.
Self-complementary viral particle: As used herein, a “self-complementary viral particle” is a particle comprised of at least two components, a protein capsid and a polynucleotide sequence encoding a self-complementary genome enclosed within the capsid.
Signal Sequences: As used herein, the phrase “signal sequences” refers to a sequence which can direct the transport or localization of a protein.
Single unit dose: As used herein, a “single unit dose” is a dose of any therapeutic administered in one dose/at one time/single route/single point of contact, i.e., single administration event. In some embodiments, a single unit dose is provided as a discrete dosage form (e.g., a tablet, capsule, patch, loaded syringe, vial, etc.).
Similarity: As used herein, the term “similarity” refers to the overall relatedness between polymeric molecules, e.g. between polynucleotide molecules (e.g. DNA molecules and/or RNA molecules) and/or between polypeptide molecules. Calculation of percent similarity of polymeric molecules to one another can be performed in the same manner as a calculation of percent identity, except that calculation of percent similarity takes into account conservative substitutions as is understood in the art.
Split dose: As used herein, a “split dose” is the division of single unit dose or total daily dose into two or more doses.
Stable: As used herein “stable” refers to a compound that is sufficiently robust to survive isolation to a useful degree of purity from a reaction mixture, and preferably capable of formulation into an efficacious therapeutic agent.
Stabilized: As used herein, the term “stabilize”, “stabilized,” “stabilized region” means to make or become stable.
Subject: As used herein, the term “subject” or “patient” refers to any organism to which a composition in accordance with the disclosure may be administered, e.g., for experimental, diagnostic, prophylactic, and/or therapeutic purposes. Typical subjects include animals (e.g., mammals such as mice, rats, rabbits, non-human primates, and humans) and/or plants.
Substantially: As used herein, the term “substantially” refers to the qualitative condition of exhibiting total or near-total extent or degree of a characteristic or property of interest. One of ordinary skill in the biological arts will understand that biological and chemical phenomena rarely, if ever, go to completion and/or proceed to completeness or achieve or avoid an absolute result. The term “substantially” is therefore used herein to capture the potential lack of completeness inherent in many biological and chemical phenomena.
Substantially equal: As used herein as it relates to time differences between doses, the term means plus/minus 2%.
Substantially simultaneously: As used herein and as it relates to plurality of doses, the term means within 2 seconds.
Suffering from: An individual who is “suffering from” a disease, disorder, and/or condition has been diagnosed with or displays one or more symptoms of a disease, disorder, and/or condition.
Susceptible to: An individual who is “susceptible to” a disease, disorder, and/or condition has not been diagnosed with and/or may not exhibit symptoms of the disease, disorder, and/or condition but harbors a propensity to develop a disease or its symptoms. In some embodiments, an individual who is susceptible to a disease, disorder, and/or condition (for example, cancer) may be characterized by one or more of the following: (1) a genetic mutation associated with development of the disease, disorder, and/or condition; (2) a genetic polymorphism associated with development of the disease, disorder, and/or condition; (3) increased and/or decreased expression and/or activity of a protein and/or nucleic acid associated with the disease, disorder, and/or condition; (4) habits and/or lifestyles associated with development of the disease, disorder, and/or condition; (5) a family history of the disease, disorder, and/or condition; and (6) exposure to and/or infection with a microbe associated with development of the disease, disorder, and/or condition. In some embodiments, an individual who is susceptible to a disease, disorder, and/or condition will develop the disease, disorder, and/or condition. In some embodiments, an individual who is susceptible to a disease, disorder, and/or condition will not develop the disease, disorder, and/or condition.
Sustained release: As used herein, the term “sustained release” refers to a pharmaceutical composition or compound release profile that conforms to a release rate over a specific period of time.
Synthetic: The term “synthetic” means produced, prepared, and/or manufactured by the hand of man. Synthesis of polynucleotides or polypeptides or other molecules of the present disclosure may be chemical or enzymatic.
Targeting: As used herein, “targeting” means the process of design and selection of nucleic acid sequence that will hybridize to a target nucleic acid and induce a desired effect.
Targeted Cells: As used herein, “targeted cells” refers to any one or more cells of interest. The cells may be found in vitro, in vivo, in situ or in the tissue or organ of an organism. The organism may be an animal, preferably a mammal, more preferably a human and most preferably a patient.
Therapeutic Agent: The term “therapeutic agent” refers to any agent that, when administered to a subject, has a therapeutic, diagnostic, and/or prophylactic effect and/or elicits a desired biological and/or pharmacological effect.
Therapeutically effective amount: As used herein, the term “therapeutically effective amount” means an amount of an agent to be delivered (e.g., nucleic acid, drug, therapeutic agent, diagnostic agent, prophylactic agent, etc.) that is sufficient, when administered to a subject suffering from or susceptible to an infection, disease, disorder, and/or condition, to treat, improve symptoms of, diagnose, prevent, and/or delay the onset of the infection, disease, disorder, and/or condition. In some embodiments, a therapeutically effective amount is provided in a single dose. In some embodiments, a therapeutically effective amount is administered in a dosage regimen comprising a plurality of doses. Those skilled in the art will appreciate that in some embodiments, a unit dosage form may be considered to comprise a therapeutically effective amount of a particular agent or entity if it comprises an amount that is effective when administered as part of such a dosage regimen.
Therapeutically effective outcome: As used herein, the term “therapeutically effective outcome” means an outcome that is sufficient in a subject suffering from or susceptible to an infection, disease, disorder, and/or condition, to treat, improve symptoms of, diagnose, prevent, and/or delay the onset of the infection, disease, disorder, and/or condition.
Total daily close: As used herein, a “total daily dose” is an amount given or prescribed in 24 hr. period. It may be administered as a single unit dose.
Transfection: As used herein, the term “transfection” refers to methods to introduce exogenous nucleic acids into a cell. Methods of transfection include, but are not limited to, chemical methods, physical treatments and cationic lipids or mixtures.
Treating: As used herein, the term “treating” refers to partially or completely alleviating, ameliorating, improving, relieving, delaying onset of, inhibiting progression of, reducing severity of, and/or reducing incidence of one or more symptoms or features of a particular infection, disease, disorder, and/or condition. For example, “treating” cancer may refer to inhibiting survival, growth, and/or spread of a tumor. Treatment may be administered to a subject who does not exhibit signs of a disease, disorder, and/or condition and/or to a subject who exhibits only early signs of a disease, disorder, and/or condition for the purpose of decreasing the risk of developing pathology associated with the disease, disorder, and/or condition.
Unmodified: As used herein, “unmodified” refers to any substance, compound or molecule prior to being changed in any way. Unmodified may, but does not always, refer to the wild type or native form of a biomolecule. Molecules may undergo a series of modifications whereby each modified molecule may serve as the “unmodified” starting molecule for a subsequent modification.
Vector: As used herein, a “vector” is any molecule or moiety which transports, transduces or otherwise acts as a carrier of a heterologous molecule. Vectors of the present disclosure may be produced recombinantly and may be based on and/or may comprise adeno-associated virus (AAV) parent or reference sequence. Such parent or reference AAV sequences may serve as an original, second, third or subsequent sequence for engineering vectors. In non-limiting examples, such parent or reference AAV sequences may comprise any one or more of the following sequences: a polynucleotide sequence encoding a polypeptide or multi-polypeptide, which sequence may be wild-type or modified from wild-type and which sequence may encode full-length or partial sequence of a protein, protein domain, or one or more subunits of a protein; a polynucleotide comprising a modulatory or regulatory nucleic acid which sequence may be wild-type or modified from wild-type; and a transgene that may or may not be modified from wild-type sequence. These AAV sequences may serve as either the “donor” sequence of one or more codons (at the nucleic acid level) or amino acids (at the polypeptide level) or “acceptor” sequences of one or more codons (at the nucleic acid level) or amino acids (at the polypeptide level).
Vector genome: As used herein, a “vector genome” or “viral genome” is a polynucleotide comprising at least one inverted terminal repeat (ITR) and at least one encoded payload. A vector genome encodes at least one copy of the payload.
Described herein are compositions, methods, processes, kits and devices for the design, preparation, manufacture and/or formulation of AAV particles. In some embodiments, payloads, such as but not limited to AAV polynucleotides, may be encoded by payload constructs or contained within plasmids or vectors or recombinant adeno-associated viruses (AAVs).
The details of one or more embodiments are set forth in the accompanying description below. Although any materials and methods similar or equivalent to those described herein can be used in the practice or testing of the present disclosure, the preferred materials and methods are now described. Other features, objects and advantages will be apparent from the description. In the description, the singular forms also include the plural unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. In the case of conflict, the present description will control.
The present disclosure is further illustrated by the following non-limiting examples.
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments in accordance with the invention described herein. The scope of the present disclosure is not intended to be limited to the above Description, but rather is as set forth in the appended claims.
In the claims, articles such as “a,” “an,” and “the” may mean one or more than one unless indicated to the contrary or otherwise evident from the context. Claims or descriptions that include “or” between one or more members of a group are considered satisfied if one, more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process unless indicated to the contrary or otherwise evident from the context. The disclosure includes embodiments in which exactly one member of the group is present in, employed in, or otherwise relevant to a given product or process. The disclosure includes embodiments in which more than one, or the entire group members are present in, employed in, or otherwise relevant to a given product or process.
It is also noted that the term “comprising” is intended to be open and permits but does not require the inclusion of additional elements or steps. When the term “comprising” is used herein, the term “consisting of” is thus also encompassed and disclosed.
Where ranges are given, endpoints are included. Furthermore, it is to be understood that unless otherwise indicated or otherwise evident from the context and understanding of one of ordinary skill in the art, values that are expressed as ranges can assume any specific value or subrange within the stated ranges in different embodiments of the disclosure, to the tenth of the unit of the lower limit of the range, unless the context clearly dictates otherwise.
In addition, it is to be understood that any particular embodiment of the present disclosure that falls within the prior art may be explicitly excluded from any one or more of the claims. Since such embodiments are deemed to be known to one of ordinary skill in the art, they may be excluded even if the exclusion is not set forth explicitly herein. Any particular embodiment of the compositions of the disclosure (e.g., any antibiotic, therapeutic or active ingredient; any method of production; any method of use; etc.) can be excluded from any one or more claims, for any reason, whether or not related to the existence of prior art.
It is to be understood that the words which have been used are words of description rather than limitation, and that changes may be made within the purview of the appended claims without departing from the true scope and spirit of the disclosure in its broader aspects.
While the present disclosure has been described at some length and with some particularity with respect to the several described embodiments, it is not intended that it should be limited to any such particulars or embodiments or any particular embodiment, but it is to be construed with references to the appended claims so as to provide the broadest possible interpretation of such claims in view of the prior art and, therefore, to effectively encompass the intended scope of the disclosure.
The level of transgene expression by AAV particles and/or VAPs produced and purified by the methods described herein is determined by real-time quantitative polymerase chain reaction (qPCR). A culture of cells (sf9 or HEK293) engineered to produce helper components required for AAV production is infected by AAV particles and/or VAPs produced as described herein.
The target cells are harvested at a series of time points, lysed and the mRNA is purified. The level of transgene expressed is determined by reverse transcription (qPCR) on a thermal cycler equipped with an excitation source filters, and detector for quantification of the reaction such as, but not limited to, the 7500 FAST Real-Time PCR system (Applied Biosystems, Foster City Calif.).
AAV particles and/or VAPs produced and purified by the methods described herein are treated with proteinase K, serially diluted, and PCR-amplified using a fluor such as, but not limited to, SYBR green (Applied Biosystems, Foster City. Calif.) with primers specific to the transgene sequence. A reference transgene oligonucleotide is used as a copy number standard. The cycling conditions are: 95° C. for 3 min, followed by 35 cycles of 95° C. for 30 sec, 60° C. for 30 sec. and 72° C. for 30 sec.
The viral construct vector encodes the three structural cap proteins, VP1, VP2, and VP3, in a single open reading frame regulated by utilization of both alternative splice acceptor and non-canonical translational initiation codon(s). In-frame and out-of-frame ATG triplets preventing translation initiation at a position between the VP1 and VP2 start codons are eliminated. Both Rep78 and Rep52 are translated from a single transcript: Rep78 translation initiates at a non-AUG codon and Rep52 translation initiates at the first AUG in the transcript.
The nucleotides that encode the structural VP1, VP2, and VP3 capsid proteins and non-structural Rep78 and Rep52 proteins are contained on one viral expression construct under control of the baculovirus major late promoter.
The payload construct vector encodes two ITR sequences flanking a transgene polynucleotide encoding a polypeptide or modulatory nucleic acid and/or one or more regulatable elements. The ITR sequences allow for replication of a polynucleotide encoding the transgene and ITR sequences alone that will be packaged within the capsid of the viral construct vector. The replicated polynucleotide encodes ITR sequences on the 5′ and 3′ ends of the molecule.
The payload construct vector and viral construct vector each comprise a Tn7 transposon element that transposes the ITR and payload sequences or the Rep and Cap sequences respectively to a bacmid that comprises the attTn7 attachment site. Competent bacterial DH10 cells are transfected with either the payload construct vector or viral construct vector. The resultant viral construct expression vector and payload construct expression vector produced in the competent cell are then purified by detergent lysis and purification on DNA columns.
Separate seed cultures of Sf9 cells in serum free suspension culture are transfected with the viral construct expression vector or payload construct expression vector. The cultures are maintained for 48 hours while baculovirus is produced and released into the medium. The baculovirus released into the media continue to infect Sf9 cells in an exponential manner until all of the Sf9 cells in the culture are infected at least once. The baculoviral infected insect cells (BIIC) and media of the seed culture is harvested and divided into aliquots before being frozen in liquid nitrogen.
A naïve population of un-transfected Sf9 cells is expanded in serum free suspension cell culture conditions. Once the culture growth has reached peak log phase in 1 L of media as measured by optical density the culture is added to a large volume 20 L bioreactor. The bioreactor culture is co-inoculated with a frozen viral construct expression vector and payload construct expression vector BIIC aliquot. The conditions of the Sf9 cell suspension culture is monitored by instruments that measure and/or control external variables that support the growth and activity of viral replication cells such as mass, temperature, CO2, O2, pH, and/or optical density (OD). The Sf9 culture is maintained at optimal conditions until cell population growth has reached peak log phase and before cell growth has plateaued, as measured by optical density.
In each viral replication cell that has been infected with both baculoviruses the payload flanked on one end with an ITR sequence is replicated pathway producing a viral genome and packaged in a capsid assembled from the proteins VP1, VP2, and VP3.
The viral replication cells are lysed using the Microfluidizer™ (Microfluidics International Corp., Newton, Mass.), high shear force fluid processor. The resultant cell lysate is clarified by low speed centrifugation followed by tangential flow filtration. The resultant clarified lysate is filtered by a size exclusion column to remove any remaining baculoviral particles from solution. The final steps utilize ultracentrifugation and sterile filtration to produce viral particles suitable for use as described herein.
The titer of AAV particles produced and purified by the methods described herein is determined by real-time quantitative polymerase chain reaction (qPCR) on a thermal cycler equipped with an excitation source filters, and detector for quantification of the reaction such as, but not limited to, the 7500 FAST Real-Time PCR system (Applied Biosystems, Foster City Calif.). AAV particles produced and purified by the methods described herein is treated with proteinase K, serially diluted, and PCR-amplified using a fluor such as, but not limited to, SYBR green (Applied Biosystems, Foster City, Calif.) with primers specific to the AAV genome ITR sequences. A linearized viral genome is used as a copy number standard. The cycling conditions are: 95° C. for 3 min, followed by 35 cycles of 95° C. for 30 sec, 60° C. for 30 sec, and 72° C. for 30 sec.
In order to study methods to induce transient, burst expression of cas9, an AAV particle of Example 2 which encodes cas9, a DNA binding domain and a transactivating factor is prepared, produced and tested by methods known in the art and described herein. The cas9 sequence is located in the open reading frame of the vector and a DNA binding domain (DBD) and transactivating factor are located in VP2. The transactivating factor may be coupled to the DBD.
The DBD may be a pre-engineered DBD targeted specifically to the promoter used for cas9. While not wishing to be bound by theory, upon expression of VP2 in a biological system, the DBD locates and binds to the cas9 promoter. If the transactivating factor is for cas9 and the transactivating factor is coupled to the DBD, upon DBD binding to the cas9 promoter, the transactivating factor drives expression of cas9.
The nature of the interaction between the promoter and the DBD coupled to a transactivating factor generates a transient, burst expression of cas9. This punctuated expression may be beneficial as it may limit the possibility of side-effects of extended elevated expression of cas9.
To study the burst expression of cas9, an AAV particle with the DBD and the transactivating factor is purified and produced as described in Example 2. As a control, an AAV particle lacking the DBD and transactivating domain is produced in parallel.
Expression of cas9 is measured by methods described herein and known in the art, such as in HEK293 cells for the AAV particle with or without the DBD and transactivating factor. Cells are lysed at different time points and protein extracts are prepared for ELISA and Western blot analysis.
The specificity of cas9 cleavage is confirmed by deep sequencing of samples collected and the extent of dsDNA cleavage by cas9 or cas9-destabilizing domain fusion protein is measured by ligation-mediated purification or genome modification assays such as SURVEYOR. Indel percentage is calculated from the integrated intensities of the undigested PCR product and each of the cleavage products.
The ability of a CRISPR regulatable elements to regulate the expression of a luciferase payload composed of one or more CRISPR recognition sequences is tested in vitro and in vivo. Cleavage of the payload construct at the CRISPR recognition sequence ablates the payload expression. Using Cas9 with a destabilizing domain shortens the half-life of Cas9.
The following CRISPR-AAV constructs are prepared, produced and tested.
AAV construct 1 encodes a luciferase payload, which is driven by a constitutive promoter and which contains a CRISPR recognition sequence. The recognition sequence is chosen using methods known in the art and described herein to ensure that the sequence is unique to the viral genome and does not occur in the host genome. The AAV construct 2, which contains all of the components of construct 1, and additionally encodes a guide RNA specific to the CRISPR recognition sequence in the luciferase payload and a codon optimized Cas9 with a destabilizing domain. The guide RNA and Cas9 are both expressed from a constitutive promotor. Cas9 also contains a nuclear localization sequence. AAV construct 3 encodes luciferase driven by a CMV promoter without the CRISPR recognition sequence (positive control). AAV construct 4 contains all of the elements of construct 3, and further encodes a guide RNA specific to the CRISPR recognition sequence located in the luciferase payload and a Cas9 with a destabilizing domain. The guide RNA and Cas9 are both expressed from a constitutive promotor. Cas9 also contains a nuclear localization sequence.
In one example, two AAV vectors are used for transduction, one expressing the regulatable elements (Cas9 and guide RNA) and the other expressing the luciferase payload. These two vectors are delivered at a ratio determined to be optimal. Alternatively, one vector is used for transduction, expressing the regulatable elements and the luciferase payload.
HeLa cells are transduced with the AAV vector(s) constructs 1 through 4, according to methods known in the art in triplicate for each assay. The transduced HeLa cells are incubated in medium. Subsequently, cells are harvested according to a time course which includes 24, 48, and 72 hours. The cells are lysed and luciferase activity is measured and compared between the samples.
If the regulatable elements and the luciferase payload are on two separate vectors, the effective dose of the CRISPR regulatable element is determined. HeLa cells are transduced with construct 1 and different doses of the AAV vector encoding Cas9 and the guide RNA, each in triplicate. Cells are harvested, lysed and luciferase activity is measured for each dose of Cas9 and guide RNA employed.
The AAV constructs 1 and 2 are prepared, produced and tested. AAV particles 1 and 2 are each injected into mice of 3 to 5 months of age according to methods known in the art. At a set time post injection, such as for example 2 weeks, mice are injected with luciferin. Animals are then anesthetized, and images are acquired with an imaging system. Bioluminescence is measured as total flux (photons/second) of the entire mouse and compared between the samples.
The ability of an inducible system to regulate a CRISPR regulatable element, which in turn can regulate the payload in the context of AAV transduction is tested in vitro and in vivo. The ability of a regulatable element composed of two rapamycin inducible fusion proteins to regulate the expression of a CRISPR regulatable element is tested in vitro.
The following CRISPR-AAV constructs are prepared, produced and tested. AAV construct 1 encodes a luciferase payload, which contains a CRISPR recognition sequence, driven by a constitutive promoter. The recognition sequence is chosen using methods known in the art and described herein to ensure that the sequence is unique to the viral genome and does not occur in the host genome. AAV construct 2 contains the components of construct 1 and additionally encodes a guide RNA specific to the CRISPR recognition sequence located in the luciferase payload and a codon optimized Cas9 with a destabilizing domain. The guide RNA and Cas9 are both expressed from a constitutive promotor. Cas9 also contains a nuclear localization sequence. AAV construct 3 contains all of the components of construct 1, and additionally encodes a guide RNA specific to the CRISPR recognition sequence located in the luciferase payload and a codon optimized Cas9 with a destabilizing domain. The guide RNA and Cas9 are both expressed from a minimal promotor into which one or more ZHFD1 binding sites are inserted. Cas9 also contains a nuclear localization sequence. AAV construct 4 contains all of the components of construct 3, and additionally encodes two dimerizable fusion proteins, FKBP containing the DNA-binding domain of ZHFD1 and FRAP fused to the NF-kappaB p65 transactivation domain. The dimerizable fusion proteins are expressed from one constitutive promotor, and linked together through a 2A peptide sequence. The transcription factor and the transactivation domain fusion proteins both contain a nuclear localization sequence.
Two or more AAV vectors can be used for transduction, one or more expressing the regulatable elements (Cas9 and guide RNA, and dimerizable fusion proteins) and an additional vector expressing the luciferase payload. Alternatively, three AAV vectors are used, one encoding the CRISPR regulatable elements, one encoding the dimerizable fusion proteins, and one encoding the luciferase payload. These vectors are delivered at a ratio determined to be optimal. In another embodiment, one vector is used for transduction, expressing the regulatable elements and the luciferase payload. In some embodiments, an open reading frame for the DNA binding domain fusion protein and/or transactivation domain fusion protein is located in VP2.
HeLa cells are transduced with the AAV vector constructs 1 through 4, according to methods known in the art in triplicate for each assay. The transduced HeLa cells are incubated in medium in the presence or absence of rapamycin for a set time. Subsequently, cells are harvested according to a time course which includes 24, 48, and 72 hours. The cells are lysed and luciferase activity is measured. Throughout the time course, luciferase activity is compared between untreated and rapamycin treated samples for each AAV vector construct and also between samples transduced with the vector constructs 1 through 4.
The AAV constructs 1-4 are prepared, produced and tested. In addition, a 5th construct may be added, which is the same as 4, except that the fusion proteins are driven by a tissue specific promoter, such as a liver specific promoter. AAV particles 1-4 are each injected into two mice of 3 to 5 months of age according to methods known in the art. At approximately 2 weeks post injection, a further injection of rapamycin is administered to half of the mice. At a predetermined time, mice are injected with luciferin. Animals are then anesthetized and images are acquired with an imaging system. Bioluminescence is measured as total flux (photons/second) of the entire mouse. Luciferase activity is compared between untreated and rapamycin treated animals and also between animals injected with the AAV particles 1-4.
The present application claims the benefit of priority to U.S. Provisional Patent Application No. 62/895,212, entitled “VECTORIZED EDITING OF NUCLEIC ACIDS TO CORRECT OVERT MUTATIONS”, filed Sep. 3, 2019, the contents of which are herein incorporated by reference in their entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/049104 | 9/3/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62895212 | Sep 2019 | US |