The contents of the text file submitted electronically herewith are incorporated herein by reference in their entirety: A computer readable format copy of the Sequence Listing (filename: B119570068WO00-SEQ.NRL, date recorded: Jan. 30, 2020; file size: 5,346 kilobytes).
AAV vectors provide a safe and versatile platform for gene therapy. For example, an AAV2 vector carrying the RPE65 gene is now an approved drug for the treatment of Leber's congenital amaurosis. Additionally, data from ongoing clinical trials supports the continued evaluation of AAV-based treatments for additional indications including hemophilia types A and B, Parkinson's disease, spinal muscular atrophy, and MPS I and II. Despite these encouraging results, expanding the use of in vivo gene therapy, especially in difficult to target organs such as the brain, is still hindered by delivery challenges.
The present disclosure is based, at least in part, on the identification of target proteins (e.g., Ly6 proteins) that enhance transcytosis of AAV capsids across the blood-brain barrier. The present disclosure provides, in some embodiments, methods for identifying AAV capsid proteins capable of crossing the blood-brain barrier, and compositions comprising such.
Some aspects of the present disclosure provide an AAV vector comprising an amino acid sequence that comprises at least 4 contiguous amino acids from a sequence listed in Table 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or 19. Some aspects of the present disclosure provide an AAV vector comprising an amino acid sequence that is encoded by a nucleic acid sequence listed in any of the Tables included herein.
In some embodiments, the amino acid sequence is part of a capsid protein of the AAV vector. In some embodiments, the amino acid sequence is inserted at a position corresponding to the position between amino acids 586-592 of the sequence provided in SEQ ID NO: 730 or 731. In some embodiments, the amino acid sequence is inserted at a position corresponding to the position between amino acids 588-589 of the sequence provided in SEQ ID NO: 730 or 731.
In some embodiments, the AAV vector comprises at least 4 contiguous amino acids from a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204. In some embodiments, the AAV vector comprises a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204.
In some embodiments, the AAV vector comprises at least 4 contiguous amino acids of: PKMTLKI (SEQ ID NO: 320), LGKKTNS (SEQ ID NO: 325), LPKYKSS (SEQ ID NO: 396), GRGNSVL (SEQ ID NO: 465), RSPRVNA (SEQ ID NO: 466), IRNPRMA (SEQ ID NO: 467), ARRPNSE (SEQ ID NO: 480), IKMLNKP (SEQ ID NO: 484), or REVLQRI (SEQ ID NO: 506).
In some embodiments, the AAV vector comprises at least 4 contiguous amino acids of: RKPRVHD (SEQ ID NO: 317), YADTNRR (SEQ ID NO: 321), TKSVRVV (SEQ ID NO: 327), TKSSMRP (SEQ ID NO: 336), RRHLAET (SEQ ID NO: 346), RRPPSMG (SEQ ID NO: 354), KDRKVPN (SEQ ID NO: 382), KVTNRHE (SEQ ID NO: 439), DMDLGMG (SEQ ID NO: 453), IEKPTYR (SEQ ID NO: 482), RGKMELY (SEQ ID NO: 505), SKDNHRM (SEQ ID NO: 511), DIHGANL (SEQ ID NO: 512), HSVGYLD (SEQ ID NO: 514), ASLADRP (SEQ ID NO: 515), SKNDHEY (SEQ ID NO: 517), or NLGAINK (SEQ ID NO: 522).
In some embodiments, the AAV vector comprises at least 4 contiguous amino acids of: RSMKPNN (SEQ ID NO: 316), RKPRVHD (SEQ ID NO: 317), VRKMPDY (SEQ ID NO: 318), QKPIRIV (SEQ ID NO: 319), PKMTLKI (SEQ ID NO: 320), YADTNRR (SEQ ID NO: 321), RKQMNTT (SEQ ID NO: 322), ELYKLPT (SEQ ID NO: 323), GGQLRKP (SEQ ID NO: 324), LGKKTNS (SEQ ID NO: 325), NRQTVKG (SEQ ID NO: 326), TKSVRVV (SEQ ID NO: 327), GINVRPR (SEQ ID NO: 328), KKGSIGS (SEQ ID NO: 329), LRKNPNP (SEQ ID NO: 330), NSKTVVR (SEQ ID NO: 331), VRRTQLD (SEQ ID NO: 332), KKSTILA (SEQ ID NO: 333), RSKLGSG (SEQ ID NO: 334), DRRGHDR (SEQ ID NO: 335), TKSSMRP (SEQ ID NO: 336), NRITPNR (SEQ ID NO: 337), KIQNNKQ (SEQ ID NO: 338), KSRLTQP (SEQ ID NO: 339), SQKAGGR (SEQ ID NO: 340), ARKTPDY (SEQ ID NO: 341), TRKPVVI (SEQ ID NO: 342), NLKDKRT (SEQ ID NO: 343), KRDARMN (SEQ ID NO: 344), KGSMRQA (SEQ ID NO: 345), RRHLAET (SEQ ID NO: 346), VKTHRPV (SEQ ID NO: 347), or KRNNVAA (SEQ ID NO: 348).
In some embodiments, the AAV is an AAV9 vector. In some embodiments, the AAV vector is an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV10 or AAV11 vector.
In some embodiments, the AAV vector comprises at least 5 contiguous amino acids from a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204. In some embodiments, the AAV vector comprises at least 6 contiguous amino acids from a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204. In some embodiments, the AAV vector comprises a sequence that is at least 80% identical to a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204.
In some embodiments, the the AAV vector comprises a sequence that contains a single amino acid substitution compared to a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204, and wherein the amino acid substitution is a conservative amino acid substitution
In some embodiments, the AAV vector comprises at least 4 contiguous amino acids of: NSKTVVR (SEQ ID NO: 331), QRIQGQK (SEQ ID NO: 367), RGTRTEN (SEQ ID NO: 369), KLDKRMG (SEQ ID NO: 397), TRRDSLF (SEQ ID NO: 403), STKTVKL (SEQ ID NO: 420), LNNKQVR (SEQ ID NO: 454), RNTRTEA (SEQ ID NO: 479), GERSPRL (SEQ ID NO: 507), TPTNPRW (SEQ ID NO: 508), or SADRKHI (SEQ ID NO: 516).
In some embodiments, the amino acid sequence binds to a Ly6/uPAR protein. In some embodiments, the amino acid sequence specifically binds to a human Ly6/uPAR protein. In some embodiments, the amino acid sequence binds to a human Ly6/uPAR protein and binds to a non-human primate Ly6/uPAR protein. In some embodiments, the amino acid sequence binds to a human Ly6/uPAR protein, binds to a non-human primate Ly6/uPAR protein, and binds to a rodent Ly6/uPAR protein. In some embodiments, the Ly6/uPAR protein is CD59.
Some aspects of the present disclosure provide an AAV capsid protein comprising an amino acid sequence that comprises at least 4 contiguous amino acids from a sequence listed in Table 4, 5.6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 or 19.
In some embodiments, the AAV capsid protein comprises at least 4 contiguous amino acids from a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204. In some embodiments, the AAV capsid protein comprises a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204.
In some embodiments, the AAV capsid protein comprises at least 4 contiguous amino acids of: PKMTLKI (SEQ ID NO: 320), LGKKTNS (SEQ ID NO: 325), LPKYKSS (SEQ ID NO: 396), GRGNSVL (SEQ ID NO: 465), RSPRVNA (SEQ ID NO: 466), IRNPRMA (SEQ ID NO: 467), ARRPNSE (SEQ ID NO: 480), IKMLNKP (SEQ ID NO: 484), or REVLQRI (SEQ ID NO: 506).
In some embodiments, the AAV capsid protein comprises at least 4 contiguous amino acids of: RKPRVHD (SEQ ID NO: 317), YADTNRR (SEQ ID NO: 321), TKSVRVV (SEQ ID NO: 327), TKSSMRP (SEQ ID NO: 336), RRHLAET (SEQ ID NO: 346), RRPPSMG (SEQ ID NO: 354), KDRKVPN (SEQ ID NO: 382), KVTNRHE (SEQ ID NO: 439), DMDLGMG (SEQ ID NO: 453), IEKPTYR (SEQ ID NO: 482), RGKMELY (SEQ ID NO: 505), SKDNHRM (SEQ ID NO: 511), DIHGANL (SEQ ID NO: 512), HSVGYLD (SEQ ID NO: 514), ASLADRP (SEQ ID NO: 515), SKNDHEY (SEQ ID NO: 517), or NLGAINK (SEQ ID NO: 522).
In some embodiments, the AAV capsid protein comprises at least 4 contiguous amino acids of: RSMKPNN (SEQ ID NO: 316), RKPRVHD (SEQ ID NO: 317), VRKMPDY (SEQ ID NO: 318), QKPIRIV (SEQ ID NO: 319), PKMTLKI (SEQ ID NO: 320), YADTNRR (SEQ ID NO: 321), RKQMNTT (SEQ ID NO: 322), ELYKLPT (SEQ ID NO: 323), GGQLRKP (SEQ ID NO: 324), LGKKTNS (SEQ ID NO: 325), NRQTVKG (SEQ ID NO: 326), TKSVRVV (SEQ ID NO: 327), GINVRPR (SEQ ID NO: 328), KKGSIGS (SEQ ID NO: 329), LRKNPNP (SEQ ID NO: 330), NSKTVVR (SEQ ID NO: 331), VRRTQLD (SEQ ID NO: 332), KKSTILA (SEQ ID NO: 333), RSKLGSG (SEQ ID NO: 334), DRRGHDR (SEQ ID NO: 335), TKSSMRP (SEQ ID NO: 336), NRITPNR (SEQ ID NO: 337), KIQNNKQ (SEQ ID NO: 338), KSRLTQP (SEQ ID NO: 339), SQKAGGR (SEQ ID NO: 340), ARKTPDY (SEQ ID NO: 341), TRKPVVI (SEQ ID NO: 342), NLKDKRT (SEQ ID NO: 343), KRDARMN (SEQ ID NO: 344), KGSMRQA (SEQ ID NO: 345), RRHLAET (SEQ ID NO: 346), VKTHRPV (SEQ ID NO: 347), or KRNNVAA (SEQ ID NO: 348).
In some embodiments, the AAV capsid protein comprises at least 4 contiguous amino acids of: NSKTVVR (SEQ ID NO: 331), QRIQGQK (SEQ ID NO: 367), RGTRTEN (SEQ ID NO: 369), KLDKRMG (SEQ ID NO: 397), TRRDSLF (SEQ ID NO: 403), STKTVKL (SEQ ID NO: 420), LNNKQVR (SEQ ID NO: 454), RNTRTEA (SEQ ID NO: 479), GERSPRL (SEQ ID NO: 507), TPTNPRW (SEQ ID NO: 508), or SADRKHI (SEQ ID NO: 516).
In some embodiments, the AAV capsid protein further comprises a nanoparticle or second molecule to which said AAV capsid protein is conjugated. In some embodiments, the AAV capsid protein is part of an AAV. In some embodiments, the AAV capsid protein is part of an AAV9.
In some embodiments, the AAV capsid protein comprises the amino acid sequence inserted at a position corresponding to the position between amino acids 586-592 of the sequence provided in SEQ ID NO: 730 or 731. In some embodiments, the AAV capsid protein comprises the amino acid sequence inserted at a position corresponding to the position between amino acids 588-589 of the sequence provided in SEQ ID NO: 730 or 731.
In some embodiments, the AAV capsid protein is part of an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV10 or AAV11.
In some embodiments, the AAV capsid protein comprises at least 5 contiguous amino acids from a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204. In some embodiments, the AAV capsid protein comprises at least 6 contiguous amino acids from a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204. In some embodiments, the AAV capsid protein comprises a sequence that is at least 80% identical to a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204.
In some embodiments, the AAV capsid protein comprises a sequence that contains a single amino acid substitution compared to a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204, and wherein the amino acid substitution is a conservative amino acid substitution.
In some embodiments, the AAV capsid protein comprises the amino acid sequence that binds to a Ly6/uPAR protein. In some embodiments, the AAV capsid protein comprises the amino acid sequence that specifically binds to a human Ly6/uPAR protein. In some embodiments, the AAV capsid protein comprises the amino acid sequence that binds to a human Ly6/uPAR protein and binds to a non-human primate Ly6/uPAR protein. In some embodiments, the AAV capsid protein comprises the amino acid sequence that binds to a human Ly6/uPAR protein, binds to a non-human primate Ly6/uPAR protein, and binds to a rodent Ly6/uPAR protein. In some embodiments, the AAV capsid protein comprises the amino acid sequence that binds to CD59.
Some aspects of the present disclosure provide a library of AAV9 capsid proteins comprising an AAV9 capsid protein as described herein.
Some aspects of the present disclosure provide a nucleic acid sequence encoding an AAV capsid protein as described herein.
Some aspects of the present disclosure provide a pharmaceutical composition comprising an AAV capsid protein as described herein and one or more pharmaceutical acceptable carriers.
Some aspects of the present disclosure provide a peptide comprising an amino acid sequence set forth in SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204. In some embodiments, the peptide further comprises a nanoparticle or second molecule to which said peptide is conjugated.
Some aspects of the present disclosure provide a method of delivering a nucleic acid to a target environment of a subject in need, comprising providing a composition comprising an AAV vector, wherein the AAV vector comprises a capsid protein that comprises an amino acid sequence that comprises at least 4 contiguous amino acids of a sequence selected from a sequence listed in Table 4, 5, 6, 78, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or 19, and wherein the AAV vector comprises a nucleic acid to be delivered to the target environment of the subject; and administering the composition to the subject.
In some embodiments, a method of delivering a nucleic acid to a target environment of a subject in need comprises providing a composition comprising any AAV vector described herein, and administering the composition to the subject.
In some embodiments, the target environment is the central nervous system, liver, muscle, heart, lungs, stomach, adrenal gland, adipose, intestine, or immune cells. In some embodiments, the target environment is neurons, astrocytes, cardiomyocytes, or a combination thereof.
In some embodiments, the nucleic acid to be delivered comprises one or more of: a) a nucleic acid sequence encoding a trophic factor, a growth factor, or a soluble protein; b) a cDNA that restores protein function to humans or animals harboring a genetic mutation(s) in that gene; c) a cDNA that encodes a protein that can be used to control or alter the activity or state of a cell; d) a cDNA that encodes a protein or a nucleic acid used for assessing the state of a cell; e) a cDNA and/or associated guide RNA for performing genomic engineering; f) a sequence for genome editing via homologous recombination; g) a DNA sequence encoding a therapeutic RNA; h) a shRNA or an artificial miRNA delivery system; and i) a DNA sequence that influences the splicing of an endogenous gene.
In some embodiments, the subject in need is a subject suffering from or at a risk to develop one or more of chronic pain, cardiac failure, cardiac arrhythmias, Friedreich's ataxia, Huntington's disease (HD), Alzheimer's disease (AD), Parkinson's disease (PD), Amyotrophic lateral sclerosis (ALS), spinal muscular atrophy types I and II (SMA I and II), Friedreich's Ataxia (FA), Spinocerebellar ataxia, lysosomal storage disorders that involve cells within the CNS.
In some embodiments, the AAV vector is administered to the subject via intravenous administration or systemic administration. In some embodiments, the nucleic acid is delivered to dorsal root ganglia, visceral organs, astrocytes, neurons, or a combination thereof of the subject.
Some aspects of the present disclosure provide a method comprising providing an AAV capsid protein; contacting the AAV capsid protein with a cell that expresses protein of Ly6/uPAR protein family attached to the surface of the cell; and selecting the AAV capsid protein if it specifically binds to the protein of the Ly6/uPAR protein family attached to the surface of the cell. In some embodiments, a method comprises any AAV capsid protein described herein.
In some embodiments, the protein of the Ly6/uPAR protein family is expressed recombinantly in the cell. In some embodiments, the protein of the Ly6/uPAR protein family is expressed endogenously in the cell. In some embodiments, the protein of the Ly6/uPAR protein family is a human protein. In some embodiments, the protein of the Ly6/uPAR protein family is expressed in the central nervous system. In some embodiments, the protein of the Ly6/uPAR protein family is LY6A, LY6C1, LY6E, CD59, Ly6H, LYNX1 or GPIHBP1. In some embodiments, the protein of the Ly6/uPAR protein family is ACRV1, CD177, CD59A, CD59B, GML, GML2, LY6A, LY6A2, LY6C1, LY6C2, LY6D, LY6E, LY6F, LY6G, LY6G2, LY6G5B, LY6G5C, LY6G6C, LY6G6D, LY6G6E, LY6G6F, LY6G6G, LY6I, LY6K, LY6L, LY6M, LYPD1, LYPD2, LYPD3, LYPD4, LYPD5, LYPD6, LYPD6B, LYPD8, LYPD9, LYPD10, LYPD11, PATE1, PATE2, PATE3, PATE4, PATE5, PATE6, PATE7, PATE8, PATE9, PATE10, PATE11, PATE12, PATE13, PATE14, PINLYP, PLAUR, PSCCA, SLURP1, SLURP2, SPACA4, or TEX101.
In some embodiments, the method comprises contacting the AAV capsid protein with a cell that expresses a GPI-anchored protein.
In some embodiments, the method is a method for identifying an AAV capsid protein that can cross the blood-brain barrier.
Some aspects of the present disclosure provide a method comprising providing a targeting peptide; incubating the targeting peptide with a protein of the Ly6/uPAR protein family; and selecting the targeting peptide if it specifically binds to the protein of the Ly6/uPAR protein family. In some embodiments, the protein of the Ly6/uPAR protein family is a fusion protein. In some embodiments, the protein of the Ly6/uPAR protein family is an Fc fusion. In some embodiments, the protein of the Ly6/uPAR protein family forms a dimer. In some embodiments, the protein of the Ly6/uPAR protein family is fused to a: AviTag, C-tag, Calmodulin-tag, E-tag, FLAG, HA, poly-HIS, MYC, NE, Rho1D4, S-tag, SBP, Softag, Spot-tag, T7-tag, TC, Ty, V5, VSV, Xpress, Isopeptag, SpyTag, SnoopTag, DogTag, SdyTag, BCCP, GST, GFP, Halo, SNAP, CLIP, Maltose binding protein (MBP), Nus-tag, Thioredoxin-tag, Fc-tag, CRDSAT, SUMO-tag, or B2M-tag. In some embodiment, the method as described herein is conducted in vitro.
In some embodiments, the targeting peptide is expressed within an AAV capsid protein. In some embodiments, the targeting peptide is expressed within an AAV9 capsid protein. In some embodiments, the targeting peptide is contained within an AAV capsid protein described herein. In some embodiments, the targeting peptide comprises at least 4 contiguous amino acids of an amino acid sequence set forth in SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204. In some embodiments, the targeting peptide comprises an amino acid sequence set forth in SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204.
Some aspects of the present disclosure provide a method comprising delivering a protein, RNA, or DNA to a target environment of a subject and administering an adeno-associated virus (AAV) vector to the target environment of the subject. In some embodiments, the AAV vector comprises a capsid protein comprising at least 4 contiguous amino acids from a sequence listed in Table 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or 19. In some embodiments, the AAV vector comprises a nucleic acid molecule to be delivered to the target environment of the subject. In some embodiments, the protein that is delivered is a LY6/uPAR protein. In some embodiments, the DNA or RNA that is delivered encodes a Ly6/uPAR protein. In some embodiments, the method as described herein is a method of treating a disorder or defect in a subject. In some embodiments, the nucleic acid molecule to be delivered to the target environment of the subject encodes a therapeutic protein. In some embodiments, the nucleic acid molecule is a therapeutic. In some embodiments, the therapeutic protein is effective for treating the disorder or defect in the subject. In some embodiments, the nucleic acid molecule is effective for treating the disorder or defect in the subject. In some embodiments, the LY6/uPAR protein is LY6A. In some embodiments, the LY6/uPAR protein is LY6C1. In some embodiments, the LY6/uPAR protein is a murine protein. In some embodiments, the AAV is a murine AAV. In some embodiments, the AAV targets the Ly6/uPAR protein.
In some embodiments, the nucleic acid molecule to be delivered comprises one or more of: a) a nucleic acid sequence encoding a trophic factor, a growth factor, or a soluble protein; b) a cDNA that restores protein function to humans or animals harboring a genetic mutation(s) in that gene; c) a cDNA that encodes a protein that can be used to control or alter the activity or state of a cell; d) a cDNA that encodes a protein or a nucleic acid used for assessing the state of a cell; e) a cDNA and/or associated guide RNA for performing genomic engineering; f) a sequence for genome editing via homologous recombination; g) a DNA sequence encoding a therapeutic RNA; h) a shRNA or an artificial miRNA delivery system; and i) a DNA sequence that influences the splicing of an endogenous gene. In some embodiments, the method as disclosed herein is a diagnostic method.
In some embodiments, the disorder or defect is one or more of chronic pain, cardiac failure, cardiac arrhythmias, Friedreich's ataxia, Huntington's disease (HD), Alzheimer's disease (AD), Parkinson's disease (PD), Amyotrophic lateral sclerosis (ALS), spinal muscular atrophy types I and II (SMA I and II), Friedreich's Ataxia (FA), Spinocerebellar ataxia, and lysosomal storage disorders that involve cells within the CNS.
In some embodiments, the protein, RNA, or DNA is delivered to the subject via intravenous administration or systemic administration. In some embodiments, the AAV vector is administered to the subject via intravascular administration or systemic administration. In some embodiments, the protein, RNA, or DNA is delivered to the subject in trans. In some embodiments, the present method provides that the protein, RNA, or DNA is delivered to the subject via a nanoparticle. In some embodiments, the RNA or DNA is delivered to the subject via a viral vector. In some embodiments, the protein delivered to the subject is a purified protein.
In some embodiments, the method provides that the protein, RNA, or DNA is delivered to the target environment first, followed by the administration of the AAV vector. In some embodiments, the delivering of the protein or RNA to the target environment and the administering of the AAV vector occur simultaneously. In some embodiments, the protein, RNA, or DNA is delivered in a targeted fashion to a target organ, region of an organ, tumor, ganglia, or to the cerebral spinal fluid of the subject.
Some aspects of the present disclosure provide a method of providing an adeno-associated virus (AAV) capsid protein; contacting the AAV capsid protein with a cell that expresses a GPI-anchored protein attached to the surface of the cell; and selecting the AAV capsid protein if it specifically binds to the GPI-anchored protein attached to the surface of the cell. Some aspects of the present disclosure provide a method of providing an adeno-associated virus (AAV) capsid protein; contacting the AAV capsid protein with a cell that expresses a protein attached to the surface of the cell; and selecting the AAV capsid protein if it specifically binds to the protein attached to the surface of the cell.
In some embodiments, the protein attached to the surface of the cell is: i) a protein that exhibits luminal surface exposure on brain endothelium; ii) a protein that is localized within lipid micro-domains; and/or iii) a protein that exhibits recycling/intracellular trafficking capabilities.
Some aspects of the present disclosure provides a method of providing a targeting peptide; incubating the targeting peptide with a GPI-anchored protein; and selecting the targeting peptide if it specifically binds to the GPI-anchored protein. In some embodiments, the method provides that the targeting peptide is contained within an adeno-associated virus (AAV) capsid protein.
Some aspects of the present disclosure provide a method of providing an adeno-associated virus (AAV) capsid protein; contacting the AAV capsid protein with a cell that expresses a surface protein; and selecting the AAV capsid protein if it specifically binds to the surface protein. In some embodiments, the surface protein is a GPI-anchored protein. In some embodiments, the GPI-anchored protein is a Ly6/uPAR protein. In some embodiments, the surface protein is a protein that traffics to the plasma membrane. In some embodiments the surface protein is expressed recombinantly in the cell. In some embodiments, next-generation sequencing is used to determine peptide disclosed herein. In some embodiments, targeting peptides disclosed herein do not have the sequence of SEQ ID NO: 10689 (YTLSQGW).
It should be appreciated that the foregoing concepts, and additional concepts discussed below, may be arranged in any suitable combination, as the present disclosure is not limited in this respect. Further, other advantages and novel features of the present disclosure will become apparent from the following detailed description of various non-limiting embodiments when considered in conjunction with the accompanying figures.
The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present disclosure, which can be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.
Aspects of the present disclosure relate to methods for identifying targeting peptides that enhance transcytosis of AAV capsids across the blood-brain barrier via binding to target proteins such as Ly6/uPAR proteins. Accordingly, methods and compositions described herein are useful, in some embodiments, for in vivo gene therapy.
Aspects of the invention relate to adeno-associated virus (AAV) vectors and their use in gene therapy. AAV vectors described herein can be used to deliver a nucleic acid encoding a protein of interest to a subject, including delivery to the central nervous system (CNS) of a subject. AAV vectors are described further in U.S. Pat. No. 9,585,971 and US 2017/0166926, which are incorporated by reference herein in their entireties.
AAV refers to a replication-deficient Dependoparvovirus within the Parvoviridae genus of viruses. AAV can be derived from a naturally occurring virus or can be recombinant. AAV can be packaged into capsids, which can be derived from naturally occurring capsid proteins or recombinant capsid proteins. The single-stranded DNA genome of AAV includes inverted terminal repeat (ITRs), which are involved in integrating the AAV DNA into the host cell genome. In some embodiments, AAV integrates into a host cell genome, while in other embodiments, AAV is non-integrating. AAV vectors can comprise: one or more ITRs, including, for example a 5′ ITR and/or a 3′ ITR; one or more promoters; one or more nucleic acid sequences encoding one or more proteins of interest; and/or additional posttranscriptional regulator elements. AAV vectors described herein can be prepared using standard molecular biology techniques known to one of ordinary skill in the art, as described, for example, in Sambrook et al. (Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, N.Y. (2012)).
AAV vectors described herein can include sequences from any known organism and can include synthetic sequences. AAV vector sequences can be modified in any way known to one of ordinary skill in the art, such as by incorporating insertions, deletions or substitutions, and/or through the use of posttranscriptional regulatory elements, such as promoters, enhancers, and transcription and translation terminators, such as polyadenylation signals. AAV vectors can also include sequences related to replication and integration. In some embodiments, AAV vectors include a shuttle element for replication and integration.
AAV vectors can include any known AAV serotype, including, for example, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, and AAV11. In some embodiments, the AAV serotype is AAV9. Clades of AAV viruses are described in, and incorporated by reference, from Gao et al. (2004) J. Virol. 78(12):6381-6388.
AAV vectors of the present disclosure may comprise or be derived from any natural or recombinant AAV serotype. In some embodiments, the AAV vector may utilize or be based on an AAV serotype described in WO 2017/201258A1, the contents of which are incorporated herein by reference in its entirety, such as, but not limited to, AAV1, AAV2, AAV2G9, AAV3, AAV3a, AAV3b, AAV3-3, AAV4, AAV4-4, AAV5, AAV6, AAV6.1, AAV6.2, AAV6.1.2, AAV7, AAV7.2, AAV8, AAV9, AAV9.11, AAV9.13, AAV9.16, AAV9.24, AAV9.45, AAV9.47, AAV9.61, AAV9.68, AAV9.84, AAV9.9, AAV10, AAV11, AAV12, AAV16.3, AAV24.1, AAV27.3, AAV42.12, AAV42-1b, AAV42-2, AAV42-3a, AAV42-3b, AAV42-4, AAV42-5a, AAV42-5b, AAV42-6b, AAV42-8, AAV42-10, AAV42-11, AAV42-12, AAV42-13, AAV42-15, AAV42-aa, AAV43-1, AAV43-12, AAV43-20, AAV43-21, AAV43-23, AAV43-25, AAV43-5, AAV44.1, AAV44.2, AAV44.5, AAV223.1, AAV223.2, AAV223.4, AAV223.5, AAV223.6, AAV223.7, AAV1-7/rh.48, AAV1-8/rh.49, AAV2-15/rh.62, AAV2-3/rh.61, AAV2-4/rh.50, AAV2-5/rh.51, AAV3.1/hu.6, AAV3.1/hu.9, AAV3-9/rh.52, AAV3-11/rh.53, AAV4-8/r11.64, AAV4-9/rh.54, AAV4-19/rh.55, AAV5-3/rh.57, AAV5-22/rh.58, AAV7.3/hu.7, AAV16.8/hu.10, AAV16.12/hu.11, AAV29.3/bb.1, AAV29.5/bb.2, AAV106.1/hu.37. AAV114.3/hu.40, AAV127.2/hu.41, AAV127.5/hu.42, AAV128.3/hu.44, AAV130.4/hu.48. AAV145.1/hu.53, AAV145.5/hu.54, AAV145.6/hu.55, AAV161.10/hu.60, AAV161.6/hu.61. AAV33.12/hu.17, AAV33.4/hu.15, AAV33.8/hu.16, AAV52/hu.19, AAV52.1/hu.20, AAV58.2/hu.25, AAVA3.3, AAVA3.4, AAVA3.5, AAVA3.7, AAVC1, AAVC2, AAVC5, AAV-DJ, AAV-DJ8, AAVF3, AAVF5, AAVH2, AAVrh.72, AAVhu.8, AAVrh.68, AAVrh.70, AAVpi.1, AAVpi.3, AAVpi.2, AAVrh.60, AAVrh.44, AAVrh.65, AAVrh.55, AAVrh.47, AAVrh.69, AAVrh.45, AAVrh.59, AAVhu.12, AAVH6, AAVLK03, AAVH-1/hu.1, AAVH-5/hu.3, AAVLG-10/rh.40, AAVLG-4/rh.38, AAVLG-9/hu.39, AAVN721-8/rh.43, AAVCh.5, AAVCh.5R1, AAVcy.2, AAVcy.3, AAVcy.4, AAVcy.5, AAVCy.5R1, AAVCy.5R2, AAVCy.5R3, AAVCy.5R4, AAVcy.6, AAVhu.1, AAVhu.2, AAVhu.3, AAVhu.4, AAVhu.5, AAVhu.6, AAVhu.7, AAVhu.9, AAVhu.10, AAVhu.11, AAVhu.13, AAVhu.15, AAVhu.16, AAVhu.17, AAVhu.18, AAVhu.20, AAVhu.21, AAVhu.22, AAVhu.23.2, AAVhu.24, AAVhu.25, AAVhu.27, AAVhu.28, AAVhu.29, AAVhu.29R, AAVhu.31, AAVhu.32, AAVhu.34, AAVhu.35, AAVhu.37, AAVhu.39, AAVhu.40, AAVhu.41, AAVhu.42, AAVhu.43, AAVhu.44, AAVhu.44R1, AAVhu.44R2, AAVhu.44R3, AAVhu.45, AAVhu.46, AAVhu.47, AAVhu.48, AAVhu.48R1, AAVhu.48R2, AAVhu.48R3, AAVhu.49, AAVhu.51, AAVhu.52, AAVhu.54, AAVhu.55, AAVhu.56, AAVhu.57, AAVhu.58, AAVhu.60, AAVhu.61, AAVhu.63, AAVhu.64, AAVhu.66, AAVhu.67, AAVhu.14/9, AAVhu.t 19, AAVrh.2, AAVrh.2R, AAVrh.8, AAVrh.8R, AAVrh.10, AAVrh.12, AAVrh.13, AAVrh.13R, AAVrh.14, AAVrh.17, AAVrh.18, AAVrh.19, AAVrh.20, AAVrh.21, AAVrh.22, AAVrh.23, AAVrh.24, AAVrh.25, AAVrh.31, AAVrh.32, AAVrh.33, AAVrh.34, AAVrh.35, AAVrh.36, AAVrh.37, AAVrh.37R2, AAVrh.38, AAVrh.39, AAVrh.40, AAVrh.46, AAVrh.48, AAVrh.48.1, AAVrh.48.1.2, AAVrh.48.2, AAVrh.49, AAVrh.51, AAVrh.52, AAVrh.53, AAVrh.54, AAVrh.56, AAVrh.57, AAVrh.58, AAVrh.61, AAVrh.64, AAVrh.64R1, AAVrh.64R2, AAVrh.67, AAVrh.73, AAVrh.74, AAVrh8R, AAVrh8R A586R mutant, AAVrh8R R533A mutant, AAAV, BAAV, caprine AAV, bovine AAV, AAVhE1.1, AAVhEr1.5, AAVhER1.14, AAVhEr1.8, AAVhEr1.16, AAVhEr1.18, AAVhEr1.35, AAVhEr1.7, AAVhEr1.36, AAVhEr2.29, AAVhEr2.4, AAVhEr2.16, AAVhEr2.30, AAVhEr2.31, AAVhEr2.36, AAVhER1.23, AAVhEr3.1, AAV2.5T, AAV-PAEC, AAV-LK01, AAV-LK02, AAV-LK03, AAV-LK04, AAV-LK05, AAV-LK06, AAV-LK07, AAV-LK08, AAV-LK09, AAV-LK10, AAV-LK11, AAV-LK12, AAV-LK13, AAV-LK14, AAV-LK15, AAV-LK16, AAV-LK17, AAV-LK18, AAV-LK19, AAV-PAEC2, AAV-PAEC4, AAV-PAEC6, AAV-PAEC7, AAV-PAEC8, AAV-PAEC11, AAV-PAEC12, AAV-2-pre-miRNA-101, AAV-8h, AAV-8b, AAV-h, AAV-b, AAV SM 10-2, AAV Shuffle 100-1, AAV Shuffle 100-3, AAV Shuffle 100-7, AAV Shuffle 10-2, AAV Shuffle 10-6, AAV Shuffle 10-8, AAV Shuffle 100-2, AAV SM 10-1, AAV SM 10-8, AAV SM 100-3, AAV SM 100-10, BNP61 AAV, BNP62 AAV, BNP63 AAV, AAVrh.50, AAVrh.43, AAVrh.62, AAVrh.48, AAVhu.19, AAVhu.11, AAVhu.53, AAV4-8/rh.64, AAVLG-9/hu.39, AAV54.5/hu.23, AAV54.2/hu.22, AAV54.7/hu.24, AAV54.1/hu.21, AAV54.4R/hu.27, AAV46.2/hu.28, AAV46.6/hu.29, AAV128.1/hu.43, true type AAV (ttAAV), UPENN AAV 10, Japanese AAV 10 serotypes, AAV CBr-7.1, AAV CBr-7.10, AAV CBr-7.2, AAV CBr-7.3, AAV CBr-7.4, AAV CBr-7.5, AAV CBr-7.7, AAV CBr-7.8, AAV CBr-B7.3, AAV CBr-B7.4, AAV CBr-E1, AAV CBr-E2, AAV CBr-E3, AAV CBr-E4. AAV CBr-E5, AAV CBr-e5, AAV CBr-E6, AAV CBr-E7, AAV CBr-E8, AAV CHt-1, AAV CHt-2, AAV CHt-3, AAV CHt-6.1, AAV CHt-6.10, AAV CHt-6.5, AAV CHt-6.6, AAV CHt-6.7, AAV CHt-6.8, AAV CHt-P1, AAV CHt-P2, AAV CHt-P5, AAV CHt-P6, AAV CHt-P8, AAV CHt-P9, AAV CKd-1, AAV CKd-10, AAV CKd-2. AAV CKd-3, AAV CKd-4, AAV CKd-6, AAV CKd-7, AAV CKd-8, AAV CKd-B1, AAV CKd-B2, AAV CKd-B3, AAV CKd-B4, AAV CKd-B5, AAV CKd-B6, AAV CKd-B7, AAV CKd-B8, AAV CKd-H1, AAV CKd-H2, AAV CKd-H3, AAV CKd-H4, AAV CKd-H5, AAV CKd-H6, AAV CKd-N3, AAV CKd-N4, AAV CKd-N9, AAV CLg-F1, AAV CLg-F2, AAV CLg-F3, AAV CLg-F4, AAV CLg-F5, AAV CLg-F6, AAV CLg-F7, AAV CLg-F8, AAV CLv-1, AAV CLv1-1, AAV Clv1-10, AAV CLv1-2, AAV CLv-12, AAV CLv1-3, AAV CLv-13, AAV CLv1-4, AAV Clv1-7, AAV Clv1-8, AAV Clv1-9, AAV CLv-2, AAV CLv-3, AAV CLv-4. AAV CLv-6, AAV CLv-8, AAV CLv-D1, AAV CLv-D2, AAV CLv-D3, AAV CLv-D4, AAV CLv-D5, AAV CLv-D6, AAV CLv-D7, AAV CLv-D8, AAV CLv-E1, AAV CLv-K1, AAV CLv-K3, AAV CLv-K6, AAV CLv-L4, AAV CLv-L5, AAV CLv-L6, AAV CLv-M1, AAV CLv-M11, AAV CLv-M2, AAV CLv-M5, AAV CLv-M6, AAV CLv-M7, AAV CLv-M8, AAV CLv-M9, AAV CLv-R1, AAV CLv-R2, AAV CLv-R3, AAV CLv-R4, AAV CLv-R5, AAV CLv-R6, AAV CLv-R7, AAV CLv-R8, AAV CLv-R9, AAV CSp-1, AAV CSp-10, AAV CSp-11, AAV CSp-2, AAV CSp-3, AAV CSp-4, AAV CSp-6, AAV CSp-7, AAV CSp-8, AAV CSp-8.10, AAV CSp-8.2, AAV CSp-8.4, AAV CSp-8.5, AAV CSp-8.6, AAV CSp-8.7, AAV CSp-8.8, AAV CSp-8.9, AAV CSp-9, AAV.hu.48R3, AAV.VR-355, AAV3B, AAV4, AAV5, AAVF1/HSC1, AAVF11/HSC11. AAVF12/HSC12, AAVF13/HSC13, AAVF14/HSC14, AAVF15/HSC15, AAVF16/HSC16. AAVF17/HSC17, AAVF2/HSC2, AAVF3/HSC3, AAVF4/HSC4, AAVF5/HSC5, AAVF6/HSC6, AAVF7/HSC7, AAVF8/HSC8, AAVF9/HSC9, AAV-PHP.B (PHP.B), AAV-PHP.A (PHP.A), G2B-26, G2B-13, TH1.1-32 and/or TH1.1-35, and variants thereof.
AAV vectors disclosed herein comprise targeting sequences (e.g., 7-mer sequences) capable of directing the AAV vectors to specific environments within a subject, including, in some embodiments, directing the AAV vectors across the blood-brain barrier in a subject. In some embodiments, the targeting sequence is inserted into the capsid protein of the AAV vector. The targeting sequence can be inserted into any region of the capsid protein. In some embodiments, the targeting sequence is inserted at a position corresponding to the position between amino acids 588 and 589 of an AAV9 capsid protein, such as a capsid protein provided in SEQ ID NO: 730 or 731. In some embodiments, the targeting sequence is inserted at a position corresponding to a position between amino acids 586 and 592 of an AAV9 capsid protein, such as a capsid protein provided in SEQ ID NO: 730 or 731.
As used herein, a position (such as a nucleic acid residue or an amino acid residue) in sequence “X” is referred to as corresponding to a position or residue (such as a nucleic acid residue or an amino acid residue) “a” in sequence “Y” when the residue in sequence “X” is at the counterpart position of “a” in sequence “Y” when sequences X and Y are aligned using amino acid sequence alignment tools known in the art, such as, for example, Clustal Omega or BLAST®. One of ordinary skill in the art would be able to determine a position in a given protein that corresponds to the position between amino acids 588 and 589 of an AAV9 capsid protein, or a position between amino acids 586 and 592 of an AAV9 capsid protein, such as a capsid protein provided in SEQ ID NO: 730 or 731, using methods known in the art.
Aspects of the present disclosure, in some embodiments, provide an AAV vector comprising an amino acid sequence that comprises at least 4 contiguous amino acids from a sequence listed in Table 4, 5, 6, 78, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or 19. In some embodiments, the AAV vector comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of a sequence selected from SEQ ID NOs: 316-522, 732-1909, 3088-3199, 3312-6429, 9548-10086, 10626-10688, 10690-11520, 12481-12683, 12952-20446, 27942-28880, 29819-29983, 30149-30166 and 30185-30204. In some embodiments, an AAV vector comprises a sequence selected from SEQ ID NOs: 316-30,204. In some embodiments, any sequence selected from SEQ ID NOs: 316-30,204 is compatible with aspects of the disclosure, including in some embodiments insertion into AAV vectors as described herein.
In some embodiments, the AAV vector comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of: PKMTLKI (SEQ ID NO: 320), LGKKTNS (SEQ ID NO: 325), LPKYKSS (SEQ ID NO: 396), GRGNSVL (SEQ ID NO: 465), RSPRVNA (SEQ ID NO: 466), IRNPRMA (SEQ ID NO: 467), ARRPNSE (SEQ ID NO: 480), IKMLNKP (SEQ ID NO: 484), or REVLQRI (SEQ ID NO: 506).
In some embodiments, the AAV vector comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of: RKPRVHD (SEQ ID NO: 317), YADTNRR (SEQ ID NO: 321), TKSVRVV (SEQ ID NO: 327), TKSSMRP (SEQ ID NO: 336), RRHLAET (SEQ ID NO: 346), RRPPSMG (SEQ ID NO: 354), KDRKVPN (SEQ ID NO: 382), KVTNRHE (SEQ ID NO: 439), DMDLGMG (SEQ ID NO: 453), IEKPTYR (SEQ ID NO: 482), RGKMELY (SEQ ID NO: 505), SKDNHRM (SEQ ID NO: 511), DIHGANL (SEQ ID NO: 512), HSVGYLD (SEQ ID NO: 514), ASLADRP (SEQ ID NO: 515), SKNDHEY (SEQ ID NO: 517), or NLGAINK (SEQ ID NO: 522). In some embodiments, the AAV vector comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of any of sequences listed in Table 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or 19.
In some embodiments, the AAV vector comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of any one of: SEQ ID NO: 732-1909, SEQ ID NO: 3088-3199, SEQ ID NO: 3312-6429, SEQ ID NO: 9548-10086, 1 SEQ ID NO: 0626-10688, SEQ ID NO: 10690-11520, SEQ ID NO: 12481-12683, SEQ ID NO: 12952-20446, SEQ ID NO: 27942-28880, SEQ ID NO: 29819-29983, SEQ ID NO: 30149-30166, or SEQ ID NO: 30185-30204. In some embodiments, the AAV vector does not comprise SEQ ID NO: 10689 (YTLSQGW).
In some embodiments, the AAV vector comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of: RSMKPNN (SEQ ID NO: 316), RKPRVHD (SEQ ID NO: 317), VRKMPDY (SEQ ID NO: 318), QKPIRIV (SEQ ID NO: 319), PKMTLKI (SEQ ID NO: 320), YADTNRR (SEQ ID NO: 321), RKQMNTT (SEQ ID NO: 322), ELYKLPT (SEQ ID NO: 323), GGQLRKP (SEQ ID NO: 324), LGKKTNS (SEQ ID NO: 325), NRQTVKG (SEQ ID NO: 326), TKSVRVV (SEQ ID NO: 327), GINVRPR (SEQ ID NO: 328), KKGSIGS (SEQ ID NO: 329), LRKNPNP (SEQ ID NO: 330), NSKTVVR (SEQ ID NO: 331), VRRTQLD (SEQ ID NO: 332), KKSTILA (SEQ ID NO: 333), RSKLGSG (SEQ ID NO: 334), DRRGHDR (SEQ ID NO: 335), TKSSMRP (SEQ ID NO: 336), NRITPNR (SEQ ID NO: 337), KIQNNKQ (SEQ ID NO: 338), KSRLTQP (SEQ ID NO: 339), SQKAGGR (SEQ ID NO: 340), ARKTPDY (SEQ ID NO: 341), TRKPVVI (SEQ ID NO: 342), NLKDKRT (SEQ ID NO: 343), KRDARMN (SEQ ID NO: 344), KGSMRQA (SEQ ID NO: 345), RRHLAET (SEQ ID NO: 346), VKTHRPV (SEQ ID NO: 347), or KRNNVAA (SEQ ID NO: 348).
Aspects of the invention relate to AAV capsid proteins. AAV capsid proteins described herein may have a sequence that is different from the corresponding wild type AAV capsid protein sequence or is different from a reference AAV capsid protein sequence. An AAV capsid protein can include an insertion, deletion, or substitution of one or more nucleotides or one or more amino acids relative to the corresponding wild type AAV capsid protein sequence or relative to a reference AAV capsid protein sequence. The insertion, deletion, or substitution of one or more nucleotides or one or more amino acids can be at the 5′ end, the 3′ end and/or internally within the capsid sequence.
In some embodiments, the AAV capsid protein comprising at least 4, at least 5 contiguous amino acids, or at least 6 contiguous amino acids contiguous amino acids of: PKMTLKI (SEQ ID NO: 320), LGKKTNS (SEQ ID NO: 325), LPKYKSS (SEQ ID NO: 396), GRGNSVL (SEQ ID NO: 465), RSPRVNA (SEQ ID NO: 466), IRNPRMA (SEQ ID NO: 467), ARRPNSE (SEQ ID NO: 480), IKMLNKP (SEQ ID NO: 484), or REVLQRI (SEQ ID NO: 506).
In some embodiments, the AAV capsid protein comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of: RKPRVHD (SEQ ID NO: 317), YADTNRR (SEQ ID NO: 321), TKSVRVV (SEQ ID NO: 327), TKSSMRP (SEQ ID NO: 336), RRHLAET (SEQ ID NO: 346), RRPPSMG (SEQ ID NO: 354), KDRKVPN (SEQ ID NO: 382), KVTNRHE (SEQ ID NO: 439), DMDLGMG (SEQ ID NO: 453), IEKPTYR (SEQ ID NO: 482). RGKMELY (SEQ ID NO: 505), SKDNHRM (SEQ ID NO: 511), DIHGANL (SEQ ID NO: 512), HSVGYLD (SEQ ID NO: 514), ASLADRP (SEQ ID NO: 515), SKNDHEY (SEQ ID NO: 517), or NLGAINK (SEQ ID NO: 522).
In some embodiments, the AAV capsid protein comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of: RSMKPNN (SEQ ID NO: 316), RKPRVHD (SEQ ID NO: 317), VRKMPDY (SEQ ID NO: 318), QKPIRIV (SEQ ID NO: 319), PKMTLKI (SEQ ID NO: 320), YADTNRR (SEQ ID NO: 321), RKQMNTT (SEQ ID NO: 322), ELYKLPT (SEQ ID NO: 323), GGQLRKP (SEQ ID NO: 324), LGKKTNS (SEQ ID NO: 325), NRQTVKG (SEQ ID NO: 326), TKSVRVV (SEQ ID NO: 327), GINVRPR (SEQ ID NO: 328), KKGSIGS (SEQ ID NO: 329), LRKNPNP (SEQ ID NO: 330), NSKTVVR (SEQ ID NO: 331), VRRTQLD (SEQ ID NO: 332), KKSTILA (SEQ ID NO: 333), RSKLGSG (SEQ ID NO: 334), DRRGHDR (SEQ ID NO: 335), TKSSMRP (SEQ ID NO: 336), NRITPNR (SEQ ID NO: 337), KIQNNKQ (SEQ ID NO: 338), KSRLTQP (SEQ ID NO: 339), SQKAGGR (SEQ ID NO: 340), ARKTPDY (SEQ ID NO: 341), TRKPVVI (SEQ ID NO: 342), NLKDKRT (SEQ ID NO: 343), KRDARMN (SEQ ID NO: 344), KGSMRQA (SEQ ID NO: 345), RRHLAET (SEQ ID NO: 346), VKTHRPV (SEQ ID NO: 347), or KRNNVAA (SEQ ID NO: 348).
The nucleotide sequence of an AAV capsid protein can be at least about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99% or more than 99%, inclusive of all ranges and subranges therebetween, identical to a wild type AAV capsid nucleotide sequence or a reference AAV capsid nucleotide sequence. The protein sequence of an AAV capsid protein can be at least about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99% or more than 99%, inclusive of all ranges and subranges there between, identical to a wild type AAV capsid protein sequence or a reference AAV capsid protein sequence.
Also disclosed herein are libraries of AAV capsid proteins, such as AAV9 capsid proteins. As used herein, a “library” of AAV capsid proteins refers to a collection of at least two AAV capsid proteins. In some embodiments, at least one of the AAV capsid proteins within the library includes an insertion of a targeting sequence (e.g., a 7-mer). In some embodiments, at least one of the AAV capsid proteins within the library includes an insertion of a targeting sequence selected from the targeting sequences in Table 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or 19.
The AAV capsid protein can, in some embodiments, include one or more amino acid substitutions relative to the corresponding wildtype AAV capsid protein provided in SEQ ID NO: 730, including but not limited to, a K449R substitution, a A587D substitution, a Q588G substitution, a A587G substitution, a Q588G substitution, a V592T substitution, a K595S substitution, a A595N substitution, a Q597P substitution, or any combination thereof. An example an AAV capsid protein comprising a K449R substitution is provided in SEQ ID NO: 731. Amino acid modifications of AAV capsid proteins are described further in, and incorporated by reference from Li et al. (2012) Journal of Virology 86(15): 7752-7759. Sequences of AAV9 capsid proteins are further described in, and incorporated by reference from U.S. Pat. No. 7,198,951, assigned to The Trustees of the University of Pennsylvania.
The targeting sequences disclosed herein, in some embodiments, can increase transduction efficiency of an AAV across the blood-brain barrier in a subject relative to an AAV that does not contain the targeting sequence. For example, the inclusion of one or more of the targeting sequences disclosed herein in an AAV can result in an increase in transduction efficiency by at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 1.5-fold, 2-fold, 2.5-fold, 3-fold, 3.5-fold, 4-fold, 4.5-fold, 5-fold, 5.5-fold, 6-fold, 6.5-fold, 7-fold, 7.5-fold, 8-fold, 10-fold, 20-fold, 30-fold, 40-fold, 50-fold, 60-fold, 70-fold, 80-fold, 90-fold, 100-fold, or more than 100-fold, including all values in between, relative to an AAV that lacks the targeting sequence. In some embodiments, the transduction efficiency is increased for transducing AAV to the blood-brain barrier. In some embodiments, the transduction efficiency is increased for transducing AAV to the CNS. In some embodiments, the transduction efficiency is increased for transducing AAV to the PNS. In some embodiments, the transduction efficiency is increased for transducing AAV to the heart. In some embodiments, the transduction efficiency is increased for transducing AAV to cardiomyocytes, sensory neurons, dorsal root ganglia, visceral organs, or any combination thereof. In some embodiments, the transduction efficiency is increased for transducing AAV to any target environment suitable for the delivery of AAV vectors.
In some embodiments, an AAV9 capsid protein, or a library of AAV9 capsid proteins, is provided in which the AAV9 genome contains the viral replication gene (rep) and capsid gene (cap) that have been modified so as to not prevent the replication of the virus under conditions in which it could normally replicate. In some embodiments, an AAV9 capsid protein, or a library of AAV9 capsid proteins, is provided in which the AAV9 genome contains an engineered cap gene. In some embodiments, an AAV9 capsid protein, or a library of AAV9 capsid proteins, is provided in which the AAV9 genome contains the rep cap genes are flanked by ITRs. In some embodiments, an AAV genome contains the cap gene and contains rep gene sequences that are involved in regulating expression and/or splicing of the cap gene. In some embodiments, a capsid gene recombinase recognition sequence is provided, optionally with flanking ITRs.
Libraries of AAV capsid proteins, such as AAV9 capsid proteins, described herein, can be used to select for AAV capsid proteins that exhibit, e.g.: enhanced targeting to specific cells or organs; evasion of immunity; efficiency at homologous recombination; efficiency of conversion of the single stranded AAV genome to a double stranded DNA genome within a cell; and/or increased conversion of an AAV genome to a persistent, circularized form within the cell.
Aspects of the invention relate to targeting peptides that can direct AAV, e.g., to a specific target environment. In some embodiments, the target environment is a cell (e.g., neuron). In some embodiments, the target environment is neurons, astrocytes, cardiomyocytes, or a combination thereof. In some embodiments, the target environment is an organ (e.g., heart, brain). In some embodiments, the targeting peptide directs AAV to the central nervous system (CNS) of a subject. The CNS includes, e.g., brain tissue, nerves (e.g., optic nerves or cranial nerves), and fluid (e.g., cerebrospinal fluid). In some embodiments, the targeting peptide directs AAV to the peripheral nervous system (PNS) of a subject. Targeting peptides can be conjugated to other components, such as a nanoparticle or a viral capsid protein.
In some embodiments, the targeting peptide comprises an amino acid consensus motif selected from the group consisting of (T/S)-(L/I/V/M)-(A/x)-(V/x)-P-F-K, (SIT)-(V/x)-(S/T/x)-(K/R)-P-F-(L/I/V/A), x-x-x-F-K-(D/N)-(I/V/P), x-(K/R/Y)-(x/R/K/Y/F)-(G/Y/K/R/x)-(Y/W/F/L/M)-(S/A)-(S/T/A/Q), S-X-X-G-W-(V/A/S/T/I/L)-(A/P), Y-X-X-X-X-(G/S)-W, K-X-X-G/X-S-(V/I/Y/F/M)-Y, R-(F/Y)-X-(G/S)-(D/E)-(S/A/P/N/G)(S/A/G/T/V/I/Q), X-X-X-G-(Y/F/W)-S-(Q/S/T/A/M), X-X-X-P-G-V-W, G-X-X-X-G-R-W, (D/E)-(V/G/D/P/L/N/A)-(G/P/A/T/D/N/L)-S-G-R-W, S-(P/UY/E/G/T/D/A)-(G/N/S/D/V/T/H)-(D/S/G/E/P/V/Y/I)-(G/A/S/N/V/A)-R-W, X-X-Y-X-G-S-(S/T/V/A/M/Q/I/H)R-(TVL)-(S/G)-(A/S)-(G/N/x)-(S/G/M/x)-(T/S), G-S-G-T-V-(K/R)-X, Q-N-R-X-X-Y-V, Y-H-P-(L/M)-D-(V/P/I/R/K/UM/W)-(T/S), and X-X-(F/W)-X-P-P-S, where x is any amino acid.
In some embodiments, the targeting peptide comprises an amino acid consensus motif selected from the group consisting of (T/S)-(L/I/V/M)-(A/x)-(V/x)-P-F-K, (SIT)-(V/x)-(S/T/x)-(K/R)-P-F-(L/I/V/A), x-x-x-F-K-(D/N)-(I/V/P), x-(K/R/Y)-(x/R/K/Y/F)-(G/Y/K/R/x)-(Y/W/F/L/M)-(S/A)-(S/T/A/Q), F-T-(hydrophobic)-x-x-P-K, (S/T/x)-x-x-x-P-F-(R/K), G-x-(F/W)-x-P-P-x, (T/S/X)-X-X-(R/K)-P-F-(I/L/V/Q/H/S/T/M/A), P-(S/T/X)-(S/T/X)-(S/T/X)-(S/T/X)-(S/T)-W, (S/G)-X-X-G-W-A-P, L-T-(hydrophobic)-x-T-S-(V/I/K/R), X-X-(K/R)-F-E-X-(I/V/M), X-X-(F/W)-X-P-P-S, S-X-X-G-W-(V/A/S/T/I/L)-(A/P), Y-X-X-X-X-(G/S)-W, K-X-X-G/X-S-(V/I/Y/F/M)-Y, R-(F/Y)-X-(G/S)-(D/E)-(S/A/P/N/G)(S/A/G/T/V/I/Q), X-X-X-G-(Y/F/W)-S-(Q/S/T/A/M), X-X-X-P-G-V-W, G-X-X-X-G-R-W, (D/E)-(V/G/D/P/L/N/A)-(G/P/A/T/D/N/L)-S-G-R-W, S-(P/L/Y/E/G/T/D/A)-(G/N/S/D/V/T/H)-(D/S/G/E/P/V/Y/I)-(G/A/S/N/V/A)-R-W, X-X-Y-X-G-S-(S/T/V/A/M/Q/I/H), R-(TVL)-(S/G)-(A/S)-(G/N/x)-(S/G/M/x)-(T/S), G-S-G-T-V-(K/R)-X, Q-N-R-X-X-Y-V, and Y-H-P-(L/M)-D-(V/P/I/R/K/L/M/W)-(T/S), where x is any amino acid.
Targeting peptides, as described herein, may be various lengths. In some embodiments, the targeting peptide comprises 4 amino acids (e.g., 4-mer). In some embodiments, the targeting peptide comprises 5 amino acids (e.g., 5-mer). In some embodiments, the targeting peptide comprises 6 amino acids (e.g., 6-mer). In some embodiments, the targeting peptide comprises 7 amino acids (e.g., 7-mer). In some embodiments, the targeting peptide comprises 8 amino acids (e.g., 8-mer). In some embodiments, the targeting peptide comprises 9 amino acids (e.g., 9-mer). In some embodiments, the targeting peptide comprises 10 amino acids (e.g., 10-mer). In some embodiments, the targeting peptide comprises less than 4 or more than 10 amino acids. In some embodiments, the targeting peptide can be any length comprising any numbers of amino acids that are suitable for the incorporation into AAV vectors.
Targeting peptides, as described herein, may be charged or uncharged. In some embodiments, the targeting peptide is positively charged. In some embodiments, the targeting peptide is negatively charged. In some embodiments, the targeting peptide is neutrally charged. In some embodiments, the targeting peptide is uncharged.
Targeting peptides, as described herein, may comprise positively charged amino acids and negatively charged amino acids in various ratios. In some embodiments, the targeting peptide comprises positively charged amino acids and negatively charged amino acids in a 0:1 or 1:0 ratio. In some embodiments, the targeting peptide comprises positively charged amino acids and negatively charged amino acids in a 1:1, 2:1, 3:1, or 4:1 ratio. In some embodiments, the targeting peptide comprises positively charged amino acids and negatively charged amino acids in a 1:2, 1:3, or 1:4 ratio. In some embodiments, the targeting peptide comprises at least one negatively charged amino acids (e.g., arginine) and at least one hydrophobic amino acid residue (e.g., leucine). In some embodiments, the targeting peptide comprises two arginine residues and two leucine residues.
In some embodiments, the targeting peptide comprises an amino acid consensus motif consisting of (T/S)-(L/I/V/M)-(A/x-V/x-P-F-K) (SEQ ID NO: 30225), where x is any amino acid. In some embodiments, the targeting peptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 20-33. In some embodiments, the targeting peptide is encoded by a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 34-47.
In some embodiments, the targeting peptide comprises an amino acid consensus motif consisting of (S/T)-(V/x)-(S/T/x)-(K/R)P-F-(L/I/V/A) (SEQ ID NO: 30226), where x is any amino acid. In some embodiments, the targeting peptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 48-77. In some embodiments, the targeting peptide is encoded by a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 78-107.
In some embodiments, the targeting peptide comprises an amino acid consensus motif consisting of x-x-x-F-K-(D/N)-(I/V/P) (SEQ ID NO: 30227), where x is any amino acid. In some embodiments, the targeting peptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 108-119. In some embodiments, the targeting peptide is encoded by a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 120-131.
In some embodiments, the targeting peptide comprises an amino acid consensus motif consisting of x-(K/R/Y)-(x/R/K/Y/F)-(G/Y/K/R/x)-(Y/W/F/L/M)-(S/A)-(S/T/A/Q) (SEQ ID NO: 30228), where x is any amino acid. In some embodiments, the targeting peptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 132-218. In some embodiments, the targeting peptide is encoded by a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 219-305.
In some embodiments, the targeting peptide comprises an amino acid consensus motif consisting of R-(TVL)-(S/G)-(A/S)-(G/N/x)-(S/G/M/x)-(T/S) (SEQ ID NO: 30280), where x is any amino acid. In some embodiments, the targeting peptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 30149-30155.
In some embodiments, the targeting peptide comprises an amino acid consensus motif consisting of G-S-G-T-V-(K/R)-X (SEQ ID NO: 30281), where x is any amino acid. In some embodiments, the targeting peptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 30156-20160.
In some embodiments, the targeting peptide comprises an amino acid consensus motif consisting of Q-N-R-X-X-Y-V (SEQ ID NO: 30282), where x is any amino acid. In some embodiments, the targeting peptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 30161-30162.
In some embodiments, the targeting peptide comprises an amino acid consensus motif consisting of Y-H-P-(L/M)-D-(V/P/I/R/K/L/M/W)-(T/S) (SEQ ID NO: 30283), where x is any amino acid. In some embodiments, the targeting peptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 30185-30204.
In some embodiments, the targeting peptide comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of a sequence selected from SEQ ID NOs: 306-310. In some embodiments, the targeting peptide is encoded by a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 311-315.
In some embodiments, the targeting peptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 316-30204. In some embodiments, the targeting peptide is encoded by a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 523-729.
In some embodiments, the targeting peptide comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of: PKMTLKI (SEQ ID NO: 320), LGKKTNS (SEQ ID NO: 325), LPKYKSS (SEQ ID NO: 396), GRGNSVL (SEQ ID NO: 465), RSPRVNA (SEQ ID NO: 466), IRNPRMA (SEQ ID NO: 467), ARRPNSE (SEQ ID NO: 480), IKMLNKP (SEQ ID NO: 484), or REVLQRI (SEQ ID NO: 506).
In some embodiments, the targeting peptide comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of: RKPRVHD (SEQ ID NO: 317), YADTNRR (SEQ ID NO: 321), TKSVRVV (SEQ ID NO: 327), TKSSMRP (SEQ ID NO: 336), RRHLAET (SEQ ID NO: 346), RRPPSMG (SEQ ID NO: 354), KDRKVPN (SEQ ID NO: 382), KVTNRHE (SEQ ID NO: 439), DMDLGMG (SEQ ID NO: 453), IEKPTYR (SEQ ID NO: 482), RGKMELY (SEQ ID NO: 505), SKDNHRM (SEQ ID NO: 511), DIHGANL (SEQ ID NO: 512), HSVGYLD (SEQ ID NO: 514), ASLADRP (SEQ ID NO: 515), SKNDHEY (SEQ ID NO: 517), or NLGAINK (SEQ ID NO: 522).
In some embodiments, the targeting peptide comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of: RSMKPNN (SEQ ID NO: 316), RKPRVHD (SEQ ID NO: 317), VRKMPDY (SEQ ID NO: 318), QKPIRIV (SEQ ID NO: 319), PKMTLKI (SEQ ID NO: 320), YADTNRR (SEQ ID NO: 321), RKQMNTT (SEQ ID NO: 322), ELYKLPT (SEQ ID NO: 323), GGQLRKP (SEQ ID NO: 324), LGKKTNS (SEQ ID NO: 325), NRQTVKG (SEQ ID NO: 326), TKSVRVV (SEQ ID NO: 327), GINVRPR (SEQ ID NO: 328), KKGSIGS (SEQ ID NO: 329), LRKNPNP (SEQ ID NO: 330), NSKTVVR (SEQ ID NO: 331), VRRTQLD (SEQ ID NO: 332), KKSTILA (SEQ ID NO: 333), RSKLGSG (SEQ ID NO: 334), DRRGHDR (SEQ ID NO: 335), TKSSMRP (SEQ ID NO: 336), NRITPNR (SEQ ID NO: 337), KIQNNKQ (SEQ ID NO: 338), KSRLTQP (SEQ ID NO: 339), SQKAGGR (SEQ ID NO: 340), ARKTPDY (SEQ ID NO: 341), TRKPVVI (SEQ ID NO: 342), NLKDKRT (SEQ ID NO: 343), KRDARMN (SEQ ID NO: 344), KGSMRQA (SEQ ID NO: 345), RRHLAET (SEQ ID NO: 346), VKTHRPV (SEQ ID NO: 347), or KRNNVAA (SEQ ID NO: 348).
In some embodiments, the targeting peptide comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids of any one of: SEQ ID NO: 732-1909, SEQ ID NO: 3088-3199, SEQ ID NO: 3312-6429, SEQ ID NO: 9548-10086, 1 SEQ ID NO: 0626-10688, SEQ ID NO: 10690-11520, SEQ ID NO: 12481-12683, SEQ ID NO: 12952-20446, SEQ ID NO: 27942-28880, SEQ ID NO: 29819-29983, SEQ ID NO: 30149-30166, or SEQ ID NO: 30185-30204.
In some embodiments, the targeting peptide comprises at least 4 contiguous amino acids, at least 5 contiguous amino acids, or at least 6 contiguous amino acids from a sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218. SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the targeting peptide comprises at least 5 contiguous amino acids from a sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the targeting peptide comprises at least 6 contiguous amino acids from a sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the targeting peptide comprises 7 contiguous amino acids from a sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204.
In some embodiments, the targeting peptide is at least 75% identical to an amino acid sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the targeting peptide is at least 80% identical to an amino acid sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the targeting peptide is at least 85% identical to an amino acid sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522. SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the targeting peptide is at least 90% identical to an amino acid sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the targeting peptide is at least 95% identical to an amino acid sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the targeting peptide is at least 98% identical to an amino acid sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204.
In some embodiments, the targeting peptide comprises at least 1 amino acid substitution in an amino acid sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the targeting peptide comprises at least 2 amino acid substitutions in an amino acid sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the targeting peptide comprises at least 3, at least 4, at least 5, or at least 6, or at least 7 amino acid substitutions in an amino acid sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204. In some embodiments, the at least one amino acid substitution is a conservative amino acid substitution.
In some embodiments, a targeting peptide contains one or more amino acid substitutions relative to a sequence disclosed in Table 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or 19. In some embodiments, the amino acid substitution is a conservative amino acid substitution. As used herein, a “conservative amino acid substitution” refers to an amino acid substitution that does not alter the relative charge or size characteristics or functional activity of the protein in which the amino acid substitution is made. Conservative substitutions of amino acids include substitutions made amongst amino acids within the following groups: (a) M, I, L, V; (b) F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; (f) Q, N; and (g) E, D. Non-limiting examples of conservative amino acid substitutions are provided in Table 8.
In some embodiments, a targeting peptide comprises one or more of the sequences disclosed herein. In other embodiments, a targeting peptide consists of one or more of the sequences disclosed herein. In other embodiments, a targeting peptide consists essentially of one or more of the sequences disclosed herein. Targeting peptides described herein can be fused to or inserted into longer peptides. In some embodiments, targeting peptides are isolated. In some embodiments, targeting peptides are not naturally occurring.
Also disclosed herein are nucleic acid sequences that encode one or more of the targeting peptides disclosed herein. In some embodiments, a nucleic acid sequence encoding a targeting peptide comprises or consists of a sequence selected from the group consisting of SEQ ID NOs: 20-33, SEQ ID NOs: 48-77, SEQ ID NOs: 108-119, SEQ ID NOs: 132-218, SEQ ID NOs: 306-310, SEQ ID NOs: 316-522, SEQ ID NOs: 732-1909, SEQ ID NOs: 3088-3199, SEQ ID NOs: 3312-6429, SEQ ID NOs: 9548-10086, SEQ ID NOs: SEQ ID NOs: 10626-10688, SEQ ID NOs: 10690-11520, SEQ ID NOs: 12481-12683, SEQ ID NOs: 12952-20446, SEQ ID NOs: 27942-28880, SEQ ID NOs: 29819-29983, SEQ ID NOs: 30149-30166 and SEQ ID NOs: 30185-30204.
In some embodiments, the nucleic acid sequence encoding a targeting peptide comprises an amino acid sequence of: PKMTLKI (SEQ ID NO: 320), LGKKTNS (SEQ ID NO: 325), LPKYKSS (SEQ ID NO: 396), GRGNSVL (SEQ ID NO: 465), RSPRVNA (SEQ ID NO: 466), IRNPRMA (SEQ ID NO: 467), ARRPNSE (SEQ ID NO: 480), IKMLNKP (SEQ ID NO: 484), or REVLQRI (SEQ ID NO: 506).
In some embodiments, the nucleic acid sequence encoding a targeting peptide comprises an amino acid sequence of: RKPRVHD (SEQ ID NO: 317), YADTNRR (SEQ ID NO: 321), TKSVRVV (SEQ ID NO: 327), TKSSMRP (SEQ ID NO: 336), RRHLAET (SEQ ID NO: 346), RRPPSMG (SEQ ID NO: 354), KDRKVPN (SEQ ID NO: 382), KVTNRHE (SEQ ID NO: 439), DMDLGMG (SEQ ID NO: 453), IEKPTYR (SEQ ID NO: 482), RGKMELY (SEQ ID NO: 505), SKDNHRM (SEQ ID NO: 511), DIHGANL (SEQ ID NO: 512), HSVGYLD (SEQ ID NO: 514), ASLADRP (SEQ ID NO: 515), SKNDHEY (SEQ ID NO: 517), or NLGAINK (SEQ ID NO: 522).
In some embodiments, the nucleic acid sequence encoding a targeting peptide comprises an amino acid sequence of: RSMKPNN (SEQ ID NO: 316), RKPRVHD (SEQ ID NO: 317), VRKMPDY (SEQ ID NO: 318), QKPIRIV (SEQ ID NO: 319), PKMTLKI (SEQ ID NO: 320), YADTNRR (SEQ ID NO: 321), RKQMNTT (SEQ ID NO: 322), ELYKLPT (SEQ ID NO: 323), GGQLRKP (SEQ ID NO: 324), LGKKTNS (SEQ ID NO: 325), NRQTVKG (SEQ ID NO: 326), TKSVRVV (SEQ ID NO: 327), GINVRPR (SEQ ID NO: 328), KKGSIGS (SEQ ID NO: 329), LRKNPNP (SEQ ID NO: 330), NSKTVVR (SEQ ID NO: 331), VRRTQLD (SEQ ID NO: 332), KKSTILA (SEQ ID NO: 333), RSKLGSG (SEQ ID NO: 334), DRRGHDR (SEQ ID NO: 335), TKSSMRP (SEQ ID NO: 336), NRITPNR (SEQ ID NO: 337), KIQNNKQ (SEQ ID NO: 338), KSRLTQP (SEQ ID NO: 339), SQKAGGR (SEQ ID NO: 340), ARKTPDY (SEQ ID NO: 341), TRKPVVI (SEQ ID NO: 342), NLKDKRT (SEQ ID NO: 343), KRDARMN (SEQ ID NO: 344), KGSMRQA (SEQ ID NO: 345), RRHLAET (SEQ ID NO: 346), VKTHRPV (SEQ ID NO: 347), or KRNNVAA (SEQ ID NO: 348).
In some embodiments, a targeting peptide does not comprise or consist of a sequence disclosed in WO2015/038958 or WO2017/100671, which are incorporated by reference herein in their entireties.
Disclosed herein are targeting peptides capable of directing AAV to the central nervous system (CNS) via binding to at least one target protein. In some embodiments, an AAV capsid protein comprising a targeting peptide has increased transduction efficiency across the blood-brain barrier as compared to an AAV capsid protein lacking the targeting peptide. As used herein, the term “blood-brain barrier” or “BBB” refers to a network of blood vessels and tissue comprising closely spaced cells that regulate transport of substances between circulating blood from the brain and extracellular fluid in the CNS.
Target proteins that bind to targeting peptides described herein can include one or more of the following characteristics: expression in the CNS; capability of mediating transcytosis; capability of mediating endocytosis; capability of mediating intra-cellular trafficking; association with lipid rafts; and linkage to the cell surface, such as through a glycophosphatidylinositol (GPI) anchor. Characteristics of GPI-anchored proteins are described in, and incorporated by reference, from Zurzolo et al. (2016) BBA1858: 632-639; Saha et al. (2016) J. Lipid Res. 57: 159-175; Mayor et al. (2004) Nat Rev Mol Cell Biol 5, 110-120.
Target proteins, as described herein, can include, but are not limited to, members of the lymphocyte antigen-6 (Ly6)/urokinase-type plasminogen activator receptor (uPAR) protein family and GPI-anchored proteins. Notably, AAV2 has been shown to internalize in detergent-resistant GPI-anchored protein enriched endosomal compartment (GEEC), which is described in, and incorporated by reference, from: https://doi.org/10.1016/j.chom.2011.10.014. Ly6/uPAR proteins are cysteine-rich proteins characterized by a distinct disulfide bridge pattern that creates the three-finger Ly6/uPAR (LU) domain. As used herein, “Ly6/uPAR proteins” includes proteins that contain an LU domain regardless of whether they have been characterized as Ly6/uPAR proteins, and includes proteins that have been characterized as “Ly6-like” proteins, such as CD59. One of ordinary skill in the art would be able to recognize whether a protein sequence corresponds to a Ly6/uPAR protein, as used herein. For example, in some embodiments, a protein can be characterized as a Ly6/uPAR protein based on its level of homology to a protein that has been characterized as a Ly6/uPAR protein, or based on its level of homology to a protein that has been characterized as a Ly6-like protein. In other embodiments, a protein can be characterized as a Ly6/uPAR protein based on the presence of an LU domain.
The Ly6/uPAR protein family comprises at least 35 human and 61 mouse Ly6/uPAR proteins. Ly6/uPAR proteins are classified as glycophosphatidylinositol (GPI)-anchored proteins on the cell membrane or as secreted proteins based on their subcellular localization. The genes encoding Ly6/uPAR family proteins are conserved across different species and are clustered in syntenic regions on human chromosomes 8, 19, 6 and 11, and mouse Chromosomes 15, 7, 17, and 9, respectively. The Ly6/uPAR protein family is described further in Loughner et al. (2016) Human Genomics 10:10, which is incorporated by reference herein in its entirety.
Targeting peptides as described herein, in some embodiments, bind to a Ly6/uPAR protein. The Ly6/uPAR protein can be from any mammal, including humans and non-human primates. In some embodiments, the targeting peptide binds to a human Ly6 protein. In other embodiments, the targeting peptide binds to a non-human primate Ly6 protein. In other embodiments, the targeting peptide binds to a rodent Ly6/uPAR protein, such as a mouse Ly6/uPAR protein. Examples of Ly6/uPAR proteins include, but are not limited to, ACRV1, CD177, CD59A, CD59B, GML, GML2, GPIHBP1, LY6A, LY6A2, LY6C1, LY6C2, LY6D, LY6E, LY6F, LY6G, LY6G2, LY6G5B, LY6G5C, LY6G6C, LY6G6D, LY6G6E, LY6G6F, LY6G6G, LY6H, LY6I, LY6K, LY6L, LY6M, LYNX1, LYPD1, LYPD2, LYPD3, LYPD4, LYPD5, LYPD6, LYPD6B, LYPD8, LYPD9, LYPD10, LYPD11, PATE1, PATE2, PATE3, PATE4, PATE5, PATE6, PATE7, PATE8, PATE9, PATE10, PATE11, PATE12, PATE13, PATE14, PINLYP, PLAUR, PSCCA, SLURP1, SLURP2, SPACA4, and TEX101.
Human genes encoding Ly6/uPAR proteins include, but are not limited to, ACRV1, CD177, CD59, GML, GPIHBP1, LY6D, LY6E, LY6G5B, LY6G5C, LY6G6C, LY6G6D, LY6G6E, LY6G6F, LY6H, LY6K, LY6L, LYNX1, LYPD1, LYPD2, LYPD3, LYPD4, LYPD5, LYPD6, LYPD6B, LYPD8, PATE1, PATE2, PATE3, PATE4, PINLYP, PLA UR, PSCA, SLURP1, SLURP2, SPACA4, and TEX101.
Mouse genes encoding Ly6/uPAR proteins include, but are not limited to, Acrv1, Cd177, Cd59a, Cd59b, Gml, Gml2, Gpihbp1, Ly6a, Ly6a2, Ly6c1, Ly6c2, Ly6d, Ly6e, Ly6f, Ly6g, Ly6g2, Ly6g5b, Ly6g5c, Ly6g6c, Ly6g6d, Ly6g6e, Ly6g6f, Ly6g6g, Ly6h, Ly6i, Ly6k, Ly6l, Ly6m, Lynx1, Lypd1, Lypd2, Lypd3, Lypd4, Lypd5, Lypd6, Lypd6b, Lypd8, Lypd9, Lypd10, Lypd11, Pate1, Pate2, Pate3, Pate4, Pate5, Pate6, Pate7, Pate8, Pate9, Pate10, Pate11, Pate12, Pate13, Pate14, Pinlyp, Plaur, Psca, Slurp1, Slurp2, Spaca4, and Tex101.
It should be appreciated that Ly6/uPAR proteins and their expression patterns are known in the art. Information regarding the sequences of Ly6/uPAR proteins and the tissues or cells in which they are expressed is available through public databases known to one of ordinary skill in the art.
In some embodiments, the targeting peptides described herein may bind to a target protein (e.g., Ly6/uPAR protein) with a dissociation constant (Kd) lower than 20 nM (e.g., 15 nM, 10 nM, 5 nm, 1 nm, or less than 1 nm). In some embodiments, the targeting peptides described herein may bind to a Ly6/uPAR protein (e.g., human Ly6) with a dissociation constant (Kd) lower than 20 nM (e.g., 15 nM, 10 nM, 5 nm, 1 nm, or less than 1 nm). The targeting peptide may specifically bind human Ly6. Alternatively, the targeting peptides may bind to Ly6 from different species (e.g., human, non-human primate, mouse, and/or rat). It should be appreciated that any method known in the art for measuring binding activity can be compatible with aspects of the disclosure.
Targeting peptides as described herein, in some embodiments, bind to a target protein expressed in the nervous system. In some embodiments, the targeting peptide binds to a target protein expressed in the CNS. In other embodiments, the targeting peptide binds to a target protein expressed in the PNS. In other embodiments, the targeting peptide binds to a target protein expressed in a hematopoietic lineage, such as an immune cell. Accordingly, in some embodiments, targeting peptides described herein mediate delivery of nucleic acids to the CNS or PNS. In other embodiments, targeting peptides described herein mediate delivery of nucleic acids to a hematopoietic lineage, such as an immune cell.
In some embodiments, targeting peptides described herein mediate delivery of nucleic acids. In other embodiments, targeting peptides described herein mediate delivery of other biologics, such as antibodies. In some embodiments, targeting peptides described herein mediate delivery of nucleic acids or other biologics, such as antibodies, across the blood brain barrier.
In some embodiments, the targeting peptide binds to a target protein involved in cell trafficking. In some embodiments, the targeting peptide binds to a target protein involved in endocytosis. In some embodiments, the targeting peptide binds to a target protein capable of being internalized or trafficked to certain organelles. In some embodiments, the targeting peptide binds to a target protein involved in trafficking to the Golgi. In some embodiments, the targeting peptide binds to a target protein involved in transcytosis in endothelial cells. In some embodiments, the targeting peptide binds to a target protein involved in transcytosis in epithelial cells.
In some embodiments, the targeting peptide binds to a target protein associated with a lipid raft. In some embodiments, the targeting peptide binds to a target protein comprising a GPI-anchor. In some embodiments, the targeting peptide binds to a target protein comprising a typical GPI-attachment signal, e.g., a polar segment that includes the GPI-attachment site followed by a hydrophobic segment located at the C-terminus of the protein.
In some embodiments, the targeting peptide binds to a CNS endothelium protein (e.g., CD59, Ly6E, GPIHBP1) and/or a cell surface protein (e.g., PRNP). In some embodiments, the targeting peptide binds to CD59. In some embodiments, the targeting peptide binds to Ly6E. In some embodiments, the targeting peptide binds to GPIHBP1. In some embodiments, the targeting peptide binds to PRNP.
In some embodiments, the targeting peptides bind to a GPI-anchored protein. In some embodiments, the genes encoding GPI-anchored proteins can include but are not limited to the genes listed in Table 20.
Targeting peptides as described herein, in some embodiments, bind to a target protein and one or more homologues of the target protein. In some embodiments, the target protein is selected from the group consisting of a human protein, a non-human primate protein (e.g., a marmoset protein), and a rodent protein (e.g., a mouse protein). In some embodiments, the homologous target protein is selected from the group consisting of a human protein, a non-human primate protein (e.g., a marmoset protein), and a rodent protein (e.g., a mouse protein).
In some embodiments, the targeting peptide binds to a target protein and at least one homologous target protein. For example, the targeting peptide binds a human target protein and a homolog of the target protein from a non-human primate (e.g., a marmoset). In some embodiments, the targeting peptide binds a human target protein and a homolog of the target protein from a rodent (e.g., a mouse). In some embodiments, the targeting peptide binds target protein from a non-human primate (e.g., a marmoset) and a homolog of the target protein from a rodent (e.g., a mouse).
In some embodiments, the targeting peptide binds to a target protein and at least two homologous target proteins. For example, the targeting peptide binds a human target protein, a homolog of the target protein from a non-human primate (e.g., marmoset), and a homolog of the target protein from a rodent (e.g., a mouse).
In some embodiments, the targeting peptide binds a human target protein and a homolog of the target protein from marmoset. In some embodiments, the targeting peptide binds a human target protein, a homolog of the target protein from marmoset, and a homolog of the target protein from mouse. In some embodiments, the targeting peptide binds a mouse target protein and a homolog of the target protein from marmoset.
Accordingly, aspects of the invention relate to recombinant AAV capsid proteins that bind to target proteins, such as Ly6/uPAR proteins, and that can be used to mediate transport of materials across the blood-brain barrier.
Methods provided herein, in some embodiments, are useful for identifying targeting peptides, or AAV capsid proteins harboring targeting peptides, that bind target proteins. In some embodiments, the target protein is ectopically expressed on cells. In some embodiments, the target protein is a recombinant protein. In some embodiments, the target protein is endogenously expressed in a cell. In some embodiments, methods provided herein are useful for identifying AAV capsids proteins that cross specific barriers (e.g., blood-brain barrier or gut epithelium). In some embodiments, methods provided herein are useful for identifying AAV9 capsids proteins.
Targeting peptides described herein can be identified by incubating a candidate targeting peptide (e.g., an AAV capsid protein containing a targeting peptide) with a Ly6/uPAR protein; and selecting the targeting peptide if it binds to the Ly6/uPAR protein. In some embodiments, the Ly6/uPAR protein is expressed in a cell, such as on the surface of the cell, and binding of the targeting peptide (e.g., an AAV capsid protein containing a targeting peptide) to the cell that expresses the target protein on the surface of the cell is detected. Such binding assays may be performed with purified target protein (e.g., a purified Ly6/uPAR protein), or with cells naturally expressing or transfected to express a target protein (e.g., a Ly6/uPAR protein). Binding assays may be performed in various formats, including in vitro, or in cell culture, and including high-throughput formats. In some embodiments, a targeting peptide (e.g., an AAV capsid protein containing a targeting peptide) described herein can be further evaluated by monitoring its ability to mediate transcytosis across the blood-brain barrier.
In some embodiments, the target protein (e.g., a Ly6/uPAR protein) is endogenously expressed in a cell. In some embodiments, a control cell does not express a Ly6/uPAR protein. For example, expression of a Ly6/uPAR protein in some embodiments is decreased in a control cell, such as by mutating or deleting expression of the gene encoding a Ly6/uPAR protein. In some embodiments, the level of binding between a targeting peptide and a target protein (e.g., a Ly6/uPAR protein) is compared between a cell that expresses a target protein (e.g., a Ly6/uPAR protein) and a cell that does not express a target protein (e.g., a Ly6/uPAR protein).
In some embodiments, the targeting peptide disclosed herein specifically binds to a target protein, such as a human Ly6/uPAR protein. Methods to determine such specific binding are well known in the art. A targeting peptide is said to exhibit “specific binding” or to “specifically bind to a target protein” if it reacts or associates more frequently, more rapidly, with greater duration and/or with greater affinity with a particular target protein than it does with alternative target proteins. A targeting peptide that specifically binds to a first target protein may or may not specifically or preferentially bind to a second target protein.
As such, “specific binding” or “preferential binding” does not necessarily require (although it can include) exclusive binding. Generally, but not necessarily, reference to binding means preferential binding.
An AAV capsid protein is said to exhibit “specific binding” or to “specifically bind” to a protein if it reacts or associates more frequently, more rapidly, with greater duration and/or with greater affinity with the protein than it does with alternative target proteins. An AAV capsid protein that specifically binds to a protein may or may not specifically or preferentially bind to the protein. In some embodiments, the protein is a protein of the Ly6/uPAR protein family attached to the surface of a cell. In some embodiments, the protein is a GPI-anchored protein attached to the surface of a cell. In some embodiments, the protein is i) a protein that exhibits luminal surface exposure on brain endothelium; ii) a protein that is localized within lipid micro-domains; and/or iii) a protein that exhibits recycling/intracellular trafficking capabilities. In some embodiments, specific binding is determined by comparison to a control. For example, a control may involve contacting an AAV capsid protein with a cell that does not express the protein or contacting an AAV capsid protein with a cell that expresses a different protein.
For example, methods disclosed herein can comprise providing an AAV capsid protein, incubating the AAV capsid protein with a cell that recombinantly expresses a target protein attached to the surface of the cell, and selecting the AAV capsid protein if it specifically binds to the target protein attached to the surface of the cell.
In some embodiments, methods disclosed herein can comprise providing an AAV capsid protein, incubating the AAV capsid protein with a target protein that was purified from cells expressing the target protein, and selecting the AAV capsid protein if it specifically binds to the target protein.
In some embodiments, methods comprise providing an AAV capsid protein, incubating the AAV capsid protein with a cell that recombinantly expresses a Ly6/uPAR protein attached to the surface of the cell, and selecting the AAV capsid protein if it specifically binds to the Ly6/uPAR protein attached to the surface of the cell.
In some embodiments, methods comprise providing an AAV capsid protein, incubating the AAV capsid protein with a Ly6/uPAR protein, and selecting the AAV capsid protein if it specifically binds to the Ly6/uPAR protein.
In some embodiments, methods comprise screening for an AAV capsid protein that can bind to a target protein, comprising providing a library of AAV capsid proteins, incubating the library of AAV capsid proteins with a cell that recombinantly expresses a target protein attached to the surface of the cell, isolating an AAV capsid protein that binds to the cells that recombinantly express the target protein on the cell surface, and identifying the sequence of the isolated AAV capsid protein.
In some embodiments, methods comprise screening for an AAV capsid protein that can bind to a target protein, comprising providing a library of AAV capsid proteins, incubating the library of AAV capsid proteins with a target protein (e.g., a recombinant target protein or a target protein purified from cells expressing the target protein), isolating an AAV capsid protein that binds to the target protein, and identifying the sequence of the isolated AAV capsid protein.
The sequence of the isolated AAV capsid proteins may be identified using any sequencing methods known in the art. In some embodiments, AAV capsid proteins are sequenced using short read sequencing technology. In some embodiments, AAV capsid proteins are sequenced using long read sequencing technology. In some embodiments, AAV capsid proteins are sequenced using next-generation sequencing (NGS) technology or whole genome sequencing (WGS) technology.
Methods provided herein may be performed using any type of cell. Examples of cells include, but are not limited to, mammalian cells, rodent cells, yeast cells, and bacterial cells. Examples of mammalian cells include, but are not limited to, CHO (Chinese Hamster Ovary), VERO, HeLa, CVI, COS, COS-7, BHK (baby hamster kidney), MDCK, CI 27, PC 12, HEK-293, PER C6, NSO, WI38, R1610, BALBC/3T3, HAK, SP2/0, P3x63-Ag3.653, BFA-1c1BPT, RAJI, and 293 cells.
Methods provided herein may be performed using purified endogenous proteins, tagged AviTag, C-tag, Calmodulin-tag, E-tag, FLAG, HA, poly-HIS, MYC, NE, Rho1D4, S-tag, SBP, Softag, Spot-tag, T7-tag, TC, Ty, V5, VSV, Xpress, Isopeptag, SpyTag, SnoopTag, DogTag, SdyTag, BCCP, GST, GFP, Halo, SNAP, CLIP, Maltose binding protein (MBP), Nus-tag, Thioredoxin-tag, Fc-tag, CRDSAT, SUMO-tag, B2M-tag. The recombinant proteins can be purified from any cell type. Examples of cells include, but are not limited to, mammalian cells, rodent cells, yeast cells, and bacterial cells. Examples of mammalian cells include, but are not limited to, CHO (Chinese Hamster Ovary), VERO, HeLa, CVI, COS, COS-7, BHK (baby hamster kidney), MDCK, CI 27, PC 12, HEK-293, PER C6, NSO, WI38, R1610, BALBC/3T3, HAK, SP2/0, P3x63-Ag3.653, BFA-1c1BPT, RAJI, and 293 cells
Methods provided herein, in some embodiments, are useful for delivering a nucleic acid (or another biologic, such as an antibody) to a target environment (e.g., the heart, the nervous system, or a combination thereof) of a subject in need. In some embodiments, methods for delivering a nucleic acid (or another biologic, such as an antibody) to a target environment comprise delivering the nucleic acid (or another biologic, such as an antibody) to the heart, the nervous system, or a combination thereof. In some embodiments, methods for delivering a nucleic acid (or another biologic, such as an antibody) to a target environment comprise delivering the nucleic acid (or another biologic, such as an antibody) to neurons, astrocytes, cardiomyocytes, or a combination thereof. In some embodiments, methods for delivering a nucleic acid (or another biologic, such as an antibody) to a target environment comprise delivering the nucleic acid (or another biologic, such as an antibody) to a hematopoietic lineage, such as an immune cell. Methods of use of AAV vectors are described further in U.S. Pat. No. 9,585,971 and US 2017/0166926, which are incorporated by reference herein in their entireties.
In some embodiments, methods for delivering a nucleic acid to a target environment of a subject in need comprise providing a composition comprising an AAV as described herein, and administering the composition to the subject. In some embodiments, methods for delivering a nucleic acid to a target environment of a subject in need thereof comprise providing a composition comprising an AAV comprising (i) a capsid protein that comprises an amino acid sequence that comprises at least 4 contiguous amino acids of a sequence provided herein, and (ii) a nucleic acid (or another biologic, such as an antibody) to be delivered to the target environment of the subject, and administering the composition to the subject.
Methods provided herein, in some embodiments, are useful for treating a disorder or defect in a subject. In some embodiments, the methods as described herein comprise delivering a protein, RNA, or DNA to a target environment of the subject. In some embodiments, the methods as described herein comprise administering an adeno-associated virus (AAV) vector to a target environment of the subject. In some embodiments, the AAV vector comprises a nucleic acid molecule that encodes a therapeutic protein or therapeutic RNA effective in treating the disorder or defect. In some embodiments, the AAV vector comprises a capsid protein comprising at least 4 contiguous amino acids from a sequence listed in Table 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 or 19.
In some embodiments, the protein, RNA, or DNA can be a Ly6/uPAR protein or gene. In one embodiment, the Ly6/uPAR is LY6. In some embodiments, the LY6/uPAR is LY6A. In some embodiments, the LY6/uPAR is LY6C1. In some embodiments, the LY6/uPAR can be any protein that is suitable to be delivered to a target environment. In some embodiments, the LY6/uPAR receptor is a murine receptor. In some embodiments, the AAV targets the Ly6/uPAR protein. In some embodiments, the AAV targets any protein that are characterized as “Ly6-like” proteins.
In some embodiments, the protein, RNA, or DNA is delivered to the subject via intravenous administration or systemic administration. In some embodiments, the protein, RNA, or DNA is delivered in trans. In some embodiments, the protein, RNA, or DNA is delivered to the subject via a nanoparticle. In some embodiments, the RNA is delivered to the subject via a viral vector. In some embodiments, the RNA is delivered to the subject via any carriers suitable for delivering nucleic acid materials. In some embodiments, the protein is a purified protein. In some embodiments, the Ly6/uPAR gene is delivered to the subject via a viral vector.
In some embodiments, the protein or RNA is delivered prior to the administration of the AAV vector. The protein or RNA (e.g. Ly6a or Ly6c1), or an ectopic receptor can be expressed in the target environment transiently. In some embodiments, the AAV vector can be administered to the subjects 12 hours, 1 day, 2 days, 3 days, 4 days, 5 days, 6 days, 7 days, 8 days, 9 days, 10 days, inclusive of all ranges and subranges therebetween, after the protein or RNA is delivered to the target environment. In some embodiments, the AAV vector can then specifically interact with the ectopic receptor (e.g. Ly6a or Ly6c1) during the timeframe of expression of the delivered ectopic receptor. “Transiently,” “transient expression,” or “transient gene expression” as described herein refers to the temporary expression of proteins or genes that are expressed for a short time after a protein or a nucleic acid (e.g., plasmid DNA encoding an expression cassette), has been introduced into the target environment.
In some embodiments, the protein or RNA can be delivered to the target environment simultaneously with the AAV vector. In some embodiments, the protein or RNA can be delivered to the target environment with the AAV vector in any order or timeframe that is suitable for treating a disorder or defect in the subject as described herein. For example, the AAV vector can be administered a few minutes after the delivery of the protein or RNA.
Any nucleic acid may be delivered to a target environment of a subject according to methods described herein. In some embodiments, a nucleic acid to be delivered to a target environment of a subject comprises one or more sequences that would be of some use of benefit to the subject. In some embodiments, the nucleic acid is delivered to dorsal root ganglia, visceral organs, astrocytes, neurons, or a combination thereof of the subject.
In a non-limiting example, the nucleic acid or nucleic acid molecule to be delivered can comprise one or more of (a) a nucleic acid sequence encoding a trophic factor, a growth factor, or a soluble protein; (b) a cDNA that restores protein function to humans or animals harboring a genetic mutation(s) in that gene; (c) a cDNA that encodes a protein that can be used to control or alter the activity or state of a cell; (d) a cDNA that encodes a protein or a nucleic acid used for assessing the state of a cell; (e) a cDNA and/or associated guide RNA for performing genomic engineering; (f) a sequence for genome editing via homologous recombination; (g) a DNA sequence encoding a therapeutic RNA; (h) a shRNA or an artificial miRNA delivery system; and (i) a DNA sequence that influences the splicing of an endogenous gene.
Any subject in need may be administered a composition comprising an AAV according to methods described herein. In some embodiments, a subject in need or a subject having a disorder or defect is a subject suffering from or at a risk to develop one or more diseases. In some embodiments, the subject in need is a subject suffering from or at a risk to develop one or more of chronic pain, cardiac failure, cardiac arrhythmias, Friedreich's ataxia, Huntington's disease (HD), Alzheimer's disease (AD), Parkinson's disease (PD), Amyotrophic lateral sclerosis (ALS), spinal muscular atrophy types I and II (SMA I and II), Friedreich's Ataxia (FA), Spinocerebellar ataxia, lysosomal storage disorders that involve cells within the CNS.
Any suitable method may be used for administering a composition comprising an AAV described herein. In some embodiments, the composition comprising the AAV is administered to the subject via intravenous administration. In some embodiments, the composition comprising the AAV is administered to the subject via or systemic administration.
Aspects of the present disclosure provide, in some embodiments, a pharmaceutical composition comprising an AAV vector as described herein and a pharmaceutically acceptable carrier. Suitable carriers may be readily selected by one of skill in the art in view of the indication for which the AAV vector is directed. For example, one suitable carrier includes saline, which may be formulated with a variety of buffering solutions (e.g., phosphate buffered saline). Other exemplary carriers include sterile saline, lactose, sucrose, calcium phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil, and water. The selection of the carrier is not a limitation of the present disclosure. Pharmaceutical compositions comprising AAV vectors are described further in U.S. Pat. No. 9,585,971 and US 2017/0166926, which are incorporated by reference herein in their entireties.
In some embodiments, the pharmaceutical composition comprising an AAV vector comprises other pharmaceutical ingredients, such as preservatives, or chemical stabilizers. Suitable exemplary preservatives include chlorobutanol, potassium sorbate, sorbic acid, sulfur dioxide, propyl gallate, the parabens, ethyl vanillin, glycerin, phenol, and parachlorophenol. Suitable chemical stabilizers include gelatin and albumin.
Methods described herein comprise administering AAV vector in sufficient amounts to transfect the cells of a desired tissue (e.g., heart, brain) and to provide sufficient levels of gene transfer and expression without undue adverse effects. Examples of pharmaceutically acceptable routes of administration include, but are not limited to, direct delivery to the selected organ, oral, inhalation, intraocular, intravenous, intramuscular, subcutaneous, intradermal, intratumoral, and other parental routes of administration. Routes of administration may be combined, if desired.
The dose of AAV required to achieve a particular “therapeutic effect,” e.g., the units of dose in genome copies/per kilogram of body weight (GC/kg), will vary based on several factors including, but not limited to: the route of AAV administration, the level of gene or RNA expression required to achieve a therapeutic effect, the specific disease or disorder being treated, and the stability of the gene or RNA product. One of skill in the art can readily determine a AAV dose range to treat a patient having a particular disease or disorder based on the aforementioned factors, as well as other factors.
An effective amount of AAV vector is an amount sufficient to infect an animal or target a desired tissue. The effective amount will depend primarily on factors such as the species, age, weight, health of the subject, and the tissue to be targeted, and may thus vary among animal and tissue. For example, an effective amount of AAV is generally in the range of from about 1 ml to about 100 ml of solution containing from about 109 to 1016 genome copies. In some cases, a dosage between about 1011 to 1013 AAV genome copies is appropriate. In some embodiments, an effective amount is produced by multiple doses of AAV.
In some embodiments, a dose of AAV is administered to a subject no more than once per calendar day (e.g., a 24-hour period). In some embodiments, a dose of AAV is administered to a subject no more than once per 2, 3, 4, 5, 6, or 7 calendar days. In some embodiments, a dose of AAV is administered to a subject no more than once per calendar week (e.g., 7 calendar days). In some embodiments, a dose of AAV is administered to a subject no more than bi-weekly (e.g., once in a two calendar week period). In some embodiments, a dose of AAV is administered to a subject no more than once per calendar month (e.g., once in 30 calendar days). In some embodiments, a dose of AAV is administered to a subject no more than once per six calendar months. In some embodiments, a dose of AAV is administered to a subject no more than once per calendar year (e.g., 365 days or 366 days in a leap year). In some embodiments, a dose of rAAV is administered to a subject no more than once per two calendar years (e.g., 730 days or 731 days in a leap year). In some embodiments, a dose of AAV is administered to a subject no more than once per three calendar years (e.g., 1095 days or 1096 days in a leap year).
Formulation of pharmaceutically-acceptable excipients and carrier solutions is well-known to those of skill in the art, as is the development of suitable dosing and treatment regimens for using the particular compositions described herein in a variety of treatment regimens. Typically, these formulations may contain at least about 0.1% of the active compound or more, although the percentage of the active ingredient(s) may, of course, be varied and may conveniently be between about 1 or 2% and about 70% or 80% or more of the weight or volume of the total formulation. Naturally, the amount of active compound in each therapeutically-useful composition may be prepared is such a way that a suitable dosage will be obtained in any given unit dose of the compound. Factors such as solubility, bioavailability, biological half-life, route of administration, product shelf life, as well as other pharmacological considerations will be contemplated by one skilled in the art of preparing such pharmaceutical formulations, and as such, a variety of dosages and treatment regimens may be desirable.
The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersions. Dispersions may also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these preparations contain a preservative to prevent the growth of microorganisms. In many cases the form is sterile and fluid to the extent that easy syringability exists. It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms, such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable oils. Proper fluidity may be maintained, for example, by the use of a coating, such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. The prevention of the action of microorganisms can be brought about by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the use in the compositions of agents delaying absorption, for example, aluminum monostearate and gelatin.
The AAV vector compositions disclosed herein may also be formulated in a neutral or salt form. Pharmaceutically-acceptable salts, include the acid addition salts (formed with the free amino groups of the protein) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be administered in a manner compatible with the dosage formulation and in such amount as is therapeutically effective. The formulations are easily administered in a variety of dosage forms such as injectable solutions, drug-release capsules, and the like.
As used herein, “carrier” includes any and all solvents, dispersion media, vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use of such media and agents for pharmaceutical active substances is well known in the art. Supplementary active ingredients can also be incorporated into the compositions. The phrase “pharmaceutically-acceptable” refers to molecular entities and compositions that do not produce an allergic or similar untoward reaction when administered to a host.
Delivery vehicles such as liposomes, nanocapsules, microparticles, microspheres, lipid particles, vesicles, and the like, may be used for the introduction of the compositions of the present disclosure into suitable host cells. In particular, the AAV vector delivered transgenes may be formulated for delivery either encapsulated in a lipid particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like.
Such formulations may be preferred for the introduction of pharmaceutically acceptable formulations of the nucleic acids or the AAV constructs disclosed herein. The formation and use of liposomes is generally known to those of skill in the art. Recently, liposomes were developed with improved serum stability and circulation half-times (U.S. Pat. No. 5,741,516). Further, various methods of liposome and liposome like preparations as potential drug carriers have been described (U.S. Pat. Nos. 5,567,434; 5,552,157; 5,565,213; 5,738,868 and 5,795,587).
Liposomes are formed from phospholipids that are dispersed in an aqueous medium and spontaneously form multilamellar concentric bilayer vesicles (also termed multilamellar vesicles (MLVs). MLVs generally have diameters of from 25 nm to 4 μm. Sonication of MLVs results in the formation of small unilamellar vesicles (SUVs) with diameters in the range of 200 to 500 A, containing an aqueous solution in the core.
Alternatively, nanocapsule formulations of the AAV vector may be used. Nanocapsules can generally entrap substances in a stable and reproducible way. To avoid side effects due to intracellular polymeric overloading, such ultrafine particles (sized around 0.1 μm) should be designed using polymers able to be degraded in vivo. Biodegradable polyalkyl-cyanoacrylate nanoparticles that meet these requirements are contemplated for use.
The number of diseases that are potentially treatable by gene therapy is rapidly expanding. AAV vectors are proving to be safe, versatile vehicles for in vivo gene therapy applications (1-3). However, delivery challenges impede the application of gene therapy, particularly in the context of the brain, which is protected by the blood-brain barrier (BBB). To improve gene delivery across the central nervous system (CNS), AAV capsids have been engineered using in vivo selection and directed evolution (4-11). Previously engineered AAV9 variants include AAV-PHP.B (5) and its further evolved, more efficient variant, AAV-PHP.eB (4), that cross the adult BBB and enable efficient gene transfer to the mouse CNS. Since then, AAV-PHP.B and AAV-PHP.eB have been applied across a wide range of neuroscience experiments in mice (4, 12, 13), including genetic deficit correction (14, 15) and neurological disease modeling (16).
One critical question has been whether AAV-PHP.B and AAV-PHP.eB can facilitate efficient CNS gene transfer in other species. The enhanced CNS tropism of AAV-PHP.B and AAV-PHP.eB appears to extend to rats (17, 18), whereas studies testing AAV-PHP.B or related capsids in nonhuman primates (NHPs) have yielded differing outcomes (19-21). Surprisingly, the enhanced CNS tropism of AAV-PHP.B (5, 12, 15-18, 22, 23) was starkly absent in BALB/cJ mice (19). These findings indicate that the ability of AAV-PHP.B to cross the BBB is affected by genetic factors that vary by species and mouse line. Studies described herein leverage this strain-dependence to identify LY6A as the cellular receptor responsible for the enhanced CNS tropism exhibited by the AAV-PHP.B capsid family. It was demonstrated that the LY6A-mediated mechanism of transduction is independent of known AAV9 receptors and is a unique means for AAV-PHP.B capsids to cross the mouse BBB. This has widespread implications for guiding the selection of disease models in studies utilizing AAV-PHP.B capsids, as well as ongoing efforts to rationally engineer AAVs that cross the BBB in other species.
Access to whole-genome sequencing data for 36 commercially available mouse lines made it possible to estimate the number of lines necessary to produce an adequately short list of candidate variants. Starting from the known permissive C57BL/6J non-permissive BALB/cJ strains, the Hail software library was used to simulate permissivity phenotypes among other mouse lines and compute the number of candidate variants in both the high and high-and-medium predicted functional impact classes. First, a permissivity frequency was sampled from a Beta(2, 2) distribution based on the two known mouse phenotypes. Second, one or more additional mouse lines were sampled from 26 commercially available lines (Table 1). Third, phenotypes for these mice were simulated using the generated permissivity frequency. Finally, the number of variants with perfect allelic segregation between simulated permissive and non-permissive lines was calculated and recorded. This simulation model ran for 500 iterations for 3 mice up to 12 mice, providing a distribution over the number of candidate variants at each mouse sample size, and enabling an informed decision about how many mouse lines to order and test in parallel.
AKR/J
BTBR/T+/Itpr3tf/J
C57BL/6J
C57BL/6N
C57BL/10J
C57BR/cdJ
C57L/J
DBA/1J
DBA/2J
FVB/NJ
LP/J
NZW/LacJ
MOLF/EiJ
129S1/SvlmJ
A/J
BALB/cJ
CAST/EiJ
CBA/J
C3H/HeJ
LEWES/EiJ
NOD/ShiLtJ
NZB/B1NJ
NZO/HlLtJ
PWK/PhJ
SPRET/EiJ
WSB/EiJ
The AAV-PHP.eB Rep-Cap trans plasmid was generated by gene synthesis (GenScript). AAV9, AAV-PHP.B, AAV-PHP.B2, and AAV-PHP.B3 were generated by replacing the AAV-PHP.eB variant region with that of AAV9, AAV-PHP.B, B2, or B3 using isothermal HiFi DNA Assembly (NEB). AAV-CAG-NLS-GFP and AAV-CAG-NLS-mScarlet vectors were synthesized using the N-terminal SV40 NLS sequence present in the Addgene plasmid #99130 as a gBlock (IDT) and GFP was subcloned in place of mScarlet to produce the NLS-GFP cassette. Ly6a and Ly6c1 (splice variant 1) cDNAs were synthesized as gBlocks (IDT). Reporter and Ly6 expression vectors were cloned into an AAV-CAG-WPRE-hGH pA backbone obtained from Addgene (#99122). The CMV-SaCAS9 vector (AAV-CMV::NLS-SaCas9-NLS-3×HA-bGHpA; U6::BsaI-sgRNA) was obtained from Dr. Feng Zhang through Addgene (#61591). sgRNAs specifically targeting Ly6a or Ly6c1 were cloned after the U6 promoter using a single bridge oligo for each reaction as recommended (HiFi DNA Asssembly, NEB). The Broad GPP sgRNA tool for SaCAS9 was used to identify suitable SaCAS9 target sites (1).
The following primers were used for Ly6a sgRNA cloning:
The following primers were used for Ly6c1 sgRNAs cloning:
qPCR primers used for biodistribution and in vitro binding:
AAV-PHP.eB vectors with sgRNA sequences target Ly6a and Ly6c1 were generated and purified to knockout respective gene in C57BL/6 mouse primary brain microvascular endothelial cells (CellBiologics, Cat.# C57-6023). AAV vectors (1×106 vg per cell) were used to transduce cells every 3 days for 3 times to achieve higher knockout efficiency. Cells were passaged as necessary.
HEK293T/17 (CRL-11268), Pro5 (CRL-1781), Lec2 (CRL-1736), and Lec8 (CRL-1737) were obtained from ATCC. BMVEC cells were obtained from Cell Biologics (C57-6023) and cultured as directed by the manufacturer.
Recombinant AAVs were generated by triple transfection of HEK293T cells (ATCC CRL-11268) using polyethylenimine (PEI) and purified by ultracentrifugation over iodixanol gradients as previously described (2).
The virus overlay assay was performed as previously reported (3) with some modifications. Briefly, protein lysates were separated on Bolt 4-12% Bis-Tris Plus gels and transferred onto nitrocellulose membranes. After incubation with AAV9 or PHP.eB at 5e11 vg/ml, the membranes were fixed with 4% PFA at room temperature for 20 minutes to crosslink the interaction between capsid and its target protein followed by 2M HCl treatment at 37° C. for 7 minutes to expose the internal capsid epitope for detection. The blots were then rinsed and incubated with anti-AAV VP1/VP2/VP3 (1:20; American research products, Inc. cat#03-65158) followed by incubation with a horseradish peroxidase (HRP)-conjugated secondary antibody at 1:5000. The detection of binding was by SuperSignal West Femo Maximum Sensitivity Substrate under a Bio-Rad ChemiDoc TM MP system #1708280.
All procedures were performed as approved by the Broad Institute IACUC or Massachusetts General Hospital IACUC (AAVR experiments). AKR/J (000648), BALB/cJ (000651), CBA/J (000656), CAST/EiJ (000928), C57Bl/6J (000664), C57BL/J (000668), DBA/2J (000671), FVB/NJ (001800), LP/J (000676), MOLF/EiJ (000550), NOD/ShiLtJ (001976), NZB/B1NJ (000684), and PWK/PhJ (003715) were obtained from The Jackson Laboratory (JAX). AAVR mice were a generous gift from Dr. J.E. Carette (Stanford) to Dr. Balazs. Recombinant AAV vectors were administered intravenously via the retro-orbital sinus in young adult male or female mice. Mice were randomly assigned to groups based on predetermined sample sizes. No mice were excluded from the analyses. Experimenters were not blinded to sample groups.
Mice were anesthetized with Euthasol (Broad) or ketamine (MGH) and transcardially perfused with phosphate buffered saline (PBS) at room temperature followed by 4% paraformaldehyde (PFA) in PBS. Tissues were post-fixed overnight in 4% PFA in PBS and sectioned by vibratome. IHC was performed on floating sections with antibodies diluted in PBS containing 10% donkey serum, 0.1% Triton X-100, and 0.05% sodium azide. Primary antibodies were incubated at room temperature overnight. The sections were then washed and stained with secondary (Alexa-conjugated antibodies, 1:1000) for 4 hours or overnight. Primary antibodies used were mouse anti-AAV capsid (1:20; American Research Products, 03-65158, clone B1), LY6A (1:250; BD Bioscience, 553333 or ThermoFisher, 701919), LY6C1 (1:250; Millipore-Sigmam MABN668), Glut1 (Millipore Sigma, 07-1401). To expose the internal B1 Capsid epitope in intact capsids, tissue sections or cells on coverslips were treated for 15 or 7 minutes, respectively, with 2M HCl at 37° C. The treated samples were then washed extensively prior to addition of the primary antibody.
5- to 6-week-old C57Bl/6J mice, BALB/cJ mice AAVR WT or AAVR KO mice (FVB/NJ background) were injected intravenously with 1011 vg of AAV vector packaged into the indicated capsid. One or two hours after injection, the mice were perfused with PBS and tissues were collected and frozen at −80° C. Samples were processed for AAV genome biodistribution analysis and normalized to the number of copies of mouse genomes using qPCR for the GFP element and mouse glucagon by qPCR as previously described (2). To visualize the capsid distribution, mice were perfused with 4% PFA after dosing with AAV vector and brain were section into 100 micrometer and labeled with indicated antibodies.
Images were taken on an Axio Imager.Z2 Basis Zeiss 880 laser scanning confocal microscope fitted with the following objectives: PApo 10×/0.45 M27, Plan-Apochromat 20×/0.8 M27, or Plan-APO 40×/1.4 oil DIC (UV) VIS-IR. All images compared within an experiment were acquired and processed under identical conditions.
Ly6 family members (0.5 μg/well) were transfected into HEK293T cells (3×105/well) using PEI or into CHO cells (1.5×105/well) with lipofectamine 3000 reagent (ThermoFisher, L3000001) in 24-well plates. 48 hours later, the cells were chilled to 4° C. and the media was exchanged with fresh cold media containing the indicated recombinant AAV (105 copies per cell). One hour later, cells were washed with cold PBS for 3 times, then fixed with 4% PFA for IHC or lysed for genomic DNA extraction and qPCR analyses.
For BMVECs, 2×104 cells/well were seeded in 12 well plate the day before exposure to virus. The assay was performed as above except AAV vectors were added at 106 copies/cell.
HEK293T/17 cells were seeded at 2×107 per T75 flask 12-24 hours prior to being transfected with 20 μs of cDNA encoding eGFP, Ly6a, or Ly6c1. At 24-48 hours post transfection, the cells were incubated with an AAV9 K449R library (7-mer insertion between amino acids 588 and 589) at 1011 vg/T75 at 4° C. for 2 hours. Afterwards, the media was exchanged with PBS for 3 times in order to wash away unbound viruses. The viruses that remained bound to the cells were extracted with TRIzol (Invitrogen) or with whole genomic DNA isolation reagents (DNeasy, Qiagen) in order to isolate their viral genomes. The viral genomes were then prepared for next generation sequencing (NGS) to quantify the enrichment of peptides that conferred increased capsid ability to bind cells expressing the target protein.
Ly6 family members (0.1 μg/well) were transfected into the indicated cells (HEK293/17: 4×105/well; CHO: 2.5×104/well, BMVECs: 5×103/well) in 96-well plates (PerkinElmer, 6005680) in triplicate. 48 hours later, cells were transduced with AAV-CAG-GFP-2A-Luciferase-WPRE packaged into AAV9 or AAV-PHP.eB. Luciferase assays were performed with Britelite plus Reporter Gene Assay System (PerkinElmer, 6066766). Luciferase activity was reported as relative light units (RLU) as raw data or normalized to non-transfected control wells transduced with AAV9, or a control transduced without a sgRNA (
Microsoft Excel and Prism 8 were used for data analysis. For the comparison between AAV9 and AAV-PHP.eB biodistribution, a group size of 6 per group (3 males and 3 females) was used based on prior data that indicated a large effect size (mean±SEM). No animals or samples were excluded from the analysis.
To construct an adeno-associated virus (AAV) RNA expression system for the selection of functional AAV vectors and the recovery of AAV capsid transcripts, the ubiquitous promoter cytomegalovirus (CMV) was cloned into a recombinant AAV plasmid containing inverted terminal repeats from adeno-associated virus type 2 (AAV2). Downstream of the CMV promoter a synthetic intron containing a consensus donor motif (CAGGTAAGT), consensus splice motif (TTTTTTCTACAGGT) (SEQ ID NO: 30229) and branch point sequence was cloned. Downstream of the artificial intron, the AAV5 P41 promoter along with the 3′ end of the AAV2 Rep gene, which includes the splice donor sequences for the capsid RNA was cloned. The capsid gene splice donor sequence in AAV2 Rep was modified from a non consensus donor sequence CAGGTACCA to a consensus donor sequence CAGGTAAGT. The wildtype adeno-associated virus serotype 9 (AAV9) capsid gene sequence was synthesized with nucleotide changes at S448 (TCA to TCT, silent mutation), K449R (AAG to AGA), and G594 (GGC to GGT, silent mutation) to introduce XbaI and AgeI restriction enzyme recognition sites for library fragment cloning. The AAV2 polyadenylation sequence was replaced with a simian virus 40 (SV40) late polyadenylation signal to terminate the capsid RNA transcript.
To assemble an oligonucleotide Library Synthesis Pool (oligo pool; Agilent) into an AAV genome, the oligo pool was amplified and extended using 10 ng of a DNA plasmid template containing a fragment of AAV9 and a forward primer Assembly-XbaI-F. Specifically, the reaction conditions were as follows: approximately 5 pM of the OLS pool, 0.5 μM of primer Assembly-XbaI-F for 5 cycles using Q5® High-Fidelity 2× Master Mix (NEB #M0492S) following the manufacturer's protocol. After the 5-cycle amplification and extension of the oligo pool, the reaction was spiked with 0.5 μM of primer Assembly_AgeI-R and amplified for an additional 25 cycles. The PCR product was then purified using Agencourt AMPure XP SPRI paramagnetic beads (Beckman Coulter #A63880) or column purified using a Zymo Research DNA Clean & Concentrator-5 kit (Zymo Research #D4013) following the manufacturer's protocol.
For generating 7-mer NNK libraries, the hand-mixed primer (Assembly-NNK-AAV9-588; IDT) encoding a 7mer peptide insertion between AA 588 and 589 of AAV9 was used as the reverse primer along with the Assembly-XbaI-F oligo as a forward primer in a PCR reaction using Q5® High-Fidelity 2× Master Mix (NEB #M0492S) following the manufacturer's protocol for 30 cycles with 10 ng plasmid containing AAV9 as the template. The oligo pool or 7-mer NNK PCR products were assembled into the RNA expression plasmid with previous described methods in Deverman et al. Nature Biotechnology 2016.
Recombinant AAVs were generated and titered with previously described methods in Deverman et al. Nature Biotechnology 2016.
To isolate total RNA containing AAV Cap transcripts, a RNeasy Mini Kit (Qiagen #74104), along with a QIAshredder kit (Qiagen #79654) and a RNase-Free DNase kit (Qiagen #79254) was used following the manufacturer's protocol. In some variations, TRIzol™ Reagent (Invitrogen™ #15596026) was used to isolate total RNA from homogenized tissue following the manufacturer's protocol prior to additional cleanup with the RNeasy Mini, QIAshredder and RNase-Free DNase kits listed above. Isolated RNA was resuspended in RNase free water and stored in −80 C conditions until conversion to cDNA.
RNA was reverse transcribed to cDNA using Maxima H Minus Reverse Transcriptase (Thermo Scientific™ #EP0752) following the manufacturer's protocol with an anchored oligo dT primer (IDT #51-01-15-08).
The cDNA was prepared for next-generation sequencing (NGS) with two rounds of polymerase chain reaction (PCR). In the first round of PCR (PCR1), a set of forward primers (Table 1) and reverse primers (Table 2) containing gene specific priming regions and a overhang sequence containing a portion of the Illumina Read 1 sequence (forward primers) or Illumina Read 2 sequence (reverse primers) were used to selectively amplify AAV genomes from the cDNA with Q5® High-Fidelity 2× Master Mix (NEB #M0492S), with 0.5 μM of each primer.
The forward and reverse primers contain zero or up to eight N nucleotides inserted in between the gene specific priming region and the partial Illumina Read 1 (forward primers) or Read 2 (reverse primers) overhang sequence. This is to introduce diversity into amplicon during NGS and to offset the constant region of the AAV genome to improve cluster diversity and to increase sequencing quality during Illumina NGS. The forward and reverse primers were paired to produce amplicons of the same size (i.e., SEQ1_F was paired with SEQ1_R, SEQ2_F was paired with SEQ2_R, etc.).
The number of cycles performed in PCR1 was chosen to stop before the exponential amplification phase and was determined with qPCR using FastStart Universal SYBR Green Master (Millipore Sigma #4913850001) or Q5® High-Fidelity 2× Master Mix (NEB #M0492S) with SYBR® Green I nucleic acid stain (VWR #12001-798) diluted from 10,000× to 8× per reaction. The qPCR primers used were SEQ9_F and SEQ1_R with 1 μL cDNA input.
Following PCR1, the amplified DNA was cleaned up using Agencourt AMPure XP SPRI paramagnetic beads (Beckman Coulter #A63880) or column purified using a Zymo Research DNA Clean & Concentrator-5 kit (Zymo Research #D4013) following the manufacturer's protocol. PCR1 samples were then barcoded for Illumina NGS with NEBNext Multiplex Oligos for Illumina Dual Index Primers Set 1 and 2 (NEB #E7600S and #E7780S) with 2 μL PCR1 input and amplified for 5 cycles to generate PCR2 products. The PCR2 products were again purified using Agencourt AMPure XP SPRI paramagnetic beads or column purified using a Zymo Research DNA Clean & Concentrator-5 kit (Zymo Research #D4013) following the manufacturer's protocol.
The concentrations of purified PCR2 samples were determined using a Qubit™ dsDNA HS Assay Kit (Invitrogen™ #Q32854) then diluted and pooled according to the Illumina Nextseq System Denature and Dilute Libraries Guide or MiSeq System Denature and Dilute Libraries Guide along with 10-15% PhiX Control v3 (Illumina #FC-110-3001) spiked in. The pooled samples were quantified and checked for correct sizes using an Agilent High Sensitivity DNA Kit (Agilent #5067-4626) on an Agilent 2100 Electrophoresis Bioanalyzer.
Then samples were either sequenced on an Illumina NextSeq or Miseq machine using a NextSeq 500/550 High Output Kit v2.5 (150 Cycles) (Illumina #20024907), NextSeq 500/550 Mid Output Kit v2.5 (150 Cycles) (Illumina #20024904) or MiSeq Reagent Kit v3 (150-cycle) #MS-102-3001) with the indexes read from both ends after 150 read cycles.
Following NGS, sequences were aligned to an AAV9 template with 21 N nucleotides insertion between amino acid 588 and 589 to represent the 7mer insertion using Bowtie 2. Further post processing was performed using SAMtools, Python 3, NumPy and Pandas. Briefly, the flanking regions up to the 7mer (prefix) and after the 7mer (suffix) region were clipped. The resulting sequence was checked to be 21 bp in length. The nucleotide sequences were converted to amino acid sequences and exported using Pandas. Read counts associated with each nucleotide sequence were converted to normalized read counts (reads per million) to adjust for sequencing depth differences between samples. Enrichment scores for each sequence are calculated by log 2 (normalized read count post screening/normalized read count in the initial virus library). Primersequences are listed below.
The coding regions including the signal peptide and mature protein sequences were amplified with the primers below and inserted into pCMV6-XL4 FLAG-NGRN-Fc (Addgene #115773) with EcoRV and XbaI sites.
26 million HEK293-FT cells were seated per 150 mm plate the day before transfection. The next day, complete media was changed to Pro293™a-CDM™ media with two brief rinses with Pro293™a-CDM™ media to remove serum. Cells were transduced with PEI and 40 ug DNA per plate a few hours after media change. The media was replaced 18 hours after transfection. At the second day post-transfection, cell supernatants containing secreted recombinant protein were passed through a 0.45-mm pore size filter and purified on Protein A-Sepharose. 200 ul Protein A-Sepharose were incubated with 100 ml cell culture supernatant overnight at 4 C with shaking. The next day, the beads were collected and washed 3 times with 10 ml of PBS, and the proteins were eluted in 200 ul of 100 mM glycine (pH2.7). Then 1/10 volume of 1M Tris (pH8.8) was added to the eluted protein fractions to neutralize the pH.
Virus Pull Down with Purified Fc-Tagged Protein
0.5 or 1 ug of purified recombinant Fc fusion proteins or Fc control protein was incubated with 10 ul magnetic Protein A beads for 4 hrs at 4 C with rotation in PBS with 0.05% Tween-20. Supernatant was removed and 1E10 vg of an AAV9 K449R 7-mer virus library in PBS was added into beads pellet and incubated overnight. The next day, after three washes, bound virus were released with Proteinase K treatment and the viral DNA genomes were purified with Agencourt AMPure XP. The viral genomes were then amplified by PCR and processed and indexed for NGS.
The dramatic difference in the CNS tropism of AAV-PHP.B in C57BL/6J versus BALB/cJ mice (19) extends to AAV-PHP.eB (
These results prompted a search for candidate genes associated with enhanced AAV-PHP.B CNS transduction. Studies were aimed to test the AAV-PHP.B capsids across a panel of mouse lines, and harness the natural genetic variation between mice to identify the genetic variants and, subsequently, candidate gene(s) responsible for the difference in CNS transduction by AAV-PHP.eB. Using the open-source software Hail (24), a genome-wide database of variants across 36 mouse lines (25) was analyzed. Starting from millions of genetic variants, comprised of single-nucleotide polymorphisms (SNPs) as well as insertions and deletions (indels), the analysis was narrowed to variants predicted to affect expression, splicing, or protein coding regions (Table 2). As in a genetic linkage study, the aim was to rapidly identify variants whose alleles segregate across mice with the observed phenotype (permissive or non-permissive). Using a statistical simulation framework, it was estimated that 12 mouse lines would be sufficient to narrow our search to ˜10 high/medium impact variants (
Accordingly, mice from 13 commercially available lines were acquired, including C57BL/6J and BALB/cJ, and administered 1011 vector genomes (vg)/animal of AAV-PHP.eB, which packaged an AAV genome encoding an enhanced green fluorescent protein (GFP) with a nuclear localization signal (NLS-GFP). As observed with AAV-PHP.B, intravenous administration of AAV-PHP.eB resulted in GFP expression throughout the brain of permissive lines such as C57BL/6J, but not those of nonpermissive mice such as BALB/cJ; seven permissive and six nonpermissive lines were identified (
Hail analysis reduced the number of high or medium impact gene variants to missense SNPs in the related Ly6a and Ly6c1 genes (
Based on these findings, the possibility that genetic variation within Ly6a or Ly6c1 is associated with the differential AAV-PHP.eB tropism across mouse lines was analyzed. Immunohistochemistry (IHC) assays for LY6A and LY6C1 in C57BL/6J and BALB/cJ mice were performed to assess their expression and localization (
Taken together, these results suggest an association between Ly6a gene variants, the abundance of specific forms of LY6A within brain endothelial cells, and permissivity to transduction by AAV-PHP.eB.
Whether LY6A and/or LY6C1 are necessary for the ability of AAV-PHP.eB to bind and transduce CNS endothelial cells was analyzed. To achieve this, Ly6a and Ly6c1 knockout experiments were performed in brain microvascular endothelial cells (BMVECs) from C57BL/6J mice, which express both genes and are more efficiently transduced by AAV-PHP.eB than by AAV9 (
To determine whether AAV-PHP.eB directly binds LY6A and whether either of the missense SNPs in the BALB/cJ Ly6a gene (
Whether ectopic Ly6a expression is sufficient for increased binding and transduction by AAV-PHP.eB was investigated. HEK293T cells were transiently transfected with cDNAs encoding C57BL/6J Ly6a or Ly6c1 and the effects on binding and transduction by AAV-PHP.B capsids was evaluated. Remarkably, Ly6a expression resulted in a >50-fold increase in binding by each of the AAV-PHP.B capsids to HEK293T cells, but did not increase binding by AAV9 (
To determine whether LY6A acts solely as a primary attachment factor or has additional roles in promoting the internalization and trafficking of AAV-PHP.eB, it was explored whether AAV-PHP.eB binding and transduction are dependent on known receptor interactions. AAVs typically use a cellular receptor for attachment and secondary receptors for internalization and intracellular trafficking (39); AAV9 utilizes galactose as an attachment factor (40), and, like most AAVs, relies on the AAV receptor (AAVR) for intracellular trafficking and transduction (37). First, it was tested whether LY6A influences AAV-PHP.eB binding to Chinese Hamster ovary (CHO) cells with differing levels of galactose on their surface glycoproteins; Pro5 CHO derivative cells were previously used to map the galactose binding site on the AAV9 capsid (40). The Lec2 and Lec8 models derived from the parental Pro5 CHO cell line were utilized: Lec2 cells expose excess galactose whereas Lec8 cells are unable to add galactose to the glycoproteins (41). AAV9 and AAV-PHP.B similarly bind and transduce Lec2 cells more efficiently than Lec8 or Pro5 cells (
However, this process may not require AAVR, which is essential for the intracellular trafficking of numerous AAV capsids including AAV9 (42). To test this possibility, AAVR WT and KO FVB/NJ mice (42) were injected with AAV-PHP.eB, and their brains were collected two hours later for capsid detection. As seen in C57BL/6J mice, AAV-PHP.eB capsids were detected along the vasculature of AAVR KO and control mice (
The >30-fold increase in AAV-PHP.eB binding and transduction of cells from three different species following ectopic LY6A expression highlighted the potential application of this assay for screening or selecting novel capsids that bind to specific cell surface proteins (
Taken together, these results indicate that the in vitro ectopic expression assay can rapidly and quantitatively identify binding interactions between AAV capsids and specific cell surface proteins.
TVMAPFK
TRDLPFK
TLAYPFK
SLTTPFK
TGTSPFK
SNDRPFI
SHTKPFA
SINKPFV
TDTRPFI
TSMKPFL
SEGRPFI
SIARPFV
TSFKPFP
TTMKPFN
SVSKPFS
SSEKPFQ
TKSTPFI
SLSKPFS
YLKGYSA
To validate and test 7-mer modified AAV vectors that selectively bind HEK293T cells that express Ly6a, Ly6c1, marmoset CD59, or human CD59, a new synthetic oligo pool library was generated. The oligo pool (Agilent) library comprised 7-mer-modified AAV variants that were specifically enriched on HEK293T expressing one of the above genes. In addition, in cases where motifs were found within the enriched sequences, 7-mers that maintained the motif but introduced diversity adjacent to the motif were also generated. For example, X-(K/R)-(A/D/E/F/G/H/I/L/M/N/P/Q/S/T/V/W/Y)-G-Y-S-(Q/S/T) (SEQ ID NO: 30259) was generated, where X is any amino acid, based on a common motif identified through screening for 7-mer modified capsids that were selectively enriched on HEK293T cells expressing Ly6c1. Single site-saturation mutagenesis was also used to explore which amino acids within the 7-mer are critical for the selected activity of several highly enriched sequences that did not share an obvious motif with other enriched sequences. Sequences were pooled into a single oligo pool library along with several reference sequences with characterized tropisms (e.g., AAV-PHP.B2: SVSKPFL (SEQ ID NO: 1906); AAV-PHP.B3: FTLTTPK (SEQ ID NO: 1908); AAV-PHP.A: YTLSQGW (SEQ ID NO: 10689). Two copies of each 7-mer were synthesized using different codon sets. The library contained just under 60,000 unique oligos.
The oligo pool was used to generate a PCR fragment that was cloned (as described in Deverman et al NBT 2016) into a novel AAV capsid selection plasmid. This AAV genome provides selective pressure for functional AAV variants (i.e., those that transcribe the viral capsid gene in vivo). In between the CMV enhancer and AAV p41 promoter contains a synthetic intron with a consensus donor motif (CAGGTAAGT), consensus splice motif (TTTTTTCTACAGGT) and branch point sequence. This library vector comprises a CMV enhancer upstream of the AAV p41 promoter and Cap gene. The AAV-capsid library expresses the AAV capsid gene both during virus production as well as following transduction in cultured cells and in vivo. To recover the functional capsids, cellular/tissue RNA was isolated, the capsid RNA was reverse transcribed into cDNA, and the capsid sequence containing the 7-mer was amplified by PCR. By recovering and sequencing viral RNAs, this approach applied selective pressure for functional, transcriptionally active AAV vectors.
An AAV library was generated from this oligo pool library and delivered it intravenously to two C57BL/6J and two BALB/cJ mice. It was found that more than 100 of the sequences screened on Ly6a or Ly6c1 expressing cells (or sequences derived from those sequences as described above) that were enriched in at least one of the CNS RNA samples. Furthermore, the sequences that were found to bind Ly6a expressing cells were selectively enriched in the CNS of C57BL/6J mice while many of the sequences found to bind Ly6c1 expressing cells were enriched in both C57BL/6J and BALB/cJ mice. This differential tropism is consistent with the finding that genetic changes in the BALB/cJ Ly6a gene prevent it from functioning as a receptor for AAV capsids engineered to bind Ly6a (Huang et al, bioRxiv 2019). These data provide additional validation that a significant fraction of the 7-mer modified capsids that were screened for selective binding to HEK293T cells ectopically expressing Ly6a or Ly6c exhibited the predicted enhanced tropism in vivo.
Although SNPs in Ly6c1 identified this gene as a potential factor associated with the nonpermissive AAV-PHP.eB transduction phenotype, unlike Ly6A, it remains highly expressed on endothelial cells of non-permissive strains (
These results demonstrate several findings. First, like LY6A, LY6C1 has the ability to traffic engineered viruses into the CNS, raising the possibility that additional Ly6 proteins and the wider class of GPI-anchored proteins may also facilitate CNS-wide gene delivery in other species including humans. Second, the novel ectopic expression and in vitro binding assay developed herein can enable the development of multiple AAV capsid variants that bind to specific proteins. Third, protein targets known to be present on specific cell populations of interest (e.g., brain endothelial cells) can be harnessed to enhance the transduction of those cells in vivo. This assay could enable the rapid development of capsids that are able to transduce target cell populations more efficiently and with greater specificity. Importantly, because the precise target receptor is known, the method and findings will be more translational to human gene therapy as compared to existing capsid engineering methods that rely on in vivo selections in model organisms and often result in the development of AAV capsids with species-specific tropisms.
To identify AAV capsids that selectively bind specific LY6 proteins, a purified protein pull down assay was used. To do this, a screen for viruses that interact with purified LY6A-, LY6C- or human CD59-fusion proteins was performed. This assay proved highly sensitive and resulted in the development of thousands of 7-mer modified capsid variants that selectively bind LY6A-Fc or LY6C1-Fc, but not a control Fc protein (Tables 11 and 15). A smaller number of sequences was found to specifically bind hCD59-Fc (Table 18). Encouragingly, for all three LY6-Fc fusions, a significant number of novel sequences were identified that matched motifs previously identified through HEK293T cell ectopic receptor assays and in vivo screening for each receptor (LY6A: Table 12; LY6C1: Table 16; hCD59: Table 18).
AAV vectors are commonly used to deliver genes in vivo because of their ability to provide long-term expression. In addition, many AAV vectors are able to transverse vascular barriers after intravenous administration and deliver genes to the cells throughout numerous tissues, including but not limited to the brain, heart, liver, skeletal muscle, lungs, bone, cartilage, bone marrow, adrenal gland, retina, pancreas, adipose tissue and kidney. However, it remains challenging to develop AAV vectors that target specific cell types or specific organs in humans.
Previously, nanoparticle and other novel delivery modalities were developed and directed toward the vasculature of specific organs (Sago et al., Proc Natl Acad Sci USA. 2018 Oct. 16; 115(42):E9944-E9952; Sago et al., J Am Chem Soc. 2018 Dec. 12; 140(49): 17095-17105.; Järvinen et al., Int J Mol Sci. 2015 Sep. 30; 16(10):23556-71.). While such nanoparticles can be developed to preferentially deliver siRNAs and mRNAs to endothelial cells in specific organs, it remains challenging to use nanoparticles or other nonviral delivery vehicles to deliver DNA to the nucleus for long-term gene therapeutic applications or to achieve gene delivery across vasculature barriers to reach parenchymal cells within the target tissue(s).
In the present disclosure, a two-step delivery method that overcomes these challenges is described. The first step involves the expression, preferably transient, of an ectopic receptor for an engineered virus in the target cell population of a patient. The second step involves the administration of an AAV that specifically interacts with the ectopic receptor to the same patient during the window of receptor expression. This approach is attractive because it breaks down the process of achieving stable gene expression in the cells of specific organs into two steps. The first step requires only transient delivery or expression of a receptor in the target organ endothelium, which could be achieved by delivery of an mRNA carried by a nanoparticle, a RNA or DNA virus (e.g. a recombinant lentivirus, SV40, anellovirus, or adenovirus) or protein with a targeting motif or conjugate. It is not necessary nor preferred that the delivery system achieves persistent gene expression or traverses the vascular barrier. The second step uses an engineered AAV, such as those presented here within, to efficiently target the cells that ectopically express the receptor for the modified AAV. The ectopic receptor then mediates the transcytosis of the engineered AAV across the vasculature where it can subsequently transduce cells within the target organ and provide durable transgene expression from the recombinant viral genome.
In step one, the receptor, which is absent or expressed at a level that limits transduction in the target cell population, is ectopically expressed in, or delivered to, the target cell population of a patient. The delivery of the receptor can be achieved with a nanoparticle carrying an mRNA for the receptor or a viral vector carrying RNA or DNA encoding the receptor, or targeted to cells through the administration of the purified protein. Preferably, the receptor is not otherwise found or expressed in the human patient. Preferably, the delivery of the receptor protein or the nucleic acid encoding the receptor results in transient delivery of the receptor protein or expression of the receptor in the target population of interest.
In step two, the AAV vector that exhibits selectively enhanced binding to, and transduction of, cells expressing the receptor is administered during the window of ectopic receptor expression. Preferably, the AAV vector is delivered to a patient through the intravascular route. However, the receptor-selective AAV can be delivered through any route that provides access to the cells expressing the receptor. Ideally the expression of Ly6a or Ly6c would be transient and the delivery of the AAV vector that transduces cells though binding to LY6A or LY6C1 would be delivered during the window of time that LY6A or LY6C is present within the target cell population of interest.
Provided within are examples of receptor-modified AAV pairs that can be used for the above two-step delivery approach. Examples are provided of AAV capsids that have been screened for binding to and transduction of human cells that ectopically express mouse Ly6a (Tables 4, 9, 10) and Ly6c1 (Tables 5, 12, 13) or to purified LY6A-Fc or LY6C1-Fc proteins (Table 11 and 15, respectively). These receptors are attractive as ectopic AAV receptors for several reasons: (1) No homologs of these genes exist in humans or other primates. (2) These receptors are highly expressed on mouse CNS vasculature and have a demonstrated ability to efficiently transfer a subset of 7-mer modified AAVs across the vascular barrier (i.e., the BBB) and into the CNS where they can then transduce neurons and glia (Huang et al. 2019:
It was found that Ly6a expression in human HEK293T cells results in a >50-fold increase in binding by the AAV-PHP.B caspids (AAV-PHP.B, AAV-PHP.eB, AAV-PHP.B2 and AAV-PHP.B3) as compared to control cells not expressing Ly6a, but did not increase binding to the control AAV9 (Huang et al. (2019) BioRxiv,
To determine whether ectopic receptor expression can be used to render human endothelial cells more sensitive to transduction by viruses engineered to bind specific receptors, Ly6a, Ly6c1, or a control (mScarlet) was expressed in human hCMEC cells using a 7-mer modified AAV, AAV-BI-13, that efficiently transduces several human cultured cell types including hCMEC cells. The hCMEC cells expressing Ly6a, Ly6c1 or mScarlet were then exposed to AAV vectors that specifically interact with LY6A (represented by AAV-PHP.eB; Table 1-4) or LY6C1 (represented by AAV-BI-28; Tables 5-8). Expression of Ly6a or Ly6c1 made hCMEM cells approximately 2-logs (base 10) more sensitive to transduction by AAV-PHP.eB or AAV-BI-28, respectively. Importantly, the increased efficiency is highly specific—Ly6a expression selectively improved transduction by AAV-PHP.eB and Ly6c1 expression selectively improved transduction by AAV-BI-28. No increased transduction was observed for either vector in the cells expressing mScarlet.
Using the in vitro binding assay, novel AAV capsids were identified that selectively bind cells overexpressing the human, marmoset, and/or mouse CD59 gene (
The development of AAV-PHP.B capsids provided proof-of-concept that AAV vectors with dramatically enhanced BBB crossing capabilities can be engineered, without a priori mechanistic knowledge [4,5]. AAV-PHP.B and AAV-PHP.eB are now widely used vectors for mouse neuroscience studies. However, the species-specific tropism of the AAV-PHP.B capsids reduces their appeal for human CNS gene therapy and highlights the shortcomings of performing selections and screens in model systems—the enhanced features of the identified capsids may not extend beyond the context (the genetic background) in which the selective pressure was applied. Accordingly, as compared to efforts in mice, selections in nonhuman primates (NHPs) are predicted to result in the identification of capsids whose enhanced features better translate to humans. Nonetheless, such efforts to develop clinically relevant vectors may likewise be thwarted by the identification of species- or model-specific capsids. Therefore, the pursuit of a vector that crosses the human BBB with AAV-PHP.eB-like efficiency gains will be aided by a mechanistic understanding of how naturally isolated and engineered capsids cross the BBB.
In the present disclosure, a single missense varian was rapidly identified tin Ly6a, out of a starting pool of millions of genetic variants, which segregates with efficient CNS transduction by AAV-PHP.eB. This was accomplished by first narrowing down candidates to genetic variants with a predicted high or medium impact and eliminating the bulk of the variants that did not segregate with the permissivity phenotype. This segregation study was achieved by leveraging Hail [26], the Mouse Genomes Project dataset [27], and 13 commercially available mouse lines; the code was implemented and run end-to-end on WGS data within hours, harnessing Hail's ability to scale computation across a large compute cluster, and the in vivo screening was completed in three weeks. The speed and small number of animals required for this approach is unprecedented compared to the conventional approaches of using diversity outbred lines or breeding generations of mice to determine the approximate genomic loci that segregates with a given phenotype.
After narrowing down the perfectly segregating genetic variants to two missense SNPs in two genes, molecular and biochemical studies were used to identify and validate Ly6a as the gene encoding the receptor for the AAV-PHP.B capsids. Because this approach was restricted to high and medium impact variants, the present disclosure does not rule out the possibility that other perfectly segregating noncoding variants present within Ly6a or other sites within the genome may contribute to the CNS transduction phenotype. In addition, it is possible that other genetic variants present in a subset of the nonpermissive strains within and surrounding Ly6a contribute to the nonpermissive phenotype. It is possible that one or more of these variants may influence Ly6a expression and contribute to the variation in LY6A levels and localization observed across nonpermissive strains.
The finding that Ly6a expression increases binding by the top three AAV-PHP.B variants, harboring unique peptide insertions (TLAVPFK, SVSKPFL, and FTLTTPK), identified using CREATE [5] suggests that LY6A has properties that make it an ideal receptor to engage for efficient transcytosis across the C57BL/6J BBB. Indeed, LY6A facilitates binding and transduction by AAV-PHP.eB in cells lacking either of the known AAV9 receptors, galactose and AAVR. Furthermore, ectopic expression of Ly6a is sufficient to render both human and hamster cells permissive to the enhanced binding and transduction of AAV-PHP.eB. Importantly, these findings demonstrate that AAVs can be engineered to utilize entirely new cell entry/transduction mechanisms rendering the novel capsids less dependent on interactions with the receptors that natural AAV serotypes rely on for transduction. Although there is no direct Ly6a homolog in primates, other cellular factors that share key properties with LY6A such as abundant luminal surface exposure on brain endothelium, localization within lipid micro-domains through GPI anchoring, or specific recycling/intracellular trafficking capabilities, may be prime molecular targets for gene delivery vectors in mice, NHPs, and humans. Notably, other LY6 proteins with homologs in primates are present within the CNS endothelium and can be explored and potentially harnessed for AAV capsid engineering. Developing capsids and/or other biologicals that target these receptors can open up new therapeutic avenues for treating a wide range of currently intractable neurological diseases.
In the claims articles such as “a,” “an,” and “the” may mean one or more than one unless indicated to the contrary or otherwise evident from the context. Claims or descriptions that include “or” between one or more members of a group are considered satisfied if one, more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process unless indicated to the contrary or otherwise evident from the context. The invention includes embodiments in which exactly one member of the group is present in, employed in, or otherwise relevant to a given product or process. The invention includes embodiments in which more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process.
Furthermore, the invention encompasses all variations, combinations, and permutations in which one or more limitations, elements, clauses, and descriptive terms from one or more of the listed claims is introduced into another claim. For example, any claim that is dependent on another claim can be modified to include one or more limitations found in any other claim that is dependent on the same base claim. Where elements are presented as lists, e.g., in Markush group format, each subgroup of the elements is also disclosed, and any element(s) can be removed from the group. It should it be understood that, in general, where the invention, or aspects of the invention, is/are referred to as comprising particular elements and/or features, certain embodiments of the invention or aspects of the invention consist, or consist essentially of, such elements and/or features. For purposes of simplicity, those embodiments have not been specifically set forth in haec verba herein. It is also noted that the terms “comprising” and “containing” are intended to be open and permits the inclusion of additional elements or steps. Where ranges are given, endpoints are included. Furthermore, unless otherwise indicated or otherwise evident from the context and understanding of one of ordinary skill in the art, values that are expressed as ranges can assume any specific value or sub-range within the stated ranges in different embodiments of the invention, to the tenth of the unit of the lower limit of the range, unless the context clearly dictates otherwise.
This application refers to various issued patents, published patent applications, journal articles, and other publications, all of which are incorporated herein by reference. If there is a conflict between any of the incorporated references and the instant specification, the specification shall control. In addition, any particular embodiment of the present invention that falls within the prior art may be explicitly excluded from any one or more of the claims. Because such embodiments are deemed to be known to one of ordinary skill in the art, they may be excluded even if the exclusion is not set forth explicitly herein. Any particular embodiment of the invention can be excluded from any claim, for any reason, whether or not related to the existence of prior art.
Those skilled in the art will recognize or be able to ascertain using no more than routine experimentation many equivalents to the specific embodiments described herein. The scope of the present embodiments described herein is not intended to be limited to the above Description, but rather is as set forth in the appended claims. Those of ordinary skill in the art will appreciate that various changes and modifications to this description may be made without departing from the spirit or scope of the present invention, as defined in the following claims.
This application claims priority to and the benefit of U.S. Provisional Application Ser. No. 62/798,961 filed Jan. 30, 2019, the entire disclosure of which is hereby incorporated by reference.
This invention was made with government support under Grant No. NINDS UG3 NS111689-01 awarded by the National Institutes of Health Somatic Cell Genome Editing Consortium. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/015972 | 1/30/2020 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62798961 | Jan 2019 | US |