This application contains a Sequence Listing which has been submitted electronically in XML format and is hereby incorporated by reference in its entirety. The Sequence Listing XML file, created on Jul. 5, 2023, is named “167741-049104_PCT_SL.xml” and is 183,740 bytes in size.
Engineering novel functions into proteins while retaining desired traits is a key challenge for developers of viral vectors, antibodies, and inhibitors of medical and industrial value. For instance, to be harnessed as a viable gene therapy vector, an adeno-associated virus (AAV) capsid must simultaneously exhibit high production yield and efficiently target the cell type(s) relevant to a specific disease across preclinical models to patients. A common approach for developing AAV capsids with novel tropisms is to funnel a random library of peptide-modified capsids through multiple rounds of selection to identify a few top-performing candidates. This approach has produced modified capsids that more efficiently transduce cells throughout the central nervous system (CNS), photoreceptors, brain endothelial cells, and skeletal muscle.
These rare capsids can then be diversified to screen even more enhanced tropisms, high production yield, or cross-species functionality. However, variants optimized for one trait can be difficult to optimize for other traits, and the protein sequence space is too vast to effectively sample by chance for rare variants that are enhanced across multiple traits. As a result, AAV engineering teams often devote many years and significant resources to developing capsids that ultimately fail to be optimized across multiple traits essential for preclinical and clinical translation.
Therefore, there is a need for improved adeno-associated viral vectors for multiple traits; for example, capsids that work across species to target organs of interest.
As described below, the present invention features adeno-associated viral vectors and methods of using such vectors.
In one aspect, the disclosure features an adeno-associated virus (AAV) capsid polypeptide containing an amino acid sequence with at least 85% amino acid sequence identity to one of the following amino acid sequences, containing one of the following amino acid sequences, or containing only one of one of the following amino acid sequences:
In another aspect, the disclosure features a polynucleotide encoding the AAV capsid polypeptide of any aspect of the disclosure delimited herein, or embodiments thereof.
In another aspect, the disclosure features a viral particle containing the AAV capsid polypeptide of any aspect of the disclosure delimited herein, or embodiments thereof.
In another aspect, the disclosure features a composition containing the capsid polypeptide, the polynucleotide, or the viral particle of any aspect of the disclosure delimited herein, or embodiments thereof.
In another aspect, the disclosure features a pharmaceutical composition containing the capsid polypeptide, the polynucleotide, or the viral particle of any aspect of the disclosure delimited herein, or embodiments thereof, and a pharmaceutically acceptable carrier, excipient, or diluent.
In another aspect, the disclosure features a method for delivering a payload to a liver cell in a subject, the method involving administering to the subject the viral particle of any aspect of the disclosure delimited herein, or embodiments thereof, thereby delivering the payload to a liver cell in the subject.
In another aspect, the disclosure features a vector containing a nucleotide sequence encoding a functional AAV-BI151, AAV-BI152, AAV-BI153, AAV-BI154, AAV-BI155, AAV-BI156, or AAV-BI157 polypeptide.
In another aspect, the disclosure features a host cell containing the vector of any aspect of the disclosure delimited herein, or embodiments thereof.
In another aspect, the disclosure features a method of producing a recombinant AAV particle containing an AAV-BI151, AAV-BI152, AAV-BI153, AAV-BI154, AAV-BI155, AAV-BI156, or AAV-BI157 capsid polypeptide, the method involving a) culturing a host cell. The host cell contains i) the vector of any aspect of the disclosure delimited herein, or embodiments thereof, ii) a polynucleotide containing a recombinant AAV genome containing a polynucleotide sequence flanked by ITR sequences and encoding a payload operably linked to a regulatory element for expression in a target cell, and iii) one or more polynucleotides encoding polypeptides capable of mediating production of recombinant AAV particles. The method further involves b) recovering recombinant AAV particles from the host cell.
In another aspect, the disclosure features a kit suitable for use in the method of any aspect of the disclosure delimited herein, or embodiments thereof. The kit contains the capsid polypeptide, the polynucleotide, the viral particle, the composition, or the vector of any aspect of the disclosure delimited herein, or embodiments thereof.
In any aspect of the disclosure delimited herein, or embodiments thereof, the polypeptide contains an amino acid sequence having at least about 90% amino acid sequence identity to the amino acid sequence. In any aspect of the disclosure delimited herein, or embodiments thereof, the polypeptide contains an amino acid sequence having at least about 95% amino acid sequence identity to the amino acid sequence. In any aspect of the disclosure delimited herein, or embodiments thereof, the polypeptide contains an amino acid sequence having at least about 99% amino acid sequence identity to the amino acid sequence. In any aspect of the disclosure delimited herein, or embodiments thereof, the polypeptide contains or contains only one of the amino acid sequences of any aspect of the disclosure delimited herein, or embodiments thereof.
In any aspect of the disclosure delimited herein, or embodiments thereof, the polynucleotide contains or contains only a nucleic acid sequence with at least 85%, 90%, 95%, or 99% nucleic acid sequence identity to one of the following nucleic acid sequences, contains one of the following nucleic acid sequences, or contains only one of the following nucleic acid sequences and encodes a functional AAV capsid protein:
In any aspect of the disclosure delimited herein, or embodiments thereof, the viral particle has increased transduction efficiency for a liver cell relative to a control viral particle. In embodiments, transduction efficiency is increased by at least about 10%, 25%, 50%, 100%, 200% or more relative to a control viral particle.
In any aspect of the disclosure delimited herein, or embodiments thereof, the viral particle has increased binding to a liver cell relative to a control viral particle.
In any aspect of the disclosure delimited herein, or embodiments thereof, the viral particle contains a polynucleotide. In any aspect of the disclosure delimited herein, or embodiments thereof, the polynucleotide contains a viral genome. In any aspect of the disclosure delimited herein, or embodiments thereof, the polynucleotide contains a payload. In any aspect of the disclosure delimited herein, or embodiments thereof, the payload contains a polynucleotide encoding a heterologous polypeptide or polynucleotide of interest. In any aspect of the disclosure delimited herein, or embodiments thereof, the polynucleotide contains two inverted terminal repeat (ITR) sequences, one at each of the 5′ and 3′ ends. In embodiments, the ITR sequences are AAV2 ITR sequences. In any aspect of the disclosure delimited herein, or embodiments thereof, the polynucleotide contains an element selected from one or more of a regulatory element, an untranslated region, a polyadenylation sequence, an intron, and a linker sequence, operably linked to the nucleotide sequence encoding the payload. In any aspect of the disclosure delimited herein, or embodiments thereof, the polynucleotide contains a promoter. In embodiments, the promoter is a ubiquitous promoter, a CAG promoter, or a tissue-specific promoter. In embodiments, the tissue-specific promoter is a liver promoter. In embodiments, the promoter drives expression of a polypeptide encoded by the polynucleotide in a hepatocyte.
In any aspect of the disclosure delimited herein, or embodiments thereof, the subject is a mammal. In embodiments, the mammal is a human, mouse or a macaque.
In any aspect of the disclosure delimited herein, or embodiments thereof, the vector is a plasmid.
Unless defined otherwise, all technical and scientific terms used herein have the meaning commonly understood by a person skilled in the art to which this invention belongs. The following references provide one of skill with a general definition of many of the terms used in this invention: Singleton et al., Dictionary of Microbiology and Molecular Biology (2nd ed. 1994); The Cambridge Dictionary of Science and Technology (Walker ed., 1988); The Glossary of Genetics, 5th Ed., R. Rieger et al. (eds.), Springer Verlag (1991); and Hale & Marham, The Harper Collins Dictionary of Biology (1991). As used herein, the following terms have the meanings ascribed to them below, unless specified otherwise.
By “AAV-BI151 polypeptide” is meant a protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid. In embodiments, the AAV-BI151 protein comprises or consists of a sequence having at least about 90%, 95%, 99%, or 100% amino acid sequence identity with the following sequence.
By “AAV-BI151 polynucleotide” is meant a nucleic acid molecule encoding an AAV-BI151 polypeptide. An exemplary AAV-BI151 nucleotide sequence is provided below.
By “AAV-BI152 polypeptide” is meant a protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid. In embodiments, the AAV-BI152 protein comprises or consists of a sequence having at least about 90%, 95%, 99%, or 100% amino acid sequence identity with the following sequence.
By “AAV-BI152 polynucleotide” is meant a nucleic acid molecule encoding an AAV-BI152 polypeptide. An exemplary AAV-BI152 nucleotide sequence is provided below.
By “AAV-BI 153 polypeptide” is meant a protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid. In embodiments, the AAV-BI 153 protein comprises or consists of a sequence having at least about 90%, 95%, 99%, or 100% amino acid sequence identity with the following sequence.
By “AAV-BI 153 polynucleotide” is meant a nucleic acid molecule encoding an AAV-BI153 polypeptide. An exemplary AAV-BI 153 nucleotide sequence is provided below.
By “AAV-BI154 polypeptide” is meant a protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid. In embodiments, the AAV-BI154 protein comprises or consists of a sequence having at least about 90%, 95%, 99%, or 100% amino acid sequence identity with the following sequence.
By “AAV-BI 154 polynucleotide” is meant a nucleic acid molecule encoding an AAV-BI 154 polypeptide. An exemplary AAV-BI 154 nucleotide sequence is provided below.
By “AAV-BI155 polypeptide” is meant a protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid. In embodiments, the AAV-BI155 protein comprises or consists of a sequence having at least about 90%, 95%, 99%, or 100% amino acid sequence identity with the following sequence.
By “AAV-BI155 polynucleotide” is meant a nucleic acid molecule encoding an AAV-BI155 polypeptide. An exemplary AAV-BI155 nucleotide sequence is provided below.
By “AAV-BI156 polypeptide” is meant a protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid. In embodiments, the AAV-BI156 protein comprises or consists of a sequence having at least about 90%, 95%, 99%, or 100% amino acid sequence identity with the following sequence.
By “AAV-BI156 polynucleotide” is meant a nucleic acid molecule encoding an AAV-BI156 polypeptide. An exemplary AAV-BI156 nucleotide sequence is provided below.
By “AAV-BI157 polypeptide” is meant a protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid. In embodiments, the AAV-BI157 protein comprises or consists of a sequence having at least about 90%, 95%, 99%, or 100% amino acid sequence identity with the following sequence.
By “AAV-BI 157 polynucleotide” is meant a nucleic acid molecule encoding an AAV-BI157 polypeptide. An exemplary AAV-BI 157 nucleotide sequence is provided below.
By “AAV1 polypeptide” is meant an AAV1 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAV1 polynucleotide” is meant a nucleic acid molecule encoding an AAV1 polypeptide. An exemplary AAV1 nucleotide sequence is provided below.
By “AAV2 polypeptide” is meant an AAV2 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAV2 polynucleotide” is meant a nucleic acid molecule encoding an AAV2 polypeptide. An exemplary AAV2 nucleotide sequence is provided below.
By “AAV3 polypeptide” is meant an AAV3 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAV3 polynucleotide” is meant a nucleic acid molecule encoding an AAV3 polypeptide. An exemplary AAV3 nucleotide sequence is provided below.
By “AAV3B polypeptide” is meant an AAV3B protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAV3B polynucleotide” is meant a nucleic acid molecule encoding an AAV3B polypeptide. An exemplary AAV3B nucleotide sequence is provided below.
By “AAV4 polypeptide” is meant an AAV4 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid. PP4′N,
By “AAV4 polynucleotide” is meant a nucleic acid molecule encoding an AAV4 polypeptide. An exemplary AAV4 nucleotide sequence is provided below.
By “AAV5 polypeptide” is meant an AAV5 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAV5 polynucleotide” is meant a nucleic acid molecule encoding an AAV5 polypeptide. An exemplary AAV5 nucleotide sequence is provided below.
CCTCGGGAGATTGGCATTGCGATTCCACGTGGATGGGGGACAGAGTCGTCACCAAGTCCACCCG AACCTGGGTGCTGCCCAGCTACAACAACCACCAGTACCGAGAGATCAAAAGCGGCTCCGTCGAC GGAAGCAACGCCAACGCCTACTTTGGATACAGCACCCCCTGGGGGTACTTTGACTTTAACCGCT TCCACAGCCACTGGAGCCCCCGAGACTGGCAAAGACTCATCAACAACTACTGGGGCTTCAGACC CCGGTCCCTCAGAGTCAAAATCTTCAACATTCAAGTCAAAGAGGTCACGGTGCAGGACTCCACC ACCACCATCGCCAACAACCTCACCTCCACCGTCCAAGTGTTTACGGACGACGACTACCAGCTGC CCTACGTCGTCGGCAACGGGACCGAGGGATGCCTGCCGGCCTTCCCTCCGCAGGTCTTTACGCT GCCGCAGTACGGTTACGCGACGCTGAACCGCGACAACACAGAAAATCCCACCGAGAGGAGCAGC TTCTTCTGCCTAGAGTACTTTCCCAGCAAGATGCTGAGAACGGGCAACAACTTTGAGTTTACCT ACAACTTTGAGGAGGTGCCCTTCCACTCCAGCTTCGCTCCCAGTCAGAACCTGTTCAAGCTGGC CAACCCGCTGGTGGACCAGTACTTGTACCGCTTCGTGAGCACAAATAACACTGGCGGAGTCCAG TTCAACAAGAACCTGGCCGGGAGATACGCCAACACCTACAAAAACTGGTTCCCGGGGCCCATGG GCCGAACCCAGGGCTGGAACCTGGGCTCCGGGGTCAACCGCGCCAGTGTCAGCGCCTTCGCCAC GACCAATAGGATGGAGCTCGAGGGCGCGAGTTACCAGGTGCCCCCGCAGCCGAACGGCATGACC AACAACCTCCAGGGCAGCAACACCTATGCCCTGGAGAACACTATGATCTTCAACAGCCAGCCGG CGAACCCGGGCACCACCGCCACGTACCTCGAGGGCAACATGCTCATCACCAGCGAGAGCGAGAC GCAGCCGGTGAACCGCGTGGCGTACAACGTCGGCGGGCAGATGGCCACCAACAACCAGAGCTCC ACCACTGCCCCCGCGACCGGCACGTACAACCTCCAGGAAATCGTGCCCGGCAGCGTGTGGATGG AGAGGGACGTGTACCTCCAAGGACCCATCTGGGCCAAGATCCCAGAGACGGGGGCGCACTTTCA CCCCTCTCCGGCCATGGGCGGATTCGGACTCAAACACCCACCGCCCATGATGCTCATCAAGAAC ACGCCTGTGCCCGGAAATATCACCAGCTTCTCGGACGTGCCCGTCAGCAGCTTCATCACCCAGT ACAGCACCGGGCAGGTCACCGTGGAGATGGAGTGGGAGCTCAAGAAGGAAAACTCCAAGAGGTG GAACCCAGAGATCCAGTACACAAACAACTACAACGACCCCCAGTTTGTGGACTTTGCCCCGGAC AGCACCGGGGAATACAGAACCACCAGACCTATCGGAACCCGATACCTTACCCGACCCCTTTAA (SEQ ID NO: 26)
By “AAV6 polypeptide” is meant an AAV6 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAV6 polynucleotide” is meant a nucleic acid molecule encoding an AAV6 polypeptide. An exemplary AAV6 nucleotide sequence is provided below.
By “AAV7 polypeptide” is meant an AAV7 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAV7 polynucleotide” is meant a nucleic acid molecule encoding an AAV7 polypeptide. An exemplary AAV7 nucleotide sequence is provided below.
By “AAV8 polypeptide” is meant an AAV8 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAV8 polynucleotide” is meant a nucleic acid molecule encoding an AAV8 polypeptide. An exemplary AAV8 nucleotide sequence is provided below.
By “AAV9 K549R polypeptide” is meant an AAV9 K549R protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAV9 K549R polynucleotide” is meant a nucleic acid molecule encoding an AAV9 K549R polypeptide. An exemplary AAV9 K549R nucleotide sequence is provided below.
By “AAV9 polypeptide” is meant an AAV9 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAV9 polynucleotide” is meant a nucleic acid molecule encoding an AAV9 polypeptide. An exemplary AAV9 nucleotide sequence is provided below.
By “AAVrh.10 polypeptide” is meant an AAVrh.10 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAVrh.10 polynucleotide” is meant a nucleic acid molecule encoding an AAVrh.10 polypeptide. An exemplary AAVrh.10 nucleotide sequence is provided below.
By “AAVrh.8 polypeptide” is meant an AAVrh.8 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “AAVrh.8 polynucleotide” is meant a nucleic acid molecule encoding an AAVrh.8 polypeptide. An exemplary AAVrh.8 nucleotide sequence is provided below.
By “LK03 polypeptide” is meant an LK03 protein with at least about 85% amino acid sequence identity to the amino acid sequence provided below, or a fragment thereof capable of multimerization to form a capsid.
By “LK03 polynucleotide” is meant a nucleic acid molecule encoding an LK03 polypeptide. An exemplary LK03 nucleotide sequence is provided below.
By “administering” is meant giving, supplying, dispensing a composition, agent, therapeutic product, and the like to a subject, or applying or bringing the composition and the like into contact with the subject. Administering or administration may be accomplished by any of a number of routes, such as, for example, without limitation, parenteral or systemic, intravenous (IV), (injection), subcutaneous, intrathecal, intracranial, intramuscular, dermal, intradermal, inhalation, rectal, intravaginal, topical, oral, subcutaneous, intramuscular, or intraocular. In embodiments, administration is systemic, such as by inoculation, injection, or intravenous injection.
By “agent” is meant any viral particle comprising a therapeutic molecule (e.g., antibody, nucleic acid molecule, or polypeptide, or fragments thereof). A non-limiting example of an agent is an AAV of the present disclosure.
By “alteration” is meant a change in the expression levels or activity of a gene or polypeptide as detected by standard art known methods such as those described herein. The alteration can be an increase or a decrease. As used herein, an alteration includes a 10% change in expression levels, preferably a 25% change, more preferably a 40% change, and most preferably a 50% or greater change in expression levels.”
By “analog” is meant a molecule that is not identical but has analogous functional or structural features. For example, a polypeptide analog retains the biological activity of a corresponding naturally occurring polypeptide, while having certain biochemical modifications that enhance the analog's function relative to a naturally occurring polypeptide. Such biochemical modifications could increase the analog's protease resistance, membrane permeability, or half-life, without altering, for example, ligand binding. An analog may include an unnatural amino acid.
In this disclosure, “comprises,” “comprising,” “containing” and “having” and the like can have the meaning ascribed to them in U.S. Patent law and can mean “includes,” “including,” and the like; “consisting essentially of” or “consists essentially” likewise has the meaning ascribed in U.S. Patent law and the term is open-ended, allowing for the presence of more than that which is recited so long as basic or novel characteristics of that which is recited is not changed by the presence of more than that which is recited, but excludes prior art embodiments. Any embodiments specified as “comprising” a particular component(s) or element(s) are also contemplated as “consisting of” or “consisting essentially of” the particular component(s) or element(s) in some embodiments.
By “consist essentially” it is meant that the ingredients include only the listed components along with the normal impurities present in commercial materials and with any other additives present at levels which do not affect the operation of the disclosure, for instance at levels less than 5% by weight or less than 1% or even 0.5% by weight.
“Detect” refers to identifying the presence, absence, or amount of the analyte to be detected.
By “detectable label” is meant a composition that when linked to a molecule of interest renders the latter detectable, via spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For example, useful labels include radioactive isotopes, magnetic beads, metallic beads, colloidal particles, fluorescent dyes, electron-dense reagents, enzymes (for example, as commonly used in an ELISA), biotin, digoxigenin, or haptens.
By “fragment” is meant a portion of a polypeptide or nucleic acid molecule. This portion contains at least about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% of the entire length of the reference nucleic acid molecule or polypeptide. A fragment may contain 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100, 200, 300, 400, 500, 600, 700, 800, 900, or 1000 nucleotides or amino acids.
By “gene” is meant a region of a polynucleotide that is transcribed as a single unit. Typically, a gene is transcribed to produce a single RNA molecule.
“Hybridization” means hydrogen bonding, which may be Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary nucleobases. For example, adenine and thymine are complementary nucleobases that pair through the formation of hydrogen bonds.
By “increase” is meant to alter positively by at least 5% relative to a reference. An increase may be by 5%, 10%, 25%, 30%, 50%, 75%, or even by 100%.
The terms “isolated,” “purified,” or “biologically pure” refer to material that is free to varying degrees from components which normally accompany it as found in its native state. “Isolate” denotes a degree of separation from original source or surroundings. “Purify” denotes a degree of separation that is higher than isolation. A “purified” or “biologically pure” protein is sufficiently free of other materials such that any impurities do not materially affect the biological properties of the protein or cause other adverse consequences. That is, a nucleic acid or peptide of this invention is purified if it is substantially free of cellular material, viral material, or culture medium when produced by recombinant DNA techniques, or chemical precursors or other chemicals when chemically synthesized. Purity and homogeneity are typically determined using analytical chemistry techniques, for example, polyacrylamide gel electrophoresis or high-performance liquid chromatography. The term “purified” can denote that a nucleic acid or protein gives rise to essentially one band in an electrophoretic gel. For a protein that can be subjected to modifications, for example, phosphorylation or glycosylation, different modifications may give rise to different isolated proteins, which can be separately purified.
By “isolated polynucleotide” is meant a nucleic acid that is free of the genes which, in the naturally occurring genome of the organism from which the nucleic acid molecule of the invention is derived, flank the gene. The term therefore includes, for example, a recombinant DNA that is incorporated into a vector; into an autonomously replicating plasmid or virus; or into the genomic DNA of a prokaryote or eukaryote; or that exists as a separate molecule (for example, a cDNA or a genomic or cDNA fragment produced by PCR or restriction endonuclease digestion) independent of other sequences. In addition, the term includes an RNA molecule that is transcribed from a DNA molecule, as well as a recombinant DNA that is part of a hybrid gene encoding additional polypeptide sequence.
By an “isolated polypeptide” is meant a polypeptide of the invention that has been separated from components that naturally accompany it. Typically, the polypeptide is isolated when it is at least 60%, by weight, free from the proteins and naturally occurring organic molecules with which it is naturally associated. Preferably, the preparation is at least 75%, more preferably at least 90%, and most preferably at least 99%, by weight, a polypeptide of the invention. An isolated polypeptide of the invention may be obtained, for example, by extraction from a natural source, by expression of a recombinant nucleic acid encoding such a polypeptide; or by chemically synthesizing the protein. Purity can be measured by any appropriate method, for example, column chromatography, polyacrylamide gel electrophoresis, or by HPLC analysis.
By “marker” is meant any protein or polynucleotide having an alteration in expression level or activity that is associated with a developmental state, condition, disease, or disorder.
As used herein, “obtaining” as in “obtaining an agent” includes synthesizing, purchasing, or otherwise acquiring the agent.
By “payload” or “payload region” is meant an agent to be delivered to a cell. In embodiments, the payload is a polynucleotide that encodes a heterologous polypeptide or polynucleotide (e.g., miRNA) to be expressed in a target cell.
By “polypeptide” or “amino acid sequence” is meant any chain of amino acids, regardless of length or post-translational modification. In various embodiments, the post-translational modification is glycosylation or phosphorylation. In various embodiments, conservative amino acid substitutions may be made to a polypeptide to provide functionally equivalent variants, or homologs of the polypeptide. In some aspects the invention embraces sequence alterations that result in conservative amino acid substitutions. In some embodiments, a “conservative amino acid substitution” refers to an amino acid substitution that does not alter the relative charge or size characteristics of the protein in which the conservative amino acid substitution is made. Variants can be prepared according to methods for altering polypeptide sequence known to one of ordinary skill in the art such as are found in references that compile such methods, e.g., Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989, or Current Protocols in Molecular Biology, F. M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. Non-limiting examples of conservative substitutions of amino acids include substitutions made among amino acids within the following groups: (a) M, I, L, V; (b) F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; (f) Q, N; and (g) E, D. In various embodiments, conservative amino acid substitutions can be made to the amino acid sequence of the proteins and polypeptides disclosed herein.
By “manufacturability,” “production fitness,” “production,” or “produces” with reference to a capsid polypeptide is meant how well a capsid polynucleotide is expressed in a cell and the amount of viral particles produced from the expressed capsid polypeptides that are capable of delivering a payload to a cell. In embodiments, the production efficiency of a capsid polypeptide may be measured as the number of functional viral particles produced using a particular amount of a polynucleotide encoding the capsid polypeptide. In some cases, an AAV capsid with good production is an AAV capsid that yields greater or comparable levels of functional AAV viral particles relative to a reference AAV viral capsid. Production fitness of a capsid polypeptide can be assessed using methods provided herein.
The term “recombinant” as used herein in the context of proteins or nucleic acids refers to proteins or nucleic acids that do not occur in nature or in a naturally occurring protein or nucleic acid sequence, but are the product of human engineering, often or typically utilizing molecular biological or molecular genetic tools and techniques practiced by the skilled practitioner in the art. For example, in some embodiments, a recombinant protein or nucleic acid molecule comprises an amino acid or nucleotide sequence that comprises at least one, at least two, at least three, at least four, at least five, at least six, at least seven, or at least eight mutations as compared to any naturally occurring sequence.
By “reduce” is meant to alter negatively by at least 5% relative to a reference. A reduction may be by 5%, 10%, 25%, 30%, 50%, 75%, or even by 100%.
By “reference” is meant a standard or control condition. In embodiments, a reference is a cell or animal that does not express a particular recombinase (e.g., Cre or FLP). In some embodiments, the reference is a cell or animal that has not been contacted with or administered a viral particle. In some cases, a reference is a capsid polypeptide that does not comprise a peptide insert of the present disclosure.
A “reference sequence” is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset of or the entirety of a specified sequence; for example, a segment of a full-length cDNA or gene sequence, or the complete cDNA or gene sequence. For polypeptides, the length of the reference polypeptide sequence will generally be at least about 16 amino acids, preferably at least about 20 amino acids, more preferably at least about 25 amino acids, and even more preferably about 35 amino acids, about 50 amino acids, or about 100 amino acids. For nucleic acids, the length of the reference nucleic acid sequence will generally be at least about 50 nucleotides, preferably at least about 60 nucleotides, more preferably at least about 75 nucleotides, and even more preferably about 100 nucleotides or about 300 nucleotides or any integer thereabout or therebetween.
Nucleic acid molecules useful in the methods of the invention include any nucleic acid molecule that can be transcribed into an mRNA molecule or that encodes a polypeptide of the invention or a fragment thereof. In embodiments, the mRNA contains a sequence corresponding to a barcode and/or invertible spacer of the present disclosure. In embodiments, nucleic acid molecules need not be 100% identical with an endogenous nucleic acid sequence but will typically exhibit substantial identity. Polynucleotides having “substantial identity” to an endogenous sequence are typically capable of hybridizing with at least one strand of a double-stranded nucleic acid molecule. Nucleic acid molecules useful in the methods of the invention include any nucleic acid molecule that encodes a polypeptide of the invention or a fragment thereof. Polynucleotides having “substantial identity” to an endogenous sequence are typically capable of hybridizing with at least one strand of a double-stranded nucleic acid molecule. By “hybridize” is meant pair to form a double-stranded molecule between complementary polynucleotide sequences (e.g., a gene described herein), or portions thereof, under various conditions of stringency. (See, e.g., Wahl, G. M., and S. L. Berger (1987) Methods Enzymol. 152:399; Kimmel, A. R. (1987) Methods Enzymol. 152:507). In some instances, the nucleic acid molecule encodes a polypeptide that is not endogenous to a target cell or animal. In some cases, the nucleic acid molecule encodes a capsid polypeptide of the present disclosure or a fragment thereof.
For example, stringent salt concentration will ordinarily be less than about 750 mM NaCl and 75 mM trisodium citrate, preferably less than about 500 mM NaCl and 50 mM trisodium citrate, and more preferably less than about 250 mM NaCl and 25 mM trisodium citrate. Low stringency hybridization can be obtained in the absence of organic solvent, e.g., formamide, while high stringency hybridization can be obtained in the presence of at least about 35% formamide, and more preferably at least about 50% formamide. Stringent temperature conditions will ordinarily include temperatures of at least about 30° C., more preferably of at least about 37° C., and most preferably of at least about 42° C. Varying additional parameters, such as hybridization time, the concentration of detergent, e.g., sodium dodecyl sulfate (SDS), and the inclusion or exclusion of carrier DNA, are well known to those skilled in the art. Various levels of stringency are accomplished by combining these various conditions as needed. In a preferred: embodiment, hybridization will occur at 30° C. in 750 mM NaCl, 75 mM trisodium citrate, and 1% SDS. In a more preferred embodiment, hybridization will occur at 37° C. in 500 mM NaCl, 50 mM trisodium citrate, 1% SDS, 35% formamide, and 100 μg/ml denatured salmon sperm DNA (ssDNA). In a most preferred embodiment, hybridization will occur at 42° C. in 250 mM NaCl, 25 mM trisodium citrate, 1% SDS, 50% formamide, and 200 μg/ml ssDNA. Useful variations on these conditions will be readily apparent to those skilled in the art.
For most applications, washing steps that follow hybridization will also vary in stringency. Wash stringency conditions can be defined by salt concentration and by temperature. As above, wash stringency can be increased by decreasing salt concentration or by increasing temperature. For example, stringent salt concentration for the wash steps will preferably be less than about 30 mM NaCl and 3 mM trisodium citrate, and most preferably less than about 15 mM NaCl and 1.5 mM trisodium citrate. Stringent temperature conditions for the wash steps will ordinarily include a temperature of at least about 25° C., more preferably of at least about 42° C., and even more preferably of at least about 68° C. In a preferred embodiment, wash steps will occur at 25° C. in 30 mM NaCl, 3 mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps will occur at 42 C in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps will occur at 68° C. in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. Additional variations on these conditions will be readily apparent to those skilled in the art. Hybridization techniques are well known to those skilled in the art and are described, for example, in Benton and Davis (Science 196:180, 1977); Grunstein and Hogness (Proc. Natl. Acad. Sci., USA 72:3961, 1975); Ausubel et al. (Current Protocols in Molecular Biology, Wiley Interscience, New York, 2001); Berger and Kimmel (Guide to Molecular Cloning Techniques, 1987, Academic Press, New York); and Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York.
By “substantially identical” is meant a polypeptide or nucleic acid molecule exhibiting at least 50% identity to a reference amino acid sequence or nucleic acid sequence. In embodiments, such a sequence is at least 60%, more preferably 80% or 85%, and more preferably 90%, 95% or even 99% identical at the amino acid level or nucleic acid to the sequence used for comparison.
Sequence identity is typically measured using sequence analysis software (for example, Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705, BLAST, BESTFIT, GAP, or PILEUP/PRETTYBOX programs). Such software matches identical or similar sequences by assigning degrees of homology to various substitutions, deletions, and/or other modifications. Conservative substitutions typically include substitutions within the following groups: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. In an exemplary approach to determining the degree of identity, a BLAST program may be used, with a probability score between e−3 and e−100 indicating a closely related sequence.
By “subject” is meant an organism. In embodiments, the organism is a mammal. Non-limiting examples of a subject include a human or non-human mammal, such as a non-human primate (e.g., a marmoset), or a non-human mammal, such as a bovine, equine, canine, ovine, or feline mammal, or a sheep, goat, llama, camel, or a rodent (rat, mouse), ferret, gerbil, hamster, or zebrafish.
Ranges provided herein are understood to be shorthand for all of the values within the range. For example, a range of 1 to 50 is understood to include any number, combination of numbers, or sub-range from the group consisting of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50.
“Transduction” refers to a process by which a polynucleotide is introduced or transferred into a cell. In embodiments, a cell is transduced by a virus or viral vector. In embodiments, the transduced polynucleotide (e.g., RNA, DNA) is expressed in the transduced cell.
As used herein, the term “vehicle” refers to a solvent, diluent, or carrier component of a pharmaceutical composition.
By “viral genome” is meant a polynucleotide molecule suitable for encapsidation by a viral capsid. A non-limiting example of a viral genome is a polynucleotide (e.g., single-stranded DNA) containing and/or flanked by two adeno-associated virus inverted terminal repeats (ITR's). In some cases, a viral genome contains a rep open reading frame and/or a cap open reading frame. In embodiments, the viral capsid is an adeno-associated virus capsid or a lentivirus capsid. In various instances, the viral genome is of sufficient size for encapsidation by a viral capsid (e.g., less than 4.7 kilobases long).
Unless specifically stated or obvious from context, as used herein, the term “or” is understood to be inclusive. Unless specifically stated or obvious from context, as used herein, the terms “a”, “an”, and “the” are understood to be singular or plural.
Unless specifically stated or obvious from context, as used herein, the term “about” is understood as within a range of normal tolerance in the art, for example within 2 standard deviations of the mean. Unless otherwise clear from context, all numerical values provided herein are modified by the term about.
In any of the above aspects, or embodiments thereof, the cells form part of an organoid or virtual organ. In any of the above aspects, or embodiments thereof, the cells contain two or more different cell types.
The recitation of a listing of chemical groups in any definition of a variable herein includes definitions of that variable as any single group or combination of listed groups. The recitation of an embodiment for a variable or aspect herein includes that embodiment as any single embodiment or in combination with any other embodiments or portions thereof.
Any compositions or methods provided herein can be combined with one or more of any of the other compositions and methods provided herein.
As described below, the present invention features adeno-associated viral vectors and methods of using such vectors.
The invention of the disclosure is based, at least in part, upon the design of new adeno-associated virus (AAV) capsids and libraries comprising the same. Systematically identifying protein variants with multiple enhanced traits remains a major protein engineering challenge. Focusing on adeno-associated virus (AAV) capsids, a machine learning-compatible Fit4Function library was designed that evenly samples the sequence space of high production fit variants. With this library, generalizable ML models were trained to predict gene therapy-relevant traits including production yield, in vivo biodistribution, and binding and transduction of human cells. The models were used to design a library that efficiently explored capsids predicted to possess multiple traits important for hepatocyte gene delivery. Upon validation, 90% of the library variants met all predetermined criteria. Individually tested capsids exhibited efficient cross-species hepatocyte transduction. In embodiments, the Fit4Function approach is applicable to the multi-trait enhancement of other proteins amenable to quantitative, high-throughput engineering.
As described above, the invention of the disclosure is based, at least in part, upon the development of a generalizable machine learning-guided approach to systematically and simultaneously map 7-mer-modified AAV9 capsid sequences to multiple functions. To generate high-quality data that would enable the training of accurate ML models, a low bias, high diversity library composed only of capsid variants with high production fitness was created (
In various embodiments, the capsids and/or capsid libraries of the present disclosure possess one or more of the following traits: enhanced on-target delivery; reduced delivery to common accumulation sites; resistance to pre-existing antibodies (e.g., pre-existing circulating antibodies in a subject); and/or improved or maintained manufacturability. In embodiments, the capsids and/or capsid libraries of the present disclosure are suitable for infecting human cells. In some instances, the capsids and/or capsid libraries of the present disclosure are resistant to a polyclonal response. In some embodiments, capsids and/or capsid libraries of the present invention are suitable for infecting one or more species (e.g., a mouse and a primate, such as a human). In some cases, the present disclosure provides capsids or libraries containing the same that have increased immune evasion.
In various aspects, the disclosure features capsid libraries containing polypeptides or polynucleotides encoding the same. In aspects, the disclosure features methods for screening the capsid libraries. In aspects, the present disclosure features viral particles containing capsid polypeptides. In various cases, the capsid libraries are prepared by inserting peptides of a predetermined length into a parent/reference adeno-associated virus capsid polypeptide (e.g., an AAV9 K449R polypeptide). In embodiments, the peptides are 2-mers, 3-mers, 4-mers, 5-mers, 6-mers, 7-mers, 8-mers, 9-mers, 10-mers, 11-mers, 12-mers, 13-mers, 14-mers, 15-mers, or longer n-mers. The peptides can be inserted at any of various locations in the capsid polypeptide, such as within a loop of the capsid polypeptide (e.g., Loop VIII of the polypeptide); for example, the peptide may be inserted after or before amino acid position 577, 586, 587, 588, 589, or 590 of the polypeptide. In some cases, the capsid polypeptide is an AAV1, AAV2, AAV3, AAv3b, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV9 K449R, rh.10, rh.8, or LK03 polypeptide. Non-limiting examples of representative insertion sites are provided in Table 1 below. A capsid library can contain about, or at least about 2, 5, 10, 50, 100, 500, 1e3, 5e3, 1e4, 5e4, 1e5, 2e5, 3e5, 4e5, 5e5, 6e5, 7e5, 8e5, 9e6, 1e6, 5e6, or 1e7 unique insertions.
In embodiments, all of the sequences in the capsid library are capable of forming viral particles sharing 1, 2, 3, 4, 5, 6 or more common traits selected from one or more of those described herein, such as binding a cell of interest (e.g., liver cell, hepatocyte, HepG2, THLE, T cell; HEK293 cell, brain endothelial cell; C57 brain endothelial cell; hCME CD3; kidney cell; spinal cord cell); transducing a cell of interest (e.g., liver cell, hepatocyte, HepG2, THLE, T cell; HEK293 cell, brain endothelial cell; C57 brain endothelial cell; hCME CD3; kidney cell; spinal cord cell); biodistributing to the liver of an organism (e.g., human, rodent); production fitness; heart biodistribution; spleen biodistribution; kidney biodistribution; serum biodistribution; brain biodistribution; lung biodistribution; spinal cord biodistribution; and spinal cord transduction. In embodiments, the common trait(s) is increased relative to a reference viral particle. In some cases, the reference viral particle is selected from a viral particle containing a capsid polypeptide selected from one or more of the following and not including any peptide insert: AAV1, AAV2, AAV3, AAv3b, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV9 K449R, rh.10, rh.8, or LK03. Further non-limiting examples of common traits include binding to or transfecting one or more of the following cell types: HEK293 T cells, primary mouse brain microvascular endothelial cells, primary human BMVEC cells, and/or human brain endothelial cell line hCMEC/D3 cells. In some cases, the common trait(s) include increased biodistribution relative to a reference viral particle in one or more of the following organs: liver, kidney, spleen, brain, spinal cord, serum, heart, and/or lungs. In embodiments, a capsid library is enriched for capsids capable of forming viral particles with the common trait(s) relative to reference capsid library (e.g., a randomly selected library of capsid sequences and/or a library of capsid sequences containing a random collection of 7-mer peptides inserted at a particular amino acid location).
In various embodiments, the methods of the present disclosure involve selecting from a library of capsid polypeptides those capsid polypeptides having a trait(s) of interest (e.g., binding, biodistribution, or transduction capabilities). The selection can be carried out in silico or in vivo using a selection criterion or selective pressure. In embodiments, the methods of the present disclosure allow for the simultaneous optimization of multiple capsid functions, such as production, biodistribution to a target organ in a particular species, enhanced biodistribution to a target organ in a particular species, and enhanced target cell type (e.g., a human target cell type) transduction. In embodiments, machine learning (ML) models are used to deeply sample sequence space for capsids that have traits of interest. Capsids identified as having traits of interest can then be selected to populate a multi-function library containing the in silico predicted sequences (see, e.g.,
In embodiments, a Fit4Function library contains high production capsids. It can be advantageous for the Fit4Function libraries to contain less amino acid bias than libraries constructed using alternative approaches (e.g., random selection of sequences, such as traditional NNN/NNK libraries). All sequences contained within a Fit4Function library are known and, accordingly, each library is accompanied by a member list providing a comprehensive list of all capsid sequences contained within the library. In some cases, the Fit4Function libraries facilitate more accurate machine learning (ML) models that can learn a theoretical sequence-to-function mapping. The Fit4Function libraries can enable efficient exploration of the multi-functional fitness space and/or enable data accumulation across species and experiments.
In some cases, the present disclosure provides libraries of capsid polypeptides, or polynucleotides encoding the same, where the libraries contain capsid polypeptides that satisfy a detargeting trait. Non-limiting examples of detargeting traits include reduced transduction of a target cell type or organ (e.g., reduced liver transduction) and reduced biodistribution in a particular organ (e.g., spleen biodistribution) or species.
In various embodiments, a library of capsids of the present disclosure contains a higher proportion of capsids with a trait(s) of interest than a reference library of randomly selected capsid sequences (e.g., capsids containing a random selection of 7-mer peptides inserted at a particular amino acid position within a reference capsid polypeptide sequence). In embodiments, a library of capsids contains capsids or contains only capsids having one or more (e.g., 1, 2, 3, 4, 5, or all) of the following traits: 1) high binding affinity to HepG2 cells, 2) high binding affinity to THLE cells, 3) high transduction of HepG2 cells. 4) high transduction of TILE cells, 5) high biodistribution to C57 mice liver, and 6) high production fitness.
In embodiments, viral particles containing capsids polypeptides of the present disclosure can transduce muscle, liver, brain, retina, and/or lung cells in vivo and/or in vitro. The efficiency of rAAV transduction is dependent on the efficiency at each step of AAV infection, i.e., virus binding, entry, trafficking, nuclear entry, uncoating, and second-strand synthesis.
Adeno-associated viruses (AAV) are small non-enveloped icosahedral capsid viruses of the Parvovindae family characterized by a single stranded DNA viral genome. Parvoviridae family viruses consist of two subfamilies: Parvovirinae, which infect vertebrates, and Densovirinae, which infect invertebrates. The Parvoviridae family comprises the Dependovirus genus which includes AAV, capable of replication in vertebrate hosts including, but not limited to, human, primate, bovine, canine, equine, and ovine species.
The parvoviruses and other members of the Parvoviridae family are generally described in Kenneth I Berns, “Parvoviridae: The Viruses and Their Replication,” Chapter 69 in FIELDS VIROLOGY (3d Ed. 1996), the contents of which are incorporated by reference in their entirety.
AAV have proven to be useful as a biological tool due to their relatively simple structure, their ability to infect a wide range of cells (including quiescent and dividing cells) without integration into the host genome and without replicating, and their relatively benign immunogenic profile. The genome of the virus may be manipulated to contain a minimum of components for the assembly of a functional recombinant virus, or viral particle, which is loaded with or engineered to target a particular tissue and express or deliver a desired payload.
The wild-type AAV vector genome is a linear, single-stranded DNA (ssDNA) molecule approximately 5,000 nucleotides (nt) in length. Inverted terminal repeats (ITRs) traditionally cap the viral genome at both the 5′ and the 3′ end, providing origins of replication for the viral genome. While not wishing to be bound by theory, an AAV viral genome typically comprises two ITR sequences. These ITRs have a characteristic T-shaped hairpin structure defined by a self-complementary region (145 nt in wild-type AAV) at the 5′ and 3′ ends of the ssDNA which form an energetically stable double stranded region. The double stranded hairpin structures comprise multiple functions including, but not limited to, acting as an origin for DNA replication by functioning as primers for the endogenous DNA polymerase complex of the host viral replication cell.
The wild-type AAV viral genome further comprises nucleotide sequences for two open reading frames, one for the four non-structural Rep proteins (Rep78, Rep68, Rep52, Rep40, encoded by Rep genes) and one for the three capsid, or structural, proteins (VP1. VP2, VP3, encoded by capsid genes or Cap genes). The Rep proteins are important for replication and packaging, while the capsid proteins are assembled to create the protein shell of the AAV, or AAV capsid. Alternative splicing and alternate initiation codons and promoters result in the generation of four different Rep proteins from a single open reading frame and the generation of three capsid proteins from a single open reading frame. Though it vanes by AAV serotype, as a non-limiting example, for AAV9/hu.14 (SEQ ID NO. I23 of U.S. Pat. No. 7,906,111, the contents of which are herein incorporated by reference in their entirety) VP1 refers to amino acids 1-736, VP2 refers to amino acids 138-736, and VP3 refers to amino acids 203-736. In other words. VP1 is the full-length capsid sequence, while VP2 and VP3 are shorter components of the whole. As a result, changes in the sequence in the VP3 region, are also changes to VP1 and VP2, however, the percent difference as compared to the parent sequence will be greatest for VP3 since it is the shortest sequence of the three. Though described here in relation to the amino acid sequence, the nucleic acid sequence encoding these proteins can be similarly described. Together, the three capsid proteins assemble to create the AAV capsid protein. While not wishing to be bound by theory, the AAV capsid protein typically comprises a molar ratio of 1:1:10 of VP1:VP2:VP3. As used herein, an “AAV serotype” ° is defined primarily by the AAV capsid. In some instances, the ITRs are also specifically described by the AAV serotype (e.g., AAV2/9).
For use as a biological tool, the wild-type AAV viral genome can be modified to replace the rep/cap sequences with a nucleic acid sequence comprising a payload region with at least one ITR region. Typically, in recombinant AAV viral genomes there are two ITR regions. The rep/cap sequences can be provided in trans during production to generate AAV particles.
In addition to the encoded heterologous payload. AAV vectors may comprise the viral genome, in whole or in part, of any naturally occurring and/or recombinant AAV serotype nucleotide sequence or variant. AAV variants may have sequences of significant homology at the nucleic acid (genome or capsid) and amino acid levels (capsids), to produce constructs which are generally physical and functional equivalents, replicate by similar mechanisms, and assemble by similar mechanisms. Chiorini et al., J. Vir. 71: 6823-33(1997): Srivastava et al., J. Vir. 555-64 (1983), Chiorini et al., J. Vir 73 1309-1319 (1999): Rutledge et al., J. Vir. 72:309-319 (1998; and Wu et al, J. Vir. 74: 8635-47 (2000), the contents of each of which are incorporated herein by reference in their entirety.
In certain embodiments, AAV particles of the present disclosure are recombinant AAV viral vectors which are replication defective and lacking sequences encoding functional Rep and Cap proteins within their viral genome. These defective AAV vectors may lack most or all parental coding sequences and essentially carry only one or two AAV ITR sequences and the nucleic acid of interest for delivery to a cell, a tissue, an organ, or an organism.
In certain embodiments, the viral genome of the AAV particles of the present disclosure comprises at least one control element which provides for the replication, transcription, and translation of a coding sequence encoded therein. Not all of the control elements need always be present as long as the coding sequence is capable of being replicated, transcribed, and/or translated in an appropriate host cell. Non-limiting examples of expression control elements include sequences for transcription initiation and/or termination, promoter and/or enhancer sequences, efficient RNA processing signals such as splicing and polyadenylation signals, sequences that stabilize cytoplasmic mRNA, sequences that enhance translation efficacy (e.g., Kozak consensus sequence), sequences that enhance protein stability, and/or sequences that enhance protein processing and/or secretion.
According to the present disclosure, AAV particles for use in therapeutics and/or diagnostics comprise a virus that has been distilled or reduced to the minimum components necessary for transduction of a nucleic acid payload or cargo of interest. In this manner, AAV particles are engineered as vehicles for specific delivery while lacking the deleterious replication and/or integration features found in wild-type viruses.
AAV vectors of the present disclosure may be produced recombinantly and may be based on adeno-associated virus (AAV) parent or reference sequences. As used herein, a “vector” is any molecule or moiety which transports, transduces, or otherwise acts as a carrier of a heterologous molecule such as the nucleic acids described herein.
In addition to single stranded AAV viral genomes (e.g., ssAAVs), the present invention also provides for self-complementary AAV (scAAVs) viral genomes, scAAV vector genomes contain DNA strands which anneal together to form double stranded DNA. By skipping second strand synthesis. scAAVs allow for rapid expression in the transduced cell.
In certain embodiments, the AAV particle of the present disclosure is an scAAV.
In certain embodiments, the AAV particle of the present disclosure is an ssAAV.
Methods for producing and/or modifying AAV particles are provided herein and are disclosed in the art such as pseudotyped AAV vectors (PCT Patent Publication Nos. WO200028004; WO200123001: WO2004112727: WO2005005610; and WO2005072364, the content of each of which is incorporated herein by reference in its entirety).
AAV particles may be modified by methods such as those provided herein to enhance the efficiency of delivery. Such modified AAV particles can be packaged efficiently and be used to successfully infect the target cells at high frequency and with minimal toxicity. In some embodiments, the capsids of the AAV particles are engineered according to the methods provided herein and/or those described in US Publication Number US20130195801, the contents of which are incorporated herein by reference in their entirety.
AAVs are well suited for use as vectors and vehicles for gene transfer to cells. AAVs provide safe, long-term expression in a cell (e.g., a nerve cell). AAV vectors have been highly successful in fulfilling all of the features desired for a delivery vehicle, such as the ability to attach to and enter the target cell, successful transfer to the nucleus, the ability to be expressed in the nucleus for a sustained period of time, and a general lack of pathogenicity and toxicity. Recombinant AAV (rAAV) is advantageous as a delivery vector, particularly for delivery to the central nervous system, as it is focally injectable; it exhibits stable expression over time; and it is both non-pathogenic and non-integrative into the genome of the cell into which it is transduced. Twelve human serotypes of AAV (AAV serotype 1 (AAV-1) to AAV-12) and more than 100 serotypes from nonhuman primates have been reported to date. (Daya, S. and Berns, K. I., 2008, Clin. Microbiol. Rev., 21(4):583-593). In addition, rAAV has been approved by the FDA for use as a vector in at least 38 protocols for several different human clinical trials. AAV's lack of pathogenicity, persistence and its many available serotypes have increased the potential of the virus as a delivery vehicle for a gene therapy application in accordance with the described compositions and methods.
AAV particles of the present disclosure may comprise or be derived from any natural or recombinant AAV serotype AAV serotypes may differ in traits such as, but not limited to, packaging, tropism, transduction, and immunogenic profiles. While not wishing to be bound by theory, the AAV capsid protein is often considered to be the driver of AAV particle tropism to a particular tissue.
In certain embodiments, an AAV particle may have a capsid protein and ITR sequences derived from the same parent serotype (e.g., AAV2 capsid and AAV2 ITRs). In another embodiment, the AAV particle may be a pseudo-typed AAV particle, wherein the capsid protein and ITR sequences are derived from different parent serotypes (e g., AAV9 capsid and AAV2 ITRs; AAV2/9).
The AAV particles of the present disclosure may comprise an AAV capsid protein with a targeting peptide inserted into the parent sequence. The parent capsid or serotype may comprise or be derived from any natural or recombinant AAV serotype. As used herein, a “parent” sequence is a nucleotide or amino acid sequence into which a targeting sequence is inserted (i.e., nucleotide insertion into nucleic acid sequence or amino acid sequence insertion into amino acid sequence).
In another embodiment, the parent AAV capsid nucleotide sequence is a K449R variant, wherein the codon encoding a lysine (e.g, AAA or AAG) at position 449 in the amino acid sequence is exchanged for one encoding an arginine (CGT, CGC, CGA, CGG, AGA, AGG). The K449R variant has the same function as wild-type AAV9.
The parent AAV serotype and associated capsid sequence may be any of those known in the art Non-limiting examples of such AAV serotypes include, AAV9, AAV9 K449R (or K449R AAV9), AAV1, AAVrh10, AAV-DJ, AAV-DJ8, AAV5, AAVPHP.B (PHP.B), AAVPHP.A (PHP.A), AAVG2B-26, AAVG2B-13, AAVTH1.1-32, AAVTH1.1-35, AAVPHP.B2 (PIP.B2), AAVPHP.B3 (PHP.B3), AAVPHP.N/PHP.B-DGT, AAVPHP.B-EST, AAVPHP B-GGT, AAVPHP B-ATP, AAVPHP B-ATT-T. AAVPHP.B-DGT-T, AAVPHP.B-GGT-T, AAVPHP.B-SGS, AAVPHP.B-AQP, AAVPHP.B-QQP, AAVPHP.B-SNP(3), AAVPHP.B-SNP, AAVPHP.B-QGT, AAVPHP.B-NQT, AAVPHP.B-EGS, AAVPHP.B-SGN, AAVPHP.B-EGT, AAVPHP.B-DST, AAVPHP.B-DST, AAVPHP.B-STP, AAVPHP.B-PQP, AAVPHP.B-SQP, AAVPHP.B-QLP, AAVPHP.B-TMP, AAVPHP.B-TTP, AAVPHP.S/G2A12, AAVG2A15/G2A3 (G2A3), AAVG2B4 (G2B4), AAVG2B5 (G2B5), PHP.S, AAV2, AAV2G9, AAV3, AAV3a, AAV3b. AAV3-3, AAV4, AAV4-4, AAV6, AAV6.1, AAV6.2, AAV6.1.2, AAV7, AAV7.2, AAV8, AAV9.11, AAV9.13, AAV9.16, AAV9.24, AAV9.45, AAV9.47, AAV9.61, AAV9.68, AAV9.84, AAV9.9, AAV10, AAV11, AAV12, AAV16.3, AAV24.1, AAV27.3, AAV42.12, AAV42-1b, AAV42-2, AAV42-3a, AAV42-3b, AAV42-4, AAV42-5a, AAV42-5b, AAV42-6b, AAV42-8, AAV42-10, AAV42-11, AAV42-12, AAV42-13, AAV42-15, AAV42-aa, AAV43-1, AAV43-12, AAV43-20, AAV43-21, AAV43-23, AAV43-25, AAV43-5, AAV44.1, AAV44.2, AAV44.5, AAV223.1, AAV223.2, AAV223.4, AAV223.5, AAV223.6, AAV223.7, AAV1-7/rh.48, AAV1-8/rh.49, AAV2-15/rh.62, AAV2-3/rh.61, AAV2-4/rh.50, AAV2-5/rh.51, AAV3.1/hu.6, AAV3.1/hu.9, AAV3-9/rh.52, AAV3-11/rh.53, AAV4-8/r11.64, AAV4-9/rh.54, AAV4-19/rh.55, AAV5-3,rh.57, AAV5-22/rh.58, AAV7.3/hu.7, AAV16.8/hu.10, AAV16.12/hu. 11, AAV29.3/bb.1, AAV29.5/bb.2, AAV106.1/hu.37, AAV114.3/hu.40, AAV127.2/hu.41, AAV127.5/hu.42, AAV128.3/hu.44, AAV130.4/hu.48, AAV145.1/hu.53, AAV145.5/hu.54, AAV145.6/hu.55, AAV161.10,hu.60, AAV161.6/hu.61, AAV33.12/hu.17, AAV33.4/hu.15, AAV33.8/hu.16, AAV52/hu.19, AAV52.1/hu.20, AAV58.2/hu.25, AAVA3.3, AAVA3.4, AAVA3.5, AAVA3.7, AAVC1, AAVC2, AAVC5, AAVF3, AAVF5, AAVH2, AAVrh.72, AAVhu.8, AAVrb.68, AAVrh.70, AAVpi.1, AAVpi.3, AAVpi.2, AAVrh.60, AAVrh.44, AAVrh.65, AAVrh.55, AAVrh.47, AAVrh.69, AAVrh.45, AAVrh.59, AAVhu.12, AAVH6, AAVH-1/hu.1, AAVH-5/hu.3, AAVLG-10/rh.40, AAVLG-4/rh.38, AAVLG-9/hu.39, AAVN721-8/rh.43, AAVCh.5, AAVCh.5R1, AAVcy.2, AAVcy.3, AAVcy.4, AAVcy.5, AAVCy.5R1, AAVCy.5R2, AAVCy.5R3, AAVCy.5R4, AAVcy.6, AAVhu.1, AAVhu.2, AAVhu 3, AAVhu 4, AAVhu 5, AAVhu 6, AAVhu 7, AAVhu 9, AAVhu.10, AAVhu 11, AAVhu 13, AAVhu.15, AAVhu.16, AAVhu.17, AAVhu.18, AAVhu.20, AAVhu.21, AAVhu.22, AAVhu.23.2, AAVhu.24, AAVhu.25, AAVhu.27, AAVhu.28, AAVhu.29, AAVhu.29R. AAVhu.31, AAVhu.32, AAVhu.34, AAVhu.35, AAVhu.37, AAVhu.39, AAVhu.40, AAVhu.41, AAVhu.42, AAVhu.43, AAVhu 44, AAVhu.44R1, AAVhu.44R2, AAVhu.44R3, AAVhu 45, AAVhu 46, AAVhu.47, AAVhu.48, AAVhu.48R1, AAVhu.48R2, AAVhu.48R3, AAVhu.49, AAVhu.51, AAVhu.52, AAVhu.54, AAVhu.55, AAVhu.56, AAVhu.57, AAVhu.58, AAVhu.60, AAVhu.61, AAVhu.63, AAVhu.64, AAVhu.66, AAVhu.67, AAVhu 14/9, AAVhu.t 19, AAVrh.2, AAVrh.2R, AAVrb.8, AAVrh.8R, AAVrh.10, AAVrh.12, AAVrh.13, AAVrh.13R, AAVrh.14, AAVrh.17, AAVrh.18, AAVrh.19, AAVrh.20, AAVrh.21, AAVrh.22, AAVrh.23, AAVrh.24, AAVrh.25, AAVrh.31, AAVrh.32, AAVrh.33, AAVrh.34, AAVrh.35, AAVrb.36, AAVrb.37, AAVrh.37R2, AAVrh.38, AAVrh.39, AAVrh.40, AAVrh.46, AAVrb.48, AAVrh.48.1, AAVrh.48.1.2, AAVrh.48.2, AAVrh.49, AAVrh.51, AAVrh.52, AAVrh.53, AAVrh.54, AAVrh.56, AAVrh.57, AAVrb.58, AAVrb.61, AAVrh.64, AAVrh.64R1, AAVrh.64R2, AAVrb.67, AAVrh.73, AAVrb.74, AAVrh8R, AAVrh8R A586R mutant, AAVrh8R R533A mutant, AAAV.BAAV, caprine AAV, bovine AAV, AAVhE1.1, AAVhEr1.5, AAVhER1.14, AAVhEr1.8, AAVhEr1.16, AAVhEr1.18, AAVhEr1.35, AAVhEr1.7, AAVhEr1.36, AAVhEr2.29, AAVhEr2.4, AAVhEr2.16, AAVhEr2.30, AAVbEr2.31, AAVhEr2.36, AAVhER1.23, AAVhEr3.1, AAV2.5T, AAV-PAEC, AAV-LK01, AAV-LK02, AAV-LK03, AAV-LK04, AAV-LK05, AAV-LK06, AAV-LK07, AAV-LK08, AAV-LK09, AAV-LK10, AAV-LK11, AAV-LK12, AAV-LK13, AAV-LK14, AAV-LK15, AAV-LK16, AAV-LK17, AAV-LK18, AAV-LK19, AAV-PAEC2, AAV-PAEC4, AAV-PAEC6, AAV-PAEC7, AAV-PAEC8, AAV-PAEC11, AAV-PAEC12, AAV-2-pre-miRNA-101, AAV-8h, AAV-8b, AAV-h, AAV-b, AAV SM 10-2, AAV Shuffle 100-1, AAV Shuffle 100-3, AAV Shuffle 100-7, AAV Shuffle 10-2, AAV Shuffle 10-6, AAV Shuffle 10-8, AAV Shuffle 100-2, AAV SM 10-1, AAV SM 10-8, AAV SM 100-3, AAV SM 100-10, BNP61 AAV, BNP62 AAV, BNP63 AAV. AAVrh.50, AAVrh.43, AAVrh.62, AAVrh.48, AAVhu 19, AAVhu.11, AAVhu.53, AAV4-8/rh.4, AAVLG-9/hu.39, AAV54.5/hu.23, AAV54.2,hu.22, AAV54.7/hu.24, AAV54.1/hu.21, AAV54.4R/hu.27, AAV46.2/hu.28, AAV46.6/hu.29, AAV128.1/hu.43, true type AAV (ttAAV), UPENN AAV10, Japanese AAV10 serotypes, AAV CBr-7.1, AAV CBr-7.10, AAV CBr-7.2, AAV CBr-7.3, AAV CBr-7.4, AAV CBr-7.5, AAV CBr-7.7, AAV CBr-7.8, AAV CBr-B7.3, AAV CBr-B7.4, AAV CBr-E1, AAV CBr-E2, AAV CBr-E3, AAV CBr-E4, AAV CBr-E5, AAV CBr-e5, AAV CBr-E6, AAV CBr-E7, AAV CBr-E8, AAV CHt-1, AAV CHt-2, AAV CHt-3, AAV CHt-6.1, AAV CHt-6.10, AAV CHt-6.5, AAV CHt-6.6, AAV CHt-6.7, AAV CHt-6.8, AAV CHt-P1, AAV CHt-P2, AAV CHt-P5, AAV CHt-P6, AAV CHt-P8, AAV CHt-P9, AAV CKd-1, AAV CKd-10, AAV CKd-2, AAV CKd-3, AAV CKd-4, AAV CKd-6, AAV CKd-7, AAV CKd-8, AAV CKd-B1, AAV CKd-B2, AAV CKd-B3, AAV CKd-B4, AAV CKd-B5, AAV CKd-B6, AAV CKd-B7, AAV CKd-B8, AAV CKd-H1, AAV CKd-H2, AAV CKd-H3, AAV CKd-H4, AAV CKd-H5, AAV CKd-H6, AAV CKd-N3, AAV CKd-N4, AAV CKd-N9, AAV CLg-F1, AAV CLg-F2, AAV CLg-F3, AAV CLg-F4, AAV CLg-F5, AAV CLg-F6, AAV CLg-F7, AAV CLg-F8, AAV CLv-1, AAV CLv1-1, AAV CLv1-10, AAV CLv1-2, AAV CLv-12, AAV CLv1-3, AAV CLv-13, AAV CLv1-4, AAV C1-7, AAV CLv1-8, AAV C1v-9, AAV CLv-2, AAV CLv-3, AAV CLv-4, AAV CLv-6, AAV CLv-8, AAV CLv-D1, AAV CLv-D2, AAV CLv-D3, AAV CLv-D4, AAV CLv-D5, AAV CLv-D6, AAV CLv-D7, AAV CLv-D8, AAV CLv-E1, AAV CLv-KL, AAV CLv-K3, AAV CLv-K6, AAV CLv-L4, AAV CLv-L5, AAV CLv-L6, AAV CLv-M1, AAV CLv-M11, AAV CLv-M2, AAV CLv-M5, AAV CLv-M6, AAV CLv-M7, AAV CLv-M8, AAV CLv-M9, AAV CLv-R1, AAV CLv-R2, AAV CLv-R3, AAV CLv-R4, AAV CLv-R5, AAV CLv-R6, AAV CLv-R7, AAV CLv-R8, AAV CLv-R9, AAV CSp-1, AAV CSp-10, AAV CSp-11, AAV CSp-2, AAV CSp-3, AAV CSp-4, AAV CSp-6, AAV CSp-7, AAV CSp-8, AAV CSp-8.10, AAV CSp-8.2, AAV CSp-8.4, AAV CSp-8.5, AAV CSp-8.6, AAV CSp-8.7, AAV CSp-8.8, AAV CSp-8.9, AAV CSp-9, AAV.hu.48R3, AAV.VR-355, AAV3B, AAV4, AAV5, AAVF1/HSC1, AAVF11/HSC11, AAVF12/HSC12, AAVF13/HSC13, AAVF14/HSC14, AAVF15/HSC15, AAVF16/HSC16, AAVF17/HSC17, AAVF2/HSC2, AAVF3/HSC3, AAVF4/HSC4, AAVF5/HSC5, AAVF6/HSC6, AAVF7/HSC7, AAVF8/HSC8, and/or AAVF9/HSC9 and variants thereof.
In embodiments, a capsid or capsid library of the present disclosure is derived from AAV-PHP.B (see, e.g., Deverman, et al. “Cre-dependent selection yields AAV variants for widespread gene transfer to the adult brain,” Nat Biotechnol. 2016 February; 34(2):204-209. PMCID: PMC5088052, the disclosure of which is incorporated herein by reference in its entirety for all purposes). AAV-PHP.eB (described in Deverman B E, Pravdo P L, Simpson B P, Kumar S R, Chan K Y, Banerjee A, Wu W-L, Yang B, Huber N, Pasca S P, Gradinaru V. Cre-dependent selection yields AAV variants for widespread gene transfer to the adult brain. Nat Biotechnol 2016 February; 34(2):204-209. PMCID: PMC5088052; and Chan K Y, Jang M J, Yoo B B, Greenbaum A, Ravi N. Wu W-L, Sánchez-Guardado L, Lois C, Mazmanian S K. Deverman B E, Gradinaru V. Engineered AAVs for efficient noninvasive gene delivery to the central and peripheral nervous systems. Nat Neurosci 2017 August; 20(8):1172-1179. PMCID: PMC5529245), AAVF (described in Hanlon K S, Meltzer J C, Buzhdygan T, Cheng M J, Sena-Esteves M, Bennett R E, Sullivan T P, Razmpour R, Gong Y, Ng C, Nammour J, Maiz D, Dujardin S, Ramirez S H, Hudry E, Maguire C A. Selection of an Efficient AAV Vector for Robust CNS Transgene Expression. Mol Ther Methods Clin Dev. 2019 Dec. 13; 15:320-332. PMCID: PMC6881693, the disclosure of which is incorporated herein by reference in its entirety for all purposes), AAV-PHP.B4-B8, AAV-PHP.C1-C3 (Kumar, S. R. et al. Multiplexed Cre-dependent selection yields systemic AAVs for targeting distinct brain cell types Nat Methods 17, 541-550 (2020). 9P31) or other capsids with similar properties (Nonnenmacher, M. et al. Rapid Evolution of Blood-Brain Barrier-Penetrating AAV Capsids by RNA-Driven Biopanning. Mol Ther—Methods Clin Dev (2020) doi:10.1016/j.omtm.2020.12.006), or CAP-BI0 or CAP-B22 (Goertsen, D. et al. AAV capsid variants with brain-wide transgene expression and decreased liver targeting after intravenous delivery in mouse and marmoset. Nat Neurosci 1-10 (2021) doi.10.1038/s41593-021-00969-4). Further non-limiting examples of AAV capsids suitable for encapsidation of polynucleotides include those described in PCT/US2019/044796, PCT/US2020/027708, PCT/US2020/044487, or PCT/US2020/015972, the disclosures of each of which are incorporated herein by reference in their entireties for all purposes.
In some embodiments, the serotype may be AAVDJ or a variant thereof, such as AAVDJ8 (or AAV-DJ8), as described by Grimm et al. (Journal of Virology 82(12): 5887-5911 (2008), US Publication US20140359799 and U.S. Pat. No. 7,588,772, each of which is herein incorporated by reference in its entirety) The amino acid sequence of AAVDJ8 may comprise two or more mutations in order to remove the heparin binding domain (HBD). As a non-limiting example, the AAV-DJ sequence is as described by SEQ ID NO: 1 in U.S. Pat. No. 7,588,772, the contents of which are herein incorporated by reference in their entirety, and the AAVDJ8 sequence may comprise two mutations: (1) R587Q where arginine (R; Arg) at amino acid 587 is changed to glutamine (Q; Gln) and (2) R590T where arginine (R; Arg) at amino acid 590 is changed to threonine (T; Thr) As another non-limiting example, the AAVDJ8 sequence may comprise three mutations: (1) K406R where lysine (K; Lys) at amino acid 406 is changed to arginine (R; Arg), (2) R587Q where arginine (R; Arg) at amino acid 587 is changed to glutamine (Q; Gln) and (3) R590T where arginine (R; Arg) at amino acid 590 is changed to threonine (T; Thr).
While not wishing to be bound by theory, it is understood that a parent AAV capsid sequence comprises a VP1 region. In certain embodiments, a parent AAV capsid sequence comprises a VP1, VP2 and/or VP3 region, or any combination thereof. A parent VP1 sequence may be considered synonymous with a parent AAV capsid sequence.
In certain embodiments, the initiation codon for translation of the AAV VP1 capsid protein may be CTG, TTG, or GTG as described in U.S. Pat. No. 8,163,543, the contents of which are herein incorporated by reference in their entirety.
The present disclosure refers to structural capsid proteins (including VP1, VP2 and VP3) which are encoded by capsid (Cap) genes. These capsid proteins form an outer protein structural shell (i.e. capsid) of a viral vector such as AAV. VP capsid proteins synthesized from Cap polynucleotides generally include a methionine as the first amino acid in the peptide sequence (Met1), which is associated with the start codon (AUG or ATG) in the corresponding Cap nucleotide sequence. However, it is common for a first-methionine (Met1) residue or generally any first amino acid (AA1) to be cleaved off after or during polypeptide synthesis by protein processing enzymes such as Met-aminopeptidases. This “Met/AA-clipping” process often correlates with a corresponding acetylation of the second amino acid in the polypeptide sequence (e.g., alanine, valine, serine, threonine, etc.). Met-clipping commonly occurs with VP1 and VP3 capsid proteins but can also occur with VP2 capsid proteins.
Where the Met/AA-clipping is incomplete, a mixture of one or more (one, two or three) VP capsid proteins comprising the viral capsid may be produced, some of which may include a Met1/AA1 amino acid (Met+/AA+) and some of which may lack a Met1/AA1 amino acid as a result of Met/AA-clipping (Met−/AA−). For further discussion regarding Met/AA-clipping in capsid proteins, see Jin, et al. Direct Liquid Chromatography/Mass Spectrometry Analysis for Complete Characterization of Recombinant Adeno-Associated Virus Capsid Proteins. Hum Gene Ther Methods. 2017 Oct. 28(5):255-267; Hwang, et al N-Terminal Acetylation of Cellular Proteins Creates Specific Degradation Signals. Science. 2010 Feb. 19, 327(5968): 973-977; the contents of which are each incorporated herein by reference in its entirety.
According to the present disclosure, references to capsid proteins is not limited to either clipped (Met−/AA−) or unclipped (Met+/AA+) and may, in context, refer to independent capsid proteins, viral capsids comprised of a mixture of capsid proteins, and/or polynucleotide sequences (or fragments thereof) which encode, describe, produce or result in capsid proteins of the present disclosure. A direct reference to a “capsid protein” or “capsid polypeptide” (such as VP1, VP2 or VP2) may also comprise VP capsid proteins which include a Met1/AA1 amino acid (Met+/AA+) as well as corresponding VP capsid proteins which lack the Met1/AA1 amino acid as a result of Met/AA-clipping (Met−/AA−).
Further according to the present disclosure, a reference to a specific SEQ ID NO: (whether a protein or nucleic acid) which comprises or encodes, respectively, one or more capsid proteins which include a Met1/AA1 amino acid (Met+/AA+) should be understood to teach the VP capsid proteins which lack the Met1/AA1 amino acid as upon review of the sequence, it is readily apparent any sequence which merely lacks the first listed amino acid (whether or not Met1/AA1).
As anon-limiting example, reference to a VP1 polypeptide sequence which is 736 amino acids in length and which includes a “Met1” amino acid (Met+) encoded by the AUG/ATG start codon may also be understood to teach a VP1 polypeptide sequence which is 735 amino acids in length and which does not include the “Met1” amino acid (Met−) of the 736 amino acid Met+ sequence.
As a second non-limiting example, reference to a VP1 polypeptide sequence which is 736 amino acids in length and which includes an “AA1” amino acid (AA1+) encoded by any NNN initiator codon may also be understood to teach a VP1 polypeptide sequence which is 735 amino acids in length and which does not include the “AA1” amino acid (AA1−) of the 736 amino acid AA1+ sequence.
References to viral capsids formed from VP capsid proteins (such as reference to specific AAV capsid serotypes), can incorporate VP capsid proteins which include a Met1/AA1 amino acid (Met+/AA1+), corresponding VP capsid proteins which lack the Met1/AA1 amino acid as a result of Met/AA1-clipping (Met−/AA1−), and combinations thereof (Met+/AA1+ and Met−/AA1−).
As a non-limiting example, an AAV capsid serotype can include VP1 (Met+/AA1+), VP1 (Met−/AA1−), or a combination of VP1 (Met+/AA1+) and VP1 (Met−/AA1−). An AAV capsid serotype can also include VP3 (Met+/AA1+), VP3 (Met−/AA1−), or a combination of VP3 (Met+/AA1+) and VP3 (Met−/AA1−); and can also include similar optional combinations of VP2 (Met+/AA1) and VP2 (Met−/AA1−).
In certain embodiments, the parent AAV capsid sequence may comprise an amino acid sequence with 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identity to any of the those amino acid sequences (e.g, 7-mer peptide sequences) provided in the Sequence Listing.
In certain embodiments, the parent AAV capsid sequence may be encoded by a nucleotide sequence with 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identity to any of the those nucleotide sequences provided in the Sequence Listing.
Recombinant or engineered AAV vectors have shown promise for use in therapy for the treatment of human disease. However, a need still exists for AAV particles with more specific and/or enhanced tropism for target tissues. Capsid engineering methods, including those provided herein, have been used to try to identify capsids with enhanced transduction of target tissues (e.g., brain, spinal cord, DRG). A variety of methods have been used, including mutational methods, DNA barcoding, directed evolution, random peptide insertions, and capsid shuffling and/or chimeras.
One method used to generate AAV particles with desirable traits is through the use of insertion of peptides, such as those provided herein, into a parent AAV capsid sequence according to the methods provided herein.
Rational engineering and mutational methods have been used to direct AAV to a target tissue. In rational design, structure-function relationships are used to determine regions in which changes to the capsid sequence may be made. As non-limiting examples, surface loop structures, receptor binding sites, and/or heparin binding sites may be mutated, or otherwise altered, for rational design of recombinant AAV capsids for enhanced targeting to a target tissue. In one example of rational design, AAV capsids were modified by mutation of surface exposed tyrosines to phenylalanine, in order to evade ubiquitination, reduce proteasomal degradation and allow for increased AAV particle and viral genome expression (Lochrie M A, et al, J Virol. 2006 January; 80(2):821-34: Santiago-Ortiz J L and Schaffer D V. J Control Release, 2016 Oct. 28, 240:287-301, the contents of each of which are incorporated by reference in their entirety). Rational design also encompasses the addition of targeting peptides to a parent AAV capsid sequence, wherein the targeting peptide may have an affinity for a receptor of interest within a target tissue.
In certain embodiments, rational engineering and/or mutational methods are used to identify AAV capsids and/or targeting peptides having enhanced transduction of a target tissue (e.g., CNS or PNS).
Capsid shuffling, and/or chimeras describe a method in which fragments of at least two parent AAV capsids are combined to generate a new recombinant capsid protein, the number of parent AAV capsids used may be 2-20, or more than 20.
In certain embodiments, capsid shuffling is used to identify AAV capsids and/or targeting peptides having enhanced transduction of a target tissue (e.g., CNS or PNS).
Directed evolution involves the generation of AAV capsid libraries (˜104-108) by any of a variety of mutagenesis techniques and selection of lead candidates based on response to selective pressure by properties of interest (e.g, tropism). Directed evolution of AAV capsids allows for positive selection from a pool of diverse mutants without necessitating extensive prior characterization of the mutant library. Directed evolution libraries may be generated by any molecular biology technique known in the art, and may include. DNA shuffling, random point mutagenesis, insertional mutagenesis (e.g., targeting peptides), random peptide insertions, or ancestral reconstructions. AAV capsid libraries may be subjected to more than one round of selection using directed evolution for further optimization. Directed evolution methods are most commonly used to identify AAV capsid proteins with enhanced transduction of a target tissue. Capsids with enhanced transduction of a target tissue have been identified for the targeting human airway epithelium, neural stem cells, human pluripotent stem cells, retinal cells, and other in vivo and in vivo cells.
In certain embodiments directed evolution methods are used to identify AAV capsids and/or targeting peptides having enhanced transduction of a target tissue (e.g., CNS or PNS).
One method described for high-throughput characterization of the phenotypes of a large number of AAV serotypes is known as AAV Barcode-Seq (Adachi K et al, Nature Communications 5:3075 (2014), the contents of which are herein incorporated by reference in their entirety). In this next-generation sequence (NGS) based method, AAV libraries are created comprising DNA barcode tags, which can be assessed by multi-plexed Illumina barcode sequencing. This method can be used to identify AAV variants with altered receptor binding, tropism, neutralization and or blood clearance as compared to wild-type or non-variant sequences. Amino acids of the AAV capsid that are important to these functions can also be identified in this manner.
As described in Adachi et al 2014. AAV capsid libraries were generated, wherein each mutant carried a wild-type AAV2 rep gene and an AAV cap gene derived from a series of variants or mutants, and a pair of left and right 12-nucleotide long DNA bar-codes downstream of an AAV2 polyadenylation signal (pA). In this manner, 7 different DNA barcode AAV capsid libraries were generated Capsid libraries were then provided to mice. At a pre-set timepoint, samples were collected. DNA extracted and PCR-amplified using AAV-clone specific virus bar codes and sample-specific bar code attached PCR primers. All the virus barcode PCR amplicons were Illumina sequenced and converted to raw sequence read number data by a computational algorithm. The core of the Barcode-Seq approach is a 96-nucleotide cassette comprising the DNA bar-codes (left and right) described above, three PCR primer binding sites and two restriction enzyme sites. As an exemplar, an AAV rep-cap genome was used, but the system can be applied to any AAV viral genome, including one devoid of rep and cap genes. The advantage of the Barcode Seq method is the collection of a large data set and correlation to desirable phenotype with few replicates and in a short period of time.
The DNA Barcode Seq method can be similarly applied to RNA.
In certain embodiments, the Barcode Seq method is used to identify AAV capsids and/or targeting peptides having enhanced transduction of a target tissue (e.g., CNS or PNS).
The rational design of AAV vectors that display selective tissue/organ targeting has broadened the applications of AAV as vector/vehicle for polynucleotide delivery to cells. Both direct and indirect targeting approaches have been used to enhance AAV vector cell targeting specificity and retargeting. By way of example, in direct targeting, AAV vector targeting to certain cell types is mediated by small peptides or ligands that have been directly inserted into the viral capsid sequence. This approach has been successfully employed to target endothelial cells. Direct targeting requires detailed knowledge of the capsid structure such that peptides or ligands are positioned at sites that are exposed to the capsid surface; the insertion does not significantly affect capsid structure and assembly; and the native tropism is ablated to maximize targeting to a specific cell type. In indirect targeting, AAV vector targeting is mediated by an associating molecule that interacts with both the viral surface and the specific cell surface receptor. Such associating molecules for AAV vectors may include bispecific antibodies and biotin. The advantages of indirect targeting are that different adaptors can be coupled to the capsid without resulting in significant changes in the capsid structure, and the native tropism can be easily ablated. A disadvantage of using adaptors for targeting involves a potential for decreased stability of the capsid-adaptor complex in vivo.
In addition, AAV vectors may be produced that comprise capsids that allow for the increased transduction of cells and gene transfer to the central nervous system and the brain via the vasculature (Chan, K. Y. et al., 2017, Nat. Neurosci., 20(8):1172-1179). Such vectors facilitate robust transduction of neuronal cells, including interneurons. In embodiments, AAV vectors contain an AAVF, AAV-PHP.B4, AAV-PHP.B5, AAV-PHP.C1, 9P31, or an AAV-PHP.eB capsid.
In embodiments, AAV vectors comprise or consist of an AAV-BI151, AAV-BI152, AAV-BI153, AAV-BI154, AAV-BI155, AAV-BI156 or AAV-BI157 capsid polypeptide (e.g., a polypeptide comprising or consisting of an amino acid sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, or 13, or functional fragments thereof, or comprising or consisting of an amino acid sequence having 85%, 90% or 95% sequence identity thereto). In embodiments, the capsid polypeptide is a VP1 polypeptide.
AAV particles of the disclosure, comprising targeting peptides, may be used for the delivery of any viral genome to a target tissue. The viral genome may encode any payload, such as but not limited to a polypeptide, an antibody, an enzyme, an RNAi agent and/or components of a gene editing system. In certain embodiments, the AAV particles of the disclosure are used to deliver a payload to cells of the CNS, after intravenous delivery. In some embodiments, the AAV particles of the disclosure are used to deliver a payload to cells of the liver, kidney, spleen, brain, spinal cord, serum, heart, or lungs. In some cases, the AAV particles of the disclosure are used to deliver a payload to a cell (e.g., HEK293, primary mouse brain microvascular endothelial cell, primary human BMVEC, and human brain endothelial cell line hCMEC/D3, human liver epithelial cells, hepatocytes, or human hepatocellular carcinoma cells (HepG2)) In embodiments, a viral particle comprising a capsid of the present disclosure has one or more traits selected from the following: 1) high binding affinity to HepG2 cells, 2) high binding affinity to THLE cells. 3) high transduction of HepG2 cells, 4) high transduction of THLE cells, 5) high biodistribution to C57 mice liver, and 6) high production fitness.
A viral genome of an AAV particle of the disclosure, comprises a nucleic acid sequence with at least one payload region encoding a payload, and at least one ITR. A viral genome typically comprises two ITR sequences, one at each of the 5′ and 3′ ends. Further, a viral genome of the AAV particles of the disclosure may comprise nucleic acid sequences for additional components, such as, but not limited to, a regulatory element (e.g., promoter), untranslated regions (UTR), a polyadenylation sequence (polyA), a filler or stuffer sequence, an intron, and/or a linker sequence for enhanced expression.
These viral genome components can be selected and/or engineered to further tailor the specificity and efficiency of expression of a given payload in a target tissue (e.g., CNS or DRG).
The AAV particles of the present disclosure comprise a viral genome with at least one ITR and a payload region. In certain embodiments, the viral genome has two ITRs. These two ITRs flank the payload region at the 5′ and 3′ ends. The ITRs function as origins of replication comprising recognition sites for replication. ITRs comprise sequence regions which can be complementary and symmetrically arranged. ITRs incorporated into viral genomes of the disclosure may be comprised of naturally occurring polynucleotide sequences or recombinantly derived polynucleotide sequences.
The ITRs may be derived from the same serotype as the capsid, selected from any of the known serotypes, or a derivative thereof. The ITR may be of a different serotype than the capsid. In certain embodiments, the AAV particle has more than one ITR In a non-limiting example, the AAV particle has a viral genome comprising two ITRs. In certain embodiments, the ITRs are of the same serotype as one another. In another embodiment, the ITRs are of different serotypes. Non-limiting examples include zero, one or both of the ITRs having the same serotype as the capsid. In certain embodiments both ITRs of the viral genome of the AAV particle are AAV2 ITRs.
Independently, each ITR may be about 100 to about 150 nucleotides in length. An ITR may be about 100-105 nucleotides in length, 106-110 nucleotides in length, 111-115 nucleotides in length, 116-120 nucleotides in length, 121-125 nucleotides in length, 126-130 nucleotides in length, 131-135 nucleotides in length. 136-140 nucleotides in length. 141-145 nucleotides in length or 146-150 nucleotides in length. In certain embodiments, the ITRs are 140-142 nucleotides in length. Non-limiting examples of ITR length are 102, 105, 130, 140, 141, 142, 145 nucleotides in length. ITRs encompassed by the present disclosure include those with at least 90% identity, at least 95% identity, at least 98% identity, or at least 99% identity to a known AAV serotype ITR sequence.
In certain embodiments, the payload region of the viral genome comprises at least one element to enhance the payload target specificity and expression (See e.g., Powell et al. Viral Expression Cassette Elements to Enhance Transgene Target Specificity and Expression in Gene Therapy. 2015 the contents of which are herein incorporated by reference in their entirety). Non-limiting examples of elements to enhance payload target specificity and expression include promoters, endogenous miRNAs, post-transcriptional regulatory elements (PREs), polyadenylation (PolyA) signal sequences and upstream enhancers (USEs), CMV enhancers and introns.
A person skilled in the art may recognize that expression of a payload in a target cell may require a specific promoter, including but not limited to, a promoter that is species specific, inducible, tissue-specific, or cell cycle-specific (Parr et al., Nat. Med 3:1145-9 (1997) the contents of which are herein incorporated by reference in their entirety).
In certain embodiments, the promoter is deemed to be efficient when it drives expression of the payload encoded by the viral genome of the AAV particle.
In certain embodiments, the promoter is a promoter deemed to be efficient when it drives expression in a cell being targeted.
In certain embodiments, the promoter is a promoter having a tropism for a cell being targeted.
In certain embodiments, the promoter drives expression of the payload for a period of time in targeted tissues, Expression driven by a promoter may be for a period of 1 hour, 2, hours, 3 hours, 4 hours, 5 hours, 6 hours, 7 hours, 8 hours, 9 hours, 10 hours, 11 hours, 12 hours, 13 hours, 14 hours, 15 hours, 16 hours, 17 hours, 18 hours, 19 hours, 20 hours, 21 hours, 22 hours, 23 hours, 1 day, 2 days, 3 days, 4 days, 5 days, 6 days, 1 week, 8 days, 9 days, 10 days, 11 days, 12 days, 13 days, 2 weeks, 15 days, 16 days, 17 days, 18 days, 19 days, 20 days, 3 weeks, 22 days, 23 days, 24 days, 25 days, 26 days, 27 days, 28 days, 29 days, 30 days, 31 days, 1 month, 2 months, 3 months, 4 months, 5 months, 6 months, 7 months, 8 months, 9 months, 10 months, 11 months, 1 year, 13 months, 14 months, 15 months, 16 months, 17 months, 18 months, 19 months, 20 months, 21 months, 22 months, 23 months, 2 years, 3 years, 4 years, 5 years, 6 years, 7 years, 8 years, 9 years, 10 years or more than 10 years, Expression may be for 1-5 hours, 1-12 hours, 1-2 days, 1-5 days, 1-2 weeks, 1-3 weeks, 1-4 weeks, 1-2 months, 1-4 months, 1-6 months, 2-6 months, 3-6 months, 3-9 months, 4-8 months, 6-12 months, 1-2 years, 1-5 years, 2-5 years, 3-6 years, 3-8 years, 4-8 years, or 5-10 years. As a non-limiting example, the promoter is a selected for sustained expression of a payload in tissues and/or cells of the central or peripheral nervous system.
Promoters may be naturally occurring or non-naturally occurring Non-limiting examples of promoters include those derived from viruses, plants, mammals, or humans. In some embodiments, the promoters may be those derived from human cells or systems. In some embodiments, the promoter may be truncated or mutated.
Promoters which drive or promote expression in most tissues include, but are not limited to, the human elongation factor 1α-subunit (EF1α) promoter, the cytomegalovirus (CMV) immediate-early enhancer and/or promoter, the chicken β-actin (CBA) promoter and its derivative CAG, β glucuronidase (GUSB) promoter, or ubiquitin C (UBC) promoter Tissue-specific promoters can be used to restrict expression to certain cell types such as, but not limited to, cells of the central or peripheral nervous systems, targeted regions within (e.g., frontal cortex), and/or sub-sets of cells therein (e.g, excitatory neurons) As non-limiting examples, cell-type specific promoters may be used to restrict expression of a payload to excitatory neurons (e.g., glutamatergic), inhibitory neurons (e.g., GABA-ergic), neurons of the sympathetic or parasympathetic nervous system, sensory neurons, neurons of the dorsal root ganglia, motor neurons, or supportive cells of the nervous systems such as microglia, astrocytes, oligodendrocytes, and/or Schwann cells.
Cell-type specific promoters also exist for other tissues of the body, with non-limiting examples including, liver promoters (e.g., hAAT, TBG), skeletal muscle specific promoters (e.g, desmin, MCK, C512). B cell promoters, monocyte promoters, leukocyte promoters, macrophage promoters, pancreatic acinar cell promoters, endothelial cell promoters, lung tissue promoters, and/or cardiac or cardiovascular promoters (e.g., αMHC, cTnT, and CMV-MLC2k).
Non-limiting examples of tissue-specific promoters for targeting payload expression to central nervous system tissues and cells include synapsin (Syn), glutamate vesicular transporter (VGLUT), vesicular GABA transporter (VGAT), parvalbumin (PV), sodium channel Nav 1.8, tyrosine hydroxylase (TH), choline acetyltransferase (ChaT), methyl-CpG binding protein 2 (MeCP2), Ca2+/calmodulin-dependent protein kinase II (CaMKII), metabotropic glutamate receptor 2 (mGluR2), neurofilament light (NFL) or heavy (NFH), neuron-specific enolase (NSE), p-globin minigene np2, preproenkephalin (PPE), enkephalin (Enk) and excitatory amino acid transporter 2 (EAAT2) promoters. Non-limiting examples of tissue-specific expression elements for astrocytes include glial fibrillary acidic protein (GFAP) and EAAT2 promoters. A non-limiting example of a tissue-specific expression element for oligodendrocytes includes the myelin basic protein (MBP) promoter.
In certain embodiments, the promoter may be less than 1 kb. The promoter may have a length of 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800 or more than 800 nucleotides. The promoter may have a length between 200-300, 200-400, 200-500, 200-600, 200-700, 200-800, 300-400, 300-500, 300-600, 300-700, 300-800, 400-500, 400-600, 400-700, 400-80, 500-600, 500-700, 500-800, 600-700, 600-800 or 700-800 nucleotides.
In certain embodiments, the promoter may be a combination of two or more components of the same or different starting or parental promoters such as, but not limited to, CMV and CBA, Each component may have a length of 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800 or more than 800 nucleotides. Each component may have a length between 200-300, 200-400, 200-500, 200-600, 200-700, 200-800, 300-400, 300-500, 300-600, 300-700, 300-800, 400-500, 400-600, 400-700, 400-800, 500-600, 500-700, 500-800, 600-700, 600-800 or 700-800 nucleotides. In certain embodiments, the promoter is a combination of a 382 nucleotide CMV-enhancer sequence and a 260 nucleotide CBA-promoter sequence.
In certain embodiments, the viral genome comprises a ubiquitous promoter. Non-limiting examples of ubiquitous promoters include CMV. CBA (including derivatives CAG, CBh, etc.), EF-1α, PGK, UBC, GUSB (hGBp), and UCOE (promoter of HNRPA2B1-CBX3).
Yu et al. (Molecular Pam 2011, 7:63, the contents of which are herein incorporated by reference in their entirety) evaluated the expression of eGFP under the CAG. EF1α, PGK and UBC promoters in rat DRG cells and primary DRG cells using lentiviral vectors and found that UBC showed weaker expression than the other 3 promoters and only 10-12% glial expression was seen for all promoters. Soderblom et al. (E. Neuro 2015, 2(2). ENEURO.0001-15, the contents of which are herein incorporated by reference in their entirety) evaluated the expression of eGFP in AAV8 with CMV and UBC promoters and AAV2 with the CMV promoter after injection in the motor cortex. Intranasal administration of a plasmid containing a UBC or EF1α promoter showed a sustained airway expression greater than the expression with the CMV promoter (See e.g., Gill et al., Gene Therapy 2001. Vol. 8, 1539-1546: the contents of which are herein incorporated by reference in their entirety). Husain et al. (Gene Therapy 2009, 16(7): 927-932: the contents of which are herein incorporated by reference in their entirety) evaluated an HOH construct with a hGUSB promoter, a HSV-ILAT promoter and an NSE promoter and found that the HpH construct showed weaker expression than NSE in mouse brain. Passini and Wolfe (J Virol. 2001, 12382-12392, the contents of which are herein incorporated by reference in their entirety) evaluated the long-term effects of the HpH vector following an intraventricular injection in neonatal mice and found that there was sustained expression for at least 1 year. Low expression in all brain regions was found by Xu et al. (Gene Therapy 2001, 8, 1323-1332: the contents of which are herein incorporated by reference in their entirety) when NFL and NFH promoters were used as compared to the CMV-lacZ. CMV-luc, EF, GFAP, hENK, nAChR, PPE, PPE+wpre, NSE (0.3 kb), NSE (1.8 kb) and NSE (1.8 kb+wpre). Xu et al. found that the promoter activity in descending order was NSE (1.8 kb), EF. NSE (0.3 kb), GFAP, CMV, hENK, PPE, NFL and NFH. NFL is a 650-nucleotide promoter and NFH is a 920-nucleotide promoter which are both absent in the liver but NFH is abundant in the sensory proprioceptive neurons, brain, and spinal cord and NFH is present in the heart. SCN8A (Nav 1.6) is a 470 nucleotide promoter which expresses throughout the DRG, spinal cord and brain with particularly high expression seen in the hippocampal neurons and cerebellar Purkinje cells, cortex, thalamus and hypothalamus (See e.g., Drews et al. Identification of evolutionary conserved, functional noncoding elements in the promoter region of the sodium channel gene SCN8A. Mamm Genome (2007) 18:723-731; and Raymond et al. Expression of Alternatively Spliced Sodium Channel α-subunit genes. Journal of Biological Chemistry (2004) 279(44) 46234-46241, the contents of each of which are herein incorporated by reference in their entireties).
Any of the promoters taught by the aforementioned Yu, Soderblom. Gill, Husain. Passini, Xu, Drews or Raymond may be used herein.
In certain embodiments, the promoter is not cell specific.
In certain embodiments, the promoter is a RNA pol III promoter. As a non-limiting example, the RNA pol III promoter is U6. As a non-limiting example, the RNA pol III promoter is H1.
In certain embodiments, the viral genome comprises an enhancer element.
In certain embodiments, the viral genome comprises an engineered promoter.
In another embodiment, the viral genome comprises a promoter from a naturally expressed protein.
By definition, wild type untranslated regions (UTRs) of a gene are transcribed but not translated. Generally, the 5′ UTR starts at the transcription start site and ends at the start codon and the 3′ UTR starts immediately following the stop codon and continues until the termination signal for transcription.
Features typically found in abundantly expressed genes of specific target organs (e.g., CNS tissue or DRG) may be engineered into UTRs to enhance stability and protein production. As a non-limiting example, a 5′ UTR from mRNA normally expressed in the brain (e.g., huntingtin) may be used in the viral genomes of the AAV particles of the disclosure to enhance expression in neuronal cells or other cells of the central nervous system.
While not wishing to be bound by theory, wild-type 5′ untranslated regions (UTRs) include features which play roles in translation initiation. Kozak sequences, which are commonly known to be involved in the process by which the ribosome initiates translation of many genes, are usually included in 5′ UTRs. Kozak sequences have the consensus CCRCCAUGG, where R is a purine (adenine or guanine) three bases upstream of the start codon (ATG), which is followed by another ‘G’.
In certain embodiments, the 5′UTR in the viral genome includes a Kozak sequence.
In certain embodiments, the 5′UTR in the viral genome does not include a Kozak sequence.
While not wishing to be bound by theory, wild-type 3′ UTRs are known to have stretches of Adenosines and Uridines embedded therein. These AU rich signatures are particularly prevalent in genes with high rates of turnover. Based on their sequence features and functional properties, the AU rich elements (AREs) can be separated into three classes (Chen et al. 1995, the contents of which are herein incorporated by reference in its entirety): Class I AREs, such as, but not limited to, c-Myc and MyoD, contain several dispersed copies of an AUUUA motif within U-rich regions. Class II AREs, such as, but not limited to, GM-CSF and TNF-α, possess two or more overlapping UUAUUUA(U/A)(U/A) nonamers. Class III ARES, such as, but not limited to, c-Jun and Myogenin, are less well defined. These U rich regions do not contain an AUUUA motif Most proteins binding to the AREs are known to destabilize the messenger, whereas members of the ELAV family, most notably HuR, have been documented to increase the stability of mRNA. HuR binds to AREs of all the three classes. Engineering the HuR specific binding sites into the 3′ UTR of nucleic acid molecules will lead to HuR binding and thus, stabilization of the message in vivo.
Introduction, removal, or modification of 3′ UTR AU rich elements (AREs) can be used to modulate the stability of a polynucleotide. When engineering specific polynucleotides, e.g., payload regions of viral genomes, one or more copies of an ARE can be introduced to make polynucleotides less stable and thereby curtail translation and decrease production of the resultant protein. Likewise. AREs can be identified and removed or mutated to increase the intracellular stability and thus increase translation and production of the resultant protein.
In certain embodiments, the 3′ UTR of the viral genome may include an oligo(dT) sequence for templated addition of a poly-A tail.
In certain embodiments, the viral genome may include at least one miRNA seed, binding site or full sequence, microRNAs (or miRNA or miR) are 19-25 nucleotide noncoding RNAs that bind to the sites of nucleic acid targets and down-regulate gene expression either by reducing nucleic acid molecule stability or by inhibiting translation. A microRNA sequence comprises a “seed” region. i.e., a sequence in the region of positions 2-8 of the mature microRNA, which has perfect Watson-Crick sequence complementarity to the miRNA target sequence of the nucleic acid.
In certain embodiments, the viral genome may be engineered to include, alter, or remove at least one miRNA binding site, full sequence, or seed region.
Any UTR from any gene known in the art may be incorporated into the viral genome of the AAV particle. These UTRs, or portions thereof, may be placed in the same orientation as in the gene from which they were selected, or they may be altered in orientation or location. In certain embodiments, the UTR used in the viral genome of the AAV particle may be inverted, shortened, lengthened, made with one or more other 5′ UTRs or 3′ UTRs known in the art. As used herein, the term “altered” as it relates to a UTR, means that the UTR has been changed in some way in relation to a reference sequence. For example, a 3′ or 5′ UTR may be altered relative to a wild type or native UTR by the change in orientation or location as taught above or may be altered by the inclusion of additional nucleotides, deletion of nucleotides, swapping or transposition of nucleotides.
In certain embodiments, the viral genome of the AAV particle comprises at least one artificial UTR which is not a variant of a wild type UTR.
In certain embodiments, the viral genome of the AAV particle comprises UTRs which have been selected from a family of transcripts whose proteins share a common function, structure, feature, or property.
The viral genome of the AAV particles of the present disclosure may comprise at least one polyadenylation sequence. In certain embodiments, the viral genome of the AAV particle comprises a polyadenylation sequence between the 3′ end of the payload encoding region and the 5′ end of the 3′ITR.
In certain embodiments, the polyadenylation sequence or “polyA sequence” may range from absent to about 500 nucleotides in length. The polyadenylation sequence may be, but is not limited to, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 481, 482, 483, 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, and 500 nucleotides in length.
In certain embodiments, the viral genome of the AAV particles of the present disclosure comprises at least one element to enhance the payload target specificity and expression (See e.g., Powell et al. Viral Expression Cassette Elements to Enhance Transgene Target Specificity and Expression in Gene Therapy, Discov. Med, 2015, 19(102) 49-57; the contents of which are herein incorporated by reference in their entirety) such as an intron. Non-limiting examples of introns include. MVM (67-97 bps), FIX truncated intron 1 (300 bps), pi-globin SD/immunoglobulin heavy chain splice acceptor (250 bps), adenovirus splice donor/immunoglobin splice acceptor (500 bps). SV40 late splice donor/splice acceptor (19S/16S) (180 bps) and hybrid adenovirus splice donor/IgG splice acceptor (230 bps).
In certain embodiments, the intron or intron portion may be 100-500 nucleotides in length. The intron may have a length of 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490 or 500 nucleotides. The intron may have a length between 80-100, 80-120, 80-140, 80-160, 80-180, 80-200, 80-250, 80-300, 80-350, 80-400, 80-450, 80-500, 200-30), 200-400, 200-500, 300-400, 300-500, or 400-500 nucleotides
In certain embodiments, the viral genome of the AAV particles of the present disclosure comprises at least one element to improve packaging efficiency and expression, such as a stuffer or filler sequence. Non-limiting examples of stuffer sequences include albumin and/or alpha-1 antitrypsin. Any known viral, mammalian, or plant sequence may be manipulated for use as a stuffer sequence.
In certain embodiments, the stuffer or filler sequence may be from about 100-3500 nucleotides in length. The stuffer sequence may have a length of about 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 2400, 2500, 2600, 2700, 2800, 2900 or 3000 nucleotides.
miRNA
In certain embodiments, the viral genome comprises at least one sequence encoding a miRNA to reduce the expression of the payload in an “off-target” tissue. As used herein, “off-target” indicates a tissue or cell-type unintentionally targeted by the AAV particles of the disclosure. As an example, an “off-target” tissue or cell when targeting the DRG, may be neurons of other ganglia, such as those of the sympathetic or parasympathetic nervous system, miRNAs and their targeted tissues are well known in the art. As a non-limiting example, a miR-122 miRNA may be encoded in the viral genome to reduce the expression of the viral genome in the liver.
In some embodiments, the viral genome of the AAV particles of the disclosure optionally encodes a selectable marker. The selectable marker may comprise a cell-surface marker, such as any protein expressed on the surface of the cell including, but not limited to receptors. CD markers, lectins, integrins, or truncated versions thereof.
In some embodiments, selectable marker reporter genes are described in International Publication Nos. WO 1996023810 and WO 196030540: Heim et al., Current Biology 2-178-182 (1996); Heim et al., Proc. Natl. Acad. Sci. USA (1995): or Heim et al., Science 373:663-664 (1995), the contents of each of which are incorporated herein by reference in their entirety.
In certain embodiments, the AAV particles of the disclosure may comprise a single-stranded or double-stranded viral genome. The size of the viral genome may be small, medium, large or the maximum size. As described above, the viral genome may comprise a promoter and a polyA tail.
In certain embodiments, the viral genome may be a small single stranded viral genome. A small single stranded viral genome may be 2.1 to 3.5 kb in size such as, but not limited to, about 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3.0, 3.1, 3.2, 3.3, 3.4, and 3.5 kb in size.
In certain embodiments, the viral genome may be a small double stranded viral genome. A small double stranded viral genome may be 1.3 to 1.7 kb in size such as, but not limited to, about 1.3, 1.4, 1.5, 1.6, and 1.7 kb in size.
In certain embodiments, the viral genome may be a medium single stranded viral genome. A medium single stranded viral genome may be 3.6 to 4.3 kb in size such as, but not limited to, about 3.6, 37, 3.8, 3.9, 40, 4.1, 4.2 and 4.3 kb in size.
In certain embodiments, the viral genome may be a medium double stranded viral genome. A medium double stranded viral genome may be 1.8 to 2.1 kb in size such as, but not limited to, about 1.8, 1.9, 2.0, and 2.1 kb in size.
In certain embodiments, the viral genome may be a large single stranded viral genome. A large single stranded viral genome may be 4.4 to 6.0 kb in size such as, but not limited to, about 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9 and 6.0 kb in size.
In certain embodiments, the viral genome may be a large double stranded viral genome. A large double stranded viral genome may be 2.2 to 3.0 kb in size such as, but not limited to, about 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9 and 3.0 kb in size.
The AAV particles of the present disclosure comprise a viral genome with at least one payload region. As used herein, a“payload region” is any nucleic acid sequence (e.g., within the viral genome) which encodes one or more “payloads” of the disclosure. As non-limiting examples, a payload region may be a nucleic acid sequence within the viral genome of an AAV particle, which encodes a payload, wherein the payload is a polynucleotide or polypeptide. Payloads of the present disclosure may be, but are not limited to, peptides, polypeptides, proteins, antibodies, polynucleotides, etc. including those of therapeutic benefit.
The payload region can contain a combination of coding and non-coding nucleic acid sequences.
In certain embodiments, the AAV particle comprises a viral genome with a payload region encoding more than one payload of interest. In such an embodiment, a viral genome encoding more than one payload may be replicated and packaged into a viral particle. A target cell transduced with a viral particle comprising more than one payload may express each of the payloads in a single cell.
In some embodiments of any of the aspects, a nucleic acid sequence as described herein is chemically modified to enhance stability or other beneficial characteristics. The nucleic acids described herein may be synthesized and/or modified by methods such as those described in “Current protocols in nucleic acid chemistry,” Beaucage, S. L. et al. (Edrs.), John Wiley & Sons, Inc., New York, NY, USA, which is hereby incorporated herein by reference. Modifications include, for example, (a) end modifications, e.g., 5′ end modifications (phosphorylation, conjugation, inverted linkages, etc.) 3′ end modifications (conjugation, DNA nucleotides, inverted linkages, etc.), (b) base modifications, e.g., replacement with stabilizing bases, destabilizing bases, or bases that base pair with an expanded repertoire of partners, removal of bases (abasic nucleotides), or conjugated bases, (c) sugar modifications (e.g., at the 2′ position or 4′ position) or replacement of the sugar, as well as (d) backbone modifications, including modification or replacement of the phosphodiester linkages. Specific examples of nucleic acid compounds useful in the embodiments described herein include but are not limited to nucleic acids containing modified backbones or no natural internucleoside linkages nucleic acids having modified backbones include, among others, those that do not have a phosphorus atom in the backbone.
Modified nucleic acids that do not have a phosphorus atom in their internucleoside backbone can also be considered to be oligonucleosides. In some embodiments, the modified nucleic acid will have a phosphorus atom in its internucleoside backbone.
Modified nucleic acid backbones can include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3′-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3′-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3′-5′ linkages, 2′-5′ linked analogs of these, and those) having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3′-5′ to 5′-3′ or 2′-5′ to 5′-2′. Various salts, mixed salts and free acid forms are also included. Modified nucleic acid backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatoms, and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; others having mixed N, O, S and CH2 component parts, and oligonucleosides with heteroatom backbones, and in particular —CH2-NH—CH2-, —CH2-N(CH3)-O—CH2- [known as a methylene (methylimino) or MMI backbone], —CH2-O—N(CH3)-CH2-, —CH2-N(CH3)-N(CH3)-CH2- and —N(CH3)-CH2-CH2-[wherein the native phosphodiester backbone is represented as —O—P—O—CH2-].
In other nucleic acid mimetics, both the sugar and the internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups. The base units are maintained for hybridization with an appropriate nucleic acid target compound. One such oligomeric compound, an RNA mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA compounds, the sugar backbone of an RNA is replaced with an amide containing backbone, in particular an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone.
The nucleic acid can also be modified to include one or more locked nucleic acids (LNA). A locked nucleic acid is a nucleotide having a modified ribose moiety in which the ribose moiety comprises an extra bridge connecting the 2′ and 4′ carbons. This structure effectively “locks” the ribose in the 3′-endo structural conformation. The addition of locked nucleic acids to siRNAs has been shown to increase siRNA stability in serum, and to reduce off-target effects (Elmen, J. et ah, (2005) Nucleic Acids Research 33(1):439-447; Mook, OR. et ak, (2007) Mol. Cane. Ther. 6(3):833-843; Grunweller, A. et ah, (2003) Nucleic Acids Research 31(12):3185-3193).
Modified nucleic acids can also contain one or more substituted sugar moieties. The nucleic acids described herein can include one of the following at the 2′ position: OH; F; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-alkynyl; or O-alkyl-O-alkyl, where the alkyl, alkenyl and alkynyl may be substituted or unsubstituted Cl to CIO alkyl or C2 to CIO alkenyl and alkynyl. Exemplary suitable modifications include O[(CH2)nO]mCH3, O(CH2)nOCH3, O(CH2)nNH2, O(CH2) nCH3, O(CH2)nONH2, and O(CH2)nON[(CH2)nCH3)]2, where n and m are from 1 to about 10. In some embodiments, nucleic acids include one of the following at the 2′ position: Cl to CIO lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH3, OCN, Cl, Br, CN, CF3, OCF3, SOCH3, SO2CH3, ONO2, NO2, N3, NH2, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of a nucleic acid, or a group for improving the pharmacodynamic properties of a nucleic acid, and other substituents having similar properties. In some embodiments, the modification includes a 2′ methoxyethoxy (2′-O—CH2CH2OCH3, also known as 2′-O-(2-methoxyethyl) or 2′-MOE) (Martin et al, Helv. Chim. Acta, 1995, 78:486-504) i.e., an alkoxy-alkoxy group. Another exemplary modification is 2′-dimethylaminooxyethoxy, i.e., a O(CH2)2ON(CH3)2 group, also known as 2′-DMAOE, as described in examples herein below, and 2′-dimethylaminoethoxyethoxy (also known in the art as 2′-O-dimethylaminoethoxyethyl or 2′-DMAEOE), i.e., 2′-O—CH2-O—CH2-N(CH2)2).
Other modifications include 2′-methoxy (2′-OCH3), 2′-aminopropoxy (2′-OCH2CH2CH2NH2) and 2′-fluoro (2′-F). Similar modifications can also be made at other positions on the nucleic acid, particularly the 3′ position of the sugar on the 3′ terminal nucleotide or in 2′-5′ linked dsRNAs and the 5′ position of 5′ terminal nucleotide. Nucleic acids may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar.
A nucleic acid can also include nucleobase (often referred to in the art simply as “base”) modifications or substitutions. “Unmodified” or “natural” nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases can include other synthetic and natural nucleobases including but not limited to as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl anal other 8-substituted adenines and guanines, 5-halo, particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-daazaadenine and 3-deazaguanine and 3-deazaadenine. Certain of these nucleobases are particularly useful for increasing the binding affinity of the inhibitory nucleic acids featured in the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and 0-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2° C. (Sanghvi, Y. S., Crooke, S. T. and Lebleu, B., Eds., dsRNA Research and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are exemplary base substitutions, even more particularly when combined with 2′-O-methoxyethyl sugar modifications. In some embodiments, modified nucleobases can include d5SICS and dNAM, which are a non-limiting example of unnatural nucleobases that can be used separately or together as base pairs (see e.g., Leconte et. al. J. Am. Chem. Soc. 2008, 130, 7, 2336-2343; Malyshev et. al. PNAS. 2012. 109 (30) 12005-12010). In some embodiments, oligonucleotide tags (e.g., Oligopaint) comprise any modified nucleobases known in the art, i.e., any nucleobase that is modified from an unmodified and/or natural nucleobase.
The preparation of the modified nucleic acids, backbones, and nucleobases described above are known in the art.
Another modification of a nucleic acid featured in the disclosure involves chemically linking to a polynucleotide one or more ligands, moieties or conjugates that enhance the activity, cellular distribution, pharmacokinetic properties, or cellular uptake of the polynucleotide. Such moieties include but are not limited to lipid moieties such as a cholesterol moiety (Letsinger et al., Proc. Natl. Acid. Sci. USA, 1989, 86: 6553-6556), cholic acid (Manoharan et al., Biorg. Med. Chem. Let., 1994, 4: 1053-1060), a thioether, e.g., beryl-S-tritylthiol (Manoharan et al., Ann. N.Y. Acad. Sci., 1992, 660:306-309; Manoharan et al., Biorg. Med. Chem. Let., 1993, 3:2765-2770), a thiocholesterol (Oberhauser et al., Nucl. Acids Res., 1992, 20:533-538), an aliphatic chain, e.g., dodecandiol or undecyl residues (Saison-Behmoaras et ak, EMBO J, 1991, 10: 1111-1118; Kabanov et al., LEBS Lett., 1990, 259:327-330; Svinarchuk et al., Biochimie, 1993, 75:49-54), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethyl-ammonium 1,2-di-0-hexadecyl-rac-glycero-3-phosphonate (Manoharan et al., Tetrahedron Lett., 1995, 36:3651-3654; Shea et al., Nucl. Acids Res., 1990, 18:3777-3783), a polyamine or a polyethylene glycol chain (Manoharan et al., Nucleosides & Nucleotides, 1995, 14:969-973), or adamantane acetic acid (Manoharan et al., Tetrahedron Lett., 1995, 36:3651-3654), a palmityl moiety (Mishra et al., Biochim. Biophys. Acta, 1995, 1264:229-237), or an octadecylamine or hexylamino-carbonyloxycholesterol moiety (Crooke et al., J. Pharmacol. Exp. Ther., 1996, 277:923-937).
Viral production disclosed herein describes processes and methods for producing AAV particles may be used to contact a target cell to deliver a payload.
The present disclosure provides methods for the generation of AAV particles containing capsids with improved traits. In certain embodiments, the AAV particles are prepared by viral genome replication in a viral replication cell. Any method known in the art may be used for the preparation of AAV particles. In certain embodiments, AAV particles are produced in mammalian cells (e.g., HEK293). In another embodiment, AAV particles are produced in insect cells (e.g., Sf9)
Methods of making AAV particles are well known in the art and are described in e.g., U.S. Pat. Nos. 6,204,059, 5,756,283, 6,258,595, 6,261,551, 6,270,996, 6,281,010, 6,365,394, 6,475,769, 6,482,634, 6,485,966, 6,943,019, 6,953,690, 7,022,519, 7,238,526, 7,291,498 and 7,491,508, 5,064,764, 6,194,191 U.S. Pat. Nos. 6,566,118, 8,137,948; or International Publication Nos. WO1996039530, WO1998010088, WO1999014354, WO1999015685, WO1999047691, WO2000055342. WO2000075353 and WO2001023597, Methods In Molecular Biology, ed. Richard, Humana Press, NJ (1995); O'Reilly et al., Baculovirus Expression Vectors. A Laboratory Manual. Oxford Univ. Press (1994); Samulski et al., J. Vir. 63:3822-8 (1989), Kajigaya et al., Proc. Nat'l. Acad. Sci. USA 88: 4646-50 (1991); Ruffing et al., J. Vir. 66:6922-30 (1992); Kimbauer et al., Vir. 219:37-44 (1996); Zhao et al., Vir. 272:382-93 (2000); the contents of each of which are herein incorporated by reference in their entirety. In certain embodiments, the AAV particles are made using the methods described in International Patent Publication WO2015191508, the contents of which are herein incorporated by reference in their entirety.
The viral replication cell may be selected from any biological organism, including prokaryotic (e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and mammalian cells. Viral replication cells commonly used for production of recombinant AAV viral particles include, but are not limited to, HEK293 cells, COS cells, HeLa cells, KB cells, and other mammalian cell lines as described in U.S. Pat. Nos. 6,156,303, 5,387,484, 5,741,683, 5,691,176, and 5,688,676: U.S. Patent Application Publication No 2002/0081721, and International Patent Publication Nos. WO 2000047757. WO 2000024916, and WO 1996017947, the contents of each of which are herein incorporated by reference in their entirety. Viral replication cells may comprise other mammalian cells such as A549, WEH1, 3T3, 10T1/2, BHK, MDCK, COS 1, COS 7, BSC 1, BSC 40, BMT 10, VERO, W138, Saos, C2C12, L cells, HT1080, HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from mammals. Viral replication cells may comprise cells derived from mammalian species including, but not limited to, human, monkey, mouse, rat, rabbit, and hamster. Viral replication cells may comprise cells derived from a cell type, including but not limited to fibroblast, hepatocyte, tumor cell, cell line transformed cell, etc.
In some embodiments, the present disclosure provides a method for producing an AAV particle in mammalian cells, comprising the steps of 1) simultaneously co-transfecting mammalian cells, such as, but not limited to HEK293 cells, with a viral genome comprising a payload region (payload construct), a viral genome comprising polynucleotide sequences for rep and cap genes (rep/cap construct) and a viral genome comprising polynucleotide sequences encoding helper components (helper construct), 2) harvesting and purifying the AAV particles comprising a viral genome. This triple transfection method of AAV particle production may be utilized to produce small lots of virus.
In certain embodiments, the AAV particles may be produced in a viral replication cell that comprises an insect cell.
Growing conditions for insect cells in culture, and production of heterologous products in insect cells in culture are well-known in the art, see U.S. Pat. No. 6,204,059, the contents of which are herein incorporated by reference in their entirety.
Any insect cell which allows for replication of parvovirus and which can be maintained in culture can be used in accordance with the present disclosure. Cell lines may be used from Spodoptera frugiperda, including, but not limited to the Sf9 or Sf21 cell lines, Drosophila cell lines, or mosquito cell lines, such as Aedes albopictus derived cell lines. Use of insect cells for expression of heterologous proteins is well documented, as are methods of introducing nucleic acids, such as vectors, e.g., insect-cell compatible vectors, into such cells and methods of maintaining such cells in culture. See, for example, Methods in Molecular Biology, ed. Richard, Humana Press, NJ (1995); O'Reilly et al., Baculovirus Expression Vectors, A Laboratory Manual, Oxford Univ. Press (1994). Samulski et al., J. Vir. 63:3822-8 (1989); Kajigaya et al., Proc. Nat'l. Acad. Sci. USA 88: 4646-50 (1991); Ruffing et al., J. Vir. 66:6922-(1992); Kimbauer et al., Vir. 219-37-44 (1996); Zhao et al, Vir. 272:382-93 (2000); and Samulski et al., U.S. Pat. No. 6,204,059, the contents of each of which is herein incorporated by reference in its entirety.
In some embodiments, the present disclosure provides a method for producing an AAV particle in a baculovirus/Sf9 system, comprising the steps of: 1) co-transfecting competent bacterial cells with a bacmid vector and either a viral construct vector and/or AAV payload construct vector, 2) isolating the resultant viral construct expression vector and AAV payload construct expression vector and separately transfecting viral replication cells. 3) isolating and purifying resultant payload and viral construct particles comprising viral construct expression vector or AAV payload construct expression vector, 4) co-infecting a viral replication cell with both the AAV payload and viral construct particles comprising viral construct expression vector or AAV payload construct expression vector, and 5) harvesting and purifying AAV particles comprising a viral genome.
Briefly, the viral construct vector and the AAV payload construct vector are each incorporated by a transposon donor/acceptor system into a bacmid, also known as a baculovirus plasmid, by standard molecular biology techniques known and performed by a person skilled in the art. Transfection of separate viral replication cell populations produces two baculoviruses, one that comprises the viral construct expression vector, and another that comprises the AAV payload construct expression vector. The two baculoviruses may be used to infect a single viral replication cell population for production of AAV particles.
Baculovirus expression vectors for producing viral particles in insect cells, including but not limited to Spodoptera frugiperda (Sf9) cells, provide high titers of viral particle product. Recombinant baculovirus encoding the viral construct expression vector and AAV payload construct expression vector initiates a productive infection of viral replicating cells. Infectious baculovirus particles released from the primary infection secondarily infect additional cells in the culture, exponentially infecting the entire cell culture population in a number of infection cycles that is a function of the initial multiplicity of infection, see Urabe, M, et al., J Virol. 2006 February; 80 (4):1874-85, the contents of which are herein incorporated by reference in their entirety.
Production of AAV particles with baculovirus in an insect cell system may address known baculovirus genetic and physical instability. In certain embodiments, the production system addresses baculovirus instability over multiple passages by utilizing a titerless infected-cells preservation and scale-up system. Small scale seed cultures of viral producing cells are transfected with viral expression constructs encoding the structural, non-structural, components of the viral particle. Baculovirus-infected viral producing cells are harvested into aliquots that may be cryopreserved in liquid nitrogen; the aliquots retain viability and infectivity for infection of large scale viral producing cell culture Wasilko D J et al., Protein Expr Purif, 2009 June; 65(2):122-32, the contents of which are herein incorporated by reference in their entirety.
A genetically stable baculovirus may be used as the source of one or more of the components for producing AAV particles in invertebrate cells. In certain embodiments, defective baculovirus expression vectors may be maintained episomally in insect cells. In such an embodiment the bacmid vector is engineered with replication control elements, including but not limited to promoters, enhancers, and/or cell-cycle regulated replication elements.
In certain embodiments, stable viral replication cells permissive for baculovirus infection are engineered with at least one stable integrated copy of any of the elements necessary for AAV replication and viral particle production including, but not limited to, the entire AAV genome, Rep and Cap genes, Rep genes, Cap genes, each Rep protein as a separate transcription cassette, each VP protein as a separate transcription cassette, the AAP (assembly activation protein), or at least one of the baculovirus helper genes with native or non-native promoters.
AAV particles described herein may be produced by triple transfection or baculovirus mediated virus production, or any other method known in the art. Any suitable permissive or packaging cell known in the art may be employed to produce the particles. Mammalian cells are often preferred. Also preferred are trans-complementing packaging cell lines that provide functions deleted from a replication-defective helper virus, e.g., 293 cells or other E1a trans-complementing cells. A packaging cell line may be used that is stably transformed to express cap and/or rep genes. Alternatively, a packaging cell line may be used that is stably transformed to express helper constructs necessary for AAV particle assembly.
Recombinant AAV virus particles are, in some cases, produced and purified from culture supernatants according to the procedure as described in US20160032254, the contents of which are incorporated by reference.
In certain embodiments, AAV particles are produced wherein all three VP proteins are expressed at a stoichiometry around 1:1:10 (VP1:VP2:VP3). While not wishing to be bound by theory, the regulatory mechanisms that allow this controlled level of expression include the production of two mRNAs, one for VP1, and the other for VP2 and VP3, produced by differential splicing.
In certain embodiments, the viral construct vector(s) used for AAV production may contain a nucleotide sequence encoding the AAV capsid proteins where the initiation codon of the AAV VP1 capsid protein is a non-ATG, i.e., a suboptimal initiation codon, allowing the expression of a modified ratio of the viral capsid proteins in the production system, to provide improved infectivity of the host cell. In a non-limiting example, a viral construct vector may contain a nucleic acid construct comprising a nucleotide sequence encoding AAV VP1, VP2, and VP3 capsid proteins, wherein the initiation codon for translation of the AAV VP1 capsid protein is CTG. TTG, or GTG, as described in U.S. Pat. No. 8,163,543, the contents of which are herein incorporated by reference in its entirety.
In certain embodiments, the viral construct vector(s) used for AAV production may contain a nucleotide sequence encoding the AAV rep proteins where the initiation codon of the AAV rep protein or proteins is a non-ATG In certain embodiments, a single coding sequence is used for the Rep78 and Rep52 proteins, wherein initiation codon for translation of the Rep78 protein is a suboptimal initiation codon, selected from the group consisting of ACG, TTG, CTG and GTG, that effects partial exon skipping upon expression in insect cells, as described in U.S. Pat. No. 8,512,981, the contents of which is herein incorporated by reference in its entirety, for example to promote less abundant expression of Rcp78 as compared to Rep52, which may be advantageous in that it promotes high vector yields. Small-scale production
In some cases, 293T cells (adhesion/suspension) are transfected with polyethyleneimine (PEI) with plasmids required for production of AAV, i e., AAV2 rep, an adenoviral helper construct and a ITR flanked payload cassette. The AAV2 rep plasmid also contains the cap sequence of the particular virus being studied. Twenty-four hours after transfection (no medium changes for suspension), which occurs in DMEM/F17 with/without serum, the medium is replaced with fresh medium with or without serum. Three (3) days after transfection, a sample is taken from the culture medium of the 293 adherent cells. Subsequently cells are scraped, or suspension cells are pelleted, and transferred into a receptacle. For adhesion cells, after centrifugation to remove cellular pellet, a second sample is taken from the supernatant after scraping. Next, cell lysis is achieved by three consecutive freeze-thaw cycles (−80 C to 37 C) or adding detergent triton. Cellular debris is removed by centrifugation or depth filtration and sample 3 is taken from the medium. The samples are quantified for AAV particles by DNase resistant genome titration by DNA qPCR. The total production yield from such a transfection is equal to the particle concentration from sample 3.
AAV particle titers are measured according to genome copy number (genome particles per milliliter). Genome particle concentrations are based on DNA qPCR of the vector DNA as previously reported (Clark et al. (1999) Hum. Gene Ther., 10:1031-1039; Veldwijk et al. (2002) Mol Ther, 6.272-278).
In some embodiments. AAV particle production may be modified to increase the scale of production Large scale viral production methods according to the present disclosure may include any of those taught in U.S. Pat. Nos. 5,756,283, 6,258,595, 6,261,551, 6,270,996, 6,281,010, 6,365,394, 6,475,769, 6,482,634, 6,485,966, 6,943,019, 6,953,690, 7,022,519, 7,238,526, 7,291,498 and 7,491,508 or International Publication Nos WO1996039530, WO1998010088, WO1999014354, WO1999015685, WO1999047691, WO2000055342, WO2000075353 and WO2001023597, the contents of each of which are herein incorporated by reference in their entirety. Methods of increasing viral particle production scale typically comprise increasing the number of viral replication cells. In some embodiments, viral replication cells comprise adherent cells. To increase the scale of viral particle production by adherent viral replication cells, larger cell culture surfaces are required. In some cases, large-scale production methods comprise the use of roller bottles to increase cell culture surfaces Other cell culture substrates with increased surface areas are known in the art. Examples of additional adherent cell culture products with increased surface areas include, but are not limited to CELLSTACK®, CELLCUBE® (Corning Corp., Corning, N.Y.) and NUNC™ CELL FACTORY™ (ThermoFisher Scientific, Waltham, Mass.). In some cases, large-scale adherent cell surfaces may comprise from about 1,000 cm2 to about 100,000 cm2. In some cases, large-scale adherent cell cultures may comprise from about 107 to about 109 cells, from about 1 to about 1010 cells, from about 109 to about 1012 cells or at least 1012 cells. In some cases, large-scale adherent cultures may produce from about 109 to about 1012, from about 1010 to about 1013, from about 1011 to about 1014, from about 1012 to about 1015 or at least 1015 viral particles.
In some embodiments, large-scale viral production methods of the present disclosure may comprise the use of suspension cell cultures. Suspension cell culture allows for significantly increased numbers of cells. Typically, the number of adherent cells that can be grown on about 10-50 cm2 of surface area can be grown in about 1 cm3 volume in suspension.
Transfection of replication cells in large-scale culture formats may be carried out according to any methods known in the art. For large-scale adherent cell cultures, transfection methods may include, but are not limited to the use of inorganic compounds (e.g. calcium phosphate), organic compounds [e.g polyethyleneimine (PEI)] or the use of non-chemical methods (e.g. electroporation.) With cells grown in suspension, transfection methods may include, but are not limited to the use of calcium phosphate and the use of PEI. In some cases, transfection of large-scale suspension cultures may be carried out according to the section entitled “Transfection Procedure” described in Feng, L. et al., 2008. Biotechnol Appl. Biochem. 50:121-32, the contents of which are herein incorporated by reference in their entirety. According to such embodiments, PEI-DNA complexes may be formed for introduction of plasmids to be transfected. In some cases, cells being transfected with PEI-DNA complexes may be ‘shocked’ prior to transfection. This comprises lowering cell culture temperatures to 4° C. for a period of about 1 hour. In some cases, cell cultures may be shocked for a period of from about 10 minutes to about 5 hours. In some cases, cell cultures may be shocked at a temperature of from about 0° C. to about 20° C.
In some cases, transfections may include one or more vectors for expression of an RNA effector molecule to reduce expression of nucleic acids from one or more AAV payload constructs. Such methods may enhance the production of viral particles by reducing cellular resources wasted on expressing payload constructs. In some cases, such methods may be carried out according to those methods taught in US Publication No. US2014/0099666, the contents of which are herein incorporated by reference in their entirety.
Provided herein are compositions containing AAV particles, AAV capsids, and/or polynucleotides encoding the same. The AAV particles may be contained in any appropriate amount in any suitable carrier substance and is/are generally present in an amount of 0.01-95% by weight of the total weight of the composition. The composition may be provided in a form that is suitable for a parenteral (e.g., subcutaneous, intravenous, intramuscular, or intraperitoneal) administration route, such that the agent, such as a viral particle described herein, is systemically delivered. In some instances, a reporter product is also encoded by the vector. The compositions may be formulated according to conventional pharmaceutical practice (see, e.g., Remington: The Science and Practice of Pharmacy (20th ed.), ed. A. R. Gennaro, Lippincott Williams & Wilkins, 2000 and Encyclopedia of Pharmaceutical Technology, eds. J. Swarbrick and J. C. Boylan, 1988-1999, Marcel Dekker, New York).
Compositions may be formulated to release the viral particles substantially immediately upon administration or at any predetermined time or time after administration. The latter types of compositions are generally known as controlled release formulations, which include (i) compositions that create a substantially constant concentration of the agent within the body over an extended period of time; (ii) compositions that after a predetermined lag time create a substantially constant concentration of the drug within the body over an extended period of time; (iii) compositions that sustain action during a predetermined time period by maintaining a relatively constant, effective level in the body with concomitant minimization of undesirable side effects associated with fluctuations in the plasma level of the active substance (sawtooth kinetic pattern); (iv) compositions that localize action by, e.g., spatial placement of a controlled release composition adjacent to or in contact with a target site or location, e.g., in a region of a tissue or organ; (v) compositions that allow for convenient dosing, such that doses are administered, for example, once every one, two, or several weeks; and (vi) compositions that target a specific tissue or cell type using carriers, chemical derivatives, or specifically designed viral particles (e.g., comprising a certain capsid composition) to deliver a payload to a cell.
The composition may be administered systemically, for example, in an acceptable buffer such as physiological saline. In an embodiment, systemic injection of an rAAV vector as described herein allows for the delivery of a payload (e.g., a polynucleotide) to a cell or organ.
Routes of administration include, for example, intracranial, parenteral, subcutaneous (s.c.), intravenous (i.v.), intraperitoneal (i.p.), intramuscular (i.m.), or intradermal administration. The amount of the vector to be administered can vary depending upon the requirements of a given screen. Generally, amounts will be in the range of those used for other viral vector-based agents employed in the delivery of polynucleotides to cells. In embodiments, about, at least about, and/or no more than about 1×10e5, 1×10e6, 1×10e7, 1×10e8, 1×10e9, 1×10e10, 1×10e11, 1×10e12, 1×10e13, 1×10e14, or 1×10e15 vector genomes are delivered to a subject (e.g., a mouse) to screen a library of enhancers. A composition is administered at a level that is effective in meeting the objectives of a screen.
The composition may be in the form of a solution, a suspension, an emulsion, an infusion device, or a delivery device for implantation, or it may be presented as a dry powder to be reconstituted with water or another suitable vehicle before use. The composition may include suitable parenterally acceptable carriers and/or excipients. The active therapeutic agent(s) may be incorporated into microspheres, microcapsules, nanoparticles, liposomes, or the like for controlled release. Furthermore, the composition may include suspending, solubilizing, stabilizing, pH-adjusting agents, tonicity adjusting agents, and/or dispersing, agents.
In some embodiments, the composition is formulated for intravenous delivery. As noted above, the compositions according to the described embodiments may be in a form suitable for sterile injection. To prepare such a composition, the suitable therapeutic(s) are dissolved or suspended in a parenterally acceptable liquid vehicle. Acceptable vehicles and solvents that may be employed include water, water adjusted to a suitable pH by addition of an appropriate amount of hydrochloric acid, sodium hydroxide or a suitable buffer, 1,3-butanediol, Ringer's solution, isotonic sodium chloride solution and dextrose solution. The aqueous formulation may also contain one or more preservatives (e.g., methyl, ethyl, or n-propyl p-hydroxybenzoate). In cases where one of the agents is only sparingly or slightly soluble in water, a dissolution enhancing or solubilizing agent can be added, or the solvent may include 10-60% w/w of propylene glycol or the like.
For direct delivery to the brain, rAAV vectors may be administered by open neurosurgical procedure or by focal injection in order to bypass the blood-brain barrier, to temporally and spatially restrict transgene expression, and to target specific areas of the brain, e.g., interneuron cells and brain tissue comprising these cells.
In some cases, an rAAV vector is delivered to a subject intravenously. In some cases, the rAAV vector is delivered to the central nervous system using the vasculature.
Systemic rAAV delivery (by intravenous injection) provides a non-invasive alternative for broad gene delivery to the nervous system. Several groups have developed rAAV capsids that enhance gene transfer to the CNS and certain tissues and cell populations after intravenous delivery. By way of example, AAV-AS capsid18 utilizes a polyalanine N-terminal extension to the AAV9.4719 VP2 capsid protein to provide higher neuronal transduction, particularly in the striatum. The AAV-BR1 capsid20, based on AAV2, may be useful for more efficient and selective transduction of brain endothelial cells. Another AAV capsid, AAV-PHP.B, comprises a capsid that transduces the majority of neurons and astrocytes across many regions of the adult mouse brain and spinal cord after intravenous injection.
Other modes of rAAV vector administration may include lipid-mediated vector delivery, hydrodynamic delivery, and a gene gun.
The virus vectors and compositions thereof as described herein may be used to screen libraries of capsid polypeptides that have specificity or particular activity levels in particular cell types or tissues (e.g., an organ).
Preparation of a library for sequencing may involve an amplification step. Amplification may involve thermocycling (e.g., PCR) or isothermal amplification (such as through the methods NEAR, RNA-Seq, RPA or LAMP). Amplification can refer to any method employing a primer and a polymerase capable of replicating a target sequence with reasonable fidelity. Amplification may be carried out by natural or recombinant DNA polymerases, such as TaqGold™, T7 DNA polymerase, Klenow fragment of E. coli DNA polymerase, and reverse transcriptase. A preferred amplification method is PCR In some embodiments, isolated RNA is contacted with a reverse transcriptase to produce cDNA for sequencing and/or PCR amplification.
Sequencing may be performed on any high-throughput platform. Methods of sequencing oligonucleotides and nucleic acids are well known in the art (see, e.g., WO93/23564, WO98/28440 and WO98/13523; U.S. Pat. App. Pub. No. 2019/0078232; U.S. Pat. Nos. 5,525,464; 5,202,231; 5,695,940; 4,971,903; 5,902,723; 5,795,782; 5,547,839 and 5,403,708; Sanger et al., Proc. Natl. Acad. Sci. USA 74:5463 (1977); Drmanac et al., Genomics 4:114 (1989); Koster et al., Nature Biotechnology 14:1123 (1996); Hyman, Anal. Biochem. 174:423 (1988); Rosenthal, International Patent Application Publication 761107 (1989); Metzker et al., Nucl. Acids Res. 22:4259 (1994); Jones, Biotechniques 22:938 (1997); Ronaghi et al., Anal. Biochem. 242:84 (1996); Ronaghi et al., Science 281:363 (1998); Nyren et al., Anal. Biochem. 151:504 (1985); Canard and Arzumanov, Gene 11:1 (1994); Dyatkina and Arzumanov, Nucleic Acids Symp Ser 18:117 (1987); Johnson et al., Anal. Biochem. 136:192 (1984); and Elgen and Rigler, Proc. Natl. Acad. Sci. USA 91(13):5740 (1994), all of which are expressly incorporated by reference).
The sequencing of a polynucleotide can be carried out using any suitable commercially available sequencing technology. In embodiments, the sequencing of a polynucleotide is carried out using a chain termination method of DNA sequencing (e.g., Sanger sequencing). In some embodiments, commercially available sequencing technology is a next-generation sequencing technology, including as non-limiting examples combinatorial probe anchor synthesis (cPAS), DNA nanoball sequencing, droplet-based or digital microfluidics, heliscope single molecule sequencing, nanopore sequencing (e.g., Oxford Nanopore technologies), GeneGap sequencing, massively parallel signature sequencing (MPSS), microfluidic Sanger sequencing, microscopy-based techniques (e.g., transmission electronic microscopy DNA sequencing), RNA polymerase (RNAP) sequencing, single-molecule real-time (SMRT) sequencing, SOLiD sequencing, ion semiconductor sequencing, polony sequencing, Pyrosequencing (454), sequencing by hybridization, sequencing by synthesis (e.g., Illuminam sequencing), sequencing with mass spectrometry, and tunneling currents DNA sequencing.
A computer system (or digital device) may be used to receive, transmit, display and/or store results, analyze the results, and/or produce a report of the results and analysis. A computer system may be understood as a logical apparatus that can read instructions from media (e.g., software) and/or network port (e.g., from the internet), which can optionally be connected to a server having fixed media. A computer system may comprise one or more of a CPU, disk drives, input devices such as keyboard and/or mouse, and a display (e.g., a monitor). Data communication, such as transmission of instructions or reports, can be achieved through a communication medium to a server at a local or a remote location. The communication medium can include any means of transmitting and/or receiving data. For example, the communication medium can be a network connection, a wireless connection, or an internet connection. Such a connection can provide for communication over the World Wide Web. It is envisioned that data relating to the present invention can be transmitted over such networks or connections (or any other suitable means for transmitting information, including but not limited to mailing a physical report, such as a print-out) for reception and/or for review by a receiver. One can record results of calculations (e.g., sequence analysis or a listing of hybrid capture probe sequences) made by a computer on tangible medium, for example, in computer-readable format such as a memory drive or disk, as an output displayed on a computer monitor or other monitor, or simply printed on paper. The results can be reported on a computer screen. The receiver can be but is not limited to an individual, or electronic system (e.g., one or more computers, and/or one or more servers).
In some embodiments, the computer system may comprise one or more processors. Processors may be associated with one or more controllers, calculation units, and/or other units of a computer system, or implanted in firmware as desired. If implemented in software, the routines may be stored in any computer readable memory such as in RAM, ROM, flash memory, a magnetic disk, a laser disk, or other suitable storage medium. Likewise, this software may be delivered to a computing device via any known delivery method including, for example, over a communication channel such as a telephone line, the internet, a wireless connection, etc., or via a transportable medium, such as a computer readable disk, flash drive, etc. The various steps may be implemented as various blocks, operations, tools, modules, and techniques which, in turn, may be implemented in hardware, firmware, software, or any combination of hardware, firmware, and/or software. When implemented in hardware, some or all of the blocks, operations, techniques, etc. may be implemented in, for example, a custom integrated circuit (IC), an application specific integrated circuit (ASIC), a field programmable logic array (FPGA), a programmable logic array (PLA), etc.
A client-server, relational database architecture can be used in embodiments of the invention. A client-server architecture is a network architecture in which each computer or process on the network is either a client or a server. Server computers are typically powerful computers dedicated to managing disk drives (file servers), printers (print servers), or network traffic (network servers). Client computers include PCs (personal computers) or workstations on which users run applications, as well as example output devices as disclosed herein. Client computers rely on server computers for resources, such as files, devices, and even processing power. In some embodiments of the invention, the server computer handles all of the database functionality. The client computer can have software that handles all the front-end data management and can also receive data input from users.
A machine-readable medium which may comprise computer-executable code may take many forms, including but not limited to, a tangible storage medium, a carrier wave medium or physical transmission medium. Non-volatile storage media include, for example, optical or magnetic disks, such as any of the storage devices in any computer(s) or the like, such as may be used to implement the databases, etc. shown in the drawings. Volatile storage media include dynamic memory, such as main memory of such a computer platform. Tangible transmission media include coaxial cables; copper wire and fiber optics, including the wires that comprise a bus within a computer system. Carrier-wave transmission media may take the form of electric or electromagnetic signals, or acoustic or light waves such as those generated during radio frequency (RF) and infrared (IR) data communications. Common forms of computer-readable media therefore include for example: a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD or DVD-ROM, any other optical medium, punch cards paper tape, any other physical storage medium with patterns of holes, a RAM, a ROM, a PROM and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave transporting data or instructions, cables or links transporting such a carrier wave, or any other medium from which a computer may read programming code and/or data. Many of these forms of computer readable media may be involved in carrying one or more sequences of one or more instructions to a processor for execution.
The subject computer-executable code can be executed on any suitable device which may comprise a processor, including a server, a PC, or a mobile device such as a smartphone or tablet. Any controller or computer optionally includes a monitor, which can be a cathode ray tube (“CRT”) display, a flat panel display (e.g., active-matrix liquid crystal display, liquid crystal display, etc.), or others. Computer circuitry is often placed in a box, which includes numerous integrated circuit chips, such as a microprocessor, memory, interface circuits, and others. The box also optionally includes a hard disk drive, a floppy disk drive, a high-capacity removable drive such as a writeable CD-ROM, and other common peripheral elements. Inputting devices such as a keyboard, mouse, or touch-sensitive screen, optionally provide for input from a user. The computer can include appropriate software for receiving user instructions, either in the form of user input into a set of parameter fields, e.g., in a GUI, or in the form of preprogrammed instructions, e.g., preprogrammed for a variety of different specific operations.
Also provided are kits comprising engineered AAV capsids, and/or polynucleotides encoding the same. Typically, kits will comprise sufficient amounts and/or numbers of components to allow a user to perform multiple treatments of a subject(s) and/or to perform multiple experiments.
Any of the capsid polypeptides, or polynucleotides encoding the same, of the present disclosure may be contained in a kit. In some embodiments, kits may further include reagents and/or instructions for creating and/or synthesizing compounds and/or compositions of the present disclosure. In some embodiments, kits may also include one or more buffers.
In some embodiments, kit components may be packaged either in aqueous media or in lyophilized form. The container means of the kits will generally include at least one vial, test tube, flask, bottle, syringe, or other container means, into which a component may be placed, and preferably, suitably aliquoted. Where there is more than one kit component, (labeling reagent and label may be packaged together), kits may also generally contain second, third or other additional containers into which additional components may be separately placed. In some embodiments, kits may also comprise second container means for containing sterile, pharmaceutically acceptable buffers and/or other diluents. In some embodiments, various combinations of components may be comprised in one or more vial. Kits of the present disclosure may also typically include means for containing compounds and/or compositions of the present disclosure, e.g., proteins, nucleic acids, and any other reagent containers in close confinement for commercial sale. Such containers may include injection or blow-molded plastic containers into which desired vials are retained.
In some embodiments, kit components are provided in one and/or more liquid solutions. In some embodiments, liquid solutions are aqueous solutions, with sterile aqueous solutions being particularly preferred. In some embodiments, kit components may be provided as dried powder(s). When reagents and/or components are provided as dry powders, such powders may be reconstituted by the addition of suitable volumes of solvent. In some embodiments, it is envisioned that solvents may also be provided in another container means. In some embodiments, labeling dyes are provided as dried powders. In some embodiments, it is contemplated that 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 120, 120, 130, 140, 150, 160, 170, 180, 190, 200, 300, 400, 500, 600, 700, 800, 900, 1000 micrograms or at least or at most those amounts of dried dye are provided in kits of the disclosure. In such embodiments, dye may then be resuspended in any suitable solvent, such as DMSO.
The kit can include instructions for use of the compositions in a method provided herein (e.g., to deliver a payload to a cell). The instructions may be printed directly on the container (when present), or as a label applied to the container, or as a separate sheet, pamphlet, card, computer-readable medium, or folder supplied in or with the container.
The practice of the present invention employs, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, biochemistry, and immunology, which are well within the purview of the skilled artisan. Such techniques are explained fully in the literature, such as, “Molecular Cloning: A Laboratory Manual”, second edition (Sambrook, 1989); “Oligonucleotide Synthesis” (Gait, 1984); “Animal Cell Culture” (Freshney, 1987); “Methods in Enzymology” “Handbook of Experimental Immunology” (Weir, 1996); “Gene Transfer Vectors for Mammalian Cells” (Miller and Calos, 1987); “Current Protocols in Molecular Biology” (Ausubel, 1987); “PCR: The Polymerase Chain Reaction”, (Mullis, 1994); “Current Protocols in Immunology” (Coligan, 1991). These techniques are applicable to the production of the polynucleotides and polypeptides of the invention, and, as such, may be considered in making and practicing the invention. Particularly useful techniques for particular embodiments will be discussed in the sections that follow.
The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the assay, screening, and therapeutic methods of the invention, and are not intended to limit the scope of what the inventors regard as their invention.
Experiments were undertaken to develop improved gene delivery vectors derived from AAV9, by inserting into the AAV9 capsid 7 amino acids (7-mer) between VP1 residues 588 and 589 (
It was first assessed whether codon usage impacted the production fitness of identical amino acid variants. If so, it would be necessary to train on the nucleotide sequence space (617 for NNN, 317 for NNK, where “N” represents any nucleotide and “K” represents G or T, and where 61 and 31 correspond to the number of amino-acid encoding codons with the sequence NNN or NNK, respectively), which is much larger than the amino acid sequence space (207). High correlation was observed between the fitness scores of 7-mer amino acid replicates (
The production fitness distribution of the training library was modeled by a mixture of two Gaussian distributions: a “low fitness” versus a “high fitness” distribution (
A regression model was used to capture the large variation in relative production fitness scores (±5-fold; log 2 enrichment) within the high fitness and low fitness distributions (
Using the production fitness model, the fitness of 24M AA variants was randomly generated and predicted in silico. The predicted high production fitness sequence space was then evenly sampled for 240K variants to create a “Fit4Function” library that evenly sampled only the high fit sequence space (
Fit4Function libraries were designed to enable the generation of reproducible and ML-compatible functional screening data. Specifically, the library was limited to a moderate size that enabled deeper sequencing depth and sampled only variants with high production fitness, which enabled more quantitative and reliable detection of each variant in the library. In addition, the library evenly sampled the high production fitness amino acid sequence space, which resulted in less biased ML models that generalized well across the sequence space.
The outcomes of the Fit4Function library screening strategy were compared versus an NNK library across five functional assays: (1) HEK293 cell binding, (2) primary mouse brain microvascular endothelial cell (BMVEC) binding, (3) primary human BMVEC binding, (4) human brain endothelial cell line (hCMEC/D3) binding, and (5) HEK293 transduction. Binding and transduction were measured by quantitative sequencing capsid variant abundance at the DNA and mRNA levels, respectively. The Fit4Function library consistently yielded higher replication quality data than the NNK library (one-tailed paired t-test, n=5 assays; p=0.0074;
It was next sought to examine the use of the Fit4Function library to train prediction models of in vivo AAV biodistribution after systemic administration in adult C57BL/6J mice. The replication quality was high in liver, kidney, and spleen, and moderate in the brain, spinal cord, serum, heart, and lungs (
Efficient and durable gene delivery to the liver remains challenging due to capsid antigen presentation and T cell-mediated immunity. Liver-directed therapies should benefit from the development of potent AAV vectors that can be administered at lower doses to reduce the exposure to capsid antigens. There is a need for capsids that are compatible with preclinical efficacy and safety testing. The objective was to design a ‘MultiFunction’ library consisting only of variants that were each predicted to possess multiple enhanced functions related to cross-species hepatocyte gene delivery. Toward this goal, five separate functional screens of the Fit4Function library were performed for capsids capable of cross-species hepatocyte gene delivery: (1) binding (2) transduction of the human hepatocellular carcinoma cell line (HepG2), (3) binding or (4) transduction of the human liver epithelial cell line (THLE), and (5) efficient liver biodistribution in C57BL/6J mice (
To assess the accuracy of researchers' predictions and identify the top-performing variants, the MultiFunction library was screened on the same five assays related to hepatocyte targeting and on production fitness (see replicate correlations in
The performance of seven variants that were selected from the MultiFunction library were individually assessed based on their measured production fitness, liver biodistribution and transduction in mice, and their enhanced ability to bind and transduce human HEPG2 and THLE cells (
A 100K member Fit4Function library was administered intravenously to an adult cynomolgus macaque and assessed biodistribution. Liver-targeted MultiFunction capsids, predicted with the six prior models that were trained only on human cell and mouse data and production fitness, were highly enriched in terms of macaque liver biodistribution (
Production fitness is a bottleneck for manufacturability of viral vectors. Screening randomly synthesized libraries can result in the identification of capsids optimized for function, but that are challenging to manufacture. Four experiments were, therefore, undertaken (i.e., Experiments 1-4) to assess the “manufacturability” or production fitness under defined conditions for capsid variants in a library (
AAV capsid variants had different attributes that could be assessed through in vitro and in vivo assays that measure the ability of specific capsids to bind or transduce relevant cell types including those derived from humans, mice, or other species commonly used for disease models. Accordingly, the data shown in
AAV libraries were screened to assess their in vivo biodistribution in mice (
Recombinant AAVs made using the naturally occurring AAV9 capsid transduced the C57BL/6J mouse liver with high efficiency. In some applications, it is preferable to reduce the accumulation of AAV particles within the liver and reduce the transduction of liver cells. Therefore, experiments were undertaken to identify AAV capsids that produced well but had low transduction to the liver or biodistribution to the spleen. 7-mer sequences were identified that had low transduction to the liver or low biodistribution to the spleen.
A positive control set of 3K variants was sampled from a pool of 240K variants such that each variant satisfied six traits relevant to cross-species hepatocyte targeting. Specifically, the traits were 1) high binding affinity to HepG2 cells, 2) high binding affinity to THLE cells, 3) high transduction of HepG2 cells, 4) high transduction of THLE cells, 5) high biodistribution to C57 mice liver, and 6) high production fitness. The positive set was then used in a different library of 240K along with other variants. After screening the new library for the six traits, the designed variants were considered as hits and selected only if they did not fall below the affinity/enrichment of the positive control distributions (threshold is the mean of each enrichment of each of the six traits minus 2 standard deviations) for the six traits simultaneously. Nearly all of the liver MultiFunction capsids had a 7-mer with an net charge of +1 (
Seven variants individually validated as described in the above Examples were found to transduce C57 mice livers and produce at levels comparable to AAV9 WT and transduced two human hepatocyte cell lines 10×-1000× better than AAV9 WT (see Table 2 and
The 7-mer sequences listed in Table 2 above were inserted between AAV9 K549R amino acid positions 588 and 589 and the amino acid and nucleotide sequences for the resulting capsids are provided below.
The Fit4Function pipeline presents a significant conceptual and technological advance over prior AAV engineering studies, including those that leverage ML. Conventional in vivo selections use sequential rounds to narrow the focus of sequence exploration to a handful of top candidates, which may not have other traits required for translation to preclinical and clinical trials. Simultaneously engineering multiple traits into AAV capsids or other proteins of interest has become an important but challenging goal. To date, most protein engineering efforts, including those leveraging ML, have focused on optimizing a single function, e.g. generating more efficiently produced and diversified AAV capsid libraries but stopping short of multi-trait prediction. A few groups have gone beyond single trait engineering by combining multiple previously validated functional structures into a single protein, e.g., by recombining structurally independent segments from different channelrhodopsins possessing known functions, localizations, and photocurrent properties of interest, or by applying protein design tools to filter out variants that do not meet additional characteristics such as solubility and immunogenicity. A few groups have gone beyond single trait engineering by combining multiple previously validated functional structures into a single protein, e.g., by recombining structurally independent segments from different channelrhodopsins possessing known functions, localizations, and photocurrent properties of interest, or by applying protein design tools to filter out variants that do not meet additional characteristics such as solubility and immunogenicity. However, as these strategies rely on the recombination of multiple existing functional structures into a single protein or the use of third-party protein design tools, they cannot be broadly generalized to engineer multiple de novo functions. A key obstacle to combining multiple ML models that predict different traits is the aggregated error that increases with each added model. The Fit4Function approach directly tackles this problem by leveraging a moderately sized, all viable, low-bias (ML-designed) library to generate highly reproducible data for multi-trait learning with a low false positive rate. This allows the models to be applied in different combinations with a low risk of aggregating significant error. MultiFunction libraries can thus be generated to more efficiently explore the vast sequence space for multi-trait capsids.
The Fit4Function approach can help to reduce the need for extensive screening in macaques in two ways. Firstly, the unique features of Fit4Function libraries enable the quantitative assessment of capsid biodistribution and top candidate selection in multiple organs from just a single round of screening. It is only necessary to screen a Fit4Function library once for a given function to then predict the functionality of sequences that were not contained in the original library. In contrast, it typically requires 2-6 rounds of in vivo screening to reliably identify top candidates from conventional selections, and the data from these screens cannot be used to accurately predict the traits of variants not tested in that screen. This means that the Fit4Function approach can be used to design libraries full of diverse and promising candidates for more efficient screening in macaques or other animals or assays. Secondly, unlike existing screening strategies, our approach can systematically determine the functional assays or combinations thereof that drive cross-species transferability. As the Fit4Function approach is applied to more NHP functions of interest (e.g., BBB-crossing), it will become apparent whether it is worthwhile to continue screening in mice or other animals for those functions. This can inform the choice of cell or animal models to perform screens in and develop vectors that are more likely to translate preclinically and clinically.
As with other ML-guided approaches, Fit4Function can be more challenging to implement with assays that produce low quality data due to lower detection sensitivities. For example, data reproducibility and subsequent model performance can be bottlenecked by in vivo transduction assays in some organs due to the inherent tropism of the parental capsid, inter-animal variability, and technical challenges related to tissue sampling. One approach to improve data quality with low sensitivity assays may be to use smaller Fit4Function libraries, because reducing library diversity increases the sampling of each individual variant and therefore the quality of the screening data. A second limitation that affects any multi-objective engineering effort is that variants that are maximally optimized for multiple objectives may not exist, especially in cases where performance on functions are negatively correlated. While Fit4Function cannot overcome this fundamental problem, it provides the means to efficiently search the vast production fit sequence space for variants that are reasonably well optimized for multiple traits.
With continued application across experiments and laboratories, the Fit4Function approach should enable the assembly of a vast ML atlas that can accurately predict the performance of AAV capsid variants across dozens of traits and inform the design of screening pipelines. In addition, the Fit4Function approach should translate to engineering other proteins that are amenable to quantitative, high-throughput screening of libraries that are diversified at a defined set of residues.
The following materials and methods were employed in the above examples.
The training and assessment libraries were designed to contain 150K nucleotide sequences each. The libraries were composed of 64.5K unique and 10K shared amino acid sequences generated by uniformly sampling all 20 amino acids at each position. The 74.5K variants were duplicated via 7-mer replication. 1K sequences containing stop codons were included to detect problems with cross packaging. In total, each library comprised a final set of 150K sequences.
To produce synthetic library inserts, lyophilized DNA oligonucleotide libraries (Agilent G7223A) or NNK hand mixed primers (IDT) were spun down at 8000 RCF for 1 minute, resuspended in 10 μL UltraPure DNase/RNase-Free Distilled Water (Thermo Fisher Scientific, 10977015), and incubated at 37° C. for 20 minutes. For pooled synthetic oligonucleotide libraries, the following primer format was used:
To amplify the oligonucleotide libraries and incorporate them into an AAV9 (K449R) template, 2 μL of the resuspended pooled oligonucleotide library or NNK-based library was used as an initial reverse primer along with 0.5 μM AAV9_K449R_Forward primer in a 25 μL PCR amplification reaction using Q5 Hot Start High-Fidelity 2× Master Mix (NEB, M0494S). 50 ng of a plasmid containing only AAV9 (K449R) VP1 amino acids 347-586 was used as a PCR template. PCR was performed following the manufacturer's protocol with an annealing temperature of 65° C. for 20 seconds and an extension time of 90 seconds. After six PCR cycles, 0.5 μM AAV9_K449R_Reverse (GTATTCCTTGGTTTTGAACCCAACCG (SEQ ID NO: 53) was spiked into the reaction as a reverse primer to further amplify sequences containing the oligonucleotide library for an additional 25 cycles. To remove the PCR template, 1 μL of DpnI (NEB, R0176S) was added to the PCR reaction and incubated at 37° C. for one hour. Afterwards, the PCR products were cleaned using AMPure XP beads (Beckman, A63881) following the manufacturer's protocol.
The PCR insert was assembled into 1600 ng of a linearized mRNA selection vector (AAV9-CMV-Express) with NEBuilder HiFi DNA Assembly Master Mix (NEB, E2621L) at a 3:1 insert:vector Molar ratio in a 80 μL reaction volume, incubated at 50° C. for one hour, and then at 72° C. for 5 minutes. Afterwards, 4 μL of Quick CIP (NEB, M0508S) was spiked into the reaction and incubated at 37° C. for 30 minutes to dephosphorylate unincorporated dNTPs that may inhibit downstream processes. Finally, 4 μL of T5 Exonuclease (NEB M0663S) was added to the reaction and incubated at 37° C. for 30 minutes to remove unassembled products. The final assembled products were cleaned using AMPure XP beads (Beckman, A63881) following the manufacturer's protocol and their concentrations were quantified with a Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific, Q32851) and a Qubit fluorometer.
mRNA Selection Vector
The mRNA selection vector (AAV9-CMV-Express) was designed to enrich for functional AAV capsid sequences by recovering capsid mRNA from transduced cells. AAV9-CMV-Express used a ubiquitous CMV enhancer and AAV5 p41 gene regulatory elements to drive AAV Cap expression. The AAV-Express plasmid was constructed by cloning the following elements into an AAV genome plasmid in the following order: a cytomegalovirus (CMV) enhancer-promoter, a synthetic intron and the AAV5 P41 promoter along with the 3′ end of the AAV2 Rep gene, which included the splice donor sequences for the capsid RNA. The capsid gene splice donor sequence in AAV2 Rep was modified from a non-consensus donor sequence CAGGTACCA to a consensus donor sequence CAGGTAAGT. The AAV9 capsid gene sequence was synthesized with nucleotide changes at S448 (TCA to TCT, silent mutation), K449R (AAG to AGA), and G594 (GGC to GGT, silent mutation) to introduce restriction enzyme recognition sites for oligonucleotide library fragment cloning. The AAV2 polyadenylation sequence was replaced with a simian virus 40 (SV40) late polyadenylation signal to terminate the capsid RNA transcript.
For library production, HEK293T/17 cells (ATCC, CRL-11268) were seeded at 22 million cells per 15 cm plate the day before transfection and grown in DMEM with GlutaMAX (Gibco, 10569010) supplemented with 5% FBS and 1× non-essential amino acid solution (NEAA) (Gibco, 11140050). The next day, each plate was triple transfected with 39.93 μg of total plasmid DNA encoding pHelper, RepStop encoding the AAV2 Rep genes, pUC19 at a ratio of 2:1:1, respectively, and with 10 ng of assembled library DNA. The media was exchanged for fresh DMEM with 5% FBS and 1×NEAA at 20 hours post transfection. At 60 hours, the media and cell lysates were harvested and purified following a protocol described in R. C. Challis, et al., “Systemic AAV vectors for widespread and targeted gene delivery in rodents,” Nat. Proto. 14, 379-414 (2019).
Individual recombinant AAVs were produced in suspension HEK293T cells, using F17 media (ThermoFisher Scientific). Cell suspensions were incubated at 37° C., 8% CO2, 125 RPM. 24 hours before transfection, cells were seeded in 200 mL at ˜1 million cells/mL. The day after, cells (˜2 million cells/mL) were transfected with pHelper, pRepCap and pTransgene (2:1:1 ratio, 2 ug DNA per million cells) using Transport 5 transfection reagent (Polysciences) with a 2:1 PEI:DNA ratio. Three days post-transfection, cells were pelleted at 2000 RPM for 10 minutes into Nalgene conical bottles. The supernatant was discarded, and cell pellets were stored at −20° C. until purification. Each pellet, corresponding to 200 mL of cell culture, was resuspended in 7 mL of 500 mM NaCl, 40 mM Tris-base, 10 mM MgCl2, with Salt Active Nuclease (ArcticZymes, #70920-202) at 100 U/mL. Afterwards, the lysate was clarified at 2000 RCF for 10 minutes and loaded onto a density step gradient containing OptiPrep (Cosmo Bio, AXS-1114542) at 60%, 40%, 25%, and 15% at a volume of 5, 5, 6, and 6 mL respectively in OptiSeal tubes (Beckman, 361625). The step gradients were spun in a Beckman Type 70ti rotor (Beckman, 337922) in a Sorvall WX+ ultracentrifuge (ThermoFisher Scientific, 75000090) at 69,000 RPM for 1 hour at 18° C. Afterwards, ˜4.5 mL of the 40-60% interface was extracted using a 16-gauge needle, filtered through a 0.22 μm PES filter, buffer exchanged with 100K MWCO protein concentrators (Thermo Fisher Scientific, 88532) into PBS containing 0.001% Pluronic F-68, and concentrated down to a volume of 500 μL. The concentrated virus was filtered through a 0.22 μm PES filter and stored at 4° C. or −80° C.
To determine AAV titers, 5 μL of each purified virus library were incubated with 100 μL of an endonuclease cocktail consisting of 1000 U/mL Turbonuclease (Sigma T4330-50KU) with 1× DNase I reaction buffer (NEB B0303S) in UltraPure DNase/RNase-Free distilled water at 37° C. for one hour. Next, the endonuclease solution was inactivated by adding 5 μL of 0.5M EDTA, pH 8.0 (Thermo Fisher Scientific, 15575020) and incubated at room temperature for 5 minutes and then at 70° C. for 10 minutes. To release the encapsidated AAV genomes, 120 μL of a Proteinase K cocktail consisting of 1M NaCl, 1% N-lauroylsarcosine, 100 μg/mL Proteinase K (Qiagen, 19131) in UltraPure DNase/RNase-Free distilled water was added to the mixture and incubated at 56° C. for 2 to 16 hours. The Proteinase K-treated samples were then heat-inactivated at 95° C. for 10 minutes. The released AAV genomes were serial diluted between 460-460,000× in dilution buffer consisting of 1×PCR Buffer (Thermo Fisher Scientific, N8080129), 2 μg/mL sheared salmon sperm DNA (Thermo Fisher Scientific, AM9680), and 0.05% Pluronic F68 (Thermo Fisher Scientific, 24040032) in UltraPure Water (Thermo Fisher Scientific). 2 μL of the diluted samples were used as input in a ddPCR supermix (Bio-Rad, 1863023). Primers and probes, targeting the ITR and CAG promoter region, were used for titration, at a final concentration of 900 nM and 250 nM, respectively (ITR2_Forward: GGAACCCCTAGTGATGGAGTT (SEQ ID NO: 54); ITR2_Reverse: CGGCCTCAGTGAGCGA (SEQ ID NO: 55); ITR2_Probe: CACTCCCTCTCTGCGCGCTCG [FAM/Iowa Black FQ Zen](SEQ ID NO: 56); CAG_Forward: TGTTCCCATAGTAACGCCAATAG (SEQ ID NO: 57); CAG_Reverse: GTACTTGGCATATGATACACT TGATG (SEQ ID NO: 58) (CAG_Probe: TTACGGTAAACTGCCCACTTGGCA [FAM/Iowa Black FQ Zen](SEQ ID NO: 9)). Droplets were generated using a QX100 Droplet Generator following the manufacturer's protocol. The droplets were transferred to thermocycler and cycled according to the manufacturer's protocol with an annealing/extension of 58° C. for one minute. Finally, droplets were read on a QX100 Droplet Digital System to determine titers.
To recover only encapsidated AAV genomes for downstream analysis, 101 viral genomes were extracted using the endonuclease and Proteinase K steps outlined above (AAV Titering). After Proteinase K treatment, samples were column purified using a DNA Clean and Concentrator Kit (Zymo Research, D4033) and eluted in 25 μL elution buffer for NGS preparation.
To prepare AAV libraries for sequencing, qPCR was performed on extracted AAV genomes or cDNA to determine the cycle thresholds for each sample type to prevent overamplification. PCR amplification using equal primer pairs (1-8) (Table 3; Described in Huang et al., bioRxiv 2022.10.31.514553 (2022), the disclosure of which is incorporated herein by reference in its entirety for all purposes), was used to attach partial Illumina Read 1 and Read 2 sequences using Q5 Hot Start High-Fidelity 2× Master Mix with an annealing temperature of 65° C. for 20 seconds and an extension time of 60 seconds. Round one PCR products were purified using AMPure XP beads following the manufacturer's protocol and eluted in 25 μL UltraPure Water (Thermo Fisher Scientific). 2 μL was used as input in a second round of PCR to attach on Illumina adaptors and dual index primers (NEB, E7600S) for five PCR cycles using Q5 HotStart-High-Fidelity 2× Master Mix with an annealing temperature of 65° C. for 20 seconds and an extension time of 60 seconds. The round two PCR products were purified using AMPure XP beads following the manufacturer's protocol and eluted in 25 μL UltraPure DNase/RNase-Free distilled water (Thermo Fisher Scientific).
To quantify the amount of PCR products for NGS, an Agilent High Sensitivity DNA Kit (Agilent, 5067-4626) was used with an Agilent 2100 Bioanalyzer. PCR products were pooled and diluted to 2-4 nM in 10 mM Tris-HCl, pH 8.5 and sequenced on an Illumina NextSeq 550 following the manufacturer's instructions using a NextSeq 500/550 Mid or High Output Kit (Illumina, 20024904 or 20024907), or on an Illumina NextSeq 1000 following the manufacturer's instructions using NextSeq P2 v3 kits (Illumina, 20046812). Reads were allocated as follows: I1: 8, I2: 8, R1: 150, R2: 0.
NCCAACGAAGAAGAAATTAAAACTACTAACCCG
Sequencing data was de-multiplexed with bcl2fastq (version v2.20.0.422) using the default parameters. The Read 1 sequence (excluding Illumina barcodes) was aligned to a short reference sequence of AAV9:
CCAACGAAGAAGAAATTAAAACTACTAACCCGGTAGCAACGGAGTCCTATGGACAAGTGGCCAC AAACCACCAGAGTGCCCAANNNNNNNNNNNNNNNNNNNNNGCACAGGCGCAGACCGGTTGGGTT CAAAACCAAGGAATACTTCCG (SEQ ID NO: 76). Alignment was performed with bowtie2 (version 2.4.1) (B. Langmead and S. L. Salzberg, “Fast gapped-read alignment with Bowtie 2,” Nat. Methods. 9, 357-359 (2012)) with the following parameters: --end-to-end --very-sensitive --np 0 --n-ceil L,21,0.5 -xeq -N 1 --reorder --score-min L, 0.6,−0.6 −5 8 −3 8. Resulting same files from bowtie2 were sorted by read and compressed to bam files with samtools (version 1.11-2-g26d7c73, hislib version 1.11-9-g2264113) (P. Danecek, et al, “Twelve years of SAMtools and BCFtools,” Gigascience. 10 (2021), doi:10.1093/gigascience/giab008; and H. Li, et al., “1000 Genome Project Data Processing Subgroup, The Sequence Alignment/Map format and SAMtools,” Bioinformatics. 25, 2078-2079 (2009)).
Python (version 3.8.3) scripts and pysam (version 0.15.4) were used to extract the 21 nucleotide insertion from each amplicon read. Each read was assigned to one of the following bins: Failed, Invalid, or Valid. Failed reads were defined as reads that did not align to the reference sequence, or that had an in/del in the insertion region (i.e., 20 bases instead of 21 bases). Invalid reads were defined as reads whose 21 bases were successfully extracted, but matched any of the following conditions: 1) Any one base of the 21 bases had a quality score (AKA Phred score, QScore) below 20, i.e., error probability >1/100, 2) Any one base was undetermined, i.e., “N”, 3) The 21 base sequence was not from the synthetic library (this case does not apply to NNK library). Valid reads were defined as reads that did not fit into either the Failed or Invalid bins. The Failed and Invalid reads were collected and analyzed for quality control purposes, and all subsequent analyses were performed on the Valid reads.
Count data for valid reads was aggregated per sequence, per sample, and was stored in a pivot table format, with nucleotide sequences on the rows, and samples (Illumina barcodes) on the columns. Sequences not detected in samples were assigned a count of 0.
Count data was read-per-million (RPM) normalized to the sequencing depth of each sample (Illumina barcode) with:
where r is the RPM-normalized count, k is the raw count, i=1 . . . n sequences, and j=1 . . . m samples.
As each biological sample was run in triplicate, data were aggregated for each sample by taking the mean of the RPMs:
across p replicates of sample s. Normalized variance was estimated across replicates by taking the coefficient of variaton (CV):
where sigma{i,s} is the standard deviation for variant i in sample s over p replicates. Log 2 enrichment for each sequence was defined as:
where e is the log 2 enrichment, mu is the mean of the replicate RPMs, and t is the normalization sample. For production fitness, the sample s is the variant abundance after virus production, and the normalization sample t is the variant abundance in the plasmid pool. For functional screens, the sample s is the variant abundance of the screen, and the normalization factor t is the variant abundance after virus production. To avoid dividing by 0 in e (for NNK library processing), mu_corrected is defined as:
i.e., counts of 0 across all 3 replicates for the normalization sample were adjusted to a count of 1 across all 3 replicates.
A robust ML framework was designed and used for the production fitness and Fit4Function functional mappings. A long short-term memory (LSTM) regression model with two hidden layers of 140 and 20 nodes was implemented in Keras (keras-team, GitHub—keras-team/keras: Deep Learning for humans. GitHub, (available at github.com/keras-team/keras)). RNNs, and LSTMs in particular, have been successfully applied for learning functions from biological sequence data as they are designed to capture local and distant relationships across different parts of the input sequences (D. H. Bryant, et al. “Deep diversification of an AAV capsid protein by machine learning.” Nat. Biotechnol. (2021), doi:10.1038/s41587-020-00793-4; and E. Alley, et al. “Unified rational protein engineering with sequence-based deep representation learning,” doi:10.21203/rs.2.13774/v1.). Model parameters and hyperparameters were subject to fine tuning processes but no significant performance was gained across all different functional models implemented in this study. Thus, the simplest model architecture was kept across all modeling throughout this study. The input layer was 7-mer amino acid sequences one-hot encoded into a 20×7 matrix. The target/output is the relative production (or functional) fitness score. Loss was optimized by mean-squared-error with Adam optimizer running on a learning rate of 0.001 (D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization” (2014), (available at arxiv.org/abs/1412.6980).). The batch size was set to 500 observations. To avoid overfitting, model training was controlled by a custom early stopping procedure where the training process was terminated if the ratio of training error to validation error dropped below 0.90.
For production fitness learning, the training size was optimized by training the framework on increments of 1K variants. Variants that were not detected (n=5,279) after virus production were filtered out from training. Model validation performance was reported at each training size, and a size of 24K variants was arbitrarily selected for final model training given that the model performance reached a plateau after a training size of ˜5K. The training library core variants (N˜60K, after removing the non-detected sequences) were then randomly divided into training (24K), validation (12K) and testing subsets (24K), all from the training library. The model was trained on the training set (24K), validated during the training process on the validation set (12K), and tested on the testing set (24K). The model was further tested on the unique variants from the assessment library to assess its generalization across libraries.
The Fit4Function libraries were intended to be sampled from the high production fitness space. For the Fit4Function library utilized in the Examples of the present disclosure, a set of 7-mer amino acid sequences was first uniformly sampled 100 times the required library size (240K Fit4Function variants*100=24M variants), by equally sampling each amino acid at each of the 7 positions. Duplicates were removed and the remaining sequences were scored using the production fitness model. Then, the 240K Fit4Function library variants were probabilistically sampled from the parametrized high production fitness distribution. In addition to the 240K high production fitness variants, 1K stop codon-containing variants and 3K variants from the 10K shared variants between the training and assessment libraries were added as a control set.
Fitness enrichment scores are relative across library variants due to normalization calculations; calibration is needed to make the fitness scores of two libraries of different compositions comparable for assessment or integration purposes. To calibrate the Fit4Function library production fitness, the 3K control set was used to fit an ordinary linear regression model of the measured production fitness scores between the Fit4Function library and the training library. These regression parameters were applied to the production fitness measured scores of the 240K Fit4Function variants to obtain calibrated production fitness scores. After synthesizing the Fit4Function library, the predicted fitness scores were compared to the calibrated measured fitness by means of correlation.
All mouse procedures were performed as approved by the Broad Institute Institutional Animal Care and Use Committee (IACUC), approval number 0213-06-18-1. Female C57BL/6J (000664) mice were obtained from the Jackson Laboratory (JAX). Recombinant AAV vectors were administered intravenously via the retro-orbital sinus in young adult (7- to 8-week-old) animals. Mice were randomly assigned to groups based on predetermined sample sizes. No mice were excluded from the analyses. For all assays, mice were anesthetized with EUTHASOLT™ (Virbac) and transcardially perfused with phosphate buffer saline, pH 7.4, at room temperature (RT). Experimenters were not blinded to the sample groups.
For the cynomolgus macaque experiments, the study plan involving the care and use of animals was reviewed and approved by the Charles River CR-LAV Institutional Animal Care and Use Committee (IACUC). During the study, the care and use of animals was conducted by CR-LAV with guidance from the USA National Research Council and the Canadian Council on Animal Care (CCAC). The Test Facility is accredited by the CCAC and AAALAC. Per the CCAC guidelines, this study was considered as a category of invasiveness C.
The rhesus macaque study (n=2) was conducted in the NIH Nonhuman Primate Testing Center for Evaluation of Somatic Cell Genome Editing Tools at the University of California, Davis. All procedures conformed to the requirements of the Animal Welfare Act, and protocols were approved prior to implementation by the UC Davis IACUC.
Purified virus libraries were injected at a dose of 1×1012 into C57BL/6J mice. Two hours post-injection serum was collected and organs were harvested using disposable 3 mm biopsy punches (Integra, 33-32-P/25) with a new biopsy punch used per organ per replicate. Harvested tissues were immediately frozen in dry ice. AAV genomes were recovered using a DNeasy kit (Qiagen, 69504) following the manufacturer's protocol and samples were eluted in 200 μL elution buffer for NGS preparation.
The library administered had 100K unique amino acid variants following the Fit4Function criteria (uniformly sampled from the high production fitness sequence space) in addition to a calibration set (3K), control variants, and AAV9. Each variant in the Fit4Function distribution was represented by either two or six 7-mer replicates; AAV9 was represented by two replicates. The purified virus library was injected at a dose of 4.6×1012 vg/kg into a female cynomolgus macaque that was pre-screened for NAbs against AAV9 (CRL). Six hours after systemic delivery, the animal was perfused with cold PBS and organs were harvested and snap frozen in dry ice. DNA was extracted using a DNeasy kit in a Qiagen QIAcube Connect. Samples were then processed as detailed in the NGS sample preparation section.
Approximately 3-month-old rhesus monkeys (˜1 kg; one male, one female) were screened then assigned to the project after confirming seronegative status for AAV9 antibodies. Sedation with Telazol (IM) was performed prior to IV administration of a purified virus library (1×1013 vg/kg) with blood samples collected (˜4 mL; hematology, clinical chemistry, serum, plasma; pre-administration then weekly post-administration). Animals were monitored closely during the study period and until endpoint (four weeks post-administration). They remained robust and healthy with no evidence of adverse findings (body weights, hematology and clinical chemistry panels were all in the normative range at all timepoints; data not shown). Four weeks after systemic delivery, tissues were collected and snap frozen over liquid nitrogen then placed on dry ice immediately prior to storage at ≤−80° C. RNA and DNA were extracted using TRIzol (Invitrogen, 15596026) following the manufacturer's instructions. Total RNA was cleaned up using a RNeasy kit (Qiagen, 74106) followed by on-column DNA digestion. RNA was converted to cDNA using Maxima H Minus Reverse Transcriptase (ThermoFisher Scientific, EP0751) according to the manufacturer's instructions. Samples were then processed as detailed in the NGS sample preparation section.
Neutralization assays were performed at two MOIs, 500 and 1000, in Perkin-Elmer white 96-well plates. Four-fold serial dilutions (1:4 to 1:16,384) of macaque serum samples were prepared in 96-well plates using DMEM supplemented with 5% FCS. Then, 40 μL of each dilution was transferred to a separate 96-well plate, mixed with an equal volume of AAV9.CAG-GFP-P2A-Luciferase-WPRE-SV40 vector (4-8E7 vg per 40 μL, diluted in DMEM-5% FCS), and incubated for one hour at 37° C. Following the incubation, AAV-serum samples were transferred into a new 96-well plate (20 uL triplicates) and a total of 80 μL of DMEM-5% FCS, containing 20,000 HEK293T cells, was added to each well (final volume of 100 μL). 96-well plates were incubated for 48 hours at 37° C., 5% C02. Luminescence levels were read using a Perkin Elmer Victor Luminescence Plate Reader using the britelite plus Reporter Gene Assay System (Perkin-Elmer, #6066761). Data was analyzed using the neutcurve Python package developed by the Bloom lab. The neutralizing antibody titer was measured as the concentration that resulted in a 50% reduction in luciferase activity relative to the no-serum control. Animals used in the transduction study had NAb titers <1:12 in this set of antibody screens.
HEK293T/17 (ATCC® CRL-11268™), HepG2 (ATCC® HB-8065™), THLE-2 (ATCC® CRL-2706™), hCMEC/D3 (Millipore, SCC066), and human and mouse BMVECs (Cell Biologics, H-6023 and C57-H6023) were grown in 100 mm dishes and exposed to the Fit4Function or (NNK) 7-mer library (MOI 1E4 for HEK293T/17, MOI3E4 for hCMEC/D3, MOI 6E4 for primary human and mouse BMVECs and MOI5E3 for HepG2 and THLE-2) diluted in 10 mL of growth media at 4° C. with gentle rocking for two hours. After that, cells were washed three times with DPBS, and total DNA was extracted with DNeasy kit (Qiagen) according to the manufacturer instructions. Half of the recovered DNA was used in PCR amplification for viral genome sequence recovery.
Transduction assays were performed as described above with the following exceptions. The cells were cultured in growth media containing virus for 60 hours and total RNA was then extracted with the RNeasy kit (Qiagen), 5 μg of RNA was converted to cDNA using the Maxima H Minus Reverse Transcriptase according to the manufacturer's instructions.
Sequence-to-Function Mapping Functional scores were quantified as the log 2 of the fold-change enrichment of the variant reads-per-million (RPM) after the screen relative to its RPM in the virus library, i.e. log 2 (Assay RPM/Virus RPM). Fit4Function models utilized the same design of the ML framework utilized for production fitness mapping (two-layer LSTM, custom early stopping, batch size of 500 variants, MSE error and Adam optimizer). Out of the 240K variants in the Fit4Function library, 90K were allocated for training and testing the ML function models (model construction) and 150K variants were held-out for validation of the MultiFunction approach. The training size for each function model was optimized independently. As with the production fitness model, the function models were assessed by correlation between the predicted and measured functional scores.
Using the previously generated fitness models of the production fitness and the five functional models described in the Examples provided above, an in-silico screen of 10M randomly sampled 7-mer sequences was conducted to identify variants that are highly fit for all six traits. The threshold of high fitness for each function was arbitrarily set to the 50th percentile of each functional fitness distribution from the Fit4Function screening data. The percentiles were calculated on the detected variants of each functional assay from the 90K model construction data set. To reduce false positive predictions (variants predicted above the thresholds due to model errors), the filtration thresholds were increased slightly when applied to the predictions. For example, if the measured threshold is at fitness score of 2.5, variants predicted to have fitness >2.5+ shift were considered. The shift in applied thresholds is arbitrarily set to be 5% of the fitness dynamic range of each function. The thresholds were then used to filter out the 10M variants that were run through the six functional prediction models. Out of the variants predicted to pass the six modified thresholds, 30K variants were sampled to be included in the MultiFunction library. The 30K variants were each represented by two 7-mer replicates.
The MultiFunction library also included (1) a positive control set (3K) that was drawn from the subset of the 150K Fit4Function validation set that met the six conditions on the actual measurements (without modifying the thresholds), (2) a set of 10K variants randomly sampled from the Fit4Function 240K core variants as background controls representing the high production fitness space, (3) a set of 3K calibration variants present in the Fit4Function library (and the training library) to be used as background controls representing the entire (unbiased) sequence space, and (4) 1K stop codon containing sequences.
The MultiFunction library was synthesized, virus was produced, and the five liver-related functions were screened in the same way the Fit4Function library was processed. The success rate of the MultiFunction library was quantified in terms of hit rate, i.e. out of the 30K variants predicted to meet the six criteria, what percentage satisfied the six criteria when the MultiFunction library was screened on those functions (predicted positive versus measured positive). To determine whether a variant met specific functional criteria, the distribution of that function for the MultiFunction variants against was compared the positive control set. For a variant to be considered a hit for a specific function, its measured value should be above the mean-2SD (standard deviations) of the positive control set measured in the same experiment. A variant was considered a hit in calculating the MultiFunction hit rate only if it was a hit for all six functions; a variant that met five or fewer conditions was not considered a hit.
The hit rate of the Fit4Function space was the number of non-control variants from the Fit4Function library measured to pass the six thresholds (without the prediction marginal shifts used for MultiFunction variant design) divided by the number of non-control variants in the library. The hit rate for the uniform sequence space could be estimated as the hit rate in the Fit4Function library (representing the high production fitness space—all the low production fitness variants were filtered out from the selection), relative to the percentage of the space occupied by the high production fitness variants. Uniform hit rate=Fit4Function hit rate×High production fitness ratio=7.1%×40.8%=2.9%.
Individual capsids were cloned into iCAP-AAV9 (K449R) backbone (GenScript), and administered to C57BL/6J (The Jackson Laboratory, 000664) mice at a dose of 1×1010 vg/mouse (n=5/group). Three weeks later, three separate lobes of the liver were collected for RNA extraction and a single lobe per mouse was dropped fixed into 4% PFA.
For microscopy, fixed liver tissues were sectioned at 100 μm using a Leica VT1200 vibratome. Sections were mounted with ProLong™ Gold Antifade Mountant with DAPI (ThermoFisher, P36931). Liver images were collected using the optical sectioning module on a Keyence BZ-X800 with a Plan Apochromat 20× objective (Keyence, BZ-PA20). 3 images were taken for each animal (n=5/group) and compared to a no injection control (n=3 animals). In CellProfiler, nuclei were segmented and DAPI+ nuclei were identified using a threshold on DAPI intensity determined from the no injection control. Each DAPI+ nuclei was then quantified with the median pixel intensity in the GFP channel.
For assessment of liver transduction by quantitative RT-PCR, total RNA was recovered using TRIzol (Invitrogen, 15596026) following the manufacturer's instructions. Afterwards, total RNA was cleaned up using a RNeasy kit (Qiagen, 74106) followed by on-column DNA digestion. RNA was converted to cDNA using Maxima H Minus Reverse Transcriptase (ThermoFisher Scientific, EP0751) according to manufacturer instructions. Afterwards, qPCR was used to detect AAV encoded RNA transcripts with the following primer pair (5′-GCACAAGCTGGAGTACAACTA-3′ (SEQ ID NO: 77) and 5′-TGTTGTGGCGGATCTTGAA-3′ (SEQ ID NO: 78)) and the following primer pair for GAPDH (5′-ACCACAGTCCATGCCATCAC-3′ (SEQ ID NO: 79) and 5′-TCCACCACCCTGTTGCTGTA-3′ (SEQ ID NO: 80)).
THLE and HepG2 cells were seeded in a 96 well plate the day before adding the AAVs at 5000 vg/cell. For binding assays, viruses were diluted in media and incubated with cells at 4° C. with gentle shaking for one hour. After incubation, cells were washed three times with PBS to remove unbound virus and treated with proteinase K to release viral genomes for qPCR quantification. For transduction assays, cells were incubated with the AAVs for 24 hours at 37° C. and assayed with Britelite plus (Perkin Elmer, cat #6066766) following the manufacturer's protocol.
7 Individual Variants when Tested in Macaque
The rhesus macaque study (n=2) was conducted in the NIH Nonhuman Primate Testing Center for Evaluation of Somatic Cell Genome Editing Tools at the University of California, Davis. All procedures conformed to the requirements of the Animal Welfare Act, and protocols were approved prior to implementation by the UC Davis IACUC.
Approximately 3-month-old rhesus monkeys (˜1 kg; one male, one female) were screened then assigned to the project after confirming seronegative status for AAV9 antibodies. Sedation with Telazol (IM) was performed prior to IV administration of a purified virus library (1×1013 vg/kg) with blood samples collected (˜4 mL; hematology, clinical chemistry, serum, plasma; pre-administration then weekly post-administration). Animals were monitored closely during the study period and until endpoint (four weeks post-administration). They remained robust and healthy with no evidence of adverse findings (body weights, hematology and clinical chemistry panels were all in the normative range at all timepoints; data not shown). Four weeks after systemic delivery, tissues were collected and snap frozen over liquid nitrogen then placed on dry ice immediately prior to storage at ≤−80° C. RNA and DNA were extracted using TRIzol (Invitrogen, 15596026) following the manufacturer's instructions. Total RNA was cleaned up using a RNeasy kit (Qiagen, 74106) followed by on-column DNA digestion. RNA was converted to cDNA using Maxima H Minus Reverse Transcriptase (ThermoFisher Scientific, EP0751) according to the manufacturer's instructions. Samples were then processed as detailed in the NGS sample preparation section.
Relative transduction efficiencies were assessed by measuring the enrichment of the capsid RNA for each variant in the liver relative to the starting virus. It was found that the macaque liver transduction efficiency for the seven individually characterized liver MultiFunction variants are significantly higher than that of AAV9 (
From the foregoing description, it will be apparent that variations and modifications may be made to the invention described herein to adapt it to various usages and conditions. Such embodiments are also within the scope of the following claims.
The recitation of a listing of elements in any definition of a variable herein includes definitions of that variable as any single element or combination (or subcombination) of listed elements. The recitation of an embodiment herein includes that embodiment as any single embodiment or in combination with any other embodiments or portions thereof.
All patents and publications mentioned in this specification are herein incorporated by reference to the same extent as if each independent patent and publication was specifically and individually indicated to be incorporated by reference. The disclosures of the following U.S. Provisional Patent Applications are incorporated herein by reference in their entireties for all purposes: 63/342,001, filed May 13, 2022 and 63/343,010, filed May 17, 2022.
The present application is a continuation under 35 U.S.C. § 111(a) of PCT International Patent Application No. PCT/US2023/022266, filed May 15, 2023, designating the United States and published in English, which claims priority to and the benefit of PCT Application No. PCT/IB2023/050844, filed Jan. 31, 2023, and to U.S. Provisional Application Nos. 63/476,705, filed Dec. 22, 2022, 63/343,010, filed May 17, 2022, and 63/342,001, filed May 13, 2022, the entire contents of each of which are incorporated by reference herein.
This invention was made with government support under Grant Nos. NS111689, MH120096, OD027094, and OD0101107 awarded by the National Institutes of Health. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
63476705 | Dec 2022 | US | |
63343010 | May 2022 | US | |
63342001 | May 2022 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2023/022266 | May 2023 | WO |
Child | 18940565 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/IB2023/050844 | Jan 2023 | WO |
Child | PCT/US2023/022266 | US |