ADENO-ASSOCIATED VIRUS CAPSID POLYPEPTIDES AND VECTORS

Information

  • Patent Application
  • 20230093183
  • Publication Number
    20230093183
  • Date Filed
    February 25, 2021
    3 years ago
  • Date Published
    March 23, 2023
    a year ago
Abstract
The present disclosure relates generally to adeno-associated virus (AAV) capsid polypeptides and encoding nucleic acid molecules. The disclosure also relates to AAV vectors comprising the capsid polypeptides, and nucleic acid vectors (e.g. plasmids) comprising the encoding nucleic acids molecules, as well as to host cells comprising the vectors. The disclosure also relates to methods and uses of the polypeptides, encoding nucleic acids molecules, vectors and host cells.
Description
RELATED APPLICATIONS

This application claims priority to Australian Provisional Application No. 2020900529 entitled “Adeno-associated virus capsid polypeptides and vectors”, filed on 25 Feb. 2020, the entire content of which is hereby incorporated herein by reference in its entirety.


FIELD OF THE DISCLOSURE

The present disclosure relates generally to adeno-associated virus (AAV) capsid polypeptides and encoding nucleic acid molecules. The disclosure also relates to AAV vectors comprising the capsid polypeptides, and nucleic acid vectors (e.g. plasmids) comprising the encoding nucleic acids molecules, as well as to host cells comprising the vectors. The disclosure also relates to methods and uses of the polypeptides, encoding nucleic acids molecules, vectors and host cells.


BACKGROUND OF THE DISCLOSURE

Gene therapy has most commonly been investigated and achieved using viral vectors, with notable recent advances being based on adeno-associated viral vectors. Adeno-associated virus (AAV) is a replication-deficient parvovirus, the single-stranded DNA genome of which is about 4.7 kb in length. The AAV genome includes inverted terminal repeat (ITRs) at both ends of the molecule, flanking two open reading frames: rep and cap. The cap gene encodes three structural capsid proteins: VP1, VP2 and VP3. The three capsid proteins typically assemble in a ratio of 1:1:8-10 to form the AAV capsid, although AAV capsids containing only VP3, or VP1 and VP3, or VP2 and VP3, have been produced. The cap gene also encodes the assembly activating protein (AAP) from an alternative open reading frame. AAP promotes capsid assembly, acting to target the capsid proteins to the nucleolus and promote capsid formation. The rep gene encodes four known regulatory proteins: Rep78, Rep68, Rep52 and Rep40. These Rep proteins are involved in AAV genome replication, packaging, genomic integration and other processes. More recently, an X gene has been identified in the 3′ end of the AAV2 genome (Cao et al. PLoS One, 2014, 9:e104596). The encoded X protein appears to be involved in the AAV life cycle, including DNA replication.


The ITRs are involved in several functions, in particular integration of the AAV DNA into the host cell genome, as well as genome replication and packaging. When AAV infects a host cell, the viral genome can integrate into the host's chromosomal DNA resulting in latent infection of the cell. Thus, AAV can be exploited to introduce heterologous sequences into cells. In nature, a helper virus (for example, adenovirus or herpesvirus) provides protein factors that allow for replication of AAV virus in the infected cell and packaging of new virions. In the case of adenovirus, genes E1A, E16, E2A, E4 and VA provide helper functions. Upon infection with a helper virus, the AAV provirus is rescued and amplified, and both AAV and the helper virus are produced.


AAV vectors (also referred to as recombinant AAV, rAAV) that contain a genome that lacks some, most or all of the native AAV genome and instead contain one or more heterologous sequences flanked by the ITRs, have been successfully used in gene therapy settings. These AAV vectors are widely used to deliver heterologous nucleic acid to cells of a subject for therapeutic purposes, and in many instances, it is the expression of the heterologous nucleic acid that imparts the therapeutic effect. Although several AAV vectors have now been used in the clinic, there are a limited number that exhibit the required in vivo transduction efficiency of primary human cells/tissues to facilitate adequate expression of the heterologous nucleic acid for therapeutic applications. There is therefore a need to develop alternative AAV vectors that contain capsid proteins that facilitate efficient transduction of host cells in vivo.


SUMMARY OF THE DISCLOSURE

The present disclosure is predicated in part on the generation of novel AAV capsid polypeptides. In particular embodiments, the capsid polypeptides facilitate efficient transduction of human cells (such as human hepatocytes) when contained in an AAV vector. Typically, the in vivo transduction of AAV vectors comprising a capsid polypeptide of the present disclosure is improved compared to AAV vectors comprising other AAV capsid polypeptides (e.g. the prototypic AAV2 capsid set forth in SEQ ID NO:1). The capsids polypeptides of the present disclosure are therefore particularly useful in preparing AAV vectors, and in particular, AAV vectors for gene therapy uses. Similarly, AAV vectors comprising a capsid polypeptide of the present disclosure (i.e. having a capsid comprising or consisting of a capsid polypeptide of the present disclosure) are of particular use in gene therapy applications, such as for delivery of heterologous nucleic acids for the treatment of various diseases and conditions.


In one aspect, the disclosure provides a capsid polypeptide, comprising: (i) the sequence of amino acids set forth in any one of SEQ ID Nos:2-20 and 65-79 or a sequence having at least or about 90% or 95% sequence identity thereto; (ii) the sequence of amino acids at positions 138-735 of any one of SEQ ID NOs:2, 6, 7, 9, 10, 12-14, 16-20, 69, 71-74, 76 and 78, positions 138-734 of SEQ ID NO:5, 8 or 11, positions 138-736 of any one of SEQ ID NOs:3, 15, 65, 68, 75, 77 and 79, positions 138-737 of any one of SEQ ID NOs:4, 67 and 70, or positions 138-738 of SEQ ID NO:66; or a sequence having at least or about 90% or 95% sequence identity thereto; and/or (iii) the sequence of amino acids at positions 203-734 of any one of SEQ ID NOs:5, 8 and 11, positions 203-736 of SEQ ID NO:15, positions 204-735 of any one of SEQ ID NOs:2, 6, 7, 9, 10, 12-14, 16-20, 69, 71-74, 76 and 78, positions 204-736 of any one of SEQ ID NOs:3, 65, 68, 75, 77 and 79, positions 204-737 of any one of SEQ ID NOs: 4, 67 and 70, or positions 204-738 of SEQ ID NO:66; or a sequence having at least or about 90% or 95% sequence identity thereto.


In one embodiment, the capsid polypeptide comprises (i) the sequence of amino acids set forth in SEQ ID NO:13 or a sequence having at least or about 90%, 95%, 96%, 97%, 98% or 99% sequence identity thereto; (ii) the sequence of amino acids at positions 138-735 of SEQ ID NO:13 or a sequence having at least or about 90%, 95%, 96%, 97%, 98% or 99% sequence identity thereto; and/or (iii) the sequence of amino acids at positions 204-735 of SEQ ID NO:13 or a sequence having at least or about 90%, 95%, 96%, 97%, 98% or 99% sequence identity thereto.


In a particular examples, the capsid polypeptide comprises one or more of: a) amino acid residues S263, Q264, S265, S268 and H272, with numbering relative to SEQ ID NO:13; b) amino acid residues T546, G547, T549, N550, K551, T552, T553, L554, E555, N556, L558, M559, N561, R566 and P567, with numbering relative to SEQ ID NO:13; c) amino acid residues S580, S581, A585, A586, A590, T592, Q593, V594, and N597, with numbering relative to SEQ ID NO:13; d) amino acid residues D532, S538 and V540, with numbering relative to SEQ ID NO:13; e) amino acid residues S451, Q456, G457, Q460, L462, A466, A469, N470, S472 and A473, with numbering relative to SEQ ID NO:13; f) amino acid residues L493, S494, G505, A506, V518 and V522, with numbering relative to SEQ ID NO:13; g) the sequence of amino acids SQSGASNDNH (SEQ ID NO:58) at positions 263-272, with numbering relative to SEQ ID NO:13; h) the sequence of amino acids TGATNKTTLENVLMTNEEEIRP (SEQ ID NO:59) at positions 546-567, with numbering relative to SEQ ID NO:13; i) the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13; j) the sequence of amino acids DRFFPSSGV (SEQ ID NO:61) at positions 532-540, with numbering relative to SEQ ID NO:13; k) the sequence of amino acids STGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:62) at positions 451-473, with numbering relative to SEQ ID NO:13; and/or l) the sequence of amino acids LSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:63) at positions 493-522, with numbering relative to SEQ ID NO:13.


Another aspect of the disclosure relates to a capsid polypeptide, comprising: (i) the sequence of amino acids set forth in SEQ ID NO:13 or a sequence having at least or about 85% sequence identity thereto; (ii) the sequence of amino acids at positions 138-735 of SEQ ID NO:13 or a sequence having at least or about 85% sequence identity thereto; and/or (iii) the sequence of amino acids at positions 204-735 of SEQ ID NO:13 or a sequence having at least or about 85% sequence identity thereto; wherein the capsid polypeptide comprises: a) amino acid residues S263, Q264, S265, S268 and H272, with numbering relative to SEQ ID NO:13; and b) amino acid residues T546, G547, T549, N550, K551, T552, T553, L554, E555, N556, L558, M559, N561, R566 and P567, with numbering relative to SEQ ID NO:13; and/or amino acid residues S580, S581, A585, A586, A590, T592, Q593, V594, and N597, with numbering relative to SEQ ID NO:13.


In some embodiments, the capsid polypeptide comprises a) the sequence of amino acids SQSGASNDNH (SEQ ID NO:58) at positions 263-272, with numbering relative to SEQ ID NO:13; and b) the sequence of amino acids TGATNKTTLENVLMTNEEEIRP (SEQ ID NO:59) at positions 546-567, with numbering relative to SEQ ID NO:13 and/or the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13. In further embodiments, the capsid polypeptide comprises a) the sequence of amino acids ISSQSGASNDNH (SEQ ID NO:80) at positions 261-272, with numbering relative to SEQ ID NO:13; and b) the sequence of amino acids KTGATNKTTLENVLMTNEEEIRP (SEQ ID NO:81) at positions 545-567, with numbering relative to SEQ ID NO:13 and/or the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13.


The capsid polypeptide may comprise amino acid residues D532, S538 and V540, with numbering relative to SEQ ID NO:13. In some embodiments, the capsid polypeptide comprises the sequence of amino acids DRFFPSSGV (SEQ ID NO:61) at positions 532-540, with numbering relative to SEQ ID NO:13. In further embodiments, the capsid polypeptide comprises the sequence of amino acids AMATHKDDEDRFFPSSGV (SEQ ID NO:82) at positions 523-540, with numbering relative to SEQ ID NO:13.


In some examples, the capsid polypeptide comprises amino acid residues S451, Q456, G457, Q460, L462, A466, A469, N470, S472 and A473, with numbering relative to SEQ ID NO:13. In one embodiment, the capsid polypeptide comprises the sequence of amino acids STGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:62) at positions 451-473, with numbering relative to SEQ ID NO:13. In further embodiments, the capsid polypeptide comprises the sequence of amino acids QSTGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:83) at positions 450-473, with numbering relative to SEQ ID NO:13.


In further examples, the capsid polypeptide comprises amino acid residues L493, S494, G505, A506, V518 and V522, with numbering relative to SEQ ID NO:13. In some embodiments, the capsid polypeptide comprises the sequence of amino acids LSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:63) at positions 493-522, with numbering relative to SEQ ID NO:13. In further embodiments, the capsid polypeptide comprises the sequence of amino acids RVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:84) at positions 488-522, with numbering relative to SEQ ID NO:13.


In another aspect, the disclosure provides a capsid polypeptide, comprising: (i) the sequence of amino acids set forth in SEQ ID NO:13 or a sequence having at least or about 85% sequence identity thereto; (ii) the sequence of amino acids at positions 138-735 of SEQ ID NO:13 or a sequence having at least or about 85% sequence identity thereto; and/or (iii) the sequence of amino acids at positions 204-735 of SEQ ID NO:13 or a sequence having at least or about 85% sequence identity thereto; wherein the capsid polypeptide comprises amino acid residues S451, Q456, G457, Q460, L462, A466, A469, N470, S472, A473, L493, S494, G505, A506, V518 V522, D532, S538 V540, T546, G547, T549, N550, K551, T552, T553, L554, E555, N556, L558, M559, N561, R566, P567, S580, S581, A585, A586, A590, T592, Q593, V594, and N597, with numbering relative to SEQ ID NO:13.


In some embodiments, the capsid polypeptide comprises the sequence of amino acids STGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:62) at positions 451-473; the sequence of amino acids LSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:63) at positions 493-522; the sequence of amino acids DRFFPSSGV (SEQ ID NO:61) at positions 532-540; the sequence of amino acids TGATNKTTLENVLMTNEEEIRP (SEQ ID NO:59) at positions 546-567; and the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13. In further embodiments, the capsid polypeptide comprises the sequence of amino acids QSTGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:83) at positions 450-473; the sequence of amino acids RVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:84) at positions 488-522; the sequence of amino acids AMATHKDDEDRFFPSSGV (SEQ ID NO:82) at positions 523-540; the sequence of amino acids KTGATNKTTLENVLMTNEEEIRP (SEQ ID NO:81) at positions 545-567, with numbering relative to SEQ ID NO:13; and the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13. In one example, the capsid polypeptide further comprises a) an insertion of NG after position 262 and residues T263, S264, G265, T268, and T272, with numbering relative to SEQ ID NO:13; or b) an insertion of NG after position 262 and the sequence of amino acids TSGGATNDNT at positions 263-272, with numbering relative to SEQ ID NO:13.


In one embodiment, the capsid polypeptide comprises at least or about 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96% or 97% sequence identity to the sequence of amino acids set forth in SEQ ID NO:13, the sequence of amino acids at positions 138-735 of SEQ ID NO:13, or the sequence of amino acids at positions 204-735 of SEQ ID NO:13.


In another aspect, the disclosure provides an AAV vector, comprising a capsid polypeptide described herein.


In some examples, the vector exhibits increased in vivo transduction efficiency compared to an AAV vector comprising a capsid polypeptide comprising the sequence of amino acids set forth in SEQ ID NO:1. In particular examples, the vector exhibits increased in vivo transduction efficiency of human hepatocytes compared to an AAV vector comprising a capsid polypeptide comprising the sequence of amino acids set forth in SEQ ID NO:1. In one embodiment, transduction efficiency is increased by at least or about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 300%, 400% or 500%.


In further examples, the AAV vector exhibits increased resistance to neutralization by pooled human immunoglobulins compared to an AAV vector comprising a capsid polypeptide comprising the sequence of amino acids set forth in SEQ ID NO:1. In one embodiment, resistance to neutralization is increased by at least or about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 300%, 400% or 500%.


The AAV vector of the present disclosure may further include a heterologous coding sequence, such as one that encodes a peptide, polypeptide or polynucleotide. In some examples, the peptide, polypeptide or polynucleotide is a therapeutic peptide, polypeptide or polynucleotide.


In further aspects, provided is an isolated nucleic acid molecule encoding a capsid polypeptide described herein, and a vector comprising the nucleic acid molecule. In some examples, the vector is selected from among a plasmid, cosmid, phage and transposon. A host cell comprising an AAV vector, a nucleic acid molecule or a vector described above and herein is also provided.


Also provided is a method for introducing a heterologous coding sequence into a host cell, comprising contacting a host cell with an AAV vector of the present disclosure that comprises a heterologous coding sequence. In some examples, the host cell is a hepatocyte. In some embodiments of the method, contacting a host cell with the AAV vector comprises administering the AAV vector to a subject. In other embodiments, the method is in vitro or ex vivo.


In another aspect, provided is a method for producing an AAV vector, comprising culturing a host cell comprising a nucleic acid molecule encoding a capsid polypeptide of the present disclosure, an AAV rep gene, a heterologous coding sequence flanked by AAV inverted terminal repeats, and helper functions for generating a productive AAV infection, under conditions suitable to facilitate assembly of an AAV vector comprising a capsid comprising the capsid polypeptide, wherein the capsid encapsidates the heterologous coding sequence. In some examples, the host cell is a hepatocyte.


In a further aspect, provided is a method for enhancing the in vivo human hepatocyte transduction efficiency of an AAV vector, comprising:


a) identifying a reference capsid polypeptide for transducing human hepatocytes in vivo;


b) modifying the sequence of the reference capsid polypeptide at one or more of positions 263, 264, 265, 268, 272, 546, 547, 549, 550, 551, 552, 553, 554, 555, 556, 558, 559, 561, 566, 567, 580, 581, 585, 586, 590, 592, 593, 594 and 597, with numbering relative to SEQ ID NO:13, to thereby produce a modified capsid polypeptide that comprises: i) amino acid residues S263, Q264, S265, S268 and H272, with numbering relative to SEQ ID NO:13; and ii) amino acid residues T546, G547, T549, N550, K551, T552, T553, L554, E555, N556, L558, M559, N561, R566 and P567, with numbering relative to SEQ ID NO:13; and/or amino acid residues S580, S581, A585, A586, A590, T592, Q593, V594, and N597, with numbering relative to SEQ ID NO:13; and


c) vectorising the modified capsid polypeptide to thereby produce a modified AAV vector.


In some embodiments, the method further comprises modifying the sequence of the reference capsid polypeptide at one or more of positions 532, 538 and 540, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises amino acid residues D532, S538 and V540, with numbering relative to SEQ ID NO:13. In further embodiments, the method further comprises modifying the sequence of the reference capsid polypeptide at one or more of positions 451, 456, 457, 460, 462, 466, 469, 470, 472 and 473, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises amino acid residues S451, Q456, G457, Q460, L462, A466, A469, N470, S472 and A473, with numbering relative to SEQ ID NO:13. In other embodiments, the method further comprises modifying the sequence of the reference capsid polypeptide at one or more of positions 493, 494, 505, 506, 518 and 522, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises amino acid residues L493, S494, G505, A506, V518 and V522, with numbering relative to SEQ ID NO:13.


In another aspect, provided is a method for enhancing the in vivo human hepatocyte transduction efficiency of an AAV vector, comprising:


a) identifying a reference capsid polypeptide for transducing human hepatocytes in vivo;


b) modifying the sequence of the reference capsid polypeptide at one or more of positions 263-272, 546-567 and 582-597 with numbering relative to SEQ ID NO:13, to thereby produce a modified capsid polypeptide that comprises: i) the sequence of amino acids SQSGASNDNH (SEQ ID NO:58) at positions 263-272, with numbering relative to SEQ ID NO:13; and ii) the sequence of amino acids TGATNKTTLENVLMTNEEEIRP (SEQ ID NO:59) at positions 546-567, with numbering relative to SEQ ID NO:13 and/or the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13; and


c) vectorising the modified capsid polypeptide to thereby produce a modified AAV vector.


In some embodiments, the method further comprises modifying the sequence of the reference capsid polypeptide at one or more of positions at positions 532-540, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises the sequence of amino acids DRFFPSSGV (SEQ ID NO:61) at positions 532-540, with numbering relative to SEQ ID NO:13. In further embodiments, the method further comprises modifying the sequence of the reference capsid polypeptide at one or more of positions 451-473, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises the sequence of amino acids STGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:62) at positions 451-473, with numbering relative to SEQ ID NO:1. In other embodiments, the method further comprises modifying the sequence of the reference capsid polypeptide at one or more of positions 493-522, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises the sequence of amino acids LSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:63) at positions 493-522, with numbering relative to SEQ ID NO:13.


In some examples of the methods for enhancing the in vivo human hepatocyte transduction efficiency of an AAV vector, the reference capsid polypeptide comprises at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the sequence set forth in SEQ ID NO:13. In particular embodiments, the methods further comprise assessing the transduction efficiency of the modified AAV vector in vivo system that utilises human hepatocytes (e.g. an in vivo system that comprises a small animal (e.g. a mouse) with a chimeric liver comprising human hepatocytes, such as the hFRG mouse model. In particular examples, the modified AAV vector produced by the methods has an in vivo transduction efficiency that is enhanced by at least or about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, 200%, 300% or more compared to a reference AAV vector comprising the reference capsid polypeptide.





BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the disclosure are described herein, by way of non-limiting example only, with reference to the following drawings.



FIG. 1 is an alignment of AAV capsid polypeptides.



FIG. 2 is a representation of the in vivo performance of various AAV vectors. A humanised Fah−/−/Rag2−/−/Il2rg−/− (hFRG) mouse harbouring human primary and mouse primary hepatocytes in the liver was injected with 1.8×1011 vg of each of the barcoded AAV vectors. Prototypic AAV2 and AAV8 vectors, as well as bioengineered LK03 and NP59 vectors, were also injected. One week after injection the chimeric liver of the mouse was perfused and human and murine hepatocytes were separated using cell sorting. DNA and RNA were recovered from the human population of hepatocytes and Illumina Next Generation Sequencing (NGS) of the barcoded transgene in each of the AAV vectors was performed. The number of NGS reads specific for the barcodes, and thus each vector, at the DNA and RNA (cDNA) levels were then quantified, and expressed as a proportion of the total reads. The DNA reads were also normalised to the preinjection mix, which was also quantified using NGS of the same barcode region. (A) DNA from human hepatocytes, normalised to pre-injection reads. (B) cDNA from human hepatocytes. (C) DNA from mouse hepatocytes, normalised to pre-injection reads. (D) cDNA from mouse hepatocytes.



FIG. 3 is a graphical representation of the in vivo transduction of hepatocytes of select AAV vectors. AAVC11.01, AAVC11.04, AAVC11.05, AAVC11.06, AAVC11.07, AAVC11.09, AAVC11.11, AAVC11.12, AAVC11.13 and AAVC11.15, AAV2, AAV8, LK03, NP59, packaged with 5×barcoded transgene/capsid (BC A-E) were mixed at equal ratio (1×1010 vg/capsid) and injected into a single hFRG mouse. Human and murine hepatocytes were isolated and sorted after one week. DNA and RNA was extracted and NGS performed on the DNA and cDNA. The graph shows Human Expression Index (HEXI), representing cDNA reads normalized to DNA reads.



FIG. 4 provides graphical representations of the transduction efficiency of AAV vectors in vivo in the presence of IVIg. Three hFRG mice were passively immunized with injections of 1, 5 mg or 20 mg of soluble IVIg, followed by injection with a mix of barcoded AAVC11.01, AAVC11.04, AAVC11.07, AAVC11.09, AAVC11.11-AAVC11.13 and AAVC11.15 vectors and assorted controls. A fourth hFRG mouse that did not receive IVIg injection (the hFRG mouse from FIG. 3) was used as control. DNA and RNA was extracted and NGS performed on the DNA and cDNA. (A) Percentage of NGS reads mapped to each barcode in human hepatocytes at the DNA level (cell entry, physical transduction) in control mouse (i.e. no IVIg). (B) Percentage of NGS reads mapped to each barcode in human hepatocytes at the cDNA level (expression, functional transduction) in control mouse. (C) Estimated reduction in vector genomes per AAV capsid in the presence of IVIg. Values express the logarithm of the quotient between vector genomes of the IVIg conditions (hFRGs #2-4) and the no-IVIG control (hFRG #1). (D) Quantification of the percentage of transduced human hepatocytes per human cluster, n=10 clusters/mouse. (A-B: Data are mean±SD. Statistical significance among means was calculated using the Kruskal-Wallis test, and Dunnett's multiple comparison test was used to compare AAV variants with control AAV-NP59 (*P≤0.05, **P≤0.01, ***P≤0.001, ****P≤0.0001, n.s. P value>0.05). (D: Data are mean±SD. Statistical significance among means was calculated using one-way ANOVA, and Dunnett's multiple comparison test was used to compare AAV-SYDs with the control AAV-NP59 (**** P≤0.0001, n.s. P value>0.05).



FIG. 5 provides graphical representations of the transduction efficiency of AAV vectors in vivo. An NGS-based comparison of AAVC11.12 and relevant AAV variants in FRG mice engrafted with hepatocytes from different human donors was performed. (A-C) Combined transduction of the barcoded AAV-mix englobing the ten serotypes in N=32 hFRGs (N=31 for vector copy number). Each data point represents an independent mouse. (A) Percentage of GFP+ cells on FAC-sorted human hepatocytes and murine liver cells. (B) Percentage of GFP+ cells on FAC-sorted human hepatocytes engrafted with male and female donors. (C) Vector copy number per diploid human hepatocyte on FAC-sorted human hepatocytes. For (A-C), data are mean±SD. Statistical significance among means was calculated using a paired t-test, an unpaired t-test and an unpaired t-test with Welch's correction, respectively (* P≤0.05, **** P≤0.0001, n.s. P value>0.05). (D) Percentage of NGS reads mapped to each AAV capsid (sum of n=5 barcodes/capsid) in human hepatocytes at the DNA (cell entry, physical transduction) level, normalized to the pre-injection mix, is shown. (E) Percentage of NGS reads mapped to each AAV capsid (sum of n=5 barcodes/capsid) in human hepatocytes at the cDNA (expression, functional transduction) level, normalized to the pre-injection mix, is shown. For (D-E), each data point represents percentage in an independent mouse (N=31 hFRGs analysed for DNA and N=32 for cDNA). Data are mean±SD. Statistical significance among means was calculated using one-way ANOVA, and Dunnett's multiple comparison test was used to compare AAV-SYD12 with all other AAV variants (**** P≤0.0001, n.s. P value>0.05). (F) Average percentage of mapped NGS reads per AAV capsid in FAC-sorted human hepatocytes at the DNA (N=31 hFRGs) and cDNA (N=32 hFRGs) level. The expression index is defined as the quotient between average cDNA and DNA percentual reads.



FIG. 6 is a schematic representation of analysis of the parental contribution to the AAV capsid protein sequences. Library parents are depicted as horizontal dotted lines (from top to bottom: AAV1, AAV2, AAV3b, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11 and AAV12). Large dots represent 100% parental match (i.e. the position in question matches only one parent) and small dots represent more than one parental match (i.e. the position matches more than one parent) at each position. The solid line for each chimera represents the library parents identified within the sequence between crossovers. A set of thin horizontal parallel lines between crossovers indicates multiple parents match at an equal probability.



FIG. 7 is a schematic representation of analysis of the parental contribution to the AAVC11.12 capsid protein sequence. The thick solid line represents the most probable parental origin of each region based on the longest sequence of identity to parental variants in a 5′ to 3′ direction. Parental AAVs are in horizontal dotted lines (AAV1-12, from top to bottom) VR-I and VRs-IV to VIII from AAVC11.12 are shown in blocks with an indication of parental origin (AAV2, AAV10, or AAV7).



FIG. 8 provides graphical representations of the transduction efficiency of AAV vectors in vivo. A barcoded NGS comparison of AAVC11.12 with parental AAV2, AAV7, and AAV10 using two humanised FRG mice (hFRG #31 and hFRG #44) was performed. Percentage of NGS reads mapped to each barcode in human and murine hepatocytes at the DNA (cell entry, physical transduction) and cDNA (expression, functional transduction) level, normalized to the pre-injection mix, is shown. (A) Human hepatocyte entry (DNA). (B) Human hepatocyte expression (cDNA). (C) Mouse hepatocyte entry (DNA). (D) Mouse hepatocyte expression (cDNA). Data for hFRG #31 are on the left and data for hFRG #44 are on the right of each entry for each mouse on the graph. Data are mean±SD. Statistical significance among means was calculated using the Kruskal-Wallis test, and Dunnett's multiple comparison test was used to compare AAV-SYD12 and parental AAV variants with control AAV8 (*P≤0.01, **P≤0.01, ***P≤0.001, ****P≤0.0001, n.s. P value>0.05).



FIG. 9 is a schematic representation of AAV variable regions swapped into the AAV8 capsid scaffold.



FIG. 10 is an alignment of the sequences of the AAV8 and AAVC11.12 capsid polypeptides. Variable region (VR)-I, VR-IV, VR-V, VR-VI, VR-VII and VR-III are shown, with residues making up those regions bolded and in italics in the AAV8 polypeptide. Residues from AAVC11.12 that were used to replace the corresponding residue in AAV8 are underlined, and the region spanning the first and last replacement for each variable region is shaded in grey.



FIG. 11 is a representation of the in vivo performance of AAVC11.12, AAV8 and Swaps 1-7 in hFRG mice (N=2). The percentage of NGS reads mapped to each AAV capsid (sum of n=5 barcodes/capsid) in human hepatocytes and in the murine liver at the DNA (cell entry, physical transduction) and cDNA (expression, functional transduction) level, normalized to the pre-injection mix, is shown. Variable region origin for each capsid is shown for reference in the bottom panel, with variable regions of AAVC11.12 origin in dark grey and variable regions of AAV8 origin in light grey.



FIG. 12 is a representation of the in vivo performance of AAVC11.12, AAV8 and Swaps 1-15 in hFRG mice (N=2). The percentage of NGS reads mapped to each AAV capsid (sum of n=5 barcodes/capsid) in human hepatocytes and in the murine liver at the DNA (cell entry, physical transduction) and cDNA (expression, functional transduction) level, normalized to the pre-injection mix, is shown. Variable region origin for each capsid is shown for reference in the bottom panel, with variable regions of AAVC11.12 origin in dark grey and variable regions of AAV8 origin in light grey.



FIG. 13 is a representation of the in vivo performance of AAVC11.12, AAV8 and Swaps 1-7 in highly engrafted hFRG mice (N=2). The percentage of NGS reads mapped to each AAV capsid (sum of n=5 barcodes/capsid) in human hepatocytes at the DNA (cell entry, physical transduction) and cDNA (expression, functional transduction) level, normalized to the pre-injection mix, is shown. Variable region origin for each capsid is shown for reference in the bottom panel, with variable regions of AAVC11.12 origin in dark grey and variable regions of AAV8 origin in light grey.





DETAILED DESCRIPTION

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of skill in the art to which the disclosure belongs. All patents, patent applications, published applications and publications, databases, websites and other published materials referred to throughout the entire disclosure, unless noted otherwise, are incorporated by reference in their entirety. In the event that there is a plurality of definitions for terms, those in this section prevail. Where reference is made to a URL or other such identifier or address, it is understood that such identifiers can change and particular information on the internet can come and go, but equivalent information can be found by searching the internet. Reference to the identifier evidences the availability and public dissemination of such information.


As used herein, the singular forms “a”, “an” and “the” also include plural aspects (i.e. at least one or more than one) unless the context clearly dictates otherwise. Thus, for example, reference to “a polypeptide” includes a single polypeptide, as well as two or more polypeptides.


In the context of this specification, the term “about,” is understood to refer to a range of numbers that a person of skill in the art would consider equivalent to the recited value in the context of achieving the same function or result.


Throughout this specification and the claims that follow, unless the context requires otherwise, the word “comprise”, and variations such as “comprises” and “comprising”, will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps.


As used herein, a “vector” includes reference to both polynucleotide vectors and viral vectors, each of which are capable of delivering a transgene contained within the vector into a host cell. Vectors can be episomal, i.e., do not integrate into the genome of a host cell, or can integrate into the host cell genome. The vectors may also be replication competent or replication deficient. Exemplary polynucleotide vectors include, but are not limited to, plasmids, cosmids and transposons. Exemplary viral vectors include, for example, AAV, lentiviral, retroviral, adenoviral, herpes viral and hepatitis viral vectors.


As used herein, “adeno-associated viral vector” or “AAV vector” refers to a vector in which the capsid is derived from an adeno-associated virus, including without limitation, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12 or AAV13, AAV from other clades or isolates, or is derived from synthetic, bioengineered or modified AAV capsid proteins, including chimeric capsid proteins. In particular embodiments, the AAV vector has a capsid comprising a capsid polypeptide of the present disclosure. When referring to AAV vectors, both the source of the genome and the source of the capsid can be identified, where the source of the genome is the first number designated and the source of the capsid is the second number designated. Thus, for example, a vector in which both the capsid and genome are derived from AAV2 is more accurately referred to as AAV2/2. A vector with an AAV6-derived capsid and an AAV2-derived genome is most accurately referred to as AAV2/6. A vector with the bioengineered DJ capsid and an AAV2-derived genome is most accurately referred to as AAV2/DJ. For simplicity, and because most vectors use an AAV2-derived genome, it is understood that reference to an AAV6 vector generally refers to an AAV2/6 vector, reference to an AAV2 vector generally refers to an AAV2/2 vector, etc. An AAV vector may also be referred to herein as “recombinant AAV”, “rAAV”, “recombinant AAV virion”, “rAAV virion”, “AAV variant”, “recombinant AAV variant”, and “rAAV variant” terms which are used interchangeably and refer to a replication-defective virus that includes an AAV capsid shell encapsidating an AAV genome. The AAV vector genome (also referred to as vector genome, recombinant AAV genome or rAAV genome) comprises a transgene flanked on both sides by functional AAV ITRs. Typically, one or more of the wild-type AAV genes have been deleted from the genome in whole or part, preferably the rep and/or cap genes. Functional ITR sequences are necessary for the rescue, replication and packaging of the vector genome into the rAAV virion.


The term “ITR” refers to an inverted terminal repeat at either end of the AAV genome. This sequence can form hairpin structures and is involved in AAV DNA replication and rescue, or excision, from prokaryotic plasmids. ITRs for use in the present disclosure need not be the wild-type nucleotide sequences, and may be altered, e.g., by the insertion, deletion or substitution of nucleotides, as long as the sequences provide for functional rescue, replication and packaging of rAAV.


As used herein, “functional” with reference to a capsid polypeptide means that the polypeptide can self-assemble or assemble with different capsid polypeptides to produce the proteinaceous shell (capsid) of an AAV virion. It is to be understood that not all capsid polypeptides in a given host cell assemble into AAV capsids. Preferably, at least 25%, at least 50%, at least 75%, at least 85%, at least 90%, at least 95% of all AAV capsid polypeptide molecules assemble into AAV capsids. Suitable assays for measuring this biological activity are described e.g. in Smith-Arica and Bartlett (2001), Curr Cardiol Rep 3(1): 43-49.


“AAV helper functions” or “helper functions” refer to functions that allow AAV to be replicated and packaged by a host cell. AAV helper functions can be provided in any of a number of forms, including, but not limited to, as a helper virus or as helper virus genes which aid in AAV replication and packaging. Helper virus genes include, but are not limited to, adenoviral helper genes such as E1A, E1B, E2A, E4 and VA. Helper viruses include, but are not limited to, adenoviruses, herpesviruses, poxviruses such as vaccinia, and baculovirus. The adenoviruses encompass a number of different subgroups, although Adenovirus type 5 of subgroup C (Ad5) is most commonly used. Numerous adenoviruses of human, non-human mammalian and avian origin are known and are available from depositories such as the ATCC. Viruses of the herpes family, which are also available from depositories such as ATCC, include, for example, herpes simplex viruses (HSV), Epstein-Barr viruses (EBV), cytomegaloviruses (CMV) and pseudorabies viruses (PRV). Baculoviruses available from depositories include Autographa californica nuclear polyhedrosis virus.


As used herein, the term “transduction” refers to entry of AAV vector into one or more particular cell types and transferal of the DNA contained within the AAV vector into the cell. Transduction can be assessed by measuring the amount of AAV DNA or RNA expressed from the AAV DNA in a cell or population of cells, and/or by assessing the number of cells in a population that contain AAV DNA or RNA expressed from the DNA. Where the presence or amount of RNA is assessed, the type of transduction assessed is referred to herein as “functional transduction”, i.e. the ability of the AAV to transfer DNA to the cell and have that DNA expressed. The term “transduction efficiency” and grammatical variations thereof refers to the ability of an AAV vector to transduce host cells, and more particularly the efficiency with which an AAV vector transduces host cells. In particular embodiment, the transduction efficiency is in vivo transduction efficiency, and refers to the ability of an AAV vector to transduce host cells in vivo following administration of the vector to the subject. Transduction efficiency can be assessed in a number of ways known to those in the art, including assessing the number of host cells transduced following exposure to, or administration of, a given number of vector particles (e.g. as assessed by expression of a reporter gene from the vector genome, such as GFP or eGFP, using microscopy or flow cytometry techniques); the amount of vector DNA (e.g. number of vector genomes) in a population of host cells following exposure to a given number of vector particles; the amount of vector RNA in population of host cells following exposure to a given number of vector particles; and the level of protein expression from a reporter gene (e.g. GFP or eGFP) in the vector genome in a population of host cells following exposure to, or administration of, a given number of vector particles. The population of host cells can represent a particular number of host cells, a volume or weight of tissue, or an entire organ (e.g. liver). In vivo transduction efficiency can reflect the ability of an AAV vector to access host cells, such as hepatocytes in the liver; the ability of an AAV vector to enter host cells; and/or expression of a heterologous coding sequence contained in the vector genome upon host cell entry.


As used herein, “corresponding nucleotides”, “corresponding amino acid residues” or “corresponding positions” refer to nucleotides, amino acids or positions that occur at aligned loci. The sequences of related or variant polynucleotides or polypeptides are aligned by any method known to those of skill in the art. Such methods typically maximize matches (e.g. identical nucleotides or amino acids at positions), and include methods such as using manual alignments and by using the numerous alignment programs available (for example, BLASTN, BLASTP, ClustlW, ClustlW2, EMBOSS, LALIGN, Kalign, etc) and others known to those of skill in the art. By aligning the sequences of polynucleotides, one skilled in the art can identify corresponding nucleotides. For example, by aligning the prototypic AAV2 capsid polypeptide set forth in SEQ ID NO:1 with another AAV capsid polypeptide (e.g. as shown in FIG. 1), one of skill in the art can identify regions or amino acids residues within the other AAV polypeptide that correspond to various regions or residues in the AAV polypeptide set forth in SEQ ID NO:1. For example, the methionine at position 204 of SEQ ID NO:2 is the corresponding amino acid of, or corresponds to, the methionine at position 203 of SEQ ID NO:1. In another example, and with reference to the alignment of the capsid polypeptides of AAV8 and AAVC11.12 in FIG. 10, position 262 of the serine at position 262 of the AAVC11.12 capsid polypeptide aligns with, or correspond to, position 264 of the AAV8 capsid polypeptide, and the serine at position 262 of the AAVC11.12 capsid polypeptide correspond to, or is the corresponding amino acid of, the threonine at position 264 of the AAV8 capsid polypeptide. Thus, when amino acid residues or positions are referred to herein with respect to a particular capsid polypeptide, it is understood that, where appropriate, the reference is also to the corresponding amino acid residue or position in another capsid polypeptide. For example, reference to a capsid polypeptide comprising “S264 with numbering relative to SEQ ID NO:13” encompasses not only the AAVC11.12 capsid polypeptide set forth in SEQ ID NO:13 having a serine at position 264, but also other capsid polypeptides having a serine at the position that corresponds to position 264 of SEQ ID NO:13. This includes, for example, capsid polypeptides such as the AAV8Swap1 (SEQ ID NO:65) capsid polypeptide, where the position in AAV8Swap1 that corresponds to position 264 of SEQ ID NO:13 is position 264 and is occupied by a serine; and the AAVC11.12 VP3 protein, where the position in the AAVC11.12 VP3 protein that corresponds to position 264 of SEQ ID NO:13 is position 60 (and is of course also occupied by a serine). In another example, reference to a capsid polypeptide comprising “S580 with numbering relative to SEQ ID NO:13” refers to the AAVC11.12 capsid polypeptide set forth in SEQ ID NO:13 having a serine at position 580 and to other capsid polypeptides having a serine at the position that corresponds to position 580 of SEQ ID NO:13, such as the AAV8Swap3 capsid polypeptide (SEQ ID NO:67), where the position in AAV8Swap3 that corresponds to position 580 of SEQ ID NO:13 is position 582 and is occupied by a serine.


A “heterologous coding sequence” as used herein refers to nucleic acid sequence present in a polynucleotide, vector, or host cell that is not naturally found in the polynucleotide, vector, or host cell or is not naturally found at the position that it is at in the polynucleotide, vector, or host cell, i.e. is non-native. A “heterologous coding sequence” can encode a peptide or polypeptide, or a polynucleotide that itself has a function or activity, such as an antisense or inhibitory oligonucleotide, including antisense DNA and RNA (e.g. miRNA, siRNA, and shRNA). In some examples, the heterologous coding sequence is a stretch of nucleic acids that is essentially homologous to a stretch of nucleic acids in the genomic DNA of an animal, such that when the heterologous coding sequence is introduced into a cell of the animal, homologous recombination between the heterologous sequence and the genomic DNA can occur. In one example, the heterologous coding sequence is a functional copy of a gene for introduction into a cell that has a defective/mutated copy.


As used herein, the term “operably-linked” with reference to a promoter and a coding sequence means that the transcription of the coding sequence is under the control of, or driven by, the promoter.


The term “host cell” refers to a cell, such as a mammalian cell, that has introduced into it the exogenous DNA, such as a vector or other polynucleotide. The term includes the progeny of the original cell into which the exogenous DNA has been introduced. Thus, a “host cell” as used herein generally refers to a cell that has been transfected or transduced with exogenous DNA.


As used herein, “isolated” with reference to a polynucleotide or polypeptide means that the polynucleotide or polypeptide is substantially free of cellular material or other contaminating proteins from the cells from which the polynucleotide or polypeptide is derived, or substantially free from chemical precursors or other chemicals when chemically synthesized.


The term “subject” as used herein refers to an animal, in particular a mammal and more particularly a primate including a lower primate and even more particularly, a human who can benefit from the present invention. A subject, regardless of whether a human or non-human animal or embryo, may be referred to as an individual, subject, animal, patient, host or recipient. The present disclosure has both human and veterinary applications. For convenience, an “animal” specifically includes livestock animals such as cattle, horses, sheep, pigs, camelids, goats and donkeys, as well as domestic animals, such as dogs and cats. With respect to horses, these include horses used in the racing industry as well as those used recreationally or in the livestock industry. Examples of laboratory test animals include mice, rats, rabbits, guinea pigs and hamsters. Rabbits and rodent animals, such as rats and mice, provide a convenient test system or animal model as do primates and lower primates. In some embodiments, the subject is human.


As used herein, the term “conservative sequence modifications” or “conservative substitution” refers to amino acid modifications that do not significantly affect or alter the characteristics of a vector containing the amino acid sequence. Such conservative modifications include amino acid substitutions, additions and deletions. Modifications can be introduced into a vector that are compatible with various embodiments by standard techniques known in the art, such as site-directed mutagenesis and PCR-mediated mutagenesis. Conservative amino acid substitutions are ones in which an amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine, tryptophan), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, one or more amino acid residues within a capsid can be replaced with other amino acid residues from the same side chain family and the altered capsid can be tested for tropism and/or the ability to deliver a payload using the functional assays described herein.


It will be appreciated that the above described terms and associated definitions are used for the purpose of explanation only and are not intended to be limiting.









TABLE 1







Brief Description of the Sequences








SEQ ID NO.
Description











1
Prototypic AAV2 capsid polypeptide


2
AAVC11.01 capsid polypeptide (VP1)


3
AAVC11.02 capsid polypeptide (VP1)


4
AAVC11.03 capsid polypeptide (VP1)


5
AAVC11.04 capsid polypeptide (VP1)


6
AAVC11.05 capsid polypeptide (VP1)


7
AAVC11.06 capsid polypeptide (VP1)


8
AAVC11.07 capsid polypeptide (VP1)


9
AAVC11.08 capsid polypeptide (VP1)


10
AAVC11.09 capsid polypeptide (VP1)


11
AAVC11.10 capsid polypeptide (VP1)


12
AAVC11.11 capsid polypeptide (VP1)


13
AAVC11.12 capsid polypeptide (VP1)


14
AAVC11.13 capsid polypeptide (VP1)


15
AAVC11.14 capsid polypeptide (VP1)


16
AAVC11.15 capsid polypeptide (VP1)


17
AAVC11.16 capsid polypeptide (VP1)


18
AAVC11.17 capsid polypeptide (VP1)


19
AAVC11.18 capsid polypeptide (VP1)


20
AAVC11.19 capsid polypeptide (VP1)


21
AAVC11.01 capsid polynucleotide


22
AAVC11.02 capsid polynucleotide


23
AAVC11.03 capsid polynucleotide


24
AAVC11.04 capsid polynucleotide


25
AAVC11.05 capsid polynucleotide


26
AAVC11.06 capsid polynucleotide


27
AAVC11.07 capsid polynucleotide


28
AAVC11.08 capsid polynucleotide


29
AAVC11.09 capsid polynucleotide


30
AAVC11.10 capsid polynucleotide


31
AAVC11.11 capsid polynucleotide


32
AAVC11.12 capsid polynucleotide


33
AAVC11.13 capsid polynucleotide


34
AAVC11.14 capsid polynucleotide


35
AAVC11.15 capsid polynucleotide


36
AAVC11.16 capsid polynucleotide


37
AAVC11.17 capsid polynucleotide


38
AAVC11.18 capsid polynucleotide


39
AAVC11.19 capsid polynucleotide


40
Shuffling_Rescue-F primer


41
Shuffling_Rescue-R primer


42
BB_GAR-F primer


43
BB_GAR-R primer


44
CapRescue-F primer


45
CapRescue-R primer


46
pHelperF primer


47
pHelperR primer


48
GFP-F1 primer


49
GFP-R1 primer


50
rep-F1 primer


51
rep-R2 primer


52
BC_F primer


53
BC_R primer


54
External_5_Seq primer


55
External_3_Seq primer


56
human_AAAVC._F primer


57
human_AAAVC._R primer


58
SQSGASNDNH (residues 263-272 of SEQ ID NO: 13)


59
TGATNKTTLENVLMTNEEEIRP (residues 546-567 of SEQ ID NO: 13)


60
SSNLQAANTAAQTQVVNN (residues 582-597 of SEQ ID NO: 13)


61
DRFFPSSGV (residues 532-540 of SEQ ID NO: 13)


62
STGGTQGTQQLLFSQAGPANMSA (residues 451-473 of SEQ ID NO: 13)


63
LSQNNNSNFAWTGATKYHLNGRNSLVNPGV (residues 493-522 of SEQ ID



NO: 13)


64
AAV8 capsid polypeptide (VP1)


65
AAV8 Swap 1 capsid polypeptide


66
AAV8 Swap 2 capsid polypeptide


67
AAV8 Swap 3 capsid polypeptide


68
AAV8 Swap 4 capsid polypeptide


69
AAV8 Swap 5 capsid polypeptide


70
AAV8 Swap 6 capsid polypeptide


71
AAV8 Swap 7 capsid polypeptide


72
AAV8 Swap 8 capsid polypeptide


73
AAV8 Swap 9 capsid polypeptide


74
AAV8 Swap 10 capsid polypeptide


75
AAV8 Swap 11 capsid polypeptide


76
AAV8 Swap 12 capsid polypeptide


77
AAV8 Swap 13 capsid polypeptide


78
AAV8 Swap 14 capsid polypeptide


79
AAV8 Swap 15 capsid polypeptide


80
ISSQSGASNDNH (residues 261-272 of SEQ ID NO: 13)


81
KTGATNKTTLENVLMTNEEEIRP (residues 545-567 of SEQ ID NO: 13)


82
AMATHKDDEDRFFPSSGV (residues 523-540 of SEQ ID NO: 13)


83
QSTGGTQGTQQLLFSQAGPANMSA (residues 450-473 of SEQ ID NO: 13)


84
RVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGV (residues 488-522 of



SEQ ID NO: 13)


85
AAV8 Swap 1 capsid polynucleotide


86
AAV8 Swap 2 capsid polynucleotide


87
AAV8 Swap 3 capsid polynucleotide


88
AAV8 Swap 4 capsid polynucleotide


89
AAV8 Swap 5 capsid polynucleotide


90
AAV8 Swap 6 capsid polynucleotide


91
AAV8 Swap 7 capsid polynucleotide


92
AAV8 Swap 8 capsid polynucleotide


93
AAV8 Swap 9 capsid polynucleotide


94
AAV8 Swap 10 capsid polynucleotide


95
AAV8 Swap 11 capsid polynucleotide


96
AAV8 Swap 12 capsid polynucleotide


97
AAV8 Swap 13 capsid polynucleotide


98
AAV8 Swap 14 capsid polynucleotide


99
AAV8 Swap 15 capsid polynucleotide









Capsid Polypeptides

The present disclosure is predicated in part on the identification of novel AAV capsid polypeptides. Typically, the capsid polypeptides, when present in the capsid of an AAV vector, facilitate efficient transduction of human cells (such as human hepatocytes). The in vivo transduction of cells by AAV vectors having a capsid comprising a capsid polypeptide of the present disclosure is generally increased or enhanced compared to AAV vectors comprising a reference AAV capsid polypeptide (e.g. the prototypic AAV2 capsid set forth in SEQ ID NO:1). Transduction or transduction efficiency of AAV vectors can be increased by at least or about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 300%, 400%, 500%, 600%, 700%, 800%, 900%, 1000% or more, e.g. an AAV vector comprising a capsid polypeptide of the present disclosure can be at least or about 1.2×, 1.5×, 2×, 3×, 4×, 5×, 6×, 7×, 8×, 9×, 10×, 11×, 12×, 13×, 14×, 15×, 16×, 17×, 18×, 19×, 20×, 30×, 40×, 50×, 60×, 70×, 80×, 90×, 100× or more efficient at transducing cells in vivo compared to a reference AAV capsid polypeptide (e.g. one set forth in SEQ ID NO:1). In particular examples, the increased transduction or transduction efficiency is observed in human liver tissue or human hepatocytes.


AAV vectors comprising a capsid of the present disclosure may also exhibit enhanced or increased resistance to neutralization by pooled human immunoglobulins (also referred to as intravenous immunoglobulin or IVIg). The resistance to IVIg neutralization can be observed in vivo or in vitro using well-known assays, such as those described in the Examples below. The resistance to IVIg neutralization can be increased by at least or about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 300%, 400%, 500%, 600%, 700%, 800%, 900%, 1000% or more, e.g. the resistance to IVIg neutralization of the AAV vector comprising a capsid polypeptide of the present disclosure can be at least or about 1.2×, 1.5×, 2×, 3×, 4×, 5×, 6×, 7×, 8×, 9×, 10×, 11×, 12×, 13×, 14×, 15×, 16×, 17×, 18×, 19×, 20×, 30×, 40×, 50×, 60×, 70×, 80×, 90×, 100× or more than the resistance to IVIg neutralization of an AAV vector comprising a reference AAV capsid polypeptide (e.g. one set forth in SEQ ID NO:1).


The capsid polypeptides of the present disclosure are therefore particularly useful in preparing AAV vectors, and in particular AAV vectors for gene therapy uses. In exemplary embodiments, the capsid polypeptides of the present disclosure are particularly useful in preparing AAV vectors that transduce hepatocytes, and in particular, human hepatocytes, and are thus useful for gene therapy applications targeting the liver.


Provided herein are polypeptides, including isolated polypeptides, comprising all or a portion of an AAV capsid polypeptide set forth in any one of SEQ ID Nos: 2-20 and 65-79, including all or a portion of the VP1 protein (comprising amino acid residues corresponding to those at positions 1-735 of SEQ ID NO:1), VP2 protein (comprising amino acid residues corresponding to those at positions 138-735 of SEQ ID NO:1) and/or the VP3 protein (comprising amino acid residues corresponding to those at positions 203-735 of SEQ ID NO:1), and variants thereof, including variants comprising at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP1, VP2 or VP3 proteins described herein.


Capsid polypeptides of the disclosure include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:2 (also referred to as AAVC11.01) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:2 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:2 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:2 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:2 or a functional fragment thereof.


Capsid polypeptides of the disclosure also include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:3 (also referred to as AAVC11.02) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-736 of SEQ ID NO:3 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-736 of SEQ ID NO:3 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-736 of SEQ ID NO:3 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-736 of SEQ ID NO:3 or a functional fragment thereof.


Exemplary capsid polypeptides of the disclosure also include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:4 (also referred to as AAVC11.03) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-737 of SEQ ID NO:4 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-737 of SEQ ID NO:4 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-737 of SEQ ID NO:4 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-737 of SEQ ID NO:4 or a functional fragment thereof.


Also provided herein are capsid polypeptides comprising all or a portion of the VP1 protein set forth in SEQ ID NO:5 (also referred to as AAVC11.04) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-734 of SEQ ID NO:5 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-734 of SEQ ID NO:5 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 203-734 of SEQ ID NO:5 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 203-734 of SEQ ID NO:5 or a functional fragment thereof.


Capsid polypeptides of the disclosure also include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:6 (also referred to as AAVC11.05) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:6 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:6 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:6 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:6 or a functional fragment thereof.


Capsid polypeptides of the disclosure also include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:7 (also referred to AAVC11.06) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:7 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:7 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:7 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:7 or a functional fragment thereof.


Other exemplary capsid polypeptides of the disclosure include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:8 (also referred to as AAVC11.07) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-734 of SEQ ID NO:8 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-734 of SEQ ID NO:8 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 203-734 of SEQ ID NO:8 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 203-734 of SEQ ID NO:8 or a functional fragment thereof.


Further exemplary capsid polypeptides of the disclosure include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:9 (also referred to as AAVC11.08) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:9 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:9 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:9 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:9 or a functional fragment thereof.


Capsid polypeptides of the present disclosure also include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:10 (also referred to as AAVC11.09) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:10 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:10 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:10 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:10 or a functional fragment thereof.


Capsid polypeptides of the present disclosure also include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:11 (also referred to as AAVC11.10) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-734 of SEQ ID NO:11 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-734 of SEQ ID NO:11 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 203-734 of SEQ ID NO:11 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 203-734 of SEQ ID NO:11 or a functional fragment thereof.


Exemplary capsid polypeptides of the present disclosure also include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:12 (also referred to as AAVC11.11) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:12 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:12 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:12 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:12 or a functional fragment thereof.


Further exemplary capsid polypeptides of the present disclosure include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:13 (also referred to as AAVC11.12) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:13 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:13 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:13 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:13 or a functional fragment thereof.


Also provided are capsid polypeptides that comprise all or a portion of the VP1 protein set forth in SEQ ID NO:14 (also referred to as AAVC11.13) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:14 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:14 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:14 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:14 or a functional fragment thereof.


Capsid polypeptides of the present disclosure also include those that comprise all or a portion of the VP1 protein set forth in SEQ ID NO:15 (also referred to as AAVC11.14) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-736 of SEQ ID NO:15 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-736 of SEQ ID NO:15 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 203-736 of SEQ ID NO:15 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 203-736 of SEQ ID NO:15 or a functional fragment thereof.


Capsid polypeptides of the present disclosure also include those that comprise all or a portion of the VP1 protein set forth in SEQ ID NO:16 (also referred to as AAVC11.15) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:16 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:16 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:16 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:16 or a functional fragment thereof.


Exemplary capsid polypeptides of the present disclosure also include those that comprise all or a portion of the VP1 protein set forth in SEQ ID NO:17 (also referred to as AAVC11.16) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:17 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:17 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:17 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:17 or a functional fragment thereof.


Exemplary capsid polypeptides also include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:18 (also referred to as AAVC11.17) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:18 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:18 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:18 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:18 or a functional fragment thereof.


Further exemplary capsid polypeptides of the present disclosure include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:19 (also referred to as AAVC11.18) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:19 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:19 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:19 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:19 or a functional fragment thereof.


Capsid polypeptides of the present disclosure also include those comprising all or a portion of the VP1 protein set forth in SEQ ID NO:20 (also referred to as AAVC11.19) or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:20 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:20 or a functional fragment thereof; and capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:20 or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:20 or a functional fragment thereof.


Capsid polypeptides of the present disclosure also include those comprising all or a portion of the VP1 protein set forth in any one of SEQ ID NOs:65-79 or a polypeptide having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto. Thus, also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP2 protein set forth as amino acids 138-735 of any one of SEQ ID NOs: 69, 71-74, 76 and 78, amino acids 138-736 of any one of SEQ ID NOs: 65, 68, 75, 77 and 79, amino acids 138-737 of SEQ ID NOs: 67 or 70, or amino acids 138-738 of SEQ ID NO:66; or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the aforementioned VP2 protein or a functional fragment thereof. Also included in the present disclosure are capsid polypeptides comprising all or a portion of the VP3 protein set forth as amino acids 204-735 of any one of SEQ ID NOs: 69, 71-74, 76 and 78, amino acids 204-736 of any one of SEQ ID NOs: 65, 68, 75, 77 and 79, amino acids 204-737 of SEQ ID NO: 67 or 70, or amino acids 204-738 of SEQ ID NO:66; or comprising a sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the aforementioned VP3 protein or a functional fragment thereof.


In some examples, the capsid polypeptides described above and herein comprise all or a portion of one or more variable regions having a sequence that is the same as the sequence of the corresponding variable region present in the AAVC11.12 polypeptide (SEQ ID NO:13). The variable regions of AAV capsid polypeptides have been described (see e.g. Drouin and Agbandje-McKenna, 2013, Future Virol. 8(12): 1183-1199) and include VR-I, spanning positions 260-267; VR-II, spanning positions 326-330; VR-III, spanning positions 380-384; VR-IV, spanning positions 449-467; VR-V, spanning positions 487-504; VR-VI, spanning positions 522-538; VR-VII, spanning positions 544-557; VR-VIII, spanning positions 580-592; and VR-IX, spanning positions 703-711 with numbering relative to AAV2. The AAVC11.12 polypeptide, which was generated from a DNA shuffled library, contains a VR-I of AAV2 origin, VR-IV and VR-V of AAV10 origin, and VR-VI, VR-VII, and VR-VIII of AAV7 origin (when using the VR regions as defined above and in Drouin and Agbandje-McKenna, 2013, the VR-I spans positions 261-268; the VR-IV spans positions 450-468; the VR-V spans positions 488-505; the VR-VI spans positions 523-539; the VR-VII spans positions 545-557; and the VR-VIII spans positions 580-592 of the AAVC11.12 polypeptide set forth in SEQ ID NO:13). Thus, in some examples, the capsid polypeptides of the present disclosure comprise all or a portion of one or more of the VR-I, VR-IV, VR-V, VR-VI, VR-VII and VR-VIII of the AAVC11.12 polypeptide. In some embodiments, capsid polypeptides have at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to all or a portion of one or more of the VR-I, VR-IV, VR-V, VR-VI, VR-VII and VR-VIII of the AAVC11.12 polypeptide


In one example, the capsid polypeptides of the present disclosure (e.g. a capsid polypeptide comprising a sequence having at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP1, VP2 or VP3 protein of any one of SEQ ID NOs: 2-20 or 65-79) comprise amino acid residues S263, Q264, S265, S268 and H272 (i.e. including residues in or near the VR-I of AAVC11.12); amino acid residues S451, Q456, G457, Q460, L462, A466, A469, N470, S472 and A473 (i.e. including residues in and/or near the VR-IV of AAVC11.12); amino acid residues L493, S494, G505, A506, V518 and V522 (i.e. including residues in or near the VR-V of AAVC11.12); amino acid residues D532, S538 and V540 (i.e. including residues in or near the VR-VI of AAVC11.12); amino acid residues T546, G547, T549, N550, K551, T552, T553, L554, E555, N556, L558, M559, N561, R566 and P567 (i.e. including residues in or near the VR-VII of AAVC11.12); and/or amino acid residues S580, S581, A585, A586, A590, T592, Q593, V594, and N597 (i.e. including residues in or near the VR-VIII of AAVC11.12); with numbering relative to SEQ ID NO:13.


In further examples, the capsid polypeptides comprise the sequence of amino acids SQSGASNDNH (SEQ ID NO:58) at positions 263-272; the sequence of amino acids ISSQSGASNDNH (SEQ ID NO:80) at positions 261-272; the sequence of amino acids STGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:62) at positions 451-473; the sequence of amino acids QSTGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:83) at positions 450-473; the sequence of amino acids LSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:63) at positions 493-522; the sequence of amino acids RVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:84) at positions 488-522; the sequence of amino acids DRFFPSSGV (SEQ ID NO:61) at positions 532-540; the sequence of amino acids AMATHKDDEDRFFPSSGV (SEQ ID NO:82) at positions 523-540; the sequence of amino acids TGATNKTTLENVLMTNEEEIRP (SEQ ID NO:59) at positions 546-567; the sequence of amino acids KTGATNKTTLENVLMTNEEEIRP (SEQ ID NO:81) at positions 545-567; and/or the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597; with numbering relative to SEQ ID NO:13.


In a particular example, the capsid polypeptides of the present disclosure (e.g. a capsid polypeptide comprising a sequence having at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP1, VP2 or VP3 protein of any one of SEQ ID NOs: 2-20 or 65-79) comprise all or a portion of the VR-I of AAVC11.12, and all or a portion of the VR-VII and/or VR-VIII of AAVC11.12. Thus, in one example, the polypeptides comprise a) amino acid residues S263, Q264, S265, S268 and H272; and b) amino acid residues T546, G547, T549, N550, K551, T552, T553, L554, E555, N556, L558, M559, N561, R566 and P567; and/or amino acid residues S580, S581, A585, A586, A590, T592, Q593, V594, and N597, with numbering relative to SEQ ID NO:13. In further examples, the capsid polypeptides comprise a) the sequence of amino acids SQSGASNDNH (SEQ ID NO:58) at positions 263-272; and b) the sequence of amino acids TGATNKTTLENVLMTNEEEIRP (SEQ ID NO:59) at positions 546-567 and/or the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13. In other examples, the capsid polypeptides comprise the sequence of amino acids ISSQSGASNDNH (SEQ ID NO:80) at positions 261-272; and b) the sequence of amino acids KTGATNKTTLENVLMTNEEEIRP (SEQ ID NO:81) at positions 545-567 and/or the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13. Such capsid polypeptides can further include all or a portion of the VR-VI of AAVC11.12 (e.g. amino acid residues D532, S538 and V540; the sequence of amino acids DRFFPSSGV (SEQ ID NO:61) at positions 532-540; and/or the sequence of amino acids AMATHKDDEDRFFPSSGV (SEQ ID NO:82) at positions 523-540), all or a portion of the VR-IV of AAVC11.12 (e.g. comprising amino acid residues S451, Q456, G457, Q460, L462, A466, A469, N470, S472 and A473; the sequence of amino acids STGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:62) at positions 451-473, and/or the sequence of amino acids QSTGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:83) at positions 450-473), and/or all or a portion of the VR-V of AAVC11.12 (e.g. comprising amino acid residues L493, S494, G505, A506, V518 and V522, the sequence of amino acids LSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:63) at positions 493-522, and/or the sequence of amino acids RVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:84) at positions 488-522), with numbering relative to SEQ ID NO:13.


In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 50%, 60%, 70%, 80%, or 90% sequence identity to SEQ ID NO: 58 and include at least one substitution at any of positions 264-272 (e.g., at least one conservative substitution, e.g., at least two, three, four, or five substitutions). In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 50%, 60%, 70%, 80%, or 90% sequence identity to SEQ ID NO: 58 (e.g., at least one conservative substitution, e.g., at least two, three, four, or five substitutions) and include at least one substitution at any of positions 266, 267, 269, 270, and 271. In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 50%, 60%, 70%, 80%, or 90% sequence identity to SEQ ID NO: 58 and include at least one deletion or insertion. In some embodiments, capsid polypeptides may comprise S at position 263, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise Q at position 264, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise S at position 265, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise S at position 268, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise H at position 272, or a conservative substitution thereof.


In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 59 and include at least one substitution at any of positions 545-567 (e.g., at least one conservative substitution, e.g., at least two, three, four, five, six, or seven substitutions). In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 59 (e.g., at least one conservative substitution, e.g., at least two, three, four, five, six, or seven substitutions) and include at least one substitution at any of positions 545, 548, 557, 560, 562, 563, 564, or 565. In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 59 and include at least one deletion or insertion. In some embodiments, capsid polypeptides may comprise T at position 546, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise G at position 547, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise T at position 549, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise N at position 550, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise K at position 551, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise T at position 552, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise T at position 553, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise L at position 554, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise E at position 555, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise N at position 556, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise L at position 558, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise M at position 559, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise N at position 561, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise R at position 566, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise P at position 567, or a conservative substitution thereof.


In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 60 (e.g., at least one conservative substitution, e.g., at least two, three, four, five, six, seven, eight, or nine substitutions) and include at least one substitution at any of positions 581-597. In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 60 (e.g., at least one conservative substitution, e.g., at least two, three, four, five, six, seven, eight, or nine substitutions) and include at least one substitution at any of positions 582, 583, 584, 587, 588, 589, 591, 595, or 596. In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 60 and include at least one deletion or insertion. In some embodiments, capsid polypeptides may comprise S at position 580, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise S at position 581, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise A at position 585, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise A at position 586, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise A at position 590, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise T at position 592, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise O at position 593, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise V at position 594, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise N at position 597, or a conservative substitution thereof.


In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 30%, 40%, 50%, 60%, 70%, 80%, or 90% sequence identity to SEQ ID NO: 61 (e.g., at least one conservative substitution, e.g., at least two, three, four, five, or six substitutions) and include at least one substitution at any of positions 532-540. In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 30%, 40%, 50%, 60%, 70%, 80%, or 90% sequence identity to SEQ ID NO: 61 (e.g., at least one conservative substitution, e.g., at least two, three, four, five, or six substitutions) and include at least one substitution at any of positions 533, 534, 535, 536, 537, or 539. In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 30%, 40%, 50%, 60%, 70%, 80%, or 90% sequence identity to SEQ ID NO: 61 and include at least one deletion or insertion. In some embodiments, capsid polypeptides may comprise D at position 532, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise S at position 538, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise V at position 540, or a conservative substitution thereof.


In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 62 (e.g., at least one conservative substitution, e.g., at least two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen substitutions) and include at least one substitution at any of positions 451-473. In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 62 (e.g., at least one conservative substitution, e.g., at least two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen substitutions) and include at least one substitution at any of positions 452. 453. 454. 455. 458, 459, 461, 463, 464, 465, 467, 468, or 471. In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 62 and include at least one deletion or insertion. In some embodiments, capsid polypeptides may comprise S at position 451, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise Q at position 456, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise G at position 457, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise Q at position 460, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise L at position 462, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise A at position 466, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise A at position 469, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise N at position 470, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise S at position 472, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise A at position 473, or a conservative substitution thereof.


In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 63 (e.g., at least one conservative substitution, e.g., at least two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty, twenty one, twenty two, twenty three, or twenty four substitutions) and include at least one substitution at any of positions 493-522. In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 63 (e.g., at least one conservative substitution, e.g., at least two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty, twenty one, twenty two, twenty three, or twenty four substitutions) and include at least one substitution at any of positions 495, 496, 497, 498, 499, 500, 501, 502, 503, 504, 507, 508, 509, 510, 511, 512, 513, 514, 515, 516, 517, 519, 520, or 521. In some embodiments, capsid polypeptides of the present disclosure comprise a sequence of amino acids having at least about 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% sequence identity to SEQ ID NO: 63 and include at least one deletion or insertion. In some embodiments, capsid polypeptides may comprise L at position 493, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise S at position 494, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise G at position 505, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise A at position 506, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise V at position 518, or a conservative substitution thereof. In some embodiments, capsid polypeptides may comprise V at position 522, or a conservative substitution thereof.


In a particular example, the capsid polypeptides of the present disclosure (e.g. a capsid polypeptide comprising a sequence having at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP1, VP2 or VP3 protein of any one of SEQ ID NOs: 2-20 or 65-79) comprise all or a portion of the VR-IV, VR-V, VR-VI, VR-VII and VR-VIII of AAVC11.12. Thus, in one example, the polypeptides comprise amino acid residues S451, Q456, G457, Q460, L462, A466, A469, N470, S472, A473, L493, S494, G505, A506, V518 V522, D532, S538 V540, T546, G547, T549, N550, K551, T552, T553, L554, E555, N556, L558, M559, N561, R566, P567, S580, S581, A585, A586, A590, T592, Q593, V594, and N597, with numbering relative to SEQ ID NO:13. In particular examples, the capsid polypeptides comprise the sequence of amino acids STGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:62) at positions 451-473; the sequence of amino acids LSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:63) at positions 493-522; the sequence of amino acids DRFFPSSGV (SEQ ID NO:61) at positions 532-540; the sequence of amino acids TGATNKTTLENVLMTNEEEIRP (SEQ ID NO:59) at positions 546-567; and the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13. In still further examples, the polypeptides comprise the sequence of amino acids QSTGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:83) at positions 450-473; the sequence of amino acids RVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:84) at positions 488-522; the sequence of amino acids AMATHKDDEDRFFPSSGV (SEQ ID NO:82) at positions 523-540; the sequence of amino acids KTGATNKTTLENVLMTNEEEIRP (SEQ ID NO:81) at positions 545-567, with numbering relative to SEQ ID NO:13; and the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13. Typically, such polypeptides do not have the VR-I from AAVC11.12 (i.e. do not have the AAV2 VR-I). These polypeptides may have a VR-I from AAV8. For example, the polypeptides may have an insertion of NG after position 262, and contain residues T263, S264, G265, T268, and T272, with numbering relative to SEQ ID NO:13. In particular examples, the polypeptide contains an insertion of NG after position 262 and the sequence of amino acids TSGGATNDNT at positions 263-272, with numbering relative to SEQ ID NO:13.


Also provided are nucleic acid molecules, including isolated nucleic acid molecules, encoding a capsid polypeptide described herein. Thus, for example, amongst the nucleic acid molecules provided herein are those encoding the VP1, VP2 and/or VP3 of any one of the capsid polypeptides described herein. Non-limiting examples of nucleic acid molecules therefore include those set forth in SEQ ID NOs:21-39 and 85-99, those having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity thereto, and those that hybridize with medium or high stringency to nucleic acid molecules comprising a sequence set forth in any one of SEQ ID NOs:21-39 and 85-99.


Vectors

The present disclosure also provides vectors comprising a nucleic acid molecule that encodes a capsid polypeptide described herein, and vectors comprising a capsid polypeptide described herein. The vectors include nucleic acid vectors that comprise a nucleic acid molecule that encodes a capsid polypeptide described herein, and AAV vectors that have a capsid comprising a capsid polypeptide described herein.


Nucleic Acid Vectors

Vectors of the present disclosure include nucleic acid vectors that comprise a polynucleotide that encodes all or a portion of a capsid polypeptide described herein, e.g. that encodes a polypeptide comprising an amino acid sequence set forth in any one of SEQ ID NOs:2-20 or an amino acid sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity to a sequence set forth in any one of SEQ ID NOs:2-20, or a fragment thereof (e.g. all or a portion of the VP2 or VP3 protein), as described above. The vectors can be episomal vectors (i.e., that do not integrate into the genome of a host cell) or can be vectors that integrate into the host cell genome. Exemplary vectors that comprise a nucleic acid molecule encoding a capsid polypeptide include, but are not limited to, plasmids, cosmids, transposons and artificial chromosomes. In particular examples, the vectors are plasmids.


Vectors, such as plasmids, suitable for use in bacterial, insect and mammalian cells are widely described and well-known in the art. Those skilled in the art would appreciate that vectors of the present disclosure may also contain additional sequences and elements useful for the replication of the vector in prokaryotic and/or eukaryotic cells, selection of the vector and the expression of a heterologous sequence in a variety of host cells. For example, the vectors of the present disclosure can include a prokaryotic replicon (that is, a sequence having the ability to direct autonomous replication and maintenance of the vector extra-chromosomally in a prokaryotic host cell, such as a bacterial host cell. Such replicons are well known in the art. In some embodiments, the vectors can include a shuttle element that makes the vectors suitable for replication and integration in both prokaryotes and eukaryotes. In addition, vectors may also include a gene whose expression confers a detectable marker such as a drug resistance gene, which allows for selection and maintenance of the host cells. Vectors may also have a reportable marker, such as gene encoding a fluorescent or other detectable protein. The nucleic acid vectors will likely also comprise other elements, including any one or more of those described below. Most typically, the vectors will comprise a promoter operably linked to the nucleic acid encoding the capsid protein.


The nucleic acid vectors of the present disclosure can be constructed using known techniques, including, without limitation, the standard techniques of restriction endonuclease digestion, ligation, transformation, plasmid purification, in vitro or chemical synthesis of DNA, and DNA sequencing. The vectors of the present disclosure may be introduced into a host cell using any method known in the art. Accordingly, the present disclosure is also directed to host cells comprising a vector or nucleic acid described herein.


AAV Vectors

Provided herein are AAV vectors comprising a capsid polypeptide described herein, such as a polypeptide comprising all or a portion of an AAV capsid protein (e.g. a polypeptide comprising the amino acid sequence set forth in any one of SEQ ID NOs:2-20 or an amino acid sequence having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity to a sequence set forth in any one of SEQ ID NOs:2-20, or a fragment thereof (e.g. all or a portion of the VP2 or VP3 protein).


Methods for vectorizing a capsid protein are well known in the art and any suitable method can be employed for the purposes of the present disclosure. For example, the cap gene can be recovered (e.g. by PCR or digest with enzymes that cut upstream and downstream of cap) and cloned into a packaging construct containing rep. Any AAV rep gene may be used, including, for example, a rep gene is from AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12 or AAV13 and any variants thereof. Typically, the cap gene is cloned downstream of rep so the rep p40 promoter can drive cap expression. This construct does not contain ITRs. This construct is then introduced into a packaging cell line with a second construct containing ITRs, typically flanking a heterologous coding sequence. Helper function or a helper virus are also introduced, and recombinant AAV comprising a capsid generated from capsid proteins expressed from the cap gene, and encapsidating a genome comprising the transgene flanked by the ITRs, is recovered from the supernatant of the packaging cell line. Various types of cells can be used as the packaging cell line. For example, packaging cell lines that can be used include, but are not limited to, HEK293 cells, HeLa cells, and Vero cells, for example as disclosed in US20110201088. The helper functions may be provided by one or more helper plasmids or helper viruses comprising adenoviral helper genes. Non-limiting examples of the adenoviral helper genes include E1A, E1B, E2A, E4 and VA, which can provide helper functions to AAV packaging. Helper viruses of AAV are known in the art and include, for example, viruses from the family Adenoviridae and the family Herpesviridae. Examples of helper viruses of AAV include, but are not limited to, SAdV-13 helper virus and SAdV-13-like helper virus described in US20110201088, helper vectors pHELP (Applied Viromics). A skilled artisan will appreciate that any helper virus or helper plasmid of AAV that can provide adequate helper function to AAV can be used herein.


In some instances, rAAV virions are produced using a cell line that stably expresses some of the necessary components for AAV virion production. For example, a plasmid (or multiple plasmids) comprising the nucleic acid containing a cap gene identified as described herein and a rep gene, and a selectable marker, such as a neomycin resistance gene, can be integrated into the genome of a cell (the packaging cells). The packaging cell line can then be transfected with an AAV vector and a helper plasmid or transfected with an AAV vector and co-infected with a helper virus (e.g., adenovirus providing the helper functions). The advantages of this method are that the cells are selectable and are suitable for large-scale production of the recombinant AAV. As another non-limiting example, adenovirus or baculovirus rather than plasmids can be used to introduce the nucleic acid encoding the capsid polypeptide, and optionally the rep gene, into packaging cells. As yet another non-limiting example, the AAV vector is also stably integrated into the DNA of producer cells, and the helper functions can be provided by a wild-type adenovirus to produce the recombinant AAV.


In still further instances, the AAV vectors are produced synthetically, by synthesising AAV capsid proteins and assembling and packaging the capsids in vitro.


Typically, the AAV vectors of the present disclosure also comprise a heterologous coding sequence. The heterologous coding sequence may be operably linked to a promoter to facilitate expression of the sequence. The heterologous coding sequence can encode a peptide or polypeptide, such as a therapeutic peptide or polypeptide, or can encode a polynucleotide or transcript that itself has a function or activity, such as an antisense or inhibitory oligonucleotide, including antisense DNA and RNA (e.g. miRNA, siRNA, and shRNA). In some examples, the heterologous coding sequence is a stretch of nucleic acids that is essentially homologous to a stretch of nucleic acids in the genomic DNA of an animal, such that when the heterologous coding sequence is introduced into a cell of the animal, homologous recombination between the heterologous coding sequence and the genomic DNA can occur. As would be appreciated, the nature of the heterologous coding sequence is not essential to the present disclosure. In particular embodiments, the vectors comprising the heterologous coding sequence(s) will be used in gene therapy.


In particular examples, the heterologous coding sequence encodes a peptide or polypeptide, or polynucleotide, whose expression is of therapeutic use, such as, for example, for the treatment of a disease or disorder. For example, expression of a therapeutic peptide or polypeptide may serve to restore or replace the function of the endogenous form of the peptide or polypeptide that is defective (i.e. gene replacement therapy). In other examples, expression of a therapeutic peptide or polypeptide, or polynucleotide, from the heterologous sequence serves to alter the levels and/or activity of one or more other peptides, polypeptides or polynucleotides in the host cell. Thus, according to particular embodiments, the expression of a heterologous coding sequence introduced by a vector described herein into a host cell can be used to provide a therapeutic amount of a peptide, polypeptide or polynucleotide to ameliorate the symptoms of a disease or disorder. In other instance, the heterologous coding sequence is a stretch of nucleic acids that is essentially homologous to a stretch of nucleic acids in the genomic DNA of an animal, such that when the heterologous sequence is introduced into a cell of the animal, homologous recombination between the heterologous coding sequence and the genomic DNA can occur. Accordingly, the introduction of a heterologous sequence by an AAV vector described herein into a host cell can be used to correct mutations in genomic DNA, which in turn can ameliorate the symptoms of a disease or disorder.


In non-limiting examples, the heterologous coding sequence encodes an expression product that, when delivered to a subject, and in particular the liver of a subject, treats a liver-associated disease or condition. In illustrative embodiments, the liver-associated disease or condition is selected from among a urea cycle disorder (UCD; including N-acetylglutamate synthase deficiency (NAGSD), carbamylphosphate synthetase 1 deficiency (CPS1D), ornithine transcarbamylase deficiency (OTCD), argininosuccinate synthetase deficiency (ASSD), argininosuccinate lyase (ASLD), arginase 1 deficiency (ARG1D), citrin or aspartate/glutamate carrier deficiency and the mitochondrial ornithine transporter 1 deficiency causing hyperornithinemia-hyperammonemia-homocitrullinuria syndrome (HHH syndrome)), organic acidopathy (or organic academia, including methylmalonic acidemia, propionic acidemia, isovaleric acidemia, and maple syrup urine disease), aminoacidopathy, glycogenoses (Types I, III and IV), Wilson's disease, Progressive Familial Intrahepatic Cholestasis, primary hyperoxaluria, complementopathy, coagulopathy (e.g. hemophilia A, hemophilia B, von Willebrand disease (VWD)), Crigler Najjar syndrome, familial hypercholesterolaemia, α-1-antitrypsin deficiency, mitochondria respiratory chain hepatopathy, and citrin deficiency. Those skilled in the art would readily be able to select an appropriate heterologous coding sequence useful for treating such diseases. In some examples, the heterologous coding sequence comprises all or a part of a gene that is associated with the disease, such as all or a part of a gene set forth in Table 2. Introduction of such a sequence to the liver can be used for gene replacement or gene editing/correction, e.g. using CRISPR-Cas9. In particular examples, the heterologous coding sequence encodes a protein encoded by a gene that is associated with the disease, such as a gene set forth in Table 2.










TABLE 2





Exemplary liver-associated diseases
Exemplary associated genes







Urea cycle disorders (UCDs)
OTC, ASS, CPS1, ASL, ARG1


Organic acidopathies
PCCA, PCCB, MMUT


Aminoacidopathies
PAH, FAH


Glycogenoses (Types I, III and IV)
SLC37A4


Wilson's Disease
ATP7B


Progressive Familial Intrahepatic
ABCB4, ABCB11, ATP8B1


Cholestasis



Primary Hyperoxaluria
AGXT


Complementopathies
CFH, CFI


Coagulopathies
F8, F9, VWF


Crigler Najjar syndrome
UGT1A1


Familial Hypercholesterolaemia
LDLR


α-1-antitrypsin Deficiency
SERPINA1


Mitochondria Respiratory Chain
POLG


Hepatopathies



Citrin Deficiency
SLC25A13









The heterologous coding sequence in the AAV vector is flanked by 3′ and 5′ AAV ITRs. AAV ITRs used in the vectors of the disclosure need not have a wild-type nucleotide sequence, and may be altered, e.g., by the insertion, deletion or substitution of nucleotides. Additionally, AAV ITRs may be derived from any of several AAV serotypes, including without limitation, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12 or AAV13. Such ITRs are well known in the art.


As will be appreciated by a skilled artisan, any method suitable for purifying AAV can be used in the embodiments described herein to purify the AAV vectors, and such methods are well known in the art. For example, the AAV vectors can be isolated and purified from packaging cells and/or the supernatant of the packaging cells. In some embodiments, the AAV is purified by separation method using a CsCl or iodixanol gradient centrifugation. In other embodiments, AAV is purified as described in US20020136710 using a solid support that includes a matrix to which an artificial receptor or receptor-like molecule that mediates AAV attachment is immobilized.


Additional Elements in the Vectors

The vectors of the present disclosure can comprise promoters. In instances where the vector is a nucleic acid vector comprising nucleic acid encoding the capsid polypeptide, the promoter may facilitate expression of the nucleic acid encoding the capsid polypeptide. In instances where the vector is an AAV vector, the promoter may facilitate expression of a heterologous coding sequence, as described above.


In some examples, the promoters are AAV promoters, such as the p5, p19 or p40 promoter. In other examples, the promoters are derived from other sources. Examples of constitutive promoters include, without limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer), the SV40 promoter, the dihydrofolate reductase promoter, the 8-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1α promoter. Inducible promoters allow regulation of gene expression and can be regulated by exogenously supplied compounds, environmental factors such as temperature, or the presence of a specific physiological state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells only. Non-limiting examples of inducible promoters regulated by exogenously supplied promoters include the zinc-inducible sheep metallothionine (MT) promoter, the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter, the T7 polymerase promoter system; the ecdysone insect promoter, the tetracycline-repressible system, the tetracycline-inducible system, the RU486-inducible system and the rapamycin-inducible system. Still other types of inducible promoters which may be useful in this context are those which are regulated by a specific physiological state, e.g., temperature, acute phase, a particular differentiation state of the cell, or in replicating cells only. In some embodiments, tissue specific promoters are used. Non-limiting examples of such promoters include the liver-specific thyroxin binding globulin (TBG) promoter, insulin promoter, glucagon promoter, somatostatin promoter, pancreatic polypeptide (PPY) promoter, synapsin-1 (Syn) promoter, creatine kinase (MCK) promoter, mammalian desmin (DES) promoter, a α-myosin heavy chain (a-MHC) promoter, a cardiac Troponin T (cTnT) promoter, beta-actin promoter, and hepatitis B virus core promoter. The selection of an appropriate promoter is well within the ability of one of ordinary skill in the art.


The vectors can also include transcriptional enhancers, translational signals, and transcriptional and translational termination signals. Examples of transcriptional termination signals include, but are not limited to, polyadenylation signal sequences, such as bovine growth hormone (BGH) poly(A), SV40 late poly(A), rabbit beta-globin (RBG) poly(A), thymidine kinase (TK) poly(A) sequences, and any variants thereof. In some embodiments, the transcriptional termination region is located downstream of the posttranscriptional regulatory element. In some embodiments, the transcriptional termination region is a polyadenylation signal sequence.


The vectors can include various posttranscriptional regulatory elements. In some embodiments, the posttranscriptional regulatory element can be a viral posttranscriptional regulatory element. Non-limiting examples of viral posttranscriptional regulatory element include woodchuck hepatitis virus posttranscriptional regulatory element (WPRE), hepatitis B virus posttranscriptional regulatory element (HBVPRE), RNA transport element, and any variants thereof. The RTE can be a rev response element (RRE), for example, a lentiviral RRE. A non-limiting example is bovine immunodeficiency virus rev response element (RRE). In some embodiments, the RTE is a constitutive transport element (CTE). Examples of CTE include, but are not limited to, Mason-Pfizer Monkey Virus CTE and Avian Leukemia Virus CTE.


A signal peptide sequence can also be included in the vector to provide for secretion of a polypeptide from a mammalian cell. Examples of signal peptides include, but are not limited to, the endogenous signal peptide for HGH and variants thereof; the endogenous signal peptide for interferons and variants thereof, including the signal peptide of type I, II and III interferons and variants thereof; and the endogenous signal peptides for known cytokines and variants thereof, such as the signal peptide of erythropoietin (EPO), insulin, TGF-β1, TNF, IL1-α, and IL1-β, and variants thereof. Typically, the nucleotide sequence of the signal peptide is located immediately upstream of the heterologous sequence (e.g., fused at the 5′ of the coding region of the protein of interest) in the vector.


In further examples, the vectors can contain a regulatory sequence that allows, for example, the translation of multiple proteins from a single mRNA. Non-limiting examples of such regulatory sequences include internal ribosome entry site (IRES) and 2A self-processing sequence, such as a 2A peptide site from foot-and-mouth disease virus (F2A sequence).


Host Cells

Also provided herein are host cells comprising a nucleic acid molecule or vector or of the present disclosure. In some instances, the host cells are used to amplify, replicate, package and/or purify a polynucleotide or vector. In other examples, the host cells are used to express a heterologous sequence, such as one packaged within AAV vector. Exemplary host cells include prokaryotic and eukaryotic cells. In some instances, the host cell is a mammalian host cell. It is well within the skill of a skilled artisan to select an appropriate host cell for the expression, amplification, replication, packaging and/or purification of a polynucleotide, vector or rAAV virion of the present disclosure. Exemplary mammalian host cells include, but are not limited to, HEK293 cells, HeLa cells, Vero cells, HuH-7 cells, and HepG2 cells. In particular examples, the host cell is a hepatocyte or cell-line derived from a hepatocyte.


Compositions

Also provided are compositions comprising the nucleic acid molecules, polypeptides and/or vectors of the present disclosure. In particular examples, provided are pharmaceutical compositions comprising the AAV vectors disclosed herein and a pharmaceutically acceptable carrier. The compositions can also comprise additional ingredients such as diluents, stabilizers, excipients, and adjuvants.


The carriers, diluents and adjuvants can include buffers such as phosphate, citrate, or other organic acids; antioxidants such as ascorbic acid; low molecular weight polypeptides (e.g., less than about 10 residues); proteins such as serum aAAVC.umin, gelatin or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine, or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as Tween™, Pluronics™ or polyethylene glycol (PEG). In some embodiments, the physiologically acceptable carrier is an aqueous pH buffered solution.


Methods

The AAV vectors of the present disclosure, and compositions containing the AAV vectors, may be used in methods for the introduction of a heterologous coding sequence into a host cell. Such methods involve contacting the host cell with the AAV vector. This may be performed in vitro, ex vivo or in vivo. In particular embodiments, the host cell is a hepatocyte (e.g. a human hepatocyte).


When the methods are performed ex vivo or in vivo, typically the introduction of the heterologous sequence into the host cell is for therapeutic purposes, whereby expression of the heterologous sequence results in the treatment of a disease or condition. Thus, the AAV vectors disclosed herein can be administered to a subject (e.g., a human) in need thereof, such as subject with a disease or condition amendable to treatment with a protein, peptide or polynucleotide encoded by a heterologous sequence described herein.


When used in vivo, titers of AAV vectors to be administered to a subject will vary depending on, for example, the particular recombinant virus, the disease or disorder to be treated, the mode of administration, the treatment goal, the individual to be treated, and the cell type(s) being targeted, and can be determined by methods well known to those skilled in the art. Although the exact dosage will be determined on an individual basis, in most cases, typically, recombinant viruses of the present disclosure can be administered to a subject at a dose of between 1×1010 genome copies of the recombinant virus per kg of the subject and 1×1014 genome copies per kg. In other examples, less than 1×1010 genome copies may be sufficient for a therapeutic effect. In other examples, more than 1×1014 genome copies may be required for a therapeutic effect.


The route of the administration is not particularly limited. For example, a therapeutically effective amount of the AAV vector can be administered to the subject via, for example, intramuscular, intravaginal, intravenous, intraperitoneal, subcutaneous, epicutaneous, intradermal, rectal, intraocular, pulmonary, intracranial, intraosseous, oral, buccal, or nasal routes. The AAV vector can be administrated as a single dose or multiple doses, and at varying intervals.


Also provided are methods for producing an AAV vector described above and herein, i.e. one comprising a capsid polypeptide of the present disclosure. Such methods comprise culturing a host cell comprising a nucleic acid molecule encoding a capsid polypeptide the present disclosure, an AAV rep gene, a heterologous coding sequence flanked by AAV inverted terminal repeats, and helper functions for generating a productive AAV infection, under conditions suitable to facilitate assembly of an AAV vector comprising a capsid polypeptide of the present disclosure, wherein the capsid encapsidates the heterologous coding sequence.


In further aspects, provided are methods for enhancing the in vivo human hepatocyte transduction efficiency of an AAV vector. As demonstrated herein, some variable regions, and combinations of capsid variable regions, are important for efficient transduction of human hepatocytes by an AAV vector. In particular, the presence of all or a part of VR-VII and/or VR-VIII from AAV7 in a capsid polypeptide imparts enhanced transduction by AAV vectors of a human hepatocyte in vivo. VR-I from AAV2 can also enhance the transduction by AAV vectors of a human hepatocyte in vivo.


Thus, provided herein are methods for enhancing the in vivo human hepatocyte transduction efficiency of an AAV vector (or producing an AAV vector with enhanced in vivo human hepatocyte transduction efficiency), which include the steps of modifying the sequence of a reference capsid polypeptide at one or more of positions 263, 264, 265, 268, 272, 546, 547, 549, 550, 551, 552, 553, 554, 555, 556, 558, 559, 561, 566, 567, 580, 581, 585, 586, 590, 592, 593, 594 and 597, with numbering relative to SEQ ID NO:13, to thereby produce a modified capsid polypeptide that comprises: i) amino acid residues S263, Q264, S265, S268 and H272, with numbering relative to SEQ ID NO:13; and ii) amino acid residues T546, G547, T549, N550, K551, T552, T553, L554, E555, N556, L558, M559, N561, R566 and P567, with numbering relative to SEQ ID NO:13; and/or amino acid residues S580, S581, A585, A586, A590, T592, Q593, V594, and N597, with numbering relative to SEQ ID NO:13. Additional modifications can optionally be made at or adjacent to one or more other variable regions, such as VR-IV, VR-V and VR-VI. For example, modifications can be made at one or more of positions 532, 538 and 540, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises amino acid residues D532, S538 and V540, with numbering relative to SEQ ID NO:13. In another example, modifications can be at one or more of positions 451, 456, 457, 460, 462, 466, 469, 470, 472 and 473, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises amino acid residues S451, Q456, G457, Q460, L462, A466, A469, N470, S472 and A473, with numbering relative to SEQ ID NO:13. In a further example, modifications can be made at one or more of positions 493, 494, 505, 506, 518 and 522, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises amino acid residues L493, S494, G505, A506, V518 and V522, with numbering relative to SEQ ID NO:13.


Methods for enhancing the in vivo human hepatocyte transduction efficiency of an AAV vector (or producing an AAV vector with enhanced in vivo human hepatocyte transduction efficiency) also include those methods that include the steps of modifying the sequence of a reference capsid polypeptide at one or more of positions 263-272, 546-567 and 582-597 with numbering relative to SEQ ID NO:13, to thereby produce a modified capsid polypeptide that comprises: i) the sequence of amino acids SQSGASNDNH (SEQ ID NO:58) at positions 263-272, with numbering relative to SEQ ID NO:13; and ii) the sequence of amino acids TGATNKTTLENVLMTNEEEIRP (SEQ ID NO:59) at positions 546-567, with numbering relative to SEQ ID NO:13 and/or the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13.


Methods for enhancing the in vivo human hepatocyte transduction efficiency of an AAV vector (or producing an AAV vector with enhanced in vivo human hepatocyte transduction efficiency) also include those methods that include the steps of modifying the sequence of a reference capsid polypeptide at one or more of positions 261-272, 545-567 and 582-597 with numbering relative to SEQ ID NO:13, to thereby produce a modified capsid polypeptide that comprises: i) the sequence of amino acids ISSQSGASNDNH (SEQ ID NO:80) at positions 261-272, with numbering relative to SEQ ID NO:13; and ii) the sequence of amino acids KTGATNKTTLENVLMTNEEEIRP (SEQ ID NO:81) at positions 545-567, with numbering relative to SEQ ID NO:13 and/or the sequence of amino acids SSNLQAANTAAQTQVVNN (SEQ ID NO:60) at positions 582-597, with numbering relative to SEQ ID NO:13.


Additional modifications can optionally be made at or adjacent to one or more other variable regions, such as VR-IV, VR-V and VR-VI. For example, modifications can be made at one or more of positions 532-540, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises the sequence of amino acids DRFFPSSGV (SEQ ID NO:61) at positions 532-540, with numbering relative to SEQ ID NO:13; at one or more of positions 523-540, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises the sequence of amino acids AMATHKDDEDRFFPSSGV (SEQ ID NO:82) at positions 523-540, with numbering relative to SEQ ID NO:13; at one or more of positions 451-473, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises the sequence of amino acids STGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:62) at positions 451-473, with numbering relative to SEQ ID NO:1; at one or more of positions 450-473, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises the sequence of amino acids QSTGGTQGTQQLLFSQAGPANMSA (SEQ ID NO:83) at positions 450-473, with numbering relative to SEQ ID NO:1; at one or more of positions 493-522, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises the sequence of amino acids LSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:63) at positions 493-522, with numbering relative to SEQ ID NO:13; and/or at one or more of positions 488-522, with numbering relative to SEQ ID NO:13, wherein the modified capsid polypeptide comprises the sequence of amino acids RVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGV (SEQ ID NO:84) at positions 488-522, with numbering relative to SEQ ID NO:13.


It will be understood that any modification or combination of modifications, e.g. amino acid replacement or substitution, amino acid deletion and/or amino acid insertion, will result in a change of amino acid sequence in the modified capsid polypeptide compared to the reference capsid polypeptide. Thus, for example, reference to modification does not include within its scope amino acid substitutions where one amino acid residue is substituted with the same amino acid residue, or modifications when an amino acid deletion is accompanied by an insertion of that deleted amino acid, such that there is no difference in the amino acid sequence of the modified capsid polypeptide compared to the reference capsid polypeptide sequence, i.e. the amino acid sequence of the modified capsid polypeptide can not be the same as (or must be different to) the amino acid sequence of the reference capsid polypeptide sequence.


Typically, the methods include an initial step of first identifying a reference capsid polypeptide for transducing human hepatocytes in vivo. The reference capsid polypeptide may be any AAV polypeptide, such as an AAV1, AAV2, AAV3, AAV3B, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12 or AAV13 capsid polypeptide, or a synthetic or chimeric capsid polypeptide. In illustrative embodiments, the reference polypeptide comprises at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the sequence set forth in SEQ ID NO:13. Reference capsid polypeptides include those comprising all or a portion of the VP1 protein, VP2 protein or VP3 protein. Thus, in some embodiments, the reference capsid polypeptide comprises all or a portion of a VP1 protein having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the sequence set forth in SEQ ID NO:13 (also referred to as AAVC11.12); all or a portion of a VP2 protein having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP2 protein set forth as amino acids 138-735 of SEQ ID NO:13; and all or a portion of a VP3 protein having at least or about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the VP3 protein set forth as amino acids 204-735 of SEQ ID NO:13.


Methods for modifying the sequence of a reference capsid polypeptide or polynucleotide so as to produce a modified capsid polypeptide or polynucleotide are well known in the art, and any such method can be utilised so as to perform the methods of the present disclosure. For example, the modification of the sequence of the reference capsid polynucleotide to produce a modified capsid polynucleotide can be performed using any method known in the art, including recombinant and synthetic methods, performed (either in part or in whole) in silico and/or in vitro. In a particular example, the modification of the sequence is performed in silico, followed by de novo synthesis of the modified capsid polynucleotide having the modified sequence (e.g. by gene synthesis methods such as those involving the chemical synthesis of overlapping oligonucleotides following by gene assembly).


The modified capsid polynucleotides may be contained in nucleic acid vector, such as a plasmid, for subsequent expression, replication, amplification and/or manipulation. Vectors suitable for use in bacterial, insect and mammalian cells are widely described and well-known in the art. Those skilled in the art would appreciate that the vectors may also contain additional sequences and elements useful for the replication of the vector in prokaryotic and/or eukaryotic cells, selection of the vector and the expression of a heterologous sequence in a variety of host cells. For example, the vectors can include a prokaryotic replicon, which is a sequence having the ability to direct autonomous replication and maintenance of the vector extrachromosomally in a prokaryotic host cell, such as a bacterial host cell. Such replicons are well known in the art. In some embodiments, the vectors can include a shuttle element that makes the vectors suitable for replication and integration in both prokaryotes and eukaryotes. In addition, vectors may also include a gene whose expression confers a detectable marker such as a drug resistance gene, which allows for selection and maintenance of the host cells. Vectors may also have a reportable marker, such as gene encoding a fluorescent or other detectable protein. The nucleic acid vectors will likely also comprise other elements, including any one or more of those described below. Most typically, the vectors will comprise a promoter operably linked to the nucleic acid encoding the capsid protein.


The nucleic acid vectors can be constructed using known techniques, including, without limitation, the standard techniques of restriction endonuclease digestion, ligation, transformation, plasmid purification, in vitro or chemical synthesis of DNA, and DNA sequencing. The vectors comprising a modified capsid polynucleotide may be introduced into a host cell using any method known in the art.


Following modification, the modified capsid are then vectorised. Methods for vectorising a capsid polypeptide are well known in the art and non-limiting examples are described above.


The AAV vector produced by these methods typically has an in vivo transduction efficiency that is enhanced compared to a reference AAV vector having a capsid comprising the reference capsid polypeptide. The transduction efficiency can be enhanced by at least or about, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 300%, 400%, 500%, 600%, 700%, 800%, 900% 1000%, or more, e.g. the transduction efficiency of the AAV vector can be at least or about 2×, 3×, 4×, 5×, 6×, 7×, 8×, 9×, 10×, 12×, 13×, 14×, 15×, 16×, 17×, 18×, 19×, 20×, 30×, 40×, 50×, 60×, 70×, 80×, 90×, 100× or more efficient at transducing cells in vivo.


Thus, also provided are AAV vectors produced by the methods of the present disclosure.


In order that the invention may be readily understood and put into practical effect, particular preferred embodiments will now be described by way of the following non-limiting examples.


The reference in this specification to any prior publication (or information derived from it), or to any matter which is known, is not, and should not be taken as an acknowledgment or admission or any form of suggestion that that prior publication (or information derived from it) or known matter forms part of the common general knowledge in the field of endeavour to which this specification relates.


EXAMPLES
Example 1. Materials and Methods
Shuffled AAV Capsid Plasmid Library Generation

Parental AAV cap genes (AAV1 through 12, AAV-mAAV1 (WO2019227168) and AAV-EVE1 (WO2017192699) were cloned into the plasmid p-RescueVector (pRV 1-12), a construct based on the pGEM-T Easy Vector System (catalog [Cat] #A1360; Promega) modified to harbor trimethoprim resistance and randomized ends flanking the capsids, for optimal Gibson Assembly (GA). Individual clones were Sanger sequenced (Garvan Molecular Genetics). Capsid genes (serotypes 1-12) were excised using SwaI and NsiI (NEB), mixed at 1:1 molar ratio, and digested with 1:10 prediluted DNaseI (Cat #M030S; NEB) for 2-5 min. The pool of fragments was separated on a 1% (w/v) agarose gel and fragments ranging from 200 to 1,000 bp were recovered using the Zymoclean Gel DNA Recovery Kit (Cat #D4001T; Zymogen). For each primer-less PCR reassembly reaction, 500 ng of gel-extracted fragments was used, and fully reassembled capsids were amplified in a second PCR with primers (Shuffling Rescue-F/R, Table 3) binding the cap gene and carrying overlapping ends to pRV plasmids. A GA reaction was performed by mixing an equal volume of 2 GA Master Mix (Cat #E2611L; NEB) with 1 pmoL PCR-amplified and DpnI-treated pRV (BB GAR-F/R, Table 3) and 1 pmol of the recovered shuffled capsids, at 50° C. for 30 min. DNA was ethanol precipitated and electroporated into SS320 electrocompetent E. coli (Cat #60512-2; Lucigen). The total number of transformants was calculated by preparing and plating five 10-fold serial dilutions of the electroporated bacteria. The pool of transformants was grown overnight in 250 mL of Luria-Bertani media supplemented with trimethoprim (10 mg/mL). Total pRV library plasmids were purified with an EndoFree Maxiprep Kit (Cat #12362; QIAGEN). pRV-based libraries were then digested overnight with SwaI and NsiI, and 1.4 μg of insert was ligated at 16° C. with T4 DNA ligase (Cat #M0202; NEB) for 16 hr into 1 μg of a replication-competent AAV2-based plasmid platform (p-Replication-Competent [p-RC]) containing ITR-2 and rep2, and unique SwaI and NsiI sites flanking a 1-kb randomized stuffer [ITR2-rep2-(SwaI)-stuffer-(NsiI)-ITR2]. Ligation reactions were concentrated by using ethanol precipitation, electroporated into SS320 electro-competent bacteria, and grown as described above. Total pRC library plasmids were purified with an EndoFreeMaxiprep Kit (Cat #12362; QIAGEN).


In Vivo Selection of AAV Library

A humanised FRG (hFRG) mouse was injected with 1×1011 vg of replication-competent RC-AAVC11 by i.v. tail vein administration. 5×109 PFUs of wild-type human adenovirus-5 (ATCC, VR-5, Lot #70010153) were administered intraperitoneally (i.p.) 24 hr later. The xenograft liver was harvested 72 hr after hAd5 administration, homogenised and snap frozen in liquid nitrogen. To extract AAV particles, approximately 0.3 g fragment of liver was subjected to three freeze-thaw cycles and mechanical homogenisation in the presence of 2× w/v of PBS. Sample was subsequently centrifuged for 30 min at 4° C. at top speed in a table-top centrifuge to separate the virus-containing supernatant from cellular debris. To inactivate wtAd5, the virus-containing supernatant was incubated at 65° C. for 30 min. Following titration by qPCR, 200 μL of the virus-containing supernatant was administrated i.p. into hFRG mouse for subsequent round of selection. A total of 5 rounds of selection were performed for this selection.


Vectorisation of AAV Cap Candidates

After round five of selection, AAV capsid sequences were recovered from the supernatant by PCR using primers flanking the capsid region (CapRescue-F/R, Table 3). PCR-amplified cap genes were cloned by GA in-frame downstream of the rep2 gene in a recipient pHelper packaging plasmid opened by PCR amplification using the following primers (pHelper-F/R) and DpnI treated. Individual clones containing full-length cap candidates were then Sanger sequenced.


AAV Vector Packaging and Viral Production

AAV constructs were packaged into AAV capsids using HEK293 cells and a helper-virus-free system as previously described (Xiao et al, 1998 J Virol, 1998. 72(3): 2224-32). Genomes were packaged in capsid serotypes AAV2, AAV8, LK03 and NP59 using packaging plasmid constructs pAAV2, pAAV8, pLK03 and pAAVNP59, respectively. Replication-competent (RC) library AAVC11 was packaged by co-transfection of a corresponding plasmid containing the full-length AAV genome (ITR2-rep2-cap-ITR2) and pAd5 into HEK-293T cells.


All vector/virus were purified using iodixanol gradient ultracentrifugation as previously described (Khan et al. 2011. Nat Protoc, 2011. 6(4): p. 482-501). AAV preparations were titred using real-time quantitative PCR (qPCR) using eGFP-specific qPCR primers GFP-qPCR-For/Rev or AAV2-rep-specific qPCR primers Rep-qPCR-For/Rev (Table 3). For in vivo testing of capsid candidates (Example 2), n=4 independent barcoded transgenes were packaged per capsid using two different concentrations (n=2 barcoded transgenes at high dose: 10 μg/transgene per preparation, and n=2 barcoded transgenes at low dose: 1 μg/transgene per preparation). The presence of the two distinct populations was confirmed by next-generation sequencing of the pre-injection mix. For further comparisons, n=5 barcoded transgenes were packaged at increasing concentration by co-transfecting 2, 4, 8, 12 and 16 μg per barcode per preparation. NGS analysis of vector mix confirmed presence of the five barcoded populations per capsid.


Mouse Studies

All animal care and experimental procedures were approved by the joint Children's Medical Research Institute (CMRI) and The Children's Hospital at Westmead Animal Care and Ethics Committee. CMRI's established Fah−/−/Rag2−/−/Il2rg−/− (FRG) mouse colony was used to breed recipient animals. FRG mice were housed in individually ventilated cages with 2-(2-nitro-4-trifluoro-methyAAVC.enzoyl)-1,3-cyclohexanedione (NTBC)-supplemented in drinking water. FRG mice, 6 to 8 weeks old, were engrafted with human hepatocytes (Lonza Group Ltd., Basel, Switzerland) as described previously (Azuma et al., 2007, Nat Biotechnol. 25(8):903-10). Humanised FRG (hFRG) mice were placed on 10% NTBC 1 week prior to transduction with vectors and were maintained on 10% NTBC until harvest.


The vector for injection was made up to a final volume of 150 μL using saline. Mice were randomly selected and transduced by intravenous injection (lateral tail vein) with the indicated vectors at a dose of 1×1010 vg/vector for NGS comparison, and at a dose of 2×1011 vg/vector for immunohistochemistry. For in vivo IVIg screening, 5 mg or 20 mg of IVIg (Intragam 10, CSL Behring) were injected into hFRG (i.v.) 24h prior to vector injection. Mice were euthanized by CO2 inhalation 2 weeks after transduction for immunohistochemistry and 1 week after transduction for barcoded Next-Generation Sequencing (NGS) analysis. Hepatocytes for flow cytometry analysis were obtained by collagenase perfusion of the liver (see below).


Isolation of Human Hepatocytes by Collagenase Perfusion

To perfuse mouse liver and obtain single-cell suspension, the inferior vena cava (IVC) was cannulated, and the solutions were pumped with an osmotic minipump (Gilson Minipuls 3) in the following order: 25 mL of Hank's balanced salt solution (−/−) (−/−) (cat #H9394; Sigma), 25 mL of HBSS (−/−) supplemented with 0.5 mM EDTA, 25 ml HBSS (−/−), and 25 mL of HBSS (−/−) supplemented with 5 mM CaCl2), 0.05% wt/vol collagenase IV (Sigma) and 0.01% wt/vol DNase I (Sigma).


Following perfusion, the liver was carefully removed and placed in a Petri dish containing 25 ml of DuAAVC.ecco's modified Eagle's medium (DMEM) supplemented with 10% foetal bovine serum (FBS). The blunt end of a scalpel blade was used to break the liver capsule to release the cells into the medium. After collection, the cells were spun down at 50× g for 3 min at 4° C. The pellet was resuspended in 21 mL of DMEM and passed through a 100-μm nylon cell strainer. Isotonic Percoll (9 mL) (1 part of 10×PBS (−/−) with 9 parts of Percoll; GE Healthcare) was added to the cell suspension to separate live and dead cells. Live cells were pelleted at 650× g for 10 min at 4° C. and the pellet was resuspended in FACS buffer (PBS (−/−) with 5% FBS and 5 mM EDTA). To delineate between mouse liver cells and human hepatocytes, cells were labelled with phycoerythrin (PE)-conjugated anti-human-HLA-ABC (clone W6/32; Invitrogen 12-9983-42; 1:20), biotin-conjugated anti-mouse-H2Kb (done AF6-88.5, BD Pharmigen 553568; 1:100) and allophycocyanin (APC)-conjugated streptavidin (eBioscience 17-4317-82; 1:500). GFP-positive labelled samples were sorted to a minimal 95% purity using a BD Influx cell sorter. Sorting of the GFP-positive population was included to enrich for murine hepatocytes among non-parenchymal cells, given the hepatocyte-restricted expression of the pLSP1-GFP-WPRE-BGHpA AAV construct. Flow cytometry was performed in the Flow Cytometry Facility, Westmead Institute for Medical Research, Westmead, NSW, Australia. The data were analysed using FlowJo 7.6.1 (Flow®), LLC).


Human AAAVC.umin ELISA


Levels of human cell engraftment in chimeric mice were assessed by measuring presence of human aAAVC.umin on peripheral blood, using the Human AAAVC.umin ELISA Quantitation Kit (Bethyl, cat #E80-129) as previously reported (Azuma et al., 2007, Nat Biotechnol. 25(8):903-10).


Adeno-Associated Virus Transgene Constructs

AAV transgene constructs were cloned using standard molecular biological techniques. All of the vectors used in the study contain AAV2 ITR sequences. The AAV construct pLSP1-eGFP-WPRE-BGHpA, which encodes eGFP under the transcriptional control of a heterologous promoter containing one copy of the SERPINA1 (hAAT) promoter and two copies of the APOE enhancer element, has been previously reported (Dane et al., 2009, Mol Ther, 2009. 17(9): 1548-54). Eighty four (n=84) versions of the pLSP1-eGFP-BC-WPRE-BGHpA construct were produced by cloning n=84 unique 6-nucleotide-long barcodes (BC) downstream of eGFP.


DNA and RNA Isolation

To extract DNA from sorted cells, the cells were resuspended in 200 μL lysis buffer (100 mM Tris-HCl pH 8.5 (Astral Scientific, BioSD8141-450ML), 5 mM EDTA (ThermoFisher), 0.2% (w/v) sodium dodecyl sulphate (Sigma-Aldrich), 200 mM NaCl (Sigma-Aldrich) containing 50 μg/mL of proteinase K (Bioline). Samples were incubated overnight at 56° C. degrees. DNA was extracted using a standard phenol:chloroform protocol using phenol:chloroform:isoamyl alcohol (25:24:1) (Sigma-Aldrich), followed by DNA ethanol precipitation.


RNA from sorted cells was extracted using the Direct-Zol kit (Zymogen Cat #R2062) and treated with TURBO DNase (ThermoFsher, Cat #AM2238). cDNA was synthesised using the SuperScript IV First-Strand Synthesis System, following manufacturer's instructions (ThermoFisher, Cat #18091050).


Cell Culture, Vector Transduction and Heparin Competition Assay

HEK293 cells were validated and provided by ATCC. HuH-7 cells were provided by Dr Jerome Laurence (The University of Sydney). All cells were cultured in DuAAVC.ecco's Modified Eagle Medium (DMEM) (Gibco, 11965-092) supplemented with 10% FBS (Sigma Aldrich, F9423-500 mL, Lot #16K598), 100 Units/mL Penicillin, 100 μg/mL Streptomycin (Sigma Aldrich, P4458) and passaged using TrypLE Express Enzyme (Gibco, 12604-21). For HuH-7 cultures, media were supplemented also with non-essential amino acids (Gibco, 11140-050). AH cells were tested for mycoplasma and were mycoplasma-free. For transduction studies, cells were plated into 24-well plates in complete DMEM at 2×105 cells per well and incubated overnight in a tissue-culture incubator at 37° C./5% COD. 16 hrs later, the vector stock was diluted in 1 ml of complete DMEM and added to cells (at the indicated vector genome copies per cell (vac/cell). When indicated, serial 2-fold dilutions of intravenous immunoglobulin (IVIg) (Intragam 10, CSL Behring) were mixed with vectors for 1h at 37° C. prior to cell transduction.


After a 72-h incubation, the cells were harvested using TrypLE Express (Gibco) and analysed for GFP using BD LSRFortessa cell analyser. The data were analysed using FlowJo 7.6.1.


Barcode Amplification, Next-Generation Sequencing and Distribution Analysis

The 150 base pair region surrounding the 6-mer barcode was amplified with Q5 High-Fidelity DNA Polymerase (NEB, Cat #M0491L) using BC_F and BC_R primers (Table 3). Next-generation sequencing library preparations and sequencing using a 2×150 paired-end (PE) configuration were performed by Genewiz (Suzhou, China) using an Illumina MiSeq instrument. A workflow was written in Snakemake (5.6) (Koster et al. 2012 Bioinformatics 28:2520-2522) to process reads and count barcodes. Paired reads were merged using BBMerge and then filtered for reads of the expected length in a second pass through BBDuk, both from BBTools 38.68. The merged, filtered fastq files were passed to a Perl (5.26) script that identified barcodes corresponding to AAV variants.


Immunohistochemical Analysis of Mouse Livers

Mouse livers were fixed with 4% (w/v) paraformaldehyde, cryo-protected in 10-30% (w/v) sucrose before freezing in O.C.T. (Tissue-Tek; Sakura Finetek USA, Torrance, Calif.). Frozen liver sections (5 μm) were permeabilised in −20° C. methanol, then room temperature 0.1% Triton X-100, and then reacted with anti-human GAPDH antibody (Abcam, Cat #ab215227, Clone AF674), and DAPI (Invitrogen, D1306) at 0.08 ng/mL. After immunolabelling, the images were captured and analysed on a Zeiss Axio Imager.M1 using ZEN 2 software. The percentage of transduced human hepatocytes per field of view was determined by counting total human GAPDH-positive cells and eGFP/human GAPDH double-positive cells.


Sanger Sequencing

When specified, clones were Sanger-sequenced at the Garvan Molecular Genetics facility of the Garvan Institute of Medical Research (Darlinghurst, NSW, Australia) with External_Seq_F/R primers (Table 3).


Vector DNA Copy Number Per Cell

Vector copy numbers were measured with primers GFP-qPCR-For/Rev using Droplet Digital (dd)PCR (Bio-Rad, Berkeley, US) with QX200 ddPCR EvaGreen Supermix (Bio-Rad, Cat #1864034) and following manufacturer's instructions. Vector genomes were normalised to human aAAVC.umin copy number using primers human_AAAVC._F/R_ddPCR.









TABLE 3







Primer sequences









SEQ ID




NO
Name
Sequence





40
Shuffling_Rescue-F
GTCGGAAAGCATATGCCGCG





41
Shuffling_Rescue-R
GACGTCGCATGCAACTAGTAT





42
BB_GAR-F
ACTTGTTCACTTTGATGGCGAGG





43
BB_GAR-R
CTGCACACGACATGACA TCACG





44
CapRescue-F
CCCTGCAGACAATGCGAGAGAATGAATCAGAATTCAAATATCTGC





45
CapRescue-R
ATGCATATGGAAACTAGATAAGAAAGAAATACG





46
pHelperF
CGCATTGTCTGCAGGGAAACAGCATC





47
pHelperR
TTTCTTTCTTATCTAGTTTCCA TATGCATGTAGATAAGTAGCATGGCGGG





48
GFP-F1
TCAAGATCCGCCACAACATC





49
GFP-R1
TTCTCGTTGGGGTCTTTGCT





50
rep-F1
CTCAACCCGTTTCTGTCGTC





51
rep-R2
CACATTGACCAGATCGCAGG





52
BC_F
GCTGGAGTTCGTGACCGCCG





53
BC_R
CAACATAGTTAAGAATACCAGTCAATCTTTCACAAATTTTGTAATCCAGAGG





54
External_5_Seq
TGTGGATTTGGATGACTGC





55
External_3_Seq
GACCAAAGTTCAACTGAAACG





56
human_AAAVC._F
TGCTGTCATCTCTTGTGGGCTG





57
human_AAAVC._R
AACTCATGGGAGCTGCTGGTTC









Example 2. Generation and Assessment of Novel Capsids

A shuffled DNA library was generated as described in Example 1. Replication-competent virus produced with the library were produced and injected into a hFRG mouse, and 5 rounds of selection were performed as described above to identify sixteen AAV capsid polypeptides: AAVC11.01 (SEQ ID NO:2), AAVC11.02 (SEQ ID NO:3), AAVC11.03 (SEQ ID NO:4), AAVC11.04 (SEQ ID NO:5), AAVC11.05 (SEQ ID NO:6), AAVC11.06 (SEQ ID NO:7), AAVC11.07 (SEQ ID NO:8), AAVC11.8 (SEQ ID NO:9), AAVC11.09 (SEQ ID NO:10), AAVC11.10 (SEQ ID NO:11), AAVC11.11 (SEQ ID NO:12), AAVC11.12 (SEQ ID NO:13), AAVC11.13 (SEQ ID NO:14), AAVC11.14 (SEQ ID NO:15), AAVC11.15 (SEQ ID NO:16), and AAVC11.16 (SEQ ID NO:17) (Table 4).


Four barcoded AAV transgenes (Liver Specific Promoter (LSP)-GFP-Barcode-WPRE-BGHpA) were packaged into each capsid (AAVC11.01- AAVC11.16 capsid, AAV2, AAV8, LK03 and NP59) to produce vectors. As the yield from AAVC11.03, AAVC11.10 and AAVC11.16 vectors was lower than that of AAV2, these were excluded from further testing. The remaining vectors were co-injected (1×1010 vg/capsid; a total of 1.8×1011 vg/capsid) into a hFRG mouse for comparison of function. One week after injection the chimeric liver from the mouse was perfused and human and murine hepatocytes were single cell sorted. DNA and RNA were recovered from the mouse and human populations of hepatocytes and NGS of the barcoded transgene was performed on the DNA and RNA (cDNA).


As shown in FIG. 2, the majority of the novel vectors, including AAVC11.01, AAVC11.04, AAVC11.05, AAVC11.06, AAVC11.07, AAVC11.09, AAVC11.11, AAVC11.12, AAVC11.13 and AAVC11.15, were every effective at entering human hepatocytes and expressing the transgene, and these vectors were selected for further analysis.


AAVC11.01, AAVC11.04, AAVC11.05, AAVC11.06, AAVC11.07, AAVC11.09, AAVC11.11, AAVC11.12, AAVC11.13 and AAVC11.15, as well as AAV2, AAV8, LK03 and NP59, were re-packaged with 5× barcoded transgene/capsid at increasing barcode concentration with the aim of studying the ratio of DNA to RNA conversion. The AAV-DJ vector was also included as a titer control. For each capsid, 5×15 cm HEK293T plates (˜20M cells−15 mL media) were independently transfected, processed and titered.


The vectors (excluding AAV-DJ) were then mixed at equal ratio (1×1010 vg/capsid) and injected into a single hFRG mouse. Human and murine hepatocytes were isolated and sorted after one week. DNA and RNA were extracted and NGS performed on the DNA and cDNA. NGS of the pre-injection mix was also performed for validation, and the DNA and RNA (cDNA) reads from hepatocytes were normalized to pre-injection reads. This normalization is expressed as ‘Human Entry Index’ (HEI), which is a constant for each capsid on a determined experiment and expresses how efficient a given capsid is at physically transducing human hepatocytes in relation to the other capsids included in the experiment. It was observed that regardless of initial barcode concentration, the HEI for each capsid remained constant (data not shown).


cDNA reads were then normalized to DNA reads. This normalization is expressed as ‘Human Expression Index’ (HEXI), which is a constant for each capsid on a determined experiment and indicated how efficient a given capsid is at functionally transducing human hepatocytes, i.e. converting DNA reads into RNA reads. This is an important property, as some AAV capsids (e.g. AAV2) are relatively efficient at entering the hepatocytes but relatively deficient at functional transduction (i.e. transgene expression). FIG. 3 shows the HEXI for each vector.


The HEI and HEXI were converted into a normalized percentage read to analyze the overall functional transduction power of the tested capsids. This data is shown in FIGS. 4A and B.


It has been observed that the rate of DNA to RNA conversion follows a linear trend, with a slope corresponding to each specific HEXI (RNA/DNA). Non-normalized DNA reads vs non-normalized RNA reads were plotted, where the x-axis extension gives an estimate of how efficient a capsid is at human entry, and the slope gives the approximate ratio of DNA to RNA conversion. When doing such an analysis, it becomes apparent that AAV2 is relatively better than AAV8 at human entry, but AAV8 is relatively better than AAV2 at expression (functional transduction) (data not shown). This analysis was performed with NP59 and AAVC11.04, AAVC11.06, AAVC11.11, AAVC11.12 and AAVC11.13, and demonstrated that each of AAVC11.04, AAVC11.06, AAVC11.11, AAVC11.12 and AAVC11.13 is comparable to NP59, a highly efficient capsid described previously (Paulk et al., 2018, Mol Ther 26:289-303).


Example 3. IVIg Neutralization Resistance

Having identified the most functional AAVC11 variants, their relative in vivo performance in human hepatocytes in the presence of pooled human immunoglobulins was investigated. To do so, following a method recently reported (Cabanes-Creus et al. 2020, Mol Ther Methods Clin Dev, 17:1139-1154), five barcoded AAV-LSP1-eGFP cassettes were packaged at increasing concentrations in the selected AAV variant capsids. AAV2, AAV8, AAV-LK03, and AAV-NP59 were included as controls. Three hFRG animals were passively immunized by intravenous administration of increasing doses of pooled human IgGs 24h before AAV administration (1×1010 vgs/capsid). A control hFRG animal that received no IVIg was also included (the same animal as used for the study shown in FIG. 3). One week later, human hepatocytes were sorted and the vector copy number per diploid genome determined. An IVIg dose-dependent reduction of vector genomes per cell was observed, leading to a >500-fold difference between the no-IVIg control (hFRG #1=321.25 vc/dc) and the hFRG mouse pre-injected with 20 mg of human immunoglobulin (hFRG #4=0.63 vc/dc). hFRG mice pre-injected with 1 mg (hFRG #2) or 5 mg (hFRG #3) of human immunoglobulin also showed reduced vector genomes (hFRG #2=81.16 vc/dc; hFRG #3=10.62 vc/dc.


The relative performance of the individual AAV variants in the human hepatocytes harvested from hFRG #1 (the no-IVIg control) was then analysed. As shown in FIG. 4, all AAV variants, except for AAVC11.09, transduced hepatocytes with high efficiency compared to benchmark AAV-NP59, as measured at the DNA (cell entry) and RNA/cDNA (transgene expression) levels. Since the percentage of DNA reads ultimately indicates the contribution of each AAV variant to the final vector copy number per cell, it is possible to empirically estimate the IVIg neutralization effect for each capsid (FIG. 4C). The reduction in vector genome copies per capsid was calculated and expressed as a logarithm of the quotient between the IVIg and the no-IVIg control (i.e., a value of −1 indicates a 10-fold reduction on vector genomes/capsid, FIG. 4C). AAV8 was found to be the most resistant to neutralization by human IVIg. Interestingly, in contrast to previous reports (Lisowski et al. 2014, Nature, 506(7488):382-6; Cabanes-Creus et al. 2020, Mol Ther Methods Clin Dev, 17:1139-1154), bioengineered AAV-LK03 and AAV-NP59 (AAV3b- and AAV2-like, respectively) were also strongly neutralized at the IVIg concentrations tested in this in vivo model. All AAVC11 variants presented intermediate resistance between AAV8 and AAV-NP59 at all IVIg doses tested.


As a final validation, the top three performers (AAVC11.06, AAVC11.11, and AAVC11. 12) were injected into individual humanised FRG mice, using AAV-NP59 as a control (2×1011 vgs/hFRG). As shown in FIG. 4D, AAVC11.12 was found to be significantly more functional than the AAV-NP59 control. Based on these results, AAVC11.12 was evaluated further. Because the ability to study vector function in preclinical models can have a substantial influence on its clinical development, the performance of AAVC11.12 in non-engrafted FRG using the same dose as in hFRG studies (2×1011 vector genomes/mouse) was evaluated. It was observed that AAVC11.12 can functionally transduce murine liver cells, although with substantially lower efficiency than the human hepatocytes (data not shown), consistent with the observations shown in FIG. 2 and described in Example 4.


Example 4. Immunohistochemical Analysis

AAVC11.12 and AAVC11.13 were injected into individual hFRG mice at 2×1011 vg/mouse. Livers were harvested two weeks after injection and processed for immunohistochemistry. DAPI (blue) was used to stain all cells (murine/human) and an antibody against human GAPDH (hGAPDH, red) was used to stain only human cells. eGFP (green) expressed from the AAV indicated cells that were functionally-transduced with rAAV. It was observed that AAVC11.12 and AAVC11.13 preferentially transduced human hepatocytes (data not shown).


Example 5. Further Assessment of AAVC11.12

The inventors then investigated whether relative transduction efficiency among AAV variants is dependent on the origin of the engrafted human hepatocytes. To do so, an equimolar mix was produced of barcoded AAVs that, in addition to AAVC11.12, contained prototypical variants (AAV2, AAV3b, AAV5, AAV8), bioengineered variants (AAV-LK03, AAV-NP59, AAV2-N496D (Cabanes-Creus et al. 2020, Mol Ther Methods Clin Dev, 17:1139-1154), AAV2-RC01 as well as the naturally occurring human variant AAV-hu.Lvr02 (Australian provisional patent no. 2020904687 and Cabanes-Creus et al. 2020, Sci Transl Med, 12(560):eaba3312). FRG mice were engrafted with hepatocytes from seventeen different human donors, varying in age, gender, and ethnicity (n=2 hFRGs per donor, n=1 for donor 13 and 16). The level of liver repopulation was assessed by measuring the concentration of human albumin in the blood, with the aim of performing the barcoded NGS-based comparison at mid-levels of engraftment (average of 3.6 mg human albumin/mL blood, which corresponds to a 20-60% level of human engraftment). Although there was an evident variability in the engraftment rate between donors, a positive correlation between the concentration of human albumin and the percentage of human hepatocytes in harvested livers was observed (data not shown). Each animal was injected i.v. with 1×1011 vg, which corresponds to a dose of 1×1010 vg per capsid variant. One-week post-injection, the chimeric livers were perfused, human GFP positive hepatocytes were sorted, and the vector copy number per cell and the barcode composition for each sample was analysed. It was observed that the AAV vector mix transduces human hepatocytes more efficiently than murine cells, as estimated by the respective GFP positive population in live cells (FIG. 5A). No significant difference in AAV transduction between male and female human donors was found when assessed based on the percentage of GFP+ cells (FIG. 5B), although the vector copy number per diploid cell was found to be marginally higher in female hepatocytes (FIG. 5C, which is in agreement with recently published data from Zou et al. 2020, Mol Ther Methods Clin Dev 18:189-198). The normalized percentages corresponding to the overall share of NGS reads per AAV capsid are shown in FIGS. 5D-F. The relative performance of the AAV vectors analysed appeared unaffected by the source of primary human hepatocytes in this model. More specifically, bioengineered variants AAV-NP59, AAV2-N496D, AAV2-RC01 and AAVC11.12, and the naturally occurring AAV-hu.Lvr02, entered human hepatocytes, as measured at the DNA level, more efficiently than vectors based on prototypical capsids (AAV2/3b/5/8) and bioengineered AAV-LK03 (FIG. 5D). The average physical transduction was higher for AAV-hu.Lvr02 and AAVC11.12, and these differences were significant when compared to the other variants (FIG. 5D). Analysis of the barcoded transgenes at the cDNA level, which estimates the functional performance, revealed substantial differences between individual variants with AAVC11.12 emerging as the most functional variant among the cohort tested (FIG. 5E). To gain a better understanding of relative vector fitness, the relative differences between cell entry (DNA, FIG. 5D) and expression (RNA/cDNA, FIG. 5E) were analysed and indicated an expression index (FIG. 5F). Interestingly, the analysis revealed that while AAVC11.12 had an expression index>1, and thus accounted for a larger fraction of RNA/cDNA reads than DNA reads, the other vectors, especially AAV2, AAV3b, and AAV-hu.Lvr02, lost relative share of reads at the RNA/cDNA level (FIG. 5F), highlighting differences between physical transduction and vector function (transgene expression). Consistent with previous reports, AAV-NP59 functionally transduced human hepatocytes with high efficiency. Of interest, AAV8 also had an expression index>1, suggesting that the relatively inferior performance of this variant in human hepatocytes may be caused by suboptimal cell entry (FIG. 5F).


Example 6. Identification of Additional Capsids

It was observed that the three top capsids based on RNA reads (AAVC11.06, AAVC11.12, AAVC11.13) were part of a phylogenetic cluster. Four additional clones from the same selection that clustered with AAVC11.06, AAVC11.12 and AAVC11.13 were sequenced and named AAVC11.17 (SEQ ID NO:18), AAVC11.18 (SEQ ID NO:19), and AAVC11.19 (SEQ ID NO:20) (Table 5).


Example 7. Phylogenetic Analysis of Capsids

Phylogenetic analysis and analysis of the parental contribution was performed. As shown in FIG. 6, multiple parental capsids contributed to the sequence of each of the new capsids (see FIG. 6A of Australian Provisional Application No. 2020900529 for phylogenetic analysis).


Example 8. In Vivo Functional Comparison of AAVC11.12 to Parental Variants

Given the substantially superior performance of AAVC11.12 when compared to other liver-tropic vectors, studies to investigate which capsid regions were the main determinants of human hepatocyte tropism in the hFRG model were performed. Due to the fact that AAVC11.12 was selected from a DNA-family shuffled library, it harbours regions of multiple parental variants (AAV1/AAV6, AAV2, AAV3b, AAV7, AAV10, and AAV12) as depicted in detail in FIG. 7. Interestingly, all of the functional AAVC11 variants described herein share high sequence identity and common parental capsid regions for Variable Region (VR) I (AAV2), VRs IV and V (AAV10), and VRs VI to VIII (AAV7), except for AAVC11.13 in which the region from parental AAV7 extended to VR-V (FIGS. 1 and 5B). A barcoded NGS comparison of AAVC11.12 with parental AAV2, AAV7, and AAV10 using two humanised FRG mice was performed. AAV8 was included as a positive control for the transduction of murine cells. As shown in FIG. 8, AAVC11.12 was found to significantly outperform all parental variants at human hepatocyte physical (DNA) and functional (RNA/cDNA) transduction. Of interest, AAVC11.12 was observed to physically transduce the murine liver at an efficiency similar to AAV7, AAV8, and AAV10. However, as observed before, this physical transduction was associated with relatively weak functional transduction of murine cells when compared to the parental variants. These data suggest that the superior function of AAVC11.12 in human hepatocytes results from a unique combination of parental features that in isolation are not sufficient to provide the benefit to any of the parental AAVs.


Example 9. Identification of Variable Regions Important for Human Hepatocyte Tropism

Given the differential performance of AAVC11.12 (SEQ ID NO:13) and AAV8 (SEQ ID NO:64) in human and murine cells and so as to understand which functional capsid domains are responsible for the superior function of AAVC11.12, a series of domain swaps between the two AAV was generated. As schematically shown in FIG. 9, combinations of variables regions I (AAV2 origin), IV-V (AAV10 origin), and VI-VIII (AAV7 origin) from AAVC11.12 were systematically cloned into the AAV8 capsid scaffold. Specific amino acid changes between AAV8 and the swapped variants are shown in Table 4. FIG. 10 provides an alignment between AAVC11.12 (SEQ ID NO:13) and AAV8 (SEQ ID NO:64), also showing the residues from AAVC11.12 that were substituted into AAV8. The amino acid and nucleic acid sequences of the resulting capsid polypeptides (i.e. Swap1-Swap15) are provided in Table 5, below.









TABLE 4





Amino acid changes between AAV8 and the Variable region swaps.




















Changes = 7
AAV8
Swap1
Changes = 45
AAV8
Swap9


1
N263
del
1
N263
del


2
G264
del
2
G264
del


3
T265
S
3
T265
S


4
S266
Q
4
S266
Q


5
G267
S
5
G267
S


6
T270
S
6
T270
S


7
T274
H
7
T274
H


Changes = 16
AAV8
Swap2
8
T453
S


1
T453
S
9
A458
Q


2
A458
Q
10
N459
G


3
N459
G
11
T462
Q


4
T462
Q
12
G464
L


5
G464
L
13
G468
A


6
G468
A
14
N471
A


7
N471
A
15
T472
N


8
T472
N
16
A474
S


9
A474
S
17
N475
A


10
N475
A
18
E534
D


11
T495
L
19
N540
S


12
G496
S
20
I542
V


13
A507
G
21
Q548
T


14
G508
A
22
N549
G


15
A520
V
23
A551
T


16
I524
V
24
R552
del


Changes = 28
AAV8
Swap3
25
D553
N


1
E534
D
26
N554
K


2
N540
S
27
A555
T


3
I542
V
28
D556
T


4
Q548
T
29
Y557
L


5
N549
G
30
S558
E


6
A551
T
31
D559
N


7
R552
del
32
M561
L


8
D553
N
33
L562
M


9
N554
K
34
S564
N


10
A555
T
35
K569
R


11
D556
T
36
T570
P


12
Y557
L
37
A583
S


13
S558
E
38
D584
S


14
D559
N
39
Q588
A


15
M561
L
40
Q589
A


16
L562
M
41
P593
A


17
S564
N
42
I595
T


18
K569
R
43
G596
Q


19
T570
P
44
T597
V


20
A583
S
45
S600
N


21
D584
S
Changes = 48
AAV8
Swap10


22
Q588
A
1
N263
del


23
Q589
A
2
G264
del


24
P593
A
3
T265
S


25
I595
T
4
S266
Q


26
G596
Q
5
G267
S


27
T597
V
6
T270
S


28
S600
N
7
T274
H


Changes = 23
AAV8
Swap4
8
T453
S


1
N263
del
9
A458
Q


2
G264
del
10
N459
G


3
T265
S
11
T462
Q


4
S266
Q
12
G464
L


5
G267
S
13
G468
A


6
T270
S
14
N471
A


7
T274
H
15
T472
N


8
T453
S
16
A474
S


9
A458
Q
17
N475
A


10
N459
G
18
T495
L


11
T462
Q
19
G496
S


12
G464
L
20
A507
G


13
G468
A
21
G508
A


14
N471
A
22
A520
V


15
T472
N
23
I524
V


16
A474
S
24
Q548
T


17
N475
A
25
N549
G


18
T495
L
26
A551
T


19
G496
S
27
R552
del


20
A507
G
28
D553
N


21
G508
A
29
N554
K


22
A520
V
30
A555
T


23
I524
V
31
D556
T


Changes = 35
AAV8
Swap5
32
Y557
L


1
N263
del
33
S558
E


2
G264
del
34
D559
N


3
T265
S
35
M561
L


4
S266
Q
36
L562
M


5
G267
S
37
S564
N


6
T270
S
38
K569
R


7
T274
H
39
T570
P


8
E534
D
40
A583
S


9
N540
S
41
D584
S


10
I542
V
42
Q588
A


11
Q548
T
43
Q589
A


12
N549
G
44
P593
A


13
A551
T
45
I595
T


14
R552
del
46
G596
Q


15
D553
N
47
T597
V


16
N554
K
48
S600
N


17
A555
T
Changes = 35
AAV8
Swap11


18
D556
T
1
N263
del


19
Y557
L
2
G264
del


20
S558
E
3
T265
S


21
D559
N
4
S266
Q


22
M561
L
5
G267
S


23
L562
M
6
T270
S


24
S564
N
7
T274
H


25
K569
R
8
T453
S


26
T570
P
9
A458
Q


27
A583
S
10
N459
G


28
D584
S
11
T462
Q


29
Q588
A
12
G464
L


30
Q589
A
13
G468
A


31
P593
A
14
N471
A


32
I595
T
15
T472
N


33
G596
Q
16
A474
S


34
T597
V
17
N475
A


Changes = 44
AAV8
Swap6
18
T495
L


1
T453
S
19
G496
S


2
A458
Q
20
A507
G


3
N459
G
21
G508
A


4
T462
Q
22
A520
V


5
G464
L
23
I524
V


6
G468
A
24
E534
D


7
N471
A
25
N540
S


8
T472
N
26
I542
V


9
A474
S
27
A583
S


10
N475
A
28
D584
S


11
T495
L
29
Q588
A


12
G496
S
30
Q589
A


13
A507
G
31
P593
A


14
G508
A
32
I595
T


15
A520
V
33
G596
Q


16
I524
V
34
T597
V


17
E534
D
35
S600
N


18
N540
S
Changes = 42
AAV8
Swap12


19
I542
V
1
N263
del


20
Q548
T
2
G264
del


21
N549
G
3
T265
S


22
A551
T
4
S266
Q


23
R552
del
5
G267
S


24
D553
N
6
T270
S


25
N554
K
7
T274
H


26
A555
T
8
T453
S


27
D556
T
9
A458
Q


28
Y557
L
10
N459
G


29
S558
E
11
T462
Q


30
D559
N
12
G464
L


31
M561
L
13
G468
A


32
L562
M
14
N471
A


33
S564
N
15
T472
N


34
K569
R
16
A474
S


35
T570
P
17
N475
A


36
A583
S
18
T495
L


37
D584
S
19
G496
S


38
Q588
A
20
A507
G


39
Q589
A
21
G508
A


40
P593
A
22
A520
V


41
I595
T
23
I524
V


42
G596
Q
24
E534
D


43
T597
V
25
N540
S


44
S600
N
26
1542
V


Changes = 51
AAV8
Swap7
27
0548
T


1
N263
del
28
N549
G


2
G264
del
29
A551
T


3
T265
S
30
R552
del


4
S266
Q
31
D553
N


5
G267
S
32
N554
K


6
T270
S
33
A555
T


7
T274
H
34
D556
T


8
T453
S
35
Y557
L


9
A458
Q
36
S558
E


10
N459
G
37
D559
N


11
T462
Q
38
M561
L


12
G464
L
39
L562
M


13
G468
A
40
S564
N


14
N471
A
41
K569
R


15
T472
N
42
T570
P


16
A474
S
Changes = 26
AAV8
Swap13


17
N475
A
1
N263
del


18
T495
L
2
G264
del


19
G496
S
3
T265
S


20
A507
G
4
S266
Q


21
G508
A
5
G267
S


22
A520
V
6
T270
S


23
1524
V
7
T274
H


24
E534
D
8
T453
S


25
N540
S
9
A458
Q


26
I542
V
10
N459
G


27
Q548
T
11
T462
Q


28
N549
G
12
G464
L


29
A551
T
13
G468
A


30
R552
del
14
N471
A


31
D553
N
15
T472
N


32
N554
K
16
A474
S


33
A555
T
17
N475
A


34
D556
T
18
T495
L


35
Y557
L
19
G496
S


36
S558
E
20
A507
G


37
D559
N
21
G508
A


38
M561
L
22
A520
V


39
L562
M
23
I524
V


40
S564
N
24
E534
D


41
K569
R
25
N540
S


42
T570
P
26
1542
V


43
A583
S
Changes = 39
AAV8
Swap14


44
D584
S
1
N263
del


45
Q588
A
2
G264
del


46
Q589
A
3
T265
S


47
P593
A
4
S266
Q


48
I595
T
5
G267
S


49
G596
Q
6
T270
S


50
T597
V
7
T274
H


51
S600
N
8
T453
S


Changes = 41
AAV8
Swap8
9
A458
Q


1
N263
del
10
N459
G


2
G264
del
11
T462
Q


3
T265
S
12
G464
L


4
S266
Q
13
G468
A


5
G267
S
14
N471
A


6
T270
S
15
T472
N


7
T274
H
16
A474
S


8
T495
L
17
N475
A


9
G496
S
18
T495
L


10
A507
G
19
G496
S


11
G508
A
20
A507
G


12
A520
V
21
G508
A


13
I524
V
22
A520
V


14
E534
D
23
I524
V


15
N540
S
24
Q548
T


16
I542
V
25
N549
G


17
Q548
T
26
A551
T


18
N549
G
27
R552
del


19
A551
T
28
D553
N


20
R552
del
29
N554
K


21
D553
N
30
A555
T


22
N554
K
31
D556
T


23
A555
T
32
Y557
L


24
D556
T
33
S558
E


25
Y557
L
34
D559
N


26
S558
E
35
M561
L


27
D559
N
36
L562
M


28
M561
L
37
S564
N


29
L562
M
38
K569
R


30
S564
N
39
T570
P


31
K569
R
Changes = 32
AAV8
Swap15


32
T570
P
1
N263
del


33
A583
S
2
G264
del


34
D584
S
3
T265
S


35
Q588
A
4
S266
Q


36
Q589
A
5
G267
S


37
P593
A
6
T270
S


38
I595
T
7
T274
H


39
G596
Q
8
T453
S


40
T597
V
9
A458
Q


41
S600
N
10
N459
G





11
T462
Q





12
G464
L





13
G468
A





14
N471
A





15
T472
N





16
A474
S





17
N475
A





18
T495
L





19
G496
S





20
A507
G





21
G508
A





22
A520
V





23
I524
V





24
A583
S





25
D584
S





26
Q588
A





27
Q589
A





28
P593
A





29
I595
T





30
G596
Q





31
T597
V





32
S600
N









Two independent barcoded-AAV NGS comparisons among these variants were then performed. In the first experiment (N=2 hFRGs, hFRG #1 and #2), AAVC11.12 and AAV8 were included as controls, as well as AAV8-Swaps1-7. As shown in FIG. 11, the introduction of AAV2's VR-I and AAV7's VR-VI to VR-VIII was sufficient to significantly enhance the performance of AAV8 in human hepatocytes (AAV8-Swap-5, FIG. 11). In contrast, VRs IV-V from AAV10 appeared not to have any substantial effect on the transduction of human cells (compare Swap-5 and Swap-7, FIG. 10). AAV8-Swap6, which maintained AAV8's VR-I origin, displayed a lower human entry performance as AAV8, although the substantial read share increase on the cDNA population suggests an outstanding performance at DNA to RNA conversion (FIG. 11). The phenotype of AAV8-Swap6 was even more pronounced in murine hepatocytes (FIG. 11). In these cells, the inclusion of VRs VI-VIII from AAV7 enhanced entry and expression of AAV8 (AAV8-Swap3, FIG. 11).


In the second comparison (N=2 hFRGs, hFRGs #3 and #4, FIG. 12), the inventors extended the barcoded-AAV to include fifteen AAV8 swaps. The same relative trend was confirmed as in study #1 for Swap5, Swap6, and Swap7. Additionally, the analysis of results from systematic reversion of variable regions back to AAV8 (Swap8 to Swap15) suggested that VR-VI (AAV7's origin) was not essential for enhancing human performance (compare Swap7 and Swap10). In contrast, the reversion of VR-VII and VR-VIII affected both entry and expression in human cells. Regarding the murine sample, the highly efficient DNA to RNA transcription for AAV8-Swap6 was confirmed in this larger comparison pool.


To validate these results, a multiplexed immunofluorescence comparison of AAV8+Swap5 and AAV8+Swap6 was performed in two independent hFRGs. Briefly, to allow visualisation of transduction patterns of two AAVs in the same animal, two AAV cassettes expressing the Cerulean or the Venus fluorescent reporters under the control of a liver-specific promoter were cloned. 1×1011 vg of AAV8-Cerulean with Swap5-Venus was mixed with AAV8-Cerulean with Swap6-Venus and injected into two independent hFRG mice. The immunofluorescence experiments confirmed the NGS results, with Swap5 transducing human hepatocytes substantially better than AAV8, and Swap6 displaying poor cell entry and strong expression in both human and murine hepatocytes (data not shown).


In a further validation of the results, the same barcoded mix from the first experiment (i.e. AAVC11.12 and AAV8, as well as AAV8-Swaps1-7) was injected in two highly engrafted mice. The highly engrafted mice had an average of 11 mg human albumin per mL blood, compared to the “low engraftment” mice from the previous experiments, which had an average of 1.8 mg human albumin per mL blood. The relative NGS reads mapped to each capsid were analyzed as previously for DNA and cDNA populations. As shown in FIG. 13, the overall trend was similar to that observed with the low engraftment mice, although the percentages flattened. This might reflect an increase in vector availability for AAV8, Swap3 and Swap6, which each contain VR-I from AAV8. The VR-I from AAV8 appears to impart a preference for murine hepatocytes, such that when murine hepatocytes are present, a portion of the vectors enter murine hepatocytes rather than human hepatocytes. When fewer murine hepatocytes are present, such as in the high engraftment mice, there is greater observed entry of these vectors into the human hepatocytes.


In summary, it appears that VR-VII (in particular) and VR-VIII, both from AAV7, alone or in combination, are important for efficient transduction of human hepatocytes (as evidenced by the reduction in transduction for Swap11 and Swap12 compared to Swap7). Conversely, it appears that VR-VI (also from AAV7) is dispensable for improving AAV8 performance in humans (see Swap5 compared to Swap10). VR-I, which is from AAV2, may be important for entry of human hepatocytes, such that the combination of the AAVC11.12 VR-I and VR-VII and/or VR-VIII appears to impart good entry of human hepatocytes and also good expression. In contrast, the combination present in Swap6, i.e. VR-I from AAV8, VR-IV and V from AAV10, and VR-VI, VR-VII and VR-VIII from AAV7, appears to impart much poorer entry into human hepatocytes but strong expression nonetheless, a phenotype that may have some advantages in the context of gene therapy (e.g. comparable expression with less physical transduction, potentially lessening concerns around DNA integration).









TABLE 5







Capsid Sequences









SEQ




ID




NO
Name
Sequence





 1
AAV2
MAADGYLPDWLEDTLSEGIRQWWKLKPGPPPPKPAERHKDDSRGLVLPGYKYLGPFNGLD



prototypic
KGEPVNEADAAALEHDKAYDRQLDSGDNPYLKYNHADAEFQERLKEDTSFGGNLGRAVFQ



capsid-VP1
AKKRVLEPLGLVEEPVKTAPGKKRPVEHSPVEPDSSSGTGKAGQQPARKRLNFGQTGDAD



(protein)
SVPDPQPLGQPPAAPSGLGTNTMATGSGAPMADNNEGADGVGNSSGNWHCDSTWMGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLNFKLFNIQVKEVTQNDGTTTIANNLTSTVQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMVPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTNTPSGTTTQSRLQFSQAGASDIRDQSRNWLPGPC




YRQQRVSKTSADNNNSEYSWTGATKYHLNGRDSLVNPGPAMASHKDDEEKFFPQSGVLIF




GKQGSEKTNVDIEKVMITDEEEIRTTNPVATEQYGSVSTNLQRGNRQAATADVNTQGVLPG




MVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKHPPPQILIKNTPVPANPSTTFSAA




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVNVDFTVDTNGVYSEPRP




IGTRYLTRNL





 2
AAVC11.01
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSSGIGKTGQQPAKKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLNFKLFNIQVKEVTDNNGVKTIANNLTSTVQVFTDSDYQLPYVLGSAH




QGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPF




HSSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPG




PCYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGV




LIFGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFSQ




AKLASFITQYSTGQVSVEIEWELQKENSKRWNPEVQYTSNYAKSANVDFTVDNNGLYTEPR




PIGTRYLTRPL





 3
AAVC11.02
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSSGIGKTGQQPAKKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLNFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYSFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLARTQSNPGGTAGNRELQFYQGGPSTMAEQAKNWLPG




PCFRQQRVSKTLDQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGV




LIFGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPAEFSA




TKFASFITQYSTGQVSVEIEWELQKENSKRWNPEVQYTSNYAKSANVDFTVDNNGLYTEPR




PIGTRYLTRPL





 4
AAVC11.03
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPATPAAVGPTTMASGGGAPMADNNEGADGVGNASGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSETAGSTNDNTYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLNFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEEVPFH




SSYAHSQSLDRLMNPLIDQYLYYLNRTQNQSGSAQNKDLLFSRGSPAGMSVQPKNWLPGP




CYRQQRVSKTKTDNNNSNFTWTGASKYNLNGRESIINPGTAMASHKDDEDKFFPMSGVMI




FGKESAGASNTALDNVMITDEEEIKATNPVATERFGTVAVNFQSSSTDPATGDVHVMGALP




GMVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFSQ




AKLASFITQYSTGQVSVEIEWELQKENSKRWNPEVQYTSNYAKSANVDFTVDNNGLYTEPR




PIGTRYLTRPL





 5
AAVC11.04
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEQSPQEPDSSSGIGKTGQQPAKKRLNFGQTGDS




ESVPDPQPLGEPPAGPSGLGSGTVAAGGGAPMADNNEGADGVGNSSGNWHCDSQWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPEVFTPA




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVSVDFTVDTNGVYSEPRP




IGTRYLTRNL





 6
AAVC11.05
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSSGIGKTGQQPAKKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLNFKLFNIQVKEVTDNNGVKTIANNLTSTVQVFTDSDYQLPYVLGSAH




EGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPF




HSSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPG




PCYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGV




LIFGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPEVFTP




AKFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVSVDFTVDTNGVYSEPR




PIGTRYLTRNL





 7
AAVC11.06
MAADGYLPDWLEDTLSEGIREWWALKPGAPQPKANQQHQDNGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNEADAAALEHDKAYDKQLEQGDNPYLKYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRILEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSSVGSGTVAAGGGAPMADNNEGADGVGNASGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKKLSFKLFNIQVKEVTQNDGTTTIANNLTSTVQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPEVFTPA




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVSVDFTVDTNGVYSEPRP




IGTRYLTRNL





 8
AAVC11.07
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEQSPQEPDSSSGIGKTGQQPAKKRLNFGQTGDS




ESVPDPQPLGEPPAGPSGLGSGTVAAGGGAPMADNNEGADGVGNSSGNWHCDSQWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPEVFTPA




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVSVDFTVDTNGVYSEPRP




IGTRYLTRNL





 9
AAVC11.08
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSSGIGKTGQQPAKKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLNFKLFNIQVKEVTDNNGVKTIANNLTSTVQVFTDSDYQLPYVLGSAH




EGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPF




HSSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPG




PCYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGV




LIFGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPAEFSA




TKFASFITQYSTGQVSVEIEWELQKENSKRWNPEVQYTSNYAKSANVDFTVDNNGLYTEPR




PIGTRYLTRPL





10
AAVC11.09
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSSGIGKTGQQPAKKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLNFKLFNIQVKEVTDNNGVKTIANNLTSTVQVFTDSDYQLPYVLGSAH




EGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPF




HSSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPG




PCYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGV




LIFGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFSQ




AKLASFITQYSTGQVSVEIEWELQKENSKRWNPEVQYTSNYAKSANVDFTVDNNGLYTEPR




PIGTRYLTRPL





11
AAVC11.10
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEQSPQEPDSSSGIGKTGQQPAKKRLNFGQTGDS




ESVPDPQPLGEPPAGPSGLGSGTVAAGGGAPMADNNEGADGVGNSSGNWHCDSQWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFSQA




KLASFITQYSTGQVSVEIEWELQKENSKRWNPEVQYTSNYAKSANVDFTVDNNGLYTEPRP




IGTRYLTRNL





12
AAVC11.11
MAADGYLPDWLEDTLSEGIREWWALKPGAPQPKANQQHQDNGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNEADAAALEHDKAYDKQLEQGDNPYLKYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSSVGSGTVAAGGGAPMADNNEGADGVGNASGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLNFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYSFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKNPPPQILIKNTPVPANPPAEFSAT




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEVQYTSNYAKSANVDFTVDNNGLYTEPRP




IGTRYLTRPL





13
AAVC11.12
MAADGYLPDWLEDTLSEGIREWWALKPGAPQPKANQQHQDNGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNEADAAALEHDKAYDKQLEQGDNPYLKYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRILEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSSVGSGTVAAGGGAPMADNNEGADGVGNASGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPEVFTPA




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVSVDFTVDTNGVYSEPRP




IGTRYLTRNL





14
AAVC11.13
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRILEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSSVGSGTVAAGGGAPMADNNEGADGVGNASGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLNFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CFRQQRVSKTLDQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPEVFTPA




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVSVDFTVDTNGVYSEPRP




IGTRYLTRNL





15
AAVC11.14
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEQSPQEPDSSSGIGKTGQQPAKKRLNFGQTGDS




ESVPDPQPLGEPPAGPSGLGSGTVASGGGAPMADNNEGADGVGNSSGNWHCDSQWLGD




RVITTSTRTWALPTYNNHLYKQISNSTSGGSSNDNAYFGYSTPWGYFDFNRFHCHFSPRD




WQRLINNNWGFRPKRLNFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGS




AHQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDV




PFHSSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWL




PGPCYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSS




GVLIFGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGA




LPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPAEF




SATKFASFITQYSTGQVSVEIEWELQKENSKRWNPEVQYTSNYAKSANVDFTVDNNGLYTE




PRPIGTRYLTRPL





16
AAVC11.15
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAGPSGLGSGTVAAGGGAPMADNNEGADGVGNSSGNWHCDSQWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKNPPPQILIKNTPVPANPPAEFSAT




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEVQYTSNYAKSANVDFTVDNNGLYTEPRP




IGTRYLTRPL





17
AAVC11.16
MAADGYLPDWLEDTLSEGIREWWALKPGAPQPKANQQHQDNGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNEADAAALEHDKAYDKQLEQGDNPYLKYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRILEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSSVGSGTVAAGGGAPMADNNEGADGVGNASGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLNFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFSQA




KLASFITQYSTGQVSVEIEWELQKENSKRWNPEVQYTSNYAKSANVDFTVDNNGLYTEPRP




IGTRYLTRPL





18
AAVC11.17
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEAAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGSGTVAAGGGAPMADNNEGADGVGNASGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKKLRFKLFNIQVKEVTTNDGVTTIANNLTSTIQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYSFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPEVFTPA




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVSVDFTVDTNGVYSEPRP




IGTRYLTRNL





19
AAVC11.18
MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSSGIGKTGQQPAKKRLNFGQTGDS




ESVPDPQPLGEPPAAPSSVGSGTVAAGGGAPMADNNEGADGVGNASGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTTNDGVTTIANNLTSTVQVFSDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPEVFTPA




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVSVDFTVDTNGVYSEPRP




IGTRYLTRNL





20
AAVC11.19
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



(protein)
DKGEPVNEADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSSVGSGTVAAGGGAPMADNNEGADGVGNASGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKKLSFKLFNIQVKEVTQNDGTTTIANNLTSTVQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFEFSYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPANPPEVFTPA




KFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVSVDFTVDTNGVYSEPRP




IGTRYLTRNL





21
AAVC11.01
ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



(nucleic
CGAGTGGTGGGACCTGAAACCTGGAGCCCCGAAGCCCAAGGCCAACCAGCAGAAGCAG



acid)
GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACG




CCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGC




AGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGCTGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAACCGTCACCTCAGCGTTCCCCAGACTCC




TCCTCGGGCATCGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCA




GACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCGC




CCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACAT




GGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




TGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTCTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCTAAGCGACTCAACT




TCAAGCTCTTCAACATTCAGGTCAAAGAGGTTACGGACAACAATGGAGTCAAGACCATC




GCCAATAACCTTACCAGCACGGTCCAGGTCTTCACGGACTCAGACTATCAGCTCCCGTA




CGTGCTCGGGTCGGCTCACCAGGGCTGCCTCCCGCCGTTCCCAGCGGACGTCTTCATG




ATTCCTCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATC




CTTTTACTGCCTGGAATATTTCCCATCGCAGATGCTGAGAACGGGCAATAACTTTACCTT




CAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCTCACAGCCAGAGTTTGG




ACCGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCA




CAGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATG




TCGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCA




CGACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACC




TGAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAA




CAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCC




TGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAG




CCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGA




GACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTTCA




CCCGTCTCCTCTGATGGGCGGCTTTGGACTTAAACACCCGCCTCCACAGATCCTGATCA




AGAACACGCCGGTACCTGCGGATCCTCCAACAACGTTCAGCCAGGCGAAATTGGCTTCC




TTCATCACGCAGTACAGCACCGGACAGGTCAGCGTGGAGATCGAGTGGGAGCTGCAGA




AGGAAAACAGCAAGCGCTGGAATCCCGAAGTGCAGTACACATCCAATTATGCAAAATCT




GCCAACGTTGATTTTACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGC




ACCCGTTACCTTACCCGTCCCCTGTAA





22
AAVC11.02
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



(nucleic
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAGAAGCAG



acid)
GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACG




CCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGC




AGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGCTGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAACCGTCACCTCAGCGTTCCCCAGACTCC




TCCTCGGGCATCGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCA




GACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCGC




CCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACAT




GGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




TGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTCTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAACT




TCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCATC




GCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGTTGCCGTAC




GTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGAT




TCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCCT




TTTACTGCCTGGAATATTTCCCATCGCAGATGCTGAGAACGGGCAATAACTTTGAGTTCA




GCTACAGCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCCTGGAC




CGGCTGATGAATCCCCTCATCGACCAGTACTTGTACTACCTGGCCAGAACACAGAGTAA




CCCAGGAGGCACAGCTGGCAATCGGGAACTGCAGTTTTACCAGGGCGGGCCTTCAACT




ATGGCCGAACAAGCCAAGAATTGGTTACCTGGACCTTGCTTCCGGCAACAAAGAGTCTC




CAAAACGCTGGATCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCA




CCTGAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACG




ACGAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACT




AACAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAAT




CCTGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGC




AGCCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACC




GGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTT




CACCCGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTCATC




AAAAACACGCCTGTTCCTGCGAATCCTCCGGCGGAGTTTTTCAGCTACAAAGTTTGCTTCA




TTCATCACCCAATACTCCACAGGACAAGTGAGCGTGGAGATTGAATGGGAGCTGCAGAA




AGAAAACAGCAAACGCTGGAATCCCGAAGTGCAGTATACATCTAACTATGCAAAATCTG




CCAACGTTGATTTCACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGCA




CCCGTTACCTCACCCGTCCCCTGTAA





23
AAVC11.03
ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAAGGCATTCG



(nucleic
CGAGTGGTGGGACCTGAAACCTGGAGCCCCCAAGCCCAAGGCCAACCAGCAGAAGCAG



acid)
GACGACGGTCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTAAATTTCGGTC




AGACTGGCGACTCAGAGTCAGTCCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAACC




CCCGCTGCTGTGGGACCTACTACAATGGCTTCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCCGCACCTGGGCCTTGCCCACCTACAA




CAACCACCTCTACAAGCAAATCTCCAGTGAAACTGCAGGTAGTACCAACGACAACACCTA




CTTCGGCTACAGCACCCCCTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTTTC




ACCACGTGACTGGCAAAGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGGCTCA




ACTTCAAACTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACAACC




ATCGCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCG




TACGTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCAT




GATTCCGCAATACGGCTACCTGACGCTCAACAATGGCAGCCAAGCCGTGGGACGTTCAT




CCTTTTACTGCCTGGAATATTTCCCATCGCAGATGCTGAGAACGGGCAACAACTTTACCT




TCAGCTACACCTTTGAGGAAGTGCCTTTCCACAGCAGCTACGCGCACAGCCAGAGCCTG




GACCGGCTGATGAATCCTCTCATCGACCAATACCTGTATTACCTGAACAGAACTCAAAAT




CAGTCCGGAAGTGCCCAAAACAAGGACTTGCTGTTTAGCCGTGGGTCTCCAGCTGGCAT




GTCTGTTCAGCCCAAAAACTGGCTACCTGGACCCTGTTATCGGCAGCAGCGCGTTTCTA




AAACAAAAACAGACAACAACAACAGCAATTTTACCTGGACTGGTGCTTCAAAATATAACC




TTAATGGGCGTGAATCTATAATCAACCCTGGCACTGCTATGGCCTCACACAAAGACGAC




GAAGACAAGTTCTTTCCCATGAGCGGTGTCATGATTTTTGGAAAAGAGAGCGCCGGAGC




TTCAAACACTGCATTGGACAATGTCATGATTACAGACGAAGAGGAAATTAAAGCCACTAA




CCCTGTGGCCACCGAAAGATTTGGGACCGTGGCAGTCAATTTCCAGAGCAGCAGCACAG




ACCCTGCGACCGGAGATGTGCATGTTATGGGAGCCTTACCTGGAATGGTGTGGCAAGA




CAGAGACGTATACCTGCAGGGTCCTATTTGGGCCAAAATTCCTCACACGGATGGACACT




TTCACCCGTCTCCTCTCATGGGCGGCTTTGGACTTAAGCACCCGCCTCCTCAGATCCTCA




TCAAAAACACGCCGGTACCTGCGGATCCTCCAACAACGTTCAGCCAGGCGAAATTGGCT




TCCTTCATCACGCAGTACAGCACCGGACAGGTCAGCGTGGAGATCGAGTGGGAGCTGC




AGAAGGAAAACAGCAAGCGCTGGAATCCCGAAGTGCAGTACACATCCAATTATGCAAAA




TCTGCCAACGTTGATTTTACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATT




GGCACCCGTTACCTTACCCGTCCCCTGTAA





24
AAVC11.04
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTTTCTGAAGGCATTCGT



(nucleic
GAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAGG



acid)
ACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACTC




GACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGCC




TACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACG




CCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGC




AGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCTA




AGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCAGTCGCCACAAGAGCCAGACTCCTC




CTCGGGCATTGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCAGA




CTGGCGACTCAGAGTCAGTCCCCGACCCACAACCTCTCGGAGAACCACCAGCAGGCCC




CTCTGGTCTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGACAAT




AACGAGGGTGCCGATGGAGTGGGTAATTCCTCAGGAAATTGGCATTGCGATTCCCAATG




GCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAACA




ACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTTCG




GCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACCAC




GTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCTTC




AAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCATCGC




TAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCGTACGT




CCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGATTC




CGCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATCCTTT




TACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGAAACAATTTTGAATTCAGCT




ACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGACCGA




CTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACAGGA




GGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGTCGGCT




CAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACGACAC




TGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTGAACG




GCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGAGGA




CCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAAAAC




TACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCTGTAGC




CACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCCAGA




CACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGGGACGT




GTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTTCACCCGT




CTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGATCAAGAACA




CTCCCGTTCCCGCTAATCCTCCGGAGGTGTTTACTCCTGCCAAGTTTGCTTCGTTCATCA




CACAGTACAGCACCGGACAAGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAGGAAAA




CAGCAAGCGCTGGAACCCGGAGATTCAGTACACTTCAAACTACAACAAGTCTGTTAGTG




TGGACTTTACTGTAGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACCAGAT




ACCTGACTCGTAATCTGTAA





25
AAVC11.05
ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



(nucleic
CGAGTGGTGGGACCTGAAACCTGGAGCCCCCAAGCCCAAGGCCAACCAGCAGAAGCAG



acid)
GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACG




CCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGC




AGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGCTGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAACCGTCACCTCAGCGTTCCCCAGACTCC




TCCTCGGGCATCGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCA




GACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCGC




CCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACAT




GGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




TGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTCTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCTAAGCGACTCAACT




TCAAGCTCTTCAACATTCAGGTCAAAGAGGTTACGGACAACAATGGAGTCAAGACCATC




GCCAATAACCTTACCAGCACGGTCCAGGTCTTCACGGACTCAGACTATCAGCTCCCGTA




CGTGCTCGGGTCGGCTCACGAGGGCTGCCTCCCGCCGTTCCCAGCGGACGTCTTCATG




ATTCCTCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATC




CTTTTACTGCCTGGAATATTTCCCATCGCAGATGCTGAGAACGGGCAATAACTTTACCTT




CAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCTCACAGCCAGAGTTTGG




ACCGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCA




CAGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATG




TCGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCA




CGACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACC




TGAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAA




CAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCC




TGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAG




CCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGG




GACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTTCA




CCCGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGATCAA




GAACACTCCCGTTCCCGCTAATCCTCCGGAGGTUTTTACTCCTGCCAAGTTTGCTTCGTT




CATCACACAGTACAGCACCGGACAAGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAG




GAAAACAGCAAGCGCTGGAACCCGGAGATTCAGTACACTTCAAACTACAACAAGTCTGT




TAGTGTGGACTTTACTGTAGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCAC




CAGATACCTGACTCGTAATCTGTAA





26
AAVC11.06
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACACTCTCTCTGAAGGCATTCG



(nucleic
CGAGTGGTGGGCGCTGAAACCTGGAGCTCCACAACCCAAGGCCAACCAACAGCATCAG



acid)
GACAACGGCAGGGGTCTTGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGAGAGCCGGTCAACGAGGCAGACGCCGCGGCCCTCGAGCACGACAAGGC




CTACGACAAGCAGCTCGAGCAGGGGGACAACCCGTACCTCAAGTACAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTTCAAGAAGATACGTCTTTTGGGGGCAACCTTGGCAGAGC




AGTCTTCCAGGCCAAAAAGAGGATCCTTGAGCCTCTTGGTCTGGTTGAGGAAGCTGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCGTCACCTCAGCGTTCCCCCGACTC




CTCCACGGGCATCGGCAAGAAAGGCCAGCAGCCCGCCAGAAAGAGACTCAATTTCGGT




CAGACTGGCGACTCAGAGTCAGTCCCCGACCCTCAACCTCTCGGAGAACCTCCAGCAGC




GCCCTCTAGTGTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGAC




AATAACGAAGGTGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCAC




ATGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACA




ACAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACT




TTGGCTACAGCACCCCTTGGGGGTATTTTGACTTTAACAGATTCCACTGCCATTTCTCAC




CACGTGACTGGCAGCGACTCATTAACAACAACTGGGGATTCCGGCCCAAGAAACTCAGC




TTCAAGCTCTTCAACATCCAAGTTAAAGAGGTCACGCAGAACGATGGCACGACGACTATT




GCCAATAACCTTACCAGCACGGTTCAAGTGTTTACGGACTCGGAATACCAGCTGCCGTA




CGTCCTCGGCTCCGCGCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGATGTCTTCATGA




TTCCCCAGTACGGCTACCTGACACTGAACAATGGAAGTCAAGCCGTAGGCCGTTCCTCC




TTCTACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGAAACAATTTTGAATTCA




GCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGAC




CGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACA




GGAGGAACTCAAGGTACCCAGCAATGTTATTTTCTCAAGCTGGGCCTGCAAACATGTC




GGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACG




ACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTG




AACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGA




GGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAA




AACTACATTGGAAAATGTGTTAATGACAAATGAGGAAGAAATTCGTCCTACTAATCCTGT




AGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCC




AGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGGGA




CGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTTCACC




CGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGATCAAGA




ACACTCCCGTTCCCGCTAATCCTCCGGAGGTGTTTACTCCTGCCAAGTTTGCTTCGTTCA




TCACACAGTACAGCACCGGACAAGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAGGA




AAACAGCAAGCGCTGGAACCCGGAGATTCAGTACACTTCAAACTACAACAAGTCTGTTA




GTGTGGACTTTACTGTAGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACC




AGATACCTGACTCGTAATCTGTAA





27
AAVC11.07
ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAAGGCATTCG



(nucleic
CGAGTGGTGGGACCTGAAACCTGGAGCCCCCAAGCCCAAGGCCAACCAGCAGAAGCAG



acid)
GACGACGGTCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCAGTCGCCACAAGAGCCAGACTCCT




CCTCGGGCATTGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCAG




ACTGGCGACTCAGAGTCAGTCCCCGACCCACAACCTCTCGGAGAACCACCAGCAGGCC




CCTCTGGTCTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAGGGTGCCGATGGAGTGGGTAATTCCTCAGGAAATTGGCATTGCGATTCCCAAT




GGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCATC




GCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCGTAC




GTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGATGTCTTCATGAT




TCCCCAGTACGGCTACCTGACACTGAACAATGGAAGTCAAGCCGTAGGCCGTTCCTCCT




TCTACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGAAACAATTTTGAATTCAG




CTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGACC




GACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACAG




GAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGTCG




GCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACGA




CACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTGA




ACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGA




GGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAA




AACTACATTGGAAAATGTGTTAATGACAAATGAGGAAGAAATTCGTCCTACTAATCCTGT




AGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCC




AGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGGGA




CGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTTCACC




CGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGATCAAGA




ACACTCCTGTTCCTGCGAATCCTCCGGAGGTGTTTACTCCTGCCAAGTTTGCTTCGTTCA




TCACACAGTACAGCACCGGACAAGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAGGA




AAACAGCAAGCGCTGGAACCCGGAGATTCAGTACACTTCAAACTACAACAAGTCTGTTA




GTGTGGACTTTACTGTAGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACC




AGATACCTGACTCGTAATCTGTAA





28
AAVC11.08
ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



(nucleic
CGAGTGGTGGGACCTGAAACCTGGAGCCCCGAAGCCCAAGGCCAACCAGCAGAAGCAG



acid)
GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACG




CCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGC




AGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGCTGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAACCGTCACCTCAGCGTTCCCCAGACTCC




TCCTCGGGCATCGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCA




GACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCGC




CCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACAT




GGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




TGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTCTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCTAAGCGACTCAACT




TCAAGCTCTTCAACATTCAGGTCAAAGAGGTTACGGACAACAATGGAGTCAAGACCATC




GCCAATAACCTTACCAGCACGGTCCAGGTCTTCACGGACTCAGACTATCAGCTCCCGTA




CGTGCTCGGGTCGGCTCACGAGGGCTGCCTCCCGCCGTTCCCAGCGGATGTCTTCATG




ATTCCTCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATC




CTTTTACTGCCTGGAATATTTCCCATCGCAGATGCTGAGAACGGGCAATAACTTTACCTT




CAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCTCACAGCCAGAGTTTGG




ACCGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCA




CAGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATG




TCGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCA




CGACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACC




TGAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAA




CAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCC




TGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAG




CCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGA




GACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTTCA




CCCGTCTCCTCTGATGGGCGGCTTTGGACTTAAACACCCGCCTCCACAGATCCTCATCAA




AAACACGCCTGTTCCTGCGAATCCTCCGGCGGAGTTTTCAGCTACAAAGTTTGCTTCATT




CATCACCCAATACTCCACAGGACAAGTGAGTGTGGAAATTGAATGGGAGCTGCAGAAAG




AAAACAGCAAGCGCTGGAATCCCGAAGTGCAGTACACATCCAATTATGCAAAATCTGCC




AACGTTGATTTTACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGCACC




CGTTACCTCACCCGTCCCCTGTAA





29
AAVC11.09
ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



(nucleic
CGAGTGGTGGGACCTGAAACCTGGAGCCCCGAAGCCCAAGGCCAACCAGCAGAAGCAG



acid)
GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACG




CCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGC




AGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGCTGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAACCGTCACCTCAGCGTTCCCCAGACTCC




TCCTCGGGCATCGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCA




GACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCGC




CCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACAT




GGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




TGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTCTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCTAAGCGACTCAACT




TCAAGCTCTTCAACATTCAGGTCAAAGAGGTTACGGACAACAATGGAGTCAAGACCATC




GCCAATAACCTTACCAGCACGGTCCAGGTCTTCACGGACTCAGACTATCAGCTCCCGTA




CGTGCTCGGGTCGGCTCACGAGGGCTGCCTCCCGCCGTTCCCAGCGGACGTCTTCATG




ATTCCTCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATC




CTTTTACTGCCTGGAATATTTCCCATCGCAGATGCTGAGAACGGGCAATAACTTTACCTT




CAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCTCACAGCCAGAGTTTGG




ACCGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCA




CAGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATG




TCGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCA




CGACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACC




TGAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAA




CAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCC




TGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAG




CCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGA




GACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTTCA




CCCGTCTCCTCTGATGGGCGGCTTTGGACTTAAACACCCGCCTCCACAGATCCTGATCA




AGAACACGCCGGTACCTGCGGATCCTCCAACAACGTTCAGCCAGGCGAAATTGGCTTCC




TTCATCACGCAGTACAGCACCGGACAGGTCAGCGTGGAGATCGAGTGGGAGCTGCAGA




AGGAAAACAGCAAGCGCTGGAATCCCGAAGTGCAGTACACATCCAATTATGCAAAATCT




GCCAACGTTGATTTTACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGC




ACCCGTTACCTTACCCGTCCCCTGTAA





30
AAVC11.10
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



(nucleic
CGAGTGGTGGGACTTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG



acid)
GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCAGTCGCCACAAGAGCCAGACTCCT




CCTCGGGCATTGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCAG




ACTGGCGACTCAGAGTCAGTCCCCGACCCACAACCTCTCGGAGAACCACCAGCAGGCC




CCTCTGGTCTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAGGGTGCCGATGGAGTGGGTAATTCCTCAGGAAATTGGCATTGCGATTCCCAAT




GGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCATC




GCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCGTAC




GTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGAT




TCCGCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATCCT




TTTACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGAAACAATTTTGAATTCAG




CTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGACC




GACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACAG




GAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGTCG




GCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACGA




CACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTGA




ACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGA




GGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAA




AACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCTGT




AGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCC




AGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGAGAC




GTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTTCACCC




GTCTCCTCTGATGGGCGGCTTTGGACTTAAACACCCGCCTCCACAGATCCTGATCAAGA




ACACGCCGGTACCTGCGGATCCTCCAACAACGTTCAGCCAGGCGAAATTGGCTTCCTTC




ATCACGCAGTACAGCACCGGACAGGTCAGCGTGGAGATCGAGTGGGAGCTGCAGAAGG




AAAACAGCAAGCGCTGGAATCCCGAAGTGCAGTACACATCCAATTATGCAAAATCTGCC




AACGTTGATTTTACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGCACC




AGATACCTGACTCGTAATCTGTAA





31
AAVC11.11
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACACTCTCTCTGAAGGCATTCG



(nucleic
CGAGTGGTGGGCGCTGAAACCTGGAGCTCCACAACCCAAGGCCAACCAACAGCATCAG



acid)
GACAACGGCAGGGGTCTTGTGCTTCCTGGGTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGAGAGCCGGTCAACGAGGCAGACGCCGCGGCCCTCGAGCACGACAAGGC




CTACGACAAGCAGCTCGAGCAGGGGGACAACCCGTACCTCAAGTACAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCGTCACCTCAGCGTTCCCCCGACT




CCTCCACGGGCATCGGCAAGAAAGGCCAGCAGCCCGCCAGAAAGAGACTCAATTTCGG




TCAGACTGGCGACTCAGAGTCAGTCCCCGACCCTCAACCTCTCGGAGAACCTCCAGCAG




CGCCCTCTAGTGTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGA




CAATAACGAAGGTGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCA




CATGGCTGGGCGACAGAGTCATTACCACCAGCACCCGAACCTGGGCCCTGCCCACCTAC




AACAACCACCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTAC




TTTGGCTACAGCACCCCTTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTCTCA




CCACGTGACTGGCAGCGACTCATTAACAACAACTGGGGATTCCGGCCCAAGAGACTCAA




CTTCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCA




TCGCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGTTGCCGT




ACGTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATG




ATTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTC




CTTTTACTGCCTGGAATATTTCCCATCGCAGATGCTGAGAACGGGCAATAACTTTGAGTT




CAGCTACAGCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGG




ACCGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCA




CAGGAGGAACTCAAGGTACCCAGCAATGTTATTTTCTCAAGCTGGGCCTGCAAACATG




TCGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCA




CGACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACC




TGAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAA




CAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCC




TGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAG




CCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGG




GACGTGTACCTGCAGGGTCCCATTTGGGCCAAAATTCCTCACACAGATGGACACTTTCA




CCCGTCTCCTCTTATGGGCGGCTTTGGACTCAAGAACCCGCCTCCTCAGATCCTCATCAA




AAACACGCCTGTTCCTGCGAATCCTCCGGCGGAGTTTTCAGCTACAAAGTTTGCTTCATT




CATCACCCAGTATTCCACAGGACAAGTGAGCGTGGAGATTGAATGGGAGCTGCAGAAA




GAAAACAGCAAACGCTGGAATCCCGAAGTGCAGTATACATCTAACTATGCAAAATCTGC




CAACGTTGATTTCACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGCAC




CCGTTACCTTACCCGTCCCCTGTAA





32
AAVC11.12
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACACTCTCTCTGAAGGCATTCG



(nucleic
CGAGTGGTGGGCGCTGAAACCTGGAGCTCCACAACCCAAGGCCAACCAACAGCATCAG



acid)
GACAACGGCAGGGGTCTTGTGCTTCCTGGGTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGAGAGCCGGTCAACGAGGCAGACGCCGCGGCCCTCGAGCACGACAAGGC




CTACGACAAGCAGCTCGAGCAGGGGGACAACCCGTACCTCAAGTACAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTTCAAGAAGATACGTCTTTTGGGGGCAACCTTGGCAGAGC




AGTCTTCCAGGCCAAAAAGAGGATCCTTGAGCCTCTTGGTCTGGTTGAGGAAGCTGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAACCGTCACCTCAGCGTTCCCCCGACTCC




TCCACGGGCATCGGCAAGAAAGGCCAGCAGCCCGCCAGAAAGAGACTCAATTTCGGTC




AGACTGGCGACTCAGAGTCAGTCCCCGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTAGTGTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGTGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCATC




GCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCGTAC




GTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACCTCTTCATGAT




TCCGCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATCCT




TTTACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGAAACAATTTTGAATTCAG




CTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGACC




GACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACAG




GAGGAACTCAAGGTACCCAGCAATGTTATTTTCTCAAGCTGGGCCTGCAAACATGTCG




GCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACGA




CACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTGA




ACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGA




GGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAA




AACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCTGT




AGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCC




AGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGGGA




CGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTTCACC




CGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGATCAAGA




ACACTCCCGTTCCCGCTAATCCTCCGGAGGTGTTTACTCCTGCCAAGTTTGCTTCGTTCA




TCACACAGTACAGCACCGGACAAGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAGGA




AAACAGCAAGCGCTGGAACCCGGAGATTCAGTACACTTCAAACTACAACAAGTCTGTTA




GTGTGGACTTTACTGTAGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACC




AGATACCTGACTCGTAATCTGTAA





33
AAVC11.13
ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



(nucleic
CGAGTGGTGGGACCTGAAACCTGGAGCCCCGAAGCCCAAGGCCAACCAGCAGAAGCAG



acid)
GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACG




CCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTTGGCAGAGCA




GTCTTCCAGGCCAAAAAGAGGATCCTTGAGCCTCTTGGTCTGGTTGAGGAAGCTGCTAA




GACGGCTCCTGGAAAGAAGAGACCGGTAGAACCGTCACCTCAGCGTTCCCCCGACTCCT




CCACGGGCATCGGCAAGAAAGGCCAGCAGCCCGCCAGAAAGAGACTCAATTTCGGTCA




GACTGGCGACTCAGAGTCAGTCCCCGACCCTCAACCTCTCGGAGAACCTCCAGCAGCGC




CCTCTAGTGTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAAGGTGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCACAT




GGCTGGGCGACAGAGTCATTACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAA




CAACCACCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




TGGCTACAGCACCCCTTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTCTCACC




ACGTGACTGGCAGCGACTCATTAACAACAACTGGGGATTCCGGCCCAAGAGACTCAACT




TCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCATC




GCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCGTAC




GTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGAT




TCCGCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATCCT




TTTACTGCCTGGAATATTTCCCATCGCAGATGCTGAGAACGGGCAATAACTTTGAGTTCA




GCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGAC




CGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACA




GGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGTC




GGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTTCCGGCAACAAAGAGTCTCCAAAA




CGCTGGATCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTGA




ACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGA




GGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAA




AACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCTGT




AGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCC




AGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGGGA




CGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTTCACC




CGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGATCAAGA




ACACTCCCGTTCCCGCTAATCCTCCGGAGGTGTTTACTCCTGCCAAGTTTGCTTCGTTCA




TCACACAGTACAGCACCGGACAAGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAGGA




AAACAGCAAGCGCTGGAACCCGGAGATTCAGTACACTTCAAACTACAACAAGTCTGTTA




GTGTGGACTTTACTGTAGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACC




AGATACCTGACTCGTAATCTGTAA





34
AAVC11.14
ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAAGGCATTCG



(nucleic
CGAGTGGTGGGACCTGAAACCTGGAGCCCCCAAGCCCAAGGCCAACCAGCAGAAGCAG



acid)
GACGACGGTCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCAGTCGCCACAAGAGCCAGACTCCT




CCTCGGGCATTGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCAG




ACTGGCGACTCAGAGTCAGTCCCCGACCCACAACCTCTCGGAGAACCACCAGCAGGCC




CCTCTGGTCTGGGATCTGGTACAGTGGCTTCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAGGGTGCCGATGGAGTGGGTAATTCCTCAGGAAATTGGCATTGCGATTCCCAAT




GGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAA




CAATCACCTCTACAAGCAAATCTCCAACAGCACATCTGGAGGATCTTCAAATGACAACGC




CTACTTCGGCTACAGCACCCCCTGGGGGTATTTTGACTTCAACAGATTCCACTGCCATTT




CTCACCACGTGACTGGCAGCGACTCATCAACAACAATTGGGGATTCCGGCCCAAGAGAC




TCAACTTCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACG




ACCATCGCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGTTG




CCGTACGTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTT




CATGATTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCT




CCTCCTTTTACTGCCTGGAATATTTCCCATCTCAAATGCTGCGAACTGGAAACAATTTTGA




ATTCAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCT




TGGACCGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGT




CCACAGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAAC




ATGTCGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCT




CCACGACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATC




ACCTGAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGAC




GACGAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAAC




TAACAAAACTACATTGGAAAATGTGTTAATGACAAATGAGGAAGAAATTCGTCCTACTAA




TCCTGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTG




CAGCCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAAC




CGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTT




TCACCCGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGAT




CAAGAACACTCCTGTTCCTGCGAATCCTCCGGCAGAGTTTTCGGCTACAAAGTTTGCTTC




ATTCATCACCCAATACTCCACAGGACAAGTGAGTGTGGAAATTGAATGGGAGCTGCAGA




AAGAAAACAGCAAGCGCTGGAATCCCGAAGTGCAGTATACATCTAACTATGCAAAATCT




GCCAACGTTGATTTTACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGC




ACCCGTTACCTTACCCGTCCCCTGTAA





35
AAVC11.15
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



(nucleic
CGAGTGGTGGGACTTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG



acid)
GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGCTGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTCCCCGACCCACAACCTCTCGGAGAACCACCAGCAGG




CCCCTCTGGTCTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGAC




AATAACGAGGGTGCCGATGGAGTGGGTAATTCCTCAGGAAATTGGCATTGCGATTCCCA




ATGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACA




ACAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACT




TCGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCAC




CACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGC




TTCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCAT




CGCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCGTA




CGTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCGCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATCC




TTTTACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGAAACAATTTTGAATTCA




GCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGAC




CGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACA




GGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGTC




GGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACG




ACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTG




AACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGA




GGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAA




AACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCTGT




AGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCC




AGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGGGA




CGTGTACCTGCAGGGTCCCATTTGGGCCAAAATTCCTCACACAGATGGACACTTTCACCC




GTCTCCTCTTATGGGCGGCTTTGGACTCAAGAACCCGCCTCCTCAGATCCTCATCAAAAA




CACGCCTGTTCCTGCGAATCCTCCGGCGGAGTTTTCAGCTACAAAGTTTGCTTCATTCAT




CACCCAGTATTCCACAGGACAAGTGAGCGTGGAGATTGAATGGGAGCTGCAGAAAGAA




AACAGCAAACGCTGGAATCCCGAAGTGCAGTATACATCTAACTATGCAAAATCTGCCAAC




GTTGATTTCACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGCACCCGT




TACCTTACCCGTCCCCTGTAA





36
AAVC11.16
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACACTCTCTCTGAAGGCATTCG



(nucleic
CGAGTGGTGGGCGCTGAAACCTGGAGCTCCACAACCCAAGGCCAACCAACAGCATCAG



acid)
GACAACGGCAGGGGTCTTGTGCTTCCTGGGTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGAGAGCCGGTCAACGAGGCAGACGCCGCGGCCCTCGAGCACGACAAGGC




CTACGACAAGCAGCTCGAGCAGGGGGACAACCCGTACCTCAAGTACAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTTCAAGAAGATACGTCTTTTGGGGGCAACCTTGGCAGAGC




AGTCTTCCAGGCCAAAAAGAGGATCCTTGAGCCTCTTGGTCTGGTTGAGGAAGCTGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAACCGTCACCTCAGCGTTCCCCCGACTCC




TCCACGGGCATCGGCAAGAAAGGCCAGCAGCCCGCCAGAAAGAGACTCAATTTCGGTC




AGACTGGCGACTCAGAGTCAGTCCCCGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTAGTGTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGTGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATTACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAA




CAACCACCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




TGGCTACAGCACCCCTTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTCTCACC




ACGTGACTGGCAGCGACTCATTAACAACAACTGGGGATTCCGGCCCAAGAGACTCAACT




TCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCATC




GCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCGTAC




GTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGAT




TCCGCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATCCT




TTTACTGCCTGGAATATTTCCCATCGCAGATGCTGAGAACGGGCAATAACTTTGAGTTCA




GCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGAC




CGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACA




GGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGTC




GGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACG




ACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTG




AACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGA




GGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAA




AACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCTGT




AGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCC




AGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGAGAC




GTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTTCACCC




GTCTCCTCTGATGGGCGGCTTTGGACTTAAACACCCGCCTCCACAGATCCTGATCAAGA




ACACGCCGGTACCTGCGGATCCTCCAACAACGTTCAGCCAGGCGAAATTGGCTTCCTTC




ATCACGCAGTACAGCACCGGACAGGTCAGCGTGGAGATCGAGTGGGAGCTGCAGAAGG




AAAACAGCAAGCGCTGGAATCCCGAAGTGCAGTACACATCCAATTATGCAAAATCTGCC




AACGTTGATTTTACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGCACC




CGTTACCTTACCCGTCCCCTGTAA





31
AAVC11.17
ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAAGGCATTCG



(nucleic
CGAGTGGTGGGACCTGAAACCTGGAGCCCCCAAGCCCAAGGCCAACCAGCAGAAGCAG



acid)
GACGACGGTCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACG




CCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGC




AGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGCTGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTCC




TCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTCA




GACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCGC




CCTCTGGTGTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAAGGTGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCACAT




GGCTGGGCGACAGAGTCATTACCACCAGCACCCGAACCTGGGCCCTGCCCACTTACAAC




AACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTTT




GGCTACAGCACCCCTTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTCTCACCA




CGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAAGCTGCGGTT




CAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGACGAATGACGGCGTTACGACCATCG




CTAATAACCTTACCAGCACGATTCAGGTATTCTCGGACTCGGAATACCAGCTGCCGTACG




TCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGATT




CCGCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCATCCTT




TTACTGCCTGGAGTACTTCCCCTCTCAGATGCTGAGAACGGGCAACAACTTTGAGTTCAG




CTACAGCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGACC




GACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACAG




GAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGTCG




GCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACGA




CACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTGA




ACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGA




GGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAA




AACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCTGT




AGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCC




AGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGGGA




CGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTTCACC




CGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGATCAAGA




ACACTCCCGTTCCCGCTAATCCTCCGGAGGTGTTTACTCCTGCCAAGTTTGCTTCGTTCA




TCACACAGTACAGCACCGGACAAGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAGGA




AAACAGCAAGCGCTGGAACCCGGAGATTCAGTACACTTCAAACTACAACAAGTCTGTTA




GTGTGGACTTTACTGTAGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACC




AGATACCTGACTCGTAATCTGTAA





38
AAVC11.18
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



(nucleic
CGAGTGGTGGGACCTGAAACCTGGAGCCCCGAAGCCCAAGGCCAACCAGCAGAAGCAG



acid)
GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACG




CCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGC




AGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTCC




TCCTCGGGCATTGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTCGGTCA




GACTGGCGACTCAGAGTCAGTCCCCGACCCTCAACCTCTCGGAGAACCTCCAGCAGCGC




CCTCTAGTGTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGACAA




TAACGAAGGTGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCACAT




GGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCATC




GCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCGTAC




GTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGAT




TCCGCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATCCT




TTTACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGAAACAATTTTGAATTCAG




CTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGACC




GACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACAG




GAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGTCG




GCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACGA




CACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTGA




ACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGA




GGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAA




AACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCTGT




AGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCC




AGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGGGA




CGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTTCACC




CGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGATCAAGA




ACACTCCCGTTCCCGCTAATCCTCCGGAGGTGTTTACTCCTGCCAAGTTTGCTTCGTTCA




TCACACAGTACAGCACCGGACAAGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAGGA




AAACAGCAAGCGCTGGAACCCGGAGATTCAGTACACTTCAAACTACAACAAGTCTGTTA




GTGTGGACTTTACTGTAGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACC




AGATACCTGACTCGTAATCTGTAA





39
AAVC11.19
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



(nucleic
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAGAAGCAG



acid)
GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGAGAGCCGGTCAACGAGGCAGACGCCGCGGCCCTCGAGCACGACAAAGC




CTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACG




CCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCATTTGGGGGCAACCTCGGGCGAGC




AGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCTA




AGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCGTCACCTCAGCGTTCCCCCGACTC




CTCCACGGGCATCGGCAAGAAAGGCCAGCAGCCCGCCAGAAAGAGACTCAATTTCGGT




CAGACTGGCGACTCAGAGTCAGTCCCCGACCCTCAACCTCTCGGAGAACCTCCAGCAGC




GCCCTCTAGTGTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGAC




AATAACGAAGGTGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCAC




ATGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACA




ACAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACT




TTGGCTACAGCACCCCTTGGGGGTATTTTGACTTTAACAGATTCCACTGCCATTTCTCAC




CACGTGACTGGCAGCGACTCATTAACAACAACTGGGGATTCCGGCCCAAGAAACTCAGC




TTCAAGCTCTTCAACATCCAAGTTAAAGAGGTCACGCAGAACGATGGCACGACGACTATT




GCCAATAACCTTACCAGCACGGTTCAAGTGTTTACGGACTCGGAATACCAGCTGCCGTA




CGTCCTCGGCTCCGCGCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGATGTCTTCATGA




TTCCCCAGTACGGCTACCTGACACTGAACAATGGAAGTCAAGCCGTAGGCCGTTCCTCC




TTCTACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGAAACAATTTTGAATTCA




GCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGAC




CGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACA




GGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGTC




GGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACG




ACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTG




AACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGA




GGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAA




AACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCTGT




AGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCC




AGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGGGA




CGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTTCACC




CGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGATCAAGA




ACACTCCCGTTCCCGCTAATCCTCCGGAGGTGTTTACTCCTGCCAAGTTTGCTTCGTTCA




TCACACAGTACAGCACCGGACAAGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAGGA




AAACAGCAAGCGCTGGAACCCGGAGATTCAGTACACTTCAAACTACAACAAGTCTGTTA




GTGTGGACTTTACTGTAGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACC




AGATACCTGACTCGTAATCTGTAA





64
AAV8
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL




DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISNGTSGGATNDNTYFGYSTPWGYFDFNRFHCHFSPRD




WQRLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSA




HQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVP




FHSSYAHSQSLDRLMNPLIDQYLYYLSRTQTTGGTANTQTLGFSQGGPNTMANQAKNWLP




GPCYRQQRVSTTTGQNNNSNFAWTAGTKYHLNGRNSLANPGIAMATHKDDEERFFPSNGI




LIFGKQNAARDNADYSDVMLTSEEEIKTTNPVATEEYGIVADNLQQQNTAPQIGTVNSQGA




LPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTF




NQSKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSE




PRPIGTRYLTRNL





65
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



1
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQTTGGTANTQTLGFSQGGPNTMANQAKNWLPGP




CYRQQRVSTTTGQNNNSNFAWTAGTKYHLNGRNSLANPGIAMATHKDDEERFFPSNGILIF




GKQNAARDNADYSDVMLTSEEEIKTTNPVATEEYGIVADNLQQQNTAPQIGTVNSQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQ




SKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPR




PIGTRYLTRNL





66
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



2
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISNGTSGGATNDNTYFGYSTPWGYFDFNRFHCHFSPRD




WQRLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSA




HQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVP




FHSSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLP




GPCYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEERFFPSNGI




LIFGKQNAARDNADYSDVMLTSEEEIKTTNPVATEEYGIVADNLQQQNTAPQIGTVNSQGA




LPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTF




NQSKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSE




PRPIGTRYLTRNL





67
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



3
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISNGTSGGATNDNTYFGYSTPWGYFDFNRFHCHFSPRD




WQRLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSA




HQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVP




FHSSYAHSQSLDRLMNPLIDQYLYYLSRTQTTGGTANTQTLGFSQGGPNTMANQAKNWLP




GPCYRQQRVSTTTGQNNNSNFAWTAGTKYHLNGRNSLANPGIAMATHKDDEDRFFPSSG




VLIFGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGAL




PGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFN




QSKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEP




RPIGTRYLTRNL





68
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



4
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEERFFPSNGILI




FGKQNAARDNADYSDVMLTSEEEIKTTNPVATEEYGIVADNLQQQNTAPQIGTVNSQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQ




SKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPR




PIGTRYLTRNL





69
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



5
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQTTGGTANTQTLGFSQGGPNTMANQAKNWLPGP




CYRQQRVSTTTGQNNNSNFAWTAGTKYHLNGRNSLANPGIAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQS




KLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPRPI




GTRYLTRNL





70
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



6
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISNGTSGGATNDNTYFGYSTPWGYFDFNRFHCHFSPRD




WQRLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSA




HQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVP




FHSSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLP




GPCYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSG




VLIFGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGAL




PGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFN




QSKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEP




RPIGTRYLTRNL





71
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



7
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQS




KLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPRPI




GTRYLTRNL





72
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



8
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQTTGGTANTQTLGFSQGGPNTMANQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQS




KLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPRPI




GTRYLTRNL





73
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



9
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTTGQNNNSNFAWTAGTKYHLNGRNSLANPGIAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQS




KLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPRPI




GTRYLTRNL





74
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



10
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEERFFPSNGILI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQS




KLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPRPI




GTRYLTRNL





75
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



11
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKQNAARDNADYSDVMLTSEEEIKTTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQ




SKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPR




PIGTRYLTRNL





76
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



12
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVADNLQQQNTAPQIGTVNSQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQS




KLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPRPI




GTRYLTRNL





77
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



13
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEDRFFPSSGVLI




FGKQNAARDNADYSDVMLTSEEEIKTTNPVATEEYGIVADNLQQQNTAPQIGTVNSQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQ




SKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPR




PIGTRYLTRNL





78
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



14
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEERFFPSNGILI




FGKTGATNKTTLENVLMTNEEEIRPTNPVATEEYGIVADNLQQQNTAPQIGTVNSQGALPG




MVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQS




KLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPRPI




GTRYLTRNL





79
AAV8 Swap
MAADGYLPDWLEDNLSEGIREWWALKPGAPKPKANQQKQDDGRGLVLPGYKYLGPFNGL



15
DKGEPVNAADAAALEHDKAYDQQLQAGDNPYLRYNHADAEFQERLQEDTSFGGNLGRAVF




QAKKRVLEPLGLVEEGAKTAPGKKRPVEPSPQRSPDSSTGIGKKGQQPARKRLNFGQTGDS




ESVPDPQPLGEPPAAPSGVGPNTMAAGGGAPMADNNEGADGVGSSSGNWHCDSTWLGD




RVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQ




RLINNNWGFRPKRLSFKLFNIQVKEVTQNEGTKTIANNLTSTIQVFTDSEYQLPYVLGSAHQ




GCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFQFTYTFEDVPFH




SSYAHSQSLDRLMNPLIDQYLYYLSRTQSTGGTQGTQQLLFSQAGPANMSAQAKNWLPGP




CYRQQRVSTTLSQNNNSNFAWTGATKYHLNGRNSLVNPGVAMATHKDDEERFFPSNGILI




FGKQNAARDNADYSDVMLTSEEEIKTTNPVATEEYGIVSSNLQAANTAAQTQVVNNQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIKNTPVPADPPTTFNQ




SKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYYKSTSVDFAVNTEGVYSEPR




PIGTRYLTRNL





85
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



1 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTGTCTCGGACTCAAACAAC




AGGAGGCACGGCAAATACGCAGACTCTGGGCTTCAGCCAAGGTGGGCCTAATACAATG




GCCAATCAGGCAAAGAACTGGCTGCCAGGACCCTGTTACCGCCAACAACGCGTCTCAAC




GACAACCGGGCAAAACAACAATAGCAACTTTGCCTGGACTGCTGGGACCAAATACCATC




TGAATGGAAGAAATTCATTGGCTAATCCTGGCATCGCTATGGCAACACACAAGGACGAC




GAGGAGCGTTTTTTTCCCAGTAACGGGATCCTGATTTTTGGCAAACAAAATGCTGCCAGA




GACAATGCGGATTACAGCGATGTCATGCTCACCAGCGAGGAAGAAATCAAAACCACTAA




CCCTGTGGCTACAGAGGAATACGGTATCGTGGCAGATAACTTGCAGCAGCAAAACACGG




CTCCTCAAATTGGAACTGTCAACAGCCAGGGGGCCTTACCCGGTATGGTCTGGCAGAAC




CGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTT




CCACCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGA




TCAAGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAAC




TCTTTCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCA




GAAGGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAAT




CTACAAGTGTGGACTTTGCTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTG




GCACCCGTTACCTCACCCGTAATCTGTAA





86
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



2 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACA




ACAACCACCTCTACAAGCAAATCTCCAACGGGACATCGGGAGGAGCCACCAACGACAAC




ACCTACTTCGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCAC




TTTTCACCACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAG




ACTCAGCTTCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCA




AGACCATCGCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAG




CTGCCGTACGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGT




GTTCATGATTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGAC




GCTCCTCCTTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACT




TCCAGTTTACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGA




GCTTGGACCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTC




AGTCCACAGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCA




AACATGTCGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAG




TCTCCACGACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAAT




ATCACCTGAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAG




GACGACGAGGAGCGTTTTTTTCCCAGTAACGGGATCCTGATTTTTGGCAAACAAAATGCT




GCCAGAGACAATGCGGATTACAGCGATGTCATGCTCACCAGCGAGGAAGAAATCAAAAC




CACTAACCCTGTGGCTACAGAGGAATACGGTATCGTGGCAGATAACTTGCAGCAGCAAA




ACACGGCTCCTCAAATTGGAACTGTCAACAGCCAGGGGGCCTTACCCGGTATGGTCTGG




CAGAACCGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACG




GCAACTTCCACCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAG




ATCCTGATCAAGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAA




GCTGAACTCTTTCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGG




AGCTGCAGAAGGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTAC




TACAAATCTACAAGTGTGGACTTTGCTGTTAATACAGAAGGCGTGTACTCTGAACCCCGC




CCCATTGGCACCCGTTACCTCACCCGTAATCTGTAA





87
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



3 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACA




ACAACCACCTCTACAAGCAAATCTCCAACGGGACATCGGGAGGAGCCACCAACGACAAC




ACCTACTTCGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCAC




TTTTCACCACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAG




ACTCAGCTTCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCA




AGACCATCGCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAG




CTGCCGTACGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGT




GTTCATGATTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGAC




GCTCCTCCTTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACT




TCCAGTTTACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGA




GCTTGGACCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTGTCTCGGACTC




AAACAACAGGAGGCACGGCAAATACGCAGACTCTGGGCTTCAGCCAAGGTGGGCCTAA




TACAATGGCCAATCAGGCAAAGAACTGGCTGCCAGGACCCTGTTACCGCCAACAACGCG




TCTCAACGACAACCGGGCAAAACAACAATAGCAACTTTGCCTGGACTGCTGGGACCAAA




TACCATCTGAATGGAAGAAATTCATTGGCTAATCCTGGCATCGCTATGGCAACACACAAG




GACGACGAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGC




AACTAACAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTAC




TAATCCTGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATA




CTGCAGCCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAG




AACCGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCA




ACTTCCACCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATC




CTGATCAAGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCT




GAACTCTTTCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGC




TGCAGAAGGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTAC




AAATCTACAAGTGTGGACTTTGCTTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCC




ATTGGCACCCGTTACCTCACCCGTAATCTGTAA





88
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



4 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCAC




AGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGT




CGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCAC




GACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCT




GAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAGGACGAC




GAGGAGCGTTTTTTTCCCAGTAACGGGATCCTGATTTTTGGCAAACAAAATGCTGCCAGA




GACAATGCGGATTACAGCGATGTCATGCTCACCAGCGAGGAAGAAATCAAAACCACTAA




CCCTGTGGCTACAGAGGAATACGGTATCGTGGCAGATAACTTGCAGCAGCAAAACACGG




CTCCTCAAATTGGAACTGTCAACAGCCAGGGGGCCTTACCCGGTATGGTCTGGCAGAAC




CGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTT




CCACCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGA




TCAAGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAAC




TCTTTCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCA




GAAGGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAAT




CTACAAGTGTGGACTTTGCnCTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTG




GCACCCGTTACCTCACCCGTAATCTGTAA





89
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



5 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTGTCTCGGACTCAAACAAC




AGGAGGCACGGCAAATACGCAGACTCTGGGCTTCAGCCAAGGTGGGCCTAATACAATG




GCCAATCAGGCAAAGAACTGGCTGCCAGGACCCTTGTTACCGCCAACAACGCGTCTCAAC




GACAACCGGGCAAAACAACAATAGCAACTTTGCCTGGACTGCTGGGACCAAATACCATC




TGAATGGAAGAAATTCATTGGCTAATCCTGGCATCGCTATGGCAACACACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAA




CAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCC




TGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAG




CCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGG




GACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCA




CCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCA




AGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTCTT




TCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGAA




GGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAATCTA




CAAGTGTGGACTTTGCTTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGGC




ACCCGTTACCTCACCCGTAATCTGTAA





90
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



6 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACA




ACAACCACCTCTACAAGCAAATCTCCAACGGGACATCGGGAGGAGCCACCAACGACAAC




ACCTACTTCGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCAC




TTTTCACCACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAG




ACTCAGCTTCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCA




AGACCATCGCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAG




CTGCCGTACGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGT




GTTCATGATTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGAC




GCTCCTCCTTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACT




TCCAGTTTACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGA




GCTTGGACCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTC




AGTCCACAGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCA




AACATGTCGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAG




TCTCCACGACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAAT




ATCACCTGAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAG




GACGACGAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGC




AACTAACAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTAC




TAATCCTGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATA




CTGCAGCCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAG




AACCGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCA




ACTTCCACCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATC




CTGATCAAGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCT




GAACTCTTTCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGC




TGCAGAAGGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTAC




AAATCTACAAGTGTGGACTTTGCTTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCC




ATTGGCACCCGTTACCTCACCCGTAATCTGTAA





91
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



7 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCAC




AGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGT




CGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCAC




GACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCT




GAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAA




CAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCC




TGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAG




CCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGG




GACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCA




CCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCA




AGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTCTT




TCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGAA




GGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAATCTA




CAAGTGTGGACTTTGCTTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGGC




ACCCGTTACCTCACCCGTAATCTGTAA





92
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



8 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTCAGACCAC




AGGAGGAACTGCAAATACCCAGACATTGGGATTTTCTCAAGGTGGGCCTAACACCATGG




CGAATCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCAC




GACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCT




GAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAA




CAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCC




TGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAG




CCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGG




GACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCA




CCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCA




AGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTCTT




TCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGAA




GGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAATCTA




CAAGTGTGGACTTTGCTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGGC




ACCCGTTACCTCACCCGTAATCTGTAA





93
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



9 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCAC




AGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGT




CGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCAC




GACAACGGGGCAAAACAACAACAGCAACTTTGCTTGGACTGCTGGCACCAAATATCACC




TGAACGGCAGAAACTCGTTGGCTAATCCCGGCATCGCCATGGCAACACACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAA




CAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCC




TGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAG




CCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGG




GACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCA




CCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCA




AGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTCTT




TCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGAA




GGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAATCTA




CAAGTGTGGACTTTGCTTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGGC




ACCCGTTACCTCACCCGTAATCTGTAA





94
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



10 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCAC




AGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGT




CGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCAC




GACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCT




GAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAGGACGAC




GAGGAGCGCTTTTTCCCATCCAACGGAATCCTGATTTTTGGAAAAACTGGAGCAACTAAC




AAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCT




GTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGC




CCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGG




GACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCA




CCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCA




AGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTCTT




TCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGAA




GGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAATCTA




CAAGTGTGGACTTTGCTTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGGC




ACCCGTTACCTCACCCGTAATCTGTAA





95
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



11 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCAC




AGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGT




CGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCAC




GACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCT




GAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAACAGAATGCAGCAAG




GGACAACGCTGACTACTCAGATGTGATGTTGACAAGTGAAGAAGAAATTAAGACTACTA




ATCCTGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACT




GCAGCCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAA




CCGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAAC




TTCCACCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCT




GATCAAGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGA




ACTCTTTCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTG




CAGAAGGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAA




ATCTACAAGTGTGGACTTTGCTTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCAT




TGGCACCCGTTACCTCACCCGTAATCTGTAA





96
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



12 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCAC




AGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGT




CGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCAC




GACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCT




GAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAGGACGAC




GAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAA




CAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCC




TGTAGCCACGGAAGAATACGGGATAGTCGCCGACAACTTACAACAGCAGAATACTGCAC




CCCAGATAGGAACTGTCAACAGCCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCG




GGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCC




ACCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATC




AAGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTC




TTTCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGA




AGGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAATCT




ACAAGTGTGGACTTTGCTTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGG




CACCCGTTACCTCACCCGTAATCTGTAA





97
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



13 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCAC




AGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGT




CGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCAC




GACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCT




GAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAGGACGAC




GAGGACCGTTTTTTTCCCAGTAGCGGGGTCCTGATTTTTGGCAAACAAAATGCTGCCAG




AGACAATGCGGATTACAGCGATGTCATGCTCACCAGCGAGGAAGAAATCAAAACCACTA




ACCCTGTGGCTACAGAGGAATACGGTATCGTGGCAGATAACTTGCAGCAGCAAAACACG




GCTCCTCAAATTGGAACTGTCAACAGCCAGGGGGCCTTACCCGGTATGGTCTGGCAGAA




CCGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAAC




TTCCACCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCT




GATCAAGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGA




ACTCTTTCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTG




CAGAAGGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAA




ATCTACAAGTGTGGACTTTGCTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCAT




TGGCACCCGTTACCTCACCCGTAATCTGTAA





98
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



14 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCAC




AGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGT




CGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCAC




GACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCT




GAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAGGACGAC




GAGGAGCGTTTTTTTCCCAGTAACGGGATCCTGATTTTTGGCAAAACTGGTGCCACAAAC




AAAACGACTTTGGAGAATGTCTTGATGACCAACGAGGAAGAAATCAGACCCACTAACCC




TGTGGCTACAGAGGAATACGGTATCGTGGCAGATAACTTGCAGCAGCAAAACACGGCTC




CTCAAATTGGAACTGTCAACAGCCAGGGGGCCTTACCCGGTATGGTCTGGCAGAACCGG




GACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCA




CCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCA




AGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTCTT




TCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGAA




GGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAATCTA




CAAGTGTGGACTTTGCTTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGGC




ACCCGTTACCTCACCCGTAATCTGTAA





99
AAV8 Swap
ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCG



15 (nt)
CGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAG




GACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT




CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGC




CTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC




GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAG




CAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCT




AAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTC




CTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTC




AGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCG




CCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACA




ATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACA




TGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAA




CAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTT




CGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACC




ACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCT




TCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATC




GCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTA




CGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGA




TTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCC




TTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTT




ACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGA




CCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCAC




AGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGT




CGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCAC




GACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCT




GAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACACACAAGGACGAC




GAGGAGCGTTTTTTTCCCAGTAACGGGATCCTGATTTTTGGCAAACAAAATGCTGCCAGA




GACAATGCGGATTACAGCGATGTCATGCTCACCAGCGAGGAAGAAATCAAAACCACTAA




CCCTGTGGCTACAGAGGAATACGGTATCGTGTCATCTAACTTGCAGGCGGCAAACACGG




CTGCTCAAACTCAAGTTGTCAACAACCAGGGGGCCTTACCCGGTATGGTCTGGCAGAAC




CGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTT




CCACCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGA




TCAAGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAAC




TCTTTCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCA




GAAGGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAAT




CTACAAGTGTGGACTTTGCTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTG




GCACCCGTTACCTCACCCGTAATCTGTAA








Claims
  • 1. A capsid polypeptide, comprising: (i) the sequence of amino acids set forth in any one of SEQ ID NOs:2-20 and 65-79, or a sequence having at least or about 95% sequence identity thereto;(ii) the sequence of amino acids at positions 138-735 of any one of SEQ ID NOs:2, 6, 7, 9, 10, 12-14, 16-20, 69, 71-74, 76 and 78, positions 138-734 of any one of SEQ ID NOs:5, 8 and 11, positions 138-736 of any one of SEQ ID NOs:3, 15, 65, 68, 75, 77 and 79, positions 138-737 of any one of SEQ ID NOs:4, 67 and 70, or positions 138-738 of SEQ ID NO:66; or a sequence having at least or about 95% sequence identity thereto; and/or(iii) the sequence of amino acids at positions 203-734 of any one of SEQ ID NOs:5, 8 and 11, positions 203-736 of SEQ ID NO:15, positions 204-735 of any one of SEQ ID NOs:2, 6, 7, 9, 10, 12-14, 16-20, 69, 71-74, 76 and 78, positions 204-736 of any one of SEQ ID NOs:3, 65, 68, 75, 77 and 79, positions 204-737 of any one of SEQ ID NOs: 4, 67 and 70, or positions 204-738 of SEQ ID NO:66; or a sequence having at least or about 95% sequence identity thereto.
  • 2-52. (canceled)
Priority Claims (1)
Number Date Country Kind
2020900529 Feb 2020 AU national
PCT Information
Filing Document Filing Date Country Kind
PCT/AU2021/050158 2/25/2021 WO