OPTIMIZED VACCINE COMPOSITIONS AND METHODS FOR MAKING THE SAME

Information

  • Patent Application
  • 20220088175
  • Publication Number
    20220088175
  • Date Filed
    August 10, 2021
    3 years ago
  • Date Published
    March 24, 2022
    2 years ago
Abstract
Described herein are compositions of and methods of making vaccines which can provide broad serological reactivity an inverse dose response, and a swarm effect.
Description
SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Aug. 9, 2021, is named 60333-703.301_SL.txt and is 1,484,708 bytes in size.


BACKGROUND

Pathogenic agents such as, infectious bacteria, parasites, fungi, viruses, and cancers, have evolved various strategies to evade detection and neutralization by host immune response. Such strategies can often undermine and complicate the development of successful vaccines towards these pathogens. Rapid mutations can complicate vaccine design. Vaccination can enable a host to generate protective antibodies prior to infection. This can be accomplished by exposing the immune system of a host to an antigen or antigens of a pathogen or pathogens. When this is done for a relatively slowly mutating vaccine, a multi-year window for a single vaccine to match and provide protection can be afforded, and a vaccine can be successful.


Vaccination against rapidly mutating pathogens can provide limited protection, for example, because the immune system response against the vaccine can become obsolete. In some cases, the vaccine can become obsolete as antigens change over time. This can result in the requirement for a new vaccine on a periodic or semi-periodic basis, such as every year. In some cases, protection provided by such vaccines can vary, and some vaccines can for some periods or years provide poor protection. This can be due to incorrect predictions of viral strains likely to be in circulation the following season. In some cases, such as for the common cold, such rapid mutation can prevent the effectiveness of a vaccine.


SUMMARY

Provided in this disclosure are vaccines comprising a set of antigens that are representative of at least 60% of fifth order clades of a microbe.


Also provided in this disclosure are vaccines comprising a set of antigens of a microbe wherein the smallest pairwise edit distance between two of the antigens is at least 10% of the average size of the antigens and the largest pairwise edit distance between two of the antigens is no more than 98% of the average size of the antigens.


Also provided in this disclosure are vaccines comprising a set of antigens that are representative of at least 60% of all operational taxonomic units (OTUs) of the microbe.


In some embodiments, an antigen is representative of a clade if its sequence is up to 40% of the average size of antigens of the represented clade different than other members of the represented clade. In some embodiments, an antigen is representative of a clade if the edit distance is up to 5% of the size of another strain of the represented clade. In some embodiments, an antigen is representative of a clade if its sequence is up to 20% of the average size of antigens of the represented clade different than at least 95%, 96%, 97%, 98%, or 99% of all strains of the represented clade. In some embodiments, an antigen is representative of a clade if its sequence is up to 10% of the average size of antigens of the represented clade different than at least 95%, 96%, 97%, 98%, or 99% of all strains of the represented clade. In some embodiments, an antigen is representative of a clade if the edit distance is up to 5% of the size of 95%, 96%, 97%, 98%, or 99% of all strains of the represented clade. In some embodiments, an antigen is representative of a clade if its sequence is up to 25 edit distance from all strains of the represented clade. In some embodiments, an antigen is representative of a clade if its sequence is up to 100 edit distance from all strains of the represented clade. In some embodiments, an antigen is representative of a clade if its sequence is present in the clade.


Also provided in this disclosure are vaccine compositions comprising a set of antigens, wherein the antigens are derived from a library of variants of a microbe wherein: a) two antigens of the set with the greatest edit distance have an edit distance S; b) two antigens of the library with the greatest edit distance have an edit distance L; and c) S is at least 60% of L.


Also provided in this disclosure are vaccine compositions comprising at least 4 influenza virus hemagglutinin antigen polypeptides, each represented by a sequence that is at least 20% identical, but not more than 95% identical, to the other influenza virus hemagglutinin antigen polypeptides, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 influenza virus hemagglutinin antigen polypeptides.


Also provided in this disclosure are vaccine compositions comprising at least 4 influenza virus neuraminidase antigen polypeptides, each represented by a sequence that is at least 20% identical, but not more than 95% identical, to the other influenza virus neuraminidase antigen polypeptides, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 influenza virus neuraminidase antigen polypeptides.


Also provided in this disclosure are vaccine compositions comprising at least 4 HIV gp160 antigen polypeptides, each represented by a sequence that is at least 20% identical, but not more than 95% identical, to the other HIV gp160 antigen polypeptides, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 HIV gp160 antigen polypeptides.


Also provided in this disclosure are vaccine compositions comprising at least 4 HIV gp120 antigen polypeptides, each represented by a sequence that is at least 20% identical, but not more than 95% identical, to the other HIV gp120 antigen polypeptides, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 HIV gp120 antigen polypeptides.


Also provided in this disclosure are vaccine compositions comprising at least 4 HIV gp41 antigen polypeptides, each represented by a sequence that is at least 20% identical, but not more than 95% identical, to the other HIV gp41 antigen polypeptides, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 HIV gp41 antigen polypeptides.


Also provided in this disclosure are vaccine compositions comprising a set of antigens that activate an immune response in a subject to at least 6 strains identified in Table 1.


In some embodiments, the immune response is detectable using head-specific antibodies in a hemagglutinin inhibition assay. In some embodiments, the immune response is at least 2-fold greater using the antigens than H1N1+H3N2+HAB, when tested using the hemagglutinin inhibition assay. In some embodiments, the immune response is at least 10-fold greater using the antigens than H1N1+H3N2+HAB, when tested using the hemagglutinin inhibition assay. In some embodiments, the immune response is at least 100-fold greater using the antigens than H1N1+H3N2+HAB, when tested using the hemagglutinin inhibition assay.


In some embodiments, the microbe is a bacterium. In some embodiments, the microbe is a virus. In some embodiments, the virus is influenza. In some embodiments, the influenza is type A. In some embodiments, the influenza is type B. In some embodiments, the type A influenza is H1N1, H1N2, H3N1, H3N2, or H2N3. In some embodiments, the type A influenza is H1N1. In some embodiments, the type A influenza is H3N2. In some embodiments, the virus is human immunodeficiency virus (HIV). In some embodiments, the HIV is HIV-1. In some embodiments, the HIV-1 is of subclade A, subclade B, or subclade C.


In some embodiments, the antigen is a broadly neutralizing antigen of surface exposed residues adjacent in tertiary space. In some embodiments, the broadly neutralizing antigen is in a stem of hemagglutinin. In some embodiments, the broadly neutralizing antigen is in a head of hemagglutinin. In some embodiments, the broadly neutralizing antigen is in a neuraminidase. In some embodiments, the broadly neutralizing antigen is in a gp160. In some embodiments, the broadly neutralizing antigen is in a gp120. In some embodiments, the broadly neutralizing antigen is in a gp41.


In some embodiments, the antigen is a broadly conserved fragment of hemagglutinin. In some embodiments, the antigen is a broadly conserved fragment of a head of hemagglutinin. In some embodiments, the antigen is a broadly conserved fragment of a stem of hemagglutinin. In some embodiments, the antigen is a broadly conserved fragment of neuraminidase. In some embodiments, the antigen is a broadly conserved fragment of gp160.


In some embodiments, the vaccine comprises at least one antigen selected from SEQ ID NOs:1-87. In some embodiments, the vaccine comprises at least one antigen that is at least 60% identical to any one of SEQ ID NOs:1-87. In some embodiments, the vaccine comprises at least one antigen that is at least 70% identical to any one of SEQ ID NOs:1-87. In some embodiments, the vaccine comprises at least one antigen that is at least 80% identical to any one of SEQ ID NOs:1-87. In some embodiments, the vaccine comprises at least one antigen that is at least 90% identical to any one of SEQ ID NOs:1-87. In some embodiments, the vaccine comprises at least one antigen selected from SEQ ID NOs:88-127. In some embodiments, the vaccine comprises at least one antigen that is at least 60% identical to any one of SEQ ID NOs:88-127. In some embodiments, the vaccine comprises at least one antigen that is at least 70% identical to any one of SEQ ID NOs:88-127. In some embodiments, the vaccine comprises at least one antigen that is at least 80% identical to any one of SEQ ID NOs:88-127. In some embodiments, the vaccine comprises at least one antigen that is at least 90% identical to any one of SEQ ID NOs:88-127. In some embodiments, the vaccine comprises at least one antigen selected from SEQ ID NOs:128-171. In some embodiments, the vaccine comprises at least one antigen that is at least 60% identical to any one of SEQ ID NOs:128-171. In some embodiments, the vaccine comprises at least one antigen that is at least 70% identical to any one of SEQ ID NOs:128-171. In some embodiments, the vaccine comprises at least one antigen that is at least 80% identical to any one of SEQ ID NOs:128-171. In some embodiments, the vaccine comprises at least one antigen that is at least 90% identical to any one of SEQ ID NOs:128-171. In some embodiments, the vaccine comprises at least one antigen selected from SEQ ID NOs:172-267. In some embodiments, the vaccine comprises at least one antigen that is at least 60% identical to any one of SEQ ID NOs:172-267. In some embodiments, the vaccine comprises at least one antigen that is at least 70% identical to any one of SEQ ID NOs:172-267. In some embodiments, the vaccine comprises at least one antigen that is at least 80% identical to any one of SEQ ID NOs:172-267. In some embodiments, the vaccine comprises at least one antigen that is at least 90% identical to any one of SEQ ID NOs:172-267.


In some embodiments, the set of antigens comprise at least 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 900 or 1000 different antigens. In some embodiments, the set of antigens comprises at least 30 antigens. In some embodiments, the set of antigens comprises at least 50 antigens. In some embodiments, the antigens have an average edit distance from each of the other antigens that is at least 5% of the average size of antigens in the clade.


In some embodiments, a smallest pairwise edit distance between two or more of the antigens in the set is no more than 1, and the largest pairwise edit distance is at least 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100. In some embodiments, a smallest pairwise edit distance between two or more antigens in the set is no more than 5% of the size of the antigens, and the largest pairwise edit distance is at least 75% of the size of the antigens.


In some embodiments, the clades are clades of a phylogenetic tree is a neighbor joining clustering tree or maximum parsimony tree. In some embodiments, a first order clade is a clade that is not subsumed in whole or in part within another higher-level clade. In some embodiments, each X-order clade is phylogenetically below X−1 number branch nodes in a phylogenetic tree of the microbe. In some embodiments, the vaccine comprises antigens that are representative of at least 60% of each of third, fourth, fifth, sixth, seventh, eighth, ninth, tenth, eleventh, twelfth, thirteenth, fourteenth, or fifteenth order clades of the microbe. In some embodiments, a represented clade is a clade Y that is subsumed by a clade Y−1. In some embodiments, a representative clade is a node-based clade. In some embodiments, a representative clade is a stem-based clade. In some embodiments, a representative clade is apomorphy-based clade. In some embodiments, an average number of branches between each of the antigens in the set is at least 1, 3, 5, or 10. In some embodiments, a smallest number of branches between any two antigens in the set is no more than 1 and a largest number of branches between any two antigens in the set is at least 5, 10, or 15.


In some embodiments, each OTU comprises sequences that are at least 95% homologous of one another. In some embodiments, each OTU comprises of at least 3 different sequences.


In some embodiments, each of the antigens in the set shares at least 90%, 95, or 99% sequence identity to at least one other antigen in the set. In some embodiments, each of the antigens in the set shares at least 90%, 95, or 99% % sequence identity to at least one other antigen in the set over a length of at least 100 amino acids. In some embodiments, each of the antigens in the set differs from each of the other antigens in the set by at least 5% sequence identity. In some embodiments, each of the antigens in the set differs from each of the other antigens in the set by no more than 75% sequence identity.


In some embodiments, each antigen is a peptide comprising at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 85, 100, 110, or 120 amino acids. In some embodiments, the antigens are selected from Table 2, Table 3, Table 4, Table 5, or a fragment or homologue thereof. In some embodiments, the vaccine comprises 2 or more antigens from Table 2, Table 3, Table 4, Table 5, or fragments or homologues thereof. In some embodiments, the vaccine comprises 3 or more antigens from Table 2, Table 3, Table 4, Table 5, or fragments or homologues thereof. In some embodiments, the vaccine comprises 5 or more antigens from Table 2, Table 3, Table 4, Table 5, or fragments or homologues thereof. In some embodiments, the vaccine comprises 10 or more antigens from Table 2, Table 3, Table 4, Table 5, or fragments or homologues thereof. In some embodiments, the fragment comprises at most 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, or 120 amino acids. In some embodiments the fragment comprises at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, or 120 amino acids. In some embodiments the homologue comprises a sequence having at least 90% sequence identity to the antigen of Table 1.


In some embodiments the library comprises at least 1×104, 1×105, 1×106 different variants of the microbe. In some embodiments the library comprises at least 90% of all known sequences of the microbe.


Also provided in this disclosure are pharmaceutical compositions comprising the vaccine composition of any of the preceding claims and a pharmaceutically acceptable diluent, adjuvant, excipient, or any combinations thereof.


In some embodiments the vaccine is in a form of an aerosol formulation. In some embodiments the vaccine is in a form of an injectable formulation.


In some embodiments each of the antigens is at a concentration that alone does not provide a significant prophylactic immune response to the broadly neutralizing antigen in a subject; and collectively, the antigens have a combined concentration that provides an immune response to the broadly neutralizing antigen in the subject. In some embodiments the subject is a bird. In some embodiments the subject is a mammal. In some embodiments the subject is a human. In some embodiments the subject is a pig.


Also provided in this disclosure are virus like particles (VLPs) comprising the vaccine composition of any of the preceding claims.


Also provided in this disclosure are recombinant expression vectors comprising a nucleic acid molecule encoding: a) a set of antigens that are representative of at least 60% of each of first order clade and second order clades of a microbe; b) set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 25 and the largest pairwise edit distance between two of the antigens is at least 300; c) set of antigens that are representative of at least 60% of all operational taxonomic units (OTUs) of the microbe; or d) set of antigens that are representative of at least 60% of all operational taxonomic units (OTUs) of the microbe.


Also provided in this disclosure are recombinant expression vector comprising a nucleic acid molecule encoding: a) a set of antigens that are representative of at least 60% of each of first order clade and second order clades of a microbe; b) a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 5% of the size of the antigens, and the largest pairwise edit distance is at least 75% of the size of the antigen; c) a set of antigens that are representative of at least 60% of all operational taxonomic units (OTUs) of the microbe; or d) a set of antigens that are representative of at least 60% of all operational taxonomic units (OTUs) of the microbe.


Also provided in this disclosure are methods for treating or reducing the likelihood of an infection in a subject, comprising administrating to the subject a vaccine composition of any of the preceding claims.


Also provided in this disclosure are methods for treating or reducing the likelihood of a flu infection in a subject, comprising administrating to the subject a vaccine composition that results in an immune activation effective against seasonal flu for at least 3, 4, 5, 6, 7, 8, 9, or 10 years.


In some methods, the subject is a human. In some methods, the subject is a domesticated animal.


Also provided in this disclosure are methods for making a vaccine composition, comprising: selecting a set of antigens that are (a) representative of at least 60% of each of first order clade and second order clades of a microbe; (b) set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 5% of the size of the antigens, and the largest pairwise edit distance is at least 75% of the size of the antigens; (c) representative of at least 60% of all operational taxonomic units (OTUs) of the microbe; or (d) representative of at least 60% of all operational taxonomic units (OTUs) of the microbe.


In some embodiments, the method further comprises obtaining a plurality of antigen sequences from a library of strains of the microbe; and aligning the plurality of antigen sequences to create a phylogenetic tree of the antigen.


INCORPORATION BY REFERENCE

All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference.





BRIEF DESCRIPTION OF THE DRAWINGS

The novel features of the invention are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized, and the accompanying drawings of which:



FIG. 1 illustrates examples of what is and is not a clade on a phylogenic tree.



FIG. 2 illustrates examples of three types of clades: a node-based clade, a branch-based clade, and an apomorphy-based clade.



FIG. 3 illustrates clade order in a phylogenic tree, with a third order clade being subsumed by a second order clade, which is subsumed by a first order clade.



FIG. 4 illustrates a model of the influenza antigen-antibody interaction space.



FIG. 5 depicts the probabilities of conserved epitopes by categories (i.e., universal, 90% of strains, all H1 strains, all H3 strains, 90% of all H1 strains, and 90% of all H3 strains).



FIG. 6 illustrates a concentration conservation coupling vaccination strategy that can shift epitope conservation distribution



FIG. 7 illustrates ELISA signals indicating a swarm effect observed on inoculating pigs with a vaccine as described herein.



FIG. 8 illustrates ELISA signal per HA class after inoculating a cohort of pigs three times with a vaccine as described herein.



FIGS. 9A-9B illustrate ELISA signal % max and fold change after inoculating a cohort of pigs three times with a vaccine as described herein. FIG. 9A illustrates % max OD 450 nm for Vehicle-control, BIV, C3-500, C3-100, and C3-50 after 3rd immunization. FIG. 9B illustrates Log 2 fold change (% max OD 450 nm) for Vehicle-control, BIV, C3-500, C3-100, and C3-50 after 3rd immunization.



FIG. 10 illustrates a heatmap showing serum titers after a second boost of a vaccine composition on a panel of antigen variants for each member of each cohort.



FIG. 11 illustrates a peripheral B cell repertoire occupied by top clonotypes separated by isotype. IGHG results are shown in the top row for vehicle control (top left), BIV (top middle), and C3-50 (top right). IGHM results are shown in the top row for vehicle control (bottom left), BIV (bottom middle), and C3-50 (bottom right).



FIG. 12 illustrates a sequence alignment of broadly neutralizing pig monoclonal antibody 9C5 isolated through phage display.



FIG. 13 illustrates minimum antibody concentrations of three broadly neutralizing human (CR9114, F10, and C05) and one broadly neutralizing pig antibody (9C5) sufficient for in vitro neutralization of a panel of 5 influenza A strains.



FIGS. 14A-14D illustrate data showing a conservation concentration coupling approach can yield serum with improved strain coverage and neutralization breadth compared to a bivalent control. FIG. 14A provides ELISA signals (% max OD450 nm) of pig sera from the first in vivo study to 30 HA antigens (H1, H2, H3, H5, H6, H7, and H17 spanning 1918-2014) across immunization cohorts (each n=7 pigs) of C3-50 (30 HAs at 50 ng/HA), BIV cohort (H1N1 2007 and H3N2 1997 each at 5,000 ng) and vehicle control (PBS+squalene). FIG. 14B provides neutralization breadth shown as average number of strains neutralizing per pig by cohort and as number of positive tests per cohort are shown as scatterplot. FIG. 14C provides neutralization strength for study #2 reported as reciprocal neutralizing dilution per strain across cohorts using bar charts. Bars indicate the mean across all pigs (n=4) of a cohort and error bars indicate standard deviation. “Future” strains tested that are not part of the 2008 C3 formulation are indicated by arrows. FIG. 14D provides neutralization strength for study #3 reported as reciprocal neutralizing dilution per strain across cohorts using bar charts. Bars indicate the mean across all pigs (n=5) of a cohort and error bars indicate standard deviation. Points show the individual value per pig. “Future” strains tested that are not part of the 2008 C3 formulation are indicated by arrows.





DETAILED DESCRIPTION

Rapidly evolving pathogens can pose a fundamental challenge to vaccine design, as mutations in such pathogens can render previous vaccine responses obsolete. For example, annual influenza vaccine redesigns can attempt to anticipate evolved influenza variants that may be in common circulation in the following year. Such attempts can often be unsuccessful. Conserved epitopes on the influenza coat proteins which have been identified can be missed by the antibodies elicited by most vaccine recipients.


Provided herein are immune-generating compositions (e.g., vaccines) that can elicit responses against one or more pathogens. In some cases, vaccines provided herein incorporate antigens that can elicit an immune response that can yield broadly neutralizing antibodies. The present disclosure also provides methods for using and making vaccines as provided herein.


Broadly neutralizing antibodies can provide for “universal” or longer-term vaccines. Such broadly neutralizing antibodies can be against a broadly conserved residue, for example, a residue of an antigen. A broadly conserved antigen can be conserved over multiple strains of a microbe. Microbes, even rapidly mutating pathogens, can comprise broadly conserved antigens. Antibodies that can identify and/or bind these conserved antigens or sets of antigens can confer protection against many strains of a given virus.


Since non-conserved residues can be immunodominant because they can be more abundant (e.g., vastly more abundant) than broadly conserved residues, this disclosure provides an approach that can shift the distribution of abundance of the residues of antigens to favor responses to more conserved antigens. By incorporating broadly conserved antigens representing a spectrum of variations of a microbe (e.g., mutants, different strains, or different clades) into vaccines, immunity can be conferred to a subject to many or substantially all variations of a microbe.


I. Microbes

A vaccine composition as described herein can elicit an immune response against a microbe, or against a strain or subset of strains of a microbe. In some cases, such vaccines can provide broad immunity (immunity not limited to a particular strain) to a plurality of strains of a microbe. Such vaccines can comprise a set of antigens, which can comprise several or many different antigens from several or many different strains of a microbe.


A microbe can be an agent that a subject can be vaccinated against. In some cases, a microbe can be pathogenic, or can lead to a disease, a condition, or death. A microbe can be a bacterium, a virus, a fungus, a parasite, or a derivative thereof, such as a nucleic acid, protein, toxin, or peptide secreted by or isolated from a microbe.


A bacterium can be a single-celled prokaryotic microorganism. A bacterium can be a gram positive bacterium or a gram negative bacterium. A bacterium can be infectious.


A fungus can be a spore producing organism. A fungus can cause disease in a subject on the skin, in the lungs, in the blood, or another organ, tissue, or biological fluid. In some cases, a fungus can be from the genus Aspergillus, Blastomyces, Candida, Coccidioides, or Cryptococcus (e.g., C. neoformans and C. gattii). Other suitable fungi are also within the scope of the present disclosure.


A parasite can be a protozoa, a helminth, an ectoparasite, or any other suitable parasite. In some cases, a parasite can be a Plasmodium parasite. In some cases, a parasite can be a Plasmodium parasite or a Trypanosoma parasite. In the case of a Plasmodium parasite, the vaccine might be against a malaria parasite. In the case of a Trypanosoma parasite, the vaccine might be against a Chagas parasite, a Sleeping Sickness parasite, or a Leishmaniasis parasite.


A protozoa can be a one-celled organism that can live and/or multiply in the blood or tissue of a subject. Examples of protozoa can include, but are not limited to, Sarcodina (e.g., Entamoeba), mastigophore (e.g., Giardia and Leishmania), ciliophoran (e.g., Balantidium), and sporozoan (e.g., Plasmodium, Cryptosporidium, etc.).


A helminth can be a multicellular organism. Helminths can be divided into three groups: platyhelminths, acanthocephalins, and nematodes. Examples of helminths can include, but are not limited to, parasitic flatworms, flukes, tapeworms, thorny-headed worms, roundworms, and pinworms. Helminths can live in a gastrointestinal tract, blood, lymphatic system, or other tissue.


An ectoparasite can be an organism that can live on the surface of a subject, and in some cases, can attach or burrow into the skin of a subject. Examples of ectoparasites can include ticks, fleas, lice, and mites.


In some instances, a virus can be a lentivirus, a flavivirus, a filovirus, a coronavirus, or a paramyxovirus. In the case of a lentivirus, the vaccine might be against a human immunodeficiency (HIV) virus. In the case of a flavivirus, the vaccine might be against a Dengue virus, a Zika virus, or a West Nile virus. In the case of a filovirus, the vaccine might be against an Ebola virus, a Marburg virus, or a Ravn virus. In the case of a coronavirus, the vaccine might be against a Middle East Respiratory Syndrome (MERS) virus, a Severe Acute Respiratory Syndrome (SARS) virus, or a novel coronavirus (e.g., 2019-nCoV). In the case of a paramyxovirus, the vaccine might be against a Respiratory Syncytial Virus (RSV) or a Nipah virus.


A vaccine can be against a flu virus of any type, such as, but not limited to, any one or more of the flu viruses described herein. A flu virus can be influenza type A, influenza type B, influenza type C, or influenza type D. Influenza A viruses can be divided into subtypes based on two proteins on the surface of the virus: hemagglutinin (HA) and neuraminidase (NA). Different flu strains can have different subtypes of hemagglutinin, neuraminidase, or both. There can be at least 198 predicted influenza A subtype combinations. Influenza B viruses can be further classified into lineages, which can include, for example, B/Yamagata and B/Victoria.


If a virus is a type A flu virus, it may be of any subtype, including, but not limited to, H1N1, H1N2, H1N3, H2N2, H3N2, H3N8, H4N2, H4N4, H4N6, H4N8, H5N1, H5N2, H5N3, H5N8, H6N1, H6N4, H6N5, H6N6, H6N8, H7N1, H7N2, H7N3, H7N7, H7N8, H7N9, H8N4, H9N2, H9N5, H9N8, H10N3, H10N4, H10N7, H10N8, H10N9, H11N2, H11N6, H11N9, H12N1, H12N3, H12N5, H13N6, H13N8, H14N5, H15N2, H15N8, H16N3, H17N, H18N11, or a variant thereof. If a virus is a type B flu virus, it can be any strain of type B flu virus or a variant thereof. A non-limiting list of flu viruses is included in Table 1.


A flu virus can be of a strain listed in Table 1. Some vaccines can comprise a set of antigens that can activate an immune response in a subject to at least 80% of strains in Table 1. In some instances, a vaccine can comprise a set of antigens that can activate an immune response in a subject to at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least at least 90%, at least at least 95%, at least at least 99%, or 100% of the strains in Table 1.









TABLE 1





Flu virus strains
















H1N1
A/Albany/12/1951



A/Beijing/22808/2009



A/Beijing/262/1995



A/Brevig Mission/1/1918



A/Brisbane/59/2007



A/California/04/2009



A/California/06/2009



A/California/07/2009



A/Chile/1/1983



A/England/195/2009



A/England/42/1972



A/New Caledonia/20/1999



A/New York/06/2009



A/New York/1/1918



A/New York/18/2009



A/New Jersey/8/1976



A/Ohio/07/2009



A/Ohio/UR06-0091/2007



A/Puerto Rico/8/1934



A/Puerto Rico/8/34/Mount Sinai



A/Solomon Islands/3/2006



A/swine/Belgium/1/1998



A/Swine/Wisconsin/136/1997



A/Taiwan/01/1986



A/Texas/05/2009



A/Texas/36/1991



A/USSR/90/1977



A/USSR/92/1977



A/WSN/1933


H1N2
A/swine/Guangxi/13/2006


H1N3
A/duck/NZL/160/1976


H2N2
A/Ann Arbor/6/1960



A/Canada/720/2005



A/Guiyang/1/1957



A/Japan/305/1957


H3N2
A/Aichi/2/1968



A/Babol/36/2005



A/Brisbane/10/2007



A/California/7/2004



A/Chiang Rai/277/2011



A/Christchurch/4/1985



A/Fujian/411/2002



A/Guangdong-Luohu/1256/2009



A/Hong Kong/1/1968



A/Hong Kong/CUHK31987/2011



A/Indiana/07/2012



A/Memphis/1/68



A/Moscow/10/1999



A/New York/55/2004



A/Perth/16/2009



A/reassortant/IVR-155



A/Sydney/5/1997



A/Texas/50/2012



A/Victoria/208/2009



A/Victoria/210/2009



A/Victoria/3/1975



A/Victoria/361/2011



A/Wisconsin/15/2009



A/Wisconsin/67/X-161/2005



A/Wyoming/03/2003



A/X-31


H3N8
A/canine/New York/145353/2008



A/equine/Gansu/7/2008


H4N2
A/duck/Hunan/8-19/2009


H4N4
A/mallard duck/Alberta/299/1977


H4N6
A/mallard/Ohio/657/2002



A/Swine/Ontario/01911-1/99


H4N8
A/chicken/Alabama/1/1975


H5N1
A/Anhui/1/2005



A/bar-headed goose/Qinghai/14/2008



A/bar-headed goose/Qinghai/1A/2005



A/barnswallow/Hong Kong/D10-1161/2010



A/Cambodia/R0405050/2007



A/Cambodia/S1211394/2008



A/chicken/Egypt/2253-1/2006



A/chicken/India/NIV33487/2006



A/chicken/Jilin/9/2004



A/chicken/VietNam/NCVD-016/2008



A/chicken/Yamaguchi/7/2004



A/Common magpie/Hong Kong/2256/2006



A/common magpie/Hong Kong/5052/2007



A/Duck/Hong Kong/p46/97



A/duck/Hunan/795/2002



A/duck/Laos/3295/2006



A/Egypt/2321-NAMRU3/2007



A/Egypt/3300-NAMRU3/2008



A/Egypt/N05056/2009



A/goose/Guangdong/1/96



A/goose/Guiyang/337/2006



A/Hong Kong/213/03



A/Hong Kong/483/97



A/Hubei/1/2010



A/Hubei/2011



A/Hubei/2011-CDC



A/Indonesia/5/2005



A/Japanese white-eye/Hong Kong/1038/2006



A/Thailand/1(KAN-1)/2004



A/turkey/Turkey/1/2005



A/Vietnam/UT31413II/2008



A/whooper swan/Mongolia/244/2005



A/Xinjiang/1/2006


H5N2
A/American green-winged teal/California/HKWF609/07



A/ostrich/South Africa/AI1091/2006


H5N3
A/duck/Hokkaido/167/2007


H5N8
A/breeder duck/Korea/Gochang1/2014



A/broiler duck/Korea/Buan2/2014



A/duck/Jiangsu/k1203/2010



A/duck/NY/191255-59/2002



A/duck/Zhejiang/6D18/2013



A/duck/Zhejiang/W24/2013



A/turkey/Ireland/1378/1983


H5N9
A/chicken/Italy/22A/1998


H6N1
A/northern shoveler/California/HKWF115/2007


H6N4
A/chicken/HongKong/17/77


H6N5
A/shearwater/Australia/1/1973


H6N6
A/duck/Eastern China/11/2009


H6N8
A/mallard/Ohio/217/1998


H7N1
A/turkey/Italy/4602/99


H7N2
A/ruddy turnstone/New Jersey/563/2006


H7N3
A/chicken/SK/HR-00011/2007



A/turkey/Italy/214845/2002


H7N7
A/chicken/Netherlands/1/03



A/equine/Kentucky/1a/1975



A/Netherlands/219/2003


H7N8
A/mallard/Netherlands/33/2006


H7N9
A/Anhui/1/2013



A/Anhui/PA-1/2013



A/chicken/Zhejiang/DTID-ZJU01/2013



A/Hangzhou/1/2013



A/Hangzhou/3/2013



A/Huzhou/10/2013



A/Pigeon/Shanghai/S1069/2013



A/Shanghai/1/2013



A/Shanghai/4664T/2013



A/Shanghai/Patient3/2013



A/Zhejiang/1/2013



A/Zhejiang/DTID-ZJU10/2013


H8N4
A/pintail duck/Alberta/114/1979


H9N2
A/brambling/Beijing/16/2012



A/Chicken/Hong Kong/G9/1997



A/duck/Hong Kong/448/78



A/Guinea fowl/Hong Kong/WF10/99



A/Hong Kong/1073/99



A/Hong Kong/2108/2003



A/Hong Kong/3239/2008



A/Hong Kong/35820/2009


H9N5
A/shorebird/DE/261/2003


H9N8
A/chicken/Korea/164/04


H10N3
A/duck/Hong Kong/786/1979



A/duck/Hunan/S11205/2012



A/mallard/Minnesota/Sg-00194/2007


H10N4
A/mink/Sweden/3900/1984


H10N7
A/blue-winged teal/Louisiana/Sg-00073/2007


H10N8
A/duck/Guangdong/E1/2012



A/Jiangxi-Donghu/346/2013


H10N9
A/duck/Hong Kong/562/1979



A/duck/HongKong/562/1979


H11N2
A/duck/Yangzhou/906/2002



A/thick-billed murre/Newfoundland/031/2007


H11N6
A/duck/England/1/1956


H11N9
A/mallard/Alberta/294/1977


H12N1
A/mallard duck/Alberta/342/1983


H12N3
A/bar headed goose/Mongolia/143/2005


H12N5
A/green-winged teal/ALB/199/1991


H13N6
A/black-headed gull/Sweden/1/1999


H13N8
A/black-headed gull/Netherlands/1/00


H14N5
A/Mallard/Astrakhan(Gurjev)/263/1982


H15N2
A/Australian shelduck/Western Australia/1756/1983


H15N8
A/duck/AUS/341/1983


H16N3
A/black-headed gull/Sweden/5/99


H17N10
A/little yellow-shouldered bat/Guatemala/164/2009


H18N11
A/flat-faced bat/Peru/033/2010


Influenza B
B/Brisbane/3/2007



B/Brisbane/60/2008



B/Florida/07/2004



B/Florida/4/2006



B/Hong Kong/05/1972



B/Malaysia/2506/2004



B/Massachusetts/03/2010



B/Ohio/01/2005



B/PHUKET/3073/2013



B/Utah/02/2012



B/Victoria/02/1987



B/Victoria/504/2000



B/Wisconsin/01/2012



B/Yamagata/16/1988









A flu vaccine can comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or more antigens per hemagglutinin antigen class (i.e., H1, H2, H3, etc.)


A vaccine can be against an HIV virus of any species, for example, HIV-1 and HIV-2. HIV-1 can be of group M (main), group 0 (outlier), or group N (non-M or O). HIV-1 group M can be of subtype A, B, C, D, F, G, H, J, or another subtype. HIV-2 can be of subtype A or subtype B. In some cases, a vaccine against HIV can be against one of HIV-1 or HIV-2. In some cases, a vaccine against HIV can be against both HIV-1 and HIV-2.


A vaccine can be against a variant of HIV or in some cases against simian immunodeficiency virus (SIV). In some cases, a vaccine against HIV can also be against SW. In some cases, a vaccine against HIV can also offer protection against a variant of SW.


An HIV virus can be of a strain listed in SEQ ID NOS: 172-267. Some vaccines can comprise a set of antigens that can activate an immune response in a subject to at least 80% of strains in SEQ ID NOS: 172-267. In some instances, a vaccine can comprise a set of antigens that can activate an immune response in a subject to at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least at least 90%, at least 95%, at least 99%, or 100% of the strains in SEQ ID NOS: 172-267.


A microbe can have variants. A variant of a microbe can comprise a microbe having a variation in its genome, a microbe having a variation in its post-translational modification, a microbe having a variation in its epigenome, a microbe having a variation in its protein expression, a microbe having a variation in its RNA expression, a microbe having a variation in its antigen content, or a combination thereof. In some cases, a microbe having a variation in its antigen content can be of particular interest.


A microbe can have a plurality of variants which can be represented using a phylogenic tree. A phylogenic tree can be a model or a tool, which can be used to explore relationships. In some cases, a phylogenetic tree can be a diagrammatic representation of phylogeny, or evolutionary history, in the sense of a minimally connected graph. A phylogenic tree can comprise a number of clades, wherein a clade can be a grouping of nested branches. A clade can be a grouping that includes a common ancestor and all the descendants (living and extinct) of that ancestor. Examples of clades in a phylogenic tree are provided in FIG. 1. Examples of groupings in a phylogenetic tree that do not comprise clades are also provided in FIG. 1. A vaccine can comprise antigens that are representative of clades of a microbe.


A group of microbes can be divided into clades, or groups in a phylogenetic tree. In some cases, a phylogenetic tree can be an organization of microbes that can indicate how closely related individual microbes are to one another.


Clades can be arranged based on similarity of one or more sequences of antigens of the microbes. Clades and subclades on a phylogenetic tree can be shown as groups that have similar genetic changes, such as nucleotide or amino acid changes, and have a single common ancestor (e.g., a common node or branch).


Clades that are genetically different can be antigenically different. Antigenically different clades can refer to clades whose members have differences that can impact immunity in a subject.


A clade can be a clade of a phylogenic tree which can be a neighbor-joining clustering tree or a maximum parsimony tree. In some cases, a phylogenic tree can be modeled by a Bayesian model, a maximum likelihood model, a weighted pair group method with arithmetic mean (WPGMA), or an unweighted pair group method with arithmetic mean (UPGMA). A phylogenetic tree as provided herein can also be modeled by other suitable statistical models. A neighbor-joining clustering tree can be created using an agglomerative clustering method. A maximum parsimony tree can be created to minimize the total number of character state changes.


A clade can comprise nodes and branches. A node can be a point or vertex on a phylogenetic tree that can represent the split of one lineage of a microbe to form two or more lineages (e.g., an internal node within a phylogenetic tree) or the lineage at the present time (e.g., a terminal node). A branch can be a line on a phylogenetic tree that can be used to represent a lineage, whether ancestral or terminal.


A clade can be a node-based clade, a branch-based clade, or an apomorphy-based clade. A node-based clade can be a clade originating from a particular node on a phylogenic tree and can encompass that node and the branches and nodes descending from that node. An example of a node-based clade is illustrated in the top panel of FIG. 2. A branch-based clade can be a clade originating from a particular branch on a phylogenic tree and can encompass that branch and the branches and nodes descending from that branch. An example of a branch-based clade is illustrated in the middle panel of FIG. 2. In some cases, a clade can be an apomorphy-based clade. An apomorphy-based clade can be a clade originating from an ancestor in which a particular character state, for example, a sequence or structure feature originated. An example of an apomorphy-based clade is illustrated in the bottom panel of FIG. 2. The horizontal line in the apomorphy-based clade illustrated in FIG. 2 indicates where a particular character state occurred.


In some cases, a clade can be a clade of a phylogenic tree which can be a rooted tree, an unrooted tree, or a bifurcating tree. A phylogenic tree can be a rooted tree if any unique node can correspond to the most recent common ancestor of all entities on the tree. A phylogenic tree can be an unrooted tree if it can be created without making any substantial assumptions about ancestry. A phylogenic tree can be a bifurcating tree if each node has exactly two descendants.


A clade can be described by its order. The order of a clade can be a description of how many clades it subsumes. A clade of order X can subsume clades of order X+1. For example, a clade of order 1 can subsume clades of order 2, 3, 4, etc., and a clade of order 2 can subsume clades of order 3, 4, 5, etc. An illustration of a first order clade, a second order clade, and a third order clade in the same phylogenic tree is provided in FIG. 3. Here, the first order clade subsumes the second order clade, and the second order clade subsumes the third order clade.


Likewise, a clade of order Y can be subsumed by a clade of order Y−1. For example, a clade of order 5 can be subsumed by a clade of order 4, a clade of order 6 can be subsumed by a clade of order 5, and a clade of order 7 can be subsumed by a clade of order 6, etc.


If a clade is an X-order clade, then the X-order clade may be phylogenetically below X−1 branch nodes in a phylogenic tree of the microbe. For example, a clade of order 5 can be phylogenetically below 4 branch nodes, a clade of order 6 can be phylogenetically below 5 branch nodes, and a clade of order 7 can be phylogenetically below 6 branch nodes.


A clade of a microbe can be first order, second order, third order, fourth order, fifth order, sixth order, seventh order, eighth order, ninth order, tenth order, eleventh order, twelfth order, thirteenth order, fourteenth order, fifteenth order, or any other suitable order. If a clade is a first order clade, then that clade can be not be subsumed in whole or in part within another higher-level clade.


For example, flu subtypes (e.g., influenza type A) can be broken down into different genetic clades and sub-clades. A flu clade can be a subdivision of flu viruses based on the similarity of a gene sequence of the flu viruses (e.g., hemagglutinin sequences).


As another example, HIV subtypes (e.g., HIV-1 and/or HIV-2) can be broken down into different genetic clades and sub-clades. An HIV clade can be a subdivision of HIV viruses based on the similarity of a gene sequence of the HIV viruses (e.g., the env gene, which encodes the gp160 protein).


II. Antigens

A vaccine composition as described herein can comprise antigens. Antigens in a vaccine composition can be of a microbe or a strains of a microbe that the vaccine composition can elicit an immune response against.


An antigen can be a molecule that can be recognized by the immune system, which can include antibodies, B cells and/or T cells. For example, an antigen can be a foreign substance that can induce an immune response. In some cases, the induced immune response can comprise the production of antibodies. In some cases, an antibody can recognize, attach to, or bind to an antigen.


An antigen can be any antigenic determinant on an immunogen, for example, any primary immunogen, to which an antibody binds through an antigenic binding site. Determinants or antigenic determinants on an antigen generally consist of chemically active surface groupings of molecules such as amino acids or sugar side chains and generally have specific three-dimensional structural characteristics, as well as specific charge characteristics.


An antigen can be a neoantigen, an autoantigen, an endogenous antigen, an exogenous antigen, a viral antigen, a bacterial antigen, a fungal antigen, a parasitic antigen, a toxin, or a tumor antigen. Some vaccines can comprise antigens from one source, while some vaccines can comprise antigens from two, three, four, or more sources.


A vaccine can comprise an antigen of a strain in Table 1. Some vaccines can comprise one or more homologues of one or more antigens of strains in Table 1. In some cases, such a homologue can comprise at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% sequence identity to an antigen in Table 1.


An antigen can be a flu antigen. In some cases, a flu antigen can be hemagglutinin or neuraminidase. In some cases, a flu antigen can be a fragment of, a derivative of, or a modified hemagglutinin or neuraminidase. In certain cases, a vaccine can comprise a combination of hemagglutinin and neuraminidase, or fragments, derivatives, or modified versions thereof. In various cases, an antigen can be a hemagglutinin-neuraminidase combined protein, or a fragment, derivative, or modified version thereof.


Hemagglutinin can be a homotrimeric glycoprotein that can be found on the surface of a flu virus. Hemagglutinin can be a class 1 fusion protein, and can have multifunctional activity as an attachment factor and/or a membrane fusion protein. In some cases, hemagglutinin can play a role in binding a flu virus to sialic acid on the surface of a target or host cell. After this binding, a flu virus can be internalized. In some cases, hemagglutinin can play a role in the fusion of the viral envelope of a flu virus with a late endosomal membrane in a low pH (e.g., 5.0-5.5) environment.


Hemagglutinin can be of a type A or a type B flu virus. Hemagglutinin can have a structure comprising a head and a stem, and can comprise three identical monomers. A monomer can comprise an intact HA0 single polypeptide chain with HA1 and HA2 regions that can be linked by two disulfide bridges. An HA2 region can have an alpha helical coiled coil structure, and can sit on top of an HA1 region. An HA1 region can be a small globular domain that can comprise a mix of a/13 structures.


A vaccine composition can comprise a head of hemagglutinin, a stem of hemagglutinin, or a combination or fragment thereof. In some cases, a vaccine composition can comprise a broadly conserved fragment of hemagglutinin, a broadly conserved fragment of a head of hemagglutinin, a broadly conserved fragment of a stem of hemagglutinin, or a combination thereof. A broadly conserved sequence can be an amino acid sequence or a nucleic acid sequence that can be identical or similar across species or strains of a microbe.


A vaccine composition can comprise an HA0 region, an HAL an HA2, region, or a combination or fragment thereof. A hemagglutinin antigen can be of a subtype of hemagglutinin. For example, hemagglutinin in a type A flu virus hemagglutinin antigen can be of 18 or more subtypes. Subtypes of hemagglutinin can be H1, H2, H3, H4, H5, H6, H7, H8, H9, H10, H11, H12, H13, H14, H15, H16, H17, or H18.


Some vaccine compositions can comprise at least 5, at least 10, at least 20, or at least 30 influenza virus hemagglutinin antigen polypeptides. Such a hemagglutinin antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 95% identical, to the other polypeptides in the vaccine, wherein each polypeptide comprises a target segment that is at least 90% identical among the influenza virus hemagglutinin antigen polypeptides. In some cases, such a hemagglutinin antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 95% identical, to the other polypeptides in the vaccine. In certain cases, such a hemagglutinin antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 10% identical, but not more than 95% identical, to the other polypeptides in the vaccine. In various cases, such a hemagglutinin antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 98% identical, to the other polypeptides in the vaccine. In some cases, such a hemagglutinin antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 10% identical, but not more than 98% identical, to the other polypeptides in the vaccine.


Neuraminidase can be a protein found on the surface of a flu virus. Neuraminidase can enable flu virus to be released from a host cell. Neuraminidase can be a member of the glycoside hydrolase family 34. Neuraminidase can be a mushroom shaped protein. Neuraminidase can comprise a head portion, which can comprise four co-planar and roughly spherical subunits and a hydrophobic region. In some cases, the hydrophobic region can be embedded within the interior of a viral membrane. Neuraminidase can comprise a single polypeptide chain. A neuraminidase polypeptide can be a single chain of six conserved polar amino acids, which can be followed by variable amino acids.


A vaccine can comprise a spherical subunit of neuraminidase, a hydrophobic region of neuraminidase, or a combination thereof. In some cases, a vaccine composition can comprise a broadly conserved fragment of neuraminidase, a broadly conserved fragment of a spherical subunit of neuraminidase, a broadly conserved fragment of a hydrophobic region of neuraminidase, or a combination thereof.


In some cases, a neuraminidase antigen can be of a subtype of neuraminidase. A flu virus neuraminidase antigen can be of 11 or more subtypes. Subtypes of neuraminidase can be N1, N2, N3, N4, N5, N6, N7, N8, N9, N10, or N11. In some cases, a neuraminidase antigen can be of a virus that is not a flu virus. In some cases, a neuraminidase can be of a bacterium. For example, neuraminidase can be of Bacteroides fragilis or Pseudomonas aeruginosa or another bacterium.


Some vaccine compositions can comprise at least 5, at least 10, at least 20, or at least 30 influenza virus neuraminidase antigen polypeptides. Such a neuraminidase antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 95% identical, to the other polypeptides in the vaccine, wherein each polypeptide comprises a target segment that is at least 90% identical among the influenza virus neuraminidase antigen polypeptides. In some cases, such a neuraminidase antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 95% identical, to the other polypeptides in the vaccine. In certain cases, such a neuraminidase antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 10% identical, but not more than 95% identical, to the other polypeptides in the vaccine. In various cases, such a neuraminidase antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 98% identical, to the other polypeptides in the vaccine. In some cases, such a neuraminidase antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 10% identical, but not more than 98% identical, to the other polypeptides in the vaccine.


An antigen can be an HIV antigen. In some cases, an HIV antigen can be the glycoprotein gp160. In some cases, an HIV antigen can be a fragment of, a derivative of, or a modified gp160. In certain cases, a vaccine can comprise a combination of gp160, or fragments, derivatives, or modified versions thereof.


Gp160 can be a protein encoded by the env gene in the HIV virus that can form a homotrimer and can be found on the surface of an HIV virus. Gp160 can be cleaved into gp120 and gp41 by a protease such as furin in a subject. Gp120 and gp41 can be transported to the plasma membrane of the host cell of the subject, where gp41 can anchor gp120 to the membrane of the infected cell. In some cases, gp120 can be an antigen. In some cases, gp41 can be an antigen.


Gp160 can be of HIV-1 or HIV-2. Gp160 can comprise three subunits that can have an outward facing glycoprotein portion attached to a membrane-associated portion. The interface between GP160 subunits can be polar.


A vaccine composition can comprise a glycoprotein segment of gp160, a membrane-associated portion of gp160, a broadly conserved fragment of gp160, a broadly conserved fragment of a glycoprotein segment of gp160, a broadly conserved fragment of a membrane-associated portion of gp160, or a combination thereof.


Some vaccine compositions can comprise at least 5, at least 10, at least 20, or at least 30 HIV gp160 antigen polypeptides. Such a gp160 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 95% identical, to the other polypeptides in the vaccine, wherein each polypeptide comprises a target segment that is at least 90% identical among the gp160 antigen polypeptides. In certain cases, such a gp160 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 95% identical, to the other polypeptides in the vaccine. In various cases, such a gp160 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 10% identical, but not more than 95% identical, to the other polypeptides in the vaccine. In some cases, such a gp160 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 98% identical, to the other polypeptides in the vaccine. In certain cases, such a gp160 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 10% identical, but not more than 98% identical, to the other polypeptides in the vaccine.


Gp120 can be of HIV-1 or HIV-2. Gp120 can comprise three subunits that can have an outward facing glycoprotein portion attached to a membrane-associated portion. The interface between GP120 subunits can be polar.


A vaccine composition can comprise a glycoprotein segment of gp120, a membrane-associated portion of gp120, a broadly conserved fragment of gp120, a broadly conserved fragment of a glycoprotein segment of gp120, a broadly conserved fragment of a membrane-associated portion of gp120, or a combination thereof.


Some vaccine compositions can comprise at least 5, at least 10, at least 20, or at least 30 HIV gp120 antigen polypeptides. Such a gp120 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 95% identical, to the other polypeptides in the vaccine, wherein each polypeptide comprises a target segment that is at least 90% identical among the gp120 antigen polypeptides. In certain cases, such a gp120 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 95% identical, to the other polypeptides in the vaccine. In various cases, such a gp120 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 10% identical, but not more than 95% identical, to the other polypeptides in the vaccine. In some cases, such a gp120 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 98% identical, to the other polypeptides in the vaccine. In certain cases, such a gp120 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 10% identical, but not more than 98% identical, to the other polypeptides in the vaccine.


Gp41 can be of HIV-1 or HIV-2. Gp41 can comprise three subunits that can have an outward facing glycoprotein portion attached to a membrane-associated portion. The interface between GP41 subunits can be polar.


A vaccine composition can comprise a glycoprotein segment of gp41, a membrane-associated portion of gp41, a broadly conserved fragment of gp41, a broadly conserved fragment of a glycoprotein segment of gp41, a broadly conserved fragment of a membrane-associated portion of gp41, or a combination thereof.


Some vaccine compositions can comprise at least 5, at least 10, at least 20, or at least 30 HIV gp41 antigen polypeptides. Such a gp41 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 95% identical, to the other polypeptides in the vaccine, wherein each polypeptide comprises a target segment that is at least 90% identical among the gp41 antigen polypeptides. In certain cases, such a gp41 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 95% identical, to the other polypeptides in the vaccine. In various cases, such a gp41 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 10% identical, but not more than 95% identical, to the other polypeptides in the vaccine. In some cases, such a gp41 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 20% identical, but not more than 98% identical, to the other polypeptides in the vaccine. In certain cases, such a gp41 antigen polypeptide can be represented by a sequence (e.g., an amino acid sequence) that is at least 10% identical, but not more than 98% identical, to the other polypeptides in the vaccine.


An antigen can be broadly neutralizing. A broadly neutralizing antigen can be identified by a broadly neutralizing antibody. A broadly neutralizing antibody can be an antibody which can affect multiple strains of a virus. Some broadly neutralizing antigens can be of at least 80%, 85%, 90%, 95%, 98%, 99%, or 100% of strains.


A broadly neutralizing antigen can comprise surface exposed residues adjacent in tertiary space. That is, a broadly neutralizing antigen can be neutralizing based on its surface properties. In some cases, a broadly neutralizing antigen can comprise neutralizing residues on a non-surface region of the antigen, such as in a pocket, in an active site, or in the interior of an antigen.


An antigen, such as a broadly neutralizing antigen, can be of a flu virus. Such a broadly neutralizing antigen can be located on hemagglutinin. Such a broadly neutralizing antigen can be in a stem of hemagglutinin, in a head of hemagglutinin, or in a portion thereof. In some cases, a broadly neutralizing antigen can be similar to a stem of a hemagglutinin, head of hemagglutinin, or similar to a portion thereof. In some cases, such a portion can be broadly neutralizing. For example, a broadly neutralizing antigen can be at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to a head of a hemagglutinin or a stem of a hemagglutinin. In some cases, a broadly neutralizing antigen can be at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to a portion of a head of a hemagglutinin or a portion of a stem of a hemagglutinin. In some cases, a broadly neutralizing antigen can be at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to a broadly conserved fragment of hemagglutinin, a broadly conserved fragment of a head of a hemagglutinin or a broadly conserved fragment of a stem of a hemagglutinin.


In some cases, a broadly neutralizing antigen can be located on neuraminidase or a portion thereof. In some cases, a broadly neutralizing antigen can be similar to neuraminidase. For example, a broadly neutralizing antigen can be at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to neuraminidase or a portion thereof. In some cases, such a portion can be broadly neutralizing. In some cases, a broadly neutralizing antigen can be at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to a portion of neuraminidase. In some cases, a broadly neutralizing antigen can be at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to a broadly conserved fragment of neuraminidase.


A broadly neutralizing antigen can be of an HIV virus. In some cases, a broadly neutralizing antigen can be located on gp160 or a portion thereof. In some cases, a broadly neutralizing antigen can be similar to gp160. For example, a broadly neutralizing antigen can be at least about 30% similar to, at least about 40% similar to, at least about 50% similar to, at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to gp160 or a portion thereof. In certain cases, such a portion can be broadly neutralizing. In various cases, a broadly neutralizing antigen can be at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to a portion of gp160. In some cases, a broadly neutralizing antigen can be at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to a broadly conserved fragment of gp160.


A broadly neutralizing antigen can be of an HIV virus. In some cases, a broadly neutralizing antigen can be located on gp120 or a portion thereof. In some cases, a broadly neutralizing antigen can be similar to gp120. For example, a broadly neutralizing antigen can be at least about 30% similar to, at least about 40% similar to, at least about 50% similar to, at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to gp120 or a portion thereof. In certain cases, such a portion can be broadly neutralizing. In various cases, a broadly neutralizing antigen can be at least about 30% similar to, at least 40% similar to, at least 50% similar to, at least 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to a portion of gp120. In some cases, such a portion can be a broadly conserved fragment.


A broadly neutralizing antigen can be of an HIV virus. In some cases, a broadly neutralizing antigen can be located on gp41 or a portion thereof. In some cases, a broadly neutralizing antigen can be similar to gp41. For example, a broadly neutralizing antigen can be at least about 30% similar to, at least about 40% similar to, at least about 50% similar to, at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to gp41 or a portion thereof. In certain cases, such a portion can be broadly neutralizing. In various cases, a broadly neutralizing antigen can be at least about 60% similar to, at least about 70% similar to, at least about 80% similar to, at least about 85% similar to, or at least about 90% similar to a portion of gp41. In some cases, such a portion can be a broadly conserved fragment.


In some cases, a vaccine can comprise antigens corresponding to more than one microbe. Some vaccines can comprise antigens against two, three, four, five, six, or more microbes. Antigens can each be present in the vaccine at a lower dose than in currently available vaccines. The combination of diverse antigens can allow for B cells and/or T cells which can broadly identify many strains of the microbe, even strains which are not represented in the vaccine. In some cases, this can be referred to as a swarm effect. The swarm effect can reduce the amount of dose of each antigen as well as reduce the total antigen dose required to elicit an immune response. For example, antigens representative of a particular number of clades of a given order of a clade of a microbe can be included in a vaccine. Including representatives of diverse clades may allow the vaccine to be effective against evolutionarily distinct strains. As another example, antigens representative of a particular amount of operational taxonomic units (OTUs) of a microbe can be included in a vaccine. An OTU can be used to classify groups of closely related entities, such as antigens.


An OTU can be a group of microbes. Microbes in a given OTU can be closely related. An overview of OTUs can be found in Sokal, et al. (Principles of Numerical Taxonomy. W.H. Freeman and Co., San Francisco and London (1963)) and Schloss, et al. (Applied and Environmental Microbiology, October 2006, p. 6773-6779, Vol. 72, No. 10).


An OTU can refer generally to a cluster of antigens grouped by sequence similarity and can sometimes be a pragmatic proxy for “species” at different taxonomic levels. Administering to a subject a vaccine comprising antigens representing clades of a variety of OTUs can confer immunity across a genetically diverse set of antigens.


In some cases, an OTU can be based on a particular nucleic acid or amino acid sequence of a microbe. For example, an OTU can be based on a sequence of 16S rRNA or a sequence of a gene encoding 16S rRNA. Such an OTU can be referred to as a 16S OUT. In various cases, an OTU can be based on a sequence of 18S rRNA or a sequence of a gene encoding 18S rRNA.


An OTU can be a group of microbes having a threshold similarity. For example, an OUT can be a group of microbes having at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% similarity. In some cases, different microbes can have a similarity higher than the threshold. Such microbes can be in a merged OTU, which can contain multiple species. In certain cases, a single species of a microbe can have paralogs that are less similar than the threshold. Such paralogs can be split across two or more OTUs.


Methods to assign OTUs can include rarefaction curves. Such methods can assess species richness and alpha and beta diversity estimators, implicitly assume that OTUs are observations of organisms with negligible error, and/or the number of observations correlates well with a total number of species or monophyletic groups.


In some cases, a cluster can be spurious, for example, due to artifacts, such as read errors and/or chimeras. In certain cases, an OTU can be based on a genetic distance between sequences of microbes within the OTU. In various cases, an OTU can provide an estimate of richness and/or diversity of a microbial population. In some instances, microbes belonging to a given OTU can have structural similarities. In certain instances, microbes belonging to different OTUs can have different structures.


Microbial sequences can be assigned to an OTU using a software or a tool. Examples of such tools include, but are not limited to, the Distance-Based OTU and Richness (DOTUR) tool, a shared OTUs and similarity (SONS) tool, a LIBSHUFF/∫-LIBSHUFF tool, a TreeClimber tool, a UniFrac tool, or an analysis of molecular variance (AMOVA) tool.


A tool can assign a microbial sequence to an OTU based on an input comprising information of a plurality of microbial sequences. Examples of such inputs can include a distance matrix, a phylogenetic tree, and/or a distance matrix. A DOTUR tool can assign sequences to OTUs by using a furthest, average, or nearest neighbor algorithm for one, several, or each distance level. A DOTUR tool can assign OTUs based on a distance matrix input. A SONS tool can calculate collector's curves for estimates of the fraction and richness of OTUs shared between communities. A SONS algorithm can assign OTUs based on an OTU designation input.


A LIBSHUFF/∫-LIBSHUFF tool can use a Cramer-von Mises statistic to test whether structures of two communities are the same, different, or subsets of one another. A LIBSHUFF/∫-LIBSHUFF tool can assign OTUs based on a distance matrix input. A TreeClimber tool can implement a parsimony-based test to determine whether the community structures of two or more communities are significantly different. A TreeClimber tool can assign OTUs based on a phylogenetic tree input.


A UniFrac tool can compare phylogenetic distances between pairs of communities to describe the similarity of their structures. A UniFrac tool can assign OTUs based on a phylogenetic tree input. An AMOVA tool can use an analysis of variance-type formulations to determine whether the genetic diversities of two or more community structures are significantly different. An AMOVA tool can assign OTUs based on a distance matrix input.


An OTU can refer generally to a cluster of antigens grouped by sequence similarity, and can sometimes be a pragmatic proxy for “species” at different taxonomic levels. Including representative clades of a variety of OTUs can result in conferred across a genetically diverse set of antigens. Each OTU can comprise microbial sequences which are similar. In some cases, each OTU can comprise microbial sequences which are at least 90%, at least 91%, at least 92%, at least 93%, or at least 94% homologous of one another. In some cases, each OTU can comprise microbial sequences which are at least 95%, at least 96%, at least 97%, at least 98%, at least 99% homologous of one another.


In an even further example, a set of antigens derived from a library of variants such that the set of antigens broadly represents the library of variants can be included. In some cases, a set of antigens derived from a library of variants such that the set of antigens broadly represents at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%, or at least 99.9% of the library of variants can be included. In some cases, such a set of antigens can additionally represent one or more variants not included in a library of variants.


An antigen can be a single peptide or protein of interest or a complex of two, three, four, five, six, or more peptides and/or proteins. In some cases, an antigen can refer to an antigen domain, or a region or section of a larger protein or protein complex. In some cases, an antigen can refer to two or more antigen domains or two or more regions or sections of a larger protein or protein complex. In some cases, an antigen may be a whole protein, a fragment of a whole protein, a functional fragment of a whole protein, a peptide, a multimer polypeptide, a polypeptide sequence, or a combination thereof. In some cases, an antigen can be modified (e.g., artificially modified). Such modification can stabilize or improve the antigenicity. Examples of modifications include alterations in a polypeptide sequence (e.g., truncation, lengthening, or mutating). In some cases, an antigen may comprise two or more whole proteins, fragments of whole proteins, functional fragments of whole proteins, peptides, or multimer polypeptides. Sometimes, an antigen can comprise a lipid, a nucleic acid, a polysaccharide, or a combination thereof


An antigen can be an intact (i.e., an entire or whole) antigen, or a functional portion of an antigen. An antigen can be a peptide functional portion of an antigen. “Intact,” as used herein, generally refers to a full-length antigen as that antigen polypeptide occurs in nature. An intact antigen may adopt the native protein fold as seen in nature, presenting both primary sequence epitopes as well as tertiary sequence conformational epitopes. This can be in direct contrast to delivery of only a small portion or peptide of the antigen. Delivering an intact antigen to a cell can elicit a broad spectrum immune response, rather than an immune response to just a single or select few epitopes. In some cases, one, several, or all of the epitopes on an antigen can be unknown at the time of developing a vaccine or library.


Alternatively, an antigen can be divided into parts, depending on the size of the initial antigen. Typically, where a whole antigen is a multimer polypeptide, the whole protein can be divided into sub-units and/or domains where each individual subunit or domain of the antigen can be associated with the polymer according to the methods as disclosed herein. In some cases, parts of an antigen can be functional fragments or parts.


A vaccine can comprise a homolog of an antigen (e.g., a hemagglutinin homolog, a neuraminidase homolog, a gp160 homolog, a gp120 homolog, or a gp41 homolog). A homolog can be a molecule having a shared ancestry with an antigen. In some cases, a homolog of an antigen can be an antigen.


In some cases, an antigen can be modified. A modification can comprise a truncation, a tag, an addition of amino acids, glycosylation, methylation, ubiquitination, insertion, deletion, mutation, or other modification. For example, HIV env-Glycoprotein Complex (trimer) can be usually artificially mutated e.g. by disulfide bonds that can keep the antigen stable and/or prevent or delay dissociation of HIV env derived proteins. This can include but is not limited to artificial cleaving sites, removal of entire functional regions (e.g. cutting of a transmembrane region)


In some cases, an antigen can be present at a lower concentration than in currently available vaccines. In some cases, an antigen can be present at a concentration which would be insufficient to induce an immune response if the antigen were administered alone. In some cases, each of the antigens in a vaccine can be present at concentrations which would be insufficient for any of the antigens to induce an immune response if the antigens were administered alone. In such cases, the total antigen concentration, rather than the concentrations of the individual antigens, can be a determining factor for whether or not an immune response is induced. When the total concentration of antigens is a determining factor, the required concentrations of individual antigens can be a function of the total concentration or a function of the number of antigens.


In some cases, one or more antigens can provide immunity in a subject against a broad plurality of variants of a microbe. For example, an antigen can provide immunity to variants of a microbe that are variants based on year of origin, species of origin, specific mutation, or any other suitable variant.


Two antigens in a set of antigens can have an edit distance (e.g., pairwise edit distance). An edit distance can describe the differentness of the two antigens. An edit distance can be a Levenshtein distance. A Levenshtein distance between a first antigen and a second antigen can be the fewest number of edits required to change the first antigen into the second antigen. In the calculation of edit distance, allowed edits can comprise an insertion, deletion, and/or substitution. If the first antigen and the second antigen are identical, the Levenshtein distance can be zero. If the first antigen and the second antigen are the same size, the Levenshtein distance can be at most the size of the first antigen. If the first antigen and the second antigen are different in size, the Levenshtein distance can be at least the difference in size between the first antigen and the second antigen.


Another method of calculating edit distance can be the Longest Common Subsequence method, wherein insertion and deletion can be the only two allowed edits for calculation. Yet another method of calculating edit distance can be the Hamming distance, wherein only antigens of same size may be used.


A smallest edit distance between two or more antigens can be determined. In some cases, the smallest edit distance can be the smallest edit distance between any two antigens of the two or more antigens. The smallest edit distance can be a measure of the similarity of the two or more most similar antigens in a set.


A largest edit distance between two or more antigens can be determined. In some cases, the largest edit distance can be the largest edit distance between any two antigens of the two or more antigens. The largest edit distance can be a measure of the similarity of the two or more most different antigens in a set.


Both the smallest pairwise edit distance between two or more antigens in a set and the largest pairwise edit distance between two or more antigens in a set can be measured or described. The combination of both edit distances can describe the most and least similar antigens in a set, and in some cases, can be a measure of the bounds of variety of antigens in a set.


An edit distance can be measured as a percentage of the size of the antigens. Herein, the size of the antigens can be the average size of the antigens, the maximum size of the antigens, the minimum size of the antigens, or another measure of the size of the antigens. For example, for an two antigens that are each 100 amino acids in size and have a 5 edit difference, the pairwise edit distance can be represented as 5%.


A vaccine can comprise a set of antigens of a microbe wherein the smallest pairwise edit distance between the two antigens is less than a particular threshold. A vaccine can comprise a set of antigens wherein the largest pairwise edit distance between two of the antigens is at least a different particular threshold. In some cases, requiring minimum and maximum differences in included antigens can help ensure immunity will be conferred across a variety of antigens with minimal bias.


A library or vaccine can be described by the difference between the largest edit distance and the smallest edit distance. Two antigens of the set with the greatest edit distance can have an edit distance called “S.” Two antigens of the library with the greatest edit distance can have an edit distance called “L.” The difference between “S” and “L” can indicate a range of differentness of the antigens present in a vaccine. A larger difference can indicate a greater range of differentness, or greater variety, of antigens. In some cases, “S” can be at least 60% of “L,” 70% of “L,” 80% of “L,” or 90% of “L.” In other words, the most different antigen by edit distance in the vaccine composition can be in the top 10%, 20%, 30%, or 40% of different antigens by edit distance in the library.


The average number of nodes between antigens can be calculated for a given vaccine. In some cases, the average nodes of nodes between each antigen in a set of antigens in a vaccine can be at least 1, at least 3, at least 5, at least 10, at least 15, or at least 20. In some cases, the average number of nodes between each of the antigens can be a function of the number of antigens in a set. Sometimes, the average number of nodes between each of the antigens can be lower if the number of antigens in the set is higher.


Antigens in a vaccine can be compared using the number of nodes separating them on a phylogenic tree. For example, closely related antigens can be separated by fewer nodes than less closely related antigens. When describing the number of nodes separating two antigens, antigens separated by fewer nodes can be more closely related or more similar in structure or in sequence, while antigens separated by more nodes can be less closely related or less similar in structure or in sequence. In some cases, for example, two antigens in a vaccine can be separated by at least 1, at least 3, at least 5, at least 10, at least 15, or at least 20 nodes.


The smallest number of nodes between any 2 antigens can be 1 in some vaccines. The largest number of nodes between any two antigens can be at least 5, at least 10, at least 15, or at least 20. In some vaccines, the smallest number of nodes between any 2 antigens can be 1 and the largest number of nodes between any two antigens can be at least 5. In some vaccines, the smallest number of nodes between any 2 antigens can be 1 and the largest number of nodes between any two antigens can be at least 10. In some vaccines, the smallest number of nodes between any 2 antigens can be 1 and the largest number of nodes between any two antigens can be at least 15. In some vaccines, the smallest number of nodes between any 2 antigens can be 1 and the largest number of nodes between any two antigens can be at least 20. In some vaccines, the smallest number of nodes between any 2 antigens can be less than 5 and the largest number of nodes between any two antigens can be at least 5. In some vaccines, the smallest number of nodes between any 2 antigens can be less than 5 and the largest number of nodes between any two antigens can be at least 10. In some vaccines, the smallest number of nodes between any 2 antigens can be less than 5 and the largest number of nodes between any two antigens can be at least 15. In some vaccines, the smallest number of nodes between any 2 antigens can be less than 5 and the largest number of nodes between any two antigens can be at least 20. In some vaccines, the smallest number of nodes between any 2 antigens can be less than 10 and the largest number of nodes between any two antigens can be at least 10. In some vaccines, the smallest number of nodes between any 2 antigens can be less than 10 and the largest number of nodes between any two antigens can be at least 15. In some vaccines, the smallest number of nodes between any 2 antigens can be less than 10 and the largest number of nodes between any two antigens can be at least 20.


The average number of branches between antigens can be calculated for a given vaccine. In some cases, the average number of branches between each antigen in a set of antigens in a vaccine can be at least 1, at least 3, at least 5, at least 10, at least 15, or at least 20. In some cases, the average number of branches between each of the antigens can be a function of the number of antigens in a set. Sometimes, the average number of branches between each of the antigens can be lower if the number of antigens in the set is higher.


Antigens in a vaccine can be compared using the number of branches separating them on a phylogenic tree. For example, closely related antigens can be separated by fewer branches than less closely related antigens. When describing the number of branches separating two antigens, antigens separated by fewer branches can be more closely related or more similar in structure or in sequence, while antigens separated by more branches can be less closely related or less similar in structure or in sequence. In some cases, for example, two antigens in a vaccine can be separated by at least 1, at least 3, at least 5, at least 10, at least 15, or at least 20 branches.


The smallest number of branches between any 2 antigens can be 1 in some vaccines. The largest number of branches between any two antigens can be at least 5, at least 10, at least 15, or at least 20. In some vaccines, the smallest number of branches between any 2 antigens can be 1 and the largest number of branches between any two antigens can be at least 5. In some vaccines, the smallest number of branches between any 2 antigens can be 1 and the largest number of branches between any two antigens can be at least 10. In some vaccines, the smallest number of branches between any 2 antigens can be 1 and the largest number of branches between any two antigens can be at least 15. In some vaccines, the smallest number of branches between any 2 antigens can be 1 and the largest number of branches between any two antigens can be at least 20. In some vaccines, the smallest number of branches between any 2 antigens can be less than 5 and the largest number of branches between any two antigens can be at least 5. In some vaccines, the smallest number of branches between any 2 antigens can be less than 5 and the largest number of branches between any two antigens can be at least 10. In some vaccines, the smallest number of branches between any 2 antigens can be less than 5 and the largest number of branches between any two antigens can be at least 15. In some vaccines, the smallest number of branches between any 2 antigens can be less than 5 and the largest number of branches between any two antigens can be at least 20. In some vaccines, the smallest number of branches between any 2 antigens can be less than 10 and the largest number of branches between any two antigens can be at least 10. In some vaccines, the smallest number of branches between any 2 antigens can be less than 10 and the largest number of branches between any two antigens can be at least 15. In some vaccines, the smallest number of branches between any 2 antigens can be less than 10 and the largest number of branches between any two antigens can be at least 20.


A vaccine can be described by its sequence identity. As used herein, the terms “identical” or percent “identity,” in the context of two or more nucleic acids or polypeptide sequences, generally refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same.


Alternatively, an indication that two nucleic acid sequences or polypeptides are identical is that the polypeptide encoded by the first nucleic acid is immunologically cross reactive with the antibodies raised against the polypeptide encoded by the second nucleic acid, as described below. Thus, a polypeptide is typically identical to a second polypeptide, for example, where the two peptides differ only by conservative substitutions. Another indication that two nucleic acid sequences are substantially identical is that the two molecules or their complements hybridize to each other under stringent conditions, as described herein.


Alternatively, an indication that two nucleic acid sequences or polypeptides are identical is that the polypeptide encoded by the first nucleic acid is immunologically cross reactive with the antibodies raised against the polypeptide encoded by the second nucleic acid, as described below. Thus, a polypeptide is typically identical to a second polypeptide, for example, where the two peptides differ only by conservative substitutions. Another indication that two nucleic acid sequences are substantially identical is that the two molecules or their complements hybridize to each other under stringent conditions, as described herein.


Alternatively, an indication that two nucleic acid sequences or polypeptides are identical is that the polypeptide encoded by the first nucleic acid is immunologically cross reactive with the antibodies raised against the polypeptide encoded by the second nucleic acid, as described below. Thus, a polypeptide is typically identical to a second polypeptide, for example, where the two peptides differ only by conservative substitutions. Another indication that two nucleic acid sequences are substantially identical is that the two molecules or their complements hybridize to each other under stringent conditions, as described herein.


In some cases, each antigen in a set can share at least 90%, at least 95%, or at least 99% sequence identity to at least one antigen in the set. In some cases, sequence identity can be a measure of how similar or different the sequences of the antigens within the vaccine are.


The terms “polypeptide,” “oligopeptide,” “peptide,” and “protein” are used interchangeably herein and generally refer to polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non-amino acids. The terms also encompass an amino acid polymer that has been modified naturally or by intervention; for example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation or modification, such as conjugation with a labeling component. Also included within the definition are, for example, polypeptides containing one or more analogs of an amino acid (including, for example, unnatural amino acids, etc.), as well as other suitable modifications. It is understood that, because the polypeptides of this disclosure are based upon an antibody, the polypeptides can occur as single chains or associated chains.


“Polynucleotide” or “nucleic acid,” as used interchangeably herein, generally refer to polymers of nucleotides of any length, and include DNA and RNA. The nucleotides can be deoxyribonucleotides, ribonucleotides, modified nucleotides or bases, and/or their analogs, or any substrate that can be incorporated into a polymer by DNA or RNA polymerase. A polynucleotide may comprise modified nucleotides, such as methylated nucleotides and their analogs. If present, modification to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non-nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component. Other types of modifications include, for example, “caps,” substitution of one or more of the naturally-occurring nucleotides with an analog, internucleotide modifications such as, for example, those with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, carbamates, etc.) and with charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), those containing pendant moieties, such as, for example, proteins (e.g., nucleases, toxins, antibodies, signal peptides, ply-L-lysine, etc.), those with intercalators (e.g., acridine, psoralen, etc.), those containing chelators (e.g., metals, radioactive metals, boron, oxidative metals, etc.), those containing alkylators, those with modified linkages (e.g., alpha anomeric nucleic acids, etc.), as well as unmodified forms of the polynucleotide(s).


Further, any of the hydroxyl groups ordinarily present in the sugars may be replaced, for example, by phosphonate groups, phosphate groups, protected by standard protecting groups, or activated to prepare additional linkages to additional nucleotides, or may be conjugated to solid supports. The 5′ and 3′ terminal OH can be phosphorylated or substituted with amines or organic capping group moieties of from 1 to 20 carbon atoms. Other hydroxyls may also be derivatized to standard protecting groups. Polynucleotides can also contain analogous forms of ribose or deoxyribose sugars including, for example, 2′-O-methyl-, 2′-O-allyl, 2′-fluoro- or 2′-azido-ribose, carbocyclic sugar analogs, alpha-anomeric sugars, epimeric sugars such as arabinose, xyloses, or lyxoses, pyranose sugars, furanose sugars, sedoheptuloses, acyclic analogs, and abasic nucleoside analogs such as methyl riboside. One or more phosphodiester linkages may be replaced by alternative linking groups. These alternative linking groups include, but are not limited to, embodiments wherein phosphate is replaced by P(O)S (“thioate”), P(S)S (“dithioate”), (O)NR2 (“amidate”), P(O)R, P(O)OR′, CO, or CH2 (“formacetal”), in which each R or R′ is independently H or substituted or unsubstituted alkyl (1-20 C) optionally containing an ether (—O—) linkage, aryl, alkenyl, cycloalkyl, cycloalkenyl, or araldyl. Not all linkages in a polynucleotide need be identical. The preceding description applies to all polynucleotides referred to herein, including RNA and DNA.


“Homology,” “identity,” or “similarity” can refer to sequence similarity between two peptides or between two nucleic acid molecules. Homology, similarity, and identity can each be determined by comparing a position in each sequence which may be aligned for purposes of comparison. When an equivalent position in the compared sequences is occupied by the same base or amino acid, then the molecules are identical at that position; when the equivalent site occupied by the same or a similar amino acid residue (e.g., similar in steric and/or electronic nature), then the molecules can be referred to as homologous (similar) at that position. Expression as a percentage of homology/similarity or identity refers to a function of the number of identical or similar amino acids at positions shared by the compared sequences. A sequence which is “unrelated” or “non-homologous” may share less than 40% identity or less than 25% identity with a sequence of the present disclosure. In comparing two sequences, the absence of residues (amino acids or nucleic acids) or presence of extra residues also decreases the identity and homology/similarity.


The term “homology” describes a mathematically based comparison of sequence similarities which is used to identify genes or proteins with similar functions or motifs. The nucleic acid (nucleotide or oligonucleotide) and amino acid (protein) sequences of the present disclosure may be used as a “query sequence” to perform a search against public databases to, for example, identify other family members, related sequences or homologs. Such searches can be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be performed with the NBLAST program, score=100, wordlength=12 to obtain nucleotide sequences homologous to nucleic acid molecules of the disclosure. BLAST amino acid searches can be performed with the XBLAST program, score=50, wordlength=3 to obtain amino acid sequences homologous to protein molecules of the disclosure. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al., (1997) Nucleic Acids Res. 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and BLAST) can be used (see, www.ncbi.nlm.nih.gov).


As used herein, “identity” can mean the percentage of identical nucleotide or amino acid residues at corresponding positions in two or more sequences when the sequences are aligned to maximize sequence matching, i.e., taking into account gaps and insertions. Identity can be readily calculated by known methods including, but not limited to, those described in Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SIAM J. Applied Math., 48: 1073 (1988). Methods to determine identity are designed to give the largest match between the sequences tested. Moreover, methods to determine identity are codified in publicly available computer programs. Computer program methods to determine identity between two sequences include, but are not limited to, the GCG program package (Devereux, J., et al., Nucleic Acids Research 12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Altschul, S. F. et al., J. Molec. Biol. 215: 403-410 (1990) and Altschul et al. Nuc. Acids Res. 25: 3389-3402 (1997)). The BLAST X program is publicly available from NCBI and other sources (BLAST Manual, Altschul, S., et al., NCBI NLM NIH Bethesda, Md. 20894; Altschul, S., et al., J. Mol. Biol. 215: 403-410 (1990). The well-known Smith Waterman algorithm may also be used to determine identity.


Ranges of desired degrees of sequence identity are from about 80% to about 100% and integer values therebetween. In general, this disclosure encompasses sequences with about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, or about 99% sequence identity with any sequence provided herein.


The letter “X” as used in amino acid sequences herein is intended to indicate that any of the twenty standard amino acids may be placed at this position unless specifically noted otherwise.


III. Vaccines

A vaccine can be a composition that can stimulate the immune system. This can involve stimulation of the production of antibodies and can provide immunity against one or more microbes. A vaccine can be prepared from a microbe, which can be a causative agent of a disease, its products, or a synthetic substitute thereof. Examples of microbes are provided above.


A vaccine can be prepared as a pharmaceutical composition. A pharmaceutical composition can be a formulation of a vaccine, including antigens, in a form that can be administered to a subject. In some cases, a pharmaceutical composition can be ready to use. A pharmaceutical composition can be optimized for ease of use, minimization of adverse reactions, ease of storage, a range of storage temperatures, ease of dosing, ease of manufacturing, or any other suitable factor.


A pharmaceutical composition can comprise a vaccine composition as described herein and a pharmaceutically acceptable diluent, adjuvant, excipient, or any combination thereof. In some cases, the pharmaceutical composition can comprise a diluent. A diluent can be a diluting agent. A diluent can serve to make a vaccine less viscous or less dense, to improve its ability to flow, for example, through a needle. In some cases, a diluent can be pre-mixed with the vaccine. In some cases, a diluent can be provided separately from the vaccine and mixed prior to administration.


In some cases, the pharmaceutical composition can comprise an adjuvant. An adjuvant can provide for increased immunogenicity. An adjuvant can provide for slow release of antigen (e.g., the adjuvant can be a liposome) or it can be an adjuvant that is immunogenic in its own right, thereby functioning synergistically with antigens. For example, the adjuvant can be a known adjuvant or other substance that promotes nucleic acid uptake, recruits immune system cells to the site of administration, or facilitates the immune activation of responding lymphoid cells. Adjuvants include, but are not limited to, immunomodulatory molecules (e.g., cytokines), oil and water emulsions, aluminum hydroxide, glucan, dextran sulfate, iron oxide, sodium alginate, Bacto-Adjuvant, synthetic polymers such as poly amino acids and co-polymers of amino acids, saponin, paraffin oil, PS-GAMP and muramyl dipeptide.


In some cases, the adjuvant can be an immunomodulatory molecule. For example, the immunomodulatory molecule can be a recombinant protein cytokine, chemokine, or immunostimulatory agent or nucleic acids encoding cytokines, chemokines, or immunostimulatory agents designed to enhance the immunologic response.


A pharmaceutical composition as provided herein can be packaged in a virus-like particle (VLP). In some cases, a virus-like particle can comprise a vaccine.


In certain cases, the pharmaceutical composition can comprise an excipient. In various cases, an excipient can be an inactive substance that can serve as a vehicle or medium for the vaccine. An excipient can comprise a binder, a coating, a color, a disintegrant, a flavor, a glidant, a lubricant, a preservative, a sorbent, a sweetener, a vehicle, or any combination thereof.


A vaccine composition or pharmaceutical composition herein can be formulated in one of a number of formulations. In some cases, a vaccine can be in the form of an aerosol formulation, an injectable formulation, an oral formulation, or any other suitable formulation.


A vaccine composition as provided herein can be formulated as a unit dose. A unit dose can be the dose of a vaccine which is appropriate for one subject. In some examples, a vaccine can be packaged as a single unit dose. In some other examples, a vaccine can be packaged as multiple unit doses.


An immune response generated in response to a vaccine can depend on the total amount of antigen, rather than the amount of each antigen present. In some cases, each of the antigens can be at a concentration that alone would not provide a significant immune response to the broadly neutralizing antigen in a subject. Such an immune response can comprise recruiting and/or activation if immune cells such as B cells or T cells. In some cases, such an immune response can be a prophylactic immune response. In some cases, the antigens can have a combined concentration that provides an immune response to a broadly neutralizing antigen in the subject.


IV. Methods of Use

Provided herein are methods for treating or reducing adverse events due to infection and/or exposure to a microbe or toxin. Also provided herein are methods for inhibiting or reducing the likelihood of a disease or condition caused by a microbe or toxin in a subject. The methods can comprise administering to the subject a vaccine composition as described herein. In some cases, methods provided herein can confer immunity to a disease or condition caused by a microbe in a subject.


The terms “treatment” or “treating” are used interchangeably herein. These terms, as used herein, generally refer to an approach for obtaining beneficial or desired results including, but not limited to, a therapeutic benefit and/or a prophylactic benefit. A therapeutic benefit can mean eradication or amelioration of the underlying disorder being treated. Also, a therapeutic benefit can be achieved with the eradication or amelioration of one or more of the physiological symptoms associated with the underlying disorder such that an improvement is observed in the subject, notwithstanding that the subject may still be afflicted with the underlying disorder. A prophylactic effect includes delaying, preventing, or eliminating the appearance of a disease or condition, delaying or eliminating the onset of symptoms of a disease or condition, slowing, halting, or reversing the progression of a disease or condition, or any combination thereof. For prophylactic benefit, a subject at risk of developing a particular disease, or to a subject reporting one or more of the physiological symptoms of a disease may undergo treatment, even though a diagnosis of this disease may not have been made.


A subject administered a vaccine herein can be a mammal, a bird, or any other suitable subject. In some instances, the subject can be a human. In certain instances, the subject can be a pig, or another domesticated animal, for example, a cow, a goat, a sheep, a horse, a chicken, a rooster, a turkey, a parrot, a monkey, a hamster, a guinea pig, a rat, a mouse, a dog, or a cat.


Such a subject can receive the vaccine orally, as an aerosol, as an injection, or a combination thereof. In some cases, more than one dose of the vaccine may be given. In some cases, a subject can receive one, two, three, four, five, six, seven, eight, nine, ten or more doses of a vaccine. Repeated doses can be given on the same day, or they can be spaced by at least 1 day, at least 1 week, at least 1 month, at least 1 year, at least 2 years, at least 3 years, at least 4 years, at least 5 years, or at least 10 years. In some cases, repeated doses can be given as a booster dose, and can be the same dose, a greater dose, or a lesser dose than previous doses. A booster dose can be a primed boost.


Individual doses of a vaccine having repeated doses can comprise the same antigens or different antigens. If individual doses comprise different antigens, they can be different by at least one, two, three, four, five, six, seven, eight, nine, ten or more antigens. In some cases, if individual doses comprise different antigens, the difference can be to maximize exposure to as many different antigens as possible. In some cases, if individual doses comprise different antigens, the additional doses can be “update” doses, which can confer immunity to newly evolved, adapted, or resistant strains.


In certain embodiments, a vaccine which is to be injected can be formulated as a subcutaneous injection. In some cases, a vaccine to be injected can be formulated for intramuscular, intraperitoneal, intravenous, intradermal, or any other suitable type of injection. A vaccine can be administered to a subject who is healthy, a subject who is not healthy, or a subject of unknown health. In some cases, a vaccine can be more effective if the subject is healthy. A vaccine can comprise a recombinant expression vector. In some cases, a vaccine can be made by a recombinant expression vector. In various cases, a vaccine can be stored as a recombinant expression vector.


In some instances, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens that are representative of at least 60% of each of first order clade and second order clades of a microbe; a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 25 and the largest pairwise edit distance between two of the antigens is at least 300; a set of antigens that are representative of at least 60% of all OTUs of the microbe; or a set of antigens that are representative of at least 60% of all OTUs of the microbe.


In certain instances, a recombinant expression vector can comprise a nucleic acid molecule encoding a recombinant expression vector comprising a nucleic acid molecule encoding: a set of antigens that are representative of at least 60% of each of first order clade and second order clades of a microbe; a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 5% of the size of the antigens, and the largest pairwise edit distance is at least 75% of the size of the antigens, a set of antigens that are representative of at least 60% of all OTUs of the microbe; or a set of antigens that are representative of at least 60% of all OTUs of the microbe.


In various instances, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens that are representative of at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or 100% of each of the first order clades of a microbe. In some cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens that are representative of at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or 100% of each of the second order clades of a microbe. In certain cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens that are representative of at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or 100% of each of the third order clades of a microbe.


In some cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 1%, no more than 5%, no more than 10%, no more than 15%, or no more than 20% of the size of the antigens. In some cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% of the size of the antigens. In some cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 1%, and the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% of the size of the antigens. In certain cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 5%, and the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% of the size of the antigens. In various cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 10%, and the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% of the size of the antigens. In some cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 15%, and the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% of the size of the antigens. In certain cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 20%, and the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% of the size of the antigens.


In some cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is at least 1%, at least 5%, at least 10%, at least 15%, or at least 20% of the size of the antigens. In certain cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the largest pairwise edit distance is no more than 60%, no more than 70%, no more than 80%, no more than 90%, no more than 95%, or no more than 98% of the size of the antigens. In various cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is at least 1%, and the largest pairwise edit distance is no more than 60%, no more than 70%, no more than 80%, no more than 90%, no more than 95%, or no more than 98% of the size of the antigens. In certain cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is at least 5%, and the largest pairwise edit distance is no more than 60%, no more than 70%, no more than 80%, no more than 90%, no more than 95%, or no more than 98% of the size of the antigens. In various cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is at least 10%, and the largest pairwise edit distance is no more than 60%, no more than 70%, no more than 80%, no more than 90%, no more than 95%, or no more than 98% of the size of the antigens. In some cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is at least 15%, and the largest pairwise edit distance is no more than 60%, no more than 70%, no more than 80%, no more than 90%, no more than 95%, or no more than 98% of the size of the antigens. In certain cases, a recombinant expression vector can comprise a nucleic acid molecule encoding a set of antigens wherein the smallest pairwise edit distance between two of the antigens is at least 20%, and the largest pairwise edit distance is no more than 60%, no more than 70%, no more than 80%, no more than 90%, no more than 95%, or no more than 98% of the size of the antigens.


A method for making a vaccine composition can comprise selecting a set of antigens that are representative of at least 60% of each of the first order clade and second order clades of a microbe, wherein the smallest pairwise edit distance between two of the antigens is no more than 5% of the size of the antigens, and the largest pairwise edit distance is at least 75% of the size of the antigens, representative of at least 60% of all OTUs of the microbe, or representative of at least 90% of all OTUs of the microbe.


In some cases, a method for making a vaccine composition can comprise selecting a set of antigens that are representative of at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or 100% of each of the first order clades of a microbe. In certain cases, a method for making a vaccine composition can comprise selecting a set of antigens that are representative of at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or 100% of each of the second order clades of a microbe. In various cases, a method for making a vaccine composition can comprise selecting a set of antigens that are representative of at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or 100% of each of the third order clades of a microbe.


In some cases, a method for making a vaccine composition can comprise selecting a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 1%, no more than 5%, no more than 10%, no more than 15%, or no more than 20% of the size of the antigens. In certain cases, a method for making a vaccine composition can comprise selecting a set of antigens wherein the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, or at least 90% of the size of the antigens. In various cases, a method for making a vaccine composition can comprise selecting a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 1%, and the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, or at least 90% of the size of the antigens. In some cases, a method for making a vaccine composition can comprise selecting a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 5%, and the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, or at least 90% of the size of the antigens. In certain cases, a method for making a vaccine composition can comprise selecting a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 10%, and the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, or at least 90% of the size of the antigens. In various cases, a method for making a vaccine composition can comprise selecting a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 15%, and the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, or at least 90% of the size of the antigens. In some cases, a method for making a vaccine composition can comprise selecting a set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 20%, and the largest pairwise edit distance is at least 60%, at least 70%, at least 80%, or at least 90% of the size of the antigens.


V. Examples

Vaccines were developed that can, for example, differ from (and/or provide advantages relative to) typical commercial vaccines in at least one of several ways: (1) the inclusion of numerous (e.g., six or more) antigens, for example, 30 antigens, (2) a relatively low amount of each individual antigen, for example, 0.05 μg of each antigen, which can allow for a relatively low amount of total antigen content, and (3) the breadth of immunogenic response elicited by the vaccine against shared sites on the antigens, for example, each of 30 antigens is at 0.05 μg, but shared sites are effectively at a 30× higher dose of 1.5 μg.


For the following examples, flu vaccine compositions were administered to pigs. Several flu vaccine compositions were used. The vehicle comprised squalene and phosphate buffered saline for all compositions. A vehicle control composition comprised squalene and phosphate buffered saline. A bivalent control flu vaccine, designed to mimic commercially available flu vaccines, comprised two flu antigen variants: H1N1_2007 (A/Brisbane/2007) (5 μg) and either H3N2_1997 (A/Sydney/5/1997) or H3N2_2007 (A/Brisbane/2007) (5 μg). A first single antigen flu vaccine composition comprised 50 ng of a first hemagglutinin variant (H1N1_2007; A/Brisbane/2007). A second single antigen flu vaccine composition comprised 50 ng of a second hemagglutinin variant (H3N2_2007; A/Brisbane/2007). A 30-antigen flu vaccine composition comprised 30 flu antigens (30 hemagglutinin variants), representing flu antigens from between 1918 and 2015, selected as described in Example 4. A 27-antigen flu vaccine composition comprised 27 flu antigens (27 hemagglutinin variants), representing flu antigens from between 1918 and 2008. The 30-antigen flu vaccine composition was also employed for general vaccine composition testing. The 27-antigen flu vaccine was employed to test the ability of the vaccine to protect against “future” strains, which might be described as the ability of the vaccine to provide protection against antigen variants which do not exist yet, but may exist in the future.


The following illustrative examples are representative of embodiments of the compositions and methods described herein and are not meant to be limiting in any way.


Example 1: Quantifying the Number of Unique Antibodies Elicited after Vaccination

Adult humans can comprise a B cell receptor (BCR) repertoire of about 100 million unique BCRs. After influenza vaccination, about 1,000 unique BCRs can be elicited to begin affinity maturation. Serological analysis indicates that about 10% of these can efficiently convert to productive plasma cells. This can result in a flu shot that can provide serological protection with about 100 unique antibodies. This concept is illustrated in FIG. 4.


Example 2: Estimating the Number of Unique B Cell Epitopes on Hemagglutinin

Human antibodies can contact about 20 to 30 residues on the antigen surface they recognize. In some cases, about 12 to 16 such residues can form an epitope that can be referred to as a “potential critical epitope.” In some cases, such an epitope can be consequential for maintaining recognition of an antigen by an antibody.


Given roughly 100 unique antibodies from about 1000 B cell lineages elicited by vaccination it was asked: (1) how many unique epitopes existed on the surface of the influenza hemagglutinin protein (HA), (2) given the conservation of those epitopes, what was the probability of an individual antibody to be broadly cross-reactive, and (3) across the roughly 100 unique antibodies, the probability and percentage of a subject's vaccine response to be contain broadly protective antibodies.


In order to generate a database of unique potential critical epitopes on the surface of the influenza hemagglutinin protein (HA), deep computational protein-protein docking of 559 crystallized human antibody structures against a representative HA structure (PDB ID: 3FKU) using Zdock was performed. This resulted in 3,718,346 unique contact residue sets, each containing 25+/−12 HA contact residues. Repeat random subsampling of 14 contact residues from the unique contact residue sets on the HA that could form a potential critical epitope was then performed, resulting in a database of 263,022,674 unique 14-residue potential critical epitopes that could define the binding determinant of a single antibody. While not exhaustive, the database provides the basis for modeling epitope conservation probability of human antibodies binding HAs. This concept is illustrated in FIG. 4.


Example 3: Quantifying Amino Acid Conservation of all Hemagglutinin Epitopes

The conservation of the 263 million potential critical epitopes across a panel of 82 HA protein sequences from strains spanning the years 1918-2018 was analyzed. The analysis identified less than one in one million potential critical epitopes that are universally conserved across all HAs. About one in 300,000 potential critical epitopes were observed to be 90% conserved across all strains, while 1% and 3% were observed to be universally conserved within a given HA antigen group, as shown in FIG. 5. Potential critical epitopes that were 90% conserved in H1 or H3 antigen groups were less common (6.9% and 5.8% for H1 and H3, respectively). This can suggest that in some cases, non-broadly conserved epitopes can be immunodominant by virtue of vastly outnumbering broadly conserved epitopes.


The model suggested that truly universally conserved epitope binding antibodies may be too rare to occur in most immune systems, and that broad antibodies recognizing conserved epitopes within a strain can occur, but may represent a minority of responses, and can be unlikely to exist at a concentration sufficient to mediate sufficient protection in following years. For example, given 100 responding antibodies, most subjects are not predicted to elicit antibodies against broadly conserved epitopes. For subjects that can or might be able to elicit antibodies against broadly conserved epitopes given 100 responding antibodies, in most cases that antibody can make up about 1% of their serum response. In such cases, antigen drift between seasons can be enough to render the remaining 99% of the antibody titers of the subject obsolete or nearly obsolete. Combined with natural titer drops that occur over time, such subjects can be eventually left unprotected.


This model was consistent with observations that seasonal variation of about 10%-15% of amino acid positions can be sufficient to disrupt the majority of epitopes against the hemagglutinin, and can lead to the requirement of continual vaccination by current methods.


Further, immunizing with just hemagglutinin stems could provide some benefit, but seasonal variation within stems can continue to render the majority of epitopes obsolete. Variations of the immunogen, including consensus hemagglutinin variants, can provide benefit which can be limited. Enhanced adjuvants would likely not solve the breadth challenge, as they can increase the total number of responding antibodies, and may be able to alter the fundamental critical epitope conservation probability distribution.


While conserved epitopes exist within the stem and head of hemagglutinin, their proportions can be too low to elicit a dominant proportion of an immune response by most subjects, including most humans or most pigs, and there can be a low probability that this can be solved through optimization of the sequence of the immunogen or enhancements of an adjuvant.


Example 4: Generation of Flu Vaccine Compositions

At present, commercial vaccines generally deliver between about 10-50 μg of total antigen. Such commercial vaccines often include only two antigens (a bivalent vaccine), three antigens (a trivalent vaccine), or four antigens (a quadrivalent vaccine).


To identify an optimally dispersed population of hemagglutinin variants, a method was developed to take as input a large database of homologous variants of a target antigen, and output a stochastically generated subset of optimally dispersed representatives based on amino acid percent identity. Herein, the input was a large database of homologous variants of a hemagglutinin antigen for influenza, and the output was a subset of the antigens which were representative of the input group of antigens as described herein. In this example, two distinct vaccine compositions were designed for administration to subjects.


This method was repeatedly applied for 1000 stochastic iterations to 8600 hemagglutinin variants spanning the time period from 1918 to 2015 to output a set of 30 representative flu antigen variants. These variants comprise the 30-antigen vaccine composition, and are detailed in Table 2. Antigens were purchased from Sino Biological, Inc. (Beijing, China).









TABLE 2







30-antigen vaccine composition











Strain
Year
Strain Info







H1N1
1998
A/swine/Belgium/1998



H1N1
1934
A/Puerto Rico/1934



H1N1
2006
A/Solomon Islands/2006



H1N1
1995
A/Beijing/1995



H1N1
1977
A/USSR/1977



H1N1
1976
A/New Jersey/1976



H1N1
2007
A/Brisbane/2007



H1N1
1933
A/WSN/1933



H1N1
2009
A/California/2009



H2N2
1957
A/Guiyang/1957



H2N2
2005
A/Canada/2005



H3N2
2002
A/Fujian/2002



H3N2
1997
A/Sydney/1997



H3N2
2007
A/Brisbane/2007



H3N2
2004
A/California/2004



H3N2
2014
A/Missouri/2014



H3N2
1995
A/Nanchang/1995



H5N1
1997
A/Hong Kong/1997



H5N1
2008
A/chicken/VietNam/2008



H5N1
2010
A/barnswallow/Hong Kong/2010



H5N1
2004
A/Vietnam/2004



H7N7
1975
A/equine/Kentucky/1975



H7N7
2003
A/Netherlands/2003



H7N9
2013
A/Zhejiang/2013



B
1988
B/Yamagata/1988



B
2012
B/Utah/2012



B
2008
B/Brisbane/2008



H1N1
1918
A/New York/1918



H3N2
1968
A/Hong Kong/1968



H5N1
2009
A/Egypt/2009










In some examples, a 27-antigen vaccine composition, designed to be representative of antigen variants spanning the time period from 1918 to 2008, was used. Most of the antigens of this 27-antigen vaccine composition are also included in the 30-antigen vaccine composition. This vaccine composition can be used to determine efficacy of such a composition having antigens representative of an epitope of a microbe against “future antigens,” and includes the antigens in Table 3. Antigens were purchased from Sino Biological, Inc. (Beijing, China).









TABLE 3







27-antigen vaccine composition











Strain
Year
Strain Info







H1N1
1998
A/swine/Belgium/1998



H1N1
1934
A/Puerto Rico/1934



H1N1
2006
A/Solomon Islands/2006



H1N1
1995
A/Beijing/1995



H1N1
1977
A/USSR/1977



H1N1
1976
A/New Jersey/1976



H1N1
2007
A/Brisbane/2007



H1N1
1933
A/WSN/1933



H2N2
1957
A/Guiyang/1957



H2N2
2005
A/Canada/2005



H3N2
2002
A/Fujian/2002



H3N2
1997
A/Sydney/1997



H3N2
2007
A/Brisbane/2007



H3N2
2004
A/California/2004



H3N2
1995
A/Nanchang/1995



H5N1
1997
A/Hong Kong/1997



H5N1
2008
A/chicken/VietNam/2008



H5N1
2004
A/Vietnam/2004



H7N7
1975
A/equine/Kentucky/1975



H7N7
2003
A/Netherlands/2003



H1N1
1918
A/New York/1918



H3N2
1968
A/Hong Kong/1968



H1N1
1991
A/Texas/36/1991



H1N1
1986
A/Tawain/01/1986



H1N3
1976
A/duck/NZL/1976



H3N2
2003
A/Wyoming/03/2003



H6N2
2000
A/duck/Shantou/2000










In some examples, a vaccine composition comprising 73 flu hemagglutinin antigens from human, swine, and unknown hosts from 1918 through 2018 was used. Antigens in this composition are provided in Table 4. Antigens were purchased from Sino Biological, Inc. (Beijing, China). The amino acid sequences for these antigens are SEQ ID NOs:1-87, as listed in Table 6.









TABLE 4







Flu hemagglutinin antigen vaccine composition










Subtype
Host
Year
Strain Info





H1N1
human
1918
A/New.York/1/1918


H1N1
human
1933
A/WSN/1933


H1N1
human
1934
A/Puerto.Rico/8/1934


H1N1
swine
1935
A/swine/Ohio/23/1935


H1N1
swine
1939
A/swine/Cambridge/1939.1939


H1N1
human
1945
A/Huston/1945


H1N1
swine
1946
A/H1N1-1946-ACV49556-/Melbourne/1/





1946


H1N1
human
1957
A/Denver/1957


H1N1
human
1976
A/New.Jersey/8/1976


H1N1
human
1977
A/USSR/90/1977


H1N1
human
1986
A/Tawain/01/1986


H1N1
human
1991
A/Texas/36/1991


H1N1
swine
1995
A/swine/OMS/2112/1995.1995


H1N1
human
1995
A/Beijing/262/1995


H1N1
swine
1996
A/swine/Eire/89/1996.1996


H1N1
human
1999
A/Taiwan/4845/1999.1999


H1N1
swine
2000
H1N1-2000-swine-AAL87868-/





Swine/Minnesota/55551/2000


H1N1
swine
2004
A/swine/Saitama/21/2004.2004


H1N1
swine
2004
A/swine/Kansas/00246/2004


H1N1
swine
2004
A/swine/Denmark/12813-1/2004


H1N1
swine
2014
A/swine/Saskatchewan/SD0056/2014


H1N1
human
2005
A/Iowa/CEID23/2005.2005


H1N1
human
2005
A/Thailand/271/2005.2005/07


H1N1
swine
2005
A/swine/Tainan/46-4/2005


H1N1
swine
2005
A/swine/England/383/2005


H1N1
swine
2006
A/swine/Canada/01093/2006.2006


H1N1
human
2006
A/Solomon.Islands/3/2006


H1N1
swine
2006
A/swine/North Carolina/01169/2006


H1N1
swine
2006
A/swine/Miyazaki/1/2006


H1N1
human
2007
A/Brisbane/59/2007


H1N1
swine
2007
A/swine/England/267/2007


H1N1
swine
2009
A/swine/Korea/VDS3/2009.2009/09


H1N1
human
2009
A/California/07/2009


H1N1
swine
2010
A01049060/2010.2010/11


H1N1
human
2011
A/Wisconsin/28/2011.2011/12


H1N1
swine
2011
A/swine/Hong.Kong/4083/2011.2011/10


H1N1
human
2012
A/Minnesota/14/2012.2012/12


H1N1
swine
2012
A/swine/Belgium/Oostkamp-26/2012.2012/





01


H1N1
swine
2012
A/swine/England/7856/2012.2012/02


H1N1
swine
2012
A/swine/England/038712/2012.2012/12


H1N1
swine
2013
AVX47/2013.2013/01


H1N1
human
2016
A/Pavia/65/2016.2016/10


H1N1
human
2017
A/Florida/04/2017


H3N2
human
1968
A/Hong.Kong/1/1968


H3N2
human
1985
A/Guildford/V728/1985


H3N2
swine
1995
A/swine/England/704563/1995.1995/08


H3N2
human
1995
A/Nanchang/933/1995


H3N2
human
1997
A/Sydney/5/1997


H3N2
swine
1997
A/swine/England/90591/1997


H3N2
swine
2001
A/swine/Spain/33601/2001


H3N2
human
2002
A/Fujian/411/2002


H3N2
human
2003
A/Wyoming/03/2003


H3N2
human
2004
A/California/7/2004


H3N2
swine
2004
A/swine/Taiwan/0408/2004


H3N2
human
2005
A/Mexico/InDRE2250/2005


H3N2
human
2007
A/Brisbane/10/2007


H3N2
human
2008
A/Ohio/14/2008


H3N2
swine
2010
A/swine/Denmark/101501-1/2010.2010/08


H3N2
human
2011
A/Delaware/05/2011


H3N2
swine
2012
A01203748/2012.2012/08


H3N2
swine
2012
A/swine/Thailand/CU-P53/2012.2012/01


H3N2
swine
2012
AVX13/2012.2012/01


H3N2
swine
2013
A/swine/Miyazaki/2/2013.2013


H3N2
swine
2017
A01812268/2017.2017/08


H3N2
human
2018
A/Indiana/11/2018


H5N1
unknown
1997
A/Hong.Kong/483/1997


H5N1
unknown
2001
A/Hong.Kong/378.1/2001


H5N1
unknown
2003
A/Beijing/01/2003


H5N1
unknown
2004
A/Vietnam/1194/2004


H5N1
unknown
2006
A/China/2006


H5N1
unknown
2007
A/Egypt/1394-NAMRU3/2007


H5N1
unknown
2008
A/chicken/VietNam/NCVD-016/2008


H5N1
unknown
2008
A/Bangladesh/207095/2008


H5N1
unknown
2013
A/Cambodia/X0123311/2013


H5N1
unknown
2015
A/Egypt/N0001/2015


HAB
human
1940
B/Lee/40


HAB
human
1959
B/Maryland/1959


HAB
human
1973
B/Hong Kong/8/1973


HAB
human
1986
B/Ann Arbor/1/1986


HAB
human
1992
B/Oita/15/1992


HAB
human
2001
B/Hong Kong/22/2001


HAB
human
2002
B/Sydney/3/2002


HAB
human
2004
B/Hong Kong/CUHK50947/2004


HAB
human
2006
B/Kol/583/2006


HAB
human
2007
B/Kol/1234/2007


HAB
human
2007
B/Kol/1013/2007


HAB
human
2008
B/Kol/1373/2008









In some examples, a vaccine composition comprising 28 flu neuraminidase antigens from human, swine, and unknown hosts from 1930 through 2019 was used. Antigens in this composition are provided in Table 5. Antigens were purchased from Sino Biological, Inc. (Beijing, China). The amino acid sequences for these antigens are SEQ ID NOs:88-127, as listed in Table 7.









TABLE 5







Flu neuraminidase antigen vaccine composition










Strain
Host
Year
Strain Info





H1N1
swine
1930
A/swine/Iowa/15/1930(H1N1)


H1N1
human
1936
A/Henry/1936(H1N1)


H1N1
human
1943
A/Iowa/1943(H1N1)


H1N1
human
1982
A/Baylor/11515/1982(H1N1)


H1N1
swine
1993
A/swine/Denmark/19216B/1993(H1N1)


H1N1
human
1995
A/quail/Nanchang/12-340/2000(H1N1)


H1N1
human
1996
A/Auckland/6/1996(H1N1)


H1N1
human
2001
A/Auckland/606/2001(H1N1)


H1N1
swine
2006
A/swine/Canada/01093/2006(H1N1)


H1N1
human
2008
A/Arizona/13/2008(H1N1)


H1N1
swine
2012
A/swine/Guangxi/NS2176/2012(H1N1)


H1N1
swine
2016
A/swine/Alberta/SD0154/2016(H1N1)


H1N1
human
2017
A/Idaho/01/2017(H1N1)


H1N1
swine
2018
A/Sw/Bulnes/VN1401-P6SP/2018


H3N2
human
1968
A/Albany/18/1968(H3N2)


H3N2
human
1969
A/Bilthoven/17938/1969(H3N2)


H3N2
human
1971
A/Hong Kong/107/1971(H3N2)


H3N2
swine
1985
A/swine/Italy/526/1985(H3N2)


H3N2
human
1992
A/Amsterdam/4112/1992(H3N2)


H3N2
human
1998
A/Hong Kong/CUHK19579/1998(H3N2)


H3N2
swine
2000
A/Swine/Indiana/P12439/00 (H1N2)


H3N2
human
2005
A/Australia/NHRC0005/2005(H3N2)


H3N2
swine
2007
A/Swine/Spain/SF12091/2007(H1N2)


H3N2
swine
2009
A/swine/Italy/191985/2009(H1N2)


H3N2
swine
2010
A/swine/Papenburg/IDT12653/2010(H1N2)


H3N2
swine
2016
A/swine/North Carolina/A01668056/





2016(H1N2)


H3N2
swine
2017
A/swine/Alberta/SD0217/2017


H3N2
human
2019
A/swine/China/JG20/2019


HAB-NA
human
1936
HAB-NA-AHZ37009-/B/Nicaragua/AGB2-





36


HAB-NA
human
1940
HAB-NA-AAA43749-/B/Lee/1940


HAB-NA
human
1959
HAB-NA-AAA43735-/B/Maryland/59


HAB-NA
human
1973
HAB-NA-BAA03412-/B/Kanagawa/73


HAB-NA
human
1994
HAB-NA-AAU94767-/B/Nanchang/560/94


HAB-NA
human
1998
HAB-NA-ABQ53764-/B/Siriraj/09/98


HAB-NA
human
2001
HAB-NA-AAN39781-/B/Brazil/952/2001


HAB-NA
human
2005
HAB-NA-AGZ60070-/B/Christchurch/38/





2005


HAB-NA
human
2006
HAB-NA-AEG20998-/B/Kol/515/2006


HAB-NA
human
2006
HAB-NA-AEA51357-/B/Kol/583/2006


HAB-NA
human
2006
HAB-NA-AEA51355-/B/Kol/542/2006


HAB-NA
human
2007
HAB-NA-ABX71686-/B/Guangzhou/01/





2007









Additional amino acid sequences for antigens that can be included in a vaccine are provided as SEQ ID NOs:128-171, as listed in Table 8.


Example 5: Hemagglutinin Inhibition Assay

A hemagglutinin inhibition assay can be used to measure an immune response conferred by a vaccine. A sample protocol for a hemagglutinin inhibition assay can begin with the preparation of a serial dilution of a virus, and can include the preparation of at least one negative control or positive control sample, or both a negative control sample and a positive control sample. Antibodies which might be against a virus, for example, in a serum sample, can be incubated with the virus. Red blood cells can then be added to each sample and incubated, and hemagglutinin can be observed. In some cases, the absence of a button can be considered a positive response. In some cases, a titer can be calculated and reported.


Example 6: Influenza Neutralization Assay

A sample protocol for an influenza neutralization assay can begin with the preparation of a serial dilution of an antibody or a solution comprising an antibody, for example, serum. Provided next is a sample method for producing such a serial dilution. The antibody dilutions can be prepared using a virus diluent. A virus diluent can be added to wells of a microtiter plate, such as a 96-well plate. In some cases, 50 μL of virus diluent can be added in each well. Then, 50 μL of the antibody or solution comprising an antibody can be added to the first well. In some cases, this can be a 1 mg/mL stock solution. In some cases, this can be serum. In some cases, this can be diluted serum. 2-fold serial dilutions can be accomplished by transferring 50 μL from the first well to each successive well to accomplish dilutions ranging from 1:1 to 1:128 (antibody solution: diluent). In addition, negative control wells comprising virus diluent but no antibody or solution comprising an antibody can be prepared. The last 50 μL may be discarded. In some cases, this can be done in replicates, for example, of 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


The plate can be covered and placed in an incubator. In some cases, the conditions can be about 37° C. with about 5% CO2. In some cases, the conditions can be exactly 37° C. with about 5% CO2. In some cases, the conditions can be about 37° C. with exactly 5% CO2. In some cases, the conditions can be exactly 37° C. with exactly 5% CO2. The plate can be held in the incubator during preparation of the diluted virus. In some cases, pH should be maintained to avoid deleterious pH effects on the virus when it is added.


The virus can be diluted in virus diluent to a working dilution, which can be 100 TCID50/50 μL. TCID50 can be the 50% tissue culture infective dose, and 100 TCID50 can be equal to 100 times the value of TCID50.


50 μL of diluted virus can be added to wells containing antibodies and control wells having no antibodies. In some cases, additional control wells having antibody or solution containing antibody, but no virus, can be prepared. In some cases, a row of wells can be reserved for virus back titration. The plate can be mixed gently, and virus diluent can be added to all wells to bring each well to an equal volume. A back titration can be performed using a standard protocol.


The plate can be covered and again placed in the incubator as described above for 1 hour. After 1 hour, 100 μL of diluted MDCK cells (which should be 70-95% monolayer and low passage<30) can be added to each well of microtiter plates. Each well can contain 1.5×104 cells.


The plate can be covered and again placed in the incubator as described above for 18-20 hours. After incubation, the liquid can be removed from wells, and wells can be washed, for example, using 200 μL phosphate buffered saline (PBS). 100 μL of cold 80% acetone can be added to each well, and the plate can be incubated at room temperature for 10-12 minutes for fixation. Acetone can be then removed from the wells, and the plate can be dried, for example, by air drying for about 10 minutes or until dry.


An influenza enzyme linked immunosorbent assay (ELISA) can then be performed using a standard ELISA protocol to determine the amount of virus which has been neutralized. In some cases, an influenza neutralization assay can be performed with the serum of a subject which has been inoculated with a vaccine composition developed according to this disclosure. In some cases, such an influenza neutralization assay can be performed using pig serum, human serum, or the serum of any mammal. In some cases, the serum can comprise antibodies which are head specific antibodies, stem specific antibodies, broadly neutralizing antibodies, universal antibodies, 90% universal antibodies, broadly strain specific antibodies, and/or 90% strain specific antibodies


In an influenza neutralization assay, control samples from a subject vaccinated with a control vaccine can be included. In such cases, the control vaccine can be a monovalent vaccine, a bivalent vaccine, a trivalent vaccine, or a quadrivalent vaccine. When the control vaccine is a trivalent vaccine, it can comprise antigens from the H1N1 strain, the H3N2 strain, and the HAB strain. In some cases, the minimal neutralizing dilution can be at least 2-fold greater using the developed vaccine than using the control vaccine. In some cases, the minimal neutralizing dilution can be at least 2-fold greater using the developed vaccine than using the control vaccine comprising H1N1, H3N2, and HAB antigens when tested using a hemagglutinin inhibition assay or influenza neutralization. In some cases, the minimal neutralizing dilution can be at least 2-fold greater using the developed vaccine than using the control quadrivalent vaccine when tested using a hemagglutinin inhibition assay or influenza neutralization. In some cases, the minimal neutralizing dilution can be at least 10-fold greater using the developed vaccine than using the control vaccine. In some cases, the minimal neutralizing dilution can be at least 10-fold greater using the developed vaccine than using the control vaccine comprising H1N1, H3N2, and HAB antigens when tested using a hemagglutinin inhibition assay or influenza neutralization. In some cases, the minimal neutralizing dilution can be at least 10-fold greater using the developed vaccine than using the control quadrivalent vaccine when tested using a hemagglutinin inhibition assay or influenza neutralization. In some cases, the minimal neutralizing dilution can be at least 100-fold greater using the developed vaccine than using the control vaccine. In some cases, the minimal neutralizing dilution can be at least 100-fold greater using the developed vaccine than using the control vaccine comprising H1N1, H3N2, and HAB antigens when tested using a hemagglutinin inhibition assay or influenza neutralization. In some cases, the minimal neutralizing dilution can be at least 100-fold greater using the developed vaccine than using the control quadrivalent vaccine when tested using a hemagglutinin inhibition assay or influenza neutralization. In certain cases, the minimal neutralizing dilution can be at least 1,000-fold greater using the developed vaccine than using the control vaccine. In various cases, the minimal neutralizing dilution can be at least 1,000-fold greater using the developed vaccine than using the control vaccine comprising H1N1, H3N2, and HAB antigens when tested using a hemagglutinin inhibition assay or influenza neutralization. In some cases, the minimal neutralizing dilution can be at least 1,000-fold greater using the developed vaccine than using the control quadrivalent vaccine when tested using a hemagglutinin inhibition assay or influenza neutralization. In certain cases, the minimal neutralizing dilution can be at least 10,000-fold greater using the developed vaccine than using the control vaccine. In various cases, the minimal neutralizing dilution can be at least 10,000-fold greater using the developed vaccine than using the control vaccine comprising H1N1, H3N2, and HAB antigens when tested using a hemagglutinin inhibition assay or influenza neutralization. In some cases, the minimal neutralizing dilution can be at least 10,000-fold greater using the developed vaccine than using the control quadrivalent vaccine when tested using a hemagglutinin inhibition assay or influenza neutralization. In certain cases, the improvement in minimal neutralizing dilution can be for one antigen, for more than one antigen, or for all antigens tested. In various cases, the improvement in minimal neutralizing dilution can be for one antigen, for more than one antigen, or for all antigens tested.


In some cases, the developed vaccine can neutralize more strains than the control vaccine. For example, the developed vaccine can neutralize at least one more strain than the control vaccine. In certain cases, the developed vaccine can neutralize at least 2, at least 5, at least 10, at least 25, at least 50, or at least 100 more strains than the control vaccine. In various cases, the developed vaccine can neutralize at least 2 times, at least 3 times, at least 4 times, at least 5 times, at least 10 times, at least 50 times, at least 100 times, at least 500 times, at least 1,000 times, at least 5,000 times, or at least 10,000 times more strains than the control vaccine.


Example 7: ELISA Assay

An ELISA can be used to detect antibodies present in serum directed against an antigen such as a flu antigen. In some embodiments, multiple different antigens (e.g., hemagglutinin variants from various strains) are investigated. Briefly, a hemagglutinin antigen variant can be immobilized in a well of a 96-well plate. In some cases, a dilution series can be prepared. Negative control wells comprising no antigen can be prepared.


A solution comprising serum suspected of containing hemagglutinin can be incubated in the wells comprising a hemagglutinin antigen variant or control wells. After an incubation period, the solution comprising serum can be washed away. In some cases, if an antibody or antibodies are present in the serum which can react with the hemagglutinin, then antibody will remain bound to the plate.


A secondary antibody raised against antibodies of the animal from which the serum originated can be applied to the wells. After an incubation period, the secondary antibody can be washed away. Should there be an interaction between the hemagglutinin and an antibody in the serum, secondary antibody can label the hemagglutinin-antibody complex.


The secondary antibody can be labeled by any acceptable means, and signal can be detected in wells where secondary antibody is present. In some cases, signal can correspond to the amount of secondary antibody present. In some cases, signal can correspond to the amount of antibody of serum-origin present. In some cases, signal can correspond to the amount of hemagglutinin-antibody interaction in a well.


Example 8: Increasing Average Epitope Breadth Through Conservation-Concentration Coupling

It was tested whether a vaccination strategy that shifts the probability distribution of epitope conservation in vivo to enrich vaccine-elicited antibodies towards conserved epitopes can be designed.


Without being bound by any one particular theory, the hypothesis was that it can be possible to selectively reward B cells that can recognize, and can affinity mature towards broadly conserved epitopes by coupling the conservation of these epitopes to their concentration in the vaccine formulation, known as Conservation-Concentration Coupling or C3. By administering a mixture of 30 diverse variants of HA, each individually at a dose which is not sufficient to elicit an effective strain-specific immune response, the dominant population of single variant epitopes can be effectively removed from the distribution. Thus, this would selectively favor B cells that recognize shared epitopes across components, with a broadly reactive B cell receiving up to a 30-fold higher dose than a strain specific B cell.



FIG. 6 schematically illustrates how C3 works. B cells that carry a receptor that targets a conserved epitope receive a higher relative load of antigen, and thus necessary stimulus to differentiate into plasma cells that can produce antibodies. Other B cells with a receptor that can recognize a strain specific epitope can receive a lower relative load of antigens, which can be below the threshold necessary to initiate an immune response.


Example 9: Study Design—Conserved Epitope Focusing with Conservation-Concentration Coupling (C3)

Three in vivo experiments were designed using domesticated outbred pigs (Sus scrofa), a model organism and a veterinary target organism for influenza A infection and vaccination. For the first in vivo study, 35 pigs were split into 5 cohorts with 7 pigs per cohort. Each vaccine (C3) formulation comprised a diverse set of 30 HA variants (H1, H2, H3, H5, and H7 spanning 1918-2014). Cohorts of pigs were inoculated with either a bivalent formulation (BIV, 2 HA antigens H1N1 2007 and H3N2 1997 at 5,000 ng/antigen), C3-500 (30 HA antigens at 500 ng/antigen), C3-100 (30 HA antigens at 100 ng/antigen), C3-50 (30 HA antigens at 50 ng/antigen), or a vehicle control (PBS+squalene). The animals were inoculated 3 times, 3 weeks apart, with sera collection 3-4 weeks after each vaccination (Days 29, 50, and 71). RNA was isolated from PBMCs collected 7 days after the third vaccination (Day 50).


In the second and third in vivo studies, all HA representatives after 2008 were removed in order to create a “pre-2009” C3 formulation for examining serological response to de facto future viral strains and HAs including pandemic H1N1 2009 through H3N2 2018. A 2008 seasonal bivalent control was similarly formulated, using H1N1 2007 and H3N2 2007.


For the second in vivo study, 20 pigs were separated into 5 cohorts with 4 pigs per cohort. Cohorts were each inoculated with a formulation comprising either 2 HA antigens at 5,000 ng/antigen (BIV), 27 HA antigens at 50 ng/antigen (C3-50), H1N1 2007 at 50 ng (single low-dose H1), H3N2 2007 at 50 ng (single low-dose H3), or PBS+squalene (vehicle control). Cohorts for single HAs administered at low dose (50 ng) were included to validate a working Lthresh (stochiometric limit below which a given antigen does not induce an immune response) in pigs. The animals were each inoculated six times, 21 days apart. Additionally, 3 cohorts (BIV, C3-50, and vehicle control) received a seventh inoculation 9 weeks after sixth inoculation. For C3-50, the seventh inoculation comprised a high-dose boost (27 HA at 500 ng/antigen). Sera were collected immediately prior to each inoculation on the same day. Additional sera were collected 4 weeks after the sixth inoculation (Day 135) and 4 weeks after the seventh inoculation (Day 195). RNA was isolated from peripheral blood mononuclear cells collected 7 days after the sixth inoculation (Day 112).


For the third in vivo study, 25 pigs were separated into five cohorts with five pigs per cohort. Cohorts were each inoculated with a formulation comprising either 5,000 ng/antigen+squalene (BIV), C3-50+squalene, C3-50+alum, C3-50+TLR, or a vehicle control (PBS+squalene+alum+MPLA+imiquimod. The animals received three inoculations each 21 days apart, with sera collection immediately prior to each inoculation on the same day. Additional sera were collected four weeks after the third inoculation (Day 70). All 3 C3-50 cohorts received 28 HA antigens at 50 ng/antigen for the first two inoculations, and 500 ng/antigen for the third inoculation.


Example 10: Swarm Effect

To support the theory of concentration conservation coupling, Lthresh was determined in vivo. First, it was determined that a 50 ng dose of HA is under the threshold of immune activation by administering a single antigen, either H1N1 2007 or H3N2 2007, at a dose of 50 ng. Six total doses were administered to pigs 21 days apart (corresponding to the second study in the previous example). In both cases, a sera response was not detected by ELISA to either antigen 28 days after the sixth inoculation, as illustrated in FIG. 7.


However, when the two antigens are administered in combination with 25 other HAs spanning 1918-2008, also each at 50 ng (C3-50), a sera response to both antigens was detected. Additionally, a C3-50 inoculation resulted in sera responses to other HAs in C3 and a panel of “future” post-2008 strains not included in the formulation, including H1N1, H3N2, H5N1, and H7N9, spanning 2009-2013. Thus, a low-dose ensemble can elicit an immune response even though individual components can be below their threshold dose.


In this case, in a pool of 27 antigens at individually sub-physiological doses, shared epitopes can be at up to a 27-fold higher dose, and therefore can be successfully targeted and can result in preferentially broad responses that can cross to heterologous strains.



FIG. 7 illustrates ELISA signals (% max OD450 nm) of individual pig sera to 10 HA antigens (H1, H3, H5, and H7 spanning 1934-2013) across immunization cohorts (each n=4) of C3-50 (27 HAs at 50 ng/HA), single low-dose (50 ng) H1N1 2007 antigen, single low-dose (50 ng) H3N2 2007 antigen, and vehicle control (PBS+squalene). Box and whiskers plots show median and IQR; whiskers show 1.5×IQR of the lower and upper quartiles. Points represent outliers. Antigens shown in the dashed line box indicate HAs present in the 1918-2008 C3 vaccine formulation.


Example 11: Conservation-Concentration Coupling Responses are Primarily Subtype-Specific and Require a Minimum Total Subtype Antigen Dose

The analysis of C3-elicted binding response was separated by HA subtype. The C3-50 formulation the first study described in Example 9 contained 10 H1s, 7 H3s, 5 H5s, 3 H7s, and 2 H2s, each individually at 50 ng. Serum responses were correlated to total subtype dose, with H1 and H3 providing the strongest responses, H5 providing a weaker response, and H2 and H7 appearing poorly responded to by C3-50. These results support predictions that responses in one subtype do not necessarily elevate responses to other subtypes. Thus, in some cases broad intra-subtype responses can be much more common than fully universal antibodies. The data suggests a potential minimum clade dose of 7-10 members (i.e., 350-500 ng of the most conserved epitopes) for best effect.


Data illustrating this experiment is illustrated in FIG. 8. ELISA signals (% max OD450 nm) from pigs of the C3-50 cohort (study #1) are shown separated into the different HA antigen classes present in the C3-50 formulation (H1, H2, H3, H5, and H7). The X-axis shows the number of HAs per class. Sera were collected 28 days after the third vaccination (Day 71). Pearson's correlation coefficient for number of HA antigens versus ELISA signal is shown as R2 with corresponding p-value. Box and whiskers plots show median and IQR; whiskers show 1.5×IQR of the lower and upper quartiles. P values are calculated using Kruskal-Wallis rank sum test with Dunn's correction for multiple comparisons. Selected P values are reported as adjusted and indicated as *P<0.05, **P<0.01, ***P<0.001, and ****P<0.0001.


Example 12: Conservation-Concentration Coupling Broad Reactivity with an Inverse Dose Response

In some cases, lowering the dose for each individual component may be able to increase the pressure to target only conserved epitopes and thus provide greater breadth of serological response. To evaluate the relationship between dose and response of C3, three dosage cohorts of the C3 were tested alongside BIV and vehicle control groups. Sera responses from each of 7 pigs per cohort from the first study described in Example 9 to a panel of 36 antigens indicate that the lowest dose of C3 (50 ng/HA) provided the highest sera response to the most antigens, spanning H1N1, H1N3, H1N3, H2N2, H3N2, H5N1, H6N2, and H17N10 from 1918-2014. This inverse dose-response is in accordance with model predictions, and a unique property of this vaccination strategy. All three C3 formulations produced an improved median serum response compared to our bivalent formulation as measured by ELISA, as shown in FIG. 9B. Additionally, an inverse dose response was observed where the lowest dose of C3 yielded the highest median serum response as shown in FIG. 9A, and highest fold-change in sera titer over time, from 7 days to 28 days post-third inoculation shown in FIG. 9B, consistent with the model's predictions of conservation-concentration coupling leading to a shift towards breadth by diluting out competing epitopes.



FIG. 9A illustrates average ELISA signals (% max OD450 nm) per pig across all tested strains (n=36, spanning H1, H2, H3, H5, H7, and H17 ranging from 1918-2014) per pig (study #1). Sera were collected 28 days after the third vaccination (Day 71). FIG. 9B illustrates fold-change of average ELISA signal (% max OD450 nm) per pig from C) between 7 days and 28 days post-third vaccination. Box and whiskers plots show median and IQR; whiskers show 1.5×IQR of the lower and upper quartiles. P values are calculated using Kruskal-Wallis rank sum test with Dunn's correction for multiple comparisons. Selected P values are reported as adjusted and indicated as *P<0.05, **P<0.01, ***P<0.001, and ****P<0.0001.


When analyzing responses on a per-cohort, per-animal basis, more animals were observed responding more broadly as a function of inverse dose response, with the lowest dose (C3-50 ng/antigen) exhibiting greatest breadth, as shown in the heatmap of FIG. 10.



FIG. 10 provides a heatmap of ELISA signals (% max OD450 nm) from individual pig sera of 5 immunization cohorts binding against 36 HAs (H1, H2, H3, H5, H6, H7, and H17 spanning 1918-2014) as well as broadly neutralizing human antibodies (C05, F10, and CR9114). Immunization cohorts were three different C3 formulations of 30 HAs at 500 ng/HA, 100 ng/HA, and 50 ng/HA, a BIV cohort (H1N1 2007 and H3N2 1997 each at 5,000 ng) and vehicle control cohort (PBS+squalene). Sera were collected 28 days after the third inoculation (Day 71, study #1). Heatmap shading was adapted using quantile breaks to adjust for the ELISA data distribution. Positive strain coverage was determined as percent values above average of the vehicle control+3×SD for each strain.


Collectively, data show that breadth of the response is coupled to the concentration of conserved epitopes and that the C3 formulation at the lowest dose of 50 ng/antigen provided the broadest mean strains bound per pig serum in the most pigs.


Example 13: Conservation-Concentration Coupling Induces Uniform B Cell Expansion

Next, IgM and IgG B cell receptor (BCR) repertoires of pig peripheral blood mononuclear cells (PBMCs) were analyzed of pigs described in the second study (as described in Example 9), 7 days after the sixth inoculation. In contrast with the high variability in clonal expansion observed among pigs receiving either bivalent or vehicle immunization schedules, significantly more uniform BCR repertoire clonotype expansion were observed in the pigs receiving the C3-50 formulation. Thus, C3 appeared to generate a more uniform, and therefore potentially more predictable, immune response.


Data illustrating these observations are provided in FIG. 11, as peripheral BCR repertoire occupied by top clonotypes separated by isotype. The black line shows the average repertoire occupied as percent (%) of total sequences for top n clonotypes across pigs per immunization group. The gray area indicates 95% confidence interval Immunization cohorts were from the second in vivo study described in Example 9; a C3 formulation of 27 HAs at 50 ng/HA (n=4), a bivalent cohort (H1N1 2007 and H3N2 2007 each at 5,000 ng, n=3) and vehicle control (PBS+squalene, n=4). Lymphocytes were collected 7 days after sixth inoculation (Day 112). Isotypes were determined by C region specific priming during library preparation. Clonotypes are defined as sequences with identical CDRH3 amino acid sequence.


Example 14: Conservation-Concentration Coupling Elicits Broadly Neutralizing Antibodies

To isolate monoclonal pig anti-HA antibodies produced in C3 vaccinated animals, libraries were generated of m13 phage displayed pig antibodies from C3 animals of the first in vivo study described in Example 9. From those libraries, we isolated pig anti-HA antibodies, including the bnAb 9C5, from a pig which received low-dose C3 (50 ng/antigen).


Data representing the results of this experiment are provided in FIG. 12. Sequence alignment of broadly neutralizing porcine monoclonal antibody 9C5 isolated through phage display. Heavy and Light chain nucleotide sequences were aligned using IMGT V Quest against pig (Sus scrofa) germline genes. Mutations compared to detected germline genes are underlined and the CDRH3 is marked in bold. CDRH3 was reported by IMGT with 7 added amino acid positions 111.1-111.3 and 112.4-112.1 in the IMGT numbering scheme. Phage libraries were constructed from B cell pools per immunization cohort from lymphocytes 7 days post third vaccination (Day 50, study #1) and panned 4 rounds against H3N2 2007 and H1N1 2007. Monoclonal 9C5 was derived from one pig (C3-50 cohort, 30 HAs at 50 ng/HA).


The data show that C3 antigen formulations can induce broadly neutralizing antibodies, which show improved performance when compared to known human bnAbs. This suggests that the approach provided herein is effectively overcoming the stochastic hurdles of these exceedingly rare epitopes and that antibodies targeting these can be able to broadly neutralize a series of influenza strains, even if they were not part of the original antigen formulation.


Further, an experiment was performed to determine minimum antibody concentration (ug/mL) sufficient for in vitro neutralization of a panel of 5 Influenza A strains (H1N1/California/2009, H1N1/Michigan/2015, H1N1/Michigan/2017, H3N2/Brisbane/2007, and H3N2/Victoria/2011). Results are provided in FIG. 13. Pig monoclonal 9C5, isolated from one pig (C3-50, in-vivo study #1) using phage display library, was tested as scFv-huFc fusion chimera construct in 2-Ml HEK293 supernatant, quantitated by Protein A standard curve on Octet QK at 32 ug/M1. Shown in comparison to control bnAbs CR9114, F10, and C05 tested for neutralizing capability as purified human IgG1.


Example 15: Neutralization of “Future” Influenza Strains

Here it was determined whether C3 formulations elicited neutralizing antibodies through microneutralization assays on sera from pigs inoculated with C3. For sera from the second in vivo study described in Example 9, similar neutralization capacities were observed from sera of both the C3-50 vaccination cohort and the sera of pigs vaccinated with the bivalent control for strains that were part of the bivalent control cohort (H1N1 and H3N2 from 2007). In addition, for tested respective future strains, C3-50 pigs were able to neutralize 5/5 (H1N1 2009, H3N2 2009, H3N2 2011, H3N2 2014, and H1N1 2015) whereas the bivalent control group only showed neutralizing titers against 2/5 (H3N2 2009 and 2011).


For sera from the third in vivo study described in Example 9, pigs from the bivalent control group neutralized 2/6 strains tested with one “future” virus, H3N2 2009, not part of the formulations. The C3-50 cohorts alum and squalene exhibited improved neutralization, with neutralization against 4/6 and 5/6 strains tested. Both neutralizing 3 and 4 future viruses not part of the C3-50 formulation. Pigs that received C3-50+TLR agonists as adjuvants were unable to neutralize any “future” strains and only neutralized H3N2 2007. Collectively, the data show that C3 formulations administered at 50 ng produce sera with broadly neutralizing capacities at a similar level to that of a bivalent formulation while also neutralizing a broader range of “future” viruses not part of the antigen formulation. Thus, a conservation-concentration coupling approach can focus the immune response towards highly conserved epitopes on the HA coat protein. Moreover, these epitopes can elicit a neutralizing serum response at a similar level to a normal bivalent formulation, and additionally covering viral strains emerging almost a decade of viral evolution later.


Data illustrating these results are provided in FIG. 14A-14D. FIG. 14A provides ELISA signals (% max OD450 nm) of pig sera from the first in vivo study to 30 HA antigens (H1, H2, H3, H5, H6, H7, and H17 spanning 1918-2014) across immunization cohorts (each n=7 pigs) of C3-50 (30 HAs at 50 ng/HA), BIV cohort (H1N1 2007 and H3N2 1997 each at 5,000 ng) and vehicle control (PBS+squalene). Sera were collected 28 days after the third vaccination (Day 71). Bar represent mean across all pigs of a cohort, error bars indicate positive standard deviation. Arrows indicate HAs not present in the C3 vaccine formulation (in-vivo study #1). Asterisks indicate statistically significant differences between C3-50 and bivalent formulation with adjusted p values<0.05 or lower. P values were determined using strain-wise Wilcoxon Rank Sum test with alternative hypothesis of BIV signals being less than C3-50 signals. Benjamini Hochberg correction was applied to adjust for multiple testing.



FIG. 14B provides neutralization breadth shown as average number of strains neutralizing per pig by cohort and as number of positive tests per cohort are shown as scatterplot. Sera were collected 28 days after the seventh vaccination (Day 195, study #2). Viruses tested include H1N1/PuertoRico/1934, H1N1/Brisbane/2007, H1N1/California/2009, H1N1/Michigan/2015, H3N2/Brisbane/2007, H3N2/Perth/2009, H3N2/Victoria/2011, and H3N2/HongKong/2014. Sera were not pooled. Cohorts were vehicle control (PBS+squalene), seasonal BIV (H1N1 2007 and H3N2 2007 each at 5,000n g), single low-dose H1 antigen (H1N1 2007, 50 ng) single low-dose H3 antigen (H3N2 2007, 50 ng), and C3-50 (27 HAs at 50 ng/HA), in vivo study #2. Error bars indicate standard deviation.



FIG. 14C provides neutralization strength for study #2 reported as reciprocal neutralizing dilution per strain across cohorts using bar charts. Bars indicate the mean across all pigs (n=4) of a cohort and error bars indicate standard deviation. “Future” strains tested that are not part of the 2008 C3 formulation are indicated by arrows. Sera were collected 28 days after the 7th vaccination (Day 195).



FIG. 14D provides neutralization strength for study #3 reported as reciprocal neutralizing dilution per strain across cohorts using bar charts. Bars indicate the mean across all pigs (n=5) of a cohort and error bars indicate standard deviation. Points show the individual value per pig. “Future” strains tested that are not part of the 2008 C3 formulation are indicated by arrows. Sera were collected 28 days after the third vaccination (Day 70). Cohorts were vehicle control (PBS+squalene+TLR-agonist+alum), seasonal BIV (H1N1 2007 and H3N2 2007 each at 5,000 ng+squalene), and 3 cohorts of C3-50 (28 HAs at 50 ng/HA), with squalene, TLR-agonist, and alum adjuvants, respectively.


Example 16: Selection of 32 Most Diverse HIV Gp160 Sequences for HIV Clade a, B, and C Respectively

HIV sequences were downloaded from www.hiv.lanl.gov/ and separated into clades. Sequences were filtered by length to exclude sequences shorter than 800 amino acids and sequences from Clade A, B or C (of HIV-1 Group M) selected for further processing.


Sequences were aligned using hmmalign (HMMER v. 3.0, hmmer.org/) with a custom made HMM profile for HIV gp160 proteins. Next, a pairwise hamming distance matrix was generated using a custom-made python (v. 3.7.2) script. The sequences were subsequently clustered based on the hamming distance matrix using UPGMA clustering (UPGMA: unweighted pair group method with arithmetic mean) implemented in the hclust function of the R (v. 3.6.1) base stats package. The clustered sequences were divided into 32 subclusters using the cutree function of the R (v 3.6.1) base stats package.


Representative sequences were extracted for each cluster as their medoids using the medoids function of GDAtools (v.1.4). The output for this analysis was 96 amino acid sequences, 32 for each HIV Clade A, B and C. These sequences are SEQ ID NOs:172-267, as listed in Table 9.


VI. Compositions

A composition can be a vaccine or a library of antigens from which antigens of a vaccine are selected. A library of antigens can comprise a plurality of antigens. Such a composition can comprise a minimum number of variants of a microbe. In some cases, such a library can comprise at least 1×101, at least 1×102, at least 1×103, at least 1×104, at least 1×105, at least 1×106, at least 1×107, at least 1×108, at least 1×109, or at least 1×1010 different variants of a microbe.


A library of antigens can comprise a subset of all known sequences of a microbe. A library comprising more sequences can be considered more complete, and in some embodiments, can be considered more desirable. In some cases, a library can comprise at least 80%, 85%, 90%, or 95% of all known sequences of a microbe.


Antigens in a vaccine can comprise a specified number of amino acids. In some cases, an antigen in a vaccine can comprise no more than 10, no more than 15, no more than 20, no more than 25, no more than 30, no more than 35, no more than 40, no more than 45, no more than 50, no more than 55, no more than 60, no more than 65, no more than 70, no more than 75, no more than 80, no more than 85, no more than 90, no more than 95, no more than 100, no more than 110, or no more than 120 amino acids. In some cases, an antigen in a vaccine can comprise at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, at least 100, at least 110, or at least 120 amino acids. In some cases, an antigen in a vaccine can comprise from 10 to 120 amino acids, from 10 to 100 amino acids, from 10 to 90 amino acids, from 10 to 80 amino acids, from 10 to 70 amino acids, from 10 to 60 amino acids, from 10 to 50 amino acids, from 10 to 40 amino acids, from 10 to 30 amino acids, from 20 to 120 amino acids, from 20 to 110 amino acids, from 20 to 100 amino acids, from 20 to 90 amino acids, from 20 to 80 amino acids, from 20 to 70 amino acids, from 20 to 60 amino acids, from 20 to 50 amino acids, from 20 to 40 amino acids, from 20 to 30 amino acids, from 30 to 120 amino acids, from 30 to 110 amino acids, from 30 to 100 amino acids, from 30 to 90 amino acids, from 30 to 80 amino acids, from 30 to 70 amino acids, from 30 to 60 amino acids, from 30 to 50 amino acids, from 30 to 40 amino acids, from 40 to 120 amino acids, from 40 to 110 amino acids, from 40 to 100 amino acids, from 40 to 90 amino acids, from 40 to 80 amino acids, from 40 to 70 amino acids, from 40 to 60 amino acids, from 40 to 50 amino acids, from 50 to 120 amino acids, from 60 to 110 amino acids, from 50 to 100 amino acids, from 50 to 90 amino acids, from 50 to 80 amino acids, from 50 to 70 amino acids, from 50 to 60 amino acids, from 60 to 120 amino acids, from 60 to 110 amino acids, from 60 to 100 amino acids, from 60 to 90 amino acids, from 60 to 80 amino acids, from 60 to 70 amino acids, from 70 to 120 amino acids, from 70 to 110 amino acids, from 70 to 100 amino acids, from 70 to 90 amino acids, from 70 to 80 amino acids, from 80 to 120 amino acids, from 80 to 110 amino acids, from 80 to 100 amino acids, from 80 to 90 amino acids, from 90 to 120 amino acids, from 90 to 110 amino acids, from 90 to 100 amino acids, from 100 to 120 amino acids, from 100 to 110 amino acids, or from 110 to 120 amino acids.


In some cases, an antigen in a vaccine can comprise at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, or at least 1000 amino acids. In some cases, an antigen in a vaccine can comprise no more than 200, no more than 300, no more than 400, no more than 500, no more than 600, no more than 700, or no more than 800 amino acids. In some cases, an antigen in a vaccine can comprise from 10 to 1000 amino acids, from 10 to 900 amino acids, from 10 to 800 amino acids, from 10 to 700 amino acids, from 10 to 600 amino acids, from 10 to 500 amino acids, from 10 to 400 amino acids, from 10 to 300 amino acids, from 10 to 200 amino acids, from 100 to 1000 amino acids, from 100 to 800 amino acids, from 100 to 700 amino acids, from 100 to 600 amino acids, from 100 to 500 amino acids, from 100 to 400 amino acids, from 100 to 300 amino acids, from 100 to 200 amino acids, from 200 to 1000 amino acids, from 200 to 900 amino acids, from 200 to 800 amino acids, from 200 to 700 amino acids, from 200 to 600 amino acids, from 200 to 500 amino acids, from 200 to 400 amino acids, from 200 to 300 amino acids, from 300 to 1000 amino acids, from 300 to 900 amino acids, from 300 to 800 amino acids, from 300 to 700 amino acids, from 300 to 600 amino acids, from 300 to 500 amino acids, from 300 to 400 amino acids, from 400 to 1000 amino acids, from 400 to 900 amino acids, from 400 to 800 amino acids, from 400 to 700 amino acids, from 400 to 600 amino acids, from 400 to 500 amino acids, from 500 to 1000 amino acids, from 500 to 900 amino acids, from 500 to 800 amino acids, from 500 to 700 amino acids, from 500 to 600 amino acids, from 600 to 1000 amino acids, from 600 to 900 amino acids, from 600 to 800 amino acids, from 600 to 700 amino acids, from 700 to 1000 amino acids, from 700 to 900 amino acids, from 700 to 800 amino acids, from 800 to 1000 amino acids, from 800 to 900 amino acids, or from 900 to 1000 amino acids.


A vaccine can comprise a specified number of antigens. The antigens in a vaccine can be selected for inclusion based on the total number of antigens available, similarity to pathogenic strains, similarity to other included antigens, or differences from other included antigens. Vaccines herein can comprise a set of antigens which can comprise at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 15, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, or at least 1000 different antigens. In selected cases, a vaccine can comprise at least 30 antigens. In some cases, a vaccine can comprise a set of antigens which can comprise no more than 5, no more than 6, no more than 7, no more than 8, no more than 9, no more than 10, no more than 15, no more than 20, no more than 30, no more than 40, no more than 50, no more than 60, no more than 70, no more than 80, no more than 90, no more than 100, no more than 200, no more than 300, no more than 400, no more than 500, no more than 600, no more than 700, no more than 800, no more than 900, or no more than 1000 different antigens. In selected cases, a vaccine can comprise no more than 30 antigens. In some cases, a vaccine can comprise from 5 to 1000, from 5 to 900, from 5 to 800, from 5 to 700, from 5 to 600, from 5 to 500, from 5 to 400, from 5 to 300, from 5 to 200, from 5 to 100, from 5 to 90, from 5 to 80, from 5 to 70, from 5 to 60, from 5 to 50, from 5 to 40, from 5 to 30, from 5 to 20, from 5 to 10, from 10 to 1000, from 10 to 900, from 10 to 800, from 10 to 700, from 10 to 600, from 10 to 500, from 10 to 400, from 10 to 300, from 10 to 200, from 10 to 100, from 10 to 90, from 10 to 80, from 10 to 70, from 10 to 60, from 10 to 50, from 10 to 40, from 10 to 30, from 10 to 20, from 20 to 1000, from 20 to 900, from 20 to 800, from 20 to 700, from 20 to 600, from 20 to 500, from 20 to 400, from 20 to 300, from 20 to 200, from 20 to 100, from 20 to 90, from 20 to 80, from 20 to 70, from 20 to 60, from 20 to 50, from 20 to 40, from 20 to 30, from 30 to 1000, from 30 to 900, from 30 to 800, from 30 to 700, from 30 to 600, from 30 to 500, from 30 to 400, from 30 to 300, from 30 to 200, from 30 to 100, from 30 to 90, from 30 to 80, from 30 to 70, from 30 to 60, from 30 to 50, from 30 to 40, from 40 to 1000, from 40 to 900, from 40 to 800, from 40 to 700, from 40 to 600, from 40 to 500, from 40 to 400, from 40 to 300, from 40 to 200, from 40 to 100, from 40 to 90, from 40 to 80, from 40 to 70, from 40 to 60, from 40 to 50, from 50 to 1000, from 50 to 900, from 50 to 800, from 50 to 700, from 50 to 600, from 50 to 500, from 50 to 400, from 50 to 300, from 50 to 200, from 50 to 100, from 50 to 90, from 50 to 80, from 50 to 70, from 50 to 60, from 100 to 1000, from 100 to 900, from 100 to 800, from 100 to 700, from 100 to 600, from 100 to 500, from 100 to 400, from 100 to 300, from 100 to 200, from 200 to 1000, from 200 to 900, from 200 to 800, from 200 to 700, from 200 to 600, from 200 to 500, from 200 to 400, from 200 to 300, from 300 to 1000, from 300 to 900, from 300 to 800, from 300 to 700, from 300 to 600, from 300 to 500, from 300 to 400, from 400 to 1000, from 400 to 900, from 400 to 800, from 400 to 700, from 400 to 600, from 400 to 500, from 500 to 1000, from 500 to 900, from 500 to 800, from 500 to 700, from 500 to 600, from 600 to 1000, from 600 to 900, from 600 to 800, from 600 to 700, from 700 to 1000, from 700 to 900, from 700 to 800, from 800 to 1000, from 800 to 900, or from 800 to 1000 different antigens.


In some cases, an intact antigen can be divided into functional fragments, or parts, of the whole antigen, for example, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95 or at least 100 portions (e.g., pieces or fragments), inclusive, and where each individual functional fragment of the antigen can be associated with the polymer according to the methods as disclosed herein. In some cases the antigen may be divided into at most 2, at most 3, at most 4, at most 5, at most 6, at most 7, at most 8, at most 9, at most 10, at most 11, at most 12, at most 13, at most 14, at most 15, at most 16, at most 17, at most 18, at most 19, at most 20, at most 21, at most 22, at most 23, at most 24, at most 25, at most 30, at most 35, at most 40, at most 45, at most 50, at most 55, at most 60, at most 65, at most 70, at most 75, at most 80, at most 85, at most 90, at most 95, or at most 100 portions (e.g., pieces or fragments), inclusive.


In some cases, a broadly conserved sequence can be at least 40% conserved across a set of strains of a microbe (e.g., at least 10% of strains, at least 25% of strains, at least 50% of strains, at least 75% of strains, at least 90% of strains, at least 95% of strains, or all strains of a microbe). In some cases, a broadly conserved sequence can be a sequence that has limited variation across species or strains of a microbe. A sequence that is conserved across a plurality of microbes. can be the same or substantially the same across a plurality of microbes. For example, a sequence that is 40%, 50%, 60%, 70%, 80%, 90%, or 100% conserved can have structural or sequence identity of 40%, 50%, 60%, 70%, 80%, 90%, or 100% across a set of microbes.


In certain cases, a broadly conserved sequence can be at least 50% conserved across a set of strains of a microbe (e.g., at least 10% of strains, at least 25% of strains, at least 50% of strains, at least 75% of strains, at least 90% of strains, at least 95% of strains, or all strains of a microbe). In various cases, a broadly conserved sequence can be at least 60% conserved across a set of strains of a microbe (e.g., at least 10% of strains, at least 25% of strains, at least 50% of strains, at least 75% of strains, at least 90% of strains, at least 95% of strains, or all strains of a microbe). In some cases, a broadly conserved sequence can be at least 70% conserved across a set of strains of a microbe (e.g., at least 10% of strains, at least 25% of strains, at least 50% of strains, at least 75% of strains, at least 90% of strains, at least 95% of strains, or all strains of a microbe). In certain cases, a broadly conserved sequence can be at least 80% conserved across a set of strains of a microbe (e.g., at least 10% of strains, at least 25% of strains, at least 50% of strains, at least 75% of strains, at least 90% of strains, at least 95% of strains, or all strains of a microbe). In various cases, a broadly conserved sequence can be at least 90% conserved across a set of strains of a microbe (e.g., at least 10% of strains, at least 25% of strains, at least 50% of strains, at least 75% of strains, at least 90% of strains, at least 95% of strains, or all strains of a microbe).


An antigen can be representative of a clade if its sequence can be up to 10% of the average size of antigens of the represented clade different than other members of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is up to 15%, up to 20%, up to 25%, or up to 30% of the average size of antigens of the represented clade different than other members of the represented clade.


An antigen can be representative of a clade if its sequence is no more than 75% of the average size of antigens of the represented clade different than other members of the represented clade. In some clade, an antigen can be representative of a clade if its sequence is no more than 50%, no more than 55%, no more than 60%, no more than 65%, no more than 70%, no more than 75%, no more than 80%, no more than 85%, no more than 90%, or no more than 95% different than other members of the represented clade.


An antigen can be representative of a clade if its sequence is up to 5, up to 10, up to 15, up to 20, up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 70, up to 80, up to 90, up to 100, up to 150, up to 200, up to 250, or up to 300 amino acids different from a subset of strains of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is up to 5, up to 10, up to 15, up to 20, up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 70, up to 80, up to 90, up to 100, up to 150, up to 200, up to 250, or up to 300 amino acids different from at least 50% of strains of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is up to 5, up to 10, up to 15, up to 20, up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 70, up to 80, up to 90, up to 100, up to 150, up to 200, up to 250, or up to 300 amino acids different from at least 60% of strains of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is up to 5, up to 10, up to 15, up to 20, up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 70, up to 80, up to 90, up to 100, up to 150, up to 200, up to 250, or up to 300 amino acids different from at least 70% of strains of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is up to 5, up to 10, up to 15, up to 20, up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 70, up to 80, up to 90, up to 100, up to 150, up to 200, up to 250, or up to 300 amino acids different from at least 80% of strains of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is up to 5, up to 10, up to 15, up to 20, up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 70, up to 80, up to 90, up to 100, up to 150, up to 200, up to 250, or up to 300 amino acids different from at least 90% of strains of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is up to 5, up to 10, up to 15, up to 20, up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 70, up to 80, up to 90, up to 100, up to 150, up to 200, up to 250, or up to 300 amino acids different from all strains of the represented clade.


An antigen can be representative of a clade if its edit distance is up to 5% of the average size of antigens of the represented clade. In some cases, an antigen can be representative of a clade if its edit distance is up to 10%, up to 15%, or up to 20% of the average size of antigens of the represented clade. An antigen can be representative of a clade if its sequence is up to 10% of the average size of antigens of the represented clade different than at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is up to 15% of the average size of antigens of the represented clade different than at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is up to 20% of the average size of antigens of the represented clade different than at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade.


An antigen can be representative of a clade if its edit distance is no more than 75% of the average size of antigens of the represented clade. In some cases, an antigen can be representative of a clade if its edit distance is no more than 70%, no more than 80%, no more than 90%, no more than 95%, or no more than 99% of the average size of antigens of the represented clade.


An antigen can be representative of a clade if its sequence is no more than 90% of the average size of antigens of the represented clade different than at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is no more than 80% of the average size of antigens of the represented clade different than at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade. In some cases, an antigen can be representative of a clade if its sequence is no more than 70% of the average size of antigens of the represented clade different than at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade.


An antigen can be representative of a clade if its edit distance is up to 5% of the average size of at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade. In some cases, an antigen can be representative of a clade if its edit distance is up to 10% of the average size of at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade. In some cases, an antigen can be representative of a clade if its edit distance is up to 15% of the average size of at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade.


An antigen can be representative of a clade if its edit distance is up to no more than 90% of the average size of at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade. In some cases, an antigen can be representative of a clade if its edit distance is no more than 80% of the average size of at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade. In some cases, an antigen can be representative of a clade if its edit distance is no more than 70% of the average size of at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of all strains of the represented clade.


Some vaccines can comprise antigens having an average edit distance from each of the other antigens which is at least 5% of the average size of antigens in the clade. Some vaccines can comprise antigens having an average edit distance from at least one of the other antigens which is at least 1%, at least 2%, at least 3%, at least 4%, at least 5%, at least 6%, at least 7%, at least 8%, at least 9%, at least 10%, at least 15%, at least 20%, or at least 25% of the average size of antigens in the clade. In such vaccines, some vaccines can comprise antigens having an average edit distance from each of the other antigens which is at least 1%, at least 2%, at least 3%, at least 4%, at least 5%, at least 6%, at least 7%, at least 8%, at least 9%, at least 10%, at least 15%, at least 20%, or at least 25% of the average size of antigens in the clade. In such vaccines, the actual edit distances can be varied, or can be similar. In some cases, no antigens will have an actual edit distance which is exactly at least 5% of the average size of antigens in the clade.


In some cases, the smallest pairwise edit distance between two or more of the antigens in the set may be no more than 1. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set may be no more than 2, no more than 3, no more than 4, no more than 5, no more than 6, no more than 7, no more than 8, no more than 9, no more than 10, no more than 20, no more than 30, no more than 40, or no more than 50. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least 1. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 20, at least 30, at least 40, or at least 50.


In some cases, the smallest pairwise edit distance can be no more than about 1%, no more than about 2%, no more than about 3%, no more than about 4%, no more than about 5%, no more than about 6%, no more than about 7%, no more than about 8%, no more than about 9%, no more than about 10%, no more than about 15%, no more than about 20%, or no more than about 25% of the size of the antigens. In some cases, the smallest pairwise edit distance can be at least about 5% of the size of the antigens. In some cases, the smallest pairwise edit distance can be at least about 1%, at least about 2%, at least about 3%, at least about 4%, at least about 5%, at least about 6%, at least about 7%, at least about 8%, at least about 9%, at least about 10%, at least about 15%, at least about 20%, or at least about 25% of the size of the antigens.


In some cases, the largest pairwise edit distance can be at least 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, at least 400, at least at least 450, at least 500, at least 550, at least 600, at least 700, or at least 800. In some cases, the largest pairwise edit distance can be no more than 5, no more than 10, no more than 20, no more than 30, no more than 40, no more than 50, no more than 60, no more than 70, no more than 80, no more than 90, no more than 100, no more than 150, no more than 200, no more than 250, no more than 300, no more than 350, no more than 400, no more than 450, no more than 500, no more than 550, no more than 600, no more than 700, or no more than 800.


In some cases, the largest pairwise edit distance can be no more than about 75% of the size of the antigens. In some cases, the largest pairwise edit distance can be no more than about 50%, no more than about 55%, no more than about 60%, no more than about 65%, no more than about 70%, no more than about 75%, no more than about 80%, no more than about 85%, no more than about 90%, or no more than about 95% of the size of the antigens. In some cases, the largest pairwise edit distance can be at least about 75% of the size of the antigens. In some cases, the largest pairwise edit distance can be at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90% or at least about 95% of the size of the antigens.


In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than 1, and the largest pairwise edit distance can be at least 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, or at least 800. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least 1, and the largest pairwise edit distance can be at least 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, or at least 800. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least 1, and the largest pairwise edit distance can be no more than 5, no more than 10, no more than 20, no more than 30, no more than 40, no more than 50, no more than 60, no more than 70, no more than 80, no more than 90, no more than 100, no more than 200, no more than 300, no more than 400, no more than 500, no more than 600, no more than 700, or no more than 800. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than 1, and the largest pairwise edit distance can be no more than 5, no more than 10, no more than 20, no more than 30, no more than 40, no more than 50, no more than 60, no more than 70, no more than 80, no more than 90, no more than 100, no more than 200, no more than 300, no more than 400, no more than 500, no more than 600, no more than 700, or no more than 800.


In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 1%, and the largest pairwise edit distance can be at least about 75%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 1%, and the largest pairwise edit distance can be at least about 75%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 1%, and the largest pairwise edit distance can be no more than 75%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 1%, and the largest pairwise edit distance can be no more than about 75%.


In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 5%, and the largest pairwise edit distance can be at least about 75%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 5%, and the largest pairwise edit distance can be at least about 75%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 5%, and the largest pairwise edit distance can be no more than 75%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 5%, and the largest pairwise edit distance can be no more than about 75%.


In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 20%, and the largest pairwise edit distance can be at least about 75%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 20%, and the largest pairwise edit distance can be at least about 75%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 20%, and the largest pairwise edit distance can be no more than 75%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 20%, and the largest pairwise edit distance can be no more than about 75%.


In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 1%, and the largest pairwise edit distance can be at least about 90%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 1%, and the largest pairwise edit distance can be at least about 90%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 1%, and the largest pairwise edit distance can be no more than 90%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 1%, and the largest pairwise edit distance can be no more than about 90%.


In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 5%, and the largest pairwise edit distance can be at least about 90%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 5%, and the largest pairwise edit distance can be at least about 90%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 5%, and the largest pairwise edit distance can be no more than 90%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 5%, and the largest pairwise edit distance can be no more than about 90%.


In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 20%, and the largest pairwise edit distance can be at least about 90%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 20%, and the largest pairwise edit distance can be at least about 90%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be at least about 20%, and the largest pairwise edit distance can be no more than 90%. In some cases, the smallest pairwise edit distance between two or more of the antigens in the set can be no more than about 20%, and the largest pairwise edit distance can be no more than about 90%.


A vaccine can comprise a set of antigens wherein the smallest pairwise edit distance between the two of the antigens can be no more than 1%, no more than 2%, no more than 3%, no more than 4%, no more than 5%, no more than 6%, no more than 7%, no more than 8%, no more than 9%, no more than 10%, no more than 11%, no more than 12%, no more than 13%, no more than 14%, no more than 15%, no more than 16%, no more than 17%, no more than 18%, no more than 19%, or no more than 10% inclusive. A vaccine can comprise a set of antigens wherein the smallest pairwise edit distance between the two of the antigens can be at least 1%, at least 2%, at least 3%, at least 4%, at least 5%, at least 6%, at least 7%, at least 8%, at least 9%, at least 10%, at least 11%, at least 12%, at least 13%, at least 14%, at least 15%, at least 16%, at least 17%, at least 18%, at least 19%, or at least 10% inclusive. A vaccine can comprise a set of antigens wherein the largest pairwise edit distance between two of the antigens can be at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%, inclusive. A vaccine can comprise a set of antigens wherein the largest pairwise edit distance between two of the antigens can be no more than 50%, no more than 55%, no more than 60%, no more than 65%, no more than 70%, no more than 75%, no more than 80%, no more than 85%, no more than 90%, or no more than 95%, inclusive.


In some cases, each antigen in a set can share at least 90%, at least 95%, or at least 99% sequence identity to at least one antigen in the set over a length of at least 100 amino acids. In some cases, each antigen in a set can share at least 90%, at least 95%, or at least 99% sequence identity to at least one antigen in the set over a length of at least 300 amino acids. In some cases, each antigen in a set can share at least 90%, at least 95%, or at least 99% sequence identity to at least one antigen in the set over a length of at least 500 amino acids.


A vaccine can comprise antigens in a set which differ from other antigens in the set by at least 1%, at least 5%, or at least 10% sequence identity. In some cases, a vaccine can comprise antigens in a set which differ from all other antigens in the set by at least 1%, at least 5%, or at least 10% sequence identity.


A vaccine can comprise antigens in a set which differ from at least one other antigen in the set by no more than 75% sequence identity. In some cases, a vaccine can comprise antigens in a set which differ from all other antigens in the set by no more than 75% sequence identity.


A vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of fifth order clades of a microbe. In some cases, vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of sixth order clades of a microbe. In further cases, vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of seventh order clades of a microbe. In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of eighth order clades of a microbe. In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of ninth order clades of a microbe. In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of tenth order clades of a microbe. In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of eleventh order clades of a microbe. In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of twelfth order clades a microbe. In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of thirteenth order clades of a microbe. In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of fourteenth order clades of a microbe. In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of fifteenth order clades of a microbe. In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of twentieth order clades of a microbe In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of twenty-fifth order clades of a microbe. In some cases, a vaccine can comprise a set of antigens that are representative of at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, or at least 90% of thirtieth order clades of a microbe.


A vaccine can comprise antigens that are representative of at least 60% of each of the third, fourth, fifth, sixth, seventh, eighth, ninth, tenth, eleventh, twelfth, thirteenth, fourteenth, fifteenth, or other suitable order clades of a microbe. In some cases, a vaccine can comprise antigens that are representative of at least 70% of each of the third, fourth, fifth, sixth, seventh, eighth, ninth, tenth, eleventh, twelfth, thirteenth, fourteenth, fifteenth, or other suitable order clades of a microbe. In further cases, a vaccine can comprise antigens that are representative of at least 80% of each of the third, fourth, fifth, sixth, seventh, eighth, ninth, tenth, eleventh, twelfth, thirteenth, fourteenth, fifteenth, or other suitable order clades of a microbe. In yet further cases, a vaccine can comprise antigens that are representative of at least 90% of each of the third, fourth, fifth, sixth, seventh, eighth, ninth, tenth, eleventh, twelfth, thirteenth, fourteenth, fifteenth, or other suitable order clades of a microbe.


The smallest number of branches between any 2 antigens can be 1 in some vaccines. The largest number of branches between any two antigens can be at least 5, at least 10, at least 15, or at least 20. In some vaccines, the smallest number of branches between any 2 antigens can be 1 and the largest number of branches between any two antigens can be at least 5. In some vaccines, the smallest number of branches between any 2 antigens can be 1 and the largest number of branches between any two antigens can be at least 10. In some vaccines, the smallest number of branches between any 2 antigens can be 1 and the largest number of branches between any two antigens can be at least 15. In some vaccines, the smallest number of branches between any 2 antigens can be 1 and the largest number of branches between any two antigens can be at least 20. In some vaccines, the smallest number of branches between any 2 antigens can be less than 5 and the largest number of branches between any two antigens can be at least 5. In some vaccines, the smallest number of branches between any 2 antigens can be less than 5 and the largest number of branches between any two antigens can be at least 10. In some vaccines, the smallest number of branches between any 2 antigens can be less than 5 and the largest number of branches between any two antigens can be at least 15. In some vaccines, the smallest number of branches between any 2 antigens can be less than 5 and the largest number of branches between any two antigens can be at least 20. In some vaccines, the smallest number of branches between any 2 antigens can be less than 10 and the largest number of branches between any two antigens can be at least 10. In some vaccines, the smallest number of branches between any 2 antigens can be less than 10 and the largest number of branches between any two antigens can be at least 15. In some vaccines, the smallest number of branches between any 2 antigens can be less than 10 and the largest number of branches between any two antigens can be at least 20.


A vaccine which is a flu vaccine can comprise antigens selected from Table 2, or a fragment or homologue thereof. If a vaccine comprises a fragment or homologue of an antigen, the fragment or homologue can be of at least a certain size. In some cases, a fragment or homologue can be at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, at least 100, at least 110, at least 120, at least 150, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, or at least 1000 amino acids. In some cases, a fragment or homologue can be at most 10, at most 15, at most 20, at most 25, at most 30, at most 35, at most 40, at most 45, at most 50, at most 55, at most 60, at most 65, at most 70, at most 75, at most 80, at most 85, at most 90, at most 95, at most 100, at most 110, or at most 120 amino acids, at most 150 amino acids, at most 200 amino acids, at most 300 amino acids, at most 400 amino acids, at most 500 amino acids, at most 600 amino acids, at most 700 amino acids, at most 800 amino acids, at most 900 amino acids, or at most 1000 amino acids.


In some cases, an antigen in a vaccine can be selected from SEQ ID NOs:1-87. In some cases, an antigen in a vaccine can be at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 90%, or at least about 95% identical to a sequence selected from SEQ ID NOs:1-87.


In some cases, an antigen in a vaccine can be selected from SEQ ID NOs:88-127. In some cases, an antigen in a vaccine can be at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 90%, or at least about 95% identical to a sequence selected from SEQ ID NOs:88-127.


In some cases, an antigen in a vaccine can be selected from SEQ ID NOs:128-171. In some cases, an antigen in a vaccine can be at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 90%, or at least about 95% identical to a sequence selected from SEQ ID NOs:128-171.


In some cases, an antigen in a vaccine can be selected from SEQ ID NOs:172-267. In some cases, an antigen in a vaccine can be at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 90%, or at least about 95% identical to a sequence selected from SEQ ID NOs:172-267.









TABLE 6







Amino acid sequences of hemagglutinin antigens that can be included in


a flu vaccine








SEQ



ID
Sequence





 1
EDTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCKLKGIAPLQL



GKCNIAGWLLGNPECDLLLTASSWSYIVETSNSENGTCYPGDFIDYEELREQL



SSVSSFEKFEIFPKTSSWPNHETTKGVTAACSYAGASSFYRNLLWLTKKGSSY



PKLSKSYVNNKGKEVLVLWGVHHPPTGTDQQSLYQNADAYVSVGSSKYNRR



FTPEIAARPKVRDQAGRMNYYWTLLEPGDTITFEATGNLIAPWYAFALNRGS



GSGIITSDAPVHDCNTKCQTPHGAINSSLPFQNIHPVTIGECPKYVRSTKLRMA



TGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADRKS



TQNAIDGITNKVNSVIEKMNTQFTSVGKEFNHLEKRIENLNRKVDDGFLDVW



TYNAELLVLLENERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKC



DDSCMESVKNGTYDYPKYSEESKLNREEIDGVKLES





 2
ADTICIGYHANNSTDTVDTIFEKNVAVTHSVNLLEDRHNGKLCKLKGIAPLQL



GKCNIIGWLLGNPECDSLLPARSWSYIVETPNSENGACYPGDFIDYEELREQLS



SVSSLERFEIFPKESSWPNHTFNGVTASCSHRGKSSFYRNLLWLTKKGDSYPK



LTNSYVNNKGKEVLVLWGVHHPSSSDEQQSLYSNGNAYVSVASSNYNRRFT



PEIAARPKVKDQHGRMNYYWTLLEPGDTIIFEATGNLIAPWYAFALSRGFESG



IITSNASMHECNTKCQTPQGSINSNLPFQNIHPVTIGECPKYVRSTKLRMVTGL



RNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQN



AINGITNKVNSIIEKMNTQFTAVGKEFNNLEKRMENLNKKVDDGFLDIWTYN



AELLVLLENGRTLDFHDLNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCDNE



CMESVRNGTYDYPKYSEESKLNREKIDGVKLES





 3
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLKGIAPLQL



GKCNIAGWLLGNPECDPLLPVRSWSYIVETPNSENGICYPGDFIDYEELREQLS



SVSSFERFEIFPKESSWPNHNTNGVTAACSHEGKSSFYRNLLWLTEKEGSYPK



LKNSYVNKKGKEVLVLWGIHHPPNSKEQQNLYQNENAYVSVVTSNYNRRFT



PEIAERPKVRDQAGRMNYYWTLLKPGDTIIFEANGNLIAPMYAFALSRGFGSG



IITSNASMHECNTKCQTPLGAINSSLPYQNIHPVTIGECPKYVRSAKLRMVTGL



RNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQN



AINGITNKVNTVIEKMNIQFTAVGKEFNKLEKRMENLNKKVDDGFLDIWTYN



AELLVLLENERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCDNE



CMESVRNGTYDYPKYSEESKLNREKVDGVKLES





 4
ADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLGGIAPLQ



LGKCNIAGWLLGNPECDLLLTVSSWSYIVETSNSDNGTCYPGDFIDYEELREQ



LSSVSSFERFEIFPKTSSWPNHETTNGVTAACPYAGANSFYRNLLWLVKKGDS



YPKLSKSYVNNKGKEVLVLWGVHHPPTSTEQQSLYQNADAYVSVGSSKYNR



RFTPEIAARPKVRGQAGRMNYYWTLLEPGDTITFEATGNLVAPRYAFALNKG



SGSGVITSDAPVHDCDTKCQTPHGAINSSLPFQNIHPVTIGECPKYVKSTKLRM



ATGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHQQNGQGSGYAADQK



STQNAIDGITNKVNSVIEKMNTQFTAVGKEFNNLERRIENLNKKVDDGFLDV



WTYNAELLVLLENERTLDFHDSNVKNLYEKVRSQLRNNAKEIGNGCFEFYHK



CDNACMESVRNGTYDYPKYSEESKLNREEIDGVKLES





 5
ADTICIGYQANNSTDTVDTLLEKNVTVTHSVNLLEDSHNGKLCKLKGIAPLQL



GKCNIAGWLLGNPECDSLLPARSWSYIVETPNSETGACYPGDFIDYEELREQL



SSVSSLERFEIFPKESSWPNHNTTGVTKSCSHRGESSFYRNLLWLTKKGNSYPE



LNNSYVNNKGKEVLVLWGVHHPSNSNNQQTLYHNANAYVSVVSSNYNRRF



TPEIAARPKVRDQTGRMNYYWTLLEPGDTIIFEATGNLIAPWYAFALSRGFGS



GIITSNASMHECNTKCQTPQGAINSSLPFQNIHPVTIGECPKYVRSTKLRMVTG



LRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQ



NAINRITNKVNSVIEKMNTQFTAVGKEFNNLEKRMENLNKEVDDGFLDIWTY



NAELLVLLENEMTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCD



NECMESVRNGTYDYPKYSEDSKLNREKIDGVKLEP





 6
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLKGIAPLQL



RKCNIAGWILGNPECESLLSERSWSYIVETPNSENGTCYPGDFTNYEELREQLS



SVSSFERFEIFPKESSWPKHNTTRGVTAACSHAGKSSFYRNLLWLTEKDGSYP



NLNNSYVNKKGKEVLVLWGVHHPSNIKDQQTLYQKENAYVSVVSSNYNRRF



TPEIAERPKVRGQAGRMNYYWTLLKPGDTIMFEANGNLIAPWYAFALSRGFG



SGIITSNASMHECDTKCQTPQGAINSSLPFQNIHPVTIGECPKYVRSTKLRMVT



GLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKST



QNAINGITNKVNSVIEKMNTQFTAVGKEFNNLEKRMENLNKKVDDGFLDIWT



YNAELLILLENERTLDFHDSNVKNLYEKVKSQLRNNAKEIGNGCFEFYHKCN



NECMESVKNGTYDYPKYSEESKLNREKIDGVKLES





 7
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLKGIAPLQL



GKCNIAGWILGNPECDSLLPASSWSYIVETPNSKNGICYPGDFIDYEELREQLS



SVSSFERFEIFPKESSWPKHSTTKGVTAACSHAGKSSFYRNLLWLTKKEDSYP



KLSNSYVNKKGKEVLVLWGVHHPSSSKEQQTLYQNENAYVSVVSSNYNRRFI



PEIAERPEVKDQAGRINYYWTLLEPGDTIIFEANGNLVAPWYAFALSRGFGSGI



ITSNASMHECNTKCQTPQGAINSSLPFQNIHPVTIGECPKYVKSAKLRMVTGL



RNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQN



AINGITNKVNSVIEKMNTQFTAVGKEFNNLEKRMENLNKKVDDGFLDIWTYN



AELLVLLENERTLDFHDSNVKNLYEKVKIQLKNNAKEIGNGCFEFYHKCDNE



CMESVRNGTYDYPKYSKESKLNREKIDGVKLES





 8
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLKGKAPLQ



LGNCNIAGWVLGNPECESLLSNRSWSYIAETPNSENGTCYPGDFADYEELREQ



LSSVSSFERFEIFPKESSWPNHTTRGVTAACPHARKSSFYKNLVWLTEANGSY



PNLSRSYVNNQEKEVLVLWGVHHPSNIEEQRALYRKDNAYVSVVSSNYNRRF



TPEIAKRPKVRDQSGRMNYYWTLLEPGDTIIFEATGNLIAPWYAFALSRGPGS



GIITSNAPLDECDTKCQTPQGAINSSLPFQNIHPVTIGECPKYVRSTKLRMVTGL



RNIPSIQSRGLFGAIAGFIEGGWTGMMDGWYGYHHQNEQGSGYAADQKSTQ



NAINGITNKVNSVIEKMNTQFTAVGKEFNKLEKRMENLNKKVDDGFMDIWT



YNAELLVLLENERTLDFHDSNVKNLYEKVKNQLRNNAKELGNGCFEFYHKC



DNECMESVKNGTYDYPKYSEESKLNREKIDGVKLES





 9
ADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDRHNGKLCKLGGIAPLH



LGKCNIAGWLLGNPECELLLTVSSWSYIVETSNSDNGTCYPGDFINYEELREQ



LSSVSSFERFEIFPKTSSWPNHETNRGVTAACPYAGANSFYRNLIWLVEKGNS



YPKLSKSYVNNKGKEVLVLWGIHHPPTSTDQQSLYQNADAYVFVGSSKYNR



KFKPEIAARPKVRGQAGRMNYYWTLIEPGDTITFEATGNLVVPRYAFAMNRG



SGSGIIISDAPVHDCNTKCQTPKGAINTSLPFQNIHPVTIGECPKYVKSTKLRMA



TGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQRS



TQNAIDGITNKVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWT



YNAELLVLLENERTLDFHDSNVKNLYEKVRSQLRNNAKEIGNGCFEFYHKCD



DTCMESVKNGTYDYPKYSEESKLNREEIDGVKLES





10
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLKGIAPLQL



GKCNIAGWILGNPECESLFSKKSWSYIAETPNSENGTCYPGYFADYEELREQL



SSVSSFERFEIFPKESSWPKHNVTRGVTASCSHKGKSSFYRNLLWLTEKNGSY



PNLSKSYVNNKEKEVLVLWGVHHPSNIEDQKTIYRKENAYVSVVSSNYNRRF



TPEIAERPKVRGQAGRINYYWTLLEPGDTIIFEANGNLIAPWHAFALNRGFGSG



ITTSNASMDECDTKCQTPQGAINSSLPFQNIHPVTIGECPKYVRSTKLRMVTGL



RNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQN



AINGITNKVNSVIEKMNTQFTAVGKEFNKLEKRMENLNKKVDDGFLDIWTYN



AELLVLLENERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNNE



CMESVKNGTYDYPKYSEESKLNREKIDGVKLES





11
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLKGIAPLQL



GNCSIAGWILGNPECESLFSKKSWSYIAETPNSENGTCYPGYFADYEELREQLS



SVSSFERFEIFPKESSWPNHTVTKGVTASCSHKGKSSFYRNLLWLTEKNGSYP



TLSKSYVNNKEKEVLVLWGVHHPSNIGDQRAIYHTENAYVSVVSSHYNRRFT



PEIAKRPKVRGQEGRINYYWTLLEPGDTIIFEANGNLIAPWYAFALSRGFGSGII



TSNASMDECDAKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSTKLRMVTGL



RNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQN



AINGITNKVNSVIEKMNTQFTAVGKEFNKLERRMENLNKKVDDGFLDIWTYN



AELLVLLENERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNNE



CMESVKNGTYDYPKYSEESKLNREKIDGVKLES





12
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLKGIAPLQL



GNCSVAGWILGNPKCESLFSKESWSYIAETPNPENGTCYPGYFADYEELREQL



SSVSSFERFEIFPKESSWPNHTVTKGVTTSCSHNGKSSFYRNLLWLTEKNGLYP



NLSKSYVNNKEKEVLVLWGVHHPSNIRDQRAIYHTENAYVSVVSSHYSRRFT



PEIAKRPKVRGQEGRINYYWTLLEPGDTIIFEANGNLIAPWYAFALSRGFGSGII



TSNASMDECDAKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSTKLRMVTGL



RNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQN



AINGITNKVNSVIEKMNTQFTAVGKEFNKLERRMENLNKKVDDGFLDIWTYN



AELLVLLENGRTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNNE



CMESVKNGTYDYPKYSEESKLNRGKIDGVKLES





13
ADTICVGYHANNSTDTVDTILEKNVTVTHSVNLLENNHNGKLCSLNGIAPLQL



GDCNVAGWILGNPECDSLLTANSWSYIIETSNSENGTCYPGEFADYEELRELL



SSVSSFERFEIFPKATSWPNHETTKGVTAACSHSGARSFYRNLLWIVKKGNSY



PKLSKSYTNNRRKEVLVIWGVHHPPTNNDQQSLYQNADAYVSVGSSKYYRR



FTPEIADRPKVRGQAGRMNYYWTMLDQGDTITFEATGNLIAPWYAFALNKG



LSSGIIMSDAPVHNCTTRCQTPYGALNSNLPFQNVHPITIGKCPKYVKSTQLRM



ATGLRNVPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADQ



KSTQNAIDGISNKVNSIIEKMNIQFTSVGKEFNDLEKRVENLNKKVDDGFLDV



WTYNAELLVLLENERTLDFHDFNVRNLYEKVKSQLRNNAKEVGNGCFEFYH



KCDDECMESVRNGTYNYPKYSEESKLNREKIDGVKLES





14
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCLLKGIAPLQL



GNCSVAGWILGNPECESLISKESWSYIVETPNPENGTCYPGYFADYEELREQLS



SVSSFERFEIFPKESSWPNHTVTGVTASCSHNGKSSFYRNLLWLTEKNGLYPN



LSNSYVNNKEKEVLVLWGVHHPSNIRDQRAIYHTENAYVSVVSSHYSRRFTP



EIAKRPKVRGQEGRINYYWTLLEPGDTIIFEANGNLIAPWYAFALSRGFGSGIIT



SNAPMNECDAKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSTKLRMVTGLR



NIPSIQSRGLFGAIAGFIEGGWTGMMDGWYGYHHQNEQGSGYAADQKSTQN



AINGITNKVNSVIEKMNTQFTAVGKEFNKLERRMENLNKKVDDGFLDIWTYN



AELLVLLENERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNNE



CMESVKNGTYDYPKYSEESKLNREKIDGVKLES





15
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLENNHNGKLCNLNGIAPLQ



LGKCNVAGWLLGNPECGLLLNANAWSYIIETSDSKNGTCYPGDFIDYEELRE



QLGSVSSFEKFEIFPKASSWPDHETTKGTTAACPYSGVESFYRNLLWIIKKGNS



YPKISKSYTNNRGKEVLVLWGVHHPPTTSDQQTLYQNIDAYVSVGSSKYNRR



FTPEIATRPKVRGLAGRMNYYWTLLDQGDTIMFEATGNLIAPWYAFALNKGS



DSGIITSDTPVHDCDTKCQTPYGALNSSLPFQNVHPITIGECPKYVKSTKLRMA



TGLRNVPSIQSKGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYSADQKS



TQNAIDGITNKVNSVIEKMNTQFTAVGKEFNNLERRIENLNKKVDDGFLDVW



TYNAELLVLLENERTLDFHDSNVRDLYERVKSQLRNNAKEIGNGCFEFYHKC



DDECMENVKNGTYDYPKYSEEAKLNREKIDGVKIES





16
ADTICIGYHANNSTDTVDTIFEKNVAVTHSVNLLEDSHNGKLCLLKGIAPLQL



GNCSVAGWILGNPECELLISKESWSYIVETPNPENGTCYPGYFADYEELREQLS



SVSSFERFEIFPKESSWPNHTVTGVSASCSHNGKSSFYRNLLWLTGKNGLYPN



LSKSYANNKEKEVLVLWGVHHPPNIGDQQGLYSNGNAYVSVASSNYNRRFT



PEIAARPKVKDQHGRMNYYWTLLEPGDTIIFEATGNLIAPWYAFALSRGFESG



IITSNASMHECNTKCQTPQGSINSNLPFQNIHPVTIGECPKYVRSAKLRMVTGL



RNIPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADQKSTQ



NAINGITNKVNSVIEKMNTQFTAVGKEFNRLERRMENLNKKVDDGFLDIWTY



NAELLVLLENERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNN



ECMESVKNGTYDYPKYSEESKLNREKIDGVKLES





17
ADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLENKHNGKLCKLRGVAPL



HLGKCNIAGWLLGNPECESLFEASSWSYIVETPNSDNGTCYPGDFINYEELRE



HLSSVSSFERFEIFPKANSWPNHDTDSGVTASCPYAGAKSFYRNLIWLVKKGS



SYPKLSKSYINDKKKEVLVIWGIHHPPNSTDQQTLYQNADAYVFVGSSKYSK



RFKPEIAARPKVRDQAGRMNYYWTLIEPGDTITFEATGNLVVPRYAFAMTRG



SGSGIIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPVTIGECPKYVKSTKLKMA



TGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKS



TQNAIDGITNKVNSVIEKMNTQFTAVGKEFNQLEKRIESLNNKVDDGFLDIWT



YNAELLVLLENERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCD



DTCMESVKNGTYDYPKYSEEAKLNREEIDGVKLES





18
ADTLCIGYHANNSTDTIDTVLEKNVTVTHSVNLLENKHNGKLCKLGGMTPLH



LGKCNIAGWLLGNPECELLSTVSSWSYIVETPYSDNGTCYPGDFTNYEELREQ



LSSVSSFERFEIFPKASSWPNHETNRGVTSACPYAGENDFYRNLLWLVKKGNS



YPKLSKSYVNNKGKEVLILWGIHHPPTSADQQSLYQNADAYVFVGTSTYNRR



FTPEVAARPKVRGQAGRMDYYWTLIGPGDTITFEATGNLVVPRYAFAMNRGS



GSGIIISDIPFHNCTTKCQTPKGAINTSLPFQNVHPVTIGECPKYVKSTKLRMVT



GLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKST



QNAIDKITNKVNSVIEKMNTQFTAVGKEFNHLERRIENLNRKVDDGFLDVWT



YNAELLILLENERTLDLHDSNVKNLYEKVKSQLRNNAKEIGNGCFEFYHKCD



DACMESVRNGTYDYPKYSEESRLNREEIDGVKLES





19
ADTLCIGYHANNSTDTVDTILEKNVTVTHSVNLLEDRHNGKLCKLRGVAPLH



LGKCNIAGWLLGNPECESLFTARSWSYIVETSNSDNGTCYPGDFINYEELREQ



LSSVSSFERFEIFPNTSSWPNHDTNRGVTAACPHAGTKSFYRNLIWLVKKGDS



YPKISKSYINNKEKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGSSKYSKKF



KPEIATRPKVRDQEGRMDYYWTIVEPGDKITFEATGNLVVPRYAFELKRNSGS



GIIISDTSVHDCNTTCQTPKGAINTSLPFQNIHPVTIGECPKYVKSTKLRMATGL



RNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADLKSTQN



AIDGITNKVNSIIEKMNTQFTAVGKEFSHLEKRIENLNKKVDDGFLDIWTYNA



ELLVLLENERTLDYHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCDDM



CMESVKNGTYDYPKYSEEAKLNREEIDGVKLES





20
ADTICVGYHANNSTDTVDTILEKNVTVTHSVNLLESNHNGKLCSLNGKAPLQ



LGNCNVAGWILGNPECDLLLTANSWSYIIETSNSKNGACYPGEFADYEELREL



LSTVSSFERFEIFPKATSWPHHETTKGTTVACPHSGVNSFYRNLLWIVKKGNS



YPKLSKSYTNSKGKEVLVIWGVHHPPTDSDQQTLYQNNHTYVSVGTSKYYR



RFTPEIVARPKIREQAGRMNYYWTLLDQGDTITLEATGNLIVPWHAFALNKGS



NSGIMISDADFHNCTTKCQTPHGALKSNLPFQNVHPITIGECPKYVKSTQLRM



ATGLRNVPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQ



KSTQIAIDGISNKVNSVIEKMNIQFASVGKEFNDLEKRIENLNKKVDDGFVDV



WTYNAELLILLENERTLDFHDFNVKNLYEKVKSQLRNNAKEIGNGCFEFYHK



CDNECMESVKNGTYNYPKYSEESKLNREEIDGVKLES





21
ADTLCIGYHANNSTDTVDTILEKNVTVTHSVNLLEDKHNGRLCKLGGIAPLNL



GRCNIAGWLLGNPECESLFTVSSWSYIVETSNSYNGTCYPGDFINYEELREQLS



SVSSFERFEIFPMASSWPNHETTKGVTAACSHAGENSFYRNLIWLVKKENTYP



KISKSYINNKGKEVLVLWGIHHPPTNNDQQSLYQNADTYVFVGTSKYNKKFK



PEIATRPKVRDQAGRMNYYWTLVEPGDTITFEATGNLVAPRYAFAMNKGSGS



GIIVSDIPAHDCNTNCQTPKGAINTSLPFQNVHPVTIGECPKYVKSTRLRMATG



LRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNGQGSGYAADQKSTQ



NAIDGITNKVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDVWTY



NAELLILLENERTLDFHDSNVKNLYEKARRQLKNNAKEIGNGCFEFYHKCDN



ACMESVKNGTYDYPKYSEESRLNREEIDGVKLDS





22
ADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLENKHDGKLCKLRGIAPLH



LGKCNIAGWILGNPECESLFTTSSWSYIVETPNSDNGTCYPGDFINYEELREHL



SSVSSFERFEIFPKANSWPYHDTNKGVTAACPYAGANSFYRNLIWLVKKGNS



YPKLSKSYINNKKKEVLVIVVGIHHPPTSTDQQTLYQNADAYVFVGSSKYSKRF



KPEIAARPKVRDQAGRMDYYWTLIEPGDTITFEATGNLVVPRYAFTMKRGSG



SGIIVSDAPVHDCNTTCQTPKGAINTSLPFQNIHPVTIGECPKYVKSTKLRMAT



GLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKST



QNAIDGITNKVNSIIEKMNTQFTAVGKEFSQLEKRIESLNNKVDDGFLDIWTY



NAELLVLLENERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDD



TCMESVKNGTYDYPKYSEEAKLNRQEIDGVKLES





23
ADTLCIGYHANNSTDTVDTILEKNVTVTHSVNLLEDRHNGKLCNLRGEAPLH



LGKCNIAGWLLGNPECELLFAVNSWSYIVETSNSDNGTCYPGDFTSYEELREQ



LSSVSSFERFEIFPKASSWPNHETNRGVTAACPYAGTNSFYRNLIWLVKKGNS



YPKLSKSYVNNKKKEVLVLWGIHHPPTNADQQSLYQNADAYVFVGSSKYNK



KFKPEIAKRPKVRGQAGRMNYYWTLVEPGDTITFEATGNLVAPRYAFAMNR



DPGSGIITSDAPIHDCNATCQTPKGAINTSLPFQNIHPITIGECPKYVKSTRLRM



ATGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNGQGSGYAADQK



STQNAIDRITNKVNSVIEKMNIQFTAVGKEFNHLERRIENLNKKVDDGFLDVW



TYNAELLVLLENERTLDFHDSNVKTLYEKVKTQLRNNAKEIGNGCFEFYHKC



DDTCMESIKNGTYDYPKYSKESKLNREEIDGVQLES





24
ADTLCVGYHANNSTDTVDTVLEKNVTVTHSVNLLEDNHNGKLCKLRGIAPL



HLGKCNIAGWLLGNPECELLFTAKSWSYIVETSNPDNGICYPGDFSNYEELRE



QLSSVSSFERFEIFPNASSWPNHETNRGITAACPYAGANSFYRNLIWLVKKGNS



YPKINKSYINNKGKEVLVLWGIHHPSTSNDQQSLYQNADAYVFVGSSKYNRR



FKPEIATRPKVRDQAGRMNYYWTLVEPGDTITFEATGNLIAPRYAFAMTRGS



GSGIMISDTPVYDCNTICQTPKGAINTSLPFQNIHPVTIGKCPKYVKSTKLRLAT



GLRNIPSIQYRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGFAADRKST



QNAIDGITNKVNSVIEKVNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDVWT



YNAELLVLLENERTLDFHDSNVKNLYEKVRSQLRNNAKEIGNGCFEFYHKCD



NTCMESVKNGTYDYPRYSEESKLNREEIDGVKLES





25
ADTICVGYHANNSTDTVDTILEKNVTVTHSVNILETTHNGKLCSLSGIAPLQLG



NCNIAGWILGNPECDLLLTVNSWSYIIETSNSVNGTCYPGEFADYEELREQLSS



VASFERFEIFPKTTSWPNHETTRGTTAACSHSGTRSFYRNLLWIVKKGNSYPKI



SKSYTNKKGKEVLVIWGVHHPPTNNDQQSLYQNAYTYVSVESSKYYRRFTPE



IASRPKVRGQEGRMNYYWTLLDQGDTITFEATGNLIAPWYAFALNKGSDSGII



ISDAQVHNCTTKCQTPHGALNSTLPFQNVHPITIGECPKYVKSTHLRMATGLR



NVPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADKKSTQIA



IDGISNKVNSVIEKMNIQFTSVGKEFNNLEKRIENLNKKVDDGFLDIWTYNAE



LFVLLENERTLDFHDFNVKNLYERVKSQLRNNAKEVGNGCFEFYHKCDNEC



MESVKNGTYNYPKYSEESKLNREEIDGVKLES





26
ADTLCIGYHANNSTDTVDTVLEKNITVTHSVNLLEDKHNGKLCKLRGVAPLH



LGKCNIAGWLLGNPECESLFNVSSWSYIVETSNSDNGTCYPGDFTNYEELKEQ



LSSVSSFERFEIFPKEGSWPNHETNRGVTAACPHAGANSFYKNLIWLVKRGNS



YPKLNKSYVNNKKKEVLVLWGIHHPPTSTDQQSLYQNADAYVFVGSSKYSK



KFKPEIATRPKVRDQAGRMNYYWTLVEPEDTITFEATGNLVVPRYAFAMNRG



SGSGIITSDTPVHDCNTTCQTPKGAINTSLPFQNVHPITIGECPKYVKSTKLRMA



TGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNGQGSGYAADQKS



TQNAIDGVTNKVNSIIEKMNTQFAAVGKEFNHLEKRIENLNKKVDDGFLDVW



TYNAELLVLLENERTLDFHDSNVRNLYDKVRSQLRNNAKEIGNGCFEFYHKC



DDTCMESVKNGTYDYPKYSEESKLNREEIDGVRLDS





27
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCLLKGIAPLQL



GNCSVAGWILGNPECELLISRESWSYIVEKPNPENGTCYPGHFADYEELREQL



SSVSSFERFEIFPKESSWPNHTTTGVSASCSHNGESSFYKNLLWLTGKNGLYPN



LSKSYANNKEKEVLVLWGVHHPPNIGDQRALYHKENAYVSVVSSHYSRKFTP



EIAKRPKVRDQEGRINYYWTLLEPGDTIIFEANGNLIAPRYAFALSRGFGSGIIN



SNAPMDECDAKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSAKLRMVTGLR



NIPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADQKSTQN



AINGITNKVNSVIEKMNTQFTAVGKEFNKLERRMENLNKKVDDGFIDIWTYN



AELLVLLENERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNDE



CMESVKNGTYDYPKYSEESKLNREKIDGVKLES





28
ADTLCIGYHANNSTETVDTVLEKNVTVTHSINLLEDKHNGKLCRLRGVTPLH



LGKCNIAGWLLGNPECESLSTASSWSYIVETSNSDNGTCYPGDFINYEELREQL



SSVSSFERFEIFPNASSWPDHETNKGVTAACPHAGAKSFYRNLLWLVKKKNS



YPKLSKSYINNKGKEVLVLWGIHHPSNIQDQETLYQNADTYVFVGSSKYSKK



FKPEIATRPKVRDQTGRMNYYWTLVEPGDTITFEATGNLVIPKYAFAMKRGS



GSGIIISDATVHDCNTTCQTPKGAINTSLPFQNIHPVTIGECPKYVKSTKLRMAT



GLRNVPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADRKST



QNAIDGITNKVNSVIEKMNTQFSAVGKEFNHLEKRIENLNKKVDDGFLDVWT



YNAELLVLLENERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCD



DACMESVKNGTYDYPKYSEEAKLNREKIDGVKLES





29
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDTHNGKLCKLGGMAPLH



LGRCNIAGWLLGNPECELISTVSSWSYIVETSNSDNGTCYPGDFTNYEELREQL



SSVSSFEKFEMFPKISSWPNHETNRGVTAACPYAGANSFYRNLIWLVKKGNSY



PKLSKSYVNNKGKEVLVLWGIHHPPTSADQQSLYQNVDTYVFVGSSTYNRKF



KPEIAVRPKVRDQTGRMNYYWTLIKPGDTITFEATGNLVAPRYAFAMSRGSG



SGIITSDMPVHNCTTKCQTPKGALNTSLPFQNVHPVTIGECPKYVKSTKLRMA



TGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKS



TQNAIDGITNKVNSVIEKMNTQFTAVGKEFNHLERRMENLNKKVDDGFLDV



WTYNAELLILLENERTLDLHDSNVKNLYEKVRAQLRNNAKEIGNGCFEFYHK



CDDACMESVKNGTYDYPKYSEESRLNREEIDGVKLES





30
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLENSHNGKLCLLKGIAPLQL



GNCSVAGWILGNPECELLISKESWSYIVEKPNPENGTCYPGHFADYEELREQL



SSVSSFERFEIFPKESSWPNHTVTGVSASCSHNGESSFYRNLLWLTGKNGLYPN



LSKSYANNKEKEVLVLWGVHHPPNIGDQKALYHTENAYVSVVSSHYSRKFTP



EIAKRPKVRDQEGRINYYWTLLEPGDTIIFEANGNLIAPRYAFALSRGFGSGIIN



SNAPMDKCDAKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSAKLRMVTGLR



NIPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADQKSTQN



AINGITNKVNSVIEKMNTQFTAVGKEFNKLERRMENLNKKVDDGFIDIWTYN



AELLVLLENERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNDE



CMESVKNGTYDYPKYSEESKLNREKIDGVKLES





31
ADTICVGYHANNSTDTIDTILEKNVTVTHSVNLLENSHNGKLCSLNGIAPLQL



GSCNVAGWILGNPECDLLLTVNSWSYIVETSNSENGTCYPGEFADYEELREQL



SSVSSFERFEIFPKTTSWPNHDTTKGTTAACSHSGTNSFYRNLLWIVKKGNSYP



KLNSSYTNNKGKEVLIIWGVHHPPTNNDQQSLYQNADTYVSVGSSKYYQRFT



PEIAARPKVRGQAGRMNYYWTLLDQGDTITFEATGNLIAPWYAFALNKNSGS



GIMISDAQVHNCTTKCQTPHGAMNSNLPFQNVHPVTIGECPKYVKSTQLRMA



TGLRNVPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNGQGSGYAADQK



STQLAIDGISNKVNSVIEKMNIQFTSVGKEFNNLEKRIENLNKKVDDGFLDVW



TYNAEILILLENERTLDFHDFNVKNLYEKVKSQLRNNAKEVGNGCFEFYHKC



DNECMESVRNGTYNYSKYSEESKLNREEIDGVKLES





32
ADTLCVGYHANNSTDTVDTILEKNVTVTHSVNLLEDRHNGKLCKLGGIAPLH



LGKCNIAGWLLGNPECDSLLTVRSWSYIVETSNSDNGTCYPGEFINYEELKEQ



LSSVSSFERFEIFPKASSWPNHETNKGVTAACPYAGSNSFYRNLIWLVKKGNS



YPKLNKSYINNKGKEVLVLWGIHHPPTNNDQQSLYQNEDAYVFIGSSKYNKK



FKPEIATRPKVRGQAGRMDYYWLLMEPGETITFEATGNLVVPRYAFAMNRD



ARSGIIISDVSVHDCNTTCQTPKGALNTSLPFQNIHPVTIGECPKYVKSTKLRM



ATGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQK



STQNAIDGITNKINSVIEKMNTQFAAVGKEFNHLEKRLENLNKKVDDGFLDV



WTYNAELLVLLENERTLDFHDSNVKNLYEKVRSQLRNNAKEIGNGCFEFYHR



CDDACMESVKNGTYDYPVYSEESRLNREEIDGVKLDS





33
ADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDKHNGKLCKLRGVAPL



HLGKCNIAGWILGNPECESLSTASSWSYIVETPSSDNGTCYPGDFIDYEELREQ



LSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNS



YPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGSSRYSKK



FKPEIAIRPKVRDREGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNA



GSGIIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATG



LRNIPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQ



NAIDEITNKVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYN



AELLVLLENERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNT



CMESVKNGTYDYPKYSEEAKLNREEIDGVKLES





34
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDNHNGKLCSLKGIVPLQL



GNCSVAGWILGNPECELLISKKSWSYIVETPNPENGACYPGDFADYEELREQL



SSVSSFERFEIFPKESSWPNHNVNGVSAACSHNGERSFYRNLLWLTGKNGLYP



NLTNSYVNSKEKEVLILWGVHHPSNIRDQRTLYQTENAYVSVMSSHYSRRFT



PEITKRPKVRDQEGRINYYWTLLEPGDTIIFEASGNLIAPWYGFELSRGFGSGII



TSSAPTDECDAKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSAKLRMVTGLR



NIPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADQESTQNA



INGITNKVNSVIEKMNTQFTAVGKEFNKLERRMENLNKKVDDGFLDIWTYNA



ELLVLLENERTLDFHDSNVRSLYEKVRVQLKNNAKEIGNGCFEFYHKCNNEC



MESVRNGTYDYPKYSEESKLNREKIDGVKLDS





35
ADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLENRHNGKLCKLRGVAPL



HLGKCNIAGWLLGNPECESLSTASSWSYIVETSNSDNGTCYPGDFINYEELRE



QLSSVSSFERFEIFPKTSSWPNHDTNRGVTAACPHDGTNSFYRNLIWLVKKGN



SYPKINKSYINNKEKEILVLWAIHHPSTSADQQSLYQNADAYVFVGSSRYSRK



FEPEVATRPKVRDQAGRMNYYWTLVEPGDKITFEATGNLVVPRYAFALKRNS



GSGIIISDTSVHDCDTNCQTPNGAINTSLPFQNIHPVTIGECPKYVKSTKLRMAT



GLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADLKST



QNAIDGITNKVNSVIEKMNTQFTAVGKEFSHLERRIENLNKKVDDGFLDIWTY



NAELLVLLENERTLDYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDD



MCMESVKNGTYDYPKYSEEAKLNREEIDGVKLES





36
ADTLCVGYHANNSTDTVDTVLEKNVTVTHSVNLLEDRHNGKLCKLRGVPPL



HLGKCNIAGWLLGNPECEPLLTTSSWSYIVETSISENGTCYPGDFIDYEELREQ



LSSVSSFERFEIFPKTSSWPNHDSNSGVTAACPHAGAKSFYRNLIWLVKKGNS



YPKLSKSYINNKGKEVLVLWGIHHPPTSADQQSLYQNVDTYVFVGSSRYSKK



FKPEIATRPKVRDQAGRMNYYWTLVEPGDKITFEATGNLIVPRYAFAMRRNS



GSGIIISDTPVHDCNATCQTPNGAINTSLPFQNIHPVTIGECPKYIKSTKLRMAT



GLRNVPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHRNEQGSGYAADLKST



QNAIDGITNKVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWT



YNAELLVLLENERTLDYHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKC



DNTCMESVKNGTYDYPKYSEEAKLNREEIDGVRLES





37
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCLLKGIAPLQL



GNCSVAGWILGNPECESLISKKSWSYIVETPNPENGACYPGEFADYEELREQL



SSVSSFERFEIFPKESSWPNHTATGESASCSHNGKKSFYRNLIWLTVKNGLYPN



LSKSYENDKEKEVLILWGVHHPPNIENQRTLYHTENAYVSVVSSHYSGRFTPE



ITKRPKVRDQEGRINYYWTLLEPGDTIIFEANGNLIAPWYAFALSRGLGSGIITS



NAPMDECDSKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSAKLRMVTGLRNI



PSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADQESTQNAIN



GITNKVNSVIEKMNTQFTAVGKEFNKLERRMENLNKKVDDGFLDIWTYNAEL



LVLLENERTLDFHDSNVKSLYEKVKSQLKNNAKEIGNGCFEFYHKCNNECME



SVKNGTYDYPKYYEESKLNREKIDGVKLDS





38
ADTICIGYHANNSTDTVDTVLEKNVTVTHSINLLEDSHNGKLCKLKGIAPLQL



GDCSIAGWILGNPNCESLFSKKSWSYIAEIPNPENGICYPGYFSDYEELREQLSS



ISSFERFEIFPKESSWPKHSVNKGVTASCSHKGKSSFYRNLLWLTEKNGSYPNL



SKSYVNSREKEVLVLWGVHHPSNIEDQRAIYRKENAYISVVSSLYNRRFTPEIA



RRPKVRDQEGRINYYWTILEPRDTIIFEATGNLIAPWYAFSLSRGFKSGIIVSNS



SIDNCDTKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSTKLRMATGLRNIQSI



QSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADLKSTQNAIDGIT



NKVNSVIDKMNTQFASVGKEFNKLEKRMENLNKKVDDGFLDIWTYNAELLV



LLENERTLDFHDSNVKSLYNKVKGQLKSNAKEIGNGCFEFYHKCDNECMESV



KNGTYDYKEYSEESKLNREKIDGVELKS





39
ADTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLNGISPLQL



GTCSIAGWVLGNPKCESLFSRRSWSYIAEVPNPENGICYPGHFSDYEELREQLS



SVSSFERFEIFPKESSWPGHNLTKGVTASCSHNGKSSFYRNLLWLTGEKGLYP



KVNKSYVNNKKKEVLVLWGVHHPSNIEDQRAIYHEENAYVSVVSSHYSRRFT



PEIARRPKIRNQEGRINYYWTLLEPDDKIIFEANGNLIAPRYAFALSRGFGSGIIV



SNTSIDECNTKCQTPQGAINSSLPFQNVHPVTIGECPKYVRSTKLRMVTGLRNI



PSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHRNEQGSGYAADQESTQNAIN



GITNKVNSVIEKMNTQFTAVGKEFNKLEKRMENLNKKVDDGFLDIWTYNAE



LLVLLENERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCDDDC



MESVKNGTYDYPRYSEESSLNRGKIDGVKLES





40
ADTICIGYHANNSTDTVDTILEKNVTVTHSVNLLESSHNGKLCRMKGMAPLQ



LGKCSIAGWILGNPECESSFSRKSWSYIAEAPNPENGICYPGYFSDYEELREQL



SSVSSVERFEIFPKESSWPKHNVTRGKTVSCPHNGKSSFYKNLLWLTEKDGSY



PNVSKPYVNNKEKEVLVIWGVHNPSNIEDQRAIYRKETAYVSVVSSHYNRRF



TPEIEKRPKIKNQEGRINYYWTLLEPGETIIFEANGNFIAPWYAFALSRGFRSGII



VSNASMDECDTECQTPQGAINNSLPFQNVHPVAIGECPKYIKSTRLRMVTGLR



NIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHRNEQGSGFAADQESTQNAI



NGITNKVNSIIGKMNTQFTAAGKEFNRLEKRMENLNKKVDDGLLDIWTYNAE



LLVLLENERTLDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNNECM



ESVKNGTYDYPKYSEESSLNRRKIDGVKLES





41
ADTICVGYHANNSTDTVDTILEKNVTVTHSINLLESSHNGKLCSLNGIAPLQLG



NCNIAGWILGNPECDLSFTANSWSYIIETSNSENGACYPGKFTDYEELREKLSS



VSSFERFEIFPKATSWPNHETNKGTTAACSHSGVKSFYRNLLWIVKKGNSYTK



VNKSYTNNKGKEVLVIWGVHHPPTNNVQQTLYQNASTYVSVGTSKYYRRFT



PEIVARPKVRDQAGRINYYWTLLDQGDTITFEATGNLIAPWYAFALNKGSVSG



IIISDANVHNCTTKCQTPHGALKSRLPFQNVHPITIGECPKYVKSTQLRMATGL



RNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQS



AIDGISNKVNSIIEKMNVQFTSVGKEFNNLEKRLENLNKKVDDGFLDVWTYN



AELLILLENERTLDFHDFNVKNLYEKVKSQLRDNAKEVGNGCFEFYHKCDNE



CMESIKNGTYDYPKYSEESKLNREEIYGVKLES





42
ADTICVGYHANNSTDTVDTILEKNVTVTHSVNLLENSHNGKLCSLNGKAPLQ



LENCNVAGWILGNPECDFLLTAYSWSYIIETSNSKNGTCYPGEFNNYEELREQ



LSTVSSLERFEIFPKATSWPNHDTTKGTTVACSHSGANSFYRNLLWIVKKENS



YPKLSKSYTNNKGKEVLVIWGVHHPPTDSDQQTLYQNNNTYVSVESSKYYQ



RFTPEIVARPKVRKQAGRMNYYWILLDQGDTITFEATGNLIAPWYAFALDKG



SSSGIMMSDAHVQNCTTKCQTPHGALKSNLPFQNVHPITIGECPKYVKSTQLR



MATGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADK



KSTQIAIDGISNKVNSIIEKMNIQFTSVGKEFNNLEKRIENLNKKVDDGFLDVW



TYNAELLILLENERTLDFHDFNVRNLYEKVKSQLRNNAREIGNGCFEFYHKCD



NECMESIRNGTYNYPRYSEESKLNREKIDGVKLES





43
ADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDKHNGKLCKLRGVAPL



HLGKCNIAGWILGNPECESLSTASSWSYIVETSNSDNGTCYPGDFINYEELREQ



LSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNS



YPKLNQSYINDKGKEVLVLWGIHHPSTTADQQSLYQNADAYVFVGTSRYSKK



FKPEIATRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFTMERNA



GSGIIISDTPVHDCNTTCQTPEGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATG



LRNVPSIQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKST



QNAIDKITNKVNSVIEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWT



YNAELLVLLENERTLDYHDSNVKNLYEKVRNQLKNNAKEIGNGCFEFYHKC



DNTCMESVKNGTYDYPKYSEEAKLNREKIDGVKLES





44
TATLCLGHHAVPNGTLVKTITDDQIEVTNATELVQSSSTGKICNNPHRILDGID



CTLIDALLGDPHCDVFQNETWDLFVERSKAFSNCYPYDVPDYASLRSLVASS



GTLEFITEGTWTGVTQNGGSNACKRGPGSGFFSRLNWLTKSGSTYPVLNVTM



PNNDNFDKLYIWGVHHPSTNQEQTSLYVQASGRVTVSTRRSQQTIIPNIGSRP



WVRGLSSRISIYWTIVKPGDVLVINSNGNLIAPRGYFKMRTGKSSIMRSDAPID



TCISECITPNGSIPNDKPFQNVNKITYGACPKYVKQNTLKLATGMRNVPEKQT



RGLFGAIAGFIENGWEGMIDGWYGFRHQNSEGTGQAADLKSTQAAIDQINGK



LNRVIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALEN



QHTIDLTDSEMNKLFEKTRRQLRENAEDMGNGCFKIYHKCDNACIESIRNGTY



DHDVYRDEALNNRFQIKGVELKS





45
TATLCLGHHAVPNGTLVKTITNDQIEVTNATELVQSSSTGRICDSPHRILDGKN



CTLIDALLGDPHCDGFQNEKWDLFVERSKAFSNCYPYDVPDYASLRSLVASS



GTLEFINEGNWTGVTQSGGSYACKRGSVNSFFSRLNWLYESEYKYPALNVTM



PNNGKFDKLYIWGVHHPSTDKEQTNLYVRASGRVTVSTKRSQQTVIPNIGSRP



WVRGLSSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRTGKSSIMRSDAPIGTC



SSECITPNGSIPNDKPFQNVNKITYGACPRYVKQNTLKLATGMRNVPEKQTRG



IFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAIDQINGKLN



RLIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQH



TIDLTDSEMNKLFEKTRKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD



HDVYRDEALNNRFQIKGVELKS





46
IATLCLGHHAVPNGTLVKTITSDQIEVTNATELVQSFSTGEICNNPHRILDGMD



CTLVDALLGDPHCDGFQNGTWDLFVERSKAFSNCYPYDVPDYASLRSLIASS



GTLEFTNEGFNWAGVTQNGGSSACRRGSDSSFFSRLNWLYKLGNTYPVLNVT



MPNNDNFDKLYIWGIHHPSTDQEQTRLYARESGRVTVSTKRSQQTVTSNVGP



RPWVRGLSSRISIYWTIVKPGDILLINSNGNLIAPRGYFKMRAGKSSIMRSDAPI



STCISECITPNGNVPNDKPFQNVNKITYGACPKYIRQKTLKLATGMRNVPEKQ



TRGIFGAKAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQTAIDQIN



GKLNRVIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVAL



ENQHTIDLTDSEMNKLFEKTRRQLRENAEDIGNGCFKIYHRCDNACIGSIRNG



TYNHNVYRDEALNNRFQIRGVELKS





47
TATLCLGHHAVPNGTLVKTITNDQIEVTNATELVQSSSTGRICDSPHRILDGKN



CTLIDALLGDPHCDGFQNKEWDLFVERSKAYSNCYPYDVPDYASLRSLVASS



GTLEFTNEGNWTGVAQDGTSYACKRGSVKSFFSRLNWLHKLEYKYPALNVT



MPNNDKFDKLYIWGVHHPSTDSVQTSLYVQASGRVTVSTKRSQQTVIPNIGS



RPWVRGISSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIG



NCNSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQT



RGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAINQINGK



LNRLIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALEN



QHTIDLTDSEMNKLFERTRKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGT



YDHDVYRDEALNNRFQIKGVELKS





48
TATLCLGHHAVPNGTLVKTITNDQIEVTNATELVQSSSTGRICDSPHRILDGEN



CTLIDALLGDPHCDGFQNKEWDLFVERSKAYSNCYPYDVPDYASLRSLVASS



GTLEFNNESNWTGVAQNGTSYACKRSSIKSFFSRLNWLHQLKYKYPALNVTM



PNNDKFDKLYIWGVHHPSTDSDQTSIYAQASGRVTVSTKRSQQTVIPNIGSRP



WVRGISSRISIHWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKC



NSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRG



IFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAINQINGKLN



RLIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQH



TIDLTDSEMNKLFERTRKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD



HDVYRDEALNNRFQIKGVELKS





49
IAMLCLGHHAVPNGTLVKTITNDQVEVTNATELVQSSSTGKICNHPHRILDGM



DCTLIDSLLGDPHCDSFQNETWDLFVERSRAFSNCYPYDVPDYASLRSLVASS



GTLEFINEDFNWAEVTQNGGSSACKRGSDSSFFSRLNWLYKSGNTYPTLNVTI



PNNGNFDKLYIWGIHHPSTTKEQTRLYVQESGKVTVSTKRSQQTVIPNVGPRP



WVRGLSSRISIYWTIVKPGDLLLISSTGNLIAPRGYFRLRAGKSSIIRSEAPIGTCI



SECITPNGSIPNDKPFQNVNKITYGACPKYVKQNTLKLATGMRNVPEKQTRGI



FGAKAGFIENGWEGMIDGWYGFRHQNSEGTGQAADLKSTQAVIDQINGKLN



RIIEKTNEKFHQIEKEFSEVEGRVQDLEKYVEDTKIDLWSYNAELLIALENQHT



IDLADSEMNKLFEKTRRLLRENAEDIGNGCLKIYHRCDNACIGSIRNGTYNHD



VYRDEALNNRFQIKGIELKS





50
TATLCLGHHAVPNGTLVKTITDDQIEVTNATELVQNFSMGKICNNPHRILDGA



NCSLIDALLGDPHCDDLQNEKWDLFIERSKAFSNCYPYDVPEYTSLRSLIASSG



TLEFTNEGFNWTGVTQNGGSSACKRGPNNSFFSRLNWLYKSGNTYPILNVTM



QNSDDFDKLYIWGVHHPSTDREQTNLYVQASGKVTVSTKRSQQTIIPNIGSRP



WIRGLSSRISIYWTIVKPGDILMINSNGNLIAPRGYFKVYTGRSSIMKSDAPLDT



CNSECITPNGSIPNDKPFQNVNKITYGACPKYIKQNTLKLATGMRNIPEKQTRG



IFGAIAGFIENGWEGMVNGWYGFRHQNSEGTGQAADLKSTQAAINQINGKLN



RIIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHT



IDLTDSEMNKLFEKTRKQLRENAEDMGNGCFKIYHKCDNSCMESIRNGTYDH



NEYRNEALSNRFQIKSVELKS





51
TATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGGICDSPHQILDGEN



CTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASS



GTLEFNNESNWTGVTQNGTSSACKRRSNKSFFSRLNWLTHLKYKYPALNVT



MPNNEKFDKLYIWGVLHPGTDSDQIRLYAQASGRITVSTKRSQQTVIPNIGSRP



RVRDVSSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKC



NSECITPNGSIPNDKPFQNVNRITYGACPRYIKQNTLKLATGMRNVPEKQTRGI



FGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAINQINGKLN



RLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQY



TIDLTDSEMNKLFERTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD



HDVYRDEALNNRFQIKGVELKS





52
TATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGGICDSPHQILDGEN



CTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASS



GTLEFNNESNWAGVTQNGTSSACKRRSNKSFFSRLNWLTHLKYKYPALNVT



MPNNEKFDKLYIWGVHHPGTDSDQISLYAQASGRITVSTKRSQQTVIPNIGSRP



RVRDVSSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKC



NSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRG



IFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAINQINGKLN



RLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQH



TIDLTDSEMNKLFERTKKQLRENAEDMGNGCFKIYHKCDNACIESIRNGTYDH



DVYRDEALNNRFQIKGVELKS





53
TATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGGICDSPHQILDGEN



CTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASS



GTLEFNNESNWTGVTQNGTSSSCKRRSNNSFFSRLNWLTHLKFKYPALNVTM



PNNEKFDKLYIWGVHHPGTNNDQISLYTQASGRITVSTKRSQQTVIPNIGSRPR



VRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCNS



ECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGIF



GAIAGFIENGWEGMVDGWYGFRHQNSEGIGQAADLKSTQAAINQINGKLNRL



IGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTID



LTDSEMNKLFERTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDV



YRDEALNNRFQIKGVELKS





54
TATLCLGHHAVPNGTLVQTITDDQVEVTNATELVQSSSVGRICNSPHRILDGG



NCTLIDALLGDPHCDAFQNEKWDLFVERSKAFSNCYPYDMQDYVSLRSLVAS



SGTLEFTNEGFNWTGVTQSGGSYACKRGSGNSFFSRLNWLYQSENKYPALNV



TMPNNGNFDKLYIWGVHHPSTDREQTNLYVRGSGRVTVSTKRSQQTVIPNIG



PGPWVRGLSSRMSIYWTIVKPGDTLLITSNGNLIAPRGYFKIHAGKSSIMRSDA



PIGTCSSECITPNGSIPNDKPFQNVNKITYGACPKYVKQNTLKLATGMRNIPEK



QNRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAIDQIN



GKLNRVIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVAL



ENQHTVDLTDSEMNKLFEKTRRQLRENAEDMGNGCFKIYHKCDNTCIGSIRN



GTYDHDIYRDEALNNRFQIKDVVLKS





55
TATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGEICDSPHQILDGKN



CTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASS



GTLEFNNESFNWTGVTQNGTSSACIRRSKNSFFSRLNWLTHLNFKYPALNVT



MPNNEQFDKLYIWGVHHPGTDKDQIFLYAQASGRITVSTKRSQQTAIPNIGYR



PRVRNIPSRISIYWTIVKPGXILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKC



NSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRG



IFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLN



RLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQH



TIDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD



HDVYRDEALNNRFQIKGVELKS





56
TATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGEICDSPHQILDGEN



CTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASS



GTLEFNNESNWTGVTQNGTSSACIRRSNNSFFSRLNWLTHLKFKYPALNVTM



PNNEKFDKLYIWGVHHPGTDNDQIFPYAQASGRITVSTKRSQQTVIPNIGSRPR



VRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCNS



ECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGIF



GAIAGFIENGWEGMVDGWYGFRHQNSEGIGQAADLKSTQAAIDQINGKLNRL



IGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTID



LTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDV



YRDEALNNRFQIKGVELKS





57
TATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGEICDSPHQILDGEN



CTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASS



GTLEFNNESFNWTGVTQNGTSSACIRRSNNSFFSRLNWLTHFKFKYPALNVT



MPNNEQFDKLYIWGVHHPGTDNDQIFLYAQASGRITVSTKRSQQTVIPNIGSR



PRVRIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMKSDAPIGKCN



SECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGI



FGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLN



RLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQH



TIDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD



HDVYRDEALNNRFQIKGVELKS





58
TATLCLGHHAVPNGTLVKTITGEQIEVTNATELVQSLSMGKICKNPHRILDGA



NCTLMDALLGDPHCDGFQNEKWDLFIERSKAFSNCYPYDVPEYTSLRSIIASS



GTLEFFSENFNWTGVTQNGGSNSCKRGPNSSFFSRLNWLYKSGNNYPILNVT



MPNSEGFDKLYIWGVHHPSTENEQTNLYAQAPGKITVSTRSSQQTVIPNIGPRP



WIRGLSSRISIYWTIVKPGDILMINSNGNLIAPRGHFKVHTGRSSIMRSDAPIDT



CSSECITPNGSIPNDKPFQNVNKITYGVCPKYVKQSTLKLATGMRNIPERQTRG



IFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAINQINGKLN



RVIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQH



TIDLTDSEMNKLFEKTRKQLRENAEDMGNGCLKIYHKCDNSCMRSIRNGTYD



HNEYRDEALNNRFQIKSVELKL





59
TATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGEICNSPHQILDGEN



CTLIDALLGDPQCDGFQNKKWDLFVERSKAHSNCYPYDVPDYASLRSLVASS



GTLEFNNESNWTGVTQNGTSSACIRRSNNSFFSRLNWLTHLNFKYPALNVTM



PNNEQFDKLYIWGVHHPGTDKDQIFLYAQASGRITVSTKRSQQAVIPNIGSRPR



VRDIPSRVSIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCN



SSCITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGI



FGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLN



RLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQH



TIDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD



HDVYRDEALNNRFQIKGVELKS





60
TATLCLGHHAVPNGTLVKTITNDQIEVTNATELVQSSSTGKICNSPHQILDGKN



CTLIDALLGDPHCEEFQNKEWDLFVERSTAYSNCYPYYVPDYASLRSLVASSG



TLKFTKEDFNWTGVSQDGSSFSCKRKSVNSFFSRLNWLHKSTNKYPALNVTM



PNIDKFDKLYIWGVHHPSTDSDQISLYAQASGRVMVSTKRSQQAVIPNIGSRP



WVRGVSSIISIHWTIVKPGDILLINSTGNLIAPRGYFKIRGGKSSIMRSDEPIDNC



NFDCITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRG



IFGAIAGFIENGWEGMVNGWYGFRHQNSEGTGQAADLKSTQAAISQINGKLN



RLIEKTNEKFHQIEKEFSEVEGRVQDLEKYVEDTKIDLWSYNAELLVALENQH



TIDLTDSEMNKLFERTRKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD



HDVYRDEALNNRFQIKGVELKS





61
TATLCLGHHAVPNGTLVKTITDDQVEVTNATELVQNSSAGRICDNPHRILDGK



NCTLIDALLGDPHCDSFKNKEWDLFVERSEAYSDCYPYDVPDYDYIRSIVASS



GTLEFTNESFNWTGVAQDGTSNSCKRGSVKSFFSRLNWLHKLEYKYPALNVT



MPNNDRFDKLYIWGVHHPSTDSDQTNLYVQASGKVVVSTKRSQQTVIPNIGF



RPWIRGISSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDTPIGN



CNSECITPNGSIPNDKPFQNVNRITYGACPRYIKRNTLKLATGMRNVPEKQTR



GIFGAIAGFIENGWEGMIDGWYGFRHQNSEGTGQAADLKSTQAAINQINGKL



NRLVEKTNEKFHQIEKEFSEVEGRIQDLEKYIEDTKIDLWSYNAELLIALENQH



TIDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD



HNVYRDEALNNRFQIKGVKLKS





62
TATLCLGHHAVPNGTLVKTVTDDQIEVTNATELVQSLSTGKICNSPHQILDGG



NCTLIDALLGDPHCDRFQNRKWDLFVERSTAYSNCYPYDVPDYASLRSLIASS



GTLEFTKEDFNWTGVAQDGASYSCKRGSVKSFFSRLNWLHKLEYKYPALNV



TMPNNDRFNKLYIWGVHHPSTDSDQTRLYVQASGKVTVSTKRSQQTVIPNIG



PIPWIRGVSSRISIYWTIVNPGDILLINSTGNLIAPRGYFKLRNGRSSVMRSDAPI



DNCNSECITPNGSIPNDKPFQNVNRITYGTCPRYVKQNTLKLATGMRNVPEKQ



TRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQTAINQING



KLNRLIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALE



NQHTVDLTDSEMNKLFEKTRRQLRENAEDMGNGCFKIYHKCDNACIGSIING



TYDHDVYRDEALNNRFQIKGVELKS





63
MAMLCLGHHAVPNGTLVKTITNDHIEVTNATELVQSSSMGRICNSPHQILDGE



NCTLIDALLGDPHCDAFQNKEWDLFVERSTAYSNCYPYDVPDYASLRALVAS



SGTLEFNNENFNWTGVAQNGASTACKRRSSNSFFSRLNWLYQLRYKYPALN



VTMPNNEKFDKLYIWGVHHPSTDNDQISLYAQASGRVIVSTKRSQQTVIPNIG



SRPWVRGVSSTISIYWTIVKAGDILLINSTGNLIAPRGYFKIRNGKSSIMRSNAPI



GRCNAGCITPNGSISNDKPFQNVNRITYGACPRYVKQKTLRLATGMRNVPEK



QTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAISQIS



GKLNRLIGKTNEKFHQIEKEFSEVEGRIQDIEKYVEDTKIDLWSYNAELLVALE



NQHTIDLTDSEMNKLFERTRKQLRENAEDMGNGCFKIYHKCDNACIESIRNGT



YDHDVYRDEALNNRFQIKGIELKS





64
TATLCLGHHAVPNGTLVKTITNDQIEVTNATELVQSSSTGRICDSPHRILDGKN



CTLIDALLGDPHCDGFQNKEWDLFIERSKAYSNCYPYDVPDYSSLRSLVASSG



TLEFTNEDFNWTGVAQDGGSYSCKRGSVKSFFSRLNWLHKLEYKYPALNVT



MPNNDKFDKLYIWGVHHPATDKDQISLYAQAAGRIIVSTKRSQQAVIPNIGSR



PRVRDIPSRISIYWTIVRPGDILLINSPGNLIAPRGYFKIRSGKSSIMRSDAPIGKC



NSACITPNGSIPNDKPFQKVNRITYGACPRYVKQNAVKLATGMRNRPEKQTR



GIFGAIAGFIENGWEGMVHGWDGFTHQPSGGTGQAADLKSTQAAIDQINGKL



NRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQ



HTIDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTY



DHDVYRDEALNNRFQIKGVELKS





65
TATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENC



TLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSG



TLEFNNESNWTGVKQNGTSSACIRKSSSSFFSRLNWLTHLNYTYPALNVTMP



NNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRI



RDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIQSGKSSIMRSDAPIGKCKSE



CITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTRGIFG



AIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLI



GKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTID



LTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNV



YRDEALNNRFQIKGVELKS





66
SDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILERTHNGKLCDLNGVKPLIL



RDCSVAGWLLGNPMCDEFINVPEWSYIVEKASPANDLCYPGNFNDYEELKHL



LSRISHFEKIQIIPKSSWSNHDASSGVSSACPYLGKSSFFRNVVWLIKKNSTYPT



IKRSYNNTNQEDLLVLWGIHHPNDAAEQTKLYQNPTTYISVGTSTLNQRLVPE



IATRPKVNGQSGRIEFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSTIMKS



ELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRNA



PQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADQEST



QKAIDGVTNKVNSIINKMNTQFEAVGREFNNLERRIENLNKKMEDGFLDVWT



YNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHKC



DNECMESVKNGTYDYPQYSEEARLNREEISGVKLES





67
SDQICIGYHANNSTELVDTIMEKNVTVTHAQDILEKTHNGKLCDLDGVKPLIL



RDCSVAGWLLGNPMCDEFINVPEWSYIVEKANPANDLCYPGDFNDYEELKHL



LSRINHFEKIQIIPKSSWPNHEASSGVSSACPYQGKSSFFRNVVWLIKKNSAYPT



IKRSYNNTNQEDLLVLWGIHHPNDAAEQTKLYQNPTTYISVGTSTLNQRLVPK



IATRSKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSAIM



KSELEYGNCNTKCQTPLGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLR



NTPQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADK



ESTQKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNKKMEDGFLDV



WTYNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYH



KCDNECMESVKNGTYDYPQYSEEARLNREEISGVKLES





68
SDQICIGSHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLDGVKPLIL



RDCSVAGWLLGNPMCDEFINVPEWSYIVEKASPANDLCYPGDFNDYEELKHL



LSRINHFEKIQIIPKSSWSNHEASSGVSSACPYLGRPSFFRNVVWLIKKNNTYPT



IKRSYNNTNQEDLLVLWGIHHPNDEAEQIKLYQNPTTYISVGTSTLNQRLVPKI



ATRSKVNGQSGRMEFFWAILKPNDAINFESNGNFIAPEYAYKIVKKGDSAIMK



SELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRN



APQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADKE



STQKAIDGVTNKVNSIIDKMNTQFEAIGREFNNLERRIENLNKKMEDGFLDVW



TYNAELLVLMENERTLDFHDSNVKNLYEKVRLQLRDNAKGLGNGCFEFYHK



CDNECMESVKNGTYDYPQYSEEATLNREEISGVKLES





69
SDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLDGVKPLIL



RDCSVAGWLLGNPMCDEFINVPEWSYIVEKANPVNDLCYPGDFNDYEELKHL



LSRINHFEKIQIIPKSSWSSHEASLGVSSACPYQGKSSFFRNVVWLIKKNSTYPT



IKRSYNNTNQEDLLVLWGIHHPNDAAEQTKLYQNPTTYISVGTSTLNQRLVPR



IATRSKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSTIMK



SELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRN



SPQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADKES



TQKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNKKMEDGFLDVW



TYNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHK



CDNECMESVRNGTYDYPQYSEEARLKREEISGVKLES





70
SDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLDGVKPLIL



RDCSVAGWLLGNPMCDEFINVPEWSYIVEKANPANDLCYPGNFNDYEELKHL



LSRINHFEKIQIIPKSSWSDHEASSGVSSACPYQGTPSFFRNVIWLIKKNNTYPTI



KRSYNNTNQEDLLILWGIHHSNDAAEQTKLYQNPTTYISVGTSTLNQRLVPKI



ATRSKVNGQSGRMDFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSAILK



SEVEYGNCNTKCQTPIGAINSSMPFHNIHPLTIGECPKYVKSNKLVLATGLRNS



PLRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADKEST



QKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNKKMEDGFLDVWT



YNAELLVLMENERTLDFHDSNVKNLYDKVRLQLKDNAKELGNGCFEFYHKC



DNECMESVRNGTYDYPQYSEEARLKREEISGVKLES





71
SDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCNLDGVKPLIL



RDCSVAGWLLGNPMCDEFLNVPEWSYIVEKINPANDLCYPGNFNDYEELKHL



LSRINHFEKIQIIPKNSWSDHEASGVSSACPYQGRSSFFRNVVWLTKRDNAYPT



IKRSYNNTNQEDLLVLWGIHHPNDAAEQTRLYQNPTTYISVGTSTLNQRLVPK



IATRSKVNGQSGRMEFFWTILKSNDAINFESNGNFIAPENAYKIVKKGDSTIMK



SELEYGNCNTKCQTPIGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRNS



PQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADKEST



QKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNKKMEDGFLDVWT



YNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHRC



DNECMESVRNGTYDYPQYSEEARLKREEISGVKLES





72
SDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCNLDGVKPLIL



KDCSVAGWLLGNPMCDEFLNVSEWSYIVEKASPANGLCYPGDFNDYEELKH



LLSRINHLKKIKIIPKSYWSNHEASSGVSAACSYLGEPSFFRNVVWLIKKNNTY



PPIKVNYTNTNQEDLLVLWGIHHPNDEKEQIRIYQNPNTSISVGTSTLNQRLVP



KIATRPKVNGQSGRMEFFWTILKPNDSINFDSNGNFIAPEYAYKIAKKGDSVIM



KSELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLR



NAPQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADK



ESTQKAIDGITNKVNSIIDKMNTQFEIVGREFNNLERRIENLNKKMEDGFLDV



WTYNAELLVLMENERTLDFHDSNVKNLYEKVRLQLRDNAKELGNGCFEFYH



KCDNECMESVRNGTYDYPQYSEEARLSREEISGVKMES





73
SDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLDGVKPLIL



RDCSVAGWLLGNPMCDEFLNVPEWSYIVEKINPANDLCYPGNFNDYEELKHL



LSRINHFEKIQIIPKSSWSDHEASSGVSSACPYQGRSSFFRNVVWLIKKNDAYP



TIKISYNNTNQEDLLVLWGIHHPNDAAEQTRLYQNPTTYISVGTSTLNLRLVP



KIATRSKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPENAYKIVKKGDSTIM



KSELEYGNCNTKCQTPVGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLR



NSPQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADK



ESTQKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNKKMEDGFLDV



WTYNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYH



RCDNECMESVRNGTYDYPQYSEESRLKREEISGVKLES





74
SDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLDGVRPLIL



RDCSVAGWLLGNPMCDEFINVPEWSYIVEKANPVNDLCYPGVFNDYEELKHL



LSRINHFEKIQIIPKSSWPSHEASLGVSAACPYQGQSSFFRNVVWLIKKNNTYP



TIKRSYNNTNQEDLLVMWGIHHPNDAAEQTKLYQNPTTYISVGTSTLNQRLTP



RIATRSKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSTIM



RSELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSTRLVLATGLR



NSPQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADK



ESTQKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNKKMEDGFLDV



WTYNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYH



KCDNECMESVRNGTYDYPQYSEEARLKREEISGVKLES





75
SDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCNLDGVKPLIL



RDCSVAGWLLGNPMCDEFLNVPEWSYIVEKINPANDLCYPGNFNDYEELKHL



LSRINHFEKIQIIPKDSWSDHEASGVSSACPYQGRSSFFRNVVWLTKKNDAYPT



IKKSYNNTNQEDLLVLWGIHHPNDAAEQTRLYQNPTTYISVGTSTLNQRLVPK



IATRSKVNGQSGRMEFFWTILKSNDAINFESNGNFIAPENAYKIVKKGDSTIMK



SELEYSNCNTKCQTPIGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRNS



PQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADKEST



QRAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNKKMEDGFLDVWT



YNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHRC



DNECMESVRNGTYDYPQYSEEARLKREEISGVKLES





76
DRICTGITSSNSPHVVKTATQGEVNVTGVIPLTTTPTKSHFANLKGTQTRGKLC



PNCFNCTDLDVALGRPKCMGNTPSAKVSILHEVKPATSGCFPIMHDRTKIRQL



PNLLRGYENIRLSTSNVINTETAPRGSYKVGTSGSCPNVANGNGFFNTMAWVI



PKDNNKTAINPVTVEVPYICSEGEDQITVWGFHSDDKTQMERLYGDSNPQKF



TSSANGVTTHYVSQIGGFPNQTEDEGLKQSGRIVVDYMVQKPGKTGTIVYQR



GILLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNKSKPYYTGEHAKAI



GNCPIWVKTPLKLANGTKYRPPAKLLKERGFFGAIAGFLEGGWEGMIAGWH



GYTSHGAHGVAVAADLKSTQEAINKITKNLNYLSELEVKNLQRLSGAMNELH



DEILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPS



AVEIGNGCFETKHKCNQTCLDRIAAGTFNAGDFSLPTFDSLNITAASLNDDGL



DNHTILLYY





77
DRICTGITSSNSPHVVKTATQGEVNVTGVIPLTTTPTKSHFANLKGTQTRGKLC



PNCLNCTDMDVALGRPKCMGTIPSAKVSILHEVKPVTSGCFPIMHDRTKIRQL



PNLLRGYENIRLSTRNVINAETAPGGPYTVGTSGSCPNVTNGKGFFETMAWA



VPKNKNKTATNPLTVEVPYICTKGEDQITVWGFHSDDETLMVILYGDSKPQK



FTSSANGVTTHYVSQIGGFPNQTEDEGLKQSGRIVVDYIVQKPGKTGTIVYQR



GVLLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNKSKPYYTGEHAKA



IGNCPIWVKTPLKLANGTKYRPPAKLLKERGFFGAIAGFLEGGWEGMIAGWH



GYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELH



NEILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPS



AVEIGNGCFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDGL



DNHTILLYY





78
DRICTGITSSNSPHVVKTATQGEVNVTGVIPLTTTPTKSHFANLKGTQTRGKLC



PNCLNCTDLDVALGRPKCMGNIPSAKASILHEVKPGTSGCFPIMHDRTKIRQL



PNLLRGYENIRLSARNVTNAETAPGGPYIVGTSGSCPNVTNGNGFFATMAWA



VPKNKTATNPLTVEVPYICTKGEDQITVWGFHSDDETQMVKLYGDSKPQKFT



SSANGVTTHYVSQIGGFPNQAEDEGLPQSGRIVVDYMVQKPGKTGTIAYQRG



VLLPQKVWCASGRRKVIEGSLPLIGEADCLHEKYGGLNKSKPYYTGEHAKAI



GNCPIWVKTPLKLANGTKYRPPAKLLKERGFFGAIAGFLEGGWEGMIAGWH



GYTSHGAHGVAVAADLKSTQEAINKITKNLFSLSELEVKNLHRLSGAMDELH



NEILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPS



AVEIGNGCFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDGL



DNHTILLYY





79
DRICTGITSSNSPHVVKTATQGEVNVTGVIPLTTTPTKSHFANLKGTKTRGKLC



PKCLNCTDLDVALGRPKCMGTIPSAKASILHEVKPVTSGCFPIMHDRTKIRQLP



NLLRGYENIRLSTHNVINAEAAPGGPYIVGTSGSCPNVTNGNGFFATMAWAV



PKNNNNKTATNPLTVEVPFICTEGEDQITVWGFHSDNEIQMVKLYGDSKPQKF



TSSANGVTTHYVSQIGGFPKQAEDGGLPQSGRIVVDYMVQKSGKTGTITYQR



GILLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNKSKPYYTGEHAKAI



GNCPIWVKTPLKLANGTKYRPPAKLLKERGFFGAIAGFLEGGWEGMIAGWH



GYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELH



NEILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPS



AVDIGNGCFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDGL



DNHTILLYY





80
DRICTGITSSNSPHVVKTATQGEVNVTGVIPLTTTPTKSHFANLKGTKTRGKLC



PNCLNCTDLDVALGRPMCIGNIPSAKASILHEVRPVTSGCFPIMHDRTKIRQLP



NLLRGYEHIRLSTHNVINAERAPGGPYRLGTSGSCPNVTSRSGFFATMAWAVP



RDNKTATNPLTVEVPYICTKGEDQITVWGFHSDNETQMKNLYGDSNPQKFTS



SANGITTHYVSQIGGFPNQTEDGGLPQSGRIVVDYMVQKPGKTGTIVYQRGVL



LPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNKSKPYYTGEHAKAIGN



CPIWVKTPLKLANGTKYRPPAKLLKERGFFGAIAGFLEGGWEGMIAGWHGYT



SHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHNEIL



ELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPSAVD



IGNGCFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDGLDNH



TILLYY





81
DRICTGITSSNSPHVVKTATQGEVNVTGVIPLTTTLTKSHFANLKGTKTRGKLC



PKCLNCTDLDVALGRPKCTGNIPSAKVSILHEVRPVTSGCFPIMHDRTKIRQLP



NLLRGYERIRLSNHNVINAEKAPGGPYKIGTSGSCPNVTNGNGFFATMAWAV



PKNENNKTATNSLTIEVPYICTEGEDQITVWGFHSDNEIQMVKLYGDSKPQKF



TSSANGVTTHYVSQIGGFPNQIEDGGLPQSGRIVVDYMVQKSGKTGTITYQRG



ILLPQKVWCASGRSKIIKGSLPLIGEADCLHEKYGGLNKSKPYYTGEHAKAIG



NCPIWVKTPLKLANGTKYRPPAKLLKERGFFGAIAGFLEGGWEGMIAGWHG



YTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHN



EILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPSA



VEIGNGCFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDGLD



NHTILLYY





82
DRICTGITSSNSPHVVKTATQGEVNVTGAIPLTTTPAKSHFANLKGTKTRGKLC



PTCLNCTDLDVALGGPMCVGITPSAKASILHEVRPVTSGCFPIMHDRTKIRQLP



NLLRGYEKIRLSAQNVINAEKAPGGPYKLGTSGSCPNATSKSGFFATMAWAV



PRDDNKTATNPLTVEVPHICANEEDQITVWGFHSDNKTQMKNLYGDSNPQKF



TSSANGITTHYVSQIGGFPDQTEDGGLPQSGRIVVDYMVQKPGKTGTIVYQRG



ILLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNKSKPYYTGEHAKAIG



NCPIWVKTPLKLANGTKYRPPAKLLKERGFFGAIAGFLEGGWEGMIAGWHG



YTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHN



EILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPSA



VDIGNGCFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDGLD



NHTILLYY





83
DRICTGITSSNSPHVVKTATQGEVNVTGVIPLTTTPTKSYFANLKGTRTRGKLC



PDCLNCTDLDVALGRPMCVGTTPSAKASILHEVRPVTSGCFPIMHDRTKIRQL



PNLLRGYENIRLSTQNVIDAENAPGGPYRLGTSGSCPNATSKSGFFATMAWA



VPKDNNKNATNPLTVEVPYVCKEGEDQITVWGFHSDNKTQMKNLYGDSNPQ



KFTSSANGVTTHYVSQIGGFPAQTEDGGLPQSGRIVVDYMVQKPRKTGTIVY



QRGVLLPQRVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNKSKPYYTGEHA



KAIGNCPIWVKTPLKLANGTKYRPPAKLLKERGFFGAIAGFLEGGWEGMIAG



WHGYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMD



ELHNEILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKML



GPSAVDIGNGCFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDD



GLDNHTILLYY





84
DRICTGITSSNSPHVVKTATQGEVNVTGVIPLTTTPTKSYFANLKGTRTRGKLC



PDCLNCTDLDVALGRPMCVGTTPSAKASILHEVRPVTSGCFPIMHDRTKIRQL



PNLLRGYENIRLSTQNVIDAEKAPGGPYRLGTSGSCPNATSKSGFFATMAWA



VPKDNYKNATNPLTVEVPYICTEGEGQITVWGFHSDNKPQMKNLYGDSNPQ



KFTSSASGVTTHYVSQIGDFPDQTEDGGLPQSGRIVVDYMMQKPGKTGTIVY



QRGVLLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNKSKPYYTGEHA



KAIGNCPIWVITPQKLANGTTFSPPAKLLKERGFFGAIAGFLEGGWEGMIAGW



HGYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDEL



HNEILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLG



PSAVDIGNGCFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDG



LDNHTILLYY





85
DRICTGITSSISPLVDKTATQGEVNVTGVIPLTTTPTKSYFANVKGTRTRGKLCP



DCLNCTDLDVALGRPMCVGTTPSAKASILHEVRPVTSGCFPIMHDRTKIRQLP



NLLRGYENIRLSTQNVIDAQKAPGGPYRLGTSGSCPNATSKIGFFATMACAVP



KDNYKNATNPLTVEVPYIRTEGEDQITVWGFHSDNKTQMKNLYGDSNPPKFT



SSANGVTTHYVSQTGDFPDQTEDGGLPPSGRIVVDYMMQKPGKTGTIAYQRG



VLLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNKSKPPHTGEHTKAIG



NCPIWVKTPLKLANGTKYRPSAKLLKERGFFGAIAGFLEGGWEGMIAGWHG



YTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHN



EILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPSA



VDIGNGCFETKHKCNQTCLDRIAAGTFNAGEFFLPTFDSLNITAASLNDDGLD



NHTILLYY





86
DRICTEITSSNSPLVVKTATQGEVNVTGVIPLTTTPTKSYFANLKGTRTRGKLC



PDCLNCTDLDVALGRPMCVGTTPSAKASILHEVRPVTSGCFPIMHDRTKIRQL



PNLLRGYENIRLSTQNVIDAEKAPGGPYRLGTSGSCPNATSKIGFFATMAWAV



PKDNYKNATNPLTVEVPYICTEGEDQITVWGFHSDNKTQMKNLYGDSNPQKF



TSSANGVTTHYVSQIGDFPDQTEDGGLPQSGRIVVDYMMQKPGKTGPIVYQR



GVLLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNKSKYSHTGDHAKT



LGNCPIWVKTPLKLANGTKYRPPAKLLKERGFFGAIAGFLEGGWEGMIAGWH



GYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELH



NEILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPS



AVDIGNGCFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDGL



DNHTILLYY





87
DRICTGITSSNSPHVVKTATQGEVNVTGVIPLTTTPTKSHFANLKGTETRGKLC



PKCLNCTDLDVALGRPKCTGTTPSARASILHEVRPVTSGCFPILHDRTKIRQLP



NLLRGYEHIRLSTHNVINAENAPGGPYKIGTSGSCPNITNGNGFFATMAWAVP



KNDKNKTATNPLSVEVPYICTEGEDQITVWGFHSDNETQMAKLYGDSKPQKF



TSSANGVATHYVSQIGGFPNQTEDGGLPQSGRIVVDYMVQKSGKTGTITYQR



GILLPQRGWCASGRSKVIKGSLPLFGEADCLHEKYGGLNKSKPYYTGEHAKAI



GNCPIWVKTPLKLANGTKYRPPAKLFKERGFFGAIAGFLEGGWEGMIAGWHG



YTSHGAHGVPVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHN



EILELDEKVDDLRADTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPSA



VEIGNGCFETKHKCNQTCLDRIAAGTFDAGEFSLPTFDSLNITAASLNDDGLD



NHTILLYY
















TABLE 7







Amino acid sequences of neuraminidase antigens that can


be included in a flu vaccine








SEQ



ID
Sequence











88
HSIQTRDQNHPETCNQSIITYENNTWVNQTYVNISNANIVAGQDATSMILAGN



SSLCPISRWAIYSKDNSIRIGSKGDIFVIREPFISCSHLECRTFFLTQGALLNDKH



SNGTVKDRSPYRTLMSCPIGEAPSPYNSRFESVAWSASACHDGMSWLTIGISG



PDNGAVAVLKYNGIITDTIKSWRNKILRTQESECACVNGSCFTIMTDGPSNGQ



ASYKIFKIEKGRIIKSIELNAPNYHYEECSCYPDTSKVMCVCRDNWHGSNRPW



VSFDQNLDYQIGYICSGVFGDNPRSNDGTGSCGPVSSNGANGVKGFSFRYGN



GVWIGRTKSISSRSGFEMIWDPNGWTETDSSFSVKQDIVAITDWSGYSGSFVQ



HPELTGLDCIRPCFWVELIRGQPKENTIWTSGSSISFCGVNSDTVGWSWPDGA



ELPFTIDK





89
HSIQTGSQNHTGICNQNIITYKNSTWVNQTYVNISNTNVVAGQDTTSVILTGNS



SLCPIRGWAIYSKDNSIRIGSKGDVFVIREPFISCSHLECRTFFLTQGALLNDRH



SNGTVKDRSPYRALMSCPVGEAPSPYNSRFESVAWSASACHDGMGWLTIGIS



GPDNGAVAVLKYNGIITETIKSWRKKILRTQESECACVNGSCFTIMTDGPSNG



LASYKIFKIEKGKVTKSIELNAPNSHYEECSCYPDTGKVMCVCRDNWHGSNR



PWVSFDQNLDYQIGYICSGVFGDNPRPKDGTGSCGPVYVDGANGVKGFSYR



YGNGVWIGRTKSNSSRHGFEMIWDPNGWTETDSKFSVRQDVVAMTDWSGY



SGSFVQHPELTGLDCMRPCFWVELIRGRPKENTIWTSGSSISFCGVNSDTVDW



SWPDGAELPFTIDK





90
HSIQTGSQNHTGTCNQSIITYKNSTWVNQTYVNISNTNVVAGKDTTSVILAGN



SSLCPIRGWAIYSKDNGVRIGSKGDVFV1REPFISCSHLECRTFFLTQGALLNDK



HSNGTVKDRSPYRALMSCPVGEAPSPYNSRFESVAWSASACHDGMGWLTIGI



SGPDDGAVAVLKYNGIITETIKSWRKEILRTQESECACVNGSCFTIMTDGPSGG



PASYKIFKIEKGKVTKSIELDAPNSHYEECSCYPDTGKVMCVCRDNWHGSNR



PWVSFDQNLDYQMGYICSGVFGDNPRPKDGKGNCGPVYVDGANGVKGFSY



RYGNGVWIGRTKSNSSRQGFEMIWDPNGWTETDSNFFVKQDVVAVTDWSG



YSGSFVQHPELTGLDCMRPCFWVELIRGRPKEKTIWTSGSSISFCGVNSDTVD



WSWPDGAELPFTIDK





91
HSIQTGSQNHTGICNQRIITYENSTWVNQTYVNINNTNVVAGKDTTSVTLAGN



SSLCPVRGWAIYSKDNSIRIGSKGDVFVIREPFISCSHLECRTFFLTQGALLNDK



HSNGTVKDRSPYRALMSCPIGEAPSPYNSRFESVAWSASACHDGMGWLTIGIS



GPDDGAVAVLKYNGIITETIKSWRKRILRTQESECVCINGSCFTIMTDGPSNGP



ASYRIFKIEKGKITKSIELDAPNSHYEECSCYPDTGTVMCVCRDNWHGSNRPW



VSFNQNLDYQIGYICSGVFGDNPRPKDGKGSCGPVTVDGADGVKGFSYRYGN



GVWIGRTKSNSSRKGFEMIWDPNGWTDTDSNFLVKQDVVAMTDWSGYSGSF



VQHPELTGLDCMRPCFWVELIRGRPREKTTIWTSGSSISFCGVNSDTVNWSWP



DGAELPFTIDK





92
HSIQIGTQNQPETCNQSVITYENNTWVNQTYVNISNTNFVAEQTVVSVKLAGN



SSLCPVSGWAIYSKDNSVRIGSKGDVFVIREPFISCSHLECRTFFLTQGALLND



KHSNGTIKDRSPYRTLMSCPVGEVPSPYNSRFESVAWSASACHDGTSWLTIGI



SGPDNGAVAVLKYNGIITDTIKSWRNNILRTQESECACVNGSCFTVMTDGPSN



GQASYKIFKIERGKVVKSVELNAPNYHYEECSCYPDSGEITCVCRDNWHGSN



RPWVSFNQNLEYKIGYICSGIFGDNPRPNDRTGSCGPVFSNGANGVKGFSFKY



GNGVWIGRTKSTSSRSGFEMIWDPNGWTGTDNNFSVKQDIVGITDWSGYSGS



FVQHPELTGLDCMRPCFWVELIRGRPKENTIWTSGSSISFCGVTSDTVGWSWP



DGAELPFTIDK





93
HSIQTGSQNHTGICNQRIITYENSTWVNHTYVNINNTNVVAGKDKTSVTLAGN



SSLCSISGWAIYTKDNSIRIGSKGDVFVIREPFISCSHLECRTFFLTQGALLNDK



HSNGTVKDRSPYRALMSCPLGEAPSPYNSKFESVAWSASACHDGMGWLTIGI



SGPDNGAVAVLKYNGIITETIKSWKKRILRTQESECVCVNGSCFTIMTDGPSNG



AASYKIFKIEKGKVTKSIELNAPNSHYEECSCYPDTGTVMCVCRDNWHGSNR



PWVSFNQNLDYQIGYICSGVFGDNPRPKDGEGSCNPVTVDGADGVKGFSYRY



GNGVWIGRTKSNRLRKGFEMIWDPNGWTDTDSDFSVKQDVVAMTDWSGYS



GSFVQHPELTGLDCIRPCFWVELVRGRPRENTTIWTSGSSISFCGVNSDTANW



SWPDGAELPFTIDK





94
HSIQTGSQNHTGICNQRIITYENSTWVNQTYVNINNTNVVAGKDKTSMTLAG



NSSLCPIRGWAIYTKDNSIRIGSKGDVFVIREPFISCSHLECRTFFLTQGALLND



KHSNGTVKNRSPYRALMSCPLGEAPSPYNSRFESVAWSASACHDGLGWLTIG



ISGPDNGAVAVLKYNGIITETIKSWKKRILRTQESECVCMNGSCFTIMTDGPSD



GAASYRIFKIEKGRVIKSIELDAPNYHYEECSCYPDTGTVMCVCRDNWHGSNR



PWVSFNQNLDYQIGYICSGVLGDNPRPKDGEGSCNPVTADGADGVKGFSYRY



GNGVWIGRTKSNRLRKGFEMIWDPNGWTDTDSDFSMKQDIVAMTDWSGYS



GSFVQHPELTGLDCMRPCFWVELVRGLPRENTTIWTSGSSISFCGVNSDTANW



SWPDGAELPFTIDK





95
HSIQTGSQNHTGVCNQRIITYENSTWVNHTYVNINNTNVIVGKDKTSVTLAGN



SSLCSISGWAIYTKDNSIRIGSKGDVFVIREPFISCSHLECRTFFLTQGALLNDK



HSNGTVKDRSPYRALMSCPLGEAPSPYNSKFESVAWSASACHDGMGWLTIGI



SGPDNGAVAVLKYNGIITETIKSWKKQILRTQESECVCVNGSCFTIMTDGPSN



GAASYKIFKIEKGKITKSIELNAPNFHYEECSCYPDTGTVMCVCRDNWHGSNR



PWVSFNQNLDYQIGYICSGVFGDNPRPKDGEGSCNPVTVDGANGVKGFSYKY



GNGVWIGRTKSNRLRKGFEMVWDPNGWTDTDSDFSVKQDVVAITDWSGYS



GSFVQHPELTGLNCIRPCFWVELVRGLPKENTTIWTSGSSISFCGVNSDTANWS



WPDGAELPFTIDK





96
HSIQTGWKNHTETCIQNVITYENNTWVNQTYVNISNTKTVTGQGVASTMVAG



NSSLCPINGWAVYSKDNSIRIGSKGDIFVIREPFISCSNLECRTFFLTQGALLND



RHSNGTVKDRSPYRTLMSCPIGEAPSPYNSRFESVAWSASACHDGVGWLTIGI



SGPDNGAVAVLKYNGIITDTIKSWRNKILRTQESECVC1NGSCFTIMTDGPSNE



QASYKVFKIEKGKIIKTIELDAPNYHYEECSCYPDTSKVMCVCRDNWHASNRP



WVSFDQNLNYQIGYICSGVFGDNPRSNDGKGSCGPVPSSGANGVKGFSFRYG



SGVWIGRTKSISSRSGFEMIWDPNGWTETDSSFSMKQDIIALTDWSGYSGSFV



QHPELTGMDCIRPCFWVELIRGQPKESTIWTSGSSISFCGVNSETASWSWPDG



ADLPFTIDK





97
HSIQTGSQNSTGICNQRIITYENSTWVNHTYVNINNTNVVAGEDKTSVTLVGN



SSLCSISGWAIYTKDNSIRIGSKGDVFVIREPFISCSHLECRTFFLTQGALLNDK



HSNGTVKDRSPYRALMSCPLGEAPSPYNSKFESVAWSASACHDGMGWLTIGI



SGPDNGAVAVLKYNGIITGTIKSWKKQILRTQESECVCMNGSCFTIMTDGPSN



KAASYKIFKIEKGKVTKSIELNAPNFYYEECSCYPDTGIVMCVCRDNWHGSNR



PWVSFNQNLDYQIGYICSGVFGDNPRPEDGEGSCNPVTVDGANGVKGFSYKY



GNGVWIGRTKSNRLRKGFEMIWDPNGWTNTDSDFSVKQDVVAITDWSGYSG



SFVQHPELTGLDCIRPCFWVELVRGLPRENTTIWTSGSSISFCGVNSDTANWS



WPDGAELPFTIDK





98
HSIQTESQNKPEICNQNVITYENNTWVNQTYVTVSNTNFVAEQAVASVKLVG



NSSLCPVSGWAIYSKDNSVRIGSKGDVFVIREPFISCSHLECRTFFLTQGALLN



DKHSNGTIKDRSPYRTLMSCPIGEVPSPYNSRFESVAWSASACHDGTSWLTIGI



SGPDNGAVAVLKYNGIITDTIKSWRKNILRTQESECACVNGSCFTIMTDGPSN



GQASYKVFKIERGKVVKSVELNAPNYHYEECSCYPESSEITCVCRDNWHGSN



RPWVSFNQNLEYQMGYICSGIFGDNPRPNDKTGSCGPVFSNGANGVKGFSFR



YGNGVWIGRTKSTSSRMGFEMIWDPDGWIRTDDKFSVKQDIIGITDWSGYSG



SFVQHPELTGLDCMRPCFWVELIRGRPKENTIWTSGSSISFCGVNSDTVGWSW



PDGAELPFTIDK





99
HSIQTGGKNYSEMCNQNVITYVNNTWVNRTYVNISNTNVATVQGVNSIILSG



NSSLCPVSGWAIYSKDNGIRIGSKGDIFVIREPFISCSHLECRTFFLTQGALLND



KHSNGTVKDRSPYRTLMSCPIGEAPSPYNSRFESVAWSASACHDGMGWLTIGI



SGPDNGAVAVLKYNGIITDTIKSWRNKILRTQESECVCMNGSCFTVLTDGPSD



GQASYKILKVEKGKIIKSIELNAPNYHYEECSCYPDAGKVMCVCRDNWHASN



RPWVSFDQNLDYQIGYICSGVFGDNPRSNDGKGKCGPVFPNGANGVKGFSFK



YGNGIWIGRTKSINSRRGFEMIWDPNGWTETDSSFSMKQDVIALTDWSGYSG



SFVQHPELTGMNCIRPCFWVELIRGQPKEGTIWTSGSSISFCGVNSETTSWSWP



DGADLPFTIDK





100
HSIQIGSQSQIETCNQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGN



SSLCPVSGWAIYSKDNSVRIGSKGDVFVIREPFISCSPFECRTFFLTQGALLNDK



HSNGTIKDRSPYRTLMSCPIGEVPSPYNSRFESVAWSASACHDGINWLTIGISG



PDSGAVAVLKYNGIITDTIKSWRNNILRTQESECACVNGSCFTIMTDGPSDGQ



ASYKIFRIEKGKIIKSVEMKAPNYHYEECSCYPDSSEITCVCRDNWHGSNRPW



VSFNQNLEYQMGYICSGVFGDNPRPNDKTGSCGPVSSNGANGVKGFSFKYGN



GVWIGRTKSISSRKGFEMIWDPNGWTGTDNKFSIKQDIVGINEWSGYSGSFVQ



HPELTGLDCIRPCFWVELIRGRPEENTIWTSGSSISFCGVNSDTAGWSWPDGAE



LPFTIDK





101
HSIQLGSQNYTKTCTQSVITYENNTWVNQTYVNISNTNLAVGQSVVSAKLAG



NSSLCPVSGWAIYSKDNSIRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLND



QHSNGTIKDRSPYRTLMSCPIGEVPSPYNSRFESVAWSASACHDGINWLTIGIS



GPDNGAVAVLKYNGIITDTIKSWRNNILRTQESECVCVNGSCFTVMTDGPSNG



QASYKIFRIEKGKIVKSVEMNAPNYHYEECSCYPDSSEITCVCRDNWHGSNRP



WVSFNQNLEYQIGYICSGIFGDNPRPNDKTGSCGPVSSNGANGVKGFSFKYGN



GVWIGRTKSISSRKGFEMIWDPNGWTGTDNNFSIKQDIVGINEWSGYSGSFVQ



HPELTGLDCIRPCFWVELIRGRPKENTVWTSGSSISFCGVNSDTVGWSWPDGA



ELPFTIDK





102
HFKQYECDSPASNQVMPCEPIIIERNITEIVYLNNTTIEKEICPKVVEYRNWSKP



QCQITGFAPFSKDNSIRLSAGGDIWVTREPYVSCDHGKCYQFALGQGTTLDNK



HSNDTIHDRIPHRTLLMNELGVPFHLGTRQVCIAWSSSSCHDGKAWLHVCITG



DDKNATASFIYDGRLVDSIGSWSQNILRTQESECVCINGTCTVVMTDGSASGR



ADTRILFIEEGKIVHISPLSGSAQHVEECSCYPRYPGVRCICRDNWKGSNRPVV



DINMEDYSIDSSYVCSGLVGDTPRNDDRSSNSNCKNPNNERGNQGVKGWAF



DNGDDVWMGRTISKDLRSGYETFKVIGGWSTPNSKSQINRQVIVDSDNRSGY



SGIFSVEGKSCINRCFYVELIRGRKQETRVWWTSNSIVVFCGTSGTYGTGSWP



DGANINFMPI





103
HFKQHECDSPSSNQVMLCEPIIIERNITEIVYLNNTTIEKETCPKLVEYRNWSKP



QCKITGFAPFSKDNSIRLSAGGDIWVTREPYVSCDPGKCYQFALGQGTTLDNK



HSNDTIHDRIPHRTLLMNELGVPFHLGTRQVCIAWSSSSCHDGKAWLHVCVT



GDDKNATASFIYDGRLVDSIGSWSQNILRTQESECVCINGTCTVVMTDGSASG



RADTRILFIEEGKIVHISPLSGSAQHVEECSCYPRYPDVRCICRDNWKGSNRPIV



DINMKDYSIDSSYVCSGLVGDTPRNDDRSSKSNCRNPNNERGNHGVKGWAF



DNGNDVWMGRTISKDLRSGYETFKVIGGWSTPNSKSQINRQVIVDSDNRSGY



SGIFSVEGKSCINRCFYVELIRGREQETRVWWTSNSIVVFCGTSGTYGTGSWP



DGANINFMPI





104
HFKQYECDSPANNQVMPCEPIIIERNITEIVYLTNTTIEKEICPKLVEYRNWSKP



QCKITGFAPFSKDNSIRLSAGGDIWVTREPYVSCDPGKCYQFALGQGTTLDNK



HSNDTIHDRIPHRTLLMNELGVPFHLGTRQVCIAWSSSSCHDGKAWLHVCVT



GDDENATASFIYDGRLVDSIGSWSQNILRTQESECVCINGTCTVVMTDGSASG



RADTRILFIEEGKIVHISPLSGSAQHVEECSCYPRYPGVRCICRDNWKGSNRPV



VDINVKDYSIDSRYVCSGLVGDTPRNNDKSSNSNCRNPNNDKGNHGVKGWA



FDDGNDVWMGRTISKDSRSGYETFKVIGGWSTPNSKSQINRQVIVDSDNRSG



YSGIFSVEGKSCINRCFYVELIRGREQETKVWWTSNSIVVFCGTSGTYGTGSW



PDGADINLMSI





105
HFKQYECDSPANNQVMPCEPIIIERNITKIMYLTNTTIEKEICPKSVEYRNWSKP



QCKITGFAPFSKDNSIRLSAGGAIWVTREPYVSCDPGKCYQFALGQGTTLDNK



HSNDTIHDRTPHRTLLMNELGVPFHLGTRQVCIAWSSSSCHDGKAWLHVCVT



GYDKNATASFIYDGRLVDSIGSWSQNILRTQESECVCINGTCTVVMTDGSASG



KADTKILFIEEGKIVHVSPLSGSAQHVEECSCYPRYPGVRCICRDNWKGSNRPI



VDINVKDYSIDSSYVCSGLVGDTPRNNDRSSNSYCRNPNNERGNHGVKGWAF



DDGDDVWMGRTISKDSRLGYETFKVIGGWSTPNSKLQINKQVIVDSDNRSGY



SGIFSVEGKSCINRCFYVELIRGRRQETRVWWTSNSIVVFCGTSGTYGTGSWP



DGADINLMPI





106
HFKQYECNSPPNNQVMLCEPTIIERNITEIVYLTNTTIEKEKCPKLVEYRNWSK



PQCKITGFAPFSKDNSIRLSAGGDIWVTREPYVSCDPGKCYQFALGQGTTLNN



RHSNDTVHDRTPYRTLLMNELGVPFHLGTKQVCIAWSSSSCHDGKAWLHVC



VTGHDENATASFIYDGRLVDSIGSWSKNILRTQESECVCINGTCTVVMTDGSA



SERADTKILFIEEGKIVHISPLSGSAQHVEECSCYPRYPGVRCVCRDNWKGSNR



PIVDINVKDYSIVSSYVCSGLVGDTPRKNDSSSSSYCRNPNNEKGSHGVKGWA



FDDGNDVWMGRTISEELRSGYETFKVIGGWSTPNSKLQINRQVIVDRGNRSG



YSGIFSVEGKSCINRCFYVELIRGRKQENKVWWTSNSIVVFCGTSGTYGTGSW



PDGADINLMPI





107
HFKQYECNSPPNNQVMLCEPTIIERNITEIVYLTNTTIEKEICPKLAEYRNWSKP



QCNITGFAPFSKDNSIRLSAGGDIWVTREPYVSCDPDKCYQFALGQGTTLNNG



HSNDTVHDRTPYRTLLMNELGVPFHLGTKQVCIAWSSSSCHDGKAWLHVCV



TGDDENATASFIYNGRLVDSIGSWSKKILRTQESECVCINGTCTVVMTDGSAS



GKADTKILFIEEGKIVHTSPLSGSAQHVEECSCYPRYPGVRCVCRDNWKGSNR



PIVDINVKDYSIVSSYVCSGLVGDTPRKNDSSSSSHCLNPNNEEGGHGVKGWA



FDDGNDVWMGRTISEKLRSGYETFKVIEGWSKPNSKLQINRQVIVDRGNRSG



YSGIFSVEGKSCINRCFYVELIRGRKQETEVLWTSNSIVVFCGTSGTYGTGSWP



DGADINLMPI





108
HFKQYECNSPPNNQVILCEPTIIERNITEIVYLTNTT1EKEICPKLAEYRNWSKPQ



CKITGFAPFSKDNSIRLSAGGDIWVTREPYVSCDPDKCYQFALGQGTTLNNRH



SNDTVHDRTPYRTLLMNELGVPFHLGTKQVCIAWSSSSCHDGKAWLHVCVT



GHDENATASFIYNGRLVDSIGSWSKKILRTQESECVCVNGTCTVVMTDGSAS



GRADTKILFIEEGQIVHISPLLGSAQHVEECSCYPRYPGVRCVCRDNWKGSNR



PIVDINVKDYSIVSSYVCSGLVGDTPRKNDRSSSSNCLNPNNEEGGHGVKGW



AFDDGNDVWMGRTISEKLRSGYETFKVIEGWSKPNSKLQINRQVIVDRDDRS



GYSGIFSVEGKSCINRCFYVELIRGRKQETEVWWTSNSIVVFCGTPGTYGTGS



WPDGADINLMPI


109
HFKQYEFNSPPNNQVMLCEPTIIERNITEIVYLTNTTIEKEICPKLAEYRNWSKP



QCDITGFAPFSKDNSIRLSAGGDIWVTREPYVSCDPDRCYQFALGQGTTLNNV



HSNDTVHDRTPYRTLLMNELGVPFHLGTKQVCIAWSSSSCHDGKAWLHVCV



TGDDKNATASFIYNGRLVDSIVSWSKKILRTQESECVCINGTCTVVMTDGSAS



GKADTKILFIEEGKIVHTSTLSGSAQHVEECSCYPRYPGVRCVCRDNWKGSNR



PIVDINIKDYSIVSSYVCSGLVGDTPRKNDSSSSSHCLDPNNEEGGHGVKGWA



FDDGNDVWMGRTISEKLRSGYETFKVIEGWSNPNSKLQINRQVIVDRGNRSG



YSGIFSVEGKSCINRCFYVELIRGRKEETEVLWTSNSIVVFCGTSGTYGTGSWP



DGADINLMPI





110
HFKQYECDPPANNQVMTCKPIIIERNTTEIVYLTNVTIEKETYPKIVEYRNWSM



PQCKITGFAPFSKDNSIRLSAGGDIWVTREPYVSCEPGKCYQFALGQGTTLDN



KHSNDTIHDRTPHRTLLMNELGVPFYLGTRQVCIAWSSSSCYDGKAWLHVCI



TGHDKNATASFIYDGRLVDSIGSWSMNILRTQESECVCINGTCTVVMTDGSAS



GKADTRILFIEEGKIIHISPLAGSAQHVEECSCYPRYPGVRCICRDNWKGSNRPI



VDINVKDYSIDSSYVCSGLVGDTPRNNDKSSSSYCRNPNNEKGGHGVKGWAF



DDGNDLWMGRTISEDIRSGYETFKVIGGWSIPNSKLQVNRQVIVDSNNRSGYS



GIFSVEGKSCINRCFYVELIRGRRTEARVGWTSNSIVVFCGTSGTYGTGTWPD



GADINLMPI





111
HFKQYECSSPSTNQVRLCEPTIIERNITEIVYLTNTTIEKEICPKLAEYRDWSKPQ



CKIKGFAPFSKDNSIRLSAGGDIWVTREPYVSCDPDKCYQFALGQGTTLNNRH



SNDTVHDRTPYRTLLMNELGVPFHLGTKQVCTAWSSSSCHDGKAWLHVCVT



GHDENATASFIYNGRLVDSIGSWSKKILRTQESECVCINGTCTVVMTDGSASG



RADTKILFIEEGKIVHISQLSGSAQHVEECSCYPRYPGVRCVCRDNWKGSNRPI



VDINVKDRSIASSYVCSGLVGDTPRKNDSSSSSHCLTPNNEEGGHGVKGWAF



DDGNDVWMGRTISEKLRLGYETFKVIEGWSKPNSKLQINRQVIVDKGNRSGY



SGIFSVEGKSCINRCFYVELIRGRKQETEVWWTSNSIVVFCGTSGTYGTGSWP



DGADINLMPI





112
HFRQCECNSSAINQTMPCEPTKIEKNITEIVYLTNTTIKTEVCPKLVKYRNWAK



PQCKITGFAPFSKDNSIRLSAGGAIWVTREPYVSCDLSKCYQFALGQGTTLDN



RHSNDTIHDRTPYRTLLMNELGVPFHLGTRQVCIAWSSSSCHDGKAWLHVCV



TGYDKNATASLIYDGRLVDSIGSWSQNILRTQESECVCINGTCTVVMTDGSAS



GKADTRILFVEEGKIIHISPLTGSAQHVEECSCYPRYPGVRCVCRDNWKGSNR



PVVDINVKDYKINSSYVCSGLVGNTPRNNDRSSNSNCQNPNNQRGNHGVKG



WAFDDGNDIWMGRTISNDSRLGYETFKVIGGWSKPNSKVQTNRQVIVDSDNR



SGYSGVFSVEGKSCINRCFYVELIRGRRQETRVWWTSNSIVVFCGTSGTYGSG



SWPDGADINLMPI





113
HFNQCDCHYPPNNQVTLCEPTITERNTTKIVYLTNTTIEKENCPKLAEYRDWS



KPQCKITGFAPFSKDNSIRLSAGGDIWVTREPYVSCDPDKCYQFALGQGTTLN



NRHSNDTFHDRTPYRTLLMNELGIPFHLGTKQVCIAWSSSSCHDGKAWLHVC



ITGHDENATASITYNGRLVDSIGSWSKKILRTQESECVCINGTCTVVMTDGSAS



GRADTKILFIEEGKIIHISPLIGSAQHVEECSCYPRYPGVRCICRDNWKGSNRPV



VDINVKDYSIVSSYVCSGLVGDTPRKDDKSSSSNCLNPNNEKGEHGVKGWAF



DDGKDVWMGRTINETLRSGYETFKVIEGWSKSNSKLQINRQVIVEKSDRSGY



SGIFSVEGKSCINRCFYVELIRGRKQETAVWWTSNSIVVFCGTSGTYGTGSWP



DGADINLMPV


114
HFEQYDCNPFPNNQVMLCKPTIIERNTTEFVYLTNTTIEKEICPKLAEYRNWAK



PQCTITGFAPFSRDNSIRLSAGGDIWVTREPYVSCDPDKCYQFALGQGTTLNN



GHSNDTVHDRTPYRTLLMNELGVPFHLGTRQVCTAWSSSSCHDGKAWLHVCI



TGDDKNATASFIYGGKLVDSIGSWSKNILRTQESECVCINGTCTVVMTDGSAS



GKADTKILFIKEGKIVHISTLSGSAQHVEECSCYPRYPGVRCVCRDNWKGSNR



PVVDINVKDYSIASSYVCSGLVGDTPRKNDSFSNSHCLNPNNEEGGHGVKGW



AFDDGSDVWMGRTISEKFRLGYETFKVIEGWSRPNSKLQTNRQVIVERGNRS



GYSGIFSVEGKSCINRCFYVELIRGRKEETKVWWTSNSIVVFCGTSGTYGTGS



WPDGADINIMPI





115
HLRQYEFNSPPKNNQVMLCEPTIIERNITEIVYLTNTTIEKEICPKLAEYRNWSK



PQCDITGFAPFSKDNSIRLSAGGDIWVTREPYVSCGPDKCFQFALGQGTTLNN



VHSNDTVHDRTPYRTLLMSELGVPFHLGTKQVCIAWSSSSCHDGKAWLHVC



VTGDDKNATASIIYNGRLVDSIVSWSKKILRTQESECVCINGICTVVMTDGSAS



GKADTKILFIEEGKIVHTSTLSGSAQHVEECSCYPRYPGVRCVCRDNWKGSNR



PIVDINIRDYSIVSNYVCSGLVGDTPRKPDSSSSSNCLDPNDEEGGHGVKGWA



FDDGNDVWMGRTISEKSRLGYETFKVIGGWFNPNSKSQINRQVIVDRGNRSG



YSGIFSVEGKNCINRCFYVELIRGRKDEIEVLWTSNSIVVFCGTSGTYGTGSWP



DGADLKLMSV





116
LYSDILLKFSPTEITAPTMPLGCANASNVQAVNRSATKGVTLLLPEPEWTYPRL



SCPGSTFQKALLISPHRFGETKGNSAPLIIREPFIACGPNECKHFALTHYAAQPG



GYYNGTRGDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGKEWTYIG



VDGPDNNALLKVKYGEAYTDTYHSYANKILRTQESACNCIGGNCYLMITDGS



ASGVSECRFLKIREGRIIKEIFPTGRVKHTEECTCGFASNKTIECACRDNRYTAK



RPFVKLNVEADTAEIRLMCTDTYLDTPRPNDGSITGPCESDGDKGSGGIKGGF



VHQRMKSKIGRWYSRTMSKTERMGMGLYVKYDGDPWADSDALAFSGVMV



SMKEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKETWHSAATAIYCLMGSGQ



LLWDTVTGVDMAL





117
LYSDVLLKFSSTKTTAPTMSLECTNASNAQTVNHSATKEMTFPPPEPEWTYPR



LSCQGSTFQKALLISPHRFGEIKGNSAPLIIREPFVACGPKECRHFALTHYAAQP



GGYYNGTRKDRNKLRHLVSVKLGKIPTVENSIFHMAAWSGSACHDGREWTY



IGVDGPDNDALVKIKYGEAYTDTYHSYAHNILRTQESACNCIGGDCYLMITD



GSASGISKCRFLKIREGRIIKEILPTGRVEHTEECTCGFASNKTIECACRDNSYT



AKRPFVKLNVETDTAEIRLMCTKTYLDTPRPDDGSIAGPCESNGDKWLGGIKG



GFVHQRMASKIGRWYSRTMSKTNRMGMELYVKYDGDPWTDSDALTLSGVM



VSIEEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKDTWHSAATAIYCLMGSG



QLLWDTVTGVDMAL





118
LYSDILLKFSPTKRTAPTMSLECVNVSNAQAVNHSATKEMTFLLPEPEWTYPR



LSCQGSTFQKALLISPHRFGETRGNSAPLIIREPFVACGPKECRHFALTHYAAQ



PGGYYNGTRKDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGREWTY



IGVDGPDSDALIKIKYGEAYTDTYHSYAHNILRTQESACNCIGGDCYLMITDG



SASGISKCRFLKIREGRIIKEIFPAGRVEHTEECTCGFASNKTIECACRDNSYTA



KRPFVKLNVETDTAEIRLMCTETYLDTPRPDDGSITGPCESNGDKGLGGIKGG



FVHQRMASKIGRWYSRTMSKTERMGMELYVKYDGDPWTDSDALAPSGVMV



SIKEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKETWHSAATAIYCLMGSGQL



LWDTVTGVDMAL





119
LYSDILLKFSPTKITAPTMSLDCANVSNVQAVNRSVTKEMTFLLPEPEWTYPR



LSCQGSTFQKALLISPHRFGETRGNSAPLIIREPFVACGPKECRHFALTHYAAQ



PGGYYNGTRNDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGREWTY



IGVDGPDSNALIKIKYGEAYTDTYHSYANNILRTQESACNCIGGDCYPMITDGS



ASGISKCRFLKIREGRIIKEIFPTGRVDHTEECTCGFASNKTIECACRDNSYTAK



RPFVKLNVETDTAEIRLMCTETYLDTPRPDDGSITGPCESNGDKGLGGIKGGF



VHQRMASKTGRWYSRTMSKTERMGMELYVRYDGDPWTDSDALAPSGVMV



SMKEPGWYSFGFEIKDKKCDVPCIGIEVVQVGGKETWHSAATAIYCLMGSGQ



LLWDIVTGVAMAL





120
LYSDILLKFSPTEKSAPTMSLDCANASNVQAVNRSATKGMTLLLPEPEWTYPR



LSCQGSTFQKALLISPHRFGETRGNSAPLIIREPFIACGPKECKHFALTHYAAQP



GGYYNGTREDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGREWTYI



GVDGPDSNALIKIKYGEAYTDTYHSYANNILRTQESACNCIGGDCYLMITDGS



ASGISKCRFLKIREGRIIKEIFPTGRVDHTEECTCGFASNKTIECACRDNSYTAK



RPFVKLNVETDTAEIRLMCTETYLDTPRPDDGSKTGPCESNGDKGSGGIKGGF



VHQRMASKIGRWYSRTMSKTERMGMELYVKYDGDPWTDSDALAPSGVMVS



MEEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKNTWHSAATAIYCLMGSGQL



LWDTVTGVDMAL





121
LYSDILLKFSPTKVIAPTVSLDCANASNVQAVNHSATKEMTFLLPEPEWTYPR



LSCQGSTFQKALLISPHRFGEAKGNSAPLIIREPFIACGPKECKHFALTHYAAKP



GGYYNGTREDRNKLRHLISVNLGKIPTVENSIFHMAAWSGSACHDGREWTYI



GVDGPDSNALIKIKYGEAYTDTYHSYANNILRTQESACNCIGGDCYLMITDGS



ASGISRCRFLKIREGRIIKEIFPTGRVEHTEECTCGFASNKTIECACRDNSYTAK



RPFVKLNVETDTAEIRLMCTETYLDTPRPDDGSITGPCESNGDKGNGGIKGGF



VHQRMASRIGRWYSRTMSKTKRMGMELYVKYDGDPWTDSDALTPSGVMIS



MEEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKKTWHSAATAIYCLMGSGQL



LWDTVTGVDMAL





122
LYSDILLKFSSTEITAPTMPLNCANASNVQAVNHSATKGVTLPLPEPEWTYPRL



SCPGSTFQKALLISPHRFGETKGNSAPLIIREPFIACGPKECKHFALTHYAAQPG



GYYNGTREDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGREWTYIG



VDGPDSNALLKIKYGEAYTDTYHSYANNILRTQESACNCIGGDCYLMITDGSA



SGVSECRFLKIREGRIIKEIFPTGRVEHTEECTCGFASNKTIECACRDNSYTAKR



PFVKLNVETDTAEIRLMCTETYLDTPRPDDGSITGPCESNGDKGSGGIKGGFV



HQRMASKIGRWYSRTMSKNKRMGMGLYVKYDGDPWIDSDALTLSGVMVS



MEEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKETWHSAATAIYCLMGSGQL



LWDTVTGVNMAL





123
LYSDILLKFSTTEITAPTMPLDCANASNVQAVNHSAAKGVTLLLPEPEWTYPR



LSCPGSTFQKALLISPHRFGETKGNSAPLIIREPFIACGPKECKHFALTHYAAQP



GGYYNGTREDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGKEWTYI



GVDGPDSNALLKIKYGEAYTDTYHSYAHNILRTQESACNCIGGNCYLMITDGS



ASGISECRFLKIREGRIIKEIFPTGRVKHTEECTCGFANNKTIECACRDNSYTAK



RPFVKLNVETDTAEIRLMCTETYLDTPRPDDGSITGPCESNGNKGSGGIKGGF



VHQRMASKIGRWYSRTMSKTKRMGMGLYVKYDGDPWTDSDALVFSGVMIS



VEEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKETWHSAATAIYCLMGSGQL



LWDTVTGVNMTL





124
LYSDILLKFSRTEITAPILRLACANASNVQAVNRSATKGVTLLLPEPEWTYPRL



SCPGSTFQKALLISPHRFGETKGNSAPLIIREPFIACGPKECKHFALTHYAAQPG



GYYNGTREDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGREWTYIG



VDGPDSNALLKIKYGGAFTDTYHSYAKNILSPQESACNCIGGDCYLMITDGPA



SGISECRFLKIREGRIIKEIFPTGRVKHTEECTCGFASNKTIECACRDNSYTAKRP



FVKLNVETDTAEIRLMCTETYLDTPRPNDGSITGPCESNGDKGSGGIKGGFVH



QRMASKIGRWYSRTMSKTKRMGMGLYVKYDGDPWTDSEALALSGVMVSM



EEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKTTWHSAATAIYCLMGSGQLL



WDTVTGVNMTL





125
LYSDILLKFSRTEITAPIMPLDCANASNVQAVNRSATKGVTLLLPEPEWTYPRL



SCPGSTFQKALLISPHRFGETKGNSAPLIIREPFIACGPKECKHFALTHYAAQPG



GYYNGTREDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGREWTYIG



VDGPDSNALLKIKFGEAYTDTYHSYAKNILGTQESACNCIGGDCNLMITDGPA



SGISESRFLKIGEGRIIKEIFPTGRVKHTEECTCGFASNKTIECACRDNSYTAKRP



FVKLNVETDTAEIRLMCTETYLDTPRPNDGSITGPCESNGDKGSGGIKGGFVH



QRMASKIGRWYSRTMSKTERMGMGLYVKYDGDPWADSDALAFSGVMVSM



KEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKETWHSAATAIYCLMGSGQLL



WDTVTGVNMAL





126
LYSDILLKFSPTEITAPTMPLDCANTSNVQAVNRSATKGVTLLLPEPEWTYPRL



SCPGSTFQKALLISPHRFGETKGNSAPLIIREPFIACGPNECKHFALTHYAAQPG



GYYNGTRGDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGKEWTYIG



VDGPDNNALLKITYGEAYPGTYHSYAIKNPRTQESAPNCFGGHSFPIITHAPPS



GVWEPRFLKIREGRIINEIFPTGRVKPPEECTCGFASNNTIECACRDNNCTAKRP



FVKFHVESDPAEIRAMCTDTYLDSPRPDDGSITGPCDSNGDKGSGGIKGGFVH



QRMESKIGRWYSRTMSKTERMGMGLYVKYDGDPWADSDALANSGVMVSM



NEPGSSSFGFEIKDKKCDVPCIGIEMVHDGGKETWHSAATAIYCLMGSGQLL



WDTVTGVDMAL





127
LYSDILLKFSPTEITAPTMPLDCANASNVQAVNRSATKGVTLFLPEPEWTYPRL



SCPGSTFQKALLISPHRFGETKGNSAPLIIREPFIACGPTECKHFALTHYAAQPG



GYYNGTREDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGKEWTYIG



VDGPDSNALLKIKYGEAYTDTYHFYAKNILRTQESACSCIGGDCYLMITDGPA



SGVSECRFLKIREGRIIKEIFPTGRVKHTEECTCGFASNKTIECACRDNSYTAKR



PFVKLNVETDTAEIRLMCTETYLDTPRPNDGSITGPCESDGDKGSGGIKGGFV



HQRMASKIGRWYSRTMSKTKRMGMGLYVKYDGDPWTDSEALALSGVMVS



MEEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKTTWHSAATAIYCLMGSGQL



LWDTVTGVDMAL
















TABLE 8







Amino Acid sequences of hemagglutinin antigens from Sino 


Biologicals









SEQ
Catalog



ID
#
Sequence





128
40016-
MKAIIVLLMVVTSNADRICTGITSSNSPHVVKTATQGEVNVTGVIP



V08B
LTTTPTKSHFANLKGTETRGKLCPKCLNCTDLDVALGRPKCTGKIP




SARVSILHEVRPVTSGCFPIMHDRTKIRQLPNLLRGYEHIRLSTHNV




INAENAPGGPYKIGTSGSCPNITNGNGFFATMAWAVPKNDKNKTA




TNPLTIEVPYICTEGEDQITVWGFHSDNETQMAKLYGDSKPQKFTS




SANGVTTHYVSQIGGFPNQTEDGGLPQSGRIVVDYMVQKSGKTG




TITYQRGILLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNK




SKPYYTGEHAKAIGNCPIWVKTPLKLANGTKYRPPAKLLKERGFF




GAIAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAIN




KITKNLNSLSELEVKNLQRLSGAMDELHNEILELDEKVDDLRADTI




SSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPSAVEIGNGCFE




TKHKCNQTCLDRIAAGTFDAGEFSLPTFDSLNITAASLNDDGLDN




HTILLYYSTAASSLAVTLMIAIFVVYMVSRDNVSCSICL





129
11716-
MKAIIVLLMVVTSNADRICTGITSSNSPHVVKTATQGEVNVTGVIP



V08B
LTTTPTKSHFANLKGTETRGKLCPKCLNCTDLDVALGRPKCTGNIP




SARVSILHEVRPVTSGCFPIMHDRTKIRQLPNLLRGYEHIRLSTHNV




INAENAPGGPYKIGTSGSCPNVTNGNGFFATMAWAVPKNDNNKT




ATNSLTIEVPYICTEGEDQITVWGFHSDNEIQMAKLYGDSKPQKFT




SSANGVTTHYVSQIGGFPNQTEDGGLPQSGRIVVDYMVQKSGKTG




TITYQRGILLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLNK




SKPYYTGEHAKAIGNCPIWVKTPLKLANGTKYRPPAKLLKERGFF




GAIAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAIN




KITKNLNSLSELEVKNLQRLSGAMDELHNEILELDEKVDDLRADTI




SSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPSAVEIGNGCFE




TKHKCNQTCLDRIAAGTFDAGEFSLPTFDSLNITAASLNDDGLDN




HTILLYYSTAASSLAVTLMIAIFVVYMVSRDNVSCSICL





130
40157-
MKAIIVLLMVVTSNADRICTGITSSNSPHVVKTATQGEVNVTGVIP



V08B
LTTTPTKSHFANLKGTKTRGKLCPNCLNCTDLDVALGRPMCMGTI




PSAKASILHEVRPVTSGCFPIMHDRTKIRQLPNLLRGYENIRLSTHN




VINAERAPGGPYRLGTSGSCPNVTSRNGFFATMAWAVPRDNKTA




TNPLTVEVPYICTKGEDQITVWGFHSDDKTQMKNLYGDSNPQKFT




SSANGVTTHYVSQIGDFPNQTEDGGLPQSGRIVVDYMVQKPGKTG




TIVYQRGVLLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGGLN




KSKPYYTGEHAKAIGNCPIWVKTPLKLANGTKYRPPAKLLKERGF




FGAIAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAIN




KITKNLNSLSELEVKNLQRLSGAMDELHNEILELDEKVDDLRADTI




SSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPSAVDIGNGCFE




TKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDGLDN




HTILLYYSTAASSLAVTLMIAIFIVYMVSRDNVSCSIC





131
40498-
MKAIIILLMVVTSNADRICTGITSSNSPHVVKTATQGEVNVTGVIPL



V08B
TTTPTKSYFANLKGTKTRGKLCPDCLNCTDLDVALGRPMCVGTTP




SAKASILHEVRPVTSGCFPIMHDRTKIRQLANLLRGYENIRLSTQN




VIDAEKAPGGPYRLGTSGSCPNATSKSGFFATMAWAVPKDNNKN




ATNPLTVEVPYICAEGEDQITVWGFHSDNKTQMKNLYGDSNPQK




FTSSANGVTTHYVSQIGGFPDQTEDGGLPQSGRIVVDYMMQKPGK




TGTIVYQRGVLLPQKVWCASGRSKVIKGSLPLIGEADCLHEKYGG




LNKSKPYYTGEHAKAIGNCPIWVKTPLKLANGTKYRPPAKLLKER




GFFGAIAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQE




AINKITKNLNSLSELEVKNLQRLSGAMDELHNEILELDEKVDDLRA




DTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPSAVDIGNG




CFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDGL




DNHTILLYYSTAASSLAVTLMLAIFIVYMVSRDNVSCSICL





132
40463-
MKAIIVLLMVVTSNADRICTGITSSNSPHVVKTATQGEVNVTGVIP



V08B
LTTTPTKSYFANLKGTRTRGKLCPDCLNCTDLDVALGRPMCVGTT




PSAKASILHEVRPVTSGCFPIMHDRTKIRQLPNLLRGYEKIRLSTQN




VIDAEKAPGGPYRLGTSGSCPNATSKIGFFATMAWAVPKDNYKN




ATNPLTVEVPYICTEGEDQITVWGFHSDNKTQMKSLYGDSNPQKF




TSSANGVTTHYVSQIGDFPDQTEDGGLPQSGRIVVDYMMQKPGK




TGTIVYQRGVLLPQKVWCASGRSKVIKGSLPLIGEADCLHEEYGG




LNKSKPYYTGKHAKAIGNCPIWVKTPLKLANGTKYRPPAKLLKER




GFFGAIAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQE




AINKITKNLNSLSELEVKNLQRLSGAMDELHNEILELDEKVDDLRA




DTISSQIELAVLLSNEGIINSEDEHLLALERKLKKMLGPSAVDIGNG




CFETKHKCNQTCLDRIAAGTFNAGEFSLPTFDSLNITAASLNDDGL




DNHTILLYYSTAASSLAVTLMLAIFIVYMVSRDNVSCSICL





133
40323-
MELIVLLILLNPYTFVLGDRICIGYQANQNNQTVNTLLEQNVPVTG



V08B
AQEILETNHNGKLCSLNGVPPLDLQSCTLAGWLLGNPNCDSLLEA




EEWSYIKINESAPDDLCFPGNFENLQDLLLEMSGVQNFTKVKLFNP




QSMTGVTTNNVDQTCPFEGKPSFYRNLNWIQGNSGLPFNIEIKNPT




SNPLLLLWGIHNTKDAAQQRNLYGNDYSYTIFNFGEKSEEFRPEIG




QRDEVKAHQDRIDYYWGSLPAQSTLRIESTGNLIAPEYGFYYKRK




EGKGGLMKSKLPISDCSTKCQTPLGALNSTLPFQNVHQQTIGNCPK




YVKATSLMLATGLRNNPQMEGRGLFGAIAGFIEGGWQGMIDGW




YGYHHENQEGSGYAADKEATQKAVDAITNKVNSIIDKMNSQFES




NIKEFNRLELRIQHLSDRVDDALLDIWSYNTELLVLLENERTLDFH




DANVKNLFEKVKAQLKDNAIDEGNGCFLLLHKCNNSCMDDIKNG




TYKYMDYREESHIEKQKIDGVKLTDYSRYYIMTLYSTIASSVVLGS




LIIAAFLWGCQKGSIQCKICI





134
40324-
MITILILVLPIVVGDQICIGYHSNNSTQTVNTLLESNVPVTSSHSILE



V08B
KEHNGLLCKLKGKAPLDLIDCSLPAWLMGNPKCDELLTASEWAY




IKEDPEPENGICFPGDFDSLEDLILLVSNTDHFRKEKIIDMTRFSDVT




TNNVDSACPYDTNGASFYRNLNWVQQNKGKQLIFHYQNSENNPL




LIIWGVHQTSNAAEQNTYYGSQTGSTTITIGEETNTYPLVISESSILN




GHSDRINYFWGVVNPNQNFSIVSTGNFIWPEYGYFFQKTTNISGIIK




SSEKISDCDTICQTKIGAINSTLPFQNIHQNAIGDCPKYVKAQELVL




ATGLRNNPIKETRGLFGAIAGFIEGGWQGLIDGWYGYHHQNSEGS




GYAADKEATQKAVDAITTKVNNIIDKMNTQFESTAKEFNKIEMRI




KHLSDRVDDGFLDVWSYNAELLVLLENERTLDFHDANVNNLYQ




KVKVQLKDNAIDMGNGCFKILHKCNNTCMDDIKNGTYNYYEYR




KESHLEKQKIDGVKLSENSSYKIMIIYSTVASSVVLGLIILAAIEWG




CFKGNLQCRICI





135
11085-
MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVT



V08B
HSVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLST




ASSWSYIVETPSSDNGTCYPGDFIDYEELREQLSSVSSFERFEIFPKT




SSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNSYPKLSKS




YINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGSSRYSK




KFKPEIAIRPKVRDREGRMNYYWTLVEPGDKITFEATGNLVVPRY




AFAMERNAGSGIIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIG




KCPKYVKSTKLRLATGLRNIPSIQSRGLFGAIAGFIEGGWTGMVDG




WYGYHHQNEQGSGYAADLKSTQNAIDEITNKVNSVIEKMNTQFT




AVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLD




YHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVK




NGTYDYPKYSEEAKLNREEIDGVKLESTRIYQILAIYSTVASSLVLV




VSLGAISFWMCSNGSLQCRICI





136
40132-
MKAKLLVLLCAFTATYADTICIGYHANNSTDTVDTVLEKNVTVT



V08B
HSVNLLEDSHNGKLCRLKGIAPLQLGNCSVAGWILGNPKCESLFS




KESWSYIAETPNPENGTCYPGYFADYEELREQLSSVSSFERFEIFPK




ESSWPNHTVTKGVTTSCSHNGKSSFYRNLLWLTEKNGLYPNLSKS




YVNNKEKEVLVLWGVHHPSNIRDQRAIYHTENAYVSVVSSHYSR




RFTPEIAKRPKVRGQEGRINYYWTLLEPGDTIIFEANGNLIAPWYA




FALSRGFGSGIITSNASMDECDAKCQTPQGAINSSLPFQNVHPVTIG




ECPKYVRSTKLRMVTGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDG




WYGYHHQNEQGSGYAADQKSTQNAINGITNKVNSVIEKMNTQFT




AVGKEFNKLERRMENLNKKVDDGFLDIWTYNAELLVLLENGRTL




DFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNNECMESV




KNGTYDYPKYSEESKLNRGKIDGVKLESMGVYQILAIYSTVASSL




VLLVSLGAISFWMCSNGSLQCRICI





137
40392-
MKAILLVLLCTFAATNADTLCIGYHANNSTDTVDTVLEKNVTVTH



V08B
SVNLLEDRHNGKLCKLGGIAPLHLGKCNIAGWLLGNPECELLLTV




SSWSYIVETSNSDNGTCYPGDFINYEELREQLSSVSSFERFEIFPKTS




SWPNHETNRGVTAACPYAGANSFYRNLIWLVEKGNSYPKLSKSY




VNNKGKEVLVLWGIHHPPTSTDQQSLYQNADAYVFVGSSKYNRK




FKPEIAARPKVRGQAGRMNYYWTLIEPGDTITFEATGNLVVPRYA




FAMNRGSGSGIIISDAPVHDCNTKCQTPKGAINTSLPFQNIHPVTIG




ECPKYVKSTKLRMATGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDG




WYGYHHQNEQGSGYAADQRSTQNAIDGITNKVNSVIEKMNTQFT




AVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLD




FHDSNVKNLYEKVRSQLRNNAKEIGNGCFEFYHKCDDTCMESVK




NGTYDYPKYSEESKLNREEIDGVKLESTRIYQILAIYSTVASSLVLL




VSLGAISFWMCSNGSLQCRICI





138
40090-
MKATILVVLCIFTASNEDTICIGYHANNSTDTVDTVLEKNVTVTHS



V08B
VNLLEDSHNGKLCKLKGIAPLQLGKCNIAGWLLGNPECDLLLTAS




SWSYIVETSNSENGTCYPGDFIDYEELREQLSSVSSFEKFEIFPKTSS




WPNHETTKGVTAACSYAGASSFYRNLLWLTKKGSSYPKLSKSYV




NNKGKEVLVLWGVHHPPTGTDQQSLYQNADAYVSVGSSKYNRR




FTPEIAARPKVRDQAGRMNYYWTLLEPGDTITFEATGNLIAPWYA




FALNRGSGSGIITSDAPVHDCNTKCQTPHGAINSSLPFQNIHPVTIG




ECPKYVRSTKLRMATGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDG




WYGYHHQNEQGSGYAADRKSTQNAIDGITNKVNSVIEKMNTQFT




SVGKEFNHLEKRIENLNRKVDDGFLDVWTYNAELLVLLENERTL




DYHDSNVKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDDSCMESV




KNGTYDYPKYSEESKLNREEIDGV





139
11684-
MKANLLVLLCALAAADADTICIGYHANNSTDTVDTVLEKNVTVT



V08B
HSVNLLEDSHNGKLCRLKGIAPLQLGKCNIAGWLLGNPECDPLLP




VRSWSYIVETPNSENGICYPGDFIDYEELREQLSSVSSFERFEIFPKE




SSWPNHNTNGVTAACSHEGKSSFYRNLLWLTEKEGSYPKLKNSY




VNKKGKEVLVLWGIHHPPNSKEQQNLYQNENAYVSVVTSNYNR




RFTPEIAERPKVRDQAGRMNYYWTLLKPGDTIIFEANGNLIAPMY




AFALSRGFGSGIITSNASMHECNTKCQTPLGAINSSLPYQNIHPVTI




GECPKYVRSAKLRMVTGLRNIPSIQSRGLFGAIAGFIEGGWTGMID




GWYGYHHQNEQGSGYAADQKSTQNAINGITNKVNTVIEKMNIQF




TAVGKEFNKLEKRMENLNKKVDDGFLDIWTYNAELLVLLENERT




LDFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCDNECMES




VRNGTYDYPKYSEESKLNREKVDGVKLESMGIYQILAIYSTVASSL




VLLVSLGAISFWMCSNGSLQCRICI





140
40133-
MKAKLLVLLCTFTATYADTICIGYHANNSTDTVDTVLEKNVTVTH



V08B
SVNLLEDSHNGKLCLLKGIAPLQLGNCSVAGWILGNPECESLISKE




SWSYIVETPNPENGTCYPGYFADYEELREQLSSVSSFERFEIFPKES




SWPNHTVTGVTASCSHNGKSSFYRNLLWLTEKNGLYPNLSNSYV




NNKEKEVLVLWGVHHPSNIRDQRAIYHTENAYVSVVSSHYSRRFT




PEIAKRPKVRGQEGRINYYWTLLEPGDTIIFEANGNLIAPWYAFAL




SRGFGSGIITSNAPMNECDAKCQTPQGAINSSLPFQNVHPVTIGECP




KYVRSTKLRMVTGLRNIPSIQSRGLFGAIAGFIEGGWTGMMDGW




YGYHHQNEQGSGYAADQKSTQNAINGITNKVNSVIEKMNTQFTA




VGKEFNKLERRMENLNKKVDDGFLDIWTYNAELLVLLENERTLD




FHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNNECMESVK




NGTYDYPKYSEESKLNREKIDGVKLESMGVYQILAIYSTVASSLVL




LVSLGAISFWMCSNGSLQCRICI





141
11708-
MKVKLLVLLCTFTATYADTICIGYHANNSTDTVDTVLEKNVTVTH



V08B
SVNLLEDSHNGKLCLLKGIAPLQLGNCSVAGWILGNPECELLISRE




SWSYIVEKPNPENGTCYPGHFADYEELREQLSSVSSFERFEIFPKES




SWPNHTTTGVSASCSHNGESSFYKNLLWLTGKNGLYPNLSKSYA




NNKEKEVLVLWGVHHPPNIGDQRALYHKENAYVSVVSSHYSRKF




TPEIAKRPKVRDQEGRINYYWTLLEPGDTIIFEANGNLIAPRYAFAL




SRGFGSGIINSNAPMDECDAKCQTPQGAINSSLPFQNVHPVTIGECP




KYVRSAKLRMVTGLRNIPSIQSRGLFGAIAGFIEGGWTGMVDGWY




GYHHQNEQGSGYAADQKSTQNAINGITNKVNSVIEKMNTQFTAV




GKEFNKLERRMENLNKKVDDGFIDIWTYNAELLVLLENERTLDFH




DSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNDECMESVKNG




TYDYPKYSEESKLNREKIDGVKLESMGVYQILAIYSTVASSLVLLV




SLGAISFWMCSNGSLQCRICI





142
11692-
MKAKLLVLLYAFVATDADTICIGYHANNSTDTVDTIFEKNVAVTH



V08H
SVNLLEDRHNGKLCKLKGIAPLQLGKCNIIGWLLGNPECDSLLPAR




SWSYIVETPNSENGACYPGDFIDYEELREQLSSVSSLERFEIFPKESS




WPNHTFNGVTASCSHRGKSSFYRNLLWLTKKGDSYPKLTNSYVN




NKGKEVLVLWGVHHPSSSDEQQSLYSNGNAYVSVASSNYNRRFT




PEIAARPKVKDQHGRMNYYWTLLEPGDTIIFEATGNLIAPWYAFA




LSRGFESGIITSNASMHECNTKCQTPQGSINSNLPFQNIHPVTIGECP




KYVRSTKLRMVTGLRNIPSIQYRGLFGAIAGFIEGGWTGMIDGWY




GYHHQNEQGSGYAADQKSTQNAINGITNKVNSIIEKMNTQFTAVG




KEFNNLEKRMENLNKKVDDGFLDIWTYNAELLVLLENGRTLDFH




DLNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCDNECMESVRNG




TYDYPKYSEESKLNREKIDGVKLESMGVYQILAIYSTVASSLVLLV




SLGAISFWMCSNGSLQCRICI





143
11052-
MKVKLLVLLCTFTATYADTICIGYHANNSTDTVDTVLEKNVTVTH



V08H
SVNLLENSHNGKLCLLKGIAPLQLGNCSVAGWILGNPECELLISKE




SWSYIVEKPNPENGTCYPGHFADYEELREQLSSVSSFERFEIFPKES




SWPNHTVTGVSASCSHNGESSFYRNLLWLTGKNGLYPNLSKSYA




NNKEKEVLVLWGVHHPPNIGDQKALYHTENAYVSVVSSHYSRKF




TPEIAKRPKVRDQEGRINYYWTLLEPGDTIIFEANGNLIAPRYAFAL




SRGFGSGIINSNAPMDKCDAKCQTPQGAINSSLPFQNVHPVTIGEC




PKYVRSAKLRMVTGLRNIPSIQSRGLFGAIAGFIEGGWTGMVDGW




YGYHHQNEQGSGYAADQKSTQNAINGITNKVNSVIEKMNTQFTA




VGKEFNKLERRMENLNKKVDDGFIDIWTYNAELLVLLENERTLDF




HDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNDECMESVKN




GTYDYPKYSEESKLNREKIDGVKLESMGVYQILAIYSTVASSLVLL




VSLGAISFWMCSNGSLQCRICI





144
40393-
MEAKLFVLFCVFNALKADTICVGYHANNSTDTVDTILEKNVTVTH



V08B
SVNLLENSHNGKLCSLNGKAPLQLGNCNVAGWILGNPECDLLLT




ANSWSYIIETSNSKNGKCYPGEFADYEELREQLSTVSSFERFEIFPK




ATSWPNHETTKGTTTACSHSGASSFYRNLLWIVKKGNSYPKLSKS




YTNNKGKEVLVIWGVHHPPTNSDQQTLYQNAYTYVSVESSKYYR




RFTPEIAARPKVRGQAGRMNYYWTLLDQGDTITFEATGNLIAPWY




AFALNKGSNSGIMMSDAHVHNCTTKCQTPHGALKSNLPFQNVHPI




TIGECPKYVKSTQLRMATGLRNVPSIQSRGLFGAIAGFIEGGWTGM




IDGWYGYHHQNEQGSGYAADQKSTQIAIDGISNKVNSVIEKMNIQ




FTSVGKEFNNLEKRIENLNKKVDDGFLDIWTYNAELLILLENERTL




DFHDFNVKNLYEKVKSQLRNNAKEVGNGCFEFYHKCDNECMES




VKNGTYNYPKYSEESKLNREEIDGVKLESMGVHQILAIYSTVASSL




VLLVSLGAISFWMCSNGSLQCRICI





145
40134-
MKAKLLVLLCALSATDADTICIGYHANNSTDTVDTVLEKNVTVT



V08B
HSVNLLEDSHNGKLCRLKGIAPLQLGKCNIAGWILGNPECESLFSK




KSWSYIAETPNSENGTCYPGYFADYEELREQLSSVSSFERFEIFPKE




RSWPKHNVTRGVTASCSHKGKSSFYRNLLWLTEKNGSYPNLSKS




YVNNKEKEVLVLWGVHHPSNIEDQKTIYRKENAYVSVVSSNYNR




RFTPEIAERPKVRGQAGRINYYWTLLEPGDTIIFEANGNLIAPWHA




FALNRGFGSGIITSNASMDECDTKCQTPQGAINSSLPFQNIHPVTIG




ECPKYVRSTKLRMVTGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDG




WYGYHHQNEQGSGYAADQKSTQNAINGITNKVNSVIEKMNTQFT




AVGKEFNKLEKRMENLNKKVDDGFLDIWTYNAELLVLLENERTL




DFHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNNECMESV




KNGTYDYPKYSEESKLNREKIDGVKLESMGVYQILAIYSTVASSL




VLLVSLGAISFWMCSNGSLQCRICI





146
40131-
MKAKLLVLLCAFTATDADTICIGYHANNSTDTVDTVLEKNVTVT



V08B
HSVNLLEDSHNGKLCRLKGIAPLQLGNCSIAGWILGNPECESLFSK




KSWSYIAETPNSENGTCYPGYFADYEELREQLSSVSSFERFEIFPKE




SSWPNHTVTKGVTASCSHKGKSSFYRNLLWLTEKNGSYPTLSKSY




VNNKEKEVLVLWGVHHPSNIGDQRAIYHTENAYVSVVSSHYNRR




FTPEIAKRPKVRGQEGRINYYWTLLEPGDTIIFEANGNLIAPWYAF




ALSRGFGSGIITSNASMDECDAKCQTPQGAINSSLPFQNVHPVTIGE




CPKYVRSTKLRMVTGLRNIPSIQSRGLFGAIAGFIEGGWTGMIDGW




YGYHHQNEQGSGYAADQKSTQNAINGITNKVNSVIEKMNTQFTA




VGKEFNKLERRMENLNKKVDDGFLDIWTYNAELLVLLENERTLD




FHDSNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCNNECMESVK




NGTYDYPKYSEESKLNREKIDGVKLESMGVYQILAIYSTVASSLVL




LVSLGAISFWMCSNGSLQCRICI





147
11685-
MEARLFVLFCTFTVLKADIICVGYHANNSTDTVDTVLEKNVTVTH



V08H
SVNLLESSHNGKLCSLNGIPPLQLGKCNVAGWLLGNPECDQLLTA




NSWSYIIETSNSENGTCYPGEFIDYEELREQLSSVSSFEKFEIFPKAS




SWPNHETTKGVTAACPYLGNSNFYRNLLWITKKGNSYPELSKSYT




NNKGKEVLILWGVHHPPSTNEQQNLYQNANAYVSVGSSKYNRRF




TPEIAARPKVRGQAGRMNYYWTLLDQGDTITFEATGNLIAPWYAF




ALNKNSDSGIMMSDAPVHNCDTKCQTPHGALNSSLPFQNVHPITI




GECPKYIKSTKLRMATGLRNVPSIQSRGLFGAIAGFIEGGWTGMID




GWYGYHHQNEQGSGYAADQKSTQNAIDGITNKVNSVIEKMNTQF




TAVGKEFNNLEKRIENLNKKVDDGFLDVWTYNAELLVLLENERT




LDFHDSNVRNLYEKVKSQLRNNAKEIGNGCFEFYHKCDDECMES




VKNGTYDYPKYSEESKLNREEIDGVKLESMGVYQILAIYSTVASSL




VLLVSLGAISFWMCSNGSLQCRVCI





148
11688-
MAIIYLILLFTAVRGDQICIGYHANNSTEKVDTILERNVTVTHAKDI



V08H
LEKTHNGKLCKLNGIPPLELGDCSIAGWLLGNPECDRLLSVPEWS




YIMEKENPRDGLCYPGSFNDYEELKHLLSSVKHFEKVKILPKDRW




TQHTTTGGSRACAVSGNPSFFRNMVWLTKKGSNYPVAQGSYNNT




SGEQMLIIWGVHHPNDETEQRTLYQNVGTYVSVGTSTLNKRSTPE




IATRPKVNGQGGRMEFSWTLLDMWDTINFESTGNLIAPEYGFKIS




KRGSSGIMKTEGTLENCETKCQTPLGAINTTLPFHNVHPLTIGECP




KYVKSEKLVLATGLRNVPQIESRGLFGAIAGFIEGGWQGMVDGW




YGYHHSNDQGSGYAADKESTQKAFDGITNKVNSVIEKMNTQFEA




VGKEFSNLERRLENLNKKMEDGFLDVWTYNAELLVLMENERTLD




FHDSNVKNLYDKVRMQLRDNVKELGNGCFEFYHKCDDECMNSV




KNGTYDYPKYEEESKLNRNEIKGVKLSSMGVYQILAIYATVAGSL




SLAIMMAGISFWMCSNGSLQCRICI





149
40119-
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTIT



V08B
NDQIEVTNATELVQSSSTGGICDSPHQILDGENCTLIDALLGDPQC




DGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEF




NNESFNWAGVTQNGTSSACKRRSNKSFFSRLNWLTHLKYKYPAL




NVTMPNNEKFDKLYIWGVHHPGTDSDQISLYAQASGRITVSTKRS




QQTVIPNIGSRPRVRDVSSRISIYWTIVKPGDILLINSTGNLIAPRGY




FKIRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGAC




PRYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDG




WYGFRHQNSEGTGQAADLKSTQAAINQINGKLNRLIGKTNEKFH




QIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDL




TDSEMNKLFERTKKQLRENAEDMGNGCFKIYHKCDNACIESIRNG




TYDHDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCV




ALLGFIMWACQKGNIRCNICI





150
11715-
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTIT



V08B
NDQIEVTNATELVQSSSTGGICDSPHQILDGENCTLIDALLGDPQC




DGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEF




NNESFNWAGVTQNGTSSACKRRSNKSFFSRLNWLTHLKYKYPAL




NVTMPNNEKFDKLYIWGVHHPGTDSDQISLYAQASGRITVSTKRS




QQTVIPNIGSRPRVRDVSSRISIYWTIVKPGDILLINSTGNLIAPRGY




FKIRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGAC




PRYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDG




WYGFRHQNSEGTGQAADLKSTQAAINQINGKLNRLIGKTNEKFH




QIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDL




TDSEMNKLFERTKKQLRENAEDMGNGCFKIYHKCDNACIESIRNG




TYDHDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCV




ALLGFIMWACQKGNIRCNICI





151
40120-
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTIT



V08B
NDQIEVTNATELVQSSSTGGICDSPHQILDGENCTLIDALLGDPQC




DGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEF




NNESFNWTGVTQNGTSSACKRRSNKSFFSRLNWLTHLKYKYPAL




NVTMPNNEKFDKLYIWGVLHPGTDSDQIRLYAQASGRITVSTKRS




QQTVIPNIGSRPRVRDVSSRISIYWTIVKPGDILLINSTGNLIAPRGY




FKIRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGAC




PRYIKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDG




WYGFRHQNSEGTGQAADLKSTQAAINQINGKLNRLIGKTNEKFH




QIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQYTIDL




TDSEMNKLFERTKKQLRENAEDMGNGCFKIYHK





152
40145-
MKTIIALSHILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTIT



V08B
NDQIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCD




GFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFN




NESFNWTGVTQNGTSSACIRRSNNSFFSRLNWLTQLNFKYPALNV




TMPNNEQFDKLYIWGVHHPVTDKDQIFLYAQSSGRITVSTKRSQQ




AVIPNIGYRPRIRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIR




SGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPRY




VKQSTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYG




FRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKE




FSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSE




MNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD




HDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVALL




GFIMWACQKGNIRCNICI





153
40494-
MKTIIALSYILCLVFAQKLPENDNSTATLCLGHHAVPNGTIVKTITS



V08B-50
DRIEVTNATELVQNSSIGEICDSPHQILDGKNCTLIDALLGDPQCDG




FQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNDE




SFNWAGVTQNGTSSACIRGSNSSFFSRLNWLTHSNFKYPALNVTM




PNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVI




PNIGSRPRIRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGK




SSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKQ




STLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMKDGWYGFRH




QNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSE




VEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMNK




LFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDV




YRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVALLGFIM




WACQKGNIRCNICI





154
40118-
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTIT



V08B
NDQIEVTNATELVQSSSTGGICDSPHQILDGENCTLIDALLGDPQC




DGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEF




NNESFNWTGVTQNGTSSSCKRRSNNSFFSRLNWLTHLKFKYPALN




VTMPNNEKFDKLYIWGVHHPGTNNDQISLYTQASGRITVSTKRSQ




QTVIPNIGSRPRVRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKI




RSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPR




YVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGW




YGFRHQNSEGIGQAADLKSTQAAINQINGKLNRLIGKTNEKFHQIE




KEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDS




EMNKLFERTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTY




DHDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVAL




LGFIMWACQKGNIRCNICI





155
40149-
MKTIIALSYILCLVFAQKIPGNDNSTATLCLGHHAVPNGTLVKTIT



V08B
NDQIEVTNATELVQSSSTGRICDSPHRILDGENCTLIDALLGDPHCD




GFQNKEWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNN




ESFNWTGVAQNGTSYACKRSSIKSFFSRLNWLHQLKYKYPALNVT




MPNNDKFDKLYIWGVHHPSTDSDQTSIYAQASGRVTVSTKRSQQT




VIPNIGSRPWVRGISSRISIHWTIVKPGDILLINSTGNLIAPRGYFKIR




SGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPRY




VKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWY




GFRHQNSEGTGQAADLKSTQAAINQINGKLNRLIEKTNEKFHQIEK




EFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSE




MNKLFERTRKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD




HDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVVLL




GFIMWACQKGNIRCNICI





156
40485-
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTLVKTIT



V08B-50
NDQIEVTNATELVQSSSTGRICDSPHRILDGKNCTLIDALLGDPHC




DGFQNKEWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFT




NEGFNWTGVAQDGTSYACKRGSVKSFFSRLNWLHKLEYKYPALN




VTMPNNDKFDKLYIWGVHHPSTDSVQTSLYVQASGRVTVSTKRS




QQTVIPNIGSRPWVRGISSRISIYWTIVKPGDILLINSTGNLIAPRGYF




KIRSGKSSIMRSDAPIGNCNSECITPNGSIPNDKPFQNVNRITYGACP




RYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGW




YGFRHQNSEGTGQAADLKSTQAAINQINGKLNRLIEKTNEKFHQIE




KEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDS




EMNKLFERTRKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTY




DHDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVVL




LGFIMWACQKGNIRCNICI





157
40146-
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTIT



V08B-50
NDQIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCD




GFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFN




NESFNWTGVTQNGTSSACIRRSNNSFFSRLNWLTHLNFKYPALNV




TMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQ




AVIPNIGSRPRIRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIR




SGKSSIMRSDAPISKCKSECITPNGSIPNDKPFQNVNRITYGACPRY




VKQSTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYG




FRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKE




FSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSE




MNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYD




HDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVALL




GFIMWACQKGNIRCNICI





158
40043-
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTIT



V08H
NDQIEVTNATELVQSSSTGEICDSPHQILDGKNCTLIDALLGDPQC




DGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEF




NNESFNWTGVTQNGTSSACIRRSKNSFFSRLNWLTHLNFKYPALN




VTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQASGRITVSTKRSQ




QTVSPNIGSRPRVRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFK




IRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPR




YVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGW




YGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQI




EKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTD




SEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTY




DHDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVAL




LGFIMWACQKGNIRCNICI





159
11056-
MKTIIALSYILCLVFTQKLPGNDNSTATLCLGHHAVPNGTIVKTITN



V08H
DQIEVTNATELVQSSSTGEICDSPHQILDGENCTLIDALLGDPQCDG




FQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNE




SFNWTGVTQNGTSSACIRRSNNSFFSRLNWLTHLKFKYPALNVTM




PNNEKFDKLYIWGVHHPGTDNDQIFPYAQASGRITVSTKRSQQTVI




PNIGSRPRVRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSG




KSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPRYVK




QNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFR




HQNSEGIGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFS




EVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMN




KLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHD




VYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVALLGFI




MWACQKGNIRCNICI





160
40116-
MKTIIALSYIFCLALGQDLPGNDNSTATLCLGHHAVPNGTLVKTIT



V08B
DDQIEVTNATELVQSSSTGKICNNPHRILDGIDCTLIDALLGDPHCD




VFQNETWDLFVERSKAFSNCYPYDVPDYASLRSLVASSGTLEFITE




GFTWTGVTQNGGSNACKRGPGSGFFSRLNWLTKSGSTYPVLNVT




MPNNDNFDKLYIWGVHHPSTNQEQTSLYVQASGRVTVSTRRSQQ




TIIPNIGSRPWVRGLSSRISIYWTIVKPGDVLVINSNGNLIAPRGYFK




MRTGKSSIMRSDAPIDTCISECITPNGSIPNDKPFQNVNKITYGACP




KYVKQNTLKLATGMRNVPEKQTRGLFGAIAGFIENGWEGMIDGW




YGFRHQNSEGTGQAADLKSTQAAIDQINGKLNRVIEKTNEKFHQI




EKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTD




SEMNKLFEKTRRQLRENAEDMGNGCFKIYHKCDNACIESIRNGTY




DHDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVVL




LGFIMWACQRGNIRCNICI





161
40154-
MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTLVKTIT



V08B
NDQIEVTNATELVQSSSTGRICDSPHQILDGENCTLIDALLGDPHCD




GFQNKEWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNN




ESFNWTGVAQNGTSSACKRRSIKSFFSRLNWLHQLENRYPALNVT




MPNNDKFDKLYIWGVHHPSTDSVQTSVYVQASGRVTVSTKRSQQ




TVIPNIGSRPWVRGVSSRISIYWTIVKPGDILLINSTGNLIAPRGYFKI




RSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPR




YVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMMDGW




YGFRHQNSEGTGQAADLKSTQAAINQINGKLNRLIEKTNEKFHQIE




KEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDS




EMNKLFERTRKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTY




DHDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVVL




LGFIMWACQKGNIRCNICI





162
11702-
MEKIVLLLAIVSLVKSDQICIGYHANNSTEQVDTIMERNVTVTHAQ



V08H
DILEKTHNGKLCNLDGVKPLILRDCSVAGWLLGNPMCDEFLNVPE




WSYIVEKINPANDLCYPGNFNDYEELKHLLSRINHFEKIQIIPKNSW




SDHEASGVSSACPYQGRSSFFRNVVWLTQKDNAYPTIKRSYNNTN




QEDLLVLWGIHHPNDAAEQTRLYQNPTTYISVGTSTLNQRLIPKIA




TRSKVNGQSGRMEFFWTILKSNDAINFESNGNFIAPENAYKIVKKG




DSTIMKSELEYGNCNTKCQTPIGAINSSMPFHNIHPLTIGECPKYVK




SNRLVLATGLRNSPQGERRRKKRGLFGAIAGFIEGGWQGMVDGW




YGYHHSNEQGSGYAADKESTQKAIDGVTNKVNSIIDKMNTQFEA




VGREFNNLERRIENLNKKMEDGFLDVWTYNAELLVLMENERTLD




FHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHRCDNECMESVR




NGTYDYPQYSEEARLKREEISGVKLESIGTYQILSIYSTVASSLALAI




MVAGLFLWMCSNGSLQCRICI





163
40158-
MEKIVLLLAMIGLVKSDQICIGYHANNSTEQVDTIMEKNVTVTHA



V08B
QDILEKTHNGKLCNLDGVKPLILKDCSVAGWLLGNPMCDEFLNV




SEWSYIVEKASPANGLCYPGDFNDYEELKHLLSRINHLKKIKIIPKS




YWSNHEASSGVSAACSYLGEPSFFRNVVWLIKKNNTYPPIKVNYT




NTNQEDLLVLWGIHHPNDEKEQIRIYQNPNTSISVGTSTLNQRLVP




KIATRPKVNGQSGRMEFFWTILKPNDSINFDSNGNFIAPEYAYKIA




KKGDSVIMKSELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECP




KYVKSNRLVLATGLRNAPQREGGRRRKRGLFGAIAGFIEGGWQG




MVDGWYGYHHSNEQGSGYAADKESTQKAIDGITNKVNSIIDKMN




TQFEIVGREFNNLERRIENLNKKMEDGFLDVWTYNAELLVLMENE




RTLDFHDSNVKNLYEKVRLQLRDNAKELGNGCFEFYHKCDNECM




ESVRNGTYDYPQYSEEARLSREEISGVKMESMVTYQILSIYSTVAS




SLALAIMVAGLSLWMCSNGSLQCRICI





164
11062-
MEKIVLLFAIVSLVKSDQICIGYHANNSTEQVDTIMEKNVTVTHAQ



V08H1
DILEKTHNGKLCDLDGVKPLILRDCSVAGWLLGNPMCDEFINVPE




WSYIVEKANPVNDLCYPGDFNDYEELKHLLSRINHFEKIQIIPKSS




WSSHEASLGVSSACPYQGKSSFFRNVVWLIKKNSTYPTIKRSYNN




TNQEDLLVLWGIHHPNDAAEQTKLYQNPTTYISVGTSTLNQRLVP




RIATRSKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPEYAYKIV




KKGDSTIMKSELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECP




KYVKSNRLVLATGLRNSPQRERRRKKRGLFGAIAGFIEGGWQGM




VDGWYGYHHSNEQGSGYAADKESTQKAIDGVTNKVNSIIDKMNT




QFEAVGREFNNLERRIENLNKKMEDGFLDVWTYNAELLVLMENE




RTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHKCDNEC




MESVRNGTYDYPQYSEEARLKREEISGVKLESIGIYQILSIYSTVAS




SLALAIMVAGLSLWMCSNGSLQCR





165
40160-
MEKIVLLFAIISLVKSDHICIGYHANNSTEQVDTIMEKNVTVTHAQ



V08B
DILEKTHNGKLCDLNGVKPLILKDCSVAGWLLGNPMCDEFIDVPE




WSYIVEKANPANDLCYPGNFNDYEELKHLLSRINHFEKIRIIPKDS




WPDHEASLGVSAACSYQGNSSFFRNVVWLLKKDNAYPIIKKSYN




NTNKEDLLVLWGIHHPNDEAEQTRLYQNPTTYVSIGTSTLNQRLV




PRIATRSKVNGQSGRIDFFWTILKPNDAINFESNGNFIAPEYAYKIV




KKGDSTIMRSEVEYGNCSTRCQTPMGAINSSMPFHNIHPLTIGECP




KYVKSNKLVLATGLRNSPQIERRRRKRGLFGAIAGFIEGGWQGMV




DGWYGYHHSNEQGSGYAADKESTQKAIDGVTNKVNSIIDKMNTQ




FEAVGREFNNLERRIENLNKKMEDGFLDVWTYNAELLVLMENER




TLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHKCDNECM




ESVRNGTYDYPQYSEEARLKREEISGVKLESIGTYQILSIYSTVASS




LVLAIMVAGLSLWMCSNGSLQCRICI





166
11689-
MEKIVLLLATVSLVKSDQICIGYHANNSTEQVDTIMEKNVTVTHA



V08H-50
QDILERTHNGKLCDLNGVKPLILRDCSVAGWLLGNPMCDEFINVP




EWSYIVEKASPANDLCYPGNFNDYEELKHLLSRISHFEKIQIIPKSS




WSNHDASSGVSSACPYLGKSSFFRNVVWLIKKNSTYPTIKRSYNN




TNQEDLLVLWGIHHPNDAAEQTKLYQNPTTYISVGTSTLNQRLVP




EIATRPKVNGQSGRIEFFWTILKPNDAINFESNGNFIAPEYAYKIVK




KGDSTIMKSELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPK




YVKSNRLVLATGLRNAPQRERRRKKRGLFGAIAGFIEGGWQGMV




DGWYGYHHSNEQGSGYAADQESTQKAIDGVTNKVNSIINKMNTQ




FEAVGREFNNLERRIENLNKKMEDGFLDVWTYNAELLVLMENER




TLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHKCDNECM




ESVKNGTYDYPQYSEEARLNREEISGVKLESMGTYQILSLYSTVAS




SLALAIMVAGLSLWMCSNGSLQCRICI





167
40166-
MIAIIVIAILAAAGKSDKICIGYHANNSTTQVDTILEKNVTVTHSVE



V08B
LLENQKEERFCKILNKAPLDLRGCTIEGWILGNPQCDLLLGDQSWS




YIVERPNAQYGICYPGVLNEVEELKALIGAGERVERFEMFPKSTW




AGVDTSSGVTRACPYNSGSSFYRNLLWIIKTKSAAYPVIRGTYNNT




GKQPILYFWGVHHPPDINEQNTLYGSGDRYVRMGTENMNFAKSP




EIAARPAVNGQRGRIDYYWSVLKPGETLNVESNGNLIAPWYAYKF




VSTNNKGAIFKSNLPIENCDATCQTIAGALRTNKTFQNVSPLWIGE




CPKYVKSESLRLATGLRNVPQIETRGLFGAIAGFIEGGWTGMIDG




WYGYHHENSQGSGYAADRESTQKAIDGITNKVNSIIDKMNTQFEA




VDHEFSNLERRIDNLNKRMEDGFLDVWTYNAELLVLLENERTLD




LHDANVKNLYEKVKSQLRDNANDLGNGCFEFWHKCDNECIESVK




NGTYDYPKYQDESKLNRQEIESVKLENLGVYQILAIYSTVSSSLVL




VGLIIAMGLWMCSNGSMQCRICI





168
40399-
MIAFIVIAILVTTGKSDKICIGYHANNSTTKVDTILEKNVTVTHSVE



V08H1-
LLENQKEERFCKISNKAPLDLRDCTLEGWILGNPRCGILLADQSWS



50
YIVERPNARNGICYPGTLNEVEELKALIGSGERVERFEMFPKSTWA




GVDTNSGVSSACPLGNGPSFYRNLLWIIKHTTSGYPVIKGTYTNTG




DKSVLYFWGVHHPPDTTEQNVLYGSGNRYVRMGTESMNFARSPE




VAARPAVNGQRGRIDYFWSILKPGETLNVESNGNLIAPWYAYRFV




NKDSKGAIFRSNLPIENCDATCQTTEGVIRTNKTFQNVSPLWIGEC




PKYVKSKSLRLATGLRNVPQIETRGLFGDIAGFIEGGWTGMIDGW




YGYHHENSQGSGYAADRESTQKAIDGITNKVNSIIDKMNTQFEAV




GHEFSNLERRIDNLNKRMEDGFLDVWTYNAELLVLLENERTLDL




HDANVKNLHERVKSQLRDNANDLGNGCFEFWHKCDNECMESVK




NGTYNYPKYQAESRLNKQKIESVKLEEFGVYQILAIYSTVSSSLVL




VGLIMAMGLWMCSNGSMQCRICI





169
40171-
MNTQILILAISAFLCVRADKICLGHHAVSNGTKVDTLTEKGIEVVN



V08B
ATETVEQKNIPKICSKGKQTIDLGQCGLLGTTIGPPQCDQFLEFSAN




LIIERREGDDICYPGKFDNEETLRQILRKSGGIKKENMGFTYTGVRT




NGETSACRRSRSSFYAEMKWLLSNTDNGVFPQMTKSYKNTKREP




ALIIWGIHHSGSTAEQTRLYGSGNKLVTVWSSKYQQSFAPSPGPKP




QINGQSGRIDFYWLMLDPNDTVTFSFNGAFIAPDRASFLRGKSLGI




QSDAQLDNNCEGECYHIGGTIISNLPFQNINSRAIGKCPRYVKQKS




LMLATGMKNVPENSIHKQLTHHMRKKRGLFGAIAGFIENGWEGLI




DGWYGYRHQNAQGEGTAADYKSTQSAINQITGKLNRLIEKTNQQ




FELIDNEFNEIEKQIGNVINWTRDSIIEVWSYNAEFLVAVENQHTID




LTDSEMNKLYEKVRRQLRENAEEDGNGCFEIFHQCDNDCMASIR




NNTYDHKKYRKEAIQNRIQIDAVKLSSGYKDIILWFSFGASCFLFL




AIAMVLAFICIKNGNMRCTICI





170
11082-
SKSRGYKMNTQILVFALVASIPTNADKICLGHHAVSNGTKVNTLT



V08B
ERGVEVVNATETVERTNVPRICSKGKRTVDLGQCGLLGTITGPPQ




CDQFLEFSADLIIERREGSDVCYPGKFVNEEALRQILRESGGIDKET




MGFTYSGIRTNGTTSACRRSGSSFYAEMKWLLSNTDNAAFPQMT




KSYKNTRKDPALIIWGIHHSGSTTEQTKLYGSGNKLITVGSSNYQQ




SFVPSPGARPQVNGQSGRIDFHWLILNPNDTVTFSFNGAFIAPDRA




SFLRGKSMGIQSEVQVDANCEGDCYHSGGTIISNLPFQNINSRAVG




KCPRYVKQESLLLATGMKNVPEIPKRRRRGLFGAIAGFIENGWEG




LIDGWYGFRHQNAQGEGTAADYKSTQSAIDQITGKLNRLIEKTNQ




QFELIDNEFTEVERQIGNVINWTRDSMTEVWSYNAELLVAMENQ




HTIDLADSEMNKLYERVKRQLRENAEEDGTGCFEIFHKCDDDCM




ASIRNNTYDHSKYREEAIQNRIQIDPVKLSSGYKDVILWFSFGASCF




ILLAIAMGLVFICVKNGNMRCTICI





171
40325-
TQILVFALIAIIPTNADKICLGHHAVSNGTKVNTLTERGVEVVNAT



V08H
ETVERTNIPRICSKGKRTVDLGQCGLLGTITGPPQCDQFLEFSADLII




ERREGSDVCYPGKFVNEEALRQILRESGGIDKEAMGFTYSGIRTNG




ATSACRRSGSSFYAEMKWLLSNTDNAAFPQMTKSYKNTRKSPALI




VWGIHHSVSTAEQTKLYGSGNKLVTVGSSNYQQSFVPSPGARPPV




NGLSGRIDFHWLMLNPNDTVTFSFNGAFIAPDRASFLRGKSMGIQS




GVQVDANCEGDCYHSGGTIISNLPFQNIDSRAVGKCPRYVKQRSL




LLATGMKNVPEIPKGRGLFGAIAGFIENGWEGLIDGWYGFRHQNA




QGEGTAADYKSTQSAIDQITGKLNRLIEKTNQQFELIDNEFNEVEK




QIGNVINWTRDSITEVWSYNAELLVAMENQHTIDLADSEMDKLYE




RVKRQLRENAEEDGTGCFEIFHKCDDDCMASIRNNTYDHSKYREE




AMQNRIQIDPVKLSSGYKDVILWFSFGASCFILLAIVMGLVFICVK




N
















TABLE 9







Sequences of gp160 antigens that can be included in an 


HIV vaccine










ASC

SEQ



number
Clade
ID
Sequence





A.IN.01.
A
172
MRVMETQRSYQHLWRWGTMILGMLIIYSAAENLWVTVY


1579A.D


YGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPT


Q083238


DPNPQEIHLENVTEDFNMWKNNMVEQMHTDIISLWDQSL





KPCVKLTPLCVTLNCSNVNNTQEVKNCTFNMTTELRDKR





QKVSSLFYKLDIVPIEEGQGNSSNSSYEEYRLINCNTSAITQ





ACPKVTFEPIPIHYCTPAGFAILKCKDEEFNGTGPCRNVST





VQCTHGIKPVVSTQLLLNGSLAKNKVIIRSENITNNVKTIIV





QLAEPVIINCTRPNNNTRESVRIGPGQTFYTSNIIGDIRQAH





CNVSKSAWNKTLHVVGEQLEKYFVNKTIKFDRPTGGDLEI





TTHSFNCGGEFFYCNTSELFNSTWNSSTQGTNNTELNDTIT





LPCRIKQVINMWQRVGQAMYAPPIKGIIKCVSNITGLILTR





DGGINSTNGTETFRPGGGDMRDNWRSELYKYKVVQIEPL





GVAPTKARRRTVGREKRAIGLGAVFLGFLGAAGSTMGAA





SITLTVQARQLLSGIVQQQSNLLRAIEAQQQMLRLTVWGI





KQLQARVLAVERYLRDQQLLGIWGCSGKLICTTNVPWNS





SWSNKTQEEIWGNMTWLQWDKEISNYTEIIYNLIEKSQNQ





QEKNEKDLLALDKWATLWSWFDITKWLWYIKIFIMIVGG





LIGLRIVFAVLSVINRVRQGYSPLSFQTHTPNPRGLDRPGGI





EEEGGEQDRTRSIRLVSGFLALAWDDLRSLCLFSYHRLRD





FVWIAKRTVELLGHSSLKGLRLGWEGLKYLWNLLVYWG





LELKKSAINLFDTIAIVVAGWTDRIIEIGQGIGRGILHIPRRIR





QGLENFLL





A.KE.00.
A
173
MRVTGIQKNCQSLWRWGTMILGMLMICSVVGNLWVTVY


00KE


YGVPVWKEADTTLFCASNARAYDTEVHNVWATHACVPT





DPNPQEIDLENVTEEFNMWKNNMVEQMHTDIISLWDQFL





KPCVKLTPLCVTLDCGYNVTNLTSTSNMKEDITNCSYNMT





TEIRDRKQKVYSLFYRLDIVPIDEEKNNNSVTSPYRLINCN





TSAITQACPKVSFEPIPIYYCAPAGFAILKCKDAEFNGTGPC





KNVSTVQCTHGIRPVISTQLLLNGSLAENGTKIRSENITNN





AKTIIVQLNETVQINCTRPNNNTRRSVRIGPGQAFYATGGI





TGDIRRAYCNVSRQKWEQALKGVVIQLRKHFNNKTITFTN





SSGGDVEITTHSFNCGGEFFYCNTSGLFNSTWDNNTDPTN





TTSNDTITLQCRIKQFINMWQRTGQAMYAPPIQGVIRCDSN





ITGLLLTRDGGVGNNTTETFRPGGGDMRDNWRSELYKYK





VVKIEPLGVAPTGAKRRVVEREKRAVGIGAVFIGFLGAAG





STMGAASVTLTVQARQLLSGIVQQQSNLLRAIEAQQHLLK





LTVWGIKQLQARVLAVERYLRDQQLLGIWGCSGKLICTT





NVPWNSSWSNKSQEEIWGNMTWLQWDKEISKYTQTIYDL





FEESQNQQEKNEQDLLALDKWANLWTWFDITNWLWYIKI





FIMIVGGLIGLRIVFSVLSVINRVRQGYSPLSFQTLTPSPRGL





DRLGRIEEEGGERDRDRSIRLVSGFLALAWDDLRNLCLFS





YHRLRDFILIAARTVELLGHNSLKGLRLGWEGLKYLWNLL





IYWGRELKFSAINLIDTIAIAVAGWTDRIIEAGQRLCRAILN





IPRRIRQGAERAWL





A.KENY
A
174
MRVMGTQRNCQHLLRWGTMILGMIIICSAAENLWVTVYY


A


GVPVWKNAETTLFCASDAKAYETEKHNVWATHACVPTD





PNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLK





PCVKLTPLCVTLNCSNVNVTATSGTTHGGRGRMKNCSYN





ITTELRDRRQKVYSLFYRLDVVPINENNSSNNEYRLINCNT





SAITQACPKVTFEPIPIHYCAPAGFAILKCKDKKFNGTGPCT





NVSTVQCTHGIKPVVSTQLLLNGSLAEGEVMIRSENITNNA





KNIIVQFAEPVKINCTRPNNNTRMSIRIGPGQAFYATGDIIG





NIRQAHCNVSRAEWNTTLQKVVTKLREYFGNNKTIKFAN





SSGGDLEITTHSFNCGGEFFYCNSSGLFNSTWTNMQESNST





ESNDTITLPCRIKIINMWQRAGQAMYAPPIQGIIKCVSNITG





LILTRDGGDNNSESETFRPGGGDMRDNWRSELYKYKVVK





IEPLGVAPTKAKRRVVEREKRAVGIGAVFLGFLGAAGSTM





GAASITLTVQARQLLSGIVQQQSNLLRAIEAQQHLLKLTV





WGIKQLQARVLAVERYLRDQQLLGIWGCSGKIICTTNVP





WNSSWSNKSQSEIWENMTWLQWDKEISNYTDIIYNLLEES





QNQQEKNEQDLLALDKWANLWNWFDISNWLWYIKIFIMI





VGGLIGLRIVFTVLSIINRVRQGYSPLSFQTHTPAPRGLDRP





ERIEEEGGEQDRDRSIRLVTGFLALAWDDLRSLCLFSYHRL





RDFILIATETVELLGHSSLRGLRLGWEGLKYLWNLLLYWG





QELKISASNLVDTIAIVVAGWTDRVIEIAQGIGRAILHIPRRI





RQGFERALL





A.RW.SF
A
175
MRVMGIQMNCQNLLRWGTMILGMLIICSATSKLWVTVYY


1703


GVPVWKDAETTLFCASDAKAYEREVHNVWATHACVPTD





PDPQEIYLENVTEGFNMWKNNMVEQMHTDIISLWDQSLK





PCVKLTPLCVTLNCSHNITTHNITTNNPNITKYMEGEIKNC





SYNMTTELRDKKQKVYSLFYKLDVVPIDKNNGNNNIRTQ





YRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCNDAE





FNGTGPCKNVSTVQCTHGIRPVISTQLLLNGSLAEGRVKIR





SENITNNAKTIIVQLNKTVEINCTRPNNNTRKSVRIGPGQAF





YATGDIIGDIRQAYCNVSRADWNKTLQGVANQLKSYFSN





KTIIFASSSGGDLEITTHSFNCGGEFFYCDTSGLFNSTWGGN





STDSIQESNSTESNDTIILPCRIKQIINMWQRVGQAMYAPPI





QGVIKCISNITGLLLTRDGGDNNSANETFRPTGGNMRDNW





RSELYKYKVVKIEPLGVAPTPAKRRVVQREKRAVGIGAVF





IGFLGAAGSTMGAASITLTVQARQLLSGIVQQQSNLLRAIE





AQQHLLKLTVWGIKQLQARVLAVERYLKDQQLLGIWGCS





GKLICTTNVPWNSSWSNRTQSEIWNNMTWLQWDKEISNY





TDIIYNLIEESQIQQEKNEQELLALDKWANLWNWFDISKW





LWYIRIFIMIVGGLIGLRIVFAVLSIINRVRQGYSPLSFQIHTP





NPRGPDRPERIEEEGGEQGRDRSIRLVSGFLALAWDDLRSL





CLFSYHRLRDFILIAARTVELLKRSSLRGLRLGWEGLKYL





WNLLLYWGRELKNSAINLLDTIAIAVAGWTDRVIEIGQAI





CRAILNIPRRIRQGFERALL





A.NG.N
A
176
MRARGMQRNWQHWGRWGILFLLIIYSAAENLWVTVYCG


G1935


VPVWKDAKTTLFCASDARAYHPEAHNVWATHACVPTDP





NPQEIHLENVTEKFNMWKNNMVEQMHEDIISLWDQSLKP





CVKLTPLCVTLDCHDLNSSNSSNYTSSNMQGEMTNCSFN





MTTELRDKKQKVNAFFYRLDVVQIKENNSTQYRLINCNTS





AITQACPKVSFEPIPIHYCAPAGFAILKCNDKKFNGTGPCK





NVSTVQCTHGIKPVVSTQLLLNGSLAEEEVVIRSENITNNA





KTIIVQLANPVRINCTRPGNNTRKSVRIGPGQTFYATGEIIG





DIRQAHCNVNRTEWDSSLQKVATQLRQYFKNTTKIIFTKH





SGGDLEITTHSFNCGGEFFYCNTSGLFNSTWDNNNIASSNH





TGSNDTITLQCRIKQIINLWQKVGQAMYAPPIPGIIRCESNI





TGLLLTRDGGNNNSTNETFRPGGGDMRDNWRSELYKYK





VVKIEPLGVAPTKAKRRVVARKKRAVGLGAVFLGFLGAA





GSTMGAASIALTVQARQLLSGIVQQQSNLLRAIEAQQHML





KLTVWGIKQLQARVLAIERYLKDQQLLGIWGCSGKLICTT





NVPWNSGWSNKSQNEIWDKMTWLEWDKEISNYTQLIYTL





IEESQNQQEKNEQELLALDKWASLWSWFDISNWLWYIKIF





IMVVGGLIGLRIVFAVLSVVRRVRQGYSPLSLQTLIPSPRGP





DRPEGIEERGGEQDKDRSIRLVSGFLALAWDDLRSLCLFSY





HRLRDFILIAARIVETLGHRGWQGLKYLGNLVQYWSQEL





KNSAISLLDTIAIAVAEWTDRVIEIGQRICRAILNIPRRIRQG





LERALL





A.GM.00
A
177
MKVRGTQRNYAPLWKWGTIFLGLIIICKATENLWVTVYY


.N9845.H


GVPVWRDAETILFCASDAKAYEKEVHNVRDTHACVPTDP


Q385459


NPQEIELENVTEEFNMGKNNMVEQMHTDIISLWDQSLKPC





VKLTPICVTLNCHNITTNDTGTGNVTGTGNVTDNNIGEMT





NCSFNITTEIRDKRQNVYSLFYKLDVVPINGNGSGNDSEYR





LINCNTSTITQACPKVSFEPIPIHYCAPAGFAILKCRDKNFN





GTGPCRNVSSVQCTHGIKPVVSTQLLLNGSLAEGDVVIRSE





NLTNNAKTIIVQLNESVPINCTRPNNNTRKSVRIGPGQALY





ATDIIGNIRQAHCNISKQNWNTALQKIAAQLRQHFYRTINF





TKSSGGDLEITTHSFNCRGEFFYCDTSGLFNSIWNSTAISQN





VEESNDTITLPCKIRQIVNMWQRVGQAIYARPIQGIINCTSN





ITGLLLTRDGGDNDNKTETFRPGGGNMKDNWRSELYRYK





VVKIEPLGVAPTKAKRRVVEREKRAVGLGAVIFGFLGAAG





STMGAASITLTAQATQLLSGIVQQQSNLLRAIEAQQHLLKL





TVWGIKQLQARVLALERYLRDQQLLGIWGCSGKLICTTTV





PWNSSWSNKSQNEIWENMTWQWDKEISNYTDHYDLIAES





QNQQEKNEKDLLALDKWASLWNWFDISKWLWYIKLFIMI





VGGLIGLRIVFTVLSIINRVRQGYSPLSFQTHTPSPGGLDRP





ERIEEEGGDRDRDRSTRLVSGFLALVWDDLRSLCLFSYHR





LRDFTLIAARTVELLGHSSLKGLRLGWEGLKYLWNLLSY





WGRELKSSAITLLNATAIAVAEGTDRVIEVGQRAGRALLH





IPRRIRQGWKRALL





A.UG.07.
A
178
MRVMGIQKNWQHLWKWGTLILGMIMICSTAGNLWVTVY


191845


YGVPVWKDADTTLFCASDAKAYETEKHNVWATHACVPT





DPNPQETYLENVTEEFNMWKNNMVEQMHTDIISLWDQSL





KPCVKLTPLCVTLHCTDVNVTNTNVTNTNGTSTSSSKVTI





NDREEIKNCSFNVTTELRDKEKKVYSLFYKLDVVKIDKNK





SDYILINCNTSTITQACPKVSFEPIPIHYCAPAGFAILKCRDI





DFNGTGPCKNVSTVQCTHGIKPVVSTQLLLNGSLAEGGIRI





RFENITNNAKTIIVQLNSSVPINCTRPNNNTRQSVRIGPGQS





FFATGEIIGDIRQAHCNVSKSDWHKTLQQVAEQLEKHFKN





KTIIFDQPSGGDLEITTHSFNCGGEFFYCNTAGLFNRNWTW





SKENGTRMQNKTDSNGTITLQCRIKQIINMWQRTGQAMY





APPIQGVIKCASNITGLLLTRDGGDNKNNSTNSTNQNNTEI





FRPIGGDMRNNWRSELYKYKVVKIEPLGVAPTTARRRVV





GREKRAVLGGLGAVFLGFLGAAGSTMGAASITLTVQARQ





LLSGIVQQQSNLLRAIEAQQHLLKLTVWGIKQLQARVLAV





ERYLRDQQLLGIWGCSGKLICPTNVPWNSSWSNKTMSEI





WDNMTWLQWDKEISNYTNIIYNLIEESQNQQEKNEQDLL





ALDKWASLWNWFDISKWLWYIKIFIMIVGGLIGLRIIFAVF





SIINRVRQGYSPLSFQTHPPNPGGLDRPGRIEEEGGEQDRG





RSIRLVSGFLALAWDDLRSLCIFSYHRLRDFTLIAERTVELL





GHNSLKGLRLGWEGLKYLWNLLVYWGQELKTSAISLFDT





LAIVVAGWTDRVIEIGQRLGRGFLHIPRRIRQGLERVLL





A.GM.09
A
179
MRVREIPRNCPLLWRWGTMIFWILLICNAEELWVTVFYGV


.N05785


PVWRDAETTLFCASDAKAYDTEVHNIWATHACVPTDPNP


6.HQ385


QEIGLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCV


453


QLTPLCVTLNCSHNISTTNSTSTGNISTINSTSADMRNCSFN





MTTELRDKKQNVYSLFYKLDVVQIGNSSSSEYRLINCNIST





ITQACPKVSFEPIPIHYCAPAGYAILKCREKNFNGTGPCTN





VSTVQCTHGIKPVVSTQLLLNGSLAEGDVMIRSENITDNA





KTIIVQFNESVMINCTRPNNNTRRSVRIGPGQTFYATGEIIG





DIRRAHCNVSGEQWRKALQNVKVQLRKYFNTTIEFANSS





GGDLEITTHSFNCRGEFFYCDTTGLFNNTEKNDTIMLPCKI





RQIVRMWQRVGQAMYAPPIPGVINCLSNITGLLLTRDGGD





STNNTEKFRPGGGDIRDNWRSELYKYKVVKIEPLGVAPTK





AKRRVVERRKRAGLAAVFLGFLGAAGSTMGAASITLTVQ





ARQLLSGIVQQQNNLLKAIEAQQQMLRLTVWGIKQLQAR





VLALERYLRDQQLLGIWGCSGKLICTTNVPWNSSWSNKT





YNSIWDNMTWLQWDKEINNYTNTIYTLIEKAQNQQEKNE





QDLLALDKWASLWNWFDITNWLWYIRIFIMIAGGLIGLRI





VFAVLSIINRVRKGYSPLSFQTPTPNPGDLDRPERIEEEGGE





QGRDRSIRLVSGFLALVWDDLRSLCLFSYHRLRDLVLLTT





RAVELLGHNSLKGLRLGWNVLKQLGNLLSYWGQELKNS





AINLLDTIAIAVANWTDRVIEIGQRVGRAICNIPRRIRQGLE





RLLL





A.CD.97.
A
180
MKVRGIQRNSQHLWRWGTLIFWTIIICSAAEKLWVTVYY


97CD


GVPVWKDAETTLFCASHANAYEKEQHIVWATHACVPTDP





NPLEMDLDNVTEEFNMWKNNMVEQIHDDIIRLWDRSLQP





RVKLTPLCVTLDCQPVNSTNNTKVEVPGEMTNCSFNMTT





ELSDKKQKVRSLFYRIDLVQIGNNTNDSSNRSLQYRLINCN





TSTITQACPKVSFEPVPIHYCAPAGFAILKCKDQEFNGTGP





CKNVSTVQCTHGIKPVVSTQLLLNGSLAEGGIKIRSANISY





NAKNIIVQLDIPVKINCSRPNNNTRTSVRIGPGQTFYATGDI





IGNIRQAHCNLSRTAWNDTLYNVSKALREHFPNKTIIFNKS





SGGDLEVTQHMFICGGEFFYCNTSGLFNSSWAPNGSWINN





IGSRENDTMTLPCRIKQIINMWQRVEQAMYAPPIKGNITCV





SNITGLLLTRDGGTNETRENETFRPIGGEMRDNWRSELYK





YKVVKIEPLGVAPTKAKRRVVEGREKRAVGMGAFFLGFL





GAAGSTMGAASITLTVQARQLLAGIVQQQSNLLKAIEAQQ





HLLRLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKL





ICTTTVPWNSSWSNKSQDEIWNNMTWMQWDKEINNYTEI





IYGLIEESQKQQEKNEFELLELDKWANLWNWFEISNWLW





YIKIFIMIVGGLIGLRIVFAVLSIVNRVRQGYSPLSFQTPIPSP





EGPDRPGRIEEGGGEQGRTRSIRLVSGFLELAWDDLRSLCL





FVYHRSRDFILIAARTVELLGHSSLKGLRLGWEGLKYLWN





LLVYWSQELKTSAVSAFNALAITVAEWTDRAIEVGQRIGR





AFLNIPRRIRQGFERALL





A.TZ.08.
A
181
MKVKGIQMNSQHLLRWGIMILGLIIICSAAEKLWVNVYYG


CH0175


VPVWKNADTTLFCASDAKAYNPEAHNIWATHACVPTDPS





PQEIYLENVTEDFNMWKNNMVEQMHADIISLWDQSLKPC





VQLTPLCVTLECKKINITNTTGNNVTANNDTILAREEMRN





CSYNMTTELRDKKQKVHSLFYRLDIVEINNNNNGSSNNNS





DNDSRPYRLINCNTSAITQACPKISFEPIPIHYCAPAGFAILK





CRDKEFNGTGPCKNVSTVQCTHGIKPVVSTQLLLNGSLAE





KRVMIRTENITNNAKNIIVQFNESVEINCTRPNNNTRQSVRI





GPGQAFYTTEIIGDIRRAYCTVNETKWNQTLYRVAEQLRK





YFNNDTTIIFANSSGGDVEITTHSFNCGGEFFYCNTSGLFNS





RWENDTQPNNTQPNGTITLPCRIKQIINMWQRTGQAIYAPP





IRGVISCQSNITGLLLTRDGGGKNTSDNETFRPGGGDMRD





NWRSELYKYKVVKIEPLGVAPNRARRRVVEREKRAVGIG





AVFLGFLGAAGSTMGAASMTLTVQARQLLSGIVQQQSNL





LRAIEAQQHLLKLTVWGIKQLQARLLAVERYLRDQQLLGI





WGCSGKLICATNVPWNSSWSNKSYDEIWGNMTWLQWD





KEIDNYTQLIYQLIEDSQNQQEKNEQELLALDKWANLWN





WFEISNWLWYIRLFVMIVGGLIGLRIVLAVLSVINRVRQGY





SPLSFQTHTPSPGGLDRPRGIEEEGGEQDRGRSIRLVSGFLA





LAWDDLRSLCLFSYHRLRDFVLIAARIVELLGHNSLKGLR





LGWEGLKYLWNLLAYWGRELKNSAINLLDTIAIAVAGWT





DRVIELGQRLCRAFLHIPRRIRQGFERALL





A.KE.99.
A
182
MKVKGTQRNFQHLLRWGIMVLGMIIICSATEQLWVTVYY


99KE


GVPVWKDAETTLFCASDAKAHETEVHNVWATHACVPTD





PNPQEIELKNVTEEFNMWKNNMVEQMHTDIISLWDQSLK





PCVKLTPLCVTLNCSNFSVTNNSTNNSTNTTTIDNNMQGEI





KNCSYNVTTVIRDEKQKVYSLFYRTDVVEIENNSSQYRLI





NCNTSAITQACPKVTFEPIPIHYCAPAGFAILKCRNETFNGT





GPCNNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIMIRSENIS





DNAKSILVQLTRPVIINCTRPNNNTRKSIHIGPGQAFYATG





DIIGDIRQAYCTVNRTEWNETLHNVTTQLKTYFNKTIVFTS





HSGGDLEITTHTFNCGGEFFYCNTSGLFNSTWDDNIKNDT





NSTQGSNSTEIILPCKIKQIVRMWQRTGQAMYAPPIPGVIR





CVSNITGLILTRDGGNTSSINETFRPGGGDMRDNWRSELY





KYKVVKIEPLGVAPTKARRRVVEREKRAVGIGAVLFGFLG





AAGSTMGAASITLTVQARQLLAGIVQQQSNLLRAIEAQQQ





MLRLTVWGIKQLQARVLAVERYLRDQQLLGLWGCSGKLI





CTTTVPWNSSWSNKSQNEIWNNMTWLQWDKEISNYTQII





YDLIEESQNQQEKNEQDLLALDKWANLWSWFDISSWLW





YIRIFIMIVGGLIGLRIVFAVLSIIKRVRQGYSPLSFQIHTPNP





GEPDRPGEIEEEGGKQDRSRSIRLVSGFLALAWDDLRSLCL





FCYHRLRDFILIAARTVELLGHSSLKGLRLGWEGLKYLWN





LLVYWSQELKSSAINLFDTLAITVAGWTDRVIEVGQRLCR





ALLHIPRRIRQGLERALL





A.VI191
A
183
MRVKGIQMNWQHLWKWGTLILGLGIICNAENLWVTVFY


A


GVPVWKDAETTLFCASDAKAYSEEKHNVWATHACVPTD





PDPKEINLVNVTEDFNMWKNKMVEQMQEDIISLWDQSLK





PCVKLTPLCVTLECHNSTTPDNNTTQSTIIDETTNCTYNMT





TELRDKVQKVHSLFYRLDVVQINGINGTSNNNSAQYRLIN





CNTSAITQACPKVSFEPIPIHYCAPAGFAILKCRDKEFNGTG





PCRNVSSVQCTHGIKPVVSTQLLLNGSLAEEKIMIRSENFT





DNGKTIIVQLVEPVKINCTRPSNNTRKGIHIGPGRAFYATG





QITGDIRQAHCNISRTEWNKTLHQVAIQLREHFITKNTTIIF





NSSSGGDLEITTHSFNCGGEFFYCSTSGLFNSTWENGTQVA





NDTGSNDTITLPCRIKQIVRMWQRVGQAMYAPPIQGMIRC





ESNITGLILTRDGVGNNSTNETFRPGGGDMRDNWRSELYK





YKVVKIEPLGVAPSRAKRRVVAREKRAVGLGAVFLGFLG





AAGSTMGAASLTLTVQARQLLSGIVQQQSNLLRAIEAQHH





LLQLTVWGIKQLQARVLALERYLRDQQLLGIWGCSGKLIC





PTNVPWNYSWSNKSYDDIWNNMTWLQWDKEIDNYTQLI





YGLIEESQNQQEKNEQDLLALDKWASLWSWFDISNWLW





YIRIFIMIVGGLIGLRIVFAVLSIINRVRQGYSPLSFQTLPHH





QREPDRPEGIEEGGGEQDRGRSIRLVSGFLALAWDDLRSL





CLFSYHRLRDFILIVARIVETLGRRGWEALKYLWNLLLYW





GRELKISAITLLDATAIAVARWTDRIIEIGQRIVRAILNIPRRI





RQGAEKALQ





A.CD.97.
A
184
MRARGIQRNYQHLWKWGTLIFGMLMICNAVDNLWVTVY


97DC.K


YGVPVWREAKTTLFCASDAKTYDQEVHNVWATHACVPT


MST91


DPDPQEIDLRNVTEKFNMWKNDMVEQMHTDIISLWDQSL





QPCVKLTPLCVTLNCTNITNISNNNNTDSRNITHSEDPAEM





RNCSFNVATGIRDRKKKVYSLFYRLDITPINGSDDSKSSGQ





YILINCNTSTITQACPKVSFEPIPIHYCAPAGFAILKCKDKQF





NGTGTCNNVSPVQCTHGIKPVVSTQLLLNGSLPEGGVIIRS





ENITNNAKTIIVQLDEPVRINCTRPNNNTRKGIHIGPGGAFY





ATGEVVGNIRHAWCEVNGTAWKEALKKVVTKLKEHFKN





KTIAFQPSSGGDLEITTHSFNCRGEFFYCNTSGLFNSTWPG





NTSESNVNTTGNITLPCKIKQIINMWQRVGQAMYAPPIPGII





QCISNITGLLITRDGGNNDNNVTEIFSPGGGDMRDNWRSE





LYKYKVVEIEPLGVAPTKAKRRVVQREKRAVGIGAVFLGF





LAAAGSSMGAASMTLTVQARQLLSGIVQQQSNLLKAIEA





QQHLLKLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSG





KLICTTTVPWNTTWSNKTLTDIWDNMTWLEWDREIRQYT





GIIYNLIEESQNQQEKNEQDLLALDKWATLWSWFDISNWL





WYIRIFIIIVGGLIGLRIVFAVFSIVKKARQGYSPLSLQILGPN





PGGPDRPERIKEEGGEQGRDRSIRLASGFLALVWDDLRSL





GICSYHRLRDFVLIATRTVELLGRSSLKGLRLGWEGLKYL





WNLLQYWSQELKNSAISLLGAIAIAVAEGTDRVITVGQRIC





RAILNIPRRIRQGFERALL





A.FI.FIN
A
185
MRARGIQRNYQHLLRWGTIILGMILICSTTENLWVTVYYG


91121


VPVWKDAETTLFCASDAKAYDTEMHNVWATHACVPTDP





NPQEIYLENVTEEFNMWKNNMVEQMHTDIISLWDESLKP





CVQLTPLCVTLNCSNANANSINANATSTNATAENEKGEIK





NCSFNMTTELRDKKKKVYSLFYRLDVIPLNGTENETYTEY





RLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCRDEKF





NGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAKEEVR1RS





ENITNNVKTIIVQLVKPVNITCIRPNNNTRKSIHLGPGRAFY





ATGDIIGNIRKAHCIVNESEWNEALQQVATQLGKYFENKTI





NFTSPSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWVSNN





TGKYYSNSTRENPNITLPCRIKQIINMWQRAGQAIYAPPIE





GVIRCESNITGLLLTRDGGDKNNTSEIFRPGGGNMKDNWR





SELYKYKVVKIEPLGVAPTPARRRVVQREKRAVFGLGAVF





LGFLGAAGSTMGAASITLTVQARQLLSGIVQQQSNLLRAIE





AQQHLLKLTVWGIKQLQARVLAVERYLKDQQLLGIWGCS





GKLICTTNVPWNSTWSNKSYSEIWDNMTWMQWDKEISN





YTQTIYNLIEKSQIQQEKNEQDLLALDKWTNLWTWFDISN





WLWYIKIFIMIIGGLIGLRIVFAVLSAINRVRQGYSPLSFQT





HTPNPEGLDRPGRIEEEGGEQGRGRSIRLVSGFLALAWDD





LRSLCLFSYHRLKDFILIAARTVELLGHSSLKGLRLGWEGL





KYLGNLLLYWGRELRISASDLLDTIAIAVAGWTDRVIEIGQ





RIGRAILNIPRRTRQGLERALV





A.KE.07.
A
186
MRVMGIQRNWQHLLSWGTMVLGMIIICSAVDNLWVTVY


21020


YGVPVWKDAETTLFCASDAKAYETEQHNVWATHACVPT





DPNPQEMPLKNVTEQFNMWKNNMVEQMHTDIISLWDQS





LKPCVELTPLCVTLNCSTITLNNTDTHSTNNTDIDAMRNCS





FNVTTELRDKTKRVYSLFYRLDIVQIDGNNSNGSGEYRLIN





CNTSAITQACPKVSFEPIPIHYCAPAGFAILKCNEKEFNGTG





PCKNVATVQCTHGIKPVVSTQLLLNGSLAEGKVMIRSENI





TDNSKNIIVQFNKTVTINCTRPNNKTKIRSIHIGPGRVFQAT





QTGDMRKAFCTVNKTEWDKTLEEVAKQLRAHFNNKTIIF





KSSSGGDIEITTHSFNCGGEFFYCNTSGLFNSTWEANSTAN





TSTTNITLPCKIKQLVRMWQRTGQALYAPPIEGVIKCVSNI





TGLILTRDGGGNNSATNETFRPGGGDMRDNWRSELYKYK





VVQIEPLGVAPTEARRRVVKREKRAVPGLFFLGFLAAAGS





TMGAASITLTVQARQLLSGIVQQQSNLLRAIEAQQHLLKL





TVWGIKQLQARVLTVERYLRDQQLLGIWGCSGKLICTTN





VPWNSSWSNKSQTEIWDNMTWLQWDKEISNYTQQIYKLL





EESQNQQEQNEQDLLALDKWASLWNWFEISNWLWYIKIF





IMIVGGLIGLRIVFAVLSIINRVRQGYSPLSFQTQFPNPAGP





GRPGRIEEESGEQDRDRSIRLVSGFLALAWDDLRSLCLFSY





HRLRDFILIVARTVELLGHSSLKGLRLGWEGLKYLGNLLIY





WGRELKLSAINLFDTIAIVVAGWTDRVIELGQRIVRGFLHI





PRRIRQGLERALL





A.ZA.07.
A
187
MKVRGIQRNWPQWWIWGILGFWVIMMCKGVGNLWVTV


SO


YYGVPVWREANTTLFCASDAKAYDTEVHNVWATHACVP





TDPNPQEIVLENVTENFNMWKNDMVDQMHEDIISLWDQS





LKPCVKLTPLCVTLNCTEATFDNNVTDSMKKEMKKCSFN





TTTEIRDRKQQAHALFYKLDLVPLNGSSSGYGNYILINCNT





STITQACPKVTFEPIPIHYCAPAGFAILKCKDEAFNGTGPCN





NVSTVQCTHGIKPVVSTQLLLNGSLAKGGIRIRSKNITNNA





QTIIVQLSKPVKINCTRYNNNTRESIRIGPGQTFYTNKVIGD





IRQAHCNISETEWNNTLQEVAKQLVVHFPNKTITFAPAAG





GDLEITTHSFNCGGEFFYCNTSGLFNDTFMPNNTANSTKN





ETQEKNITISCKIKQIIRIWQRVGQAMYAPPIEGLIKCESNIT





GILLTRDGGNNTNSTEVFRPGGGNMKDNWRSELYNYKVV





KIEPLGVAPTGARRRVVEREKRAVVGLGAVFLGFLGAAG





STMGAASLTLTVQARQLLAGIVQQQSNLLQAIEAQQHML





KLTVWGIKQLQARVLALERYLQDQQLLGIWGCSGKLICTT





TVPWNSSWSNKSQEEIWKNMTWLQWDKEISKYTGIIYTL





LQESQSQQEKNEQDLLALDKWASLWNWFDISNWLWYIKI





FIMIVGGLIGLRIVFAVISVVNRVRKGYSPLSFQTLTPNQEG





LDRLGRIEEEGGEQGRGRSIRLVSGFFSLAWDDLRSLCLFS





YHRLRDLLLIAARTVEHLGSSLLKGLRLAREGLNQLWNLL





LYWGAEIKNSAISLLDTTAIAVANWTDRVIELGQGIGRAIL





HIPRRIRQGFETALL





A.TZ.05.
A
188
MRVKGTQMNLQHLLKWGTMILGLIIICSAAENLWVTVYY


6592


GVPVWKDAETTLFCASDAKTYDKEVHNVWATHACVPTD





PNPQEINLENVTEDFNMWKNNMVDQMHADIISLWDQAL





KPCVKLTPLCVTLNCSDSVTTANNSIQNEMKGEIKNCSFN





VTTELRDKRQKVYSLFYRLDIVQIDNNNRNSTQYRLINCN





TSAITQACPKVSFDPIPIHYCAPAGFAILKCKDENFTGVGPC





KNVSTVQCTHGIKPVVSTQLLLNGSLAEKKVMIRSENISN





NAKNILVQLTTPVNITCIRPGNNTRKSVHIAPGQAFYATGD





IIGDIRQAHCNVNRTQWNNTLKEVANQLRTYFGNETAIIFA





SSSGGDLEITTHSFNCRGEFFYCNTSGLFHSTWFNSTWFNS





TWEDNQANSINDTIVLPCRIKQIINMWQRAGQAIYAPPIPG





EIRCESNITGLMLTRDGGNNNNNRTNETKNETFRPGGGDM





RDNWRSELYKYKVVKIEPLGLAPTGAKRRVVEREKRAVG





IGAVFLGFLGAAGSTMGAASITLTVQARQLLSGIVQQQSN





LLKAIEAQQHLLKLTVWGIKQLQARVLAVERYLKDQQLL





GIWGCSGKLICTTTVPWNSSWSNRSQSEIWGNMTWQQWE





KEISNYTEEIYRLIENSQNQQEKNEQDLLALDKWASLWN





WFDISNWLWYIRLFVIIVGGLIGLRIVLAVFSIINRVRQGYS





PLSFQIHTPNPGGLDRPGRIEEEDGEQGKSRSIRLVSGFLAL





AWDDLRSLCLFSYHRLRDFILIATRTVELLGHSSLKGLRLG





WEGLKYLWNLLACWGRELKIRAINLLDTLAIATANWTDR





VIELLQALGRAFLNIPRRIRQGAERAWL





A.UA.AF
A
189
MKARGMQRNYQHLWRWGTMLFWMIIMCKAAENLWVT


082486


VYYGVPVWRDAETTLFCASDAKAYDKEVHNVWATHAC





VPTDPNPQEISLENVTEKFDMWKNNMVEQMQTDIISLWD





QSLKPCVKLTPLCVTLNCAEPKSTSSNNSSVNSNSSDSLFE





EMKNCSFNMTTELRDKRKTVHSLFYKLDIVSTGNGSGQY





RLINCNTSAMTQACPKVTFEPIPIYYCAPAGFAILKCKDTN





FTGTGPCKNVSTVQCTHGIKPVVSTQLLLNGSLAEKEVMI





RSENITDNSKIIIVQLTEPVNITCIRPGNNTRTSIRIGPGQTFY





ATGDVIGDIRKAYCNVSRAAWNSTLQKISTQLRRYFNNKT





IIFKNSPGGDLEVTTHSFNCGGEFFYCNTTDLFNSTWDEHG





TVTNSTMANGTITLPCRIKQIINMWQRVGQAMYAPPIKGSI





RCESNITGLLLTRDGGSGTNSSNETFRPIGGNMRDNWRSE





LYKYKVVKIEPIGLARTRAKRRVVEREKRAIGLGAAFLGF





LGAAGSTMGAASMTLTVQARQLLSGIVQQQSNLLRAIKA





QQHLLKLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSG





KLICTTNVPWNSNWSNKSQSEIWDNMTWMQWDREVIYY





TDHYDLIEKSQNPQEKNEQDLLALDKWASLWSWFDISNW





LWYIKIFIIIVGGLIGLRIVFAVLSIINRARQGYSPLSLQTLTP





HPEGPDRPGRIKEEGGEQGRDRSIRLVSGFLALAWDDLRS





LCLFSYRRLRDFISITARTVELLGRSSLKGLRLGWEGLKYL





GNLLGYWGQELKSSAINLIDTIAIAVAGWTDRVIEIGQRFC





RAIRNIPRIRQGAEKALQ





A.KE.Q1
A
190
MKVRGIKRNLWKWGTMLLGMLMTYSVAEQLWVTVYYG


68.AF40


VPVWKDAETTLFCASDAKAYSTEKHNIWATHACVPTDPN


7149


PQEIHLESVTEEFNMWKNNMVEQMHTDIISLWDQSLRPCV





KLTPLCVTLNCTNVNNNTTNVNNNTGWDEERKNCSFNVT





TELRDKRQKVYSLFYKLDVVQIDNSSYRLINCNTSAITQAC





PKVTFEPIPIHYCAPAGFAILKCKDEKFNGTGPCKNVSTVQ





CTHGIKPVVSTQLLLNGSLAEKEVMIRSENFTNSAKNILVQ





FKEPVKINCTRPDNNTRTSIRIGPGQAFYATGIIGDIRQAYC





TVNGSEWNKALQKVVEQLRSSFENKTIIFANSSGGDLEITT





HSFNCGGEFFYCNTSGLFDSTWNDTDSRQENGTITLPCRIK





QIINMWQRTGQAIYAPPIQGAIRCVSNITGLILTRDGGNNN





STNETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTKA





RRRVVGREKRAVGIGAVFLGFLGAAGSTMGAASITLTVQ





ARQLLSGIVQQQSNLLKAIEAQQHLLRLTVWGIKQLQARV





LAVERYLKDQQLLGIWGCSGKLICTTNVPWNSSWSNKSQ





SEIWENMTWLQWEKEISNYTQHYTLIEESQNQQEKNEQDL





LALDKWASLWNWFDISKWLWHIRIFIMIVGGLIGLRIVFA





VLSVVNRVRQGYSPLSFQTLLPAPRGPDRPDGIEEEGGEQ





GRGRSRQLVNGFSTLIWDDLRNLCLFSYHRLRDLILIAARI





VELLGRRGWEAIKYLWNLLQYWIQELKNSAISLLNTTAIA





VAEGTDRAIEIIQRAITAVLNIPTRIRQGFERALL





A.SN.01.
A
191
TRVMGIQRNCTPWWRWGIIFLGLIIICKATEKLWVTVYYG


DDJ369.


VPVWKDAETTLFCASDAKAYEREVHNVWATHACVPTDP


AY52163


NPQEIDLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKP


1


CVKLTPLCVTLNCHNVTKTNSNSNSSVKNEMPGEIKNCSY





NVTTAIRDKRQNVYSLFYRVDVVPINENSNSSENSSEYRLI





NCNTSAITQACPKVTFEPIPIHYCAPAGFAILKCKDKDFNG





TGTCRNVSTVQCTHGIKPVVTTQLLLNGSLAEGKVMIRSE





NLTNNAKTIIVQLDEPVYINCTRPNNNTRKSVHIGPGQAFY





ATGDIIGDIRQAHCNVSREDWNKTLYKVVEQLRKHFNNR





TINFTAPSGGDLEITTHSFNCGGEFFYCNTSGLFNSTWYNS





TWNNTDSTQKSNGTEKITLPCRIKQVIKMWQRVGQAMYA





PPIQGVIKCVSNITGILLTRDGGNLNSSFETFRPGGGNMKD





NWRSELYKYKVVKIEPLGVAPTKAKRRVVEREKRAVGLG





AVFLGFLGAAGSTMGAASITLTAQARQLLSGIVQQQSNLL





RAIEAQQQMLKLTVWGIKQLQARVLAVERYLKDQQLLGI





WGCSGKLICTTNVPWNSSWSNKSYSEIWDNMTWLQWDK





EISNYTDLIYNLIEESQNQQEKNEQDLLALDKWASLWNWF





NISNWLWYIRIFIMIVGGLIGLRIVFTVLSIINRVRKGYSPLS





FQTHSPSPGGLDRPRGIEEEGGEQDRDRSIRLVSGFLALAW





NDLRSLCLFSYHRLRDFTLIAARTVELLGHSSLKGLRLGW





EGLQYLGNLLLYWGRELKNSATSLLDAIAITVAGWTDRVI





EVGQRAGRALLHIPRRIRQGLERALL





A.UG.07.
A
192
MRARGTQKNCQDLWWIWGTMILGMLIICSAAGNLWVTV


191955


YYGVPVWKDAETTLFCASDAKAYSTEKHNIWATHACVPT





DPNPQEIDLENVTEKFNMWKNGMVEQMHTDIISIWDQSL





QPCVKLTPLCVTLICIDIANDTSRNVTVEMKREIQNCSYNV





TTEVRDKKQKVYSLFYKVDIVQISNNSSEYRLINCNTSVIK





QACPKVTFEPIPIHYCAPAGYAILKCKDDKFNGTGLCKNV





STVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENINNNAKTII





VQLYQPVKINCTRPGNNTRKSVHIGTGQAYFATGDIIGDIR





QAYCKVNKSEWNNTLQQVVAQLRKHWNKTIIFNSSSGGD





LEITTHRFNCGGEFFYCNTSSLFNSTWWSNNSNNTDSEESN





SMGLNDTITLQCRIKQIINMWQRVGQAMYAPPIQGVIRCE





SNITGILLTRDGGDNNNESETFRPVGGDMRDNWRSELYKY





KVVKIEPLGVAPTEARRRVVGRGKRAVVGMGAMFLGFL





GAAGSTMGAASITLTVQARQLLSGIVQQQSNLLRAIEAQQ





HLLKLTVWGIKQLQARILAVENYLKDQQLLGIWGCSGRLI





CTTNVKWNSTWSNKSKSEIWDKMTWLEWDREISNYTDII





YQLIEESQNQQEKNEQDLLALDKWANLWNWFDISNWLW





YIKIFIMIVGGLIGLRIVFAVLSVINRVRQGYSPLSFQTHTPS





PRDLDRPGRIEEEGGEQGRDRSIRLVSGFLALAWDDLRSL





CIFSYHRLRDLLLIAARTVELLGQSSLKGLRLGREKLKYLG





NLLLYWGRELRISIINLYDTIAIAVAGWTDRIIEIGQRIGRAI





LNIPRRIRQGFERALL





A.GM.00
A
193
MRVMGIQKNYALLWRWGTIIFWIMIICNADNNLWVTVYY


.N00690


GVPVWKDAETTLFCASDAKAYDQEVHNVWATHACVPTD


9.HQ385


PNPQEIDLENVTEEFNMWKNNMVEQMHTDIISLWDQSLK


444


PCVKLTPLCVTLNCNNITGNATYDMQREISNCSFNITTEIR





DKKQKVNSLFYRLDIVPIGDNKSSEYRLINCNTSALTQACP





KVTFEPIPIHYCAPAGFAILKCKDKKFNGTGPCKNVSTVQC





THGIKPVVSTQLLLNGSLAEEGVMIRSENFTNNAKTIIVQL





QEPVKINCTRPNNNTRKGVHIGPGQAFYATGGIIGNISKAR





QYVNTTEWIKALKEVNKTLNQQFVNKTLYFNESSGGDLEI





TTHSFNCRGEFFYCNTSDLFNSTWLNSTVESNETIILPCKIK





QIINMWQRVGQAMYAPPIQGVINCVSNITGILLVRDGGNN





NSTAETFRPGGGDMKDNWRSELYKYKVVKIEPLGVAPTR





AKRRVVEREKRAVGLGAVFFGFLGAAGSTMGAASITLTV





QARQLLSGIVQQQSNLLKAIEAQQHLLKLTVWGIKQLQAR





VLAVERYLKDQQLLGIWGCSGKLICTTNVPWNSSWSNKS





QDEIWDNMTWLQWDREISNYTSTIYNLLEEAQNQQEKNE





QDLLALDQWASLWNWFNITKWLWYIKIFIMIVGGLIGLRI





VFAVLTLINRVRQGYSPLSFQTLTHHQREPDRPERIEEGGG





EQDRDRSVRLVSGFLALAWDDLRSLCLFSYHRLRDFVLIV





VRTVELLGHSSLKGLRLGWEALKYLGNLLSYWGQEIKNS





AINLLNTTAIAVANWTDRVIEVGQRAGRALLHIPRRIRQGF





ERALL





A.TZ.06.
A
194
MRVKGIQRNSQHLLRWGTIILGLIIISSAAENLWVTVYYGV


3504


PVWRDADTSLFCASDAKTYKTEVHNVWATHACVPTDPN





PQEVYLKNVTENFNMWKNNMVEQMHADIISLWDQSLQP





CVKLTPLCVTLDCSYNVTVRPNNTAVNYTTFMKEEMKNC





SYNMTTEIRDKKKKVYSLFYRLDVVQIDEIDENNSNNSNN





TNNSLYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILK





CRDKNFNGTGPCRNVSSAQCTHGIRPVVSTQLLLNGSLAE





GKEVMIRSENITSNTKNIIVQLTKPVQINCTRPSNNTRTSVH





IGPGQAFYTTGEIIGDIRRAHCNVSKAKWNETLGKIAIQLE





KHFKNKTIKFNPSSGGDMEITTHMFNCGGEFFYCNTTNLF





NSNWTIWNGSAISNGSATANTESNDTVITLQCRIKQIINMW





QRAGQAIYAPPIKGVIKCNSNITGLILTRDGGDNNSNIETFR





PGGGDMRDNWRSELYKYKVVKIEPLGVAPTRARRRVVQ





REKRAAGLGAVFIGFLGAAGSTMGAASITLTVQARQLLSG





IVQQQSNLLRAIEAQQHLLKLTVWGIKQLQARVLAVERYL





KDQQLLGIWGCSGKLICTTNVPWNSSWSNKSQNEIWDNM





TWLEWDKEISNYTRIIYDLIEDSQNQQEKNEQDLLALDKW





ASLWNWFDISKWLWYIKIFIIIVGGLIGLRIVCTVLSVINRV





RQGYSPLSFQTLAPSPGDRDRPGRIEEEGGEQGRGRSIRLV





SGFLALAWDDLRSLCLFSYHRLRDFILIAARTVELLGHNSL





KGLRLGWEGLKYLWNLLAYWGRELKISAINLLNTIAIAVA





GWTDRVIEIGQRLCRAFINIPRRIRQGFERALL





A.SN.96.
A
195
MRVMGIQRNYLPLWRWGMIFLGMMIICKATENLWVTVY


DDJ360.


YGVPVWKDAETTLFCASDAKAYDQEVHNVWATHACVPT


AY52163


DPNPQELDLENVTEEFNMWKNNMVEQMHTDIISLWDQSL


0


KPCVKLTPLCVTLNCSNINSTRYDTKDKEEMKNCSFNITTE





LRDKRQSVYSLFYKLDVVPINNNSSNNSSNEYRLINCNTSA





ITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCRN





VSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENLTNNA





KTIIVQLVTPVQINCTRPNNNTRTSVRIGPGQAFYATGDIIG





KIRQAHCTVRRAQWNTTLQQVADQLRKHFNKTINFTRPS





GGDLEITTHSFNCGGEFFYCNTSGLFNSTWNSTTIKQNLTN





NDTGVINLPCRIRQIVHMWQRVGQAMYAPPIKGVINCTSN





ITGLLLTRDGGDNNSTDEKFRPGGGDMRDNWRSELYKYK





VVKIEPLGVAPTRAKRRVVQREKRAVGLGAVFFGFLGAA





GSTMGAASITLTAQARQLLSGIVQQQSNLLRAIEAQQHLL





KLTVWGIKQLQARVLAVERYLRDQQLLGLWGCSGKLICT





TAVPWNSSWSNKSQNEIWENMTWLQWDKEISNYTDIIYN





LIEESQNQQEKNEQDLLALDKWANLWNWFDISKWLWYI





KIFIMIVGGLIGLRIVFAVLSIINRVRQGYSPLSFQTHTPNPG





GLDRPGRIEEEGGEQDRDRSIRLVSGFLALVWDDLRSLCLF





SYHRLRDFTLIAARTVELLGQGLKYLWNLLLYWGRELKS





SAISLLNATAIVVAEWTDRVIEVGQRAGRALLHIPRRIRQG





LERALL





A.KE.00.
A
196
MRVMGTQRNWQPLLRWGTMILGMLMICSAADNLWVTV


00KE


YYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVP





TDPSPQEIPLKNVTEKFNMWKNNMVEQMHTDIISLWDQS





LKPCVQLTPLCVTLNCSDVTTNNTKVNTTSAPTTTATVNN





DNDMKNCSYNITTELRDKKQKRYSLFYRLDVELINKNNS





NSTEYVLINCKTSTLTQACPKVSFEPIPKHYCAPAGFAILKC





KDKEFNGTGLCKNVSTVQCTHGIKPVVSTQLLLNGSLVEK





RVMIRSANITDNTKNIIVQLKEPVEINCTRPNNNTRKGVHI





GLGRRFYTTQVVGDIRQAYCNVSKSKWNDTLKKVVYQL





RKYFNKTIIFNSSSGGDVEITTHMFTCGGEFFYCNTSGLFNS





TWPVNASEQESNSTESDGIITLRCRIKQIINMWQRTGQAM





YAPPIQGIIKCVSNITGLLLTRDGGNSTVNETFRPGGGDMR





DNWRSELYKYKVVKIEPLGVAPTKARRRVVGREKRAVGI





GAVFLGFLGAAGSTMGAASITLTVQARQLLSGIVQQQNNL





LRAIEAQQQLLKLTVWGIKQLQARVLAVERYLRDQQLLGI





WGCSEKLICTTNAPWNSSWSNKSQSEIWDNMTWLQWDR





EINNYTDIIYRLLEDSQNQQEKNEQDLLALDKWSSLWTWF





DLSNWLWYIKIFIMIVGGLIGLRIVFAVLAVINRVRQGYSP





LSFQTLTPNPGELDRPGRIEGEGGEQDRGRSIRLVSGFLAL





AWDDLRSLCLFSYHHLRDFILIAARIVGLLGRRGWEGLKY





LWNLLIYWGQELKTSAVSLLDTIAIVVAGWTDRVLEIGQR





IGRAFINIPRRIRQGFERALL





A.U455
A
197
MRVMGIQRNYPCLWRWGTMILGLIIICNAQQLWVTVYYG





VPVWKDAVTTLFCASDAKAYDAEVHNVWATHACVPTDP





NPQEIDLVNVTEEFNMWKNNMVDQMHEDIISLWDQSLKP





CVKLTPLCVTLDCHNITINNTNNNTNITDGVREEMKNCSF





NMTTELRDKKQKVYSLFYRLDIVQINKTDNNSYRLINCNT





STITQACPKVSFEPIPIHYCAPAGFAILKCKDPEFNGKGPCR





NVSTVQCTHGIKPVVSTQLLLNGSLAEREIRIRSENFTNNA





KTIIVQLVNPVKINCSRPYNTRKNIRRYSIGSGQAFYVTGKI





IGDIRQAHCNVSRRDWNRTIQQVAEQLKKKFNNKTIIFASS





SGGDIEITTHSFNCGGEFFYCNTSGLFNSIWNGSMSNDMGP





NGTITLQCRIKQIINMWQRVGQAMYAPPIQGVIRCESNITG





LLLTRDGGTNNTKNETFRPGGGDMRDNWKSELYKYKVV





KIEPLGVAPTRAKRRVVEREKRAVGLGAIFLGFLGAAGST





MGAASITLTVQARQLLSGIVQQQSNLLRAIEAQQHLLKLT





VWGIKQLQARVLAVERYLQDQQLLGIWGCSGKLICTTTV





PWNSSWSNKSQEDIWNNMTWLQWEKEISSYTGIIYQLIEE





SQNQQEKNELDLLALDKWANLNWFNISNWLWYIRLFVII





VGGLIGLRIVFTVLSIINRVRQGYSPLSFQTLAPIPEGLGRPG





RIEEEGGEQGKDRSIRLVSGFLAIAWDDLRNLCLFSYHRLR





DFALIVARAVELLGRSSLKGLRLGWEGLKYLWNLLLYWG





RELKISAITLLDAVAVAVAGWIDRVIEIGQTIGRAILNIPRRI





RQGLERALL





A.SE.SE
A
198
MRVMGTQRNCQHLLNWGIMILGMIIICSTAENLWVTVYY


8131


GVPVWKDAETTLFCASDAKAYEKEVHNVWATHACVPTD





PNPQELYLENVTEDFNMWKNNMVEQMHTDIISLWDQSLK





PCVQLTPLCVTLNCSNNVTANTNSTSANLTDSVKGEMRN





CSFNITTELRDKKKKVYSLFYKLDIVKINKNKSFRGKNSSG





NSSSDRYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILK





CNEDEFNGTGTCRNVSTVQCTHGIRPVVSTQLLLNGSLAK





EEVRIRSENISDNAKTIIVQFTKPVEIICTRPNNNTRKSIRIGP





GQAFYGMGDIIGDIRKAHCNVSRSKWNETLKKVAIQLRK





YWNTTIIFTNSSGGDLEITTHSFNCGGEFFYCNTSGLFNSS





WNENDTKVNYNTESNDTITLQCRIKQIINMWQRTGQATY





APPIPGVIQCRSNITGLLLTRDGGVTNNTNNETFRPGGGDM





RDNWRSELYKYKVVKLEPLGVAPTKAQRRVVKREKRAV





GLAAVFFGFLGAAGSTMGAASITLTVQARQLLSGIVQQQN





NLLRAIEAQQHMLRLTVWGIKQLQARVMAVERYLKDQQ





LLGIWGCSGKIICTTAVPWNSTWSNKSYTQIWDNMTWLQ





WDKEISNYTDIIYQLIEESQYQQEKNEKELLELDKWANLW





NWFDISNWLWYIKIFIMIVGGLIGLRIVFAVLTVIKRVRQG





YSPLSFQIHTPSPRDPDRPGRIEEEGGEQGRDRSIRLVSGFL





ALAWDDLRSLCRFSYHRLRDFISIATRIVELLGQGLKYLGN





LLLYWIRELKISAISLFDTIAIAVAGWTDRVIEIGQRIGRAIL





HIPRRIRQGFERALL





A.CD.97.
A
199
MKVRGIQRNYQHLWRWGTLIFGIIIICSAAENLWVTVYYG


97CD


VPVWKDAETTLFCASDAKAYEREVHNVWATHACVPTDP





NPQVMPLADVTEVFNMWMNDMVEQMHEDIISLWDQSLK





PCVQLTPLCVTLHCHYVINNTAHKVSVSPEMQEKVANCSF





NVTTELRDRKQKVYSLFYREDLVPIDEDKNNSNSASISTSS





NSSNCSISDSNSTCINISTYRLINCNTSTITQACPKVSFEPIPI





HYCVPAGFAILKCNEKEFNGTGPCKNVSSVQCTHGIKPVV





STQLLLNGSLAKETVIIRTENISDNGKNIIVQFVDPVRINCIR





PNNNTRTSVRIGPGQTFYTSDIIGDIRQAYCEVNKTQWNM





TLQKVADQLRKFFPNKTINFNQTSGGDLEVTTHIFNCRGEF





FYCNTSSLFNSSWPINSTTSNNSTASNETIILPCRIIQIINMW





QRLGQAMYAPPIKGLINCVSNITGLLLTRDGGDRNSSYETF





RPIGGNMKDNWRSELYKYKVVKIEPLGVAPTKARRRVVE





REKRAVGIGAFFLGILGAAGSTMGAASITLTVQARQVLSGI





VQQQSNLLKAIEAQQHLLKLTVWGIKQLQARVLAVERYL





KDQQLLGIWGCSGKLICTTNVPWNSSWSNKSQGEIWDNM





TWMQWDREISNYTDIIYKLIEESQEQQEKNELKLLALDQW





ASLWNWFDISNWLWYIRIFIMIVGGLIGLRIVFAVLSVVNR





VRQGYSPLSFQTLTPSPEGLDRPGKIEEGGGEQDRTRSIRL





ASGFLALAWEDLRSLCLFSYHRLRDFILIAARTVELLGHSS





LKGLRLGWEGLKYLWNLLTYWCQELKTSAISLFDTLAIA





VAEWTDRIIEVGHRIGRAFLNIPRRIRQGFERALL





A.SE.UG
A
200
MRVMGTQMNWQNLWRWGTMILGIIIICSAAENLWVTVY


SE8891


YGIPVWRDAETTLFCASDAKAYDTEVHNVWATHACVPT





DPNPQEINLENVTEEFNMWKNNMVEQMHTDIISLWDQSL





KPCVKLTPLCVTLNCSSVTNSSVTNITSDMAGEIKNCSFNM





TTEIRDKRQKVHALFYRLDVVPMDNNNSLYRLINCNTSAI





TQACPKVSFEPIPIHYCAPAGFAILKCNDKEFNGTGPCKNV





SSVQCTHGIRPVVSTQLLLNGSLAETEVMIRAENITNNIKNI





IVQFNKSVEIICIRPNNNTRKSIRIGPGQAFYATGDIIGDIRQ





AYCDVNRTEWNEALQKVVNQLKTHFKNKTIIFNSSSGGD





LEITTHSFNCGGEFFYCNTSGLFNSTWNGTDSMQKLNSTG





NITLPCRIKQIINMWQRAGQAIYAPPIQGVIRCESNITGLILT





RDGGNDNNESEIFRPGGGDMRDNWRSELYKYKVVKIEPL





GVAPTKAKRRVVEREKRAIGIGAVFIGFLGAAGSTMGAAS





ITLTVQARQLLSGIVQQQSNLLMAIEAQQHLLKLTVWGIK





QLQAQVLAVERYLRDQQLLGIWGCSGKLICTTTVPWNSS





WSDVSQSEIWENMTWLQWDKEISNYTQIIYSLIEESQNQQ





EKNEQDLLALDKWANLWNWFDISKWLWYIKIFIMIVGGLI





GLRIVIAVISIINRVRQGYSPLSFQIHTPSPGGLDRPGRIEEE





GGEQDRNRSIRLVNGFLALAWDDLRSLCLFCYRRLRDFILI





VARTVELLGHSSLRGLRLGWEGLKYLKNLLSYWGRELKL





SAINLLDTIAIVIAGWTDRVIEIGQGFCRAIFPRRSKQGLKR





ALQ





A.GM.05
A
201
MRVMGTQKNCPLLWRWTLIFGLIIICSATGKLWVTVYYG


.N33456.


VPVWRDAETTLFCASDAKAYDTEVHNVWATHACVPTDP


HQ38545


SPQEVDLTNVTENFNMWKNNMVEQMHTDIISLWDQSLKP


2


CVKLTPLCVTLNCSNITSTSSNNNSSNITNASSSSSNNSSNS





TVDERNKVMTNCSFNMTTELRDKIQKVHSLFYRSDVVGI





NGNKEYRLINCNTSTITQACPKVSFEPIPIHYCAPAGFAILK





CKDKEFNGTGPCKNVSSVQCTHGIKPVVSTQLLLNGSLAE





EGIRIRSENLTNNAKIIIVQLTKPVRINCTRPYNNTRKSIRIG





PGQSWYATGDIIGNIRQAFCNVSKTDWNTALQQVAAQLG





RHFNNKKITFKNSSGGDLEITTHSFNCRGEFFYCNTSGLFS





GSWYNGTWANDSWSAGPGTNDNITLPCKIKQIIRMWQRV





GQAMYAPPIQGEINCVSNITGLLLTRDGGNSSEEIFRPGGG





DIRDNWRSELYKYKVVKIEPLGVAPTRARRRVVGREKRA





VGLGAVFLGFLGAAGSTMGAASITLTVQARQLLSGIVQQQ





SNLLRAIEAQHHLLKLTVWGIKQLQARVLAVERFLKDQQ





LLGIWGCSGKLICTTNVPWNSSWSNKSQEQIWNNMTWLQ





WDREIDNYTQIIYNLIEESQNQQEKNEQDLLALDKWASLW





TWFDISNWLWYIKIFIMIVGGLIGLRIVFAVLSIMNRVRQG





YSPLSFQTHTPVVPTGPDRPGRIEEESGEQDRNRSIRLVSGF





LALAWDDLRSLCLFSYRHLRDFILIAARAVELLGHSSLKGL





RLGWEGLKYLWNLLVYWGRELKNSAISLLDTVAVAVAG





WTDRVIEIGQRVGKAIIHIPRRIRQGLERALL





A.CM.99
A
202
MRVMGTQRNYPPLWKLGIMVLGMIIICCNAADNLWATV


.99CM


YYGVPVWKDAETTLFCASDAKAYKTEMHNVWATHACV





PTDPNPQEIKLDNVTEEFNMWKNNMVEQMHADIISLWDQ





SLKPCVQLTPLCVTLNCSDVKVNTTKNNGTNSSNVLVING





TENGQEELKNCSYNVTTELRDKKQRVYSLFYRLDIVPINE





DNKSSDNSSGEYRLINCNTSAITQACPKVSFEPIPIHFCAPA





GFAILKCRDKNYNGTGLCKNVSTVQCTHGIKPVVSTQLLL





NGSLAEGEIKVRSENFTNNAKTIIVQLDQPVIINCTRPNNNT





RRSVRIGPGQAYYATGEIIGDIRKAYCTVNKTAWNETKHK





VMEKLRGIYHRPIKFNSSSGGDLEITTHMFNCGGEFFYCNT





SGLFAQELKNDTNDNNTIILLCKIKQIINMWQRVGQAMYA





PPRAGVIRCVSNITGLLLTRDGGKNDNGTNGTEIFRPGGGD





MRDNWRSELYKYKVVRIEPVGVAPTRAKRRVVEREKRA





VGLGAVFLGFLGAAGSTMGAASLTLTVQARQLLSGIVQQ





QSNLLRAIEAQQHMLRLTVWGIKQLQARVLAVERYLRDQ





QLLGIWGCSGKLICTTNVPWNSSWSNKSYEDIWDNMTWL





QWDKEISNYTALIYDLIEESQNQQEKNEKDLLELDKWASL





WNWFSISQWLWYIKIFIMVVGGLIGLRIIFAVLSLVNRVRQ





GYSPLSFQTHPPAPVGLDRPGGIEEEGGERGRDRSIRLVSG





FLALAWDDLRNLCLFCYHRLRDFILIAARTVELLGRNSLK





GLRLGWEGLKYLWNLLAYWGRELKISAINLLDTIAIAVGG





WTDSLIELTQRIGRAILHIPRRIRQGFERALL





A.KE.09.
A
203
MRVMGIQMNYQHLLTWGTIILGMILICSPAENLWVTVYY


1215180


GVPVWREADTTLFCASDAKAYETEKHNVWATHACVPTD


2.HQ540


PNPQEIHLANVTENFDMWKNNMVEQMHTDIISLWDQSLQ


689


PCVKLTPLCVTMNCSEPKFNNTNFNSTGSNSTGIQEEMRN





CSFNMTTELKDKKKEVYSLFYRLDIVQIDSKKGNSSDYRLI





NCNTSAVKQACPKVSFNPIPIHYCAPAGFAILKCRDEDFNG





TGPCKNVSTVQCTHGIMPVVSTQLLLNGSLAKENVQIRSE





NISNNAKIILVQLAHPVRINCTRPGNNTRKSIHMGPGQAFY





ARGDVIGDIRQAYCNVSSSEWSNTLYKVAEQLRKHYGNE





TTIKFTNHSGGDLEVTTHSFNCGGEFFYCNTTNLFNSSIPFN





ASERANNTNSTDDIITLQCRIKQIVRMWQRVGQAMYAPPI





PGVIRCESNITGLMLTWDGGSKNITEGNRTETFRPGGGDM





RDNWRSELYKYKVVKIEPLGVAPTKARRRVVGREKRAVG





IGAVFLGFLGAAGSTMGAASVTLTVQARQLLSGIVQQQSN





LLRAIEAQQQLLKLTVWGIKQLQARVLALERYLKDQQLL





GIWGCSGRLICTTNVPWNSSWSNKSYNEIWDNMTWLQW





DREIENYTQIIYGLIEESQNQQEKNEQDLLSLNKWADLWS





WFNITNWLWYIKIFIMIVGGLIGLRIVFAVLSVINRVRQGY





SPLSFQTHLPNPGGLDRPERIEEEDGEQGRTRSIRLVSGFLA





LAWDDLRSLCLFSYHRLRDFILIAARTVELLGHSSLKGLRL





GWEGLKYLWNLLVYWGRELKTSAISLVDTIAIVVAGWTD





RAIEIGQRIGRAILHIPIRIRQGLERALL





B.US.85
B
204
MRVKGIRKNYQHLWRWGTMLLGMLMICSAAEPEQLWVT


SFMHS2


VYYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACV


1


PTDPSPQEVALENVTENFNMWKNNMVEQMHEDIISLWEQ





SLKPCVKLTPLCVTLNCTDELNNVTSRIGEKMKGEIKNCSF





NITTSIRDKVKKYALFYKLDVVPIDNDNTTTNTTANYRLIN





CNTSVITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGK





GLCTNVSTVQCTHGIRPVVSTQLLLNGSLAGEEVVIRSDNF





TNNAKTIIVQLKEAVEINCTRPNNNTRKSIHIQPGRAFYTT





GDIIGDIRQAHCNLSKANWNNTLKQIVIKLRQQFNNKTIVF





NHSSGGDPEIVMHSFNCGGEFFYCNSTQLFNSTWPFNSTL





NDTSTEGNATITLPCRIKQVVNTWQEVGKAMYAPPIRGQI





RCSSNITGLLLTRDGGTSENGTNEIEIFRPGGGDMRDNWRS





ELYKYKVVKIEPLGVAPTKAKRRVVQREKRAVGTIGAMF





LGFLGAAGSTMGAASITLTVQARLLLSGIVQQQNNLLRAI





EAQQHLLQLTVWGIKQLQARVLAVERYLRDQQLLGIWGC





SGKLICTTAVPWNTSWSNKSLDKIWNNMTWMEWEREID





NYTSLIYTLIEESQNQQEKNEQELLELDKWASLWNWFDIT





KWLWYIKIFIMIVGGLVGLRIVSTVLSIANRVRQGYSPLSF





QTRLPAPRGPDRPEGIEEEGGERDRDRSGRLVNGLLALIW





DDLRSLCLFSYHRLRDLLLIAARIVELLGRRGWGILKYWW





SLLQYWSQELKNSAVSLLNATAIVVAEGTDRVIEVVQRIY





RAFLHIPRRIRPGLERALL





B.US.00.
B
205
MKVMGTKKNYQHLWRWGIMLLGMLMMSSAAEQLWVT


WITO


VYYGVPVWREANTTLFCASDAKAYDTEVHNVWATHACV





PTDPNPQEVVMGNVTEDFNMWKNNMVEQMHEDIISLWD





QSLKPCVKLTPLCVTLHCTNVTISSTNGSTANVTMREEMK





NCSFNTTTVIRDKIQKEYALFYKLDIVPIEGKNTNTSYRLIN





CNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNNKTFNGK





GPCRNVSTVQCTHGIKPVVSTQLLLNGSLAEEDIIIRSENFT





NNGKNIIVQLKEPVKINCTRPGNNTRRSINIGPGRAFYATG





AIIGDIRKAHCNISTEQWNNTLTQIVDKLREQFGNKTIIFNQ





SSGGDPEVVMHTFNCGGEFFYCNSTQLFNSTWFNNGTST





WNSTADNITLPCRIKQVINMWQEVGKAMYAPPIRGQIDCS





SNITGLILTRDGGSNSSQNETFRPGGGNMKDNWRSELYKY





KVVKIEPLGIAPTRAKRRVVQREKRAVTLGAVFLGFLGAA





GSTMGAASLTLTVQARLLLSGIVQQQSNLLRAIEAQQHML





QLTVWGIKQLQARVLAIERYLKDQQLLGIWGCSGKLICTT





TVPWNTSWSNKSYDYIWNNMTWMQWEREIDNYTGFIYT





LIEESQNQQEKNELELLELDKWASLWNWFNITNWLWYIK





LFIMIIGGLVGLRIVCAVLSIVNRVRQGYSPLSFQTRLPNPR





GPDRPEETEGEGGERDRDRSARLVNGFLAIIWDDLRSLCLF





SYHRLRDLLLIVARVVEILGRRGWEILKYWWNLLKYWSQ





ELKNSAVSLLNVTAIAVAEGTDRVIEIVQRAVRAILHIPTRI





RQGFERALL





B.CN.RL
B
206
MRVTGIRKNYQHLWRWGTMLLGMLMICNAAENLWVTV


42


YYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACVP





TDPNPQEVVLGNVTENFNMWKNNMVEQMHEDIISLWDQ





SLKPCVKLTPLCVTLNCTNLKNATNTSSTMEGGEIKNCSF





NITTSIKTKVKDYALFYKVDVVPIGNDSTSYRLINCNTSVIT





QACPKVSFEPIPIHYCTPAGFAIIKCNNKKFNGTGPCTNVST





VQCTHGIRPVVSTQLLLNGSLAEEEVVIKFSNFTDNARVII





VQLNESVEIKCIRPNNNTRKSIHLGPGKAWYTTGQIIGDIR





QAHCNLSSTKWNNTLKQITKKLREQFGNKTIVFNQSSGGD





PEIVMHSFNCGGEFFYCNTSQLFNSTWNDTGTWNDTTGN





STITLPCRIKQIVNMWQEVGKAMYAPPIEGQIRCSSNITGLL





LTRDGGNNESKPTETFRPGGGDMRDNWRSELYKYKVVKI





EPLGVAPTKARRRVVQREKRAVGTIGAMFLGFLGAAGST





MGAASITLTVQARQLLSGIVQQQRNLLRAIEAQQHLLQLT





VWGIKQLQARVLAVERYLKDQQLLGIWGCSGKLICTTAV





PWNASWSNKSLHEIWNNMTWMEWEREIDNYTREIYTLIE





ESQNQQEKNELELLELDKWASLWNWFDITKWLWYIKIFI





MIVGGLVGLRIVFAVLSIVNRVRQGYSPLSLQTRFPAQRGP





GRPEGIEEEGGERDRDRSERLVTGFLSLIWEDLRSLCLFSY





HRLRDLLLIVARIVELLGRRGWEVLRYWWNLLQYWIQEL





KNSAIGLLNATAIAVAEGTDRVIEVVQRAYRAILHIPTRIR





QGLERALL





B.CN.06.
B
207
MTVMGIRKNCQHWLTWGTMLLGMLIICSAEDLWVTVYY


CNHLJS


GVPVWKEASTTLFCASEAKAYSTEVHNVWATHACVPTDP


M06059.


NPQEVVLGNVTENFNMWKNNMVEQMQEDVISLWDESLK


EU13179


PCAKLTPLCVTLNCTDWKSNTTNTSSTTNATQSSWENMN


9


GEIKNCSFNVTTQIRNKVRKEQALFNSLDVVPIDNNNSMY





MLTNCNTSVITQACPKISFEPIPIHYCTPAGFAIIKCNDKTFN





GTGPCTNVSTVQCVHGIRPVVSTQLLLNGSLAEEDMVIRS





ENFTNNAKTIIVQLNETVKIECIRPNNNTRKSIQLVGGRSLY





ATGDIIGDIRQAHCNVSAQNWTETLKRVAIKLRDQFEHKN





KTIFLRPPSGGDPEIVMLTLNCGGEFFYCNTTKLFNHNLTE





NNTWQENASESTITLPCRIKQIINRWQEVGRAMYAPPISGQ





IKCSSNITGLLLTRDGGKTNGSENGSEIFRPGGGDMRDNW





RSELYKYKVVKIEPLGIAPTKAKRRVVQREKRAVGTIGAM





FLGFLGAAGSTMGAASMTLTVQARLLLSGIVQQQNNLLR





AIEAQQHMLQLTVWGIKQLQARVLAVERYLRDQQLLGIW





GCSGKLICTTAVPWNASWSNKSTDEIWGNMTWMQWERE





IDNYTELIYTLLEKSQNQQEKNEQELLELDKWANLWSWF





DISNWLWYIRIFIMIVGGLKGLRIIFTVFSIVNRVRQGYSPLS





FQTRLPAQKGPDRPEETEEGGGERDRGRSTRLVHGFLALI





WDDLRSLCLFSYHRLIDLLSIVARIVELLGRRGWEALKYW





WNLLQYWSQELKKSAVSLLNATAIAVAEGTDRVIAVIQAI





GRAIIHIPRRIRQGAEIALL





B.SE.11.
B
208
MKVKETRKSCQHLRGILLLGMLMICSAEDQLWVTVYYGV


SE60003


PVWKEATTTLICASDAKAYDTEAHNVWATHACVPTNPNP


4.KP411


HEVVLENVTEKFNMWKNNMVEQMHEDIISLWDQSLKPC


825


VKLTPLCVTLHCTNLTNSTTTSISPTMSIQKIKNCSFNTTTS





MRDKIEKKHALFYDLDVVQIDNDKNKTNTRYRLRHCSTS





TLACAWTNIDSVPIPYCARAAFGIAIRKCKDFDVNGPCPCV





NAAQWTPGLKLKVSIPLLLIVILSEGDIDIASDTISDNAKTII





VQLNETVQISCTKPNNNTRKSIHIGPGRAFYATGDIIGDIRQ





AYCNVSGKAWNKTLGKVVIKLREQFINKTIVFRRSSGGDP





EIVLHNFTCGGEFFYWNTTSLFDSTWNTNATWDRNWNDT





SGENDIITLPCRIKQIIYMGQEVGKAMYAPPITGLISCSSNIT





GLLLIGCGGESYNDEIFTPGGGNMKDYWRSELYKYDVQEI





EPLLVAPTKAKRIVVHREKRAFRFGSMFLGFLEAPGSTMG





AASITLTVQAILLLSGILHQHSNLLKAIEAHQLLLLVTVSLI





SNLQCRIPAEQRQLKYRQLLGICCCSGKIHYAPAAPWNTS





WSNRSRDHILHNKTCRQWEKEIDNYTSLIYTLIEESQNQQE





KNELDFVELDKWASLRNWFDKSNWLWYIKIFIMIVGVLIG





LRIMFAVLSIVNRVRQGSSSLSCQTRFPAPRGPDRPEGIEED





GGEQGRDRSIRLVDGFLALIWDDLRSLCLFSYHRLRDLLLI





AARVVELLGRRGREILKYWWNLLQYWTQELKKSAISLFN





ATAIAVAEGTDRVIELLQRTGRAILHIPVRIRQGLERLLL





B.JP.00.1
B
209
MRVRGIRKNYRHLWRWGTMLLGMLMICSAADKLWVTV


17.AB42


YYGVPVWKEAPTTLFCASDAKAYKTEMHNVWATHACVP


8551


TDPNPQEVLLGNMTENFNMWENNMVEQMQEDVISLWDQ





SLKPCVELTPLCVTLNCTDYNGNNTNSTHTNSSSSEGEKLE





KGEIKNCTFNITTNIGDKQRDYAFFYKLDLEPINKDNASYR





LISCNTSVITQACPKVSFEPIPIHYCTPAGFALLKCNDKKFN





GTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSA





NFSNNGKIIIVQLNETVEINCTRPNNNTRKGIQLGQGRALY





ATGDIIGDIRQAYCTINGTKWNKTLNQIAEKLREQFGENKT





IKFQPSSGGDPEVVMHSFNCGGEFFYCNTSSLFNWSTNGS





NITNGSNITKTIPCRIKQIINMWQKVGKAMYAPPIRGQINCL





SNITGLLLTRDGGNQTEFNNTEIFRPGGGDMRDNWRSELY





KYKVVKIEPLGVAPTKARRRVVQREKRAVGTIGAMFLGF





LGAAGSTMGAASLTLTVQARLLLSGIVQQQNNLLRAIEAQ





QHLLQLTVWGIKQLQARVLAVERYLQDQQLLGIWGCSGK





LICTTNVPWNSSWSNKTLNQIWENMTWMQWEREIDNYT





REIYSLIEEAQNQQEKNEQDLLELDKWDNLWSWFSITKWL





WYIKIFIMIVGGLIGLRIVFAVLSIVNRVRQGYSPLSLQTRL





PAQRGPDRHEETGEEGGERDRDTSGRLVTGFLALIWDDLR





SLFLFSYHHLRDLLLIVARIVELLGRRGWEVLKLWWNLLQ





YWSQEIKNSAVSLLNATAIIVAEGTDRIIEIAQRTYRAILHIP





TRIRQGLERALL





B.CN.12.
B
210
MRVTGIRKNYQHLWRWGTLLLGMLMICSAAEKLWVTVY


2097


YGVPVWKDANTTLFCASDAKAYATEVHNVWATHACVPT





DPNPQEVVLGNVTENFDMWKNDMVEQMHEDIISLWDQS





LKPCVKLTPLCVTLNCTNVRPLNCTDGKPPNCTAVEPSTS





GNNSSATGGGEMKNCSFNVTTSIRTKKTYATFYNLDIVKI





NDDNDSTSTSYRLINCNTSVITQACPKVSFEPIPIHYCTPAG





FAILKCNNKTFNGKGPCTNVSTVQCTHGIRPVVSTQLLLN





GSLAEGEVVIRSSNFTDNAKIIIVQLNEPVVINCTRPNNNTR





KSIPLGPGQAWYTTGDIIGNIRQAHCNLSRTEWNNTLRQIT





KKLREHIGNKTITFNQSAGGDPEIVMHSFNCGGEFFYCNTS





QLFNSTWPWNDTSTGNDTTGNSTITLPCKIRQIVNRWQEV





GKAMYAPPIAGPISCSSNITGLLLTRDGGNGSADNETFRPG





GGDMRDNWRSELYKYKVVQIEPLGVAPTKAKRRVVQRE





KRAVGTIGAMFLGFLGAAGSTMGAASITLTVQARQLLSGI





VQQQRNLLRAIEAQQHLLQLTVWGIKQLQARVLAVERYL





KDQQLLGIWGCSGKLICTTAVPWNASWSNKSLSQIWDNM





TWMEWEREINNYTKEIYTLIEQSQNQQEKNELELLELDKW





ASLWNWFDITNWLWYIRIFIMIVGGLVGLRIVFAVLSIVNR





VRQGYSPLSLQTRFPTQRGPGRPEGIEEEGGEQDRDRSERL





ANGFLTLFWEDLRSLCLFSYHHLRDLLLIVARIVELLGRRG





WEALKYWWNLLQYWIQELKNRAVSLFNATAIAVAEGTD





RVIEVVQRAFRAVLNIPRRIRQGLERSLL





B.ES.04.
B
211
MRVKGIRKNYQHLWRWGTMLLGLLIICSAAERLWVTVY


357184


YGVPVWKEATVTLFCASDAKAYDTEVHNVWATHACVPT





DPNPQEVVLENVTENFNMWKNNMVEQMHEDIISLWDQS





LKPCVKLTPLCVTLNCTDWRNATNTTSSSGGTIEGGEIKN





CSFNITAGIRDKVQKEYALFYKLDVVPIDNDNSSYRLISCN





TSVITQACPKVSFDPIPIHYCAPAGFAILKCNDKKFDGKGP





CTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSDNFTD





NAKTIIVQLNESIEINCTRPNNNTRKSIHIGPGRAFYTTGEIT





GDIRQAHCNLSRAKWDNTLKQIADKLRKQLKNKTIVFNQ





SSGGDPEIVMHSFTCGGEFFYCNSTKLFNSTWNGTEGSNN





SGTRANNSHILTLPCRIKQIINMWQEVGKAMYAPPIRGQIR





CSSNITGLLLTRDGGNNNSGTEIFRPGGGDMRDNWRSELY





KYKVVKIEPLGVAPTKAKRRVVQREKRAVGIGAVFLGFL





GAAGSTMGAASMTLTVQARLLLSGIVQQQNNLLRAIEAQ





QHLLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGK





LICTTTVPWNTSWSNKSLAEIWNNMTWMEWEREINNYTS





LIYNLIEESQNQQEKNEQELLELDKWASLWNWFDITKWL





WYIKIFIMIIGGLVGLRIVFTVLSIVNRVRQGYSPLSFQTHLP





APRGPDRPEGIEGEDGERDRDRSGRLVGGFLALFWVDLRS





LCLFSYHRLRDLLLIVTRIVELLGRRGWEVLKYWWNLLQ





YWSQELKNSAVSLLNAAAIAVAEGTDRVIEILQRAGRAIL





HIPTRIRQGLERALL





B.US.06.
B
212
MKVMETRRSWLPLWRGGILLLGILMICSTENLWVTVYYG


7010100


VPVWKEATTTLFCASDAKAYSSEKHNIWATHACVPTDPSP


68


QEIPLENVTENFNMWKNDMADQMHEDIISLWDQSLKPCV





KLTPLCVTLNCTNLNCTSSQNHNTTCRNVSKEMEGEIKNC





SFYTTTNIRDRVTKNYALFYSLDIVPIDDNNNTNISYRLTSC





NTSIITQACPKVSFEPIPIHYCAPAGFAILKCNDQEFNGTGP





CKNVSTVQCTHGIRPVVSTQLLLNGSLAEKDIVIRSSNISD





NTKTIIVQLKEAIIIDCIRPNNNTRRSIHIGPGRAFYGTGDIIG





NIRQAYCSINISKWDETLRQIAGKLGEQLNKTKIITFKNSSG





GDPEITMFNFNCGGEFFYCNSTQLFNSIWTKNDNGIWTKN





ETNGNWTRSGSTNDNDTIILPCRIRQIINMWQEVGKAMYA





PPIRGNISCSSNITGLLLTRDGGNNNDSANETFRPGGGNMK





DNWRSELYKYKVVEIEPLGLAPTRAKRRVVQREKRAALG





ALFLGFLGAAGSTMGAASVTLTVQARLLLTGIVQQQNNL





LKAIEAQQHLLQLTVWGIKQLQARVLSIERYLKDQQLLGI





WGCSGKLICTTAVPWNTSWSNKSVDMIWHNMTWMEWE





REIDNYTGLIYKLLEASQNQQEQNEQELLELDKWASLWN





WFDITNWLWYIKIFIMIVGGLIGLRIVFTVLSIVNRVRKGYS





PLSFQTHRPAPRGPDRPEGIEEEGGERDRDTSGPLVTGFLAI





IWVDLRNLCLFSYHRLRDLLLIVARIVELLGRRGWEALKY





WWNLLQYWIQELKNSAVSLYNTIAIAVAEGTDRIIEVLHRI





WRAILNIPRRIRQGLERALL





B.CU.14.
B
213
MRVKGIRKNYPHWWRWGIMVLGMLMIWSAKEQLWVTV


14CU005


YYGVTVWKGLPVWKNATTTLFCDTELHNVRDKCACVPT


.KR9146


DPNPQEVVMGNVTEYFNMWKNDMVEQMHEDIISLWDRS


76


LKPCVIFTPLCVTLNCNDLKNVTNTNCSSGRGIVEKLDISN





CSVSISTFITGHVVIADEVMDTDDILLVSIDSINNKLTLCNT





SLMTQACRSERCKPIPIHYCAPAGFAILKCNNKTFNGKGPC





KNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVVIRSENFTDN





TKTIIVQLNESVVINCTRPSNNTRKSIRIHRGPGQAFFATGK





IGDIRQAHCNLSRAEWNKTLAQIAKKLGKQFGENKTISFQ





PSSGGDIEIVTHSFNCGGEFFYCNTTPLFNSSWYKSNWKSQ





RTKEIGGRETITLQCRIKQIVNMWQEVGRAMYAPPIKGQIR





CSSNITGVLLTRDGGNTTNHSEIFRPGGGNMKDNWRSELY





KYKVVQIEPLGIAPTKAKRRVVQREKRAIGFGAVFLGFLG





AAGSTMGAASVTLTVQARQLLSGIVQQQSNLLNAIQAQQ





HLLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKL





ICTTAVPWNASWSNKSLEYIWDNMTWMEWEREINNYTR





LIYHLIEESQNQQEKNEKELLELDKWASLWNWFDITKWL





WYIKIFIMIVGGLIGLRIVFVVLSIINRVRQGYSPLSFQTRHP





AQRGPDRPEGIEGEGGERDRGGSERLVDGFLAIIWVDLRS





LCLFSYHRLRDLLSILTRIVELLGRRSWEALKYCWNLLQY





WSQELKNSAVSLLNATAVAVAEGTDRIIEIVQRACRAFLHI





PRRIRQGFERLLL





B.US.06.
B
214
MRAKEMKKHWQRSWKGGILLLGMLMICSATAPDKWVT


CH106


VHYGVPVWREAKTTLFCASDAKAYSTEKHNVWATHACV





PTDPNPQEVVLGNVTENFNMWKNDMVEQMHEDIISLWD





QSLKPCVKLTPLCVTLNCIDEITNRTNNTHKDGQLMDKGE





IKNCSFNTTTSVQDKLQKVSALFYKPDVVRIGDNNNSSNS





SSYRLINCNTSIITQACPKVSFEPIPIHYCTPAGFAILKCNDK





NFNGSGLCTNVSTVQCTHGIRPVVSTQLLLNGSLAENEVVI





KSENFTDNAKTIIVQLKEAVQINCTRPNNNTRKSIHIGPGR





AFYATGNIIGDIRKAYCTINITKWNKTLEQIVDKLRDKFGE





NKTIVFNSSAGGDPEIVTHSFNCGGEFFYCNTTQLFNSTWN





LNSTWNGTKNLNNTTRNDTIILPCRIRQIINLWQKVGKAM





YAPPIRGQINCTSNITGLLLTRDGGNTSETNTTEVFRPGGG





NMRDNWRSELYKYKVVEIEPIGIAPTKAKRRVVQREKRA





VGIGAVFLGFLGAAGSTMGAASVALTVQARQLLSGIVQQ





QNNLLRAIEAQQHMLKLTVWGIKQLQARVLAVERYLRDQ





QLLGIWGCSGKLICTTAVPWNNSWSNKSLEYIWGNMTW





MEWEREIDNYTGLIYNLIEESQLQQEKNEQELLELDKWAS





LWNWFDITNWLWYIKIFIMIVGGLVGLRIVFAVLAIANRV





RQGYSPLSLQIRPPAQRGPDRPEGIEEEGGDRDRDTSTRLV





HGFLALVWDDLRSLCLFSYHRLRDLLLIVTRIVELLGHRG





WEALKYWWNLLQYWIQELKSSTVSLFNYTAIAVAEGTDR





IIEIVGRVFRAFIHIPRRIRQGLERLLL





B.KR.93.
B
215
MRVKGIRKNYQHWWRWGIMLLGMWMICSADEMWVTV


93JHS5


YYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACVP





IDPNPQEVELVNVTENFNMWKNNMVEQMHEDIISLWDQS





LKPCVKLTPLCVTLNCSNLRNETNIKNATNSSSGKVMEKG





EMKNCSFNVTTNIGNKVQKEYALFYKLDVVQIDNTSYTLI





NCNSSIITQACPKISFEPIPIHYCTPAGFAILQCNDKKFNGSG





PCSNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSENFT





NNAKTIIVQLNASVEINCTRPNNNTRKSIHLGPGSTIYATG





DIIGDIRQAHCNISETKWNNTLRQIVIKLREQFGNKTTIVFN





QSSGGDPEIVMHSFNCGGEFFYCNTAQLFNSTWIWNDTKG





LNKTAENGTFTLPCRIKQIINRWQEVGKAMYAPPIRGQIRC





SSNITGLLLTRDGGAESRNGTEIFRPGGGDMRDNWRSELY





KYKVVRIEPLGIAPTKARRRVVQREKRAVGTIGAMFLGFL





GAAGSTMGAASMTLTVQARLLLSGIVQQQNNLLRAIEAQ





QHLLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGK





LICTTNVPWNTSSNKSQNEIWDNMTWMQWDREINNYTNL





IYTLLEESQNQQEKNEQELLELDKWSNLWNWFSISNWLW





YIRIFIMIVGGLIGLRIIFSVLSIVNRVRQGYSPLSFQTHFPSP





RGPDRPEGIEGEGGERNRDGSGPLVNGFLTLIWVDLRSLC





LFIYRRLRDLLLIVARSVELLGLRGWEVLKYWWNLLHYW





SQELKKSAVSLLHATARAVAEGTDRAIEILQRAYRAILHIP





RRIRQGLERALL





B.US.NL
B
216
MRVKEKYQHLWRWGWKWGTMLLGILMICSATEKLWVT


43E9


VYYGVPVWKETTTTLFCASDAPAYDTEVHNVWATHACV





PTDPNPPEVVLVNVTENSHMWTNDMVEQMHEDIISLWDQ





SLKPCVKLTPLCVSLKCTDLKNDTNTNSSSGRMIMEKGEI





KNCSFNIGTSIRDKVQKEYAFFYKLDIVPIDNTSYRLISCNT





SVITQACPKVTFEPIPIHYCAPAGFAILKCNNKTFNGTGPCT





IVSTVQCTHGIRPVVSTQLLLNGSLAGEDVVIRSANFTDNA





KTIIVQLNTSVEINCTRPYNNKRKSIRIQRGPGRAFVTIGKI





GNMRQAHCHISRAKWNATLKQIASKLREQFGNNKTIIFKQ





SSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTWSTEE





SNNTEGSDTITLPCRIKQFINMWQKVGKAMYAPPISGQSR





CSSNITGLLLTRDGGNNNNGSQIFRPGGGDMRDNWRSELY





KYKVVKIDPLGVAPTKAKRRVVQREKRAVGIGALFLGFL





GAAGSTMGAASMTLTVQARQSLSGIVHQQNNLLRAIEAQ





QHLLQLTVWGIKQLQARILAVERYLKDQQLLGIWGCSGK





LICTTAVPWNASWSNKSLEQIWNNMTWMEWDREINNYT





SLIHSLIEESQNQQEENEQELLELDKWASLWNWFNITNWL





WYIKLFIIIVGGLVGLRIVFAVVSIVNRVRQGYSLLSFQTHL





PIPRGPDRPEGIEEEGGERDRGRSIRLVNGSLALIWDDLRSL





CLFSYHRLRDLLLIVTRIVELLGRRGWEALKYWWNLLQY





WSQELKNSAVNLLNATAIAVAEGKDRVIEVLQAAYRAIR





HIPRRIRQGLERILL





B.SE.11.
B
217
MIVMGIKKNYQHLWTWGMMLLGMLMICSTAPSLWVTV


SE60002


YYGVPVWKEATTTLFCASDAKAYEKEAHNIWATHACVPT


3.KP411


DPNPQEVVLENVTEDFNMWKNNMVEQMHEDIISLWDQS


824


LKPCVKLTPLCVTLNCTDPWYDNTANRTTERMIMEKGEIK





KCSIEVNTCLIDVWMEHCALLSKLHLVRLENTTFIYMQTS





RHSSKRTKGCPRYSIEPIPLQYSAPAGSAILIRKNDMVHGT





GTCTHVPPVQCSNGIRPVVSSQHPSHDISGAAVLTKDNFK





DTVRTIIVQLNDSVPINCTRPNNNTRKSIHIGPGRAFYATGD





IIGDIRQAHCNISLVKWNNTLEKIVTKLREQFKNKTIVFSPP





SGGDPEIVSHSFNCGGEFFYCNSTPLFNSTWDTNGTCKGS





NSADWNVTVPCISIPITTMGQKLEQAMDVRPIAGKAICSSS





LSKGILLANSHNSIIVPRLVMPKGKDKKFNWNRKMYQCQ





VQCQIEPLGVAPTMAKRRVVQREKRDLGLGALFLGFLGA





AGSTMGAASITLTVQARQLLSGIVQQQNNLLRAIEAQQHL





LQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKLIC





TTTVPWNISWSNKTLEQIWDNMTWMQWDKEIHNYTELIY





TLIEESQIQQEKNEQELLELDKWSNLWNWFDITNWLWYIK





IFIMIVGGLIGLRIVFVVLSIINRVRKGYSPLSFQTLLPVPRG





PDRRFPIGEGDGERDRDRDSRSANGLLAIINVDMRILSLCIF





LYRRDLLLIVMMTVELLGRRGWETLIYCLNLNLYLSQNLK





HKAISIINVTATAVAVAADTVIDTVPAAGRGILLILRRIRQG





LERDLL





B.US.09.
B
218
MRVKGIRKNWQHWWWKWGIMLLGMWMICSTEEEKLW


9036


VTVYYGVPVWKEASTTLFCASDAKAHKTEVHNVWATHA





CVPIDPDPQEVRLENVTENFNMWKNDMVEQMHEDIISLW





DQSLKPCVKLTPLCVTLNCSSNVTIITNSTTANSSGKGIRDE





LEMKNCSFNVSTNTNRKGFKEEYALFYKLDLNGTDNTSY





TLINCDSSVIKQACPKVTFEPIPIHYCTPAGFAILQCNDKKF





NGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEEEIVIRSE





NLTDNAKNIIVQLNASVEINCTRPNNNTRKGIGIGPGSIFYA





TGAIIGDIRQAHCNISRAKWNNTLRQLAGELEKRFNKTTIIF





EQPLPGGDQEIVMHSFNCGGEFFYCNTSQLFNNTWNSTDK





TLNNTEGNDTITLPCRIRQIINRWQEVGKAMYAPPIAGIITC





SSNITGLILTRDGGTENGTNTTTETFRPGGGNMKDNWRSE





LYKYKVVRIEPLGIAPTRARRIVVQREKRAVGTIGAMFLGF





LGAAGSTMGAASVTLTVQARLLLSGIVQQQNNLLRAIEAQ





QRMLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSG





KLICTTNVPWNTSWSNKSMNDIWNNMTWMEWDREINNY





TGLIQTLLEESQIQQEKNEQELLELDKWTSLWNWFDITKW





LWYIRIFIMVVGGLVGLRIIFFVLSIVNRVRQGYSPLSLQTH





LPSPRGPDRPGGTEGEGGEEDRGGSGRSVNGFLALIWVDL





RSLLLFSYRHLRDLLLILARSVELLGLRRWEALKYCWNLL





QYWSQELKNSAVSLLNAIAIAVAEGTDRIIEFLQGVFRAFI





HIPRRIRQGLERALL





B.KR.04.
B
219
MRVKGIMKNYQHWWRWGMMLLGMWMICSANKSWVT


04CWS5.


VYYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACV


JQ31613


PIDPNPQEVVLGNVTENFNMWKNNMVDQMHEDIISLWDQ


3


SLKPCVKITPLCVTLNCSNLKNETANSTSANSNETANSTTA





SNSTSNSSGEVMEEVDMKNCSFTVTTELKNTVSKQYALFY





KCKPYLKNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLA





EEEVVIRSENFTNNAKNIIVQLNASVEINCTRPDNTIKNIRIG





PGIGLGSTLHATKSITGRIRPHCNISEAKWNNTLKQIAIKLR





EQFRNKTIIFNHSSGGDPEIEMHSFNCGGEFFYCNTTKLFDS





TWNGTKEWNDTKESDKNITLQCRIKQIINRWQEVGQAMY





APPISGLIRCSSNITGLILTRDGGANETHTNTETFRPAGGDM





KDNWRSELYKYKVVRIEPLGIAPTKARRRVVQREKRAVGI





LGAMFLGFLGAAGSTMGAASMTLTVQARQLLSGIVQQQN





NLLRAIDAQQHLLRLTVWGIKQLQARVLAVERYLKDQQL





LGIWGCSGKLICTTNVPWNTSWSNKTQEDIWTNLTWMQ





WEREIDNYTNVIYSLLEESQNQQEKNEQELLQLDKWASL





WNWFDITSWLWYIKIFIMIVGGLIGLRIIFTVLSIVNRVRQG





YSPLSFQTHFPSPRGPDRPGEIEGEGGEEDRGGSDRLVTGF





LTLVWVDLRNLCLFSYRSLRDLILIVARSVELLGLRGWEV





LKYCWSLLQYWSQELKKSAVSLLQATARAVVEGTDRVIE





TLQRAYRAILHIPRRIRQGLERALL





B.DK.04.
B
220
MRVKGIRRNWQHWWRWGTMLLGILMICSATDLWVTVY


PMVL


YGVPVWKEANTTLFCASDAKAYDTEVHNVWATHACVPT





DPSPQEVELKNVTEDFNMWKNNMVEQMHEDIISLWDQSL





KPCVKLTPLCVTLNCTDWNGTATSPNDNSYRGNKMEKGE





MKNCSFNITTGIRDKIQKEYALFYKLDMVPIDDDKTNSSPT





TTTSSTLTSSSVTNSNSNSSNTTSNSSQYRNYRMISCNTSVI





TQACPKVSFEPIPIHYCAPAGFAILKCNDKNFTGKGPCSNV





STVQCTHGIRPVVSTQLLLNGSLAEEGIVIRSDNFSNNAKNI





LVQLKEAVVINCTRPNNNTRRGIHIGPGRAFYTTGDVIGDI





RQAHCNISGTKWNNTLYQIAKKLKEQYNETNKIIFNHSSG





GDPEIVMHSFNCGGEFFYCNTAKLFNSTLNNTEWSNITAE





NNEDKIILQCRIKQIINMWQEVGKAMYAPPISGPINCSSSIT





GLILTRDGGNINNNSMRDNDTFRPGGGDMRDNWRSELYK





YKVVRIEPLGVAPTKAKRRVVQREKRAVGIGAVLLGFLG





AAGSTMGAASVMLTVQARQLLSGIVQQQNNLLRAIEAQQ





HLLQLTVWGIKQLQARVLAIERYLQDQQLLGIWGCSGRLI





CTTAVPWNASWSNRSLDQIWNNMTWMEWEREIDNYTNII





YSLIQESQNQQDKNEQELLELDKWASLWNWFSITQWLWY





IKIFIMIVGGLIGLRIVFAVLSIVNRVRQGYSPLSLQTLLPAP





RGPDRPEGTEEEGGERDRDRSGQLADGLLTLIWVDLRSLC





LFSYHRLRDLLLIVARIVELLGRRGWEALKYWWNLLQYW





GREIKNSAVSLLNATAIAVAEGTDRVLELLQRIGRGILHIPR





RIRQGLERSLL





B.US.x.C
B
221
MRVKGIRRNYQHLWWKWGIMLLGTLMIYSAAEQWWVT


R0317N.


VYYGVPVWKEANTTLFCASDAKAYDTEVHNVWATHAC


FJ469719


VPTDPSPQEINMTNVTEYFNMWKNNMVDQMHEDIINLW





DQSLKPCVELTPLCVTLECTDNITTSSNTTRGSNSSNRSRW





ENMTIETGEIKKCSFNVTTGIRDRVNTEYAYFNRFDIVQTE





RDNKSYTLRHCNTSVIKQACPKISFEPIPIYYCAPAGFAILK





CNNKTFSGKGECDNVSTVQCTHGIRPVVSTQLLLNGSLAE





EGVVIRSDNFSDNAKTIIVQLTKAVQINCTRPNNNTRKGIH





MGPGKAFFTTGQIIGDIRQAYCNISHTKWNNTLKQVVYQL





RKQYGGNKTINFTQPSGGDLEIVTHSFNCGGEFFYCNTTQ





LFNTTWYNNETVDNKTWTEDDTNLTDKDIIILPCRIKQIIN





MWQKVGKAMYAPPIQGMISCSSNITGLLLTRDGGNDTDT





DPNGTETFRPIGGDMRDNWRSELYKYKVVRIEPLGIAPSK





AKRRVVQREKRAALGAVFLGFLGAAGSTMGAASITLTVQ





ARQLLSGIVQQQSRLLRAIEAQQHMLQLTVWGIKQLQAR





VLALERYLQDQQILGIWGCSGKLICTTSVPWNASWGNTSL





GYDYIWTNWTWMKWEKEIANYTHIIYDLLEESQIQQEQN





ELELMKLDKWASLWDWLGITKWMKYIKIFIIIVVGLVGLR





IIFAVLSIVNRVRQGYSPLSFQIRPPAAREPDRPEEIEEGGGG





RDRGRSNQLVDGFLALIWVDLWSLCLFSYQRLRDLLLIVT





RIVELLGRRGWEILKYWWSLLQYWGQELKKSAVSLLNAT





AIAVAEGTDRHEAAQRVFRAIIHIPRRIRQGLERALL





B.KR.12.
B
222
MRVKGIRKNCQHWWRWGIMLLGMWMICSAATGGLWVT


12KYY1


VYYGVPVWREAITTLFCASDAKAYETEMHNVWATHACV


0


PIDPNPLEVDLPNVTENFNMWKNNMVEQMHEDIISLWDQ





SLKPCVKITPLCVVLNCSDAKNSTSTNNSTEKVMEKGEMK





NCSFNVTTNIENKVKEARAIFDKLDVVQIDDNDTNTMIGIQ





YYKCYSSSITQACPMVSFEPIPIHYCTPAGFAILQCKDKKFN





GTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEIVIRSE





NFTNNAKTIIVQLNASVEINCTRPNNNTRKSIPIGPGRAFYA





TGDIIGDIRRAHCNISKATWNNTLRQLAIKLREQLEDNTTTI





IFKNSSGGDPEIEAHSFNCREEFFYCNTTPLFNSIWYSNNTE





KSYNTAENDTITLQCRIRQIIKRWQEVGKARYAPLITCLIRC





YSNITGLLLAGDGGEAGKDNGNKNETFRPGGGDMRDNW





RSELYKYKVVEIEPLRIAPTRARRIVVQREKRAVGTLGAM





FLGFLGAAGSTMGAASLTLTVQAKQVLSGIVRQQNNLLR





AIEAQQRLLQLTVWGIKQLQARVLAVERYLQDQRLLGIW





GCSGKLICTTSVPWNTSWSNKSHDEIWNNMTWMEWDREI





NNYTQVIYDLLEESQIQQEKNEQELLKLDSWASLWKWLDI





SNWLWYIQIFMMMVGGLVGLRIILCVLSVVNRVRQGYSP





LPFQTHLPSPRGLDRPGGIEGEGGEGDRSGSSGLAGGFLTL





VWVDLRSLCLFSYHSLRDLLLIVARSVELLGLRGWEILKY





WWSLLQYWSQELKKSAVSLLNATAIAVAEGTDRVIEILQR





IYRAILHIPTRIRQGLERALL





B.GB.x.
B
223
VRGTRKNYHHYLWRWGTMLLGMLMICSAGEQLWVTVY


M26864.


YGVPVWKEATTTLFCASDAKSYDTEVHNMWATHACVPT


U36875


DPNPQEVVLGNVTENFNMWKNNMVEQMHEDIISLWDQS





LKPCVKLTPLCVTLNCTDNLDNGTISNATHTTNSSSVNNT





TGNSTRKLEIDRGEIKNCSFNITTNIKDKMQKAHALFYKLD





VVPIDNSTTSYRLISCNTSVITQACPKVSFEPIPIHFCAPAGF





AILKCNNKTFNGKGPCKNVSTVQCTHGIRPVVSTQLLLNG





SLAEDEVVIRSENFTDNAKTIIVHLNESVEINCTRPNNKTK





RSFHIGPGRAFLTEIVGDIRQAYCNISRAKWNNTLHQVVN





KLREQFGNRTIVFKNSSGGDPEIVTHMFNCAGEFFYCNTT





KLFNSIWMPNGTWNSTDTSNSTEIRLPCRIKQIVNMWQEV





GKAMYAPPISGRIRCVSNITGLLLTRDGGNNNETTEIFRPG





GGDMRDNWRSELYKYKLIKIEPLGIAPTKAKRRVVQREK





RAVGIGAVLLGFLGAAGSTMGAASLTLTVQARLLLSGIVQ





QQNNLLRAIEAQQHMLQLTVWGIKQLQARILAVERYLRD





QQLLGIWGCSGKLICPTTVPWNASWSNKNLSQIWDNMTW





MEWEREIDNYTSLIYTLIEESQNQQEKNEQDLLELDKWAS





LWNWFSITNWLWYIKIFIIIVGGLIGLRMVFTVLSIVNRVR





QGYSPLSFQTRHPAPRGPDRPEGIEEEGGERDRDTSGFLVD





GFLAIIWVDLRSLCLFSYHRLRDLLLIVARIVELLGRRGWE





ALKYWWNLLQYWGKELKSSAISLFNATAIAVAEGTDRVI





EVIQRACRAILHIPRRIRQGLERALQ





B.US.13.
B
224
MRAKGIRKNCQHWWRGVTLLLGMLMICNAAQQLWVTV


IQA267.


YYGVPVWRDADTILFCASDAKGYDTEVHNVWATHACVP


PBMC.S


TDPNPQEIRLENVTENFNMWKNKMVEQMHEDIISLWDQS


GA1.1.K


LKPCVKLTPLCVTLNCTDLNKNSTQGNATAEEQEEIRNCS


R182196


FNITTSIRDRMKKDYALFYSNDIVKLDNNTFRLTSCNTSVI





TQACPKISFNPIPIHYCAPAGFAILKCKDTTFNGSGLCRNVS





SVQCTHGIKPVVSTHFLLNGSLAEKEVVIRSENFTDNAKTII





VQLKDPVRINCTRPNNNTRKGIHIGPGRAFYATGDIIGDIR





QAHCNISRTEWNKTLNQVVEKLRAQFNNKTIVFKNASGG





DPEIVMHIFNCRGEFFYCNTTQLFNSTWNANSTWNDTSSN





IGESNDTITLPCKIRQIVNMWQEVGKAMYAPPIAGQLNCSS





NITGILLTRDGGESNTTNSNETFRPGGGNMRDNWRSKLYK





YKVIKIEPLGLAPTKAKRRVVQRKARAAGLGAVIFGFLGA





AGSTMGAASVALTVQARQLLSGIVQQQNNLLRAIEAQQH





MLQLTVWGIKQLQARVLAVERYLRDQQLLGIWGCSGKLI





CTTAVPWNDSWSPRWSKENMSKIWDNMTWNQWEREID





NYTGYIYTLLEESQNQQEKNEQDLLSLDKWADLWSWFDI





TNWLWYIKIFIMIVGGLVGLRIIFSVLAIVNRVRQGYSPLSF





QTHLPARRGPEGPEEIEEGGGERDRDRSRPLVDGFLTLIWI





DLRSLCLFSYHRLRDLLLIVTRTLELLGRRGWEALKYCWN





LLQYWGQELRSSAVSLFNATAIAVAEGTDRIIEIIQRIFRAY





SFSAPTLKNDMALALG





B.US.81.
B
225
MRVKEIRKNYQHLWRWGTMLLGILMICSAAEKLWVTVY


81NJ.AY


YGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACVPT


247221


DPNPQEVVLENVTENFNMWKNNMVEQMHEDIISLWDQS





LKPCVKLTPLCVTLNCTDLRNATNTTSSSGGTMERGEIKN





CSFNITTSIRDKVQKEYALFYKLDVVPIDNDNTTTSYRLISC





NTSVITQACPKVSFEPIPIHYCTPAGFAILKCKDKKFNGTGP





CTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSDNFTD





NAKTIIVQLKESVEINCTRPNNNTRKSIHIGPGRAFYTTGEII





GDIRQAHCNLSRAKWDNTLKQIVRKLREQFGNKTIVFNQS





SGGDPEIVTHSFNCGGEFFYCDSTQLFNSTWNVTEGSNNT





EGNITLTLPCRIKQIINMWQEVGKAMYAPPIRGQIRCSSNIT





GLLLTRDGGNNESETEIFRPGGGDMRDNWRSELYKYKVV





KIEPLGVAPTKAKRRVVQREKRAVGIGAVFLGFLGAAGST





MGAASMTLTVQARLLLSGIVQQQNNLLRAIEAQQHLLQL





TVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKLICTTT





VPWNASWSNKSLDKIWNNMTWMEWEREINNYTSLIYTLI





EESQNQQEKNEQELLELDKWASLWNWFDITKWLWYIKIFI





MIVGGLVGLRIVFAVLSIVNRVRQGYSPLSFQTRLPAPRGP





DRPEGIEEEGGERDRDRSGRLVDGFLALFWVDLRSLCLFS





YHRLRDLLLIVTRIVELLGRRGWEVLKYWWNLLQYWSQE





LKNSAVSLLNATAIAVAEGTDRVIEVLQRTCRAILHIPTRIR





QGLERALL





B.JP.x.D
B
226
MRVKEMRKHWQHLWTGGILLLGMLMICSTAQDAWVTV


R1777.A


YYGVPVWKEATTTLFCASDAKAYKTEVHNVWATHACVP


B604948


TDPNPQEVVLENVTENFNMWKNNMVEQMHEDIISLWDES





LKPCVKLTPLCVTLNCTDELIVTNSTNGNNTNSHSTRGND





TIGNSTSWKEMKGEIKNCSFNIPTSVKDKMQKQYALFYKL





DVVAINDDNNKNSSNYNSSKLSSSNSNCKSDNNSSCNCSS





SNNNCSSSNHSSNYSSYILISCNTSTLTQACPKVSFEPIPIHY





CTPAGFAILKCNDKRFNGTGPCKNVSTVQCTHGIRPVVST





QLLLNGSLAEEEVVIRSENISNNAKTIIVQLNESVAINCTRP





NNNTRKGIRIGPGRTFYAAEKIIGDIRKAYCIINGTKWNETL





RLIVAKLREQEQIGENTTIIFKPSSGGDPEIENHIFNCRGEFF





YCNTTQLFNSTWYSNGTWIGKNFTGSNITLPCRIKQIVNM





WQEVGKAMYAPPIRGQINCISNITGLLLTSDGGFRKTNETT





NMTETLRPGGGDMRDNWRSELYKYKVVRIEPLGIAPTQA





KRRVVQREKRAVGIIGAVFLGFLGAAGSTMGAAALTLTV





QARQLLSGIVQQQNNLLRAIEAQHQLLQLTVWGIKQLQA





RILAVERYLRDQQLLGIWGCSGKLICTTTVPWNTSWSNKS





LTEIWNNMTWMEWEREIENYTGLIYNLLEKSQNQQEKNE





QELLELDKWANLWNWFDITNWLWYIRIFIMIVGGLIGLRI





VFAVLSIVNRVRQGYSPISLQTHLPVPRGPDRPEGIEGEGG





ERDGDTSRRLVIGLLPLIWDDLRSLCLFSYHRLRDLLLIVA





RIVELLGRRGWEILKYWWNLLQYWSQELKNSAVSLLNAT





AIAVAEGTDRIIEIARTIFRAFYHIPRRIRQGFERALL





B.TH.x.3
B
227
MRVKETQMNWPNLWKWGTLILGLVIICSASDKLWVTVY


141A16.


YGVPVWRDADTTLFCASDAKAHVTEVHNVWATHACVPT


B2.KJ95


DPNPQEIHLENVTEYFNMWKNNMVEQMHEDIISLWDQGL


3175


KPCVKLTPLCVTLTCTNASLTKINTTTSDPKIGNITDEVRN





CSFNVTTEIEDKKQRVSALFYKLDIVPMEDNSNSSEYRLIN





CNTSVIKQACPKVSFDPIPIHYCTPAGYAILKCNDKNFTGT





GQCKNVSSVQCTHGIRPTVSTQLLLNGSLAEEEIMIRTESL





TNNAKTIIVHLNKSVEINCTRPSNNTRTSITIGPGRVFYRTG





DIIGDIRRAYCEINETNWYKVLKQVTGKLKEHFNKTINFGP





PSGGDLEITMHHFTCRGEFFYCNTTKLFINNCTRNETKGGC





NSTINGTIILPCRIKQIINMWQGAGQAMYAPPISGRINCVSN





ITGILLTRDGGITSNGTNGTNGTNGTETFRPGGGDIRDNWR





SELYKYKVVQIEPLGVAPTKARRRVVEREKRAVGIGAMIF





GFLGAAGSTMGAASITLTVQARQLLSGIVQQQSNLLRAIE





AQQHLLQLTVWGIKQLQARVLAVERYLKDQKFLGLWGC





SGKIICTTAVPWNSSWSNKSYDQIWHNMTWIEWEREISNY





TNQIYEILTESQNQQDRNEKDLLELDKWASLWNWFDITR





WLWYIKIFIMIVGGLIGLRIIFAVLSIVNRVRQGYSPLSFQTP





TLHQREPDRPERIEEGGGEQDRDRSVRLVSGFLALAWDDL





RSLCLFSYHRLRDFISIAARTVELLGHSSLKGLRLGWEGLK





YLGNLLLYWGQELKISAISLLDATAIAVAGWTDRVIEGAQ





RAWRAFLHIPRRIRQGLERALQ





B.BR.03.
B
228
MKVRVIRMSYQQLWKGGTLLLGMLMICSAVGPLWVTVY


BREPM1


YGVPVWKEANTTLFCASDAKAYDTEAHNVWATHACVPT


023.EF63


DPDPREVVLENLTEDFNMWKNDMVEQMHQDIISLWDQSL


7057


KPCVKLTPLCVTLNCSDFNGTVTNNSSGNATDNSSGSGLE





RETTGEIKNCSFNITTDMKDKVQEKYALFYKLDIVSIDESR





NGSSYRLTSCNTSVLTQACPKVSFEPIPIHFCAPAGFAILKC





NNKRFNGAGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEE





KVIIRSKNFSDNTKIIIVQLNESVVINCTRPNNNTRKSIHIGP





GRAFYGTEIIGDIRKAHCTLNRTQWNNTLQRVATELREQF





KNKTIGFTNSSGGDLEITMHSFNCGGEFFFCNTTKLFGGT





WDGSTWNWSDNENNTITITLPCRKKLIINMQQEEEKATYA





NTIRGPIKSASNITGLLKTRDGGEEDNASETFRPGGGDMRD





NWRSELYKYKVVKVVPLGVAPTRAKRRVVQREKDLGLG





ALFLGFLGAAGSTMGAASLTLTVQARQLLSGIVQQQNNLL





RAIEAQHQLLQLTVWGIKQLQARVLAVERYLRDQQLLGI





WGCSGKLICTTSVPWNASWSNKSLDDIWNNMTWMQWE





KEIENYTVEIYKLIGEAQNQQEKNEKDLLELDNWASLWN





WFSITNWLWYIKIFIMIIGGLIGLRIVFAVLSIVKKVRQGYS





PLSLQTRLPTQRGPDRPEGIEEEGGEQDRGSSIRLVDGFLA





LIWDDLGSLGPFSYPRLEDLPPIVTRIGELLGRRGWELLKL





WWSLLQYWSQELKKSAVSLLNATAIAVAEGTDRILEVVQ





RFCRAILHIPVRIRQGLERALL





B.US.96.
B
229
MRVKGIRKNYQRLWRWGTMLLGMLMICTATGKLWVTV


5155


YYGVPVWKEATTTLFCASDAKSYDTEAHNVWATHACVP





TDPNPQELELKNVTEEFNMWKNNMVEQMQEDIISLWDQS





IKPCVKLTPLCVTLNCTGLRNNTNATSASSTHPPTTSSSSR





NIMEGEMANCSFNITTTMNKVQKEYALFYTHDVVQIVDN





DNNNASNSAKKRISYRLISCNTSVVTQACPKVNFEPIPIHY





CAPAGFAILKCNEKEFSGKGPCKNVSTVQSTHGIRPVVSTQ





LLLNGSLAEKEIVIRSENFTDNTKTIIVQLNEPVKIECTRPN





NNTRESITMGPGKAFYATGDIIGNIRKAYCTLRGTEWTNT





LKRIAEKLGEKFENKTIKFTQSSGGDPEIVAHSFNCGGEFF





YCETTQLFNSTWIRDNGNWTRAENITEGNNITLPCRIRQIV





NLWQQVGKAMYAPPISGPINCLSNITGLLLTWDGGNNSTS





NNNTEIFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTK





AKRRVVQREKRAVGLGALFLGFLGTAGSTMGAASLTLTV





QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQA





RVLAVERYLKDQQLLGIWGCSGKLICTTAVPWNASWSNK





SLNDIWDNMTWMQWEREIINYTGLIYTLLVESQIQQEKNV





QELLELDTWASLWNWFNITNWLWYIKLFIMIVGGLIGLRI





VFTVLSIVNTVTQGYSPLSFQTRLPSQEGTDRPEGIQEGGG





ETDRSGPIRLVDGFLAFIWVDLRSLCLFSYHRLRDLLLIVT





RTVELLGRRGWELLNYLWNLLQYWSQELLNTAQRLFNTT





AIVVAEGTDRVIEVVQRAYRAILHIPTRIRQGLERLLL





B.US.SC
B
230
MRVKGIRKNCQHLWKWGTMLLGMLIICSAAEQLVTVYY


141


GVPVKEATTTLFCASDAKAYNTEVHNVWATHACVPTNPN





PQKVVLKNVTKNFNMKNNMVKQMHKNIISLWDQSLKPC





VKLTPLCVTLNCTDYLKNNTNTTSSSEKEMKGGKIKNCSF





NITTKIENKVQKKYALFYKLNVVPINNTTTSYRLISCNTSVI





TQACPKVSFKPIPIHYCTPAGFALLKCKNKKFNETGPCTNV





STVQCTHGIKPVVSTQLLLNKSLAEKKVVIKSKNFTDNAK





THVQLNKSVKINCTKPNNNTRKSIHIGPGQALYATEMIENI





RQAHCNISKAKNNTLKQIVKKLKVQFRNKTIIFNQSSGGDP





KIVMHSFNCGGEFFYCNTTKLFNSTMFNNTNNTKNTKEN





GTITLPCRIKQIINRWQKVRKAMYAPPIRGQIKCSSNITGIL





LTKNGSTNNSTNKTFRPKEENIRNNKSKLYKYKVVKIKPL





KVAPTKAKRRVVQREKKAVETKALFLGFLRAAGSTMGA





AAVTLTVQARQLLSGIVQQQNNLLRAIKAQQHLLQLTVGI





KQLQAKVLAVKKYLRNQQLLGIGCSEKLICTTTVPNASSN





KSLNKINNITMKKKKINNYTSLIYSLIKKSQNQQVKNRQKL





LKLNKASLSFNITKWLWYIKIFIIIVKGLIGLKIVFLVLSIVN





KVRQKYSPLSFQTRFPAPKEPDRPEGIEEKGKKDKDRSGQ





LVNKFLTLIINLRSLCLFSYHRLKNLLLIVAKIVKLLKRRG





WEALKYWWNLLQYSQKLKNSAVSLLNATAIAVAEGTDR





VIEVVQRTCRAILHIPARIRQGLERALL





B.US.02.
B
231
MRAKEMRKHWQQWWKGGILLLGMLMICSTAENLWVTV


F762P.FJ


YYGVPVWKEATTTLFCASDAKAYDTEAHNVWATHACVP


469735


TDPNPQEVFLKNVTENFNMGKNNMVEQMHEDIISLWDQS





LKPCVKLTPLCVTLNCTDNIKLKNATENATATKANKEEIIN





CSFLVTTDRKDKEQNEYALFYRLDICTNKSWLLLVYTNNT





DYRLDSCNTSVITPSGPRRCPCESISIHPCTLDDCAIFKCNE





KKCNGTGVCKTVSTVHCIHGLSAVGSSPLLLNGSLAEEEV





VIRSENFTNNAKTIIVQLKEAVTINCTRPNNNTRKGIHIGPG





RAFYATESIIGDIRQAHCNLSRAQWNNTLRQIVQKLKEQF





KNATTIKFNSSSGGDPEIVTHSFNCGGEFLYCNTSHLFNST





WMCNSTWNSTNTEGNNTITLPCRIKQIVNMWQEVGKAM





YAPPIKGQIKCLSNITGLLLTRDGGVGDDSSNDTEIFRPGG





GDMRDNWRSELYKYKVVRIEPLGIAPTKAKRRVVQREKR





AVGTLGAMFLGFLGAAGSTMGAASVTLTVQARQLLSGIV





QQQNNLLRAIEAQQHLLQLTVWGIKQLQARVLAVERYLW





DQQLLGIWGCSGKLICPTTVPWNTSWSNKNYTEIWNKMT





WMQWEREIDNYTSLIYSLIEESQNQQEKNEQELLELDKWD





SLWSWFSITHWLWYIKIFIMIVGGLVGLRIVFTVLSIVNRV





RQGYSPLSFQTHLPAQRGPDRPEGIEEEGGERDRERSTRLV





HGLLALIWDDLRSLCLFSYHRLRDLLLIVARTVELLGRRG





WEALKYLWNLLQYWIQELKKSAISLLNATAIAVAEGTDRI





IEIAQRIFRAFLHIPRRIRQGFERALQ





B.CN.12.
B
232
MKVTGIRRNYQKFWRWGALLLGMLMISSAAEKLWVTVY


2019


YGVPVWKDANTTLFCASDAKGYKTEAHNVWATHACVPT





DPNPQEVVLANLTENFNMWESDMVEQMHEDVINLWDQS





LKPCVKLTPLCVTLNCTNLNITSNNTSSSNTSSSNTSSITGG





GEMKNCSFNITTSINTKVKDYAFFYNLDIVQINNDKTSDNT





SYRLINCNTSVITQACPKISFEPIPIHYCTPAGFAIIKCNNETF





NGTGPCTNVSIVQCTHGIRPVVSTQLLLNGSLAERDVAIRS





KNFSNNAENIIVQLNEAVQINCTRPNNNTRKSINIGPGRAW





YTTGEIIGNIRQAHCNISGSKWNDTLKQVVKKLREQFKNK





TITFEQPPPGGDPEIVMHNFNCGGEFFYCNTSQLFNSRWDD





VDENETSTSNGTTGNDTITLPCRIKQIINMWQEVGKAMYA





PPIAGLIRCSSNITGLLLTRDGGNINATGNNTNKNNNTEIFR





PAGGNMKDNWRSELYRYKVVKVEPLGVAPTRAQRRVVQ





REKRAVGMIGAMFLGFLGTAGSTMGAASLTLTVQARQLL





SGIVQQQRNLLRAIEAQQHLLQLTVWGIKQLQARVLAVER





YLKDQQLLGLWGCSGKLICTTNVPWSGNWSSKSYNDIWY





NMTWMQWEKEIENYTGVIYTLIEESQNQQEKNEQELLEL





DKWASLWNWFDISNWLWYIRIFIMIVGGLIGLRIIFAVLSIV





NRVRQGYSPLSLQTHFPTQRGPGRPEGTEEEGGERDRDRS





ERLVTGFLTLFWEDLRSLCLFSYHRLRDLLLIVARIVELLG





RRGWEVLRYWWNLLQYWIQELKNSAISLLNAIAIAVGEG





TDRVIEVAQRIYRAILNIPRRIRQGLERALL





B.SE.03.
B
233
MRVKGIRKNCQHSWKWGIMLLGMLMICNASDQLWVTV


005SE.M


YYGVPVWKEANTTLFCASDAKAYDTEVHNVWATHACVP


F373127


TDPNPQEIGLGNVTENFNMWKNNMVEQMHEDIISLWDQS





LKPCVKLTPLCVTLNCTDCKKDNDTLQCDCNNERLIIILSL





VTYTWERGEIRNCSVTATTDIEDRRKNEDASYHNRGIIQIN





NINNTYTLMLSSCTTSIIAGPCPSVSLDPIPNCYCAAAGFPIR





NGNDNNVAGPGPSDIVTPGPCALGIMPVAPPLLLVSGPLA





EEEVITTSDIFTNNAKTIIVQLNETVKINCTRPSNNTRKSITI





GPGRAFYATGDIIGDIRQAHCNISQAKWNNTLKQVVVKLR





EQFGMNKTIIFNQSSGGDPEIVMHSFNCGGEFFYCNTTRLF





NTTWNNTEKSNPTEGNGTITLQCRIKQIINLWQEVGKAMY





APPIRGQINCSSNITGLLLTTDGGMNTSNNKTFIPAGGDMR





DYWRSGVLKYDVVIVEPLGIAPTTDMRRVVQIQKRAELLR





ALLLVFLGAAGRTMSAASMTLTVQARQLLSGIVQQQNNL





LQAIEAQQHMLKLTVWGIKQLQARILAVERYLQDQQFLG





FWGCSGKRICTTAVPWNASWSNKSLDEIWNNMTWMEWE





KEIDNYTGLIYNLLIESQNQQEKNEQELLALDKWASLWN





WFDISNWLWYIRIFIMIVGGLVGLRIVFTVLSIVRRVRQRY





SPISVQTPRSAAQRGHDRPDGIEEEGGERDRDRFDQSVNGF





LTIIWVDVRSLCLFSYHRLRDLLSIVSRIVELMGRRGWGIL





KYWCNLLQYRSQELKKSAVSLFNATAIVVAEGTDRVIEAI





QRAFRAILHIPRRIRQGAERALI





B.SE.10.
B
234
MRVKGIKEELSSSWWKWGIMLLGMLMMICSALDQLWVT


SE60001


VYYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACV


2.KP411


PTDPNPQEVVLENVTENFNMWKNNMVDQMHEDIISLWD


823


QSLKPCVKLTPLCVTLNCTDLRNNTESNDTTSGMILGKDK





IKMILFNCSFNITTSRRDKWQQEYAFFYKLDIMPIDEENNT





NTYTLISCNTSVITQACPKVSFEPIPIHYCTPAGFALLKCND





KKFNGTGLCKNVSTVQCTHGIRPVVSTQLLLNGSLAEEEV





VIRSENFTDNIKTIIVQLNESVEINCTRPNNNTRRSIRNHRGP





GRAFHTTGEIIGNIRQAHCNISRAKWNNTLKQIVAKLREQF





GKNKTIVFNHSSGGDPEIVMHSFNCGGEFFYCNTTQLFNST





WNITGGLNNTEGNGTITLQCKIKQFINMWHEVGKAMYAP





PIRGQITCSSNITGLLLTRDGGENPGNDTDTFTPGGGDMKD





NWSSELDKYRVIGIAPLPVAPTKAKTRLLQRDKRAVGIGS





VFLVFLVAAGSTMSAMSMTMTLQAQELLYVTERMQKNL





LKAIEAQQHLLQLTVWGIKQLQARVLAIEGYLKDQQLLGL





WGCSGKLICTTAVPWNVSWSNKSLDKIWNNMTWREWER





EIDNYTGLIYNLLETSQNQQEKNEQELLELDKWASLWNW





FDITNWLWYIKIFIMIVGGLVGLRIVFTVLSIVNRVRQGYSP





ISLQTPRPAQRGLDRPEAWDEKAGEKCRGHFHRCVNRIM





AIFGAICGAFCSSSFSCDPLGFYWGFWKFLPGGGKPLKNA





WNFLPYLVPELNQGANEVFNCPVNATGESTGRGIETFQRT





FKSIFQILSQITPGQTGAKKGWV





B.SE.10.
B
235
MRVQENWQNLWKGGIWFLGIWMICSAVEQLWVTVYYG


SE60004


VPVWKDATTTLFCASNAKAYDTEVHNVWATHACVPTDP


6.KP411


NPQEVVLQNVTEPFNVWKNGMVEQMHEDIISLWDQSLKP


827


CVKLTPLCVTLHCTDLKETIKEEKGKMKNCSFNTTTSIRD





KTQTAYALFSTLDLVQIKDGKINANSKHSYRIIHCNTSTITQ





ACPKVSFEPIPIHYCAPAGFAILKCNDKQFNGTGECRNVSS





VQCTHGIRPVVSTQLLLNGSLAEEDIVIRSENLTNNAKTIIV





QLNETVRINCTRPNNIVSRRIHIGPGRTFHATKSITGSIKQAF





CNLSRPQWEDTLGKIVKKLRQNFGTNTTIVFDHSSGGDPEI





VKDRINCGGKFFYCYSIKLFKSTWNMEWSTQKASNTENE





NITLPCKIKQVINMWQEVGKAMYAPPIEGQIKCSSNITGLL





LTRDGGKHNSSDTNKTEIFRPGGGDMRDNWRSELYKYKV





VKIEPLGIAPTRAKRRVVQREKRAVGLGAMFLGFLGAAGS





TMGAASLTLTVQARQLMSGIVQQQNNLLRAVEAQQHML





QLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGRLICT





TAVPWNTSWSNRTQDDIWTNLTWMKWEKEIDNYTDVIY





RLLEESQNQQEKNEQELLELDKWASLWSWFDITNWLWYI





KIFIMIVGGLIGLKIVFTVFSIVNRVRQGYSPVLLQTHHPGR





RGRPRPVGMEEEVGEGDRDSSCPLAVELLSIFEVVFRILFV





FNCLRLRDLILIILRMVVLLGRGGWEIFKYWSRVLQFRIQE





LKNSGFILFPSTPITVEEETNGITEPFRSPCNPIVIIPIRKKQGS





ER





C.MW.9
C
236
MRVRGILRNCQQWWIWGILGFWMLMICNVVGNLWVTV


6.C120


YYGVPVWKEAKTTLFCASDAKAYEKEVHNVWATHACVP





TDPNPQEMVLENVTENFNMWKNDMVDQMHEDIISLWDQ





SLKPCVKLTPLCVTLNCSNITSIVSRNVSNNDNMRNCSFNI





TTEIRDKEKKEYALFYRLDIIPLEKNSSEYRLINCNTSAVTQ





ACPKVSFDPIPIHYCAPAGYAILKCNNKTFNGTGPCNNVST





VQCTHGIKPVVSTQLLLNGSLAEEEIVIRSKNLTDNAKIIIV





QLKDPVTIVCTRPGNNTRKSVRIGPGQTFYATGEIIGNIRQ





AHCNISKEDWNKTLQQVGKKLAEHFPNKTIEFKEHSGGDL





EITTHSFNCRGEFFYCYTSTLFNSTYNAEYNSNSTNSSSTIT





LQCRIKQIINMWQGVGRAIYAPPIEGNITCESNITGLLLTRD





GGKDGNNTEIFRPGGGDMRDNWRSELYKYKVVEIKPLGI





APTKAKRRVVEREKRAVGIGAVFLGFLGAAGSTMGAASIT





LTVQARQLLSGIVQQQSNLLRAIEAQQHMLQLTVWGIKQL





QARVLAIERYLKDQQLLGIVVGCSGKLICTTAVPWNSSWSN





RSQDDIWQNMTWMEWDREINNYTNTIYRLLEDSQNQQE





KNEQDLLALDSWKNLWSWFNITNWLWYIKIFIMIVGGLIG





LRIIFAVLSIVNRVRQGYSPLSFQTLIPNPRGPDRLRRIEEEG





GEQDKDSSIRLVSGFLALAWDDLRSLCLFSYHRLRDFILIV





VRAVELLGHSSLRGLQRGWEALKYLWSLVQYWGLELKK





SAISLLDTIAIAVAEGTDRIIELVQRIFRAICNIPRRIRQGFEA





ALQ





C.CN.06.
C
237
MRVRGTLRNCQQWWMWGVLGFWMLMICNGGKNLWVT


CNE88


VYYGVPVWKEAKTTLSCASDAKAYDTEVHNVWATHAC





VPTDPNPQEMVLENVTENFNMWKNDMVDQMHEDVISLW





DQSLKPCVKLTPLCVTLECRDVTNGTSGNDTLANDTSECK





NGTNNETYDESVKELRNCSFNATTLVRDKKKTAYALFYR





LDIMPLNEKKNSSENSSEYVLINCNSSAITQACPKVTFDPIPI





HYCAPAGYAILKCNNKTFNGTGPCNNVSTVQCTHGIKPV





VSTQLLLNGSLAEGEIIVRSENLTNNVKTIIVHLNQSVEIVC





TRPNNNTRKSIRIGPGQTFYATGEIIGDIRQAHCIINASNWH





KTLQEVSKELAGYFPNKTIIFNSSSGGDLEITTHSFNCRGEF





FYCNTSGLFNSTYMTNGTYMFNGTYRLNNITSNSNITIPCR





IKQIINTWQEVGRAMYANPIAGNITCKSNITGLLLVRDGGT





NNATTETFRPGGGNMRDNWRSELYKYKVVEIKPLGIAPTE





AKRRVVEREKRAVGIGAVFLGFLGAAGSTMGAASITLTV





QARQLLSGIVQQQSNLLRAIEAQQHMLQLTVWGIKQLQT





RVLAIERYLKDQQLLGIWGCSGKLICTTAVPWNSSWSNKS





HEEIWNNMTWMQWDREISNYTNTIYRLLEDSQNQQERNE





KDLLALDSWKNLWSWFDITNWLWYIKIFIMIVGGLIGLRII





FAVLSIVNRVRQGYSPLPFQIRTPNPGGPDRLGRIEEEGGE





QDKDRSIRLVSGFLALAWDDLRSLCLFSYHRLRDFILVAA





RVVELLGRSSLRGLQKGWEALKYLGSLVQYWGLELKKSA





TSLLDTIAIAVAEGTDRIIELGLSICRAIRHIPRRIRQGFEAAL





Q





C.ZA.04.
C
238
MRVRGIPRNWPQWWIWGILGFWIIMMCRVMGNLWVTVY


04ZASK


YGVPVWKEAKTTLFCASDAKAYEKEVHNVWATHACVPT


182B1.A


DPNPQEMELKNVTENFNMWKNDMVDQMHEDIISLWDQS


Y878054


LKPCVKLTPLCVTLNCTDVKKANNTTNIGNTTSNRNTTYN





ATYNEYMKNCSFNITTEIRDKKRKEYALFYRPDIVPLENSS





SEYILINCNSSTITQACPKVSFDPIPIHYCAPAGYAILKCNNK





TFNGTGPRHNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIR





SENLTDNTKTIIVHLNKSVAINCTRPGNNTRRSIRIGPGQAI





YTNNIIGDIRQAYCSISKEKWNKTLEQVKEKLKEYYKKTIE





FQQPVGGDLEITTHSFNCRGEFFYCDTTKLFSPKNETDNNT





ITLPCRIKQIINMWQEVGRAMYAPPIAGNITCNSNITGILLT





RDGGDTHEKKRKHKRNKVTEIFRPAGGNMKDNWRSELY





KYKVVEIKPLGVAPTGAKRRVVEREKRAVGMATMLFGFL





GAAGSTMGAASITLTVQVRQLLSGIVQQQSNLLRAIEAQQ





HLLQLTVWGIKQLQARVLAIERFLQGQQLLGLWGCSGKLI





CTTNVPWNSSWSNKSEEDIWGNLTWMQWDREISNYTDII





YRLLEDSQTQQEQNEKDLLALDSWKNLWSWFDISNWLW





YIRIFIIIVGGLIGLRIIFGVLSIVNRVRQGYSPLSFQTLTPNPR





ELDRLGRIEEEGGEQGRDRSIRLVSGFLTIAWDDLRSLCLF





LYHQLRDFILIAARAAELLGRSSLRGLQRGWEALKYLGNI





VQYWGLELKKSAISLLDTIAIAVAEGTDRIIRVIQSICRAIR





NAPRRIRQGFEAALL





C.ZA.09.
C
239
MRVRGTLRNYQQWWTWGILGFWMLIIYSVAGNLWVTVY


GU08018


YGVPVWTNAKTTLFCASDAKAYEKEVHNVWATHACVPT


6.GU080


DPHPQELVLANVTENFNMWKNDMVEQMHEDIINLWDES


186


LKPCVKLTPLCVTLRCTKVNGTANATSEGTANATSKDEE





MTMYNCSFNTTTGIRDKTQQEYALFHKHDVVPSNIDNSEY





ILIHCNTSTITQACPKVSFDPIPIHYCAPAGYAILKCNNKTF





NGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEKEIIIRSE





NLTDNSKTIIVHLNESVDIVCTRPGNNTRRGIRIGPGQTFFA





TRTIGNIRQAHCNISESKWNTTLQRVSAKLKEYFNKTIIYN





SSSGGDPEITTHSFNCRGEFFYCNTSRLFNRNSTDLSTNTG





NNSTITLPCRIKQIINMWQGVGRAMYAPPIEGNITCKSIITG





LLLVRDGGPNNSTETRPEPGREGNMKGNKFGRKSEIIIKYK





VIENLSPLGEIHPTKAKRRAVEREKRAVVGIGAMILGFLGA





AGSTMGAASITLTVHARQLLSGIVQQQSNLLRAIEAQQHM





LQLTVWGIKQLQARVLAIERYLKDQRLLGLWGCSGKLICT





TSVPWNSSWSNKSEADIWNNMTWMQWEKEINNYTYTIY





QLLEDSQSQQEQNEKDLLELDKWQNLWSWFSITNWLWYI





KIFIMIVGGLIGLRIIFAVLSIVNRVRQGYSPLSFQTLTPSPRE





LDRRERIEEEGGEQDRDRSVRLVSGFLSLAWDDLRSLCLFI





YHRLRDFILIATRVVELLGQRGWEALKYLGNLVLYWGLEI





KKSAINLLDTIAIAVAEGTDRIIEVVHRACRAIRYIPRRIRQ





DFEADLQ





C.IN.94.
C
240
MRVRGIRRNYQQWWIWGVLGFWILMICNGGGNLWVTVY


94IN


YGVPVWKEAKTTLFCASDAKAYEKEVHNVWATHACVPT





DPDPQEMVLENVTENFNMWKNDMVNQMHEDVISLWDQ





SLKPCVKLTPLCVTLECRNVSTCNETYNESVKEIKNCSFNA





TTEIRDRKQTVYALFYRLDIVPLDKKNSSRNSSDYYRLINC





NTSAITQACPKVTFDPIPIHYCTPAGYAILKCNDKTFNGTG





PCHNVSTVQCTHGIKPVVSTQLLLNGSLAEREIIIRSENLTD





NVKTIIVHLNESVDIVCTRPNNNTRKSIRIGPGQTFYATGDI





IGDIRQAHCNISEDKWNETLQRVSKNLAEHFPNKTIKFASS





SGGDLEITTHSFNCRGEFFYCNTSGLFNGTYMSNGTESNSS





SIITIPCRIKQIINMWQEVGRTMYAPPIKGNITCKSNITGLLL





VRDGGTEPNDTETFRPGGGDMRNNWRSELYKYKVVEIKP





LGIAPTAAKRRVVEREKRAVGLGAVFLGFLGAAGSTMGA





ASITLTAQARQLLSGIVQQQSNLLRAIEAQQHLLQLTVWGI





KQLQTRVLAIERYLRDQQLLGIWGCSGKLICTTAVPWNSS





WSNRSEKEIWDNMTWMQWDREISNYTNTIYRLLEDSQNQ





QERNEKDLLALDSWKNLWSWFDITNWLWYIKIFIMIVGG





LVGLRIIFAVLSIVNRVRQGYSPLSFQTLTPNSGGPDRLGRI





EEEGGEQDKDRSIRLVNGFLALAWDDLRNLCLFSYHRLK





DCILVTARVVELLGRNSLRGLQKGWEALKYLGSLVQYWG





LELKKSTTSLLDTLTIAVAEGTDRIIELGQSICRAIRNIPRRIR





QGLEAALQ





C.IN.09.
C
241
MRVMEILRNYQQWWIWGILGFWMLMICNVGGNLWVTV


T125


YYGVPVWKEAKTTLFCASDAKAHETEVHNVWATHACVP





TDPKPQEIVLGNVTENFNMWKNDMVDQMHEDVISLWDQ





SLKPCVKLTPLCVTLECGNVSVSNVSVSNTTQGDANVTHN





EGANEIKNCSFKIATELRDKEQKVYALFYRLDIVQLNDSSN





SSSSNNSSKYSKYRLINCDTSTVTQACPKVTFDPIPIHYCAP





AGYAILKCNNETFNGTGPCTNVSTVQCTHGIKPVVSTQLL





LNGSLAKGGVIIRSENLTENAKIIIVHLNESVEINCTRPNNN





TRKSVRIGPGQTFYATGEIIGDIRKAYCEINEQKWNKTLRE





VAKSLAKHFPNREIRFAPSSGGDLEIVTHSFNCRGEFFYCN





TSGLFNRTYTTDLFNKTYNDNDTEDNSGPVIIIPCRIKQIIN





MWQEVGRAMYAPPIAGNITCNSSITGLLLTRDGGNDNKTS





TNETFRPLGGNMRDNWRNELYKYKVVEIKPLGIAPTTAK





RRVVEREKRAVGMMGAMFLGFLGAAGSTMGAASITLTV





QARQLLSGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQAR





VLAIERYLKDQQLLGIWGCSGKLICTTAVPWNSSWSNKSY





DEIWENMTWMQWDKEINNYTDTIYRLLEDSQIQQERNEK





DLLALDNWKNLWNWFNISNWLWYIKIFIMIVGGLIGLRIIF





AVLSIVNRVRQGYSPLSFQTLIPSPGGPDRLERIEEEGGEQD





RDRSIRLVNGFLALAWDDLRNLCLFSYHRLRDFILVTARA





VELLGHHSLRGLRKGWEALKYLGSLVQYWGLELKKGAIS





LFDTIAIAVAEGTDRIIEFLQRLCRAIYNIPRRIRQGLEAALQ





C.ZA.99.
C
242
MRVKGIQRNWPQWWIWGILGFWMIIICRVVGNLWVTVY


Du123.D


YGVPVWTEAKTTLFCASDAKAYEREVHNVWATHACVPT


Q411850


DPNPQEIVLGNVTENFNMWKNDMVDQMHEDIISIWDQSL





KPCVKLTPLCVTLNCTDVKVNATSNGTTTYNNSIDSMNGE





IKNCSFNITTEIRDKKQKVYALFYRPDVVPLNENSSSYILIN





CNTSTTTQACPKVSFDPIPIHYCAPAGYAILKCNNKTFNGT





GPCHNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLT





NNAKTIIVHLNESIEIVCTRPNNNTRKSIRIGPGQTVYATND





IIGDIRQAHCNISKTKWNTTLEKVKEKLKEHFPSKAITFQP





HSGGDLEVTTHSFNCRGEFFYCDTTKLFNESNLNTTNTTT





LTLPCRIKQIVNMWQGVGRAMYAPPVEGNITCNSSITGLL





LVRDGGNTSNSTPEIFRPGGGNMKDNWRSELYKYKVVEI





KPLGVAPTKAKRRVVEREKRAVGIGAVLFGFLGAAGSTM





GAASITLTVQARQLLSGIVQQQSNLLRAIEAQQHMLQLTV





WGIKQLQARVLAIERYLKDQQLLGLWGCSGKLICPTTVP





WNSSWSNKSQTDIWDNMTWMQWDREISNYTGTIYKLLE





ESQNQQEKNEKDLLALDSWKNLWSWFDITNWLWYIKIFI





MIVGGLIGLRIIFGVLSIVKRVRQGYSPLSFQTLTPNPRGLD





RLGRIEEEGGEQDKDRSIRLVNGFLALAWDDLRSLCLFSY





HRLRDFILVAARAVELLGRSSLRGLQRGWEALKYLGNLV





QYGGLELKRRAISLFDTIAIAVAEGTDRILEVILRIIRAIRNIP





TRIRQGFEAALL





C.NP.11.
C
243
MKVRGTQRNYPQWWIWGILGFWMLMICNVERNLWVTV


11NP041


YYGVPVWREAKTTLFCASDAKAYDQEVHNVWATHACVP


.KJ15843


TDPNPQERSMENVTENFNMWENDMVDQMHEDVISLWDQ


0


SLKPCVKLTPLCVTLNCTDAKVNITADNSKTYNESMKEIK





NCSFNATTVIKDKKQTVYALFYKLDIVPLDNEEQENNGNE





NYTYRLINCNTSTITQACPKVTFDPIPIHYCAPAGFAILKCN





DKTFNGTGPCSNVSTVQCTHGIKPVVSTQLLLNGSLAEEEI





IIRSKNLSQNHRTIIVHLNESVGIECTRPSNNTRKSIRIGPGQ





TFFATGEIIGNIRQAHCNISKEKWDKTLQMVGKKLAEHFP





NKTIIFAPASGGDLEITMHSFNCRGEFFYCDTKKLFNATYM





NDDTANNSRSNGNDTVTLPCKIKQIINMWQEVGRAMYAP





PIAGNITCKSNITGLLLVRDGGKMNQNIETNKTETFRPEGG





NMKDNWRSELYKYKVVEIKPLGIAPTKAKRRVVEREKRA





VGLGAVFLGFLGVAGSTMGAASITLTAQARQLLSGIVQQQ





SNLLKAIEAQQHLLQLTVWGIKQLQTRILAIERFLQDQQLL





GIWGCSGKLICTTAVPWNNTWSNKSESEIWNNMTWMQW





DREISNYTFTIYQLLEDSQIQQEKNEKDLLALDSWNSLWS





WFNITSWLWYIKIFIMIVGGLIGLRIIFAVLSIVNRVRQGYS





PLSLQTLIPTPREPDRPRGIEEGGGEQDNATSIRLVNGFLAL





AWDDLRNLCLFSYRHLRDLLSVIARVVGLLGRVGGGGLQ





RGGKAPKLGGLGAYWGLEIKKSVFSCLIRLAIAVLKGTVR





FLKLVQDFGKGINNYPRRIPGFETGL





C.SE.09.
C
244
MRVMGTLRSCQRWWIGGILGFWMLFMCSVGEDLWVTV


035ZA.


YYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVP


MF37314


TDPNPQEMILKNVTENFNIWKNDMVDQMHEDIISLWDQG


8


LKPCVKLTPLCVTLNCSNVNSSSVANSTNSTNSGNITNANT





TNVNITDSDMKNCSFNITTELRDKTRKEYALFYRPDIVPLS





NSNSSEYILINCNSSTITQACPKVSFEPIPLHYCAPAGYALL





KCRDTKFNGTGPCKNVSTVQCTHGIKPVVSTQLLLNGSLA





EDEIVIRSENITNNAKTIIVQLKEPVTIMCTRPNNNTRKSMR





IGPGQTFYATGAIIGNIRQAHCNISRTNWTKTLQEVGEKLQ





EQELFKNKTIKFTEHSGGDIEITTHSFNCGGEFFYCNTSELF





NGTYSNGTYTPNNNGSDSNLTITIPCRIKQVVNLWQGVGQ





AMDAPPIAGISWGSIRGLLLPGGGPNPSTAEIFRPGGGDMR





DNWRSELYKYKVVEVNPLGIAPTRAKRRVVEREKRAVGI





GAVFLGFLGAAGSTMGAASVTLTVQARQLLSGIVQQQSN





LLRAIEAQQHMLQLTVWGIKQLQARVLALERYLRDQQLL





GIWGCSGKLICTTAVPWNSSWSNKSYDAIWDNMTWMQW





EKEIDNYTDTIYRLLEVSQTQQEQNERDLLALDKWNNLW





SWFNISNWLWYIKIFIMIVGGLIGLRIIFAVLSIVNRVRQGY





SPLSLQTLIPSPRGPDRPGGIEEEGGEQDRSSSIRLVSGFLAL





AWDDLRSLCLFSYHRLRDLVLIAARGVNLLGQRGWEALK





LLWSLVQYWGLELKKSAISLFDTLAITVAEGTDRIIEIVQR





TWRAICNIPRRIRQGFEAALQ





C.SE.03.
C
245
MSVKDILENYQHWWIWGLLAFWMQMVCMGDNLWVMV


004ZM.


YYGAPVWREAKPTLFCASDAKGYNKESQSVWATCACVP


MF37312


TDPPPQELVLGIVTEYFNVWKMDGIHCEGYNLLGYQPKPV


6


CKGNICVLKLYICYCYQQFWLVGEMKNCTLNATTELRDN





RKKESALFYRLDVLPLNSSDNNNNEYRLINCSTSAITQACP





KVSFDPIPIHYCAPAGYAILKCNDRTFNGTGPCKNVSTVQC





THGIKPVVSTQLLLNGSLTKEIIISSENLTKNTKTIIVHLNES





VGIVCIRPNNNTRKSVRIGPGQTFFATGAIIGEIRQAYCTVN





GSAWNNTLQRVRKKLRRYFSKAIRFEPSSGGDLLLTTHSF





TCRGGYFYCNTSNLINNNINIASASTIQRRIKQIINMWQGV





GRAMYAPPIAGNITCKSNITGLLLTRDGGENNRTETFRPGG





GDMKDNWRSELYKYKVVESKPLGIAPTKAKRRVVEREKR





AVGLGAVFLGFLGAAGSTMGAASITLTVQARQLLSGIVQQ





QSNLLKAIEAQQHMLQLTVWGIKQLQARVLAIERYLEDQ





QLLGMWGCSGRRICTTALSWNSSWSNKTYDDIWNNMTW





MQWDREISNYTDTIFRLLEDSQNQQEKNEKDLLALDSWK





NLWSWFNITKWLWYIKIFIMIVGGLIGLRIIFAVLSIVNRVR





QGYSPVSLQTLIPAPRGPDRPGGIEEGGGEQDKDRSIRLLN





GFLTLVWDDLRSLCLFSNHRLRDFILQSQRAVEVVGRRG





WEAKKYAGGGLQYWGEELKQSGFSSNDTIAIVVEEGTDRI





IQVLQWIGRASNNIPGRIRQGFEVAGQ





C.SE.12.
C
246
MRVMGMQRNCQKWWVWGILGFWMFMIYNVMGNLWV


076SO.M


TVYYGVPVWKEANPTLFCASDAKGYEREVHNVWATHAC


F373179


VPTDPNPQELFLENITENFNMWKNEMVEQMHQDIISLWD





QSLKPCVKLTPLCVTLRCSNVTYNETIREMTNCSFNVTTEL





RDRRQEVYALFYKLDVVSLNKSGDYRLINCNTSTITQACP





KVSFDPIPIHYCAPAGYAILKCNDEEFNGTGPCGNVSTVQC





SHGIKPVVSTQLLLNGSLAENVTIRSENLTNNAKTIIVQLRE





PVYINCTRPNNNTRKSIRIGPGQALYATGEVIGDIRHAHCNI





SKEKWKTALTGVRERLRELFNRTILFNKSSGGDLEVTTHS





SNCRGEFVYCDTSDLLTTSVSVQKVLVITSTDAIILRCQIQH





IISMWQVRRALYAPPMKGMRSSTCTITGTTDLRDGRTDYH





SMYIVRPGGGDMRDNWRSELYKYKVVEVKPIGIAPTGAK





RRVVEREKRAALGAVFLGFLGAAGSTMGAASVTLTVQAR





QLLSGIVQQQNNLLRAIEAQQHMLQLTVWGIKQLQTRVL





ALERYLRDQQLLGIWGCSGKLICTTNVPWNGSWSNRSQQ





EIWENMTWMQWEREIDNYTNIIYELLEVSQNQQETNERD





LLALDKWNNLWSWFNISNWLWYIRLFIMIVGGLIGLRIIFA





VLSIVNRVRQGYSPLSFQTLIPNPRGPDRPGGIEEEGGEQD





RDRSIRLVSGFLALAWDDLRSLCLFSLLHLRDLIEIVARAV





ELVEHSSLRGLQRGWDTLNYLRSLVQYRSKEIKKSATNM





LDTTAIAVDEVTDSIIGFAQRIWRGICNIRARIRQGYEAGG





Q





C.NP.11.
C
247
MRVRGILRNYQQWWIWGILGFWMFMICNGVGNLWVTV


11NP007


YYGVPVWREAKTTLFCASEAKSYEKEVHNVWATHACVP


.KJ15842


TDPNPQEQVLDNVTENFNMWENDMVEQMHEDVISLWDQ


3


SLKPCVKLTPLCVTLNCTDINITGGGAECMASNERIREVRN





CSFNATTEIRDKKQKVYALFYKLDIVSLYNLDIVAIQASSG





HYRLINCNTSAITQACPKVSFDPIPIHYCVPAGYAILKGNN





KTFKGPGHGNKVSTVQTHGIKPVVSTQLLLNGSLAEEEIII





RSKNLSDSTKIIIVQFNQSVKIVCARPNNNTRKSIRIGPGQV





FYATGDILGDIRQAHCNISKNKWKDTLHNVSKILAEQFSIK





TIIFTSSSSSTLNFNRHSFNCRGAFLYCNSCYLFNRLYTAAN





TKSNSWIDINCTVAIPCRIKQIINMWQEVGRAMYANPIEGN





ITCKSDITGMLLERDCGKERDKTISTTKEIFRPIGGNMRDN





WRSELYKYKVVEIKPLGVAPTEAKRRVVEREKRVAGLGA





VFLGFLGAAGSTMGAASITLTVQARQLLSGIVQQQSNLLR





AIEAQQHLLQLTVWGIKQLQTRVLAIERYLKDQQLLGLW





GCSGKLICTTAVPWNSNWSSKSQTEIWNHTTWMQWDREI





NNYTNTIYQLLEDSQNQQEKNEKDLLALDRWKNLWNWF





DISNWLWYIKIFIMIVGGLIGLRIIFAVLSIVNRVRQGYSPLS





FQTLTPNPRELDRLGRIEEEGGEQDRDRSIRLVNGFFSLAW





DDLRSLCLFCYHRLTDLISVTARAVELLGRSSLRGLQRGW





KVLKYLGSLVQYWGLELKKSAIRLLDITAIAVAEGTDRIIE





FFQRLCRAICNIPRRIRQGFEAALQ





C.ZA.09.
C
248
HQWWIWGILGFWMIYNVVGNLWVTVYYGVPVWKEAKT


704MC0


TLFCASDAKAYEKEAHNVWATHACVPTDPNPLEIVMGNV


03N.GU0


TENFNMWDNSMVEQMHEDIISLWDESLKPCVKLTPLCVT


80161


LECDNVTYTGNEDSITEKEMKNCSFNATTEIEDKKRKERV





LFHSLDIVPLENNNSSNNGSSNKKYRLINCDTSTTTQACPK





VSFDPIPIHYCAPAGYAILKCNNETFNGTGPCHNVSTVQCT





HGIKPVVSTQLLLNGSLAEKEIIIRSENLTENAKIIIVHLNESI





EIMCIRPGNNTRKSVRIGPGQTFYANDIIGDIRKAHCKVNR





SKWNETLQKVGIKLKEHFNKTIIFESSSGGDPEITTHSFNCG





GEFFYCNTSGLFNGTFNGTYTNSTQNETTITLPCRIKQIINM





WQGVGRAMYAPPIAGNITCTSNITGLLLLRDGGNTTNSKE





EEIFRPGGGNMRDNWRSELYKYKVVEVKPLGVAPTGAKR





RVVEREKRAVIGAVVLGFLGAAGSTMGAASIALTAQARK





VLSGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVLAV





ERYLQDQQLLGIWGCSGKLICTTNVPWNSSWSNKSLDDI





WGNMTWMQWDKEVSNYTHTIYSLLEESQIQQERNEKELL





ALDKWQSLWNWFDITNWLWYIKIFIMIVGGLIGLRIIFAVL





SIVNRVRQGYSPLSFQTLTPTSRGPERLGGIEEEGGEQGKD





RSIRLVSGFLTIVWDDLRSLCLFSYHLLRDFVLIVARGVEL





LGRRGWETLKYLGSLVQYWGLELKKSAISLLDTIEIIVAEG





TDRIINFLQSICRAIRNIPRRIRQGFEAALL





C.ZA.05.
C
249
MRVRGIQRNWPQWWIWGILGFWIILSCRGVGDLWVTVY


05ZAFV


YGVPVWREAKTTLFCASDAKAHVREVHNIWATHACVPT


26.DQ38


DPNPQEMVLENVTENFNMWKNDMVDQMHEDIINLWDQ


2378


GLKPCVKITPLCVTLRCTNATFTANDSNDNNVTMSEWTE





MTTCSFSNTTQIGDALQTAYALLYLTDSKPLFERCYTSIHL





PYLTHGTLGDPKASCYRAPIHYCAPAGYAIRKCNDRTFNG





TGPCNRVSTVQCTHGIKPVVSTQLLLNGSLAEEEIMIRSEN





LTDNTKTIIVQLKEPVEINCTRPGKGKRTRVRIGPGRTFYA





TGAVTGDIRKAHCTVNGSRWNSTLEQVREKLKKYYDNRT





IKFEQPSGGDPEVTTHSFNCRGEFFYCDTTNLFNISNINSTS





NITLQCKIKQIINMWQKVGRAMYAPPIEGTITCISNITGLLL





TRDGGSSNETSSINKTEIFRPGGGNMKDNWRSELYKYKVV





EIKPLGIAPTGAKRRVVEREKRAVGIGAVFLGFLGAAGST





MGAASITLTVQARQLLSGIVQQQSNLLRAIEAQQHMLQLT





VWGIKQLQARVLALERYLTDQQLLGLWGCSGKLICTTNV





PWNSSWSNKSRADIWDNMTWMQWDREINNYTDTIYRLL





EESQTQQEKNEKDLLALDSWKNLWNWFNISNWLWYIKIF





IMVVGGLIGLRIILGVLSIVKRVRQGYSPLSFQTLIPNPRDP





DRLGGIEEEGGEQDRDRSIRLVSGFLSLAWDDLRSLCLFSY





HQLRSFILVTARAVQLLGRSSLRGLQGGWEALKYLGNLV





QYWGLELKRSAISLLDTIAITVAEGTDRIIELIQGICRAICNI





PRRIRQGFERALL





C.ET.08.
C
250
MRVMGTSRNCQQGWIWGILGFWMFMICNVLGNLWVTV


ET119.K


YYGVPVWKEAKTTLFCASDAKAYDTEVHNVWATHACVP


U319532


TDPNPQELVLENVTENFNMWKNDMVDQMHQDVISLWDE





SLKPCVKLTPLCVTLNCSDATINGTIVEKMKNCSFNVTSTL





KPTKLKAYALIYRTELVPLIENQINGNYMITGYLNNLNITK





SGQTVSFGPFRPHPWPHAGQASLTVKCNNISFKGTGPCNK





VSSVQCTQGIKLVEWTRLMLYGSIAEEQIEIRCENLTIKTG





LIIVQLKESVAIRCERPKNNTRGSMRVGPGQTFYATGDIIG





DIRQDHSNISEKEWNETKQKVVEKLIELLPTTRKLTRSSGG





DPEITTHSFNCRGEFFYCDTTNLFNSTYNGTSTNSTSSGTIT





LSCSIKQIIQMWQGLRKVLNAPVCAPLDGNIKCTSNFTGLL





LPRAGGSGLNDTDAEVFRPAGGDMRDNWRSELYKYKVV





EIKPLGVAPTTAKRRVVEREKRAVGIGAVFLGFLGAAGST





MGAASITLTVQARQLLSGIVQQQSNLLRAIEAQQHLLQLT





VWGIKQLQTRVLAVERYLQDQQLLGIWGCSGKLICTTAV





PWNSSWSNRTQDEIWNNMTWMQWEKEINNYTGTIYRLL





EDSQNQQEKNEKDLLALDKWQNLWSWFSISKWLWYIKIF





IMIVGGLIGLRIIFAVLSIVNRVRQGYSPLSFQTLIPVPRTDR





LSMCLFSYRHLRDVLLIAARILNLLGQRRWETIKYLGSIVQ





YWSIELKKSAINLLDAIAIVVAEGTARILEAIQRIWRAIFNIP





RRIRQGLEAALQ





C.ES.x.
C
251
MRVTGTRKNCQQWWIWGILGFWMLMIYNVKGLWVTVY


MH08.E


YGVPVWREANTTLFCASDAKAYDKEVHNVWATHACVPT


F531335


DPKPQEQWLKNVTEDFNMWKNNMVEQMHEDIISLWDQS





LKPCVKLTPLCVTLNCTNATSIVNMTDVASNRSRSNDTVE





NVNGTSVENMTIVNSTDEIKNCSFNITTGIGDKPKKESALF





YSLDIVPLNKSANNSEYILITCKTSKITQACPKVSFEPIPIHY





CAPAGYAILRCDNKTFNGTGPCNNVSSVQCTHGIRPVVTT





QLLLNGSLAEEDIIIRSVNMSNNAKTIIVHLNESVEIICTRPN





NNTRKSVGMGPGQAIYTTGAIIGNIRQAHCNISGRNRSRTL





QRVSEKLQKYFYNKTIKFQPHSGGDPEITTHSFNCGGEFFY





CNTSDLFNSTYFNNTYFNGVYNGTRDNGTITLPCRIKQFIN





MWQRVGRAMYAPPIQGLITCRSNITGLLLTRDGGNTINNT





NNNTEIFRPGGGDMRDNWRSELYKYKVVEIRPLGVAPTD





ARRRVVGREKRAVMGALFLGFLGAAGSTMGAASMALTV





QTRQLLSGIVQQQSNLLRAIEAQQHMLQLTVWGIKQLQA





GVLALERYLQDQQILGIWGCSGKRICTTDVPWNSSWSNKT





KEYIWGNMTWMQWDKEISNYTGIIYDLLEESQNQQEKNE





QELLALDSWKNLWNWFNITNWLWYIKIFIMIIGGLIGLRIIF





AVLSIVHRVRQGYSPLSFQTLIPNQREPDRPGRIEEEGGGQ





DRDRSIRLVSGFLALAWGDLRSLCLFSYHRLRDCISIAARA





GELLGRSSLKGLQRGWETLKYLGSLVQYWGLELKKSAISL





LDTIAIAVAEGTDRIIEFIQRICRAICNIPRRIRQGLERILL





C.ZA.09.
C
252
MRVTGIPRNWPQWWIWGILGFWIMVMCNKTENLWVTVY


704MC0


YGVPVWTDAKTTLFCASDAKAYVKEVHNVWATHACVPT


19F.GU0


DPNPQELELKNVTEDFNMWKNDMVDQMHEDVISLWDQS


80178


LKPCVKLTPLCVTLNCTEVTPNCTKVNNSSSNNTTVAPTT





ATCNNHTKDNASIIGEREMRNCSFNITTEIRDKKQKAYALF





YRPDIVPLNGNPNTTEYILINCNTSTITQACPKVTFDPIPIHY





CAPAGFAILKCNNKTFNGTGPCNNVSTVQCTHGIKPVVST





QLLLNGSLAEEDIIIRSENLTSNTKTIIVHLNESVKIECLRPG





NNTRKSIRIGPGQTFYATGAIIGDIRQAHCNISTLAWNATLE





RVRGKLKEHFPNKTINFTSPAGGDLEVITHSFNCQGEFFYC





NTTNLFDKFNSSGPNITTIPCRIKQIINMWQEVGRAMYAPPI





AGNITCNSNITGLLLTHDGINITYNGTGNTTHFRPGGGNMK





DNWRSELYKYKVVEIKPLGVAPTKAKRRVVEREKRAVGL





GAMILGFLGTAGSTMGAASITLTVQARQLLSGIVQQQSNL





LRAIEAQQHMLQLTVWGIKQLQARVLAIERYLKDQQLLG





LWGCPGKLICTTNVPWNISWSNKSYNKIWDEMTWMQWD





KEISNYTDTIYRLIEDSQSQQEKNEKDLLALDSWNNLWNW





FNITNWLWYIRIFIMIVGGLIGLRIIFCVLSVVRRVRQGYSP





LSFQTLIPNPRGPDRPERIEEGGGEQDRRKSIRLVSGFLALA





WDDLRSLCLFSYHHLRDFILIVLRAVELLGRSSLRGLQRV





WEALKYLGNLVLYWGQELKKSAISLLDTIAIAVAEGTDRI





LEVLQGIWRGIRNIPRRIRQGFEASLL





C.ZM.02.
C
253
MRVMGILRSYQQWWIWGILGFWMVMTCNVGGNLWVTV


ZM211M


YYGVPVWKEAKTTLFCASDAKSYEKEVHNIWATHACVPT





DPDPQELVLENVTENFNMWKNDMVDQMHKDIISLWDQS





LKPCVKLTPLCVTLNCTDVKGNVTYSDKGNLAHSNSTDE





MKNCTFNVTTELRDKSKKESALFYRLDIETLDPKGNSSNN





NSSYKEYILINCNTSTIAQSCPKITFDPIPIHYCAPAGYAILK





CNNKAFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAE





GGIIIRSANLTDNAKTIIVHLNESVGITCVRPGNNTRKSVRI





GPGQTFYATGQIIGDIRQAHCNITGSEWNRTIDRVGKKLQE





HFPNKTINFTQPSGGDLEITMHSFNCGGEFFYCNTSGLFNT





SGLFNDTEGWINPINKTSNITIQCRIKQFINMWQEVGRAMY





APPIAGNITCTSNITGLLLTRDGGNSSSNTEIFRPGGGDMRD





NWRSELYKYKVVEIKPLGIAPTTAKRRVVEREKRAALGA





MFLGFLGAAGSTMGAASVTLTVQARQLLSGIVQQQSNLL





RAIEAQQHLLQLTVWGIKQLQTRVLAIERYLKDQQLLGIW





GCSGKLICTTAVPWNSSWSNKTQNEIWNNMTWMQWDKE





VSNYTDQIYRLLEKSQNQQEKNERDLLSLDSWNSLWSWF





SITKWLWYIKIFIMIVGGLIGLRIIFAVLSVVNRVRQGYSPL





SFQTLIPSPRGPDRPGRIEEEGGEQDRNRSTRLVSGFLALV





WDDLRSLSLFSYHRLRDFILIATRAAELLGRSSLRGLQRG





WEALKYLGNLVQYWGQELKRSAISLFDTIAIIVAEGTDRII





ELAQRICRARQGETALL





C.SE.06.
C
254
MRVMGIQRNCQQWWIWGILGFWMLLICNGMGNLWVTV


018SE.M


YYGVPVWKEASPTLFCASDAKAYESEVHNVWATHACVP


F373138


TDPSPQELVLKNVTENFNMWKNDMVEQMHQDIISLWDQS





LKPCVKLTPLCVTLNCSNVDNSTTINGTIHPNEEGKNCSFY





ITTKLKLKIRQKFALFYDPDNLILIDNDNSMYLLNCNTSTIT





QTCPWVSFDFDPIPYCDRAAYGILIRNNNKFNVTGPCPSVN





AGQCTRGIGTMPSTPLLLNGNITKEDIVIISEYLDNNANVIII





QLNKTVPVNCTRPRHNTRTRVRMRPGQRLYVSADIIGNIR





LAHCHCIGSGRYDTLELVKLNTLEHCLYHKTIMFTHPSGG





ALPTTTHTLNCGGELFYCSTSSLLNFTWRSLFCHAQAPFQE





THNNSTNNIPFPCRIKQFINMWQTVGRAMYAPPIAGVITCN





SNITGLLLTRDGGNPTNSTEGNNETEIFRPGGGDMRDNWR





SELYKYKVVEIKPLGVVPTKAKRRVVEREKRAALGAMFL





GFLGAAGSTMGAASIALTVQARQLLSGIVQQQNNLLRAIE





AQQHMLQLTVWGIKQLQTRVLALERYLRDQQLLGIWGCS





GKLICTTAVPWNSSWSNKSQEEIWDNMTWMQWEREIDN





YTDIIYNLLEVSQIQQETNEKDLLALDKWHNLWSWFDISN





WLWYIKIFIMIVGGLIGLRIIFAVLSIVNRVRQGYSPLSFQTL





IPSPRGPDRPGGIEEEGGENDRGRSIRLVSGFLGLAWDDLR





SLCLFCYHRLRDLLLIAARVVELLGQRGWEILKYLGSLVQ





YWGLELKKSAISLLDTVAIVVAEGTDRIIELIQRAWRTICNI





PVRIRQGFEAALL





C.SE.07.
C
255
MTVMGIQMNCPQWWRWGILSLWMFMIWNVWGKGNLR


SE60011


VTVYYGVPVWVEAKTTLFASDANAHSEEVHNVRATHAC


9.KP411


VCAHPNPQEINVTENFNMCKNDMEKQMHEDIVSLCHQSL


831


KPCVRVTPLCVTCHLTSSVITDTDTISSSLIINDDDSNMNCC





SFITTAVNRDKKKDDAALYYRLVVVPDDEEKKNNCSECR





LINCNSSAVPQACTKVSDEPILIHYCAPAGYAIIKCNDTTFN





ATAPCNNVTALQCPRGITPVVSHQLLLNGSAAERGDSIRSE





NITNNATTIIVHLKNPVEISRNYCTRPHINTRKSMRIGPGQIF





YATGDIIGNIRQAHCNISAKEWNETLLMVSEKLLKHFPNK





TTINFPSSSGEDQTITKRSFCRGEQFFYCNTSILSNTTYSYNS





TSNTTCSTNNTTTSTFTLQCRRIHIIIMCLEPGLAIDASPTAG





EIISKSNTTDLLLTRAGGSNRTEIFRPGGGDMRDNWRSELY





KYKVVEIKPLGIAPTKAKRRVVEREKRAVGIGAVFLGFLG





VAGSTMGAASITLTVQARQLLSGIVQQQSNLLRAIEAQQH





MLQLTVWGIKQLQTRVLAIERYLKDQQLLGIWGCSGKLIC





TTNVPWNKSWSNKTQKDIWDNMTWMEWDREISNYTDII





YSLLVDSQTQQEKNEKELLALDSWKNLWNWFNITNWLW





YIKIFIMIVGGLIGLRIIFAVLPIINRVRQGYSPLSFQTIIPNPR





GPDRLGRIEEEGGEQDRNRSIRLVNGFLALAWDDLRSLCL





FSYHRLRDFILIVVRAVELLGHSSLRGIQRGWEALKYLGSL





VQYWGLEIKKSAISLFDTLAIVVAEGTDRIIELVQRIFRAIC





NIPRRIRQGSEASLL





C.CY.07.
C
256
MRVRGILRNWQQWGIWGILGFWMLMICNVVGNKDLWV


CY205.J


TVYYGVPVWKEAKTTLFCASNAKAYEREVHNVWATHAC


F683757


VPTDPNPQKIDLENVTENFNMWKNDMVDQMHADVISLW





KESLKPCVKLTPLCVTLNCTEVTVIKNTGVNMTNNSTSTSI





DTSILNDTSNADLKNCSFNTTTKISDNEIFEFAVNNTLDVEP





IDGTYTSITFRLTNSNSSSIPQACPKVSFDPLPIPYCAPAGYT





GLRCSNNTFTGTGPCRNVSTVQCTHGIKPVVSTQLLLNGS





LAEGEIMIRSENLTNNAKTIIVHLNESVNIVCTRPNNNTRK





GVGIGPGQAIYATGAIIGDIRQAHCNITKRGWNGLPQKVK





GKLREHLPGKLIIFQPHSGGDLQITMHTFNWGGEFFYCNTS





VLSNSTHNSGTGWHLQSSDINSYATIAVSYRIGQIINMWRQ





VGRAIYGPPVARKMSAHPSTTGLLLVCDGGEKSLNSTNGT





GITNATEIFRPQGGDMKDNWRSELYKYKVVEIKPLGIAPT





EARRRVVEREKRAVGIGAVILGFLGAAGSTMGAASIALTA





QARTVMAGIVQQQSNLLRAIDAQQHMLQLTVWGIKQLQ





ARVLALERYLKDQQLLGIWGCSGKLICTTTVPWNTSWSN





RSKEEIWNNMTWMQWEREIDNYTEIIYQLLGESQIQQEQN





EKDLLALDSWKNLWSWFSISNWLWYIRIFIMIVGGLIGLRI





IFAVLSIIHRVRQGYSPLSLQTLIPNSREPLDRLGRIEEGGGE





QDRSRSIRLVNGFLALAWDDLRSLCLFCYHLLRDFILIVAR





VGELLGHSSLRGLQKGWEALKYLGSLVQYWGLELKKSAI





SLLDTVAIAVAGRTDRIIEVIQEICRAIYHIPRRIRQGFEAAL





L





C.SE.06.
C
257
MKGRRNTENLSPGWIWGILGFWMLMMHKGEGNLWFTV


SE60051


YYGLPVWTEPKPTLFGASNAKGYKTEVHMVWATHACVP


6.KP411


TNPNQQDLVLENVTENFSMWKNDMLDQMHEHVVIFSDQ


839


GLKPYAKVTSLTVTVTCTDLKNSTFTSTSSNDTSMVTESEE





MKNCSFNITTEIRDKKKKEYALFYRLDIVPLKEKEDNNSD





YRLINCNTSAVTQACPKVSFDPIPIHYCAPAGYAILKCNNK





TFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEKEIVI





RSENLTNNAKIIIVHLNESVEIVCTRPNNNTRKSMRIGPGQT





FYATGDIIGNIRQAYCNISEKRWNETLQKVSKKLREHFPNK





TIKFKPSSGGDLEITTHSFNCGGEFFYCNTSKLFNSTYPPNS





TDTYMNSANTSNIILQCRIKQIINMWQGVGQAMYAPPIAG





NITCESNITGLLLTRDGGNNQTEEETFTPGGGDMRDNWRS





ELYKYKVVEIKPLGIAPTKAKRRVVEREKRAVGIGAVFLG





VLGAAGSTMGAASITLTVQARQLLSGIVQQQSNLLRAIEA





QQHMLQLTVWGIKQLQTRVLAVEGYLKDQQLLRIWAAS





GKLICTTNVPWTPSWSIKSQEEIWNNMTWMQCDREITYTE





NLIQCLAPQPRKWKDTTCNGDLDLDKWQNLWKWFNIPK





WLWYIELFIMIVGGLIGLRIIFAVLSIVNRVRQGYSPLSFQT





LLPSPRGPDRLGGTEEESGEQDRDRSVRLVNGFLPLVWND





LRSLCLFCYHQLRTFILVTARAVELLGRSILRGLQTGWEAL





KYLGGIVQYWGLELKKSAISLFDTLAIVVAEGTDRIIEVVR





GFFRAIYNIPRRIRQGLEAALQ





C.ET.08.
C
258
MRVMGIHEELSTMGWIWGILGFWMLLICNVMGNLWVTV


ET124.K


YYGVPVWTEANPTLFCASDAKSYEPEVHNVWATHACVPT


U319534


DPNPQELVLENVTENFNMWKNGMVEQMHQDIISLWDQS





LKPCVKLTPLWVTLNCSDVNVTAIIDADILTMYEPVINRSF





STTPKLKKKTETDLFHYVEVEALDMTATDDGYINCNTSTL





TITCPCVSFAPVPIPYCAPAGYAIPIRNGNNFTGPGPCPSVN





AFQGTRGLEPVASTPLLLIGNVAKNEIVIISETLTHNVKMV





MVQLNETVTINCTRPNNNTRKSIRIGPGQAFPATATIEGDIR





QAHCNINKTEWNNTLQKVREKLRKHFNKTIKFDSSSGGDP





EITTHSFNCRGEFFYWDTTKLFNETYNNTDANGTIIMPCRI





KQIINMWQKVGRAMYAPPIEGPITCISNITGLLLTHDGEEN





DTETFRPWGGDMRDNWRSELYKYKVVEIKPLGIAPTKAK





RRVVQREKRAALGAMFLGFLGAAGSTMGAASITLTVQAR





QLLSGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQTRVLA





IERYLKDQQLLGIWGCSGKLICTTAVPWNSSWSNKSQEEI





WDNMTWMQWDREINNYTGIIYNLLEVSQNQQEKNEEDL





LALDKWQNLWNWFDITQWLWYIKLFIMIVGGLIGLRIIFA





VLSIVNRVRQGYSPLSFQTLIPNPRGPDRPGGIEEEGGEQD





RDRSIRFVNGFLALAWDDLRSLCLFIYRQLRDLILIAARAV





EILGQRGWETLKYLGSLVQYWGLELKKRAISMLDTTAIVV





AEGTDRIIEVVQRTWRIICNILTRIRQGFEAALL





C.SE.08.
C
259
MRVMGIQRNYQRWWIWGILGFWMFMICNVRGNLWVTV


SE60021


YYGVPVWKEADPTLFCASDAKAYEREVHNVWATHACVP


3.KP411


TDPSPQELVLENVTENFNMWKNGMVDQMHQDIISLWDQS


834


LKPCVKLTPLCVTLSCNNTIVVNATIRMIQKDCSFNTTTEL





RDKRQTAYALFYKLDIVALSEKSSEYRLINCNTSAITQACP





KVSFDPIPIHYCAPAGYAILKCRDANFNGTGPCKNVSTVQ





CTHGIKPVVSTQLLLNGSLAENETIIRSENLTNNAKIIIVQL





KTPVEINCTRPANNTRKSIRIGPGQTFYATEEIIGDIRQAHC





NISGKTWNATLKDVGDKLKRHFPNGTIKFLPSSGGDIQITT





HSFNCGGEFFYCCTSYLLNNTLDNNTINDTTNNGGTNDSII





LQCRIKQIINVGQEVGRAMYAPPIKGNITCRSNITGILLAED





GGPINETETFRRGGGDMRDNWRSELYKYKVVEIKPLGIAP





TKAKRRVVGREKRAALGAMFLGFLGAAGSTMGAASITLT





VQARQLLSGIVQQQSNLLRAIEAQQHMLQLTVWGIKQLQ





TRVLALERYLRDQQLLGIWGCSGKLICTTAVPWNSSWSN





RSQEEIWENMTWMQWDREINNYTNIIYELLEVSQNQQEK





NEKDLLELDKWQNLWNWFDITHWLWYIKIFIMIVGGLIGL





RIIFAVLSIVNRVRQGYSPLSFQTLIPNRRGPTSPGGLEEEGE





DQDRGRSFRLVRGFLALPWDVLRTLCLFSSHRWRDLFWI





AARAVELLDQRGWKTFNFLGTLVQYGGRKLKKSVISLFN





TPPILVAEETDRFMKLFQRFGRAFSNIPGRLSQGLEAAFQ





C.ZA.99.
C
260
MRVRGILRNYPQWWIWGILGFWMIYNVMGNLWVTVYG


LT25.AY


VPVWKEAKTTLFCASDAKAYDREVHNIWATHACVPTDPN


522727


PQELVLKNVIENFNMWKNDMVDQMHEDIISLWDESLKPC





VKLTPLCVTLNCTKVTVNGTASNSTATNDTATNGTNTVN





STDATKTKIGNSTDDMKKCFFNVTTEIRDKEKKEHALFYK





LDIVPLGNSSRNNYSEYRLINCNTSAITQACPKVNFDPIPIH





YCAPAGYAILKCNDETFNGTGPCNNVSTVQCTHGIKPVVS





TQLLLNGSLAEKETIIRSENLTENTKIIIVQLNRSVEINCTRP





GNNTRTSIRIGPGQSFYATGEIIGDIRQAHCNISAKKWNETL





QEVGKKLQEHFENKTIIFANSSGGDLEITTHSFNCRGEFFY





CNTTSLFNNIYSNKTGSPNEKEIITLPCRIKQFINMWQKVG





RAMYAPPIAGNITCKSNITGLLLLRDGGNFSRDNETFRPGG





GDMRDNWRSELYKYKVVEIKPLGVAPTKAKRRVVEREK





RAALGAVLIGFLGAAGSTMGAASMALTVQARQLLSGIVQ





QQSNLLKAIEAQQHLLQLTVWGIKQLQARVLAIERYLKDQ





QLLGIWGCSGKLICTTAVPWNSSWSNNRTHDYIWENMTW





MQWDREVNDYTDTIYRLLEKSQNQQEENEKDLLALDSW





NSLWNWFSITKWLWYIKIFIMIVGGLIGLRIIFTVLSLVNRV





RQGYSPLSFQTLTPSPGGPDRLERIEEGGGEQDRSRSIRLVS





GFLSLAWEDLRSLCLFSYHRLRDFILVTARAVELLGRSSLK





GLQRGWEALKYLGSIVQYWGLELKKSAISLLDTIAITVAE





GTDRIIEFIQSICRAIRNIPRRIRQGFETALL





C.ET.08.
C
261
MRVMGISRNCQQGWILGILGFWMLLMCNGTGNLWVTVY


ET165.K


YGVPVWKEANPTPPLFCASDAKAYETEVHNVWATHACV


U319549


PTDPDPQELELKNVTEDFNMWKNGMVDQMHQDVISLWE





QSLKPCVKMTPLCVTLNCTYVTSKNTTDNSTSTMLLSVNA





TDSDMKNCSFNVTTELKDKQKKVYALFYLLDIRQLNDSS





NTHTGNSSNTGNYSDYILINCNTSTIAQACPKVSFDPIPIPY





CAPAGYAILKCNDKNFNGTGTCTNVSTVQCTRGIKPVVST





PLLLNGSISEGDIVIISKNLTDNAKTIIVQLNQTVAVNCTRP





TNNTRTGIRIGPGRAFYATGDIIRDIRQAHCNISGGDCNRTL





EEVEKKLNESFSGNETIQFEPTSGGDPEITTRSFNCGGEFFY





CKTSKLFIKTWKSWNHRNDGTIITIPCRIKQIIHMWQGVGR





AMYAPPISGPITCRSNITGLLLTRDGGISDNNSTNTTEIFRR





QGGDMKDNWRSELYKYKVVEIKPLGIAPTKAKRRMVQR





ERGEKRHGAMFLGFLGAAGSTMGAASITLTVQARQLLSGI





VQQQSNLLRAIEAQQHMLQLTVWGIKQLQTRVLALERYL





RDQQLLGIWGCSGKIICTTAVPWNSSWSNKTHDQIWKNM





TWMQWEREIDNYTGIIYNLLEVSQIQQEQNEQDLLALDK





WQNLWSWFDITNWLWYIKIFIMIVGGLIGIRIIFAVLSIVNR





VRQGYSPLSFQTLIPNPRGPDRPGGIEEEGGEQDRDRSVRL





VNGFLALAWDDLRSLCLFCYHRLRDLILIAARVLELLGQR





GWEALKYLKNIVQYWGLELKKSAVSLLDTVAIAVAEGTD





RIIEVIQRAWRAFCNIPTRIRQGFEIALL





C.ZA.02.
C
262
MKVKGTRRNCQQWWIWGILGFWMVYNVVGNLWVTVY


02ZAPS0


YGVPVWREAKATLFCASDAKAYEKEVHNVWATHACVPT


15MB1.


DPNPLERVLENVTENFNMWKNDMVDQMHEDIISLWDESL


DQ36999


KPCVKLIPLCVTLNCTTAKNKSEEMNCSFKVTTELRDKKR


5


KEYALFYRLDVVPLNETNESSNDTNGDFREYRLINCNTST





LTQACPKVTFDPIPIHYCAPAGYAILKCNNKTFNGTGPCNN





VSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIKSENITDNAKTI





IVQLNKAIEIECIRPSNNTGRSVRIGPGQTFYTTGEIIGDIRQ





AHCNISGNWDRTLYNISKKLGEHFPNKNISFKPSSGGDPEI





TTHSFNCGGEFFYCDTSQLFNKTYNVTDDSNTTDTNNTIQ





CRIKQIINMWQEVGRAMYAPPIAGSITCTSNITGLLLTRDG





GGGTNSTSNTEEIFRPGGGNMKDNWRSELYKYKVVEVQP





LGIAPTKAKRRVVEREKRAIGIGAVFLGFLGAAGSTMGAA





SITLTVQARQLLSGIVQQQSNLLRAIEAQQHMLQLTVWGI





KQLQARVLAIERYLKDQQLLGIWGCSGRQICTTNVPWNTS





WSNKSLNDIWNNMTWMQWDKEISNYTNIIYQLLELSQNQ





QEQNEKDLLALDKWQNLWSWFDITHWLWYIKIFIMIIGGL





IGLRIIFAVLSIVNRVRQGYSPLSFQTLIPSQREPDRLGRIEE





EGGEQDKDRSIRLVHGFLAIAWDDLRSLCLFSYHRLRGFIL





VTTRVVGLLGQRGWEALKYLGSLVQYWGLELKKSAISLL





DTIAIVVAEGTDRIIELILGICRAIRNLPRRIRQGFEAALL





C.ZA.09.
C
263
MRVRGTQMNCQGWWRWGIMIIGLIIICRAKEDLWVTVYY


707PKE0


GVPVWKDAETTLFCASDAKAYDTEKHNVWATHACVPTD


2N3.HM


PNPQEIDLDNVTENFNMWKNGMVDQMHKDIINLWDQSL


623589


KPCVKLTPLCVTLNCTNININNSTDNGTSITNEIGDEIKNCS





YNITTEFRDKRRRVSSLFYRHDIMPINEKNDSRTYILINCNT





TTITQACPKVSFEPIPIHYCAPAGYAILKCKDKDFNGTGPC





KNVSTVQCTHGIKPVVSTQLLLNGSLAEGGIKIRSENITNN





AKTIIVQLDTPVKINCSRPNNNTRDRIGIGPGQAFFASNRIIG





NIRQAHCNISASIWNMILQKVAEQLAKQFPNKNISFTNHSG





GDLEITSHTFNCGGEFFYCNTSSLFNRTFPANYTYNRTADS





TQGNTTITIPCRIKQIINMWQRVGQAMYAPPIEGVIRCESNI





TGILLTRDGGNETNVEVFRPEGGNMRDNWRSELYNYKVV





RIEPLGVAPTRARRRVVEREKRAVGLGAVFLGFLGAAGST





MGAASITLTVQARQLLAGIVQQQSNLLQAIEAQQHLLKLT





VWGIKQLQARVLALERYLRDQQLLGIWGCSGKTICTTTVP





WNSSWSNKSQEEIWGNMTWLQWDKEISNYTDLIYVLLQE





SQNQQDRNEQELLALDKWANLWNWFDISNWLWYIRIFIM





VVGGLVGLRIVFAVISIVNRVRQGYSPLSLQTLTPNQEGLD





RLGRIEEGGGEQGRGRSIRLVSGFFSLAWDDLRSLCLFSYH





RLRDLVLIAARIVELLGSSLLKGLRLGLEGLNYLWNLLLY





WGGELKTSAISLFDTTAIAVANWTDRAIELGQRICRAIRNI





PRRIRQGFETALL





C.ET.08.
C
264
MRVRGISRNCQQGWIWGILGFWMLMICNVWGNLWVTVY


ET154.K


YGVPVWKEAKTTLFCASEAKAYETEVHNVWATHACVPT


U319545


DPNPQEMTLENVTENFNMWKNDMVDQMHEDIISLWDQS





LKPCVKLTPLCVTLNCTNATITGDGNNTDGTNGTARLLIY





HGDIANCSFNETTELKDKVSKEHDLFYRLDVKPLNEEKNN





STHNTSILINCSSSKVTHPCPKVSFDPIPIPYCAPAGYAILEC





RDKAFSGTGPWPYVYTLQCPHGNWPVVSTQLLMYGFLSE





NRRDNTNNITVHLSESVDIMCTRPSNNTRRRIIGIRPGRVS





MMRGDMIIYLRATYCNIRDTRAACNITLRDVVDELHLQFD





ETITIAPSSGGPPEDNTNSSTCGEDLCCCNTSQLIHSSICHNN





PTSEPSKNRAANPTFIIPCRVKQVLSMRLVLREAMYAHPIS





GDVLLTSDVGGLISTRDNANSANEIFRPGGGDMRDNWRS





ELYKYKVVEIKPLGIAPTKAKRRVVEREKRAVGIGAVFLG





FLGTAGSTMGAASITLTVQARQLLSGIVQQQSNLLKAIEA





QQHMLQLTVWGIKQLQARVLAIEKYLQDQQLLGIWGCSG





KLICTTTVPWNSSWSNKSLEEIWDNMTWMQWEREVGNY





TEIIYKLLEDSQSQQEKNEKDLLALDNWNNLWSWFNISN





WLWYIKIFIMIVGGLIGLRIIFAVLSMVRRVRQGYSPLSLQT





LIPNPRGPDRLGGIEEEGGEQDRDRSVRLVSGFFSLAWDDL





RSLILFLYHHLRDLLLIAARTVEILGQRGWEALKYLGSLVQ





YWGLELKKSAISLLDTIAISVAEGTDRIIEIVQRIWRAIYNIP





RRIRQGFEAALL





C.ET.08.
C
265
MRVKGTQQELSTLGWIWGILGFWMLLMNNVGGGKWGV


ET167.K


TYYEGVAGRKEATTTLCGAYDAKYDNDVQHVGWTTAG


U319550


GVPTHPPPQEMLLNVVEDFFMMRNDEVEHMMDDDIILW





QRLLPCCVTMTLCGAHSTCSNINSTATINDSDGGMTMCDF





IRGEIKNCSFNITTDFRYKLDKEPALFYKPDLVPLNNDNSS





YRLTNCNTSAGTQACPKASFDPIPAPYCAPAGYAILECNN





GTFNGTRACNNVSTLECTHGIKPLVSTHLLLNENLANDIVI





ISDNLTIIAVIIIVHLKNDGTIVCTRTSNSTRIRLRIGSGPVCY





ATGEIIQDIRHAHDHISRSDWNDALRNVREHFRKHFPKFNT





MRFTQSVGGNLQITTRRFNCVGEFFHCSTTGLFTDTYGTST





CGTNSTGNTSGIHTITLHCKIKQFINMWQGVGLAMYAPSL





AGNIICKSNFTGLVLTCDGGEGNNVTNLTETFRPGGGDMR





DNWRSELYKYKVVEIKPLGVAPTKAKRRVVEREKRAVGI





GAVFLGFLGAAGSTMGAASITLTVQARQLLSGIVQQQNNL





LRAIEAQQHMLQLTVWGVKQLQARVLALERYLRDQQLL





GIWGCSGKLICTTAVPWNTSWSNKSQEEIWNNMTWMQW





EKEIDNYTDIIYKLLEVSQTQQEKNEKDLLALDSWQNLWN





WFDISKWLWYIKIFIMIVGGLIGLRIIFAVLAIVNRVRQGYS





PLSFQTLTPSPEGPDRLRGIEEEGGEQDRTRSIRLVSGFLAL





AWDDLRSLCLFSYHRLRDFILIVVRAVELLGHRGWWETL





KHLGSLELYKGVEIKKNAISLLKPTATAGAEGTVQIIEIVLR





IIRSIFNLPRRVSQGFKAAQQ





C.ZA.06.
C
266
MRVMAIQRNCQHLLSWGTMILGILMICSAANNLWVTVYY


C.x.06.C


GVPVWRDAETTLFCASDAKAYDTEVHNVWATHACVPTD


F01


PDPQEIDLKNVTEEFNMWKNNMVEQMHTDIISLWDQSLK





PCVKLTHLCVTLNCSTANVNVTDYNITTGDKEEIKNCSFN





MTTELSDKKQKVHSLFYRLDVVPIKPDNSDDNRSNSSYSN





YRLINCNTSAITQACPKVTFDPIPIHYCAPAGFAILKCKDDG





FNGTGPCKNVSSVQCTHGIKPVVSTQLLLNGSLAGKGIRIR





SENITDNTKTIIVQLDKPVRINCTRPNNNTRKSMRIGPGQTF





FATGDIIGDIRKAHCNISISEWNETLYKVAEQLGGIIGNKTV





KFANSSGGDLEITTHSFNCGGEFFYCNTTDLFKGTWHPDN





KTWSDPWNASMKSNNTSNVNITILCKIKQIVRMWQRVEQ





AMYAPPIQGVISCSSNITGLLLTSDGGRDNSNNTGGQKNTE





TFRPGGGNMRDNWRSELYKYKVVKIEPLGVAPTPARRRV





VQREKRAVGLGAVFIGFLGAAGSTMGAASVTLTVQARQL





LTGIVRQQSNLLKAIEAQQHLLRLTVWGIKQLQARVLAVE





RYLKDQQLLGIWGCSGKLICTTNVPWNSSWSNKNQSEIW





DNMTWLQWDKEINNYTQIIYDLLEESQNQQEKNEQELLA





LDKWTNLWNWFNITNWLWYIKIFIMIVGGLIGLRIVFAVL





SIINRVRQGYSPLSFQIHNPNPGGPDRLGRIEEEGGEQDRDR





SIRLVSGFLALAWDDLRSLCLFSYHHLRDFILIAARTVEILL





RRAWEGLKYLWNLLLYWSRELKISAISLFDTIAIAVAGWT





DRVLEIGQAICRAILHIPRRIRQGLERLLL





C.ET.08.
C
267
MMVMGTLKNLSWGRIWGIGFFWMLMICNGEDTLWVTV


ET159.K


YYGAAVWKEATTTLCCAYDAIPYETELHVGWATPACVPT


U319547


DPHPQEHVFENVLEPFIMWNDDMEQMMHDSIIYLCHRRL





PPLVPLCVSQLCHEVDIYKNTTTANATSSMDVTVTEELKN





CSFSTTTDIKDKKQKAYALFYKLDLAPLLIGDDIVINMSCT





ASTVCRAGAFVPFPPIHCAPCAPAGFASRECKDNEVRGTR





HGVTVHRGHGTHAMVPSPPLLLVGNGKQEKEDISDKSKN





YANKVIILLLQLRTCEEIKCTKNTTKTRRSVRRGPFFASLEI





RGEIRQAPRTNSASNGSAGITLIGEELLQHLDHNTIRSVPSS





PCERETLTHTTPSCGWEVFYCVSIQICNGVYFTDTWTTYN





SNDADYTDILTLTCRIKLIVNLSQELGRARSAPPIDGSVKCT





SNFTGLVLTRAGGNATGDVTAENNTEIFRPGGGEMRDNW





RSELYKYKVVEIKPLGVAPTEAKRRVVEREKRAVGIGAVF





LGFLGAAGSTMGAASITLTVQARQLLSGIVQQQSNLLRAIE





AQQHMLQLTVWGIKQLQTRVLAIERYLKDQQLLGIWGCS





GKLICTTTVPWNASWSNKSYNEIWDNMTWMQWDREINN





YTDTIYRLLEDSQNQQEKNERDLLALDSWNSLWNWFSISN





WLWYIRIFIMIVGGLIGLRIIFAVLAIVNRVRQGYSPLSFQT





LIPNPRGPDRPGGIEEEGGEQDRGRSIRLVHGFLALIWDDL





RSLCLFCYRRLRDLTLIVARAVELLGQRGWEILKYWGNLV





QYWVLQLKNSAISLLDTVAIPLAEGPDRILEVVQRFCRAIL





NIPRRIRQGFEAALQ









Notwithstanding the appended claims, the disclosure set forth herein is also defined by the following clauses:


Clause 1. A vaccine comprising a set of antigens that are representative of at least 60% of fifth order clades of a microbe.


Clause 2. A vaccine comprising a set of antigens of a microbe wherein the smallest pairwise edit distance between two of the antigens is at least 10% of the average size of the antigens and the largest pairwise edit distance between two of the antigens is no more than 98% of the average size of the antigens.


Clause 3. A vaccine comprising a set of antigens that are representative of at least 60% of all operational taxonomic units (OTUs) of the microbe.


Clause 4. The vaccine of clause 1 or clause 3, wherein an antigen is representative of a clade if its sequence is up to 40% of the average size of antigens of the represented clade different than other members of the represented clade.


Clause 5. The vaccine of clause 1 or clause 3, wherein an antigen is representative of a clade if the edit distance is up to 5% of the size of another strain of the represented clade.


Clause 6. The vaccine of clause 1 or clause 3, wherein an antigen is representative of a clade if its sequence is up to 20% of the average size of antigens of the represented clade different than at least 95%, 96%, 97%, 98%, or 99% of all strains of the represented clade.


Clause 7. The vaccine of clause 1 or clause 3, wherein an antigen is representative of a clade if its sequence is up to 10% of the average size of antigens of the represented clade different than at least 95%, 96%, 97%, 98%, or 99% of all strains of the represented clade.


Clause 8. The vaccine of clause 1 or clause 3, wherein the antigen is representative of a clade if the edit distance is up to 5% of the size of 95%, 96%, 97%, 98%, or 99% of all strains of the represented clade.


Clause 9. The vaccine of clause 1 or clause 3, wherein an antigen is representative of a clade if its sequence is up to 25 edit distance from all strains of the represented clade.


Clause 10. The vaccine of clause 1 or clause 3, wherein an antigen is representative of a clade if its sequence is up to 100 edit distance from all strains of the represented clade.


Clause 11. The vaccine of clause 1 or clause 3, wherein an antigen is representative of a clade if its sequence is present in the clade.


Clause 12. A vaccine composition comprising a set of antigens, wherein the antigens are derived from a library of variants of a microbe wherein: a. two antigens of the set with the greatest edit distance have an edit distance S; b. two antigens of the library with the greatest edit distance have an edit distance L; and c. S is at least 60% of L.


Clause 13. A vaccine composition comprising at least 4 influenza virus hemagglutinin antigen polypeptides, each represented by a sequence that is at least 20% identical, but not more than 95% identical, to the other influenza virus hemagglutinin antigen polypeptides, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 influenza virus hemagglutinin antigen polypeptides.


Clause 14. A vaccine composition comprising at least 4 influenza virus neuraminidase antigen polypeptides, each represented by a sequence that is at least 20% identical, but not more than 95% identical, to the other influenza virus neuraminidase antigen polypeptides, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 influenza virus neuraminidase antigen polypeptides.


Clause 15. A vaccine composition comprising at least 4 HIV gp160 antigen polypeptides, each represented by a sequence that is at least 20% identical, but not more than 95% identical, to the other HIV gp160 antigen polypeptides, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 HIV gp160 antigen polypeptides.


Clause 16. A vaccine composition comprising at least 4 HIV gp120 antigen polypeptides, each represented by a sequence that is at least 20% identical, but not more than 95% identical, to the other HIV gp120 antigen polypeptides, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 HIV gp120 antigen polypeptides.


Clause 17. A vaccine composition comprising at least 4 HIV gp41 antigen polypeptides, each represented by a sequence that is at least 20% identical, but not more than 95% identical, to the other HIV gp41 antigen polypeptides, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 HIV gp41 antigen polypeptides.


Clause 18. A vaccine composition comprising a set of antigens that activate an immune response in a subject to at least 6 strains identified in Table 1.


Clause 19. The vaccine composition of clause 13 or clause 18, wherein the immune response is detectable using head-specific antibodies in a hemagglutinin inhibition assay.


Clause 20. The vaccine composition of clause 19, wherein the immune response is at least 2-fold greater using the antigens than H1N1+H3N2+HAB, when tested using the hemagglutinin inhibition assay.


Clause 21. The vaccine composition of clause 14, wherein the immune response is at least 10-fold greater using the antigens than H1N1+H3N2+HAB, when tested using the hemagglutinin inhibition assay.


Clause 22. The vaccine composition of clause 19, wherein the immune response is at least 100-fold greater using the antigens than H1N1+H3N2+HAB, when tested using the hemagglutinin inhibition assay.


Clause 23. The vaccine of clause 1, clause 2, clause 3, or clause 12, wherein the microbe is a bacterium.


Clause 24. The vaccine of clause 1, clause 2, clause 3, or clause 12, wherein the microbe is a virus.


Clause 25. The vaccine composition of clause 24, wherein the virus is influenza.


Clause 26. The vaccine composition of clause 25, wherein the influenza is type A.


Clause 27. The vaccine composition of clause 25, wherein the influenza is type B.


Clause 28. The vaccine of clause 26, wherein the type A influenza is H1N1, H1N2, H3N1, H3N2, or H2N3.


Clause 29. The vaccine of clause 28, wherein the type A influenza is H1N1.


Clause 30. The vaccine of clause 28, wherein the type A influenza is H3N2.


Clause 31. The vaccine composition of clause 24, wherein the virus is human immunodeficiency virus (HIV).


Clause 32. The vaccine composition of clause 32, wherein the HIV is HIV-1.


Clause 33. The vaccine composition of clause 32, wherein the HIV-1 is of subclade A, subclade B, or subclade C.


Clause 34. The vaccine of clause 1, clause 2, clause 3, or clause 12, wherein the antigen is a broadly neutralizing antigen of surface exposed residues adjacent in tertiary space.


Clause 35. The vaccine composition of clause 34, wherein the broadly neutralizing antigen is in a stem of hemagglutinin.


Clause 36. The vaccine composition of clause 34, wherein the broadly neutralizing antigen is in a head of hemagglutinin.


Clause 37. The vaccine composition of clause 34, wherein the broadly neutralizing antigen is in a neuraminidase.


Clause 38. The vaccine composition of clause 34, wherein the broadly neutralizing antigen is in a gp160.


Clause 39. The vaccine composition of clause 34, wherein the broadly neutralizing antigen is in a gp120.


Clause 40. The vaccine composition of clause 34, wherein the broadly neutralizing antigen is in a gp41.


Clause 41. The vaccine of clause 1, clause 2, clause 3, or clause 12, wherein the antigen is a broadly conserved fragment of hemagglutinin.


Clause 42. The vaccine of clause 41, wherein the antigen is a broadly conserved fragment of a head of hemagglutinin.


Clause 43. The vaccine of clause 41, wherein the antigen is a broadly conserved fragment of a stem of hemagglutinin.


Clause 44. The vaccine of clause 1, clause 2, clause 3, or clause 12, wherein the antigen is a broadly conserved fragment of neuraminidase.


Clause 45. The vaccine of clause 1, clause 2, clause 3, or clause 12, wherein the antigen is a broadly conserved fragment of gp160.


Clause 46. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen selected from SEQ ID NOs:1-87.


Clause 47. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 60% identical to any one of SEQ ID NOs:1-87.


Clause 48. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 70% identical to any one of SEQ ID NOs:1-87.


Clause 49. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 80% identical to any one of SEQ ID NOs:1-87.


Clause 50. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 90% identical to any one of SEQ ID NOs:1-87.


Clause 51. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen selected from SEQ ID NOs:88-127.


Clause 52. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 60% identical to any one of SEQ ID NOs:88-127.


Clause 53. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 70% identical to any one of SEQ ID NOs:88-127.


Clause 54. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 80% identical to any one of SEQ ID NOs:88-127.


Clause 55. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 90% identical to any one of SEQ ID NOs:88-127.


Clause 56. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen selected from SEQ ID NOs:128-171.


Clause 57. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 60% identical to any one of SEQ ID NOs:128-171.


Clause 58. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 70% identical to any one of SEQ ID NOs:128-171.


Clause 59. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 80% identical to any one of SEQ ID NOs:128-171.


Clause 60. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 90% identical to any one of SEQ ID NOs:128-171.


Clause 61. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen selected from SEQ ID NOs:172-267.


Clause 62. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 60% identical to any one of SEQ ID NOs:172-267.


Clause 63. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 70% identical to any one of SEQ ID NOs:172-267.


Clause 64. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 80% identical to any one of SEQ ID NOs:172-267.


Clause 65. The vaccine of clause 1, clause 2, clause 3, or clause 12, comprising at least one antigen that is at least 90% identical to any one of SEQ ID NOs:172-267.


Clause 66. The vaccine of clause 1, clause 2, clause 3, or clause 12, wherein the set of antigens comprise at least 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 900 or 1000 different antigens.


Clause 67. The vaccine composition of clause 1, clause 2, clause 3, or clause 12, wherein the set of antigens comprises at least 30 antigens.


Clause 68. The vaccine composition of clause 1, clause 2, clause 3, or clause 12, wherein the set of antigens comprises at least 50 antigens.


Clause 69. The vaccine of clause 1, clause 2, clause 3, or clause 12, wherein the antigens have an average edit distance from each of the other antigens that is at least 5% of the average size of antigens in the clade.


Clause 70. The vaccine of clause 1, clause 3, or clause 12, wherein a smallest pairwise edit distance between two or more of the antigens in the set is no more than 1, and the largest pairwise edit distance is at least 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100.


Clause 71. The vaccine of clause 1, clause 3, or clause 4, wherein a smallest pairwise edit distance between two or more antigens in the set is no more than 5% of the size of the antigens, and the largest pairwise edit distance is at least 75% of the size of the antigens.


Clause 72. The vaccine of clause 1, wherein the clades are clades of a phylogenetic tree is a neighbor joining clustering tree or maximum parsimony tree.


Clause 73. The vaccine of clause 1, wherein a first order clade is a clade that is not subsumed in whole or in part within another higher-level clade.


Clause 74. The vaccine of clause 1, wherein each X-order clade is phylogenetically below X−1 number branch nodes in a phylogenetic tree of the microbe.


Clause 75. The vaccine of clause 1, further comprising antigens that are representative of at least 60% of each of third, fourth, fifth, sixth, seventh, eighth, ninth, tenth, eleventh, twelfth, thirteenth, fourteenth, or fifteenth order clades of the microbe.


Clause 76. The vaccine of clause 75, wherein a represented clade is a clade Y that is subsumed by a clade Y−1.


Clause 77. The vaccine of clause 1, wherein a representative clade is a node-based clade.


Clause 78. The vaccine of clause 1, wherein a representative clade is a stem-based clade.


Clause 79. The vaccine of clause 1, wherein a representative clade is apomorphy-based clade.


Clause 80. The vaccine of clause 1, wherein an average number of branches between each of the antigens in the set is at least 1, 3, 5, or 10.


Clause 81. The vaccine of clause 1, wherein a smallest number of branches between any two antigens in the set is no more than 1 and a largest number of branches between any two antigens in the set is at least 5, 10, or 15.


Clause 82. The vaccine of clause 3, wherein each OTU comprises sequences that are at least 95% homologous of one another.


Clause 83. The vaccine of clause 3, wherein each OTU comprises of at least 3 different sequences.


Clause 84. The vaccine composition of clause 1, clause 2, clause 3, or clause 12, wherein each of the antigens in the set shares at least 90%, 95, or 99% sequence identity to at least one other antigen in the set.


Clause 85. The vaccine composition of clause 1, clause 2, clause 3, or clause 12, wherein each of the antigens in the set shares at least 90%, 95, or 99% % sequence identity to at least one other antigen in the set over a length of at least 100 amino acids.


Clause 86. The vaccine composition of clause 1, clause 2, clause 3, or clause 12, wherein each of the antigens in the set differs from each of the other antigens in the set by at least 5% sequence identity.


Clause 87. The vaccine composition of clause 1, clause 2, clause 3, or clause 12, wherein each of the antigens in the set differs from each of the other antigens in the set by no more than 75% sequence identity.


Clause 88. The microbial vaccine composition of clause 1, clause 2, clause 3, or clause 12, wherein each antigen is a peptide comprising at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 85, 100, 110, or 120 amino acids.


Clause 89. The vaccine composition of clause 1, clause 2, or clause 3, wherein the antigens are selected from Table 2, Table 3, Table 4, Table 5, or a fragment or homologue thereof.


Clause 90. The vaccine composition of clause 89, comprising 2 or more antigens from Table 2, Table 3, Table 4, Table 5, or fragments or homologues thereof.


Clause 91. The vaccine composition of clause 90, comprising 3 or more antigens from Table 2, Table 3, Table 4, Table 5, or fragments or homologues thereof.


Clause 92. The vaccine composition of clause 91, comprising 5 or more antigens from Table 2, Table 3, Table 4, Table 5, or fragments or homologues thereof.


Clause 93. The vaccine composition of clause 92, comprising 10 or more antigens from Table 2, Table 3, Table 4, Table 5, or fragments or homologues thereof.


Clause 94. The vaccine composition of clause 89, wherein the fragment comprises at most 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, or 120 amino acids.


Clause 95. The vaccine composition of clause 89, wherein the fragment comprises at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, or 120 amino acids.


Clause 96. The vaccine composition of clause 89, wherein the homologue comprises a sequence having at least 90% sequence identity to the antigen of Table 1.


Clause 97. The vaccine composition of clause 12, wherein the library comprises at least 1×104, 1×105, 1×106 different variants of the microbe.


Clause 98. The vaccine composition of clause 12, wherein the library comprises at least 90% of all known sequences of the microbe.


Clause 99. A pharmaceutical composition comprising the vaccine composition of any of the preceding clauses and a pharmaceutically acceptable diluent, adjuvant, excipient, or any combinations thereof.


Clause 100. The vaccine composition of clause 99, wherein the vaccine is in a form of an aerosol formulation.


Clause 101. The vaccine composition of clause 99, wherein the vaccine is in a form of an injectable formulation.


Clause 102. The vaccine composition of any of the preceding clauses, wherein each of the antigens is at a concentration that alone does not provide a significant prophylactic immune response to the broadly neutralizing antigen in a subject; and collectively, the antigens have a combined concentration that provides an immune response to the broadly neutralizing antigen in the subject.


Clause 103. The vaccine composition of clause 102, wherein the subject is a bird.


Clause 104. The vaccine composition of clause 102, wherein the subject is a mammal.


Clause 105. The vaccine composition of clause 103, wherein the subject is a human.


Clause 106. The vaccine composition of clause 103, wherein the subject is a pig.


Clause 107. A virus like particle (VLP) comprising the vaccine composition of any of the preceding clauses.


Clause 108. A recombinant expression vector comprising a nucleic acid molecule encoding: (a) a set of antigens that are representative of at least 60% of each of first order clade and second order clades of a microbe; (b) set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 25 and the largest pairwise edit distance between two of the antigens is at least 300; (c) set of antigens that are representative of at least 60% of all operational taxonomic units (OTUs) of the microbe; or (d) set of antigens that are representative of at least 60% of all operational taxonomic units (OTUs) of the microbe.


Clause 109. A recombinant expression vector comprising a nucleic acid molecule encoding: (a) a set of antigens that are representative of at least 60% of each of first order clade and second order clades of a microbe; (b) set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 5% of the size of the antigens, and the largest pairwise edit distance is at least 75% of the size of the antigen; (c) set of antigens that are representative of at least 60% of all operational taxonomic units (OTUs) of the microbe; or (d) set of antigens that are representative of at least 60% of all operational taxonomic units (OTUs) of the microbe.


Clause 110. A method for treating or reducing the likelihood of an infection in a subject, comprising administrating to the subject a vaccine composition of any of the preceding clauses.


Clause 111. A method for treating or reducing the likelihood of a flu infection in a subject, comprising administrating to the subject a vaccine composition that results in an immune activation effective against seasonal flu for at least 3, 4, 5, 6, 7, 8, 9, or 10 years.


Clause 112. The method of clause 110 or clause 111, wherein the subject is a human.


Clause 113. The method of clause 110 or clause 111, wherein the subject is a domesticated animal.


Clause 114. A method for making a vaccine composition, comprising: selecting a set of antigens that are (a) representative of at least 60% of each of first order clade and second order clades of a microbe; (b) set of antigens wherein the smallest pairwise edit distance between two of the antigens is no more than 5% of the size of the antigens, and the largest pairwise edit distance is at least 75% of the size of the antigens; (c) representative of at least 60% of all operational taxonomic units (OTUs) of the microbe; or (d) representative of at least 60% of all operational taxonomic units (OTUs) of the microbe.


Clause 115. The method of clause 114, further comprising: obtaining a plurality of antigen sequences from a library of strains of the microbe; and aligning the plurality of antigen sequences to create a phylogenetic tree of the antigen.


The term “about,” as used herein, generally refers to a range that is 2%, 5%, 10%, or 15% greater than or less than (±) a stated numerical value within the context of the particular usage. For example, “about 10” would include a range from 8.5 to 11.5. As used herein, the terms “about” and “approximately,” when used to modify a numeric value or numeric range, indicate that deviations of up to about 0.2%, about 0.5%, about 1%, about 2%, about 5%, about 7.5%, or about 10% (or any integer between about 1% and 10%) above or below the value or range remain within the intended meaning of the recited value or range.


As used in this specification and the appended claims, the singular forms “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise. Thus, for example, references to “a method” include one or more methods, and/or steps of the type described herein and/or which will become apparent to those persons skilled in the art upon reading this disclosure.


While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.

Claims
  • 1-12. (canceled)
  • 13. A vaccine composition comprising at least 4 influenza virus hemagglutinin antigen polypeptides, each polypeptide represented by an amino acid sequence that is at least 20% identical, but not more than 95% identical, to the other influenza virus hemagglutinin antigen polypeptides in the vaccine composition, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 influenza virus hemagglutinin antigen polypeptides.
  • 14. A vaccine composition comprising at least 4 influenza virus neuraminidase antigen polypeptides, each polypeptide represented by an amino acid sequence that is at least 20% identical, but not more than 95% identical, to the other influenza virus neuraminidase antigen polypeptides in the vaccine composition, wherein each polypeptide comprises an antigen that is at least 90% identical among the 4 influenza virus neuraminidase antigen polypeptides.
  • 15-25. (canceled)
  • 26. The vaccine composition of claim 13, wherein the influenza is type A.
  • 27. The vaccine composition of claim 13, wherein the influenza is type B.
  • 28. The vaccine of claim 26, wherein the type A influenza is H1N1, H1N2, H3N1, H3N2, or H2N3.
  • 29. The vaccine of claim 28, wherein the type A influenza is H1N1 or H3N2.
  • 30-45. (canceled)
  • 46. The vaccine composition of claim 13, comprising at least one antigen selected from SEQ ID NOs:1-87.
  • 47. (canceled)
  • 48. (canceled)
  • 49. The vaccine composition of claim 13, comprising at least one antigen that is at least 80% identical to any one of SEQ ID NOs:1-87.
  • 50-98. (canceled)
  • 99. A pharmaceutical composition comprising (i) the vaccine composition of claim 13 and (ii) a pharmaceutically acceptable diluent, adjuvant, excipient, or any combinations thereof.
  • 100-106. (canceled)
  • 107. A virus like particle (VLP) comprising the vaccine composition of claim 13.
  • 108-115. (canceled)
  • 116. The vaccine composition of claim 14, wherein the antigen comprises at least one antigen selected from SEQ ID NOs:1-87.
  • 117. The vaccine composition of claim 14, wherein the antigen at least one antigen that is at least 80% identical to any one of SEQ ID NOs:1-87.
  • 118. A virus like particle (VLP) comprising the vaccine composition of claim 14.
  • 119. A pharmaceutical composition comprising (i) the vaccine composition of claim 14 and (ii) a pharmaceutically acceptable diluent, adjuvant, excipient, or any combinations thereof.
  • 120. The vaccine composition of claim 14, wherein the antigen is in a stem of hemagglutinin or in a head of hemagglutinin.
  • 121. The vaccine composition of claim 13, wherein the antigen is in a stem of hemagglutinin or in a head of hemagglutinin.
  • 122. The vaccine composition of claim 13, wherein the influenza is type A.
  • 123. The vaccine composition of claim 13, wherein the influenza is type B.
  • 124. The vaccine of claim 122, wherein the type A influenza is H1N1, H1N2, H3N1, H3N2, or H2N3.
  • 125. A recombinant expression vector comprising a nucleic acid molecule encoding at least one antigen that is at least 80% identical to any one of SEQ ID NOs:1-267.
  • 126. A virus like particle (VLP) comprising a vaccine composition, wherein the vaccine composition comprises at least one antigen that is at least 80% identical to any one of SEQ ID NOs:1-267.
  • 127. A method for treating or reducing the likelihood of an infection in a subject, comprising administrating to the subject a vaccine composition, wherein the vaccine composition comprises at least one antigen that is at least 80% identical to any one of SEQ ID NOs: 1-267.
CROSS-REFERENCE

This application is a continuation application of International Patent Application No. PCT/US2020/019367, filed Feb. 21, 2021, which claims the benefit of U.S. Provisional Application No. 62/808,760, filed Feb. 21, 2019, and U.S. Provisional Application No. 62/817,902, filed Mar. 13, 2019, each of which is incorporated herein by reference in its entirety.

Provisional Applications (2)
Number Date Country
62808760 Feb 2019 US
62817902 Mar 2019 US
Continuations (1)
Number Date Country
Parent PCT/US2020/019367 Feb 2020 US
Child 17399041 US