USE OF CONSENSUS SEQUENCE AS VACCINE ANTIGEN TO ENHANCE RECOGNITION OF VIRULENT VIRAL VARIANTS

Abstract
The invention provides consensus sequences for hepatitis C virus 1a and 1b. Also provided are non-synonymous changes for each residue of the consensus sequences. These sequences are useful as compositions or vaccines for prophylactic use or treating HCV-infected individuals. Also provided are methods for lessening the chances for a HCV-infected individual to enter a chronic phase of infection and methods of diagnosing an individual with HCV 1a or HCV 1b infection.
Description
FIELD OF THE INVENTION

The invention relates to consensus sequences for hepatitis C virus and uses thereof. The invention also relates to immunogenic epitopes of hepatitis C virus and uses thereof. The invention also relates to methods of prophylaxis, treating, and diagnosing individuals infected with or exposed to hepatitis C virus.


BACKGROUND OF THE INVENTION

Currently, it is estimated there are about 270 to 300 million people worldwide who are infected with hepatitis C virus (HCV), 2.7 to 4 million of those are in the United States. In industrialized countries, HCV accounts for 20% of cases of acute hepatitis, 70% of cases of chronic hepatitis, 40% of cases of end-stage cirrhosis, 60% of cases of hepatocellular carcinoma and 30-40% of liver transplants. The incidence of new symptomatic infections of HCV has been estimated to be 13 cases/100,000 persons annually. For every one person that is infected with the AIDS virus, there are more than four infected with HCV. Currently, 8,000 to 10,000 deaths each year are a result of Hepatitis C (<http://www.hepatitis-central.com/>, updated Jan. 31, 2006). The CDC (Center For Disease Control) estimates that there are up to 30,000 new HCV infections in the U.S. every year. (<http://www.cdc.gov/ncidod/diseases/hepatitis/c/fact.htm>).


About 80% of HCV-infected individuals have no signs or symptoms. In others, the symptom include: jaundice, fatigue, dark urine, abdominal pain, loss of appetite, and nausea. For 55-85% of infected individuals, over the long term, HCV infection persists and become a chronic infection. Of the chronically infected individuals, about 30% develop liver disease.


Hepatitis C virus has six different genotypes. The most prevalent types circulating in the Western countries are sub-genotypes 1a and 1b. In a person with chronic infection, hepatitis C virus reproduces up to 1012 virion each day. Neumann, et al. Science 282:103-107 (1998). This rate of reproduction exceeds the rate of reproduction for human immunodeficiency virus (HIV) by an order of magnitude. The reproduction rate of HCV coupled with the lack of proofreading function by the HCV RNA polymerase results many mutations in HCV sequences.


Treatments for hepatitis C include interferon and ribavirin, both of which are licensed for the treatment of persons with chronic hepatitis C. Interferon can be taken alone or in combination with ribavirin. Combination therapy, using pegylated interferon and ribavirin, is currently the treatment of choice. Combination therapy can get rid of the virus in up to 5 out of 10 persons for genotype 1 and in up to 8 out of 10 persons for genotype 2 and 3. (CDC website: http://www.cdc.gov/ncidod/diseases/hepatitis/c/fact.htm)


There are no known vaccines for the prevention of hepatitis C virus infection. Until the present inventors made their discovery herein, the reason as to why individuals with chronic stage HCV infection had weak and ineffective immune responses was poorly understood. The inventors and others have demonstrated that most, if not all of the HCV sequences obtained from persons with HCV infection contain epitope escape variants. As such, the inventors have made a novel and useful discovery that describes, inter alia, the reasons for the observed genetic changes taking place during infection of a human host. These include the tendency of the virus to mutate towards a consensus sequence that is optimal for viral replication and fitness, and the competing tendency of the virus to mutate away from sequences that induce effective immunological responses in particular human hosts. This discovery has important implications for vaccine design.


Several computational alternatives to isolate-based vaccine design exist. One approach is reconstruction of the most recent common ancestor (MRCA) sequence. In this type of analysis, the ancestral state is an estimate of the actual sequence that existed in the past (i.e., it comes directly from the reconstructed history). See Science, 299:1515-1518 (2003). Another type of computational analysis is a center of the tree (COT) approach. The COT approach identifies a point on the unrooted phylogeny, where the average evolutionary distance from that point to each tip on the phylogeny is minimized. Advocates of this approach state that because the COT is a point on the phylogeny, the estimated COT sequence will have the same advantages as the estimated ancestral sequence. See, for example, U.S. Application 2005/0137387 A1. However, this COT approach is sufficiently complex that reducing it to practice for a large and heterologous data set is not practical with technology; specifically, the phylogenetic methods cannot address a sparse data set like the one for HCV, wherein most of the data for any individual sequence are missing. In addition, the premise of the COT approach is that when the phylogenetic tree is unbalanced (dominated by a particular lineage), the COT approach proposed therein provides a more representative sequence than the ancestral sequence. However, the HCV tree has been shown, by the inventors and others, to be balanced and star-like (see Ray S C et al, J Exp Med 2005 Jun. 6; 201(11):1753-9 and Salemi M and Vandamme A, J Mol Evol 2002; 54:62-70). Overall, the MRCA and COT approaches are impractical for application to the HCV sequence database, and their primary justification does not apply.


A third type of computational analysis is the consensus sequence approach. Because the consensus sequence is composed of the amino acid most commonly observed at each position, it likely represents the most fit state of the virus. Thus, effective evasion of the immune response by selection of a sequence divergent from consensus may result in a less fit virus from a replicative standpoint. The consensus sequence approach favors heavily sampled sublineages and deemphasizes outliers. The consideration of an unbalanced phylogenetic tree is not important for HCV, because the phylogeny is balanced (star-like). As such, the approaches disclosed herein are far more straightforward than the other types of computational analysis. Furthermore, these approaches can use the entire data set for HCV. One advantage of the consensus sequence is that it minimizes the genetic differences between vaccine strains and contemporary isolates, effectively reducing the extent of diversity by half, and thus it may have enhanced potential for eliciting cross-reactive responses.


A computational method is therefore needed to generate a sequence for use in vaccines that more broadly represents circulating strains, and also restores the immunogenic forms of HCV epitopes. Currently, there is a need for a method to effectively treat individuals who are infected with HCV or exposed to HCV. With the decline in an infected individual's immune system as he/she enters chronic phase, it would be highly desirable to lessen that individual's chances of progressing to the chronic phase by administering a form of treatment during the acute phase of infection. Most chronically infected people are ineligible for the currently available therapies. A vaccine could be used to enhance responses to currently available or future therapies. Further, there is a need for a prophylaxis of HCV infection. The invention disclosed herein meets all these needs and provides even more beneficial uses.


All references, patent, and patent applications cited in this patent application are herein incorporated by reference, each in its respective entirety for all purposes.


BRIEF SUMMARY OF THE INVENTION

The invention provides for an isolated nucleic acid encoding an HCV polyprotein or a fragment thereof wherein the HCV polyprotein comprises the consensus sequence 1a (SEQ ID NO: 1). In one embodiment, the invention is a nucleic acid wherein the encoded HCV polyprotein or fragment thereof comprises one or more of the non-synonymous changes shown in Table 5.


In one aspect, the invention is an isolated nucleic acid encoding an HCV polyprotein or a fragment thereof wherein the HCV polyprotein comprises the consensus sequence 1b (SEQ ID NO: 2). In one embodiment, the invention is a nucleic acid wherein the encoded HCV polyprotein or fragment thereof comprises one or more of the non-synonymous changes shown in Table 6.


In another aspect, the invention is an isolated HCV protein having the amino acid sequence comprising the consensus sequence 1a (SEQ ID NO:1) or a fragment thereof. In another aspect, the invention is an isolated HCV protein having the amino acid sequence comprising the consensus sequence 1b (SEQ ID NO:2) or a fragment thereof. In another aspect, the invention is an isolated HCV 1a core protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5. In another aspect, the invention is an isolated HCV 1a E1 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5. In another aspect, the invention is an isolated HCV 1a E2 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5. In another aspect, the invention is an isolated HCV 1a p7 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5. In another aspect, the invention is an isolated HCV 1a NS2 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5. In another aspect, the invention is an isolated HCV 1a NS3 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5. In another aspect, the invention is an isolated HCV 1a NS4a protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5. In another aspect, the invention is an isolated HCV 1a NS4b protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5. In another aspect, the invention is an isolated HCV 1a NS5a protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5. In another aspect, the invention is an isolated HCV 1a NS5b protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.


In yet another aspect, the invention is an isolated HCV 1b core protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6. In another aspect, the invention is an HCV 1b E1 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6. In another aspect, the invention is an HCV 1b E2 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6. In another aspect, the invention is an HCV 1b p7 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6. In another aspect, the invention is an HCV 1b NS2 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6. In another aspect, the invention is an HCV 1b NS3 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6. In another aspect, the invention is an HCV 1b NS4a protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6. In another aspect, the invention is an HCV 1b NS4b protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6. In another aspect, the invention is an HCV 1b NS5a protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6. In another aspect, the invention is an HCV 1b NS5b protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6. In another aspect, the invention is an consensus protein which comprises at least 5 contiguous amino acids of a hepatitis C 1a virus.


The invention also provides for an isolated expression construct comprising the following operably linked elements: transcription promoter and a nucleic acid encoding an HCV consensus protein or a fragment thereof. In another aspect, the invention is an isolated consensus protein or fragment thereof from Hepatitis C 1a virus which is immunogenic. In another aspect, the invention is an isolated consensus protein which comprises at least 5 contiguous amino acids of a hepatitis C 1b virus. In another aspect, the invention is an isolated expression construct comprising the following operably linked elements: transcription promoter and a nucleic acid encoding the consensus protein of claim 30 or a fragment thereof. In another aspect, the invention is an isolated consensus protein or fragment thereof from Hepatitis C 1b virus which is immunogenic. In another aspect, the invention is an isolated nucleic acid encoding an HCV epitope capable of eliciting an immunogenic response in an individual wherein the sequence of the epitope is selected from any of the epitopes listed in Table 7. In another aspect, the invention is an isolated HCV epitope capable of eliciting an immunogenic response in an individual wherein the sequence of the epitope is selected from any of the epitopes listed in Table 7.


The invention also provides for composition comprising at least one HCV protein or a fragment thereof encoded by a polynucleotide. In another aspect, the invention is a composition comprising at least one HCV protein or a fragment thereof encoded by a polynucleotide. In another aspect, the invention is a composition comprising at least one HCV protein or a fragment thereof. In another aspect, the invention is a composition comprising at least one nucleic acid sequence for HCV consensus sequence. In another aspect, the invention is a composition comprising at least one nucleic acid sequence for HCV consensus sequence. In another aspect, the invention is a composition comprising at least one nucleic acid sequence which codes for an HCV protein or a fragment thereof.


The invention also provides for a vaccine comprising all or a portion of consensus sequence 1a (SEQ ID NO:1). In one embodiment, the vaccine comprises a non-synonymous change at a modal consensus sequence. In another aspect, the invention is a vaccine comprising all or a portion of consensus sequence 1b (SEQ ID NO:2). In one embodiment, the vaccine comprises wherein there is a non-synonymous change at a modal consensus sequence.


The invention also provides for a method of identifying an immunogen for use as a vaccine comprising: a) Obtaining a sequence of HCV that is derived from a subject and is longer than 500 nucleotides; b) Obtaining the primary open reading frame of the sequence; c) Removing sequences that contain more than 1% ambiguous sites or more than 1 frameshift; d) Converting terminal “gap” characters to “missing”; e) Removing sequences that are redundant by identifying identical sequences and checking related publications and removing linked sequences; f) Generating predicted polyprotein sequences by using standard eukaryotic genetic code; and g) Identifying majority-rule consensus sequence for each subtype to identify modal amino acid residue at each site.


The invention also provides for a method of inducing or augmenting an immune response against hepatitis C virus comprising administering an effective amount of any one of the vaccines recited disclosed herein. The invention also provides for a method for protecting an individual from hepatitis C virus infection comprising administering an effective amount of any one of the vaccines disclosed herein. The invention also provides for a method of lessening the probability that a HCV-infected individual will enter a chronic phase of hepatitis C infection comprising administering an effective amount of any one of the vaccine disclosed herein.


The invention also provides for a method for diagnosing an individual infected with hepatitis C virus 1a comprising: a. obtaining a biological sample from the individual and b. using PCR primers to consensus sequences of HCV 1a to amplify nucleic acids in the biological sample to determine if the individual has been infected with hepatitis C virus 1a. The invention also provides for a method for diagnosing an individual infected with hepatitis C virus 1b comprising: a. obtaining a biological sample from the individual and b. using PCR primers to consensus sequences of HCV 1b to amplify nucleic acids in the biological sample to determine if the individual has been infected with hepatitis C virus 1b. The invention also provides for a kit comprising a HCV vaccine and instructions for the administration thereof.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows fluctuating HCV RNA level during acute infection. Spontaneous clearance (SC) occurred in one subject, and persistence developed in seven others. A solid triangle (▴) indicates detectable HCV RNA. An open triangle (Δ) indicates an HCV RNA level of less than 50 IU/ml. The number in the bottom right corner of each panel is the subject number. A gray inverted triangle (▾) indicates that IFN-γ ELISPOT analysis of T cell responses was performed at that time point. Subject 28 entered the study antibody negative and HCV RNA positive so that the time of infection is estimated using the average time from infection to seroconversion.



FIG. 2 shows amino acid substitutions and epitopes recognized during the first 6 months of HCV infection for five subjects. The map at the top of the figure indicates the region of the HCV polyprotein sequenced. For each study subject, horizontal lines represent the sequences obtained at initial viremia (t0), six months after viremia was first detected (t6), and 12 months after viremia was first detected (t12). Thin vertical lines represent amino acid substitutions. Shorter, thicker vertical lines indicate recognized epitopes, below which the t0 and t6 sequences of the epitope are shown.



FIG. 3 shows amino acid substitutions in epitopes and the effect on T cell responses. Peptide sequences observed to vary between to (circle, initial viremia) and t6 (triangle, 6 months after onset of viremia) were used as antigens in IFN-γ ELISPOT, using PBMC (filled symbols) or T cell lines generated from PBMC (open symbols) obtained at t6 as effectors. (A) Loss of recognition: For 4 of 10 peptide pairs, recognition of the t6 variant peptide was dramatically reduced. (B) Decreased recognition: For 4 of 10 peptide pairs, recognition of the t6 variant peptide was reduced more than 2-fold but less than 20-fold at least two concentrations of peptide. (C) Comparable recognition: For 2 of 10 pairs of peptides, recognition of the t6 variant was not reduced more than 2-fold relative to the to peptide at more than one concentration tested.



FIG. 4 shows similar recognition patterns using lines and PBMC. IFN-γ ELISPOT responses for both a T cell line (open symbols) and for PBMC (closed symbols) from which the line was generated using the peptide KLVALGINAV are shown using the same antigens. The peptide representing the to sequence is KLVALGINAV (circles) and the peptide representing the t6 sequence is KLVAMGINAV (triangles). As shown for this peptide pair, responses for PBMC and T cell lines using five distinct peptide t0/t6 pairs were consistently similar aside from the expected differences in the proportion of responding cells (note difference in axes).



FIG. 5 shows results that show recognition patterns persist with time. To rule out the subsequent development of T cell responses to the t6 HCV sequence, IFN-γ ELISPOT testing for recognition of the t6 peptides demonstrating escape was also performed for subjects 17 and 28 using PBMC obtained approximately 12, 18, 24, and, for subject 17, 36 months following initial infection. The number in the bottom right corner of each panel is the subject number. Patterns of recognition persisted over time and recognition of the t6 peptides (▴) declined in parallel with the decline in recognition of the to peptides () that occurred with prolonged infection.



FIG. 6 shows a phylogenetic analysis of HCV 18-22 years after common-source outbreak. (A) Phylogenetic tree of 5.2 kb sequence alignment placing outbreak clade (*) in context with reference sequences for all major subtypes. (B) Detailed analysis of the outbreak clade, using 10 cDNA clones from each study subject to obtain the sequence of a 698 nt region spanning the E1/E2 junction. The label “inoculum” indicates twenty clones from inoculum source plasma (10 each from 2 specimens), and a full-length clone (Genbank Accession No. AF313916) obtained in an independent study of this material using smaller amplicons. For both trees, numbers at nodes are bootstrap values, indicating the percentage of 1000 permuted trees that supported the presence of that node. Bootstrap support was 100% for each of the major clades in panel A; in panel B, only bootstrap values greater than 80% are shown, and values for nodes within a study subject's clade were omitted for clarity. Boxes highlight 2 subjects whose sequences were segregated into two separate clades.



FIG. 7 shows HCV divergence and convergence following a common-source outbreak. The upper panel shows a sliding-window analysis of nonsynonymous (lighter curve, with lower mean variability) and synonymous (darker curve, with higher mean variability) variation, calculated by comparing the mean pairwise distance between the inoculum (2 cDNA clones obtained from two inoculum source plasma specimens obtained one week apart in 1977) to 44 clones (2 per study subject) obtained from chronically-infected women 18-22 years after exposure, in a sliding window 20 codons wide, moving in 1-codon increments, generated using VarPlot. Horizontal bars indicate average distance for each gene region. The lower panel shows detail of variability in HVR1 (the region indicated by dashed lines in the upper panel), indicating position relative to the H77 polyprotein. The first three rows show the subtype 1b reference sequences, inoculum sequences (20 clones from 2 plasma specimens obtained one week apart), and 220 recipient sequences (10 per study subject) respectively, with the height of each single-letter amino acid code proportional to its frequency. The fourth row shows differences between the amino acid frequencies in the recipient sequences versus the inoculum sequences as a type 2 logo, in which the height of each amino acid is determined by the log2 relative risk of observing it, with the scale indicated. Empty spaces indicate a distribution highly similar to the inoculum distribution, because the logarithm of a relative risk of one is zero.



FIG. 8 shows escape versus reversion in the presence versus absence of the restricting HLA allele. (A) Amino acid alignment of a region in NS3 (positions 1388 to 1431 relative to strain H, GenBank entry AF009606) showing sites with polymorphism. Study subjects are listed in arbitrary order. Identity to the inoculum sequence (“inoc”) is indicated by “.”. (B) Sorting the subjects by the presence of HLA A*02, G1409S substitution in the 4th position of a frequently-recognized HLA A*02-restricted epitope at 1406-1415 is limited to subjects having the HLA A*02 allele. The subtype 1b consensus sequence for this epitope is shown above the alignment, and has been shown to be recognized as readily as the prototype (subtype 1a) KLVALGINAV sequence (23). Subject AD17 (HLA A*01, A*11) had G1409D substitution, the impact of which on recognition is unknown. (C) Variation resulting in reversion to a HLA B*08-restricted epitope. Sorting the subjects by presence of HLA B*08, R1397K substitution in the 3rd position of a frequently-recognized HLA B*08-restricted epitope at 1395-1403 is limited to subjects lacking the HLA B*08 allele. The inoculum sequence differs from the prototypical epitope (HSKKKCDEL) at the 3rd position. Reversion to consensus (and the prototype epitope) occurred only in study subjects lacking the HLA B*08 allele.



FIG. 9 shows HCV consensus sequence 1a (SEQ ID NO:1).



FIG. 10 shows HCV consensus sequence 1b (SEQ ID NO:2).





DETAILED DESCRIPTION OF THE INVENTION

The invention provides consensus sequences for HCV 1a sub-genotype (or subtype) and HCV 1b subtype. The invention further provides details on which residues of the consensus sequence have known amino acids substitutions. In addition, the invention further provides epitopes useful for inducing immune responses to HCV. The consensus sequence, its variants, and epitopes provides for vaccines for prophylaxis against and treatment of chronic HCV infection. The vaccines further provide a method for inducing an immune response against HCV. Further, the invention provides methods for preventing an individual's entrance into a chronic phase of HCV. The invention also provides methods of diagnosis of HCV 1a and 1b. The invention further provides for kits for use in prophylaxis and treatment of HCV.


DEFINITIONS

All scientific and technical terms used in this application have meanings commonly used in the art unless otherwise specified. As used in this application, the following words or phrases have the meanings specified.


An “B cell epitope” is a term well-understood in the art and means any chemical moiety which exhibits specific binding to an antibody. An “epitope” can also comprise an antigen, which is a moiety or molecule that contains an epitope, and, as such, also specifically binds to antibody.


A “T cell epitope” means a component or portion thereof for which a T cell has an antigen-specific specific binding site, the result of binding to which activates the T cell.


The terms “polynucleotide” and “nucleic acid”, used interchangeably herein, refer to a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. These terms include a single-, double- or triple-stranded DNA, genomic DNA, cDNA, RNA, DNA-RNA hybrid, or a polymer comprising purine and pyrimidine bases, or other natural, chemically, biochemically modified, non-natural or derivatized nucleotide bases.


An “individual” is a vertebrate, preferably a mammal, more preferably a human. Mammals include, but are not limited to, farm animals, sport animals, pets, primates, mice and rats.


A “biological sample” encompasses a variety of sample types obtained from an individual and can be used in a diagnostic or monitoring assay. The definition encompasses blood and other liquid samples of biological origin, solid tissue samples such as a biopsy specimen or tissue cultures or cells derived therefrom, and the progeny thereof. The definition also includes samples that have been manipulated in any way after their procurement, such as by treatment with reagents, solubilization, or enrichment for certain components, such as proteins or polynucleotides. The term “biological sample” encompasses a clinical sample, and also includes cells in culture, cell supernatants, cell lysates, serum, plasma, biological fluid, and tissue samples.


As used herein, the singular form “a”, “an”, and “the” includes plural references unless indicated otherwise. For example, “an” epitope includes one or more epitopes.


Consensus Sequences


Hepatitis C virus mutates rapidly and has two to three times more genetic variability than HIV. Hepatitis C reproduces more than 100 billion times a day, about 100 times faster than HIV. The inventors have discovered that instead of random changes, as conventional wisdom surmised, the HCV changes in a “Darwinian” manner. That is, the viral genome changes in a way to make the virus more reproductively fit in the face of each individual immune system it encounters. The inventors have discovered that the HCV genome changed in a manner to evade the individual's immune system in regions subject to effective T cell immune responses; and in regions where there was no effective T cell response due to lack of HLA epitopes, the HCV genome naturally mutated toward a consensus sequence which was the viruses' most replicatively fit state.


The HCV polyprotein is expressed as a single open reading frame, and is then processed by host (structural genes through the p7/NS2 junction) and viral (remainder) proteases. The structure and numbering of the positions in the HCV genome and polyprotein are based by convention on the subtype 1a genome, as described at the Los Alamos National Laboratory HCV sequence database web site (URL:http://hcv.lanl.gov/content/hcv-db/MAP/landmark.html) and in a recent publication (see Simmonds P, et al. Hepatology 2005; 42:962-73). HCV 1a core protein spans residues 1 to 191. E1 protein spans residues 192 to 383. p7 protein spans residues 384 to 746. NS2 protein spans residues 810 to 1026. NS3 protein spans residues 1027 to 1657. NS4a protein spans residues 1658 to 1711. NS4b protein spans residues 1712 to 1972. NS5a spans residues 1973 to 2420. NS5b protein spans residues 2421 to 3011. For HCV 1b subtype, there is an insertion at residue 2413 of an additional residue relative to the consensus 1a sequence.


Determination of a Consensus Sequence


To determine a consensus sequence from any genotype of HCV, or other viruses, one method that can be used is as follows: a) Obtain a sequence of HCV or the virus of interest that is derived from a subject (e.g., a human) and is longer than 500 nucleotides; b) Obtain the primary open reading frame of the sequence; c) Remove sequences that contain more than 1% ambiguous sites or more than 1 frameshift; d) Convert terminal “gap” characters to “missing”; e) Remove sequences that are redundant by identifying identical sequences and checking related publications and removing linked sequences; f) Generate predicted polyprotein sequences by using standard eukaryotic genetic code; and g) Identify majority-rule consensus sequence for each subtype to identify modal amino acid residue at each site.


The methodology disclosed herein is for obtaining consensus sequences and is not the same methodology used for obtaining ancestral sequences or center of tree (COT) sequences. However, for balanced (i.e., star-like) phylogeny like that seen for HCV, all three methodologies could yield the same end result. For commentary on general consensus and ancestral states as applied to HIV, see Science Vol. 29, pp. 1515-1518 (2003). However, the methodology disclosed herein provides advantages in that it is far more straightforward to use than previously disclosed methods and can be used with the entire data set for HCV, which has never been done before.


As detailed in the Examples, by using the methodology disclosed herein consensus sequence 1a (SEQ ID NO: 1) (FIG. 9) and consensus sequence 1b (SEQ ID NO:2) (FIG. 10) were obtained. Furthermore, the invention not only discloses the consensus sequence, but also details precisely which residues have non-synonmyous (amino acid changing) changes and precisely what those changes are at a certain frequency. The consensus sequence and its amino acid substitutions (or variants) are displayed in Tables 5 and 6. The first column indicates the residue position of each consensus sequence and the remaining columns indicate the residue-frequency displayed as a pair. The frequency is displayed as a subscript of the residue in this table. Residues indicated with an “X” means that that one or more sequences in the database have errors or ambiguity at that site, resulting in a codon that does not map to the genetic code. Common reasons for this include single base deletions and multistate characters (e.g., IUPAC R for A or G; Y for C or T, etc). BioEdit translates these as “X” to indicate that an amino acid could not be assigned. Using this information, one of skill in the art can predict what substitutions, if any, can be expected at a particular residue position and if a substitution is expected, then the information provided herein allows one to know which amino acids will be expected and at what frequency. For each of the 3011 residues for HCV 1a and 3010 residues for HCV1b, there are 19 other amino acids possible for substitution. It creates an overwhelming number of possible sequences for one of skill in the art to experiment with if there were not the kind of explicit teaching provided herein. It would be impossible for one of skill in the art to predict, a priori, what the consensus sequence would be for HCV 1a and HCV 1b and which amino acids are found as possible substitutes at each of the 6021 positions possible and at what frequency these non-synonymous changes are found.


Polypeptides of HCV Consensus Sequences


The invention also provides for consensus sequences for the entire HCV polyprotein as well as for each HCV protein that has been processed by host or viral proteases. These HCV proteins include: core, E1, E2, p7, NS2, NS3, NS4a, NS4b, NS5a, and NS5b. One aspect of the invention contemplates the entire HCV polyprotein while other aspects of the invention contemplate the HCV proteins post-processing. The invention encompasses HCV proteins with non-synonymous changes as well as heterologous polypeptides.


The invention further provides a number of epitopes that have been found to be immunogenic (i.e., capable of inducing an immune response). These epitopes are disclosed in Table 7. As detailed in the Examples, some of these epitopes have been mapped systemically and found to be immunogenic. HCV has been found to mutate more frequently in the areas which comprise T cell epitopes than in areas which are not within T cell epitopes. Individuals who self-recovered from HCV infection had no changes in the sequences in the T cell epitope areas while those who progressed to chronic infection had one or more changes within the T cell epitopes. Further, in those chronically infected individuals with changes outside the T cell epitopes, many of the substitutions resulted in an amino acid that matched the consensus sequence. B cell epitopes are contained within the consensus sequences disclosed herein. Accordingly, the invention encompasses T cell epitopes as well as B cells epitopes. Accordingly, the invention encompasses polypeptides comprising these epitopes and heterologous polypeptides comprising these epitopes.


The invention also encompasses polypeptides which are fusion proteins. A fusion protein is a single polypeptide comprising regions from two or more different proteins. The regions normally exist in separate proteins and are brought together in the fusion protein. They may be linked together so that the amino acid sequence of one begins where the amino acid sequence of the other ends, or they may be linked via linker amino acids which are not normally a part of the constituent proteins. A fusion protein may contain more than one copy of any of its constituent proteins. The constituent proteins may include the entire amino acid sequences of the proteins or portions of the amino acid sequences. In one embodiment, the fusion protein of this invention are two or more HCV proteins (e.g., core, E1, E2, p7, NS2, NS3, NS4a, NS4b, NS5a, and NS5b) or fragments thereof brought together as a fusion protein. The HCV proteins can contain substitutions at various residue positions as disclosed in Tables 5 and 6. In another embodiment, a fusion protein is made from two or more epitopes disclosed herein. The epitopes can be a combination of different T cell epitopes or a combination of B cell epitopes or a combination of both T cell and B cell epitopes.


A vector can include nucleic acid coding for the fusion protein of the invention in a form suitable for expression of the nucleic acid in a host cell. Preferably the recombinant expression vector includes one or more regulatory sequences operatively linked to the nucleic acid sequence to be expressed. The term “regulatory sequence” includes promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Regulatory sequences include those that direct constitutive expression of a nucleotide sequence, as well as tissue-specific regulatory and/or inducible sequences. The design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of protein desired, and the like. The expression vectors of the invention can be introduced into host cells to thereby produce proteins or polypeptides, including fusion proteins or polypeptides, encoded by nucleic acids as described herein.


The recombinant expression vectors of the invention can be designed for expression of the fusion proteins of the invention in prokaryotic or eukaryotic cells. For example, polypeptides of the invention can be expressed in E. coli, insect cells (e.g., using baculovirus expression vectors), yeast cells, mammalian cells in culture, or in transgenic animals. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.


Expression of proteins in prokaryotes is most often carried out in E. coli with vectors containing constitutive or inducible promoters directing the expression of either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, usually to the amino terminus of the recombinant protein. Such fusion vectors typically serve three purposes: 1) to increase expression of recombinant protein; 2) to increase the solubility of the recombinant protein; and 3) to aid in the purification of the recombinant protein by acting as a ligand in affinity purification. Often, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant protein to enable separation of the recombinant protein from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D. B. and Johnson, K. S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, N.J.) which fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the target recombinant protein.


A way to maximize recombinant protein expression in E. coli is to express the protein in host bacteria with an impaired capacity to proteolytically cleave the recombinant protein (Gottesman, S., Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 119-128). Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the individual codons for each amino acid are those preferentially utilized in E. coli (Wada et al., (1992) Nucleic Acids Res. 20:2111-2118). Such alteration of nucleic acid sequences of the invention can be carried out by standard DNA synthesis techniques.


The fusion protein expression vector can also be a yeast expression vector, a vector for expression in insect cells, e.g., a baculovirus expression vector or a vector suitable for expression in mammalian cells in culture, or in transgenic animals. When used in mammalian cells, the expression vector's control functions are often provided by viral regulatory elements. For example, commonly used promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40.


In another embodiment, the recombinant mammalian expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Non-limiting examples of suitable tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. (1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) Adv. Immuno. 43:235-275), in particular promoters of T cell receptors (Winoto and Baltimore (1989) EMBO J. 8:729-733) and iimmunoglobulins (Banerji et al. (1983) Cell 33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl. Acad. Sci. USA 86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 and European Application Publication No. 264,166). Developmentally-regulated promoters are also encompassed, for example, the murine hox promoters (Kessel and Gruss (1990) Science 249:374-379) and the α-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 3:537-546).


A “host cell” includes an individual cell or cell culture which can be or has been a recipient for vector(s) or for incorporation of polynucleotide molecules and/or proteins. A host cell can be any prokaryotic or eukaryotic cell. For example, fusion proteins of the invention can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those skilled in the art. Host cells include progeny of a single host cell, and the progeny may not necessarily be completely identical (in morphology or in genomic of total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation. A host cell includes cells transfected in vivo with a polynucleotide(s) of this invention.


Polynucleotides Encoding the HCV Consensus Sequences


The invention also encompasses polynucleotides encoding the HCV consensus sequences or fragments thereof. Polynucleotides coding for the epitopes disclosed herein are also encompassed by the invention. Polynucleotides coding for heterologous polypeptides and for fusion proteins are also encompassed by this invention.


The practice of the present invention will employ, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, biochemistry and immunology, which are within the skill of the art. Such techniques are explained fully in the literature, such as, Molecular Cloning: A Laboratory Manual, second edition (Sambrook et al., 1989) Cold Spring Harbor Press; Oligonucleotide Synthesis (M. J. Gait, ed., 1984); Animal Cell Culture (R. I. Freshney), ed., 1987); Methods in Enzymology (Academic Press, Inc.); Handbook of Experimental Immunology (D. M. Weir & C. C. Blackwell, eds.); Gene Transfer Vectors for Mammalian Cells (J. M. Miller & M. P. Calos, eds., 1987); Current Protocols in Molecular Biology (F. M. Ausubel et al., eds., 1987); PCR: The Polymerase Chain Reaction, (Mullis et al., eds., 1994); Current Protocols in Immunology (J. E. Coligan et al., eds., 1991) and Short Protocols in Molecular Biology (Wiley and Sons, 1999).


The invention further provides isolated polynucleotides that encode a HCV consensus sequence or fragments thereof, as well as vectors comprising the polynucleotide and a host cell containing the vector. Such expression systems can be used in a method of producing an HCV consensus sequence polypeptide or fragments thereof, wherein the host cell is cultured and the polypeptide produced by the cultured host cell is recovered. Polynucleotides encoding consensus sequences of the invention can also be delivered to a host subject for expression of the consensus sequence by cells of the host subject.


Polynucleotides complementary to any such sequences are also encompassed by the present invention. Polynucleotides may be single-stranded (coding or antisense) or double-stranded, and may be DNA (genomic, cDNA or synthetic) or RNA molecules. RNA molecules include HnRNA molecules, which contain introns and correspond to a DNA molecule in a one-to-one manner, and mRNA molecules, which do not contain introns. Additional coding or non-coding sequences may, but need not, be present within a polynucleotide of the present invention, and a polynucleotide may, but need not, be linked to other molecules and/or support materials.


Polynucleotides may comprise the consensus sequence (or a fragment thereof) or may comprise a variant of such a sequence. Polynucleotide variants contain one or more substitutions, additions, deletions and/or insertions such that the immunoreactivity of the encoded polypeptide is not diminished, relative to the original immunoreactive molecule. The effect on the immunoreactivity of the encoded polypeptide may generally be assessed as described herein. Polynucleotide variants preferably exhibit at least about 70% identity, more preferably at least about 80% identity and most preferably at least about 90% identity to a polynucleotide sequence that encodes the HCV consensus sequences disclosed herein or a fragment thereof.


Two polynucleotide or polypeptide sequences are said to be “identical” if the sequence of nucleotides or amino acids in the two sequences is the same when aligned for maximum correspondence as described below. Comparisons between two sequences are typically performed by comparing the sequences over a comparison window to identify and compare local regions of sequence similarity. A “comparison window” as used herein, refers to a segment of at least about 5 contiguous positions, at least about 10 contiguous positions, at least about 15 contiguous positions, or at least about 20 contiguous positions in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned.


Optimal alignment of sequences for comparison may be conducted using the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, Inc., Madison, Wis.), using default parameters. This program embodies several alignment schemes described in the following references: Dayhoff, M. O. (1978) A model of evolutionary change inproteins—Matrices for detecting distant relationships. In Dayhoff, M. O. (ed.) Atlas of protein Sequence and Structure, National Biomedical Research Foundation, Washington D.C. Vol. 5, Suppl. 3, pp. 345-358; Hein J., 1990, Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology vol. 183, Academic Press, Inc., San Diego, Calif.; Higgins, D. G. and Sharp, P. M., 1989, CABIOS 5:151-153; Myers, E. W. and Muller W., 1988, CABIOS 4:11-17; Robinson, E. D., 1971, Comb. Theor. 11:105; Santou, N., Nes, M., 1987, Mol. Biol. Evol. 4:406-425; Sneath, P. H. A. and Sokal, R. R., 1973, Numerical Taxonomy the Principles and Practice of Numerical Taxonomy, Freeman Press, San Francisco, Calif.; Wilbur, W. J. and Lipman, D. J., 1983, Proc. Natl. Acad. Sci. USA 80:726-730.


Preferably, the “percentage of sequence identity” is determined by comparing two optimally aligned sequences over a window of comparison of at least 5 positions, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid bases or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the reference sequence (i.e., the window size) and multiplying the results by 100 to yield the percentage of sequence identity.


Polynucleotides may be prepared using any of a variety of techniques known in the art. The nucleic acid sequence that encodes for the HCV consensus sequences disclosed herein may be obtained from publicly available databases (e.g., GenBank) or from reverse translation from the amino acid sequence of the HCV consensus sequence. Although hepatitis C virus is a RNA virus, DNA sequences (including cDNA) that code for the HCV consensus sequences are also encompassed by this invention. Such nucleic acid sequences can also be obtained as part of a genomic library or a cDNA library. Screening the cDNA or genomic library with the selected probe may be conducted using standard procedures, such as those described in Sambrook et al., Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Laboratory Press, 1989.


Polynucleotides, polynucleotide variants, and whole viruses may generally be prepared by any method known in the art, including chemical synthesis by, for example, solid phase phosphoramidite chemical synthesis. Modifications in a polynucleotide sequence may also be introduced using standard mutagenesis techniques, such as oligonucleotide-directed site-specific mutagenesis (see Adelman et al., DNA 2:183, 1983). Alternatively, RNA molecules may be generated by in vitro or in vivo transcription of DNA sequences encoding an antibody, or portion thereof, provided that the DNA is incorporated into a vector with a suitable RNA polymerase promoter (such as T7 or SP6). Certain portions may be used to prepare an encoded polypeptide, as described herein. In addition, or alternatively, a portion may be administered to an individual such that the encoded polypeptide is generated in vivo (e.g., by transfecting antigen-presenting cells, such as dendritic cells, with a cDNA construct encoding the polypeptide, and administering the transfected cells to the individual).


Any polynucleotide may be further modified to increase stability in vivo. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5′ and/or 3′ ends; the use of phosphorothioate or 2′ O-methyl rather than phosphodiesterase linkages in the backbone; and/or the inclusion of nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl-, methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and uridine.


Nucleotide sequences can be joined to a variety of other nucleotide sequences using established recombinant DNA techniques. For example, a polynucleotide may be cloned into any of a variety of cloning vectors, including plasmids, phagemids, lambda phage derivatives and cosmids. Vectors of particular interest include expression vectors, replication vectors, probe generation vectors and sequencing vectors. In general, a vector will contain an origin of replication functional in at least one organism, convenient restriction endonuclease sites and one or more selectable markers. Other elements will depend upon the desired use, and will be apparent to those of ordinary skill in the art.


Within certain embodiments, polynucleotides may be formulated so as to permit entry into a cell of a mammal, and to permit expression therein. Such formulations can be particularly useful for therapeutic purposes, for example for treating HCV infection or as a prophylaxis. In one embodiment, the invention encompasses an expression construct for expressing any portion of the consensus sequence will contain the following operably linked elements: a transcription promoter, a nucleic acid encoding all or some fragment of the consensus sequences and its possible variants disclosed herein, and a transcription terminator. Those of ordinary skill in the art will appreciate that there are many ways to achieve expression of a polynucleotide in a target cell, and any suitable method may be employed. For example, a polynucleotide may be incorporated into a viral vector such as, but not limited to, adenovirus, adeno-associated virus, retrovirus, or vaccinia or other pox virus (e.g., avian pox virus). Techniques for incorporating DNA into such vectors are well known to those of ordinary skill in the art. A retroviral vector may additionally transfer or incorporate a gene for a selectable marker (to aid in the identification or selection of transduced cells) and/or a targeting moiety, such as a gene that encodes a ligand for a receptor on a specific target cell, to render the vector target specific. Targeting may also be accomplished using an antibody, by methods known to those of ordinary skill in the art.


Additional methods of expression for vaccine purposes are disclosed below. Other formulations for therapeutic purposes include colloidal dispersion systems, such as macromolecule complexes, nanocapsules, microspheres, beads, and lipid-based systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes. A preferred colloidal system for use as a delivery vehicle in vitro and in vivo is a liposome (i.e., an artificial membrane vesicle). The preparation and use of such systems is well known in the art.


Uses for Consensus Sequences


The invention provides for many uses for the consensus sequences disclosed herein. In one aspect, an isolated consensus protein with the sequences disclosed herein is used as a vaccine for prophylactic and treatment purposes. In another aspect, the invention provides for the uses of HCV epitopes disclosed herein to induce or augment an immune response in an individual.


In another aspect, the invention is a composition comprising at least one HCV 1a or 1b consensus protein or a fragment thereof. In yet another aspect, the invention is a pharmaceutical composition comprising at least one HCV 1a or 1b consensus protein or a fragment thereof and a suitable carrier. It is understood that this invention not only encompasses polypeptides comprising the consensus sequences but also polynucleotides coding for the polypeptides.


The consensus sequences disclosed herein can be used in its entirety or as fragments. In one embodiment, the consensus sequence comprises at least one HCV protein (e.g., core, E1, E2, p7, NS2, NS3, NS4a, NS4b, NS5a, or NS5b) or a fragment thereof. In another embodiment, the consensus protein fragment has at least 5 contiguous amino acids of either HCV 1a or HCV1b virus. In another embodiment, the consensus protein fragment has least 8 contiguous amino acids of either HCV 1a or HCV1b virus. In other embodiments, the consensus protein fragment has least 11, at least 14, at least 17, at least 20, at least 23, at least 27, at least 30, at least 33, at least 36, at least 39, at least 42, at least 45, at least 48, or at least 51 contiguous amino acids of either HCV 1a or HCV1b virus.


As the HCV mutates, the invention provided herein teaches one of skill in the art how to predict changes in the sequence. Since some of the residues have several variants, the teachings herein on the precise amino acid and the frequency with which was observed allows one of skill in the art to practice the invention without undue experimentation.


Pharmaceutical Compositions


The invention encompasses pharmaceutical compositions comprising HCV consensus sequence proteins or fragments thereof and polynucleotides encoding the same. In addition to the HCV consensus protein (or polynucleotide), the pharmaceutical composition includes a pharmaceutically acceptable excipient. Pharmaceutically acceptable excipients are known in the art, and are relatively inert substances that facilitate administration of a pharmacologically effective substance. For example, an excipient can give form or consistency, or act as a diluent. Suitable excipients include but are not limited to stabilizing agents, wetting and emulsifying agents, salts for varying osmolarity, encapsulating agents, buffers, and skin penetration enhancers. Excipients as well as formulations for parenteral and nonparenteral drug delivery are set forth in Remington's Pharmaceutical Sciences 19th Ed. Mack Publishing (1995). Generally, these compositions are formulated for administration by injection (e.g., intraperitoneally, intravenously, subcutaneously, intramuscularly, etc.). Accordingly, these compositions are preferably combined with pharmaceutically acceptable vehicles such as saline, Ringer's solution, dextrose solution, and the like.


Vaccines Using HCV Consensus Sequences


Several general approaches exist for making a vaccine against a virus that mutates rapidly and has multiple subtypes around the world. From an efficiency perspective, it would be ideal to be able to make one vaccine that could be effective against the same virus, even if it has mutated as it is transmitted from person to person. One general approach is to use isolates of a particular subtype to make the vaccine. However, because it is likely that each geographical area will have mutations that are clustered and make the virus phylogenetically a different clade, the isolate approach is not economically viable because country-specific vaccine efforts would be needed. Another approach, using the consensus sequence instead of the actual sequence isolated from within the infected population, has the advantages of being central and having the potential to induce or augment cross-reactive responses in infected individuals. Other considerations when making a vaccine also include whether the protein should be polyvalent or to target specific types of valencies, for example, in HIV, designing modified envelopes to enhance exposure of epitopes known to be capable of inducing broadly neutralizing antibodies. See, for example, Gaschen at el. Science 296: 2354-2360 (2002).


Use of a consensus sequence as a vaccine for HCV should take into account considerations that would stimulate responses that drive the virus to mutate to a less fit state, thereby rendering it less able to replicate, and easier to eradicate. Such considerations include co-receptor usage (if any), protein folding, and exposure of antigenic or immunogenic domains. Selection of a consensus sequence for use as a vaccine against HCV would elicit both T cell and/or B cell responses in such way to engage the immune system to eradicate the existing HCV in the infected individual.


The vaccines of the invention comprise HCV consensus sequence or a fragment thereof or a polynucleotide encoding HCV consensus sequence or a fragment thereof. For instance, the vaccine may comprise or encode the HCV polyprotein, the primary translation product, or the full-length translation product of the HCV consensus sequence. In another embodiment, the vaccine comprises the processed HCV proteins or fragments thereof. In addition to the use of consensus sequence proteins (or polynucleotides encoding those proteins), polypeptides comprising fragments of HCV consensus sequence, or polynucleotides encoding fragments of HCV consensus sequence may be used in the vaccines. The polypeptides in the vaccines or encoded by polynucleotides of the vaccines are optionally at least about 99%, at least about 95%, at least about 90%, at least about 85%, at least about 80%, at least about 75%, at least about 70%, at least about 65%, at least about 60%, at least about 55%, or at least about 50% identical to the HCV consensus sequence disclosed herein.


In addition, the polynucleotides of the vaccines are optionally at least about 99%, at least about 95%, at least about 90%, at least about 85%, at least about 80%, at least about 75%, at least about 70%, at least about 65%, at least about 60%, at least about 55%, or at least about 50% identical to the polynucleotides encoding the HCV consensus sequence disclosed herein.


Delivery/Expression Systems


The HCV consensus sequence vaccines can be delivered to the individual using various expression systems. A few of the possible systems are described below.


1. Mammalian Cell-Based Delivery Systems


In one embodiment of the invention, the immunogenic composition comprises a cell. The cell of the immunogenic composition comprises a consensus sequence or epitope-containing polypeptide described herein or a polynucleotide encoding an epitope-containing polypeptide described herein. Prior to delivery of the cell to a host, the cell optionally comprises an epitope from the consensus sequence on its surface (for instance, as an HLA-bound peptide). In an alternative embodiment, the cell comprises an epitope-containing polypeptide described herein and/or a polynucleotide encoding a consensus sequence or epitope-containing polypeptide described herein in the interior of the cell. In one embodiment, the immunogenic composition comprising the consensus sequence or polynucleotide encoding the consensus sequence is an eukaryotic cell, such as a mammalian cell.


In another embodiment, the immunogenic composition comprises a mammalian cell which is an antigen-presenting cell. Antigen presenting cells, as referred to herein, express at least one class I or class II MHC determinant and may include those cells which are known as professional antigen-presenting cells such as macrophages, dendritic cells and B cells. Other professional antigen-presenting cells include monocytes, marginal zone Kupffer cells, microglia, Langerhans' cells, interdigitating dendritic cells, follicular dendritic cells, and T cells. Facultative antigen-presenting cells can also be used. Examples of facultative antigen-presenting cells include astrocytes, follicular cells, endothelium and fibroblasts. In one embodiment, the immunogenic composition comprises an antigen-presenting cell that affords MHC class I antigen presentation.


The polypeptides or polynucleotides of the invention may be delivered to cells in vivo or ex vivo. The loading of antigen onto antigen-presenting cells such as dendritic cells has been achieved by different methods, including peptide pulsing (see Melero et al., Gene Therapy 7:1167 (2000). This method for administering polypeptide vaccine is accomplished by pulsing the polypeptide onto an APC or dendritic cell in vitro. The polypeptide binds to MHC molecules on the surface of the APC or dendritic cell. Prior treatment of the APCs or dendritic cells with interferon-γ can be used to increase the number of MHC molecules on the APCs or dendritic cells. The pulsed cells can then be administered as a carrier for the polypeptide.


2. Yeast-Based Delivery Systems


In another embodiment of the invention, a consensus sequence or epitope-containing polypeptide described herein, or a polynucleotide encoding the consensus sequence or epitope-containing polypeptide, is delivered to a host organism in a vaccine comprising yeast. The use of live yeast DNA vaccine vectors for antigen delivery has been reviewed recently and reported to be efficacious in a mouse model using whole recombinant Saccharomyces cerevisiae yeast expressing tumor or HIV-1 antigens (see Stubbs et al. (2001) Nature Medicine 7: 625-29).


The use of live yeast vaccine vectors is known in the art. Furthermore, U.S. Pat. No. 5,830,463 describes particularly useful vectors and systems for use in the instant invention. The use of yeast delivery systems may be particularly effective for use in the viral vaccine methods and formulations of the invention as yeast appear to trigger cell-mediated immunity without the need for an adjuvant. Possible yeast vaccine delivery systems are nonpathogenic yeast carrying at least one recombinant expression system capable of modulating an immune response. In an alternative, heat killed yeast may also be used to express the protein antigen of interest as part of a yeast-based delivery system (e.g., GlobeImmune products).


3. Bacterial Systems


In another embodiment, the cell that comprises the consensus sequence or epitope-containing polypeptide, or polynucleotide encoding the consensus sequence or epitope-containing polypeptide, is a microbial cell. In one embodiment, the microbial cell is a bacterium, typically a mutant or recombinant bacterium. The bacterium is optionally attenuated. For instance, a number of bacterial species have been studied for their potential use as vaccines and can be used in the present invention, including, but not limited to, Shigella flexneri, E. Coli, Listeria monocytogenes, Yersinia enterocolitica, Salmonella typhimurium, Salmonella typhi or mycobacterium. In one embodiment, the bacterial vector used in the immunogenic composition is a facultative, intracellular bacteria vector. In one embodiment, the bacterium is used to deliver an epitope-containing polypeptide to antigen-presenting cells in the host organism.


The use of live bacterial vaccine vectors for antigen delivery has been reviewed (Medina and Guzman (2001) Vaccine 19: 1573-1580; and Darji et al. (2000) FEMS Immunol and Medical Microbiology 27: 341-9). Furthermore, U.S. Pat. Nos. 6,261,568 and 6,488,926 describe particularly useful systems for use in this HCV vaccine invention.


The use of live bacterial vaccine vectors can be particularly advantageous. Bacteria-mediated gene transfer finds particular advantage in genetic vaccination by intramuscular, intradermal or oral administration of plasmids which leads to antigen expression in the mammalian host-thereby offering the possibility of both antigen modification as well as immune modulation. Furthermore, the bacterial-mediated DNA vaccine provides adjuvant effects and the ability to target inductive sites of the immune system. In preferred embodiments, S. typhimurium, S. typhi, S. flexneri or L. monocytogenes are used as vehicles for trans-kingdom DNA vaccine delivery.


Furthermore, many live bacterial vaccine vectors make use of the almost unlimited coding capacity of bacterial plasmids, and broad availability of bacterial expression vectors, to express virtually any target tumor antigen of interest. The use of bacterial carriers is often associated with still other significant benefits, such as the availability of convenient direct mucosal or oral delivery. Other direct mucosal delivery systems (besides live viral or bacterial vaccine carriers) include mucosal adjuvants, viral particles, ISCOMs, liposomes, microparticles and transgenic plants. Other advantages of this technology are: low batch preparation costs, facilitated technology transfer following development of the prototype, increased shelf-life and stability in the field respect to other formulations (e.g., subunit vaccines), easy administration and low delivery costs. Taken together, these advantages make this strategy particularly suitable for vaccine programs including HCV vaccines. The carrier operationally becomes an equivalent of a subunit recombinant vaccine.


Both attenuated and commensal microorganisms have been successfully used as carriers for vaccine antigens. Attenuated mucosal pathogens which may be used in the invention include: L. monocyotgenes, Salmonella spp., V. cholorae, Shigella spp., mycobacterium, Y. enterocolitica, and B. anthracis. Commensal strains for use in the invention include: S. gordonii, Lactobacillus spp., and Staphylococcus ssp. The background of the carrier strain used in the formulation, the type of mutation selected to achieve attenuation, and the intrinsic properties of the immunogen can be used in optimizing the extent and quality of the immune response elicited. The general factors to be considered to optimize the immune response stimulated by the bacterial carrier include carrier-related factors including: selection of the carrier; the specific background strain, the attenuating mutation and the level of attenuation; the stabilization of the attenuated phenotype and the establishment of the optimal dosage. Other considerations include antigen-related factors such as: instrinsic properties of the antigen; the expression system, antigen-display form and stabilization of the recombinant phenotype; co-expression of modulating molecules and vaccination schedules.


Descriptions of exemplary bacterial vaccine vectors follows, including references which demonstrate the availability in the art of the knowledge required to generate recombinant bacterial vaccine vectors that express the antigen of choice and can be used in immunogenic compositions. Each of the following references in incorporated herein in its entirety.


For instance, Salmonella typhimurium could be used as a bacterial vector in the immunogenic compositions of the invention. Use of this type of bacteria as an effective vector for a vaccine has been demonstrated in the art. For instance, the use of S. typhimurium as an attenuated vector for oral somatic transgene vaccination has been described (see Darji et al. (1997) Cell 91: 765-775; and Darji et al. (2000) FEMS Immun and Medical Microbiology 27:341-9). Indeed most knowledge on bacteria-mediated gene transfer has been acquired using attenuated S. typhimurium as carrier. Two metabolically attenuated strains that have been used include S. typhimurium aroA, which is unable to synthesize aromatic amino acids, and S. typhimurium 22-11, which is defective in purine metabolism. Several antigens have been expressed using these carriers: originally, listeriolysin and actA (two virulence factors of L. monocytogenes) and beta-galactosidase (beta-gal) of E. coli were successfully tested. Cytotoxic and helper T cells as well as specific antibodies could be detected against these antigens following oral application of a single dose of the recombinant salmonella. In addition, immunization with salmonella carrying a listeriolysin-encoding expression plasmid elicited a protective response against a lethal challenge with L. monocytogenes. Interestingly, this protection was observed in the lung although the vaccine was administered orally. Oral transgene vaccination methodology has now been extended to include protective responses in herpes simplex virus 2 and hepatitis B infection models, with cell-mediated immune responses detected at the mucosal level.


In another embodiment, the immunogenic compositions of the present invention optionally comprise Shigella flexneri as a delivery vehicle. S. flexneri represents the prototype of a bacterial DNA transfer vehicle as it escapes from the vacuole into the cytosol of the host cell. Several attenuated mutants have been used successfully to transfer DNA to cell lines in vitro. Auxotrophic strains were defective in cell-wall synthesis (see Sizemore et al. (1995) Science 270:299-302; and dapB (see Courvalin et al. (1995) C R Acad Sci Ser III, 318:1207-12), synthesis of aromatic amino acids (aroA (see Powell et al. (1996) Vaccines 96: Molecular Approaches to the Control of Infectious Disease; Cold Spring Harbor Laboratory Press) or synthesis of guanine nucleotides (guaBA (see Anderson et al. (2000) Vaccine 18: 2193-2202).


In still another embodiment, the immunogenic compositions of the present invention comprise Listeria monocytogenes (Portnoy et al, Journal of Cell Biology, 158:409-414 (2002); Glomski et al., Journal of Cell Biology, 156:1029-1038 (2002)). Strains of Listeria monocytogenes have recently been developed as effective intracellular delivery vehicles of heterologous proteins providing delivery of antigens to the immune system to induce an immune response to clinical conditions that do not permit injection of the disease-causing agent, such as cancer (U.S. Pat. No. 6,051,237 Paterson; Gunn et al., J. Immun. 167:6471-6479 (2001); Liau, et al., Cancer Research, 62: 2287-2293 (2002)) and HIV (U.S. Pat. No. 5,830,702 Portnoy & Paterson). A recombinant L. monocytogenes vaccine expressing a lymphocytic choriomeningitis virus (LCMV) antigen has also been shown to induce protective cell-mediated immunity to the antigen (Shen et al., PNAS, 92: 3987-3991 (1995). The ability of L. monocytogenes to serve as a vaccine vector has been reviewed (Wesikirch, et al., Immunol. Rev. 158:159-169 (1997)).


As a facultative intracellular bacterium, L. monocytogenes elicits both humoral and cell-mediated immune responses. Following entry of the Listeria into a cell of the host organism, the Listeria produces Listeria-specific proteins that enable it to escape from the phagolysosome of the engulfing host cell into the cytosol of that cell. Here, L. monocytogenes proliferates, expressing proteins necessary for survival, but also expressing heterologous genes operably linked to Listeria promoters. Presentation of peptides of these heterologous proteins on the surface of the engulfing cell by MHC proteins permits the development of a T cell response. Two integration vectors which are particularly useful for introducing heterologous genes into the bacteria for use as vaccines include pPL1 and pPL2 as described in Lauer et al., Journal of Bacteriology, 184: 4177-4186 (2002).


Attenuated forms of L. monocytogenes have been produced. The ActA protein of L. monocytogenes is sufficient to promote the actin recruitment and polymerization events responsible for intracellular movement. A human safety study has reported that administration of an actA/plcB-deleted attenuated form of Listeria monocytogenes caused no serious sequelae in adults (Angelakopoulos et al., Infection and Immunity, 70:3592-3601 (2002)).


Another possible delivery system is based on killed but metabolically active (KBMA) bacteria, that simultaneously takes advantage of the potency of live vaccines and the safety of killed vaccines. In these microbes, for example L. monocytogenes, genes required for nucleotide excision repair (uvrAB) are removed, rendering microbial-based vaccines exquisitely sensitive to photochemical inactivation with psoralen and long-wavelength ultraviolet light. Colony formation of the nucleotide excision repair mutants can be blocked by infrequent, randomly distributed psoralen crosslinks, but the bacterial population is still able to express its genes, synthesize and secrete proteins. See Brockstedt et al Nat. Med. 2005 August; 11(8):853-60 and US 2004/0197343 A1. Other systems that can be used include those taught in US 2004/0228877 A1, US 2005/0281783 A1, and US 2005/0249748 A1.


4. Viral-Based Delivery Systems


In another embodiment of the invention, the immunogenic composition comprising the consensus sequence or epitope-containing polypeptide and/or the polynucleotide encoding the consensus sequence or epitope-containing further comprises a viral vector. The viral vector will typically comprise a highly attenuated, non-replicative virus. Vaccinia variants, avipoxviruses, adenoviruses, polio viruses, influenza viruses, and herpes viruses can all be used as delivery vectors in conjunction with the present invention.


Formulations


Compositions comprising any of the consensus sequences described herein are also provided. In some embodiments, the compositions are pharmaceutical compositions. In some embodiments, the compositions are immunogenic compositions. The pharmaceutical compositions optionally comprise a pharmaceutically acceptable carrier or adjuvant. In some embodiments, the compositions are vaccine compositions (i.e., vaccines). The vaccine compositions optionally comprise a pharmaceutically acceptable carrier or adjuvant.


The vaccine compositions of the present invention can be used to stimulate an immune response in an individual. The formulations can be administered to an individual by a variety of administration routes. Methods of administration of such a vaccine composition are known in the art, and include oral, nasal, intravenous, intradermal, intraperitoneal, intramuscular, intralymphatic, percutaneous, scarification, and subcutaneous routes of administration, as well as intradermally by gene gun wherein gold particles coated with DNA may be used in the gene gun and any other route that is relevant for an infectious disease.


The vaccine compositions may further comprise additional components known in the art to improve the immune response to a vaccine, such as adjuvants, T cell co-stimulatory molecules, or antibodies, such as anti-CTLA4. The invention also includes medicaments comprising the pharmaceutical compositions of the invention. An individual to be treated with such vaccines, is any vertebrate, preferably a mammal, including domestic animals, sport animals, and primates, including humans. The vaccine can be administered as a prophylactic or for treatment purposes.


Vaccine formulations are known in the art. Known vaccine formulations can include one or more possible additives, such as carriers, preservatives, stabilizers, adjuvants, antibiotics, and other substances. Preservatives, such as thimerosal or 2-phenoxy ethanol, can be added to slow or stop the growth of bacteria or fungi resulting from inadvertent contamination, especially as might occur with vaccine vials intended for multiple uses or doses. Stabilizers, such as lactose or monosodium glutamate (MSG), can be added to stabilize the vaccine formulation against a variety of conditions, such as temperature variations or a freeze-drying process. Adjuvants, such as aluminum hydroxide or aluminum phosphate, are optionally added to increase the ability of the vaccine to trigger, enhance, or prolong an immune response. Additional materials, such as cytokines, chemokines, and bacterial nucleic acid sequences, like CpG, are also potential vaccine adjuvants. Antibiotics, such as neomycin and streptomycin, are optionally added to prevent the potentially harmful growth of germs. Vaccines may also include a suspending fluid such as sterile water or saline. Vaccines may also contain small amounts of residual materials from the manufacturing process, such as cell or bacterial proteins, egg proteins (from vaccines that are produced in eggs), DNA or RNA, or formaldehyde from a toxoiding process. Formulations may be re-suspended or diluted in a suitable diluent such as sterile water, saline, isotonic buffered saline (e.g. phosphate buffered to physiological pH), or other suitable diluent.


The consensus sequence vaccine is optionally administered to a host in a physiologically acceptable carrier. Optionally, the vaccine formulation further comprises an adjuvant. Useful carriers known to those of ordinary skill in the art include, e.g., citrate-bicarbonate buffer, buffered water, 0.4% saline, and the like. In some embodiments, the vaccine compositions are prepared as liquid suspensions. In other embodiments, the vaccine compositions comprising the consensus sequence strains are lyophilized (i.e., freeze-dried). The lyophilized preparation can then be combined with a sterile solution (e.g., citrate-bicarbonate buffer, buffered water, 0.4% saline, or the like) prior to administration.


Viral vectors can be used to administer polynucleotides encoding a polypeptide comprising one or more HCV consensus sequences or polynucleotides encoding a HCV epitope or any fragment thereof. Such viral vectors include vaccinia virus and avian viruses, such as Newcastle disease virus. Others may be used as are known in the art.


Naked DNA can be injected directly into the host to produce an immune response. Such naked DNA vaccines may be injected intramuscularly into human muscle tissue, or through transdermal or intradermal delivery of the vaccine DNA, typically using biolistic-mediate gene transfer (i.e., gene gun). Recent reviews describing the gene gun and muscle injection delivery strategies for DNA immunization include Tuting, Curr. Opin. Mol. Ther. (1999) 1: 216-25, Robinson, Int. J. Mol. Med. (1999) 4: 549-55, and Mumper and Ledbur, Mol. Biotechnol. (2001) 19: 79-95. Other possible methods for delivering plasmid DNA include electroporation and iontophoreses.


Another possible gene delivery system comprises ionic complexes formed between DNA and polycationic liposomes (see, e.g., Caplen et al. (1995) Nature Med. 1: 39). Held together by electrostatic interaction, these complexes may dissociate because of the charge screening effect of the polyelectrolytes in the biological fluid. A strongly basic lipid composition can stabilize the complex, but such lipids may be cytotoxic. Other possible methods for delivering DNA include electroporation and iontophoreses.


The use of intracellular and intercellular targeting strategies in DNA vaccines may further enhance the effect of HCV vaccine compositions. Previously, intracellular targeting strategies and intercellular spreading strategies have been used to enhance MHC class I or MHC class II presentation of antigen, resulting in potent CD8+ or CD4+ T cell-mediated antitumor immunity, respectively. For example, MHC class I presentation of a model antigen, HPV-16 E7, was enhanced using linkage of Mycobacterium tuberculosis heat shock protein 70 (HSP70) (Chen, et al., (2000), Cancer Research, 60: 1035-1042), calreticulin (Cheng, et al., (2001) J Clin Invest, 108:669-678) or the translocation domain (domain II) of Pseudomonas aeruginosa exotoxin A (ETA(dII)) (Hung, et al., (2001) Cancer Research, 61: 3698-3703) to E7 in the context of a DNA vaccine. To enhance MHC class II antigen processing, the sorting signals of the lysosome associated membrane protein (LAMP-1) have been linked to the E7 antigen, creating the Sig/E7/LAMP-1 chimera (Ji, et al, (1999), Human Gene Therapy, 10: 2727-2740).


To enhance further the potency of naked DNA vaccines, an intercellular strategy that facilitates the spread of antigen between cells can be used. This improves the potency of DNA vaccines as has been shown using herpes simplex virus (HSV-1) VP22, an HSV-1 tegument protein that has demonstrated the remarkable property of intercellular transport and is capable of distributing protein to many surrounding cells (Elliot, et al., (1997) Cell, 88: 223-233). Such enhanced intercellular spreading of linked protein, results in enhancement of HCV-specific CD8+ T cell-mediated immune responses and anti-HCV effect. Any such methods can be used to enhance DNA vaccine potency against HCV in an infected individual.


Prophylactic Vaccines Using HCV Consensus Sequences


The HCV consensus sequence taught herein can also be used as a vaccine prophylactically, i.e., lessening the symptoms associated with HCV in an individual who is infected with HCV. In one embodiment, an individual is preventing from contracting HCV. The vaccine compositions of the invention are administered to an individual who is not infected with HCV. By its ability to induce or augment an effective immune response, the vaccine mollifies the dangers of becoming infected with HCV. The individual should be given an amount effective to induce or augment effective immune responses, which can be measured using the assays detailed in the Examples section. The invention discloses epitopes which are immunogenic in individuals with certain MHC haplotypes. Thus, depending on the MHC haplotype of the individual, a customized vaccine may also be made if desired. Even if the individual does not have matching MHC haplotype, the advantage of using consensus sequence as a vaccine is that it has the potential for cross reactivity. In another embodiment, the individual has been exposed to ACV experiences a lessening of the symptoms associated with HCV. In this case, the individual is monitored by a physician or another individual suitably trained to monitor the progression of HCV infection. As disclosed above, symptoms associated with HCV include jaundice, fatigue, dark urine, abdominal pain, loss of appetite, and nausea, liver disease, and hepatocellular carcinoma. Physicians can monitor the individuals for the presence and level of HCV (e.g., by RNA levels) to assess the individual's responses to vaccination.


Methods of Treating HCV-Infected Individuals


The invention provides methods for treating individuals infected with HCV. Using the compositions comprising HCV consensus sequences disclosed herein, individuals who are infected with HCV can be treated by administering an effective amount of the composition. An effective amount is an amount that is sufficient to stimulate an immune response. The immune response can be monitored by using the assays disclosed in the Examples or any other standard measure of immune response (e.g., measuring antibodies, cytokine levels, activation markers on immune cells, etc.) Optionally, the levels of HCV can be measured by standard assays known to one of ordinary skill in the art. In one embodiment, the administration of a composition comprising HCV consensus sequences (e.g., a vaccine) will stimulate T cell and/or B cell response. The stimulation of an immune response will serve to eradicate circulating HCV in the individual.


In some embodiments, the administration of one or more HCV consensus sequence proteins or a fragment thereof is capable of inducing an immune response in a host animal. In one embodiment, the immune response is a cell-mediated immune response. In one embodiment, the effective immune response induced or augmented by the one or more HCV consensus sequence proteins or a fragment thereof comprises a T cell response, such as a CD4+ T cell response or a CD8+ T cell response, or both.


These immune cell responses can be measured by both in vitro and in vivo methods to determine if the immune response involved in the present invention is effective. Efficacy can be determined by comparing these measurements for those to whom have received treatment to those who have received no treatment. Another alternative to compare the measurements of an individual before treatment with his/her measurements after treatment. One possibility is to measure the presentation of the HCV consensus protein or consensus epitope of interest by an antigen-presenting cell. The HCV consensus protein or consensus epitope of interest may be mixed with a suitable antigen presenting cell or cell line, for example a dendritic cell, and the antigen presentation by the dendritic cell to a T cell that recognizes the protein or antigen can be measured. If the HCV consensus protein or consensus epitope of interest is being recognized by the immune system at a sufficient level, it will be processed into peptide fragments by the dendritic cells and presented in the context of MHC class I or class II to T cells. For the purpose of detecting the presented protein or antigen, a T cell clone or T cell line responsive to the particular protein or antigen may be used. The T cell may also be a T cell hybridoma, where the T cell is immortalized by fusion with a cancer cell line. Such T cell hybridomas, T cell clones, or T cell lines can comprise either CD8+ or CD4+ T cells. The dendritic cell can present to either CD8+ or CD4+ T cells, depending on the pathway by which the antigens are processed. CD8+ T cells recognize antigens in the context of MHC class I molecules while CD4+ recognize antigens in the context of MHC class II molecules. The T cell will be stimulated by the presented antigen through specific recognition by its T cell receptor, resulting in the production of certain proteins, such as IL-2, tumor necrosis factor-α (TNF-α), or interferon-γ (IFN-γ), that can be quantitatively measured (for example, using an ELISA assay, ELISPOT assay, or Intracellular Cytokine Staining (ICS)).


In another aspect, the invention provides methods for preventing an already infected individual from entering a chronic phase of infection. As the infection moves into the chronic phase, an infected individual's immune system is weakened and less effective against the HCV. By utilizing the precise genetic changes disclosed herein, methods of treatment are possible to prevent the chronic phase of infection. An effective amount of a composition comprising HCV consensus sequences (e.g., a vaccine) is administered to the infected individual. The progress of the infection is monitored afterwards. Optionally, prior to the administration of a vaccine, the full or partial sequence of the HCV infecting the individual is determined and the consensus sequence for administration can be customized if desired.


Methods of Diagnosis


The invention also provides for methods of diagnosing an individual with HCV. By using the consensus sequences disclosed herein, one of skill in the art can obtain a biological sample from an individual to be tested and then use PCR primers (or any comparable molecular biological technique) to the consensus sequence to amplify nucleic acids in the biological sample to determine if the individual has been infected with a particular strain of HCV (e.g., HCV 1a or HCV 1b).


In another embodiment, the nucleic acid sequence of portions of the HCV consensus sequence can be used as probes for purposes of diagnosis. The oligonucleotide sequences selected as probes should be sufficiently long and sufficiently unambiguous that false positives are minimized. The oligonucleotide can be labeled such that it can be detected upon hybridization to the nucleic acid (e.g., DNA) being screened. Methods of labeling are well known in the art, and include the use of radiolabels, such as 32P-labeled ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high stringency, are provided in Sambrook et al., supra.


Kits


The invention further provides kits (or articles of manufacture) comprising the HCV consensus sequences of the present invention.


In one aspect, the invention provides a kit comprising both (a) a composition comprising a HCV consensus sequence described herein, and (b) instructions for the use of the composition in the prevention or treatment of HCV in a host. In some embodiments, the instructions are on a label. In other embodiments, the instructions are on an insert contained within the kit.


In another aspect, the invention provides a kit comprising both (a) a composition comprising a HCV consensus sequence described herein; and (b) instructions for the administration of the composition to a host. In some embodiments, the instructions are on a label. In other embodiments, the instructions are on an insert contained within the kit.


In another aspect, the invention provides a kit comprising both (a) a composition comprising a HCV consensus sequence described herein; and (b) instructions for selecting a host to which the composition is to be administered. In some embodiments, the instructions are on a label. In other embodiments, the instructions are on an insert contained within the kit.


In another aspect, the invention provides a kit comprising both (a) a composition comprising a HCV consensus sequence described herein; and (b) instructions for selecting one or more consensus sequence(s) to administer to an individual. In some embodiments, the instructions are on a label. In other embodiments, the instructions are on an insert contained within the kit


In some embodiments of each of the aforementioned aspects, the composition is a vaccine. In some embodiments of each of the aforementioned aspects, the vaccine comprises the entire consensus sequence 1a or a fragment thereof. In other embodiments, the vaccine comprises the entire consensus sequence 1b or a fragment thereof. In other embodiments, the vaccine comprises one or more HCV proteins or a fragment thereof. It is understood that these composition include non-synonymous changes as well.


The following are provided to illustrate, but not to limit, the invention.


EXAMPLES
Example 1
Obtaining HCV Consensus Sequence 1a and 1b

Using HCV Sequence Database at Los Alamos National Laboratories at the website: http://hcv.lanl.gov, all available non-redundant human sequences greater than 500 nucleotides were obtained. Then, the options were set as default except for: “Genotype” was set to “1”, “Subtype” to “a or b”, “Include recombinants” set to “off”, “Include fragments longer than” was set to “on, 500” and “Excude Non-human hosts” was set to “on.” Once the preliminary query result was displayed, “Exclude related” was chosen. Then the sequences were downloaded with default options. The rationale behind these series of steps is that size restriction reduces the number of unidirectional (poor quality) sequences.


Next, the alignments were neatened by performing any number of commands, such as restoring codon boundaries, closing gaps and removing non-translated termini. The rationale behind this step is that only the primary open reading frame is desired. Next, sequences that contain >1% ambiguous sites or >1 frameshift were removed. The rationale for this step is to remove low-quality sequences. Next, the terminal “gap” characters were converted to “missing.” The rationale behind this step is to distinguish true gaps from missing data for subsequent analysis. Using the CleanCollapse program, which was written by Stuart Ray and publicly available at <http://sray.med.som.jhmi.edu/SCRoftware/CleanCollapse>, identical sequences were identified. The CleanCollapse program has also been disclosed in Kieffer T L, et al. J Infect Dis. 2004; 189(8):1452-65. For identical sequences, related publications were checked and any linked sequences were removed. The rationale behind this approach is that oversampling individuals could skew sequence distributions. Despite use of “Exclude related” option above, the returned data set can still include many redundant/related sequences. Thus, these were removed to prevent skewing of sequence distributions.


Predicted polyprotein sequences were generated by automated translation in BioEdit (Tom Hall, URL: <http://www.mbio.ncsu.edu/RNaseP/info/programs/BIOEDIT/bioedit.html>) using standard eukaryotic genetic code. The majority-rule consensus sequence were identified for each subtype by using the MargFreq program written by Stuart Ray and publicly available at <http://sray.med.som.jhmi.edu/SCRoftware/MargFreq> to identify modal aa residue at each site. The MargFreq program has also been disclosed in Ray S C, et al. J. Virol. 1999 April; 73(4):2938-46. Where there is a near-tie at a site, the nearest neighbor (the other subtype) distribution for likely ancestor/shared state was examined and then used. The rationale behind this is that it reduces likelihood of escape mutant for common HLA allele being used. The result was consensus sequence1a (SEQ ID NO:1) and consensus sequence1b (SEQ ID NO:2).


Example 2
Cellular Immune Selection with Hepatitis C Persistence in Humans

Hepatitis C virus (HCV) infection frequently persists despite substantial virus-specific cellular immune responses. To determine if immunologically-driven sequence variation occurs with HCV persistence, we coordinately analyzed sequence evolution and CD8+ T cell responses to epitopes covering the entire HCV polyprotein in subjects followed prospectively from prior to infection to beyond the first year. There were no substitutions in T-cell epitopes for a year following infection in a subject who cleared viremia. In contrast, in subjects with persistent viremia and detectable T cell responses, we observed substitutions in 69% of T cell epitopes, and every subject had a substitution in at least one epitope. In addition, amino acid substitutions occurred 13-fold more often within than outside T cell epitopes (p<0.001, range 5-38). T lymphocyte recognition of eight of ten mutant peptides was markedly reduced compared to the initial sequence, indicating viral escape. Of 16 non-envelope substitutions that occurred outside of known T cell epitopes, eight represented conversion to consensus (p=0.015). These findings reveal two distinct mechanisms of sequence evolution involved in HCV persistence: viral escape from CD8+ T cell responses and optimization of replicative capacity.


The World Health Organization estimates there are 170 million persons with HCV infection worldwide, and an estimated 4 million persons are infected with hepatitis C virus (HCV) in the United States.(1,2) In most countries, HCV infection is found in 1-2% of the general population and may cause cirrhosis or hepatocellular cancer, but only when infection persists.(3-7)


Patients in the acute phase of HCV infection are much more likely to respond to therapy designed to eradicate the virus than are patients after progression to chronicity.(8-10) The features unique to acute infection that allow increased responsiveness to interferon therapy remain unknown. Spontaneous clearance of HCV infection occurs in about 20% of acutely infected individuals and is associated with a broadly specific and vigorous cellular immune response.(11-14) Although the cellular immune response is often less vigorous and more narrowly directed in those who fail to clear the infection, a cellular immune response is nonetheless often present in early infection and may even persist into chronic infection. Why the early response fails to control viremia in those who progress to chronic infection is not clear, but responses generated in acute infection have been shown to decline in some subjects who remained persistently infected and chronic infection is characterized by low frequencies of CD8+ T cells in peripheral blood.(13, 15-20) Although the liver has the potential to delete antigen-specific T cell responses, HCV-specific CD8+ CTL lines have been generated from the liver of both chronically infected humans and chimpanzees, suggesting that elimination of HCV specific lymphocytes from the liver is neither universal nor necessary for HCV persistence.(21-24) Survival of HCV despite virus-specific CD8+ CTL might be explained by impaired cellular effector functions (proliferation, cytokine secretion, or cytolytic activity), T cell exhaustion, or dendritic cell dysfunction.(16, 25-27) A final possibility is that persistence is facilitated by viral evolution over the course of infection, enabling escape by mutation of key epitopes targeted by T-lymphocytes. Mutational escape from T cell responses has indeed been noted in HIV, which uses an error prone RNA polymerase similar to HCV.(28)


Mathematical models of viral kinetics suggest that up to 1012 virions are produced each day in a human with chronic hepatitis C.(29) This rate exceeds comparable estimates of the production of HIV by more than an order of magnitude, and, coupled with the absence of proofreading by the HCV NS5B RNA polymerase, results in frequent mutations within the HCV genome. Mutation of class I or II MHC restricted T cell epitopes could alter the outcome of infection by preventing or delaying clearance of infected hepatocytes.(30) In the face of a vigorous multispecific CTL response, mutation of several epitopes, perhaps simultaneously, would be required for survival of the virus. In HCV infection, a strong association between viral persistence and the development of escape mutations has been demonstrated in the chimpanzee model (31) and one group examined viral evolution in a single human histocompatibility leukocyte antigen (HLA)-B8-restricted NS3 epitope,(32) but evidence of evasion of a multispecific CTL response in humans is lacking. Although mutations in class I MHC restricted HCV epitopes have been observed in humans with chronic HCV infection, it is uncertain that these mutations result from CD8+ T cell selection pressure or that they occur during the acute phase of infection, when clearance or persistence is determined.(33-35)


Studies of the cellular immune response to acute HCV infection have been challenging because acute hepatitis C is usually clinically silent, making early virus isolates and CTL difficult to obtain. In addition, no consistent pattern of HCV epitope dominance has emerged in humans so large numbers of PBMC are needed for the broad screening required to identify CTL responses. We have overcome these challenges to test the hypothesis that cytotoxic T lymphocyte (CTL)-driven sequence variation occurs with progression to persistent HCV in humans. We prospectively studied HCV antibody negative injection drug users at risk for HCV infection and compared the viral sequences at initial viremia to sequences obtained at multiple time points in acute HCV infection. In parallel, a genome wide analysis of T cell responsiveness was performed. As evidence of immune selection pressure, we determined the percentage of T cell epitopes that underwent substitution, assessed the likelihood of amino acid changing substitutions to occur within versus outside T cell epitopes, and examined the effects of observed amino acid sequence changes in epitopes on T cell recognition and MHC class I binding. For amino acid substitutions outside T cell epitopes, we investigated explanations other than T cell pressure. Our results provide strong evidence in humans that both immune and fitness selection occur during the acute phase of HCV infection.


Materials And Methods


Participants. The Risk Evaluation Assessment of Community Health (REACH) prospective study of young IDUs in Baltimore, Md. examined the incidence and risk factors for HCV infection, as described previously. (47) Participants eligible for the study were anti-HCV antibody negative, between 15 to 30 years of age, and acknowledged use of injection drugs. Participants were invited to co-enroll in a substudy of acute hepatitis C and those who consented had blood drawn for isolation of serum, plasma, and peripheral blood mononuclear cells in a protocol designed for monthly follow up. At each visit, participants were provided counseling to reduce the risks of drug use. The REACH protocol and the HCV substudy protocols were approved by the institutional review boards of the Johns Hopkins Schools of Medicine and Hygiene and Public Health.


From 1997 to 2002, 179 participants were enrolled and 62 (34.6%) developed anti-HCV antibodies (seroconverted). Beginning in November 2001, acutely infected persons were assessed for donation of ˜108-1010 PBMC by blood donation or apheresis. HCV-specific lymphocyte responses and sequences were studied in detail in 5 acutely infected persons who were assessed frequently during the six-month period following infection and from whom large volumes of PBMC were obtained. HCV sequencing and limited analysis of the cellular response were performed using an additional 3 subjects for whom smaller numbers of cells were available, as described below.


HCV testing protocol. Serum or plasma, stored at −80° C., from monthly follow-up visits was tested for the presence of HCV specific antibodies using the commercially available Ortho version 3.0 enzyme-linked immunosorbent assay (ELISA) (Ortho Clinical Diagnostics, Raritan, N.J.) according to manufacturer's instructions. Specimens in which antibodies were detected were retested in duplicate along with the participant's previous seronegative sample to identify seroconverters. HCV RNA testing was performed on sera or plasma separated from blood within two hours of collection and stored at −80° C. For HCV seroconverters, HCV RNA testing was done on samples collected before seroconversion until a negative result was obtained and after seroconversion to evaluate the outcome of infection (viral persistence versus clearance) by using quantitative, and if undetectable, qualitative HCV RNA tests that are described below.


HCV RNA Assays—Qualitative. For detecting HCV RNA, we used the COBAS AMPLICOR™ Hepatitis C Virus Test version 2.0 (Roche Molecular Systems, Branchburg, N.J.). A limit of detection of 1.7 log10International Units (IU)/mL at >95% detection is reported for this assay. Quantitative. To determine concentration of HCV RNA in serum, we used a quantitative RT-PCR assay (COBAS AMPLICOR™ HCV Monitor version 2.0, Roche Molecular Systems). This assay has a lower limit of quantitation of 2.8 log10IU/mL. When HCV RNA was not detected by using this assay, the sample was retested using the Roche qualitative test.


HCV Genotyping. Genotype was determined by performing phylogenetic analysis on Core-E1 region sequences of HCV obtained from the first viremic specimen. For most specimens, sequences were obtained from cDNA clones generated with a long amplicon RT-PCR method that has been described previously.(48) For other specimens, genotype was determined by direct sequencing of RT-PCR products from the same Core-E1 region as previously described.(49) Sequences were aligned using ClustalX,(50) trimmed to equal length using BioEdit.(51) The GTR+I+G analytical model (parameters available on request from the authors) was selected using the AIC criterion as implemented in ModelTest version 3.06(52) and PAUP* version 4b10 (Sinauer Associates, Sunderland, Mass.). Phylogenetic trees were estimated using the neighbor-joining algorithm implemented in PAUP*, and robustness of clustering was tested using by bootstrap analysis.(53)


Viral recovery. HCV clearance was defined as the presence of anti-HCV with HCV RNA undetectable by the COBAS AMPLICOR™ qualitative assay in serum or plasma specimens from ≧2 consecutive visits obtained at least 300 days after initial detection of viremia. Persistence was defined as the persistent presence of anti-HCV with HCV RNA detectable by the qualitative or quantitative COBAS AMPLICOR assay in serum or plasma specimens obtained at least 300 days after initial viremia.(54)


Hemigenomic HCV sequencing and analysis. From 140-280 uL of serum or plasma, the 5.2 kb region from the 5′UTR to the NS3/NS4A junction was cloned as previously described.(48) For each specimen, thirty-three clones were assigned to clonotypes by using a previously-described gel shift assay,(55) and 2 clones representing the modal clonotype were sequenced, with a third clone used as needed to resolve discrepancies. Sequences were assembled into contigs using Aligner (CodonCode). Sequence data were obtained at the point of initial viremia and approximately six months later.


Reference sequence analysis. Reference sequence data were obtained from the HCV Sequence Database (http://hcv.lanl.gov). For table 1, amino acid sequence was inferred from cDNA sequences, which were obtained using the HCV Sequence Database's Search Interface. Default search parameters were used, except that (i) only subtype 1a sequences were included, (ii) recombinant sequences were excluded, (iii) non-human sequences were excluded, and (iv) the search was performed for the codon of interest based on position in the HCV polyprotein. When multiple sequences with the same “patient ID” were obtained, only the first occurrence was retained.


Cellular immunology. IFN-γ ELISPOT assay to assess HCV-specific T-cell responses. HCV-specific CD8+ T-cell responses were quantified by ELISPOT assay as previously described(56) with the following modifications. For Subjects 17, 18, 21, 28, and 29, enough PBMC were acquired to test for T cell recognition of 523 overlapping peptides (16-22mer peptides overlapping by 10 amino acids) spanning the entire expressed HCV-H77 genome (genotype 1a) as well as 83 peptides corresponding to optimal described CTL epitopes(57) in a matrix format. The peptides recognized in the matrix were subsequently tested individually and in at least duplicate to confirm recognition and to measure the number of spot forming colonies (SFC) produced. For three additional subjects where the number of cells was limited, responses to peptides spanning regions of sequence variation detected during sequencing, but not to the entire collection of 523 overlapping peptides, were assessed. Ninety-six well polyvinylidene plates (Millipore, Billerica, Mass.) were coated with 2.5 μg/ml recombinant human anti-IFN-gamma antibody (Endogen, Pierce Biotechnology, Rockford, Ill.) in PBS at 4° C. overnight. Fresh or previously frozen PBMC were added at 200,000 cells/well in 140 μl R10 media (RPMI 1640 (Sigma-Aldrich Corp., St. Louis, Mo.), 10% FCS (Sigma-Aldrich), and 10 mM Hepes buffer(Sigma-Aldrich) with 2 mM glutamine and antibiotics (50 U/ml penicillin-streptomycin)). Peptides were added directly to the wells at a final concentration of 10 μg/ml. The plates were incubated for 20 hours at 37° C., 5% CO2. Plates were then washed, labelled with 0.25 μg/ml biotin-labelled anti-IFN-γ (Endogen), and developed by incubation with streptavidin-alkaline phophatase (Bio-Rad Lab., Hercules, Calif.) followed by incubation with BCIP/NBT (Bio-Rad) in Tris-buffer (pH 9.5). The reaction was stopped by washing with tap water and the plates were dried, prior to counting on an ELISPOT reader (Cellular Technology Ltd, Cleveland, Ohio). For quantitation of ex-vivo responses, the assay was performed at least in duplicate and background was not more than 15 SFC per million PBMC. Responses were considered positive if the number of spots per well minus the background was at least 25 SFC per million PBMC.(56) A control of pooled cytomegalovirus, Epstein-Barr virus, and influenza antigens (CEF control peptide pool) and phytohemmaglutinin (PHA) were used as positive controls.(58) Responses to the CEF control peptide pool were quantifiable and remained relatively constant over time. Responses to PHA were uniformly positive.


Intracellular cytokine staining (ICS). To determine whether the lines generated for each epitope were CD4+ or CD8+ T cell lines, intracellular cytokine staining (ICS) for IFN-γ was performed as previously described.(56) Briefly, 1×106 PBMC were incubated with 4 μg/ml peptide at 37° C. and 5% CO2 for 1 h before the addition of Monensin (1 μl/ml; Sigma-Aldrich). The cells were incubated for an additional 5 h at 37° C. and 5% CO2. PBMC were then washed and stained with surface antibodies, fluorescein isothiocyanate (FITC)-conjugated anti-CD8 or FITC-conjugated anti-CD4 (Becton Dickinson, BD, Franklin Lakes, N.J.) at 4° C. for 20 min. Following the washing, the PBMC were fixed and permeabilized (Caltag, Burlingame, Calif.), and the phycoerythrin (PE)-conjugated anti-IFN-γ MAb (Becton Dickinson) was added. Cells were then washed and analyzed on a FACS-Calibur flow cytometer using CellQuest software (Becton Dickinson).


Magnetic bead separation of CD8+ and CD4+ T cells. To determine if the responding T cells in PBMC were CD4+ or CD8+ T cells, between 3 and 10×107 PBMC were labeled with magnetic beads bearing anti-CD8 or anti-CD4 antibodies (Miltenyi Biotec, Auburn, Calif.) according to the manufacturer's instructions and cells were positively selected using an Auto MACS (Miltenyi Biotec) to isolate either CD8+ or CD4+ cells. The ELISPOT assay was repeated using the isolated CD8+ or CD4+ T cells to determine if recognition of an epitope were mediated by CD8+ or CD4+ T cells.


Bulk stimulation of peripheral blood mononuclear cells. To establish CD8+ T cell lines, cryopreserved or fresh PBMC (4−10×106) were stimulated with 10 μg/ml of synthetic HCV peptide and 0.5 μg/ml of the costimulatory antibodies anti-CD28 and anti-CD49d (Becton Dickinson) in R10 media. Recombinant interleukin-2 (IL-2, 25 IU/ml) was added on day 2 and every other day thereafter. Cells were restimulated with 25×106 irradiated allogeneic PBMC and 10 μg/ml of synthetic HCV peptide after ten days.


Testing impact of amino acid substitutions on T cell recognition. To assess the impact of amino acid substitutions on T cell recognition, HCV peptide-specific T cell lines generated from PBMC obtained six months after initial viremia were tested for IFN-γ production in response to serial dilutions of synthetic peptides representing the viral sequences present at initial viremia (t0) or at six months following initial viremia (t6). In five cases where the frequency of T cells specific for the HCV epitope was high, bulk PBMC obtained six months following initial viremia were also tested in this way. Comparison of initial and variant epitopes was performed using log10 dilutions of the t0 and t6 peptides from 10 μM to 0.001 μM in the IFN-γ ELSIPOT assay described above for PBMC, but using 30,000 T cells when T cell lines were tested. Ten peptide pairs from subjects with chronic viremia were tested, and three patterns were observed. Loss of recognition was defined as either no recognition at the highest concentration of the t6 peptide or at least 20 fold greater recognition of the to peptide than the t6 peptide at all concentrations. Decreased recognition was defined as greater than 2 and less than 20 fold fewer spot forming colonies (SFC) produced at two or more concentrations of the t6 peptide. Comparable recognition was defined as no more than a two fold difference in SFC between the t0 and t6 peptides at two or more concentrations tested.


MHC-peptide binding assays. EBV transformed cell lines were used as the primary sources of HLA molecules. Cells were maintained in vitro and HLA molecules purified by affinity chromatography as previously described.(59) Quantitative assays to measure the binding of peptides to purified class I molecules are based on the inhibition of binding of a radiolabeled standard peptide.(59) Briefly, 1-10 nM of radiolabeled peptide was co-incubated at room temperature with 1 μM to 1 nM of purified MHC in the presence of 1 μM human, β2-microglubulin (Scripps Laboratories, San Diego, Calif.) and a cocktail of protease inhibitors. After a two-day incubation, binding of the radiolabeled peptide to the corresponding MHC class I molecule was determined by capturing MHC/peptide complexes on Greiner Lumitrac 600 microplates (Greiner Bio-one, Longwood, Fla.) coated with the W6/32 antibody, and measuring bound cpm using the TopCount microscintillation counter (Packard Instrument Co. Meriden, Conn.).


Statistical analysis. Statistical analysis was done with the aid of SigmaStat software version 3.10 (Systat software, Inc.). For comparing proportions, Fisher's exact (small sample size) and Chi squared (large sample size) tests were used. Differences were considered significant if p-values were <0.05.


Results—We assessed T cell responses and sequenced half the HCV genome in eight subjects, seven of whom progressed to chronic infection (FIG. 1). T cell and viral analyses were done for all eight at initial detection of viremia and then six months later. Additional assessment of T cell responses was done at the time points designated by arrows in FIG. 1. No T cell responses were detectable at initial viremia, but all of the subsequently detected T cell responses were present by 6 months following infection.


Persistence versus Loss of T cell Epitopes with Sequence Evolution—The locations of amino acid substitutions and recognized CD8+ T cell epitopes are shown for subjects 17, 18, 21, 28, and 29 in FIG. 2. Subject 21 was the only subject with no detectable T cells responses. The only subject who cleared HCV spontaneously (18) was also the only individual whose HCV genome had no substitutions within any recognized T cell epitope at 6 or 12 months following initial viremia. Subjects 17, 28, and 29 remain persistently infected and had substitutions at six months in ⅗, 6/8, and ⅔ of recognized CD8+ T cell epitopes, respectively. In summary, the subject who cleared infection had no substitutions in 3 recognized T-cell epitopes at 6 or at 12 months after infection, whereas the three subjects with chronic viremia and T cell responses had substitutions in 60-75% of CD8+ T cell epitopes by six months following infection.


Impact of Amino Acid Substitutions on T cell Recognition—T cell lines were generated from PBMC using synthetic peptides representing the viral sequences present at initial viremia (t0). To assess the impact of amino acid substitutions on T cell recognition, those T cell lines and bulk PBMC obtained approximately 6 months after initial viremia were tested for IFN-γ production in response to serial dilutions of the to peptide or a synthetic peptide representing the viral sequence present at six months following initial viremia (t6). Ten t0/t6 peptide pairs from subjects with persistent viremia were tested using T cell lines, and three patterns were observed (FIG. 3). For the ten t6 peptides tested, we noted loss of recognition of four (FIG. 3a), decreased recognition of four others, (FIG. 3b), and comparable recognition of two (FIG. 3c). Therefore, for 8 of the 10 mutations in recognized epitopes tested, recognition by T lymphocytes was lost or significantly reduced compared to recognition of the sequence present at initial viremia, indicating escape. In no case was the t6 variant peptide recognized better than the to peptide. For five t0/t6 peptide pairs also tested with bulk PBMC, the patterns of recognition were the same as those observed using T cell lines, as shown for one peptide pair in FIG. 4. That PBMC as well as T-cell lines generated against the to peptides failed to recognize the t6 peptides suggests that not only did the substitution allow escape from the T-cells specific for the original sequence, but also that no new T-cell responses were generated against the t6 sequence. To rule out transient suppression or problems with specimen handling and the subsequent development of T cell responses to the t6 HCV sequence, IFN-γ ELISPOT testing for recognition of the t6 peptides demonstrating escape was also performed in subjects 17 and 28 using PBMC obtained approximately 12, 18, 24, and in subject 17, 36 months following initial infection. The patterns of recognition persisted over time and recognition of the t6 peptides declined in parallel with the decline in recognition of the to peptides that occurred with prolonged infection (FIG. 5). Despite months of persistent exposure to the t6 peptides that escaped recognition at six months following infection, in no case did a new T cell response specific for a t6 peptide arise in the following 6 to 36 months.


Mechanisms of Loss of T cell Recognition with Amino Acid Substitutions—Amino acid substitutions may result in decreased recognition through reduced HLA binding capacity, abrogation of T cell recognition, or altered processing with failure to generate the correct sequence for presentation on the surface. We did observe marked reduction in HLA biding capacity as one mechanism for reduced recognition in our subjects. For example, the HLA A*0101-restricted ATDALMTGY epitope recognized at to by Subject 28 had an A*0101 binding capacity (IC50) of 0.24 nM while the ATDALMTGF peptide recognized at t6 had a binding capacity of 64 nM. A five fold difference in binding capacity is considered significant, thus the variant peptide is less well bound to the HLA. However, the HLA A*0201-restricted KLVALGINAV epitope recognized at to by Subject 28 had an A0201 binding capacity of 5.0 nM while the KLVAMGINAV peptide recognized at t6 had a binding capacity of 2.3 nM, an insignificant difference that if anything favors the less well recognized peptide, t6. This t6 peptide may stimulate less IFN-γ production in the ELISPOT assay because of decreased T cell recognition rather than reduced HLA binding capacity. Although two of the ten t0/t6 peptide pairs were recognized comparably, we can not rule out that they represent escape mutations as well since substitutions have been shown by others to result in altered processing such that the peptide is no longer presented on the cell surface.(32,36,37) This mode of escape is circumvented when peptides are loaded onto the surface of the cell, as is the case in an ELISPOT assay.


Driving Forces for Sequence Evolution—We next evaluated the proportion of substitutions occurring in observed T cell epitopes and investigated explanations for substitutions outside of T cell epitopes. Of 69 substitution observed in the eight subjects, 17 (25%) occurred within detected CD8+ T-cell epitopes, 36 (52%) in envelope proteins (likely antibody targets), and 16 (23%) outside envelope regions and T cell epitopes. Of the eight subjects, five subjects had persistent viremia and T cell responses, and amino acid substitutions were a median of 13 fold more likely to occur within T cell epitopes than outside epitopes (p<0.001, range 5-38) in those subjects. Even though at a population level the envelope proteins are highly diverse, amino acid substitutions in T cell epitopes exceeded those in E1 and E2 by 7 fold (p<0.001, range 4-14). Of the 16 amino acid substitutions observed outside of the envelope proteins or recognized T cell epitopes (Table 1), 8 represented conversion to the most commonly-observed residue in reference sequences (i.e., convergence to the consensus sequence). Because each change has 19 alternative amino acids, and each site has only one modal (most frequently-observed) state, the convergence on modal residues at 50% of sites is highly unexpected if substitution is random (p=0.015, Table 2). In addition, the convergent changes were not consistently conservative changes, arguing against strict structural or functional constraint. These data are consistent with an accompanying study of chronically infected individuals for whom the inoculum sequence was known. That study showed that substitutions occurring in known epitopes for which the infected individual lacked the presenting HLA allele were consistently reversions toward the worldwide consensus sequence for subtype 1b.


Discussion—In this investigation of sequence variation in T cell epitopes as a potential mechanism for viral persistence, we show that amino acid substitutions during acute HCV infection are nonrandom and may be explained in part by escape from CD8+ T cell recognition and convergence, possibly because of replicatively unfit substitutions selected in a previous host. Significantly, there is early fixation of the T cell repertoire since we observed no instances of de novo development of T cells that recognized mutant t6 epitopes better than the original t0 epitope using lines or PBMC.


Using the chimpanzee model of HCV infection, mutation of multiple class I MHC restricted epitopes early in the course of chronic HCV infection has been demonstrated.(31) The role of CD8+ CTL in control of HCV replication was further reinforced by a statistically significant increase in the number of mutations resulting in amino acid changes in class I MHC restricted epitopes but not unrestricted epitopes or flanking sequences of the viral genome. These data indicate that in chimpanzees, amino acid substitutions in class I MHC restricted epitopes are selected and possibly maintained by HCV-specific CD8+ CTL populations that exert positive Darwinian selection pressure.


We also observed a statistically significant increase in the number of mutations resulting in amino acid changes in class I MHC restricted epitopes versus portions of the viral genome outside T cell epitopes and that amino acid substitutions in class I MHC restricted epitopes resulted in escape from CD8+ T cell recognition in acutely HCV-infected humans who progressed to chronic infection. New T cells specific for the sequences detected six months after initial viremia were not detected despite follow-up for as long as three years following infection. Thus, selection of HCV variants that evade CD8+ T cell recognition may represent a mechanism for persistence of HCV infection in humans. Supporting this, we observed no substitutions within recognized CD8+ T cell epitopes in the subject who cleared infection and substitution in 60-75% of CD8+ T cell epitopes in subjects with persistent infection. The number of CTL epitopes with substitutions has previously been shown to correlate with control of HIV viremia,(28) but a relationship between HCV control and maintenance of T cell epitopes had not been shown previously.


Although the observed substitutions were disproportionately contained within the portion of the HCV polyprotein in T-cell epitopes, many were found outside detectable T cell epitopes. It is possible that we missed CD8+ T cell epitopes and that some of the substitutions observed outside of T cell epitopes actually represented substitutions within T cell epitopes. We took several steps to minimize the chances of this occurring. Where there were substitutions and the H77 sequence used to make overlapping peptides as potential antigens differed from that of the subject, we tested for recognition of overlapping peptides representing autologous sequence. Where the subject's sequence matched H77 in areas of amino acid substitution, but there were no responses detected, we tested for recognition of additional overlapping peptides with different termini to minimize the possibility that we cut within a region containing an epitope. We detected no additional epitopes via ELISPOT analysis with either method of antigen modification (data not shown).


Since there is no clinical indication for liver biopsy in acute infection, we could only assess responses in the periphery. It is therefore possible that some of the substitutions may represent pressure for T cell responses present in the liver but not detectable in the periphery, We do not favor this explanation since previous studies have shown that the majority of T cell responses in the liver are also detectable in the peripheral blood.(38,39) There are several possible alternative mechanisms for selection of these substitutions occurring outside of observed CD8+ T cell epitopes. The first is that they represent substitutions in CD4+ T cell epitopes. Although we detect a few CD4+ T cell epitopes using our ELISPOT assay, the conditions of the assay preferentially detect CD8+ T cell epitopes and we may fail to detect all the possible CD4+ T cell epitopes. The second possible explanation is that they represent substitutions in B cell epitopes. Mutations in dominant antibody epitope(s) located in the hypervariable region 1 (HVR-1) of envelope glycoprotein 2 (E2) have been linked to persistence of HCV infection.(40) B cell epitopes are predicted to occur within the envelope regions of the polyprotein and the majority of mutations outside T cell epitopes were found in the envelope proteins for most subjects. Another possible explanation is that the mutations outside of T cell epitopes are selected because they confer a viral replication advantage. Some mutations may represent epitopes recognized by the previous host that revert to a more stable sequence when the new host fails to mount similar immune pressure. This may occur when the new host lacks the MHC allele required to present that epitope. Loss of escape mutations upon passage of SIV to new animals (41), and HIV to humans (42), that do not exert immune pressure on that region has been described recently, with the inference that escape from CTL responses may reduce viral fitness.


The relevance of those studies to human infection with HCV is supported by a recent study of one epitope in an acutely HCV infected patient (32), and our accompanying study of chronically infected individuals. The latter study shows that HCV amino acid sequence tends to revert to consensus in areas outside of T cell epitopes in subjects who are persistently infected. The consensus sequence likely represents a more replicatively fit state than the initial infecting strains, which have presumably adapted to the immune response of the previous host. Taken together, these results reveal at least two types of sequence variation occurring simultaneously in progression of acute HCV to persistence: immune pressure that selects T cell escape variants, and reversion to consensus sequence that is likely to result in enhanced replicative fitness.


Despite viral mutation resulting in the production of new potential antigens, no new T cell responses developed in response to mutant peptides that escaped initial recognition over months and in some cases years following the appearance of the mutation. This phenomenon has also been observed with HIV sequence evolution and there the failure to prime new responses may be due to impaired CD4+ T cell function. Although overall CD4+ T cell function is intact in HCV infection, chronic HCV infection has been linked to loss of HCV specific CD4+ T cell responses.(43) In addition, HCV has been linked to impaired DC function, decreased IRF3 signaling, and PKR inhibition, which may inhibit priming of an immune response to the mutated peptides.(26,27,44,45) However, the failure to prime responses to the newly generated sequences is observed even in HIV infected individuals with relatively high CD4 counts and there is no evidence in those with HCV infection of impaired priming of immune responses to other antigens, as would be evident by global immunosuppression. An alternative explanation is that original antigenic sin (the higher threshold required for stimulation of an immune response to an epitope resembling a previously recognized epitope (46)) may be responsible for the lack of response to mutant sequences seen in HIV and HCV, though this phenomenon has not been demonstrated in humans. Lastly, since the selective pressures of the immune system favor the emergence of a viral sequence that fails to elicit a productive response, we may be observing sequences that cannot be processed effectively for presentation or that resemble self antigens and are therefore incapable of stimulating an immune response.


Although the features of acute infection responsible for increased responsiveness to interferon therapy are unknown, this study suggests a mechanistic linkage between viral sequence variation and progression to chronicity. The arrested development of new T cell responses despite ongoing viremia with sequence evolution distinguishes the acute and chronic phases of HCV infection. Enhanced understanding of cellular immune failure leading to chronic HCV infection could accelerate development of vaccines to prevent the development of chronic infection, and agents that could increase responsiveness to interferon-based therapy for chronic HCV infection.


REFERENCES



  • 1. World Health Organization. 1997. Hepatitis C: global prevalence. Weekly Epidemiological Record 341-348.

  • 2. Alter, M. J. 1997. Epidemiology of hepatitis C. Hepatology 26:62 S-65S.

  • 3. Alter, M. J. 1995. Epidemiology of hepatitis C in the West. Semin Liver Dis 15:5-14.

  • 4. Centers for Disease Control and Prevention. 1998. Recommendations for prevention and control of hepatitis C virus (HCV) infection and HCV-related chronic disease. MMWR 47(No. RR-19):1-39.

  • 5. Villano, S. A., D. Vlahov, K. E. Nelson, S. Cohn, and D. L. Thomas. 1999. Persistence of viremia and the importance of long-term follow-up after acute hepatitis C infection. Hepatology 29:908-914.

  • 6. Cao, J., N. Sullivan, E. Desjardin, C. Parolin, J. Robinson, R. Wyatt, and J. Sodroski. 1997. Replication and neutralization of human immunodeficiency virus type 1 lacking the V1 and V2 variable loops of the gp120 envelope glycoprotein. J. Virol. 71:9808-9812.

  • 7. Tong, M. J., N. S. El-Farra, A. R. Reikes, and R. L. Co. 1995. Clinical outcomes after transfusion-associated hepatitis C. N. Engl. J. Med. 332:1463-1466.

  • 8. Jaeckel, E., M. Comberg, J. Mayer, J. N. Koerbel, H. Wedemeyer, A. Z. M. Schueler, C. Trautwein, and M. P. Manns. 2000. Early treatment of acute hepatitis C infection with interferon-alfa 2B monotherapy prevents development of chronic HCV infection. Hepatology Abstract 634:318A. (Abstr.)

  • 9. Jaeckel, E., M. Cornberg, H. Wedemeyer, T. Santantonio, J. Mayer, M. Zankel, G. Pastore, M. Dietrich, C. Trautwein, and M. P. Manns. 2001. Treatment of acute hepatitis C with interferon alfa-2b. N. Engl. J. Med. 345:1452-1457.

  • 10. Nomura, H., S. Sou, H. Tanimoto, T. Nagahama, Y. Kimura, J. Hayashi, H. Ishibashi, and S. Kashiwagi. 2004. Short-term interferon-alfa therapy for acute hepatitis C: a randomized controlled trial. Hepatology 39:1213-1219.

  • 11. Cooper, S., A. L. Erickson, E. J. Adams, J. Kansopon, A. J. Weiner, D. Y. Chien, M. Houghton, P. Parham, and C. M. Walker. 1999. Analysis of a successful immune response against hepatitis C virus. Immunity. 10:439-449.

  • 12. Gruner, N. H., T. J. Gerlach, M. C. Jung, H. M. Diepolder, C. A. Schirren, W. W. Schraut, R. Hoffmann, R. Zachoval, T. Santantonio, M. Cucchiarini, A. Cerny, and G. R. Pape. 2000. Association of hepatitis C virus-specific CD8+ T cells with viral clearance in acute hepatitis C. J. Infect. Dis. 181:1528-1536.

  • 13. Lechner, F., D. K. Wong, P. R. Dunbar, R. Chapman, R. T. Chung, P. Dohrenwend, G. Robbins, R. Phillips, P. Klenerman, and B. D. Walker. 2000. Analysis of successful immune responses in persons infected with hepatitis C virus. J. Exp. Med 191:1499-1512.

  • 14. Takaki, A., M. Wiese, G. Maertens, E. Depla, U. Seifert, A. Liebetrau, J. L. Miller, M. P. Manns, and B. Rehermann. 2000. Cellular immune responses persist and humoral responses decrease two decades after recovery from a single-source outbreak of hepatitis C. Nat. Med. 6:578-582.

  • 15. Gruener, N. H., F. Lechner, M. C. Jung, H. Diepolder, T. Gerlach, G. Lauer, B. Walker, J. Sullivan, R. Phillips, G. R. Pape, and P. Klenerman. 2001. Sustained dysfunction of antiviral CD8+ T lymphocytes after infection with hepatitis C virus. J. Virol. 75:5550-5558.

  • 16. Thimme, R., D. Oldach, K. M. Chang, C. Steiger, S. C. Ray, and F. V. Chisari. 2001. Determinants of viral clearance and persistence during acute hepatitis C virus infection. J. Exp. Med. 194:1395-1406.

  • 17. Battegay, M., J. Fikes, A. M. Di Bisceglie, P. A. Wentworth, A. Sette, E. Celis, W.-M. Ching, A. Grakoui, C. M. Rice, K. Kurokohchi, J. A. Berzofsky, J. H. Hoofnagle, S. M. Feinstone, and T. Akatsuka. 1995. Patients with chronic hepatitis C have circulating cytotoxic T cells which recognize hepatitis C virus-encoded peptides binding to HLA-A2.1 molecules. J. Virol. 69:2462-2470.

  • 18. Hiroishi, K., H. Kita, M. Kojima, H. Okamoto, T. Moriyama, T. Kaneko, T. Ishikawa, S. Ohnishi, T. Aikawa, N. Tanaka, Y. Yazaki, K. Mitamura, and M. Imawari. 1997. Cytotoxic T lymphocyte response and viral load in hepatitis C virus infection. Hepatology 25:705-712.

  • 19. Shirai, M., H. Okada, M. Nishioka, T. Akatsuka, C. Wychowski, R. Houghten, C. D. Pendleton, S. M. Feinstone, and J. A. Berzofsky. 1994. An epitope in hepatitis C virus core region recognized by cytotoxic T cells in mice and humans. J. Virol. 68:3334-3342.

  • 20. Rehermann, B., K. M. Chang, J. G. McHutchison, R. Kokka, M. Houghton, and F. V. Chisari. 1996. Quantitative analysis of the peripheral blood cytotoxic T lymphocyte response in patients with chronic hepatitis C virus infection. J. Clin. Invest. 98:1432-1440.

  • 21. Crispe, I. N. 2003. Hepatic T cells and liver tolerance. Nat. Rev. Immunol. 3:51-62.

  • 22. Erickson, A. L., M. Houghton, Q. L. Choo, A. J. Weiner, R. Ralston, E. Muchmore, and C. M. Walker. 1993. Hepatitis C virus-specific CTL responses in the liver of chimpanzees with acute and chronic hepatitis C. J. Immunol. 151:4189-4199.

  • 23. Koziel, M. J., D. Dudley, J. T. Wong, J. Dienstag, M. Houghton, R. Ralston, and B. D. Walker. 1992. Intrahepatic cytotoxic T lymphocytes specific for hepatitis C virus in persons with chronic hepatitis [published erratum appears in J Immunol 1993 Mar. 15; 150(6):2563]. J. Immunol. 149:3339-3344.

  • 24. Wong, D. K., D. D. Dudley, N. H. Afdhal, J. Dienstag, C. M. Rice, L. P. Wang, M. Houghton, B. D. Walker, and M. J. Koziel. 1998. Liver-derived CTL in hepatitis C virus infection: Breadth and specificity of responses in a cohort of persons with chronic infection. J. Immunol. 160:1479-1488.

  • 25. Wedemeyer, H., X. S. He, M. Nascimbeni, A. R. Davis, H. B. Greenberg, J. H. Hoofnagle, T. J. Liang, H. Alter, and B. Rehermann. 2002. Impaired effector function of hepatitis C virus-specific CD8+ T cells in chronic hepatitis C virus infection. J. Immunol. 169:3447-3458.

  • 26. Bain, C., A. Fatmi, F. Zoulim, J. P. Zarski, C. Trepo, and G. Inchauspe. 2001. Impaired allostimulatory function of dendritic cells in chronic hepatitis C infection. Gastroenterology 120:512-524.

  • 27. Kanto, T., N. Hayashi, T. Takehara, T. Tatsumi, N. Kuzushita, A. Ito, Y. Sasaki, A. Kasahara, and M. Hori. 1999. Impaired allostimulatory capacity of peripheral blood dendritic cells recovered from hepatitis C virus-infected individuals. J. Immunol. 162:5584-5591.

  • 28. Jones, N. A., X. Wei, D. R. Flower, M. Wong, F. Michor, M. S. Saag, B. H. Hahn, M. A. Nowak, G. M. Shaw, and P. Borrow. 2004. Determinants of human immunodeficiency virus type 1 escape from the primary CD8+ cytotoxic T lymphocyte response. J. Exp. Med. 200:1243-1256.

  • 29. Neumann, A. U., N. P. Lam, H. Dahari, D. R. Gretch, T. E. Wiley, T. J. Layden, and A. S. Perelson. 1998. Hepatitis C viral dynamics in vivo and the antiviral efficacy of interferon-alpha therapy. Science 282:103-107.

  • 30. Eckels, D. D., H. Wang, T. H. Bian, N. Tabatabai, and J. C. Gill. 2000. Immunobiology of hepatitis C virus (HCV) infection: the role of CD4 T cells in HCV infection. Immunol. Rev. 174:90-97.

  • 31. Erickson, A. L., Y. Kimura, S. Igarashi, J. Eichelberger, M. Houghton, J. Sidney, D. McKinney, A. Sette, A. L. Hughes, and C. M. Walker. 2001. The outcome of hepatitis C virus infection is predicted by escape mutations in epitopes targeted by cytotoxic T lymphocytes. Immunity. 15:883-895.

  • 32. Timm, J., G. M. Lauer, D. G. Kavanagh, I. Sheridan, A. Y. Kim, M. Lucas, T. Pillay, K. Ouchi, L. L. Reyor, J. S. Zur Wiesch, R. T. Gandhi, R. T. Chung, N. Bhardwaj, P. Klenerman, B. D. Walker, and T. M. Allen. 2004. CD8 epitope escape and reversion in acute HCV infection. J. Exp. Med. 200:1593-1604.

  • 33. Chang, K. M., B. Rehermann, J. G. McHutchison, C. Pasquinelli, S. Southwood, A. Sette, and F. V. Chisari. 1997. Immunological significance of cytotoxic T lymphocyte epitope variants in patients chronically infected by the hepatitis C virus. J. Clin. Invest. 100:2376-2385.

  • 34. Kaneko, T., I. Nakamura, H. Kita, K. Hiroishi, T. Moriyama, and M. Imawari. 1996. Three new cytotoxic T cell epitopes identified within the hepatitis C virus nucleoprotein. J. Gen. Virol. 77:1305-1309.

  • 35. Tsai, S. L., Y. M. Chen, M. H. Chen, C. Y. Huang, I. S. Sheen, C. T. Yeh, J. H. Huang, G. C. Kuo, and Y. F. Liaw. 1998. Hepatitis c virus variants circumventing cytotoxic T lymphocyte activity as a mechanism of chronicity. Gastroenterology 115:954-966.

  • 36. Mizukoshi, E., C. Eisenbach, B. Edlin, C. Weiler, M. Carrington, T. O'Brien, and B. Rehermann. 2003. HCV-specific cellular immune responses in subjects who are anti-HCV-negative, HCV RNA-negative despite long term (>15 years) injection drug use (AASLD Annual Meeting 2003 abstract #111). Hepatology 38:210 A.

  • 37. Walker, C. M., D. J. Moody, D. P. Stites, and J. A. Levy. 1986. CD8+ lymphocytes can control HIV infection in vitro by suppressing virus replication. Science 234:1563-1566.

  • 38. He, X. S., B. Rehermann, F. X. Lopez-Labrador, J. Boisvert, R. Cheung, J. Mumm, H. Wedemeyer, M. Berenguer, T. L. Wright, M. M. Davis, and H. B. Greenberg. 1999. Quantitative analysis of hepatitis C virus-specific CD8(+) T cells in peripheral blood and liver using peptide-MHC tetrainers. Proc. Natl. Acad. Sci. USA 96:5692-5697.

  • 39. Wong, D. K., D. D. Dudley, P. B. Dohrenwend, G. M. Lauer, R. T. Chung, D. L. Thomas, and B. D. Walker. 2001. Detection of diverse hepatitis C virus (HCV)-specific cytotoxic T lymphocytes in peripheral blood of infected persons by screening for responses to all translated proteins of HCV. J. Virol. 75:1229-1235.

  • 40. Farci, P., A. Shimoda, A. Coiana, G. Diaz, G. Peddis, J. C. Melpolder, A. Strazzera, D. Y. Chien, S. J. Munoz, A. Balestrieri, R. H. Purcell, and H. J. Alter. 2000. The outcome of acute hepatitis C predicted by the evolution of the viral quasispecies. Science 288:339-344,

  • 41. Friedrich, T. C., E. J. Dodds, L. J. Yant, L. Vojnov, R. Rudersdorf, C. Cullen, D. T. Evans, R. C. Desrosiers, B. R. Mothe, J. Sidney, A. Sette, K. Kunstman, S. Wolinsky, M. Piatak, J. Lifson, A. L. Hughes, N. Wilson, D. H. O'Connor, and D. I. Watkins. 2004. Reversion of CTL escape-variant immunodeficiency viruses in vivo. Nat. Med. 10:275-281.

  • 42. Allen, T. M., M. Altfeld, X. G. Yu, K. M. O'Sullivan, M. Lichterfeld, S. Le Gall, M. John, B. R. Mothe, P. K. Lee, E. T. Kalife, D. E. Cohen, K. A. Freedberg, D. A. Strick, M. N. Johnston, A. Sette, E. S. Rosenberg, S. A. Mallal, P. J. Goulder, C. Brander, and B. D. Walker. 2004. Selection, transmission, and reversion of an antigen-processing cytotoxic T-lymphocyte escape mutation in human immunodeficiency virus type 1 infection. J. Virol. 78:7069-7078.

  • 43. Grakoui, A., N. H. Shoukry, D. J. Woollard, J. H. Han, H. L. Hanson, J. Ghrayeb, K. K. Murthy, C. M. Rice, and C. M. Walker. 2003. HCV persistence and immune evasion in the absence of memory T cell help. Science 302:659-662.

  • 44. Foy, E., K. Li, C. Wang, R. Sumpter, Jr., M. Ikeda, S. M. Lemon, and M. Gale, Jr. 2003; Regulation of interferon regulatory factor-3 by the hepatitis C virus serine protease. Science 300:1145-1148.

  • 45. Gale, M. J., C. M. Blakely, B. Kwieciszewski, S. L. Tan, M. Dossett, N. M. Tang, M. J. Korth, S. J. Polyak, D. R. Gretch, and M. G. Katze. 1998. Control of PKR protein kinase by hepatitis C virus nonstructural 5A protein: molecular mechanisms of kinase regulation. Mol Cell Biol 18:5208-5218.

  • 46. Klenerman, P. and R. M. Zinkernagel. 1998. Original antigenic sin impairs cytotoxic T lymphocyte responses to viruses bearing variant epitopes. Nature 394:482-485.

  • 47. Garfein, R. S., M. C. Doherty, E. R. Monterroso, D. L. Thomas, K. E. Nelson, and D. Vlahov. 1998. Prevalence and incidence of hepatitis C virus infection among young adult injection drug users. J. Acquir. Immune. Defic. Syndr. Hum. Retrovirol. 18 Suppl 1: S11-S19.

  • 48. Liu, Z., D. M. Netski, Q. Mao, O. Laeyendecker, J. R. Ticehurst, X. H. Wang, D. L. Thomas, and S. C. Ray. 2004. Accurate representation of the hepatitis C virus quasispecies in 5.2-kilobase amplicons. J. Clin. Microbiol. 42:4223-4229.

  • 49. Ray, S. C., R. R. Arthur, A. Carella, J. Bukh, and D. L. Thomas. 2000. Genetic Epidemiology of Hepatitis C Virus throughout Egypt. J. Infect. Dis. 182:698-707.

  • 50. Jeanmougin, F., J. D. Thompson, M. Gouy, D. G. Higgins, and T. J. Gibson. 1998. Multiple sequence alignment with Clustal X. Trends Biochem. Sci. 23:403-405.

  • 51. Hall, T. A. 2001. BioEdit: Biological sequence alignment editor for Windows 95/98/NT version 5.0.7. software. Distributed by author: http://www.mbio.ncsu.edu/RNaseP/info/programs/BIOEDIT/bioedit.html.

  • 52. Posada, D. and K. A. Crandall. 1998. MODELTEST: testing the model of DNA substitution. Bioinformatics. 14:817-818.

  • 53. Felsenstein, J. 1985. Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39:783-791.

  • 54. Cox, A. L., D. M. Netski, T. Mosbruger, S. G. Sherman, S. Strathdee, D. C. Ompad, D. Vlahov, D. Chien, V. Shyamala, S. C. Ray, and D. L. Thomas. 2004. Prospective evaluation of community-acquired acute Hepatitis C. Clin. Infect. Dis. (in press).

  • 55. Wang, Y. M., S. C. Ray, O. Laeyendecker, J. R. Ticehurst, and D. L. Thomas. 1998. Assessment of hepatitis C virus sequence complexity by electrophoretic mobilities of both single- and double-stranded DNAs. J. Clin. Microbiol. 36:2982-2989.

  • 56. Lauer, G. M., K. Ouchi, R. T. Chung, T. N. Nguyen, C. L. Day, D. R. Purkis, M. Reiser, A. Y. Kim, M. Lucas, P. Klenerman, and B. D. Walker. 2002. Comprehensive analysis of CD8(+)-T-cell responses against hepatitis C virus reveals multiple unpredicted specificities. J. Virol. 76:6104-6113.

  • 57. Ward, S., G. Lauer, R. Isba, B. Walker, and P. Klenerman. 2002. Cellular immune responses against hepatitis C virus: the evidence base 2002. Clin. Exp. Immunol. 128:195-203.

  • 58. Currier, J. R., E. G. Kuta, E. Turk, L. B. Earhart, L. Loomis-Price, S. Janetzki, G. Ferrari, D. L. Birx, and J. H. Cox. 2002. A panel of MHC class I restricted viral peptides for use as a quality control for vaccine trial ELISPOT assays. J. Immunot. Methods 260:157-172.

  • 59. Sidney, J., S. Southwood, C. Oseroff, J. F. Del Guercio, A. Sette, and H. Grey. 1998. Measurement of MHC/Peptide Interactions by Gel Filtration. In Current Protocols in Immunology. B. Bierer, J. E. Coligan, D. H. Margulies, E. M. Shevach, W. Strober, and A. Kruisbeek, editors. John Wiley & Sons, Inc., 18.3.1-18.3.19.



Example 3
Divergent and Convergent Evolution Following a Common-Source Outbreak of Hepatitis C

The genomic sequences of viruses that are highly mutable and cause chronic infection tend to diverge over time. We report that these changes represent both immune driven selection and, in the absence of immune pressure, reversion toward a more “fit” ancestral consensus. Sequence changes in hepatitis C virus (HCV) structural and nonstructural genes were studied in a cohort of women accidentally infected with HCV in a rare common-source outbreak. We compared sequences present in serum obtained 18-22 years after infection to sequences present in the shared inoculum and found that HCV had evolved along a distinct path in each woman. Amino acid substitutions in known epitopes were directed away from consensus in persons having the HLA allele associated with that epitope (immune selection), and toward consensus in those lacking the allele (reversion). Vaccines for genetically-diverse viruses may be more effective if they represent consensus sequence, rather than a human isolate.


A virus capable of genetic variation and of causing chronic infection will evolve to optimize its fitness in each host, which is the net sum of immune recognition (positive selection) and functional constraint on replication (negative selection). Because an estimated 1012 virions are produced each day through an error-prone, non-proofreading NS5B RNA polymerase, hepatitis C virus (HCV) is especially capable of viral evolution (1,2)(These numbers in Example 3 refer to the list of references listed after Example 3). However, we previously showed that evolution is not driven by replication alone. In the acute phase of infection before adaptive immune responses (but after weeks of replication supporting a viral RNA level of more than 105 IU/mL), the same major viral variant was detected in each of a serial passage of eight chimpanzees (3). In contrast, the sequence of envelope genes, particularly HVR1, changes in virtually all humans who have been persistently infected (including the source of the inoculum passaged through this chimpanzee lineage (4)), a notable exception being persons with agammaglobulinemia, who have been shown to have reduced variability in HVR1 (5). Longitudinal studies of chimpanzees experimentally infected with HCV have revealed that amino acid replacements in immunodominant CD8+ T cell epitopes presented on MHC class I in an allele-restricted manner contribute to viral persistence (6). Thus, we hypothesized that the net evolution of HCV would demonstrate functional constraint (reversion of sequences toward consensus) as well as positive pressure (and thus reveal immunodominant epitopes).


Although it required that persons be infected with the same inoculum, it was possible to test this hypothesis because between May 1977 and November 1978 over 500 women were inadvertently infected with HCV from a single acutely-infected source, as a result of treatment with contaminated anti-D immune globulin (7). In a single amplicon, a 5.2 kb cDNA spanning 5′UTR through the NS3/NS4A junction was cloned from serum collected from 22 women 18-23 years after infection, as well as in 2 specimens of frozen plasma from the inoculum donor.


Methods and Materials


Study subjects. Twenty-two women from this outbreak were studied because they provided consent and had at least one of the 3 most common alleles at the HLA A locus (A*01, A*02, or A*03) (16).


Hemigenomic cDNA cloning. The region encoding Core, E1, E2, p7, NS2, and NS3 was amplified and cloned as previously described (17), and 40 clones per specimen were stored. For each specimen, envelope sequences from 10 random randomly-selected clones were determined using primer H77-1868a21 (17) on a PRISM version 3100 sequencer (ABI, Foster City, Calif.).


Estimation of Consensus Sequence. An alignment of full-length HCV subtype 1b sequences was obtained from the Los Alamos National Laboratories HCV database (http://hcv.lanl.gov). The alignment was edited by hand to remove gaps introduced for alignment to other genotypes, and to remove duplicate sequences from the same human source and those obtained from non-human sources. The resulting alignment included 83 sequences. A majority-rule consensus sequence was formed, with residues occurring in less than 42 sequences flagged as non-consensus. Changes in the anti-D recipients were then classified as “toward” (change results in a residue matching the consensus) or “away” (change results in a residue not matching the consensus, or residue is non-consensus).


Estimation of the likelihood of convergence. The expected frequency of co-variation assuming independence was calculated as the product of the marginal frequencies, and compared to the observed value using the Chi-squared distribution with 3 degrees of freedom. If we assume that all amino acid replacements are equally likely over a time period that is very long with respect to the rate of mutation, then sharing of amino acids at 4 variable sites in just 2 study subjects would be expected to occur at a frequency of 1/204 or 0.00000625. Of course, all amino acid replacements are not equally likely, even in the highly variable HVR1 (8); therefore, the likelihood of sharing of 4 variable sites by 2 subjects would be higher, e.g. 1/34 or 0.012 if each site is equally likely to have one of 3 residues. Because the likelihood of shared residues at variable sites in multiple study subjects is the product of such probabilities, the observed findings in this study are clearly incompatible with random substitution and most consistent with convergent evolution.


Phylogenetic Analysis. Sequences were aligned using ClustalX (18), codon boundaries were restored by hand in BioEdit (19), and phylogenetic analysis was performed using PAUP* version 4b10 (Sinauer Associates, Sunderland, Mass.) using a HKY85+G model and parameters (Ti/Tv 2.78, gamma—0.37) selected with the aid of ModelTest (20). Initial results from one specimen were consistent with subtype 1a, and that specimen was not examined further. Reference sequences included 3 from subtype 1a, 83 from subtype 1b (including AF313916), 2 from subtype 1c, 6 from subtype 2a, 8 from subtype 2b, 1 each from subtypes 2c and 2k, 4 from subtype 3a, and 1 each from subtypes 3b, 3k, 4a, 5a, 6a, 6b, 6d, 6g, 6h, and 6k (obtained from http://hcv.lanl.gov). VarPlot was used to calculate nonsynonymous and synonymous distances using the method of Nei and Gojobori, in a sliding window 20 codons wide, moving in 1 codon steps, as previously described (21).


Sequence logos. The sequence logos in FIG. 7 were generated using a novel software program, V is SPA (Visual Sequence Pattern Analysis, available on request from the author S. C. R.). The algorithm is identical to that described for type 2 logos by Gorodkin et al (22), except that the a priori distributions for the logo are calculated empirically from input sequences, and missing values in the a priori distribution are assigned the lowest frequency of residues at that site (if more than one state is represented) or 1/20 if the a priori distribution has only one residue at that site.


Results and Discussion


HCV envelope sequence from the inoculum clustered near the base of the clade formed by sequences from the chronically-infected women (FIG. 6), and the entire anti-D cohort clade was clearly distinct from all other sequences in available databases (excluding those from this outbreak), consistent with the previously-reported clinical history of common-source infection from an acutely infected donor (7). Nonetheless, HCV sequences in each woman diverged along distinct paths.


There was strong evidence of negative selection in both the regional differences in genetic divergence, as well as by comparison of the rates of nonsynonymous (amino acid-changing) and synonymous (silent) change (FIG. 7). Overall, the highest rate of non-synonymous change was observed in the E2 gene, followed by NS2, p7, E1, NS3, and Core. Synonymous substitution rates were consistently higher than nonsynonymous rates for all genes, suggesting strong negative selection is a consistent feature of chronic HCV infection. HVR1 sequences were highly divergent at many sites, but constrained at others (8,9).


The HVR1 sequences illustrated two distinct patterns of sequence change. When the amino acid of the inoculum matched the consensus for that viral subtype, residues either did not change or changed in an apparently ‘sporadic’ fashion (risk of finding the residue in recipients was not different than finding the residue in the inoculum, indicated by near-zero height of the sequence logo in FIG. 7. In contrast, when the amino acid in the inoculum differed from the consensus, there was convergent evolution toward consensus (residue was found >2 times more often in the recipients than in the inoculum, FIG. 7). For example, 16 of the 22 women had replaced H in the inoculum with R at position 394, 12 replaced A with T at 396, 11 replaced L with F at 399, and 16 replaced T with S at 401. Four women had all 4 of these changes, not significantly different from the expected frequency of 3.2, indicating that these changes occurred independently. Since there is an infinitesimal likelihood that these same amino acid substitutions occurred by chance in each of the women, these data indicate that the sequences converged to a more fit state, implying that prior evolution of the inoculum sequence to optimize its fitness in the original host resulted in changes that diminished its fitness in the subsequent hosts.


Rather than convergent evolution, these results might have been due to shared selection and then divergence (at other sites) from a rare variant that we did not detect in the inoculum. We did detect one clone (clone #5) among 20 in the inoculum material that carried the RxTxxFxS motif at positions 394-401, but it was highly divergent from all other sequences described here, as; evident from its position in the phylogenetic tree (FIG. 6B), and therefore less likely than the other 19 sequences to represent the founder strain for these women. While it is possible that a less divergent RxTxxFxS clone was present in the inoculum at very low frequency, shared selection of such a rare variant would support the same conclusion.


Because HVR1 is a potential target of both humoral and cellular immunity and the precise recognition motifs remain difficult to identify due to the extreme variability, further examination of immune selection and convergence was focused on other genes, and in particular, on known MHC class I-restricted epitopes. Consistent with immune selection hypothesis, the number of changes in epitopes associated with specific class I alleles was significantly greater than the number of changes in other sites and greater than what was found in that same site for persons who did not possess the allele (Table 3). For HLA B*35 and B*37, sequence changes were 7.0 and 8.5 times as likely to occur in an epitope associated with an allele in women having the allele as compared with those that lacked it (P<0.001). An example of such an epitope is shown in a 38 amino acid region that spans an HLA A2 motif (FIG. 7). Mutations from R to K were noted outside the A2 epitope, and mutations from G to S were noted within the epitope in 8 and 6 of 22 women, respectively. However, while R to K mutation was noted in a similar percentage of A2 positive and A2 negative women (41.7% versus 30:0%, P>0.10), all G to S mutations were observed in A2 positive women (P=0.015), consistent with immune escape as has been observed in the SIV macaque model (10) and chimpanzees infected with HCV (6).


As seen with envelope sequences, the opposite effect was observed in other alleles. For alleles A*01 and B*08, sequence changes were 0.2 and 0.4 times as likely to occur in an epitope restricted by an allele in women having the allele as compared with those who lack it (Table 3). In fact, the R to K mutation that was described above in both A2 positive and negative women, only occurred in women who were not HLA B*08 positive, while the apparently A2-restricted G1409S substitution occurred in both B*08 positive and negative women (FIG. 7).


Collectively, these findings indicate that HCV sequence change is a non-random process that reflects negative selection (change is disadvantageous) as well as positive selection. Moreover, we find evidence that positive selection represents both the direct effect of pressure applied by immune responses in the current host (in this case, HLA class I restricted CD8+ cytotoxic T lymphocytes) as well as reversion of sequence to a more fit consensus, as we saw with envelope sequences.


To independently evaluate this paradigm, we compared the amino acid sequences of these women with a HCV 1b consensus sequence. For the epitopes that showed evidence of HLA class I restricted positive selection (a significantly increased risk of mutations from the inoculum occurred when the restricting allele was present), there was also an increased number of changes away from the 1b prototype consensus in women with one of these alleles, but not those without (Table 4). In addition, for epitopes that showed the converse effect, i.e., evidence of positive selection when the allele was absent (a significantly lower risk of mutations from the inoculum when the restricting allele was present), there was also an increased number of changes toward consensus in those who lacked the allele versus those who had the allele, suggesting reversion (Table 4). These findings are supported in an accompanying report (Cox, et al. J Exp Med. 2005 Jun. 6; 201(11):1741-52 and Example 2), which shows that amino acid substitutions in CD8+ T cell epitopes are associated with a loss of T cell recognition during acute infection, whereas non-epitope changes revert toward consensus at a rate much higher than expected by chance.


Prior studies have demonstrated reversion of CTL-escape variant sequences in macaques experimentally infected with SIV (11), reversion of an HIV-1 epitope in humans (12), and evidence of HIV-1 adaptation to common HLA alleles (13). This is the first report of viral adaptation to multiple HLA alleles across multiple genes, and provides additional support for the suggestion, based on minimizing differences between vaccine and circulating strains, that vaccine effectiveness may be enhanced by using a consensus (14) or ancestral (15) sequence. The ability of viruses to restrict adaptive immune responses and evade those that are formed contributes to persistence and is a major barrier to vaccine development. These data demonstrate that although immune responses diminish the fitness of viral variants, viral divergence occurs in persistently infected hosts. Nonetheless, this divergence actually reduces the fitness of the virus in the population (that is, in other hosts). From an evolutionary perspective, these forces maintain the virus as a distinct pathogen. However, the data also suggest that immune responses to consensus sequences (rather than a product based on the sequence in a given host) would establish the highest barrier to viral escape and consequently the most effective protection against chronic infection.


REFERENCES



  • 1. Neumann, A. U., N. P. Lam, H. Dahari, D. R. Gretch, T. E. Wiley, T. J. Layden, and A. S. Perelson. 1998. Hepatitis C viral dynamics in vivo and the antiviral efficacy of interferon-alpha therapy. Science 282:103-107.

  • 2. Martell, M., J. I. Esteban, J. Quer, J. Genesca, A. Weiner, R. Esteban, Guardia, and J. Gomez. 1992. Hepatitis C virus (HCV) circulates as a population of different but closely related genomes: quasispecies nature of HCV genome distribution. J. Virol. 66:3225-3229.

  • 3. Ray, S. C., Q. Mao, R. E. Lanford, S. Bassett, O. Laeyendecker, Y. M. Wang, and D. L. Thomas. 2000. Hypervariable region 1 sequence stability during hepatitis C virus replication in chimpanzees. J. Virol. 74:3058-3066.

  • 4. Ogata, N., H. J. Alter, R. H. Miller, and R. H. Purcell. 1991. Nucleotide sequence and mutation rate of the H strain of hepatitis C virus. Proc. Natl. Acad. Sci. USA 88:3392-3396.

  • 5. Gaud, U., B. Langer, T. Petropoulou, H. C. Thomas, and P. Karayiannis. 2003. Changes in hypervariable region 1 of the envelope 2 glycoprotein of hepatitis C virus in children and adults with humoral immune defects. J. Med. Virol. 69:350-356.

  • 6. Erickson, A. L., Y. Kimura, S. Igarashi, J. Eichelberger, M. Houghton, J. Sidney, D. McKinney, A. Sette, A. L. Hughes, and C. M. Walker. 2001. The outcome of hepatitis C virus infection is predicted by escape mutations in epitopes targeted by cytotoxic T lymphocytes. Immunity. 15:883-895.

  • 7. Kenny-Walsh, E. 1999. Clinical outcomes after hepatitis C infection from contaminated anti-D immune globulin. Irish Hepatology Research Group. N. Engl. J. Med. 340:1228-1233.

  • 8. McAllister, J., C. Casino, F. Davidson, J. Power, E. Lawlor, P. L. Yap, P. Simmonds, and D. B. Smith. 1998. Long-term evolution of the hypervariable region of hepatitis C virus in a common-source-infected cohort. J. Virol. 72:4893-4905.

  • 9. Penin, F., C. Combet, G. Germanidis, P. O. Frainais, G. Deleage, and J. M. Pawlotsky. 2001. Conservation of the conformation and positive charges of hepatitis C virus E2 envelope glycoprotein hypervariable region 1 points to a role in cell attachment. J. Virol. 75:5703-5710.

  • 10. Allen, T. M., D. H. O'Connor, P. Jing, J. L. Dzuris, B; R. Mothe, T. U. Vogel, E. Dunphy, M. E. Liebl, C. Emerson, N. Wilson, K. J. Kunstman, X. Wang, D. B. Allison, A. L. Hughes, R. C. Desrosiers, J. D. Altman, S. M. Wolinsky, A. Sette, and D. I. Watkins. 2000. Tat-specific cytotoxic T lymphocytes select for SIV escape variants during resolution of primary viraemia. Nature 407:386-390.

  • 11. Leslie, A. J., K. J. Pfafferott, P. Chetty, R. Draenert, M. M. Addo, M. Feeney, Y. Tang, E. C. Holmes, T. Allen, J. G. Prado, M. Altfeld, C. Brander, C. Dixon, D. Ramduth, P. Jeena, S. A. Thomas, A. St John, T. A. Roach, B. Kupfer, G. Luzzi, A. Edwards, G. Taylor, H. Lyall, G. Tudor-Williams, V. Novelli, J. Martinez-Picado, P. Kiepiela, B. D. Walker, and P. J. Goulder. 2004. HIV evolution: CTL escape mutation and reversion after transmission. Nat. Med. 10:282-289.

  • 12. Friedrich, T. C., E. J. Dodds, L. J. Yant, L. Vojnov, R. Rudersdorf, C. Cullen, D. T. Evans, R. C. Desrosiers, B. R. Mothe, J. Sidney, A. Sette, K. Kunstman, S. Wolinsky, M. Piatak, J. Lifson, A. L. Hughes, N. Wilson, D. H. O'Connor, and D. I. Watkins. 2004. Reversion of CTL escape-variant immunodeficiency viruses in vivo. Nat. Med. 10:275-281.

  • 13. Moore, C. B., M. John, I. R. James, F. T. Christiansen, C. S. Witt, and S. A. Mallal. 2002. Evidence of HIV-1 adaptation to HLA-restricted immune responses at a population level. Science 296:1439-1443.

  • 14. Gaschen, B., J. Taylor, K. Yusim, B. Foley, F. Gao, D. Lang, V. Novitsky, B. Haynes, B. H. Hahn, T. Bhattacharya, and B. Korber. 2002. Diversity considerations in HIV-1 vaccine selection. Science 296:2354-2360.

  • 15. Nickle, D. C., M. A. Jensen, G. S. Gottlieb, D. Shriner, G, H. Learn, A. G. Rodrigo, and J. I. Mullins. 2003. Consensus and ancestral state HIV vaccines. Science 299:1515-1518.

  • 16. Fanning, L. J., E. Kenny-Walsh, and F. Shanahan. 2004. Persistence of hepatitis C virus in a white population: associations with human leukocyte antigen class 1. Hum. Immunol. 65:745-751.

  • 17. Liu, Z., D. M. Netski, Q. Mao, O. Laeyendecker, J. R. Ticehurst, X. H. Wang, D. L. Thomas, and S. C. Ray. 2004. Accurate representation of the hepatitis C virus quasispecies in 5.2-kilobase amplicons. J. Clin. Microbiol. 42:4223-4229.

  • 18. Jearmougin, F., J. D. Thompson, M. Gouy, D. G. Higgins, and T. J. Gibson. 1998. Multiple sequence alignment with Clustal X. Trends Biochem. Sci. 23:403-405.

  • 19. Hall, T. A. 2001. BioEdit: Biological sequence alignment editor for Windows 95/98/NT version 5.0.7. software. Distributed by author: http://www.mbio.ncsu.edu/RNaseP/info/programs/BIOEDIT/bioedit.html.

  • 20. Posada, D. and K. A. Crandall. 1998. MODELTEST: testing the model of DNA substitution. Bioinformatics. 14:817-818.

  • 21. Ray, S. C., Y. M. Wang, O. Laeyendecker, J. Ticehurst, S. A. Villano, and D. L. Thomas. 1998. Acute hepatitis C virus structural gene sequences as predictors of persistent viremia: hypervariable region 1 as decoy. J. Virol. 73:2938-2946.

  • 22. Gorodkin, J., L. J. Heyer, S. Brunak, and G. D. Stormo. 1997. Displaying the information contents of structural RNA alignments: the structure logos. Comput. Appl. Biosci. 13:583-586.

  • 23. Chang, K. M., B. Rehermann, J. G. McHutchison, C. Pasquinelli, S. Southwood, A. Sette, and F. V. Chisari. 1997. Immunological significance of cytotoxic T lymphocyte epitope variants in patients chronically infected by the hepatitis C virus. J. Clin. Invest. 100:2376-2385.

  • 24. Ward, S., G. Lauer, R. Isba, B. Walker, and P. Klenerman. 2002. Cellular immune responses against hepatitis C virus: the evidence base 2002. Clin. Exp. Immunol. 128:195-203.










TABLE 1







Non-epitope, non-envelope changesa frequently result in modal residue















Change

Change relative to


Siteb
Region
Subject
t0 → t6c
Subtype 1ad
modal residuee















6
Core
11
N → K
K102Y2
Toward


88
Core
17
S → N
N128
Toward


840
NS2
21

S → G

S10A1
Away


856
NS2
17
L → Q
Q6H2L1
Toward


859
NS2
21

V → E

V9
Away


861
NS2
21

V → I

V7I1F1
Away


899
NS2
17
F → L
L9
Toward


908
NS2
13

K → R

K9
Away


945
NS2
12

T → A

T9
Away


1021
NS2
12

K → E

K9
Away


1113
NS3
17
T → A
A73T1P1V1
Toward


1278
NS3
18

I → L

I78S1
Away


1314
NS3
12
V → I
I49F1
Toward


1338
NS3
18
S → T
T46
Toward


1600
NS3
21

P→ L

P50L1
Away


1648
NS3
28
G → C
C54Y1
Toward






aChanges that were not located in demonstrated epitopes and not located in the envelope region (FIG. 2)




bPosition in H77 polyprotein (Genbank accession AF009606)




camino acids inferred from cDNA clones obtained at t0 and t6, respectively. Changes



were only inferred when independent cDNA clone sequences were in agreement.













TABLE 2







Observed versus expected frequencies of change to modal residue at 16


sites not located in epitopes nor in envelope (E1 or E2) genes










Expected
Observed













Change to modal residue
16 * (1/19) = 0.8 ≈ 1
8*


Change to non-modal residue
16 * (18/19) = 15.2 ≈ 15
8 





*p = 0.015













TABLE 3







Proportion of amino acid substitutions in known epitopes located in HCV Core,


p7, NS2, and NS3, according to the presence or absence of the associated HLA allele


(24). Epitopes were considered in the region sequenced (corresponding to amino acid


residues 1-1651 of the H77 polyprotein, GenBank AF009606). Two sequences per


subject were examined. P values were calculated by comparison of proportions as


implemented in SigmaStat (Systat Software, Inc., Richmond, Calif. USA). NA means


not applicable due to zero cell. EpiChanges, a software program created by S.C.R. to


automate this analysis, is available from the author.


















Relative Risk of




Subjects

Allele present
Allele absent
change (allele



with allele
Epitopes
Epitope changes/
Epitope
present versus


Allele
(n)
(n)
sites (%)
changes/sites
absent)
P value


















A*01
11
1
2/198
(1.0)
10/198
(5.1)
0.2
0.019


A*02
13
13
39/3198
(1.2)
18/2214
(0.8)
1.5
>0.10


A*03
9
4
8/648
(1.2)
18/936
(1.9)
0.6
>0.10


A*11
3
3
6/162
(3.7)
45/1026
(4.4)
0.8
>0.10


A*24
1
2
0/36
(0.0)
3/756
(0.4)
0
>0.10


B*07
7
3
4/378
(1.1)
0/810
(0.0)
NA
0.003


B*08
6
3
4/312
(1.3)
28/832
(3.4)
0.4
0.057


B*35
3
3
11/162
(6.8)
10/1026
(1.0)
7.0
<0.001


B*37
5
1
20/80
(25.0)
8/272
(2.9)
8.5
<0.001


B*44
7
1
0/126
(0.0)
2/270
(0.7)
0
>0.10
















TABLE 4







Classification of amino acid substitutions in MHC class I restricted


epitopes as being toward or away from a 1b consensus sequence in


22 women exposed to a common HCV subtype 1b inoculum. Results


are stratified (in columns) by the presence or the absence of the


relevant allele, and grouped (rows) according to relative risk for


change shown in Table 1. Changes are per 1000 epitope sites,


based on whether the resulting residue matches HCV 1b consensus.









Changes in epitopes relative to


Effect of
1b consensus, per 1000 sites









allele presence on
Allele present
Allele absent











likelihood of amino
Toward
Away from
Toward
Away from


acid replacement
consensus
consensus
consensus
consensus













620 sites
2108 sites











Increased relative
1.6
54.8*
3.9
0.0


risk (B*07, B*35,


B*37)










510 sites
1030 sites











Decreased relative
3.8
4.7
22.3*
48


risk (A*01, B*08)





*P < 0.001 for comparison of Toward versus Away.













TABLE 5







HCV 1a Consensus Sequences































A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W

































1
M40














1505
G49








2
S40














1506
E49


3
T39
R1













1507
R49


4
N36
I4













1508
P49


5
P39
L1













1509
S47
X2


6
K40














1510
G49


7
P39
L1













1511
M48
X1


8
Q40














1512
F48
X1


9
R40














1513
D49


10
K37
Q2
R1












1514
S49


11
T40














1515
S48
A1


12
K46














1516
V49


13
R46














1517
L49


14
N46














1518
C49


15
T46














1519
E48
X1


16
N45
S1













1520
C49


17
R46














1521
Y49


18
R46














1522
D49


19
P46














1523
A48
T1


20
Q46














1524
G49


21
D46














1525
C48
X1


22
V46














1526
A48
X1


23
K46














1527
W49


24
F46














1528
Y48
X1


25
P46














1529
E49


26
G46














1530
L49


27
G46














1531
T47
X1
M1


28
G46














1532
P49


29
Q46














1533
A47
T1
X1


30
I46














1534
E49


31
V46














1535
T48
X1


32
G46














1536
T49


33
G46














1537
V48
A1


34
V45
I1













1538
R49


35
Y46














1539
L49


36
L46














1540
R49


37
L46














1541
A48
X1


38
P46














1542
Y49


39
R46














1543
M49


40
R45
W1













1544
N49


41
G46














1545
T49


42
P46














1546
P49


43
R45
K1













1547
G49


44
L46














1548
L49


45
G46














1549
P49


46
V46














1550
V49


47
R46














1551
C48
X1


48
A45
T1













1552
Q48


49
T46














1553
D48


50
R46














1554
H48


51
K46














1555
L48


52
T46














1556
E48


53
S46














1557
F48


54
E46














1558
W48


55
R46














1559
E48


56
S46














1560
G48


57
Q46














1561
V48


58
P46














1562
F48


59
R46














1563
T48


60
G46














1564
G47
X1


61
R45
S1













1565
L48


62
R46














1566
T48


63
Q46














1567
H47
Q1


64
P45
L1













1568
I48


65
I45
F1













1569
D48


66
P45
T1













1570
A47
X1


67
K45
Q1













1571
H47
X1


68
A43
V3













1572
F48


69
R46














1573
L47
I1


70
R46














1574
S45
X3


71
P43
S2
A1












1575
Q48


72
E46














1576
T48


73
G46














1577
K48


74
R46














1578
Q48


75
T46














1579
S45
X2
G1


76
W46














1580
G47
X1


77
A46














1581
E46
X2


78
Q46














1582
N48


79
P46














1583
F24
L23
P1


80
G46














1584
P48


81
Y46














1585
Y48


82
P46














1586
L48


83
W46














1587
V48


84
P46














1588
A48


85
L46














1589
Y48


86
Y46














1590
Q48


87
G46














1591
A48


88
N46














1592
T47
X1


89
E46














1593
V48


90
G46














1594
C48


91
C46














1595
A48


92
G46














1596
R47
X1


93
W46














1597
A47
S1


94
A45
M1













1598
Q47
R1


95
G46














1599
A48


96
W46














1600
P46
L2


97
L46














1601
P48


98
L46














1602
P47
X1


99
S46














1603
S46
X2


100
P46














1604
W48


101
R46














1605
D48


102
G46














1606
Q48


103
S46














1607
M48


104
R46














1608
W48


105
P46














1609
K48


106
S44
N2













1610
C48


107
W46














1611
L48


108
G46














1612
I36
T8
V2
X2


109
P45
X1













1613
R48


110
T46














1614
L47
X1


111
D46














1615
K47
X1


112
P45
S1













1616
P47
X1


113
R44
P2













1617
T47


114
R46














1618
L46
X2


115
R44
K2













1619
H47
Y1


116
S46














1620
G48


117
R49














1621
P46
S1
X1


118
N49














1622
T47
K1


119
L49














1623
P47
X1


120
G49














1624
L47
X1


121
K49














1625
L47
X1


122
V49














1626
Y48


123
I49














1627
R48


124
D49














1628
L47
X1


125
T49














1629
G46
X2


126
L48
F1













1630
A48


127
T49














1631
V48


128
C49














1632
Q48


129
G51














1633
N47
H1


130
F50
L1













1634
E48


131
A51














1635
V40
I7
X1


132
D51














1636
T47
X1


133
L50
H1













1637
L45
M1


134
M50
T1













1638
T46


135
G51














1639
H46


136
Y51














1640
P45
X1


137
I51














1641
V24
I21
X1


138
P51














1642
T46


139
L51














1643
K45
X1


140
V51














1644
Y44
X1


141
G51














1645
I45


142
A48
V2
P1












1646
M45


143
P51














1647
T45


144
L51














1648
C45


145
G50
X1













1649
M44
X1


146
G51














1650
S43
A1


147
A50
R1













1651
A44


148
A51














1652
D44


149
R49
K2













1653
L44


150
A51














1654
E42
X2


151
L51














1655
V41
I3


152
A51














1656
V43
T1


153
H51














1657
T43
X1


154
G50
X1













1658
S44


155
V51














1659
T44


156
R51














1660
W44


157
V51














1661
V44


158
L51














1662
L43
X1


159
E51














1663
V44


160
D51














1664
G44


161
G50
S1













1665
G44


162
V51














1666
V44


163
N51














1667
L44


164
Y51














1668
A44


165
A51














1669
A44


166
T51














1670
L44


167
G51














1671
A43
X1


168
N51














1672
A44


169
L51














1673
Y44


170
P51














1674
C44


171
G51














1675
L44


172
C52














1676
S44


173
S52














1677
T43
X1


174
F51
X1













1678
G44


175
S51














1679
C43
S1


176
I49
X12
M1
L1











1680
V44


177
F72














1681
V44


178
L72
X1













1682
I44


179
L73
X1













1683
V43
I1


180
A74
X5













1684
G44


181
L79














1685
R44


182
L78
F1













1686
I25
V18
X1


183
S80














1687
V40
I2
D1
X1


184
C80














1688
L44


185
L80














1689
S44


186
T78
A1
I1












1690
G44


187
V79
I1
X1












1691
K44


188
P81














1692
P44


189
A80
T1













1693
A43
P1


190
S80
W1













1694
I34
V7
X3


191
A80
P1













1695
I43
V1


192
Y88
H6
F1












1696
P44


193
Q89
H3
E3












1697
D44


194
V95














1698
R42
Q1
X1


195
R95














1699
E43


196
N93














1700
V43
A1


197
S91
T1
A1












1701
L44


198
S48
T44
L1












1702
Y43
X1


199
G93














1703
R30
Q14


200
L87
I4
F1
S1











1704
E42
G2


201
Y93














1705
F44


202
H93














1706
D44


203
V93
X1













1707
E44


204
T94














1708
M44


205
N93
X1













1709
E44


206
D92
H2













1710
E44


207
C94














1711
C44


208
P92
S1
L1












1712
S44


209
N94














1713
Q44


210
S94














1714
H44


211
S93
G1













1715
L44


212
I93
V1













1716
P43
X1


213
V94














1717
Y44


214
Y92
F2













1718
I44


215
E94














1719
E44


216
A52
T41
V1












1720
Q43
P1


217
A67
D14
T5
V2
S2
H1
P1
N1
G1






1721
G44


218
D90
N3
G1












1722
M44


219
A65
T27
V1
S1











1723
M40
A2
V1
X1


220
I94














1724
L43
X1


221
L93
M1













1725
A43
X1


222
H94














1726
E43
X1


223
T52
S31
A9
I1
V1










1727
Q44


224
P94














1728
F44


225
G94














1729
K44


226
C94














1730
Q44


227
V89
I3
F1
L1











1731
K43
X1


228
P94














1732
A44


229
C94














1733
L41
X3


230
V92
X1
A1












1734
G42
A1
X1


231
R86
H5
C1
G1
Y1










1735
L44


232
E91
K2
G1












1736
L44


233
G90
D3
S1












1737
Q44


234
N90
D3
S1












1738
T43
X1


235
A55
T15
V13
I4
S2
D2
L1
X1
G1






1739
A43
R1


236
S91
P2
A1












1740
S44


237
R64
K29
T1












1741
R39
H3
X2


238
C94














1742
Q42
H2


239
W94














1743
A43
X1


240
V93
A1













1744
E44


241
A87
P5
T1
S1











1745
V37
A3
X3
T1


242
V54
M28
L8
I4











1746
I41
V1
X1


243
T62
A32













1747
A25
T15
G2
X1


244
P93
S1













1748
P42
X1


245
T92
S2













1749
A38
V3
T2


246
V93
L1













1750
V43


247
A94














1751
Q43


248
T93
A1













1752
T43


249
R86
K8













1753
N41
S2


250
D92
N2













1754
W42
R1


251
G93
A1













1755
Q42
X1


252
K75
R13
S2
N2
T1
X1









1756
K34
R7
N1
X1


253
L92
I1
V1












1757
L42
I1


254
P93
H1













1758
E39
V1
X1


255
T77
A15
S1
I1











1759
A21
T11
V6
X2
F1
S1
L1


256
T86
A5
K2
M1











1760
F37
G2
V1
L1
I1
X1


257
Q93
T1













1761
W37
G6
X1


258
L92
I1
M1












1762
A41
G2
K1


259
R94














1763
K43
X2
M1


260
R93
Q1













1764
H42
N2
K2


261
H92
X1
Y1












1765
X27
M19


262
I93
V1













1766
W18
X1


263
D94














1767
N18


264
L93
X1













1768
F17
X1


265
L93
X1













1769
I16
X1


266
V93
G1













1770
S17


267
G93
R1













1771
G17


268
S75
G16
A2
X1











1772
I17


269
A94














1773
Q17


270
T92
A2













1774
Y17


271
L91
F2
I1












1775
L17


272
C94














1776
A17


273
S94














1777
G17


274
A92
T2













1778
L17


275
L93
M1













1779
S17


276
Y97














1780
T17


277
V97














1781
L17


278
G97














1782
P16
A1


279
D96
E1













1783
G17


280
L97














1784
N17


281
C97














1785
P17


282
G97














1786
A17


283
S97














1787
I14
X3


284
V92
I5













1788
A14


285
F95
L2













1789
S14


286
L97














1790
L14


287
V93
I3
L1












1791
M14


288
G91
S5
D1












1792
A14


289
Q97














1793
F14


290
L95
M2













1794
T14


291
F97














1795
A14


292
T96
V1













1796
A14


293
F92
L4
I1












1797
V13
I1


294
S97














1798
T14


295
P97














1799
S14


296
R95
G1
K1












1800
P14


297
R87
H8
L1
D1











1801
L14


298
H92
Y3
R1
L1











1802
T14


299
W95
R1
E1












1803
T14


300
T95
Q1
A1












1804
S10
G3
N1


301
T94
V2
M1












1805
Q14


302
Q97














1806
T14


303
D65
G23
E7
S1
R1










1807
L14


304
C97














1808
L14


305
N97














1809
F14


306
C97














1810
N14


307
S96
T1













1811
I14


308
I83
M10
L2
V2











1812
L14


309
Y96














1813
G14


310
P96














1814
G13
S1


311
G95
S1













1815
W14


312
H95
D1













1816
V14


313
I91
V4
L1












1817
A14


314
T94
G1
S1












1818
A14


315
G96














1819
Q14


316
H96














1820
L14


317
R95
G1













1821
A14


318
M109














1822
A12
N1
G1


319
A109














1823
P14


320
W108














1824
G13
R1


321
D108
X19
N1












1825
A14


322
M128














1826
A14


323
M133














1827
T14


324
M130
V2
T1












1828
A14


325
N132
I1













1829
F14


326
W134














1830
V14


327
S134














1831
G14


328
P134
L2
X1












1832
A14


329
T136
K1













1833
G14


330
T91
A42
I2
X1
V1










1834
L14


331
A138














1835
A13
T1


332
L136
F1
P1












1836
G14


333
V123
I10
L5












1837
A14


334
V100
M29
T3
L3
A2
I1









1838
A13
G1


335
A132
S4
V1
G1











1839
I13
V1


336
Q138














1840
G14


337
L132
V4
M2












1841
S13
R1


338
L135
I1
F1
M1











1842
V13
D1


339
R138














1843
G14


340
I100
V38













1844
L14


341
P138














1845
G14


342
Q137
L1













1846
K14


343
A136
T1
S1












1847
V14


344
I134
V3
T1












1848
L14


345
L104
M26
V8












1849
V13
I1


346
D136
N2













1850
D14


347
M138
I1













1851
I14


348
I134
V3
F2












1852
L14


349
A139














1853
A13
V1


350
G138
S1













1854
G14


351
A138
S1













1855
Y14


352
H139














1856
G14


353
W139














1857
A14


354
G139














1858
G14


355
V136
I3













1859
V14


356
L139














1860
A14


357
A139














1861
G14


358
G139














1862
A14


359
I123
V7
M6
L3











1863
L14


360
A134
T4
G1












1864
V14


361
Y139














1865
A14


362
F136
Y3













1866
F14


363
S139














1867
K14


364
M139














1868
I14


365
V133
A5
L1












1869
M14


366
G139














1870
S14


367
N140














1871
G14


368
W140














1872
E14


369
A139
G1













1873
V13
P1


370
K140














1874
P14


371
V139
A1













1875
S13
T1


372
L129
V10
S1












1876
T14


373
V127
L7
I3
A3
X1










1877
E14


374
V141
X2
M1












1878
D14


375
L144
M1













1879
L13
M1


376
L145
M1













1880
V14


377
L146














1881
N14


378
F145
V1













1882
L14


379
A142
T4













1883
L13
X1


380
G135
S9
A1
X1











1884
P14


381
V145
I1













1885
A14


382
D146
Q1
X1












1886
I14


383
A145
G2
R1












1887
L14


384
E68
G20
T11
S10
D6
K5
N5
Q4
A3
H3
V2
R2
W1
I1

1888
S14


385
T139
P1
X1












1889
P14


386
H65
Y48
R9
T8
V2
I2
S1
M1
D1
Q1
L1
G1
X1


1890
G14


387
V97
T36
I4
L2
A1
R1









1891
A14


388
T112
S24
I4
V1











1892
L14


389
G141














1893
V14


390
G133
A6
R1
D1











1894
V14


391
S75
A23
T19
N9
V6
Q5
G1
K1
H1
E1





1895
G14


392
A109
V20
I4
T3
P2
L1
S1
X1







1896
V14


393
A88
G49
S3
V1











1897
V14


394
R75
H37
Q8
K6
Y6
S3
F2
E2
V1
N1





1898
C13
Y1


395
T63
A43
S12
G12
D5
V2
N2
I2







1899
A14


396
T69
A44
V20
M4
S2
X1
I1








1900
A13
T1


397
S50
A48
L10
Y7
H6
Q5
F4
R3
V2
T2
N2
K1
G1


1901
I14


398
G97
S11
T9
R9
A7
V3
I3
L1
M1






1902
L13
Q1


399
L72
F47
I15
V7











1903
R14


400
A70
T28
V26
S13
I2
X1
N1








1904
R14


401
G56
S54
R15
N6
T5
D3
A1
X1







1905
H14


402
L91
F34
I10
M3
X2
P1









1906
V14


403
F110
L30
S1












1907
G14


404
S54
T42
A18
N13
Q4
D3
K3
P1
E1
R1
H1




1908
P14


405
P84
Q22
R11
L7
S7
T4
A3
I2
V1






1909
G14


406
G141














1910
E14


407
A94
P41
S6












1911
G14


408
K87
R27
S14
Q11
N2










1912
A14


409
Q141














1913
V14


410
N106
D23
K9
H2
X1










1914
Q14


411
I107
V32
L2












1915
W14


412
Q129
R8
K1
H1











1916
M14


413
L140














1917
N14


414
I107
V29
T2
M2











1918
R14


415
N136
K3
Y1












1919
L13
M1


416
T113
S23
A4












1920
I14


417
N136
D2
E1
S1











1921
A14


418
G140














1922
F14


419
S140














1923
A13
T1


420
W140














1924
S14


421
H140














1925
R14


422
I139
V1













1926
G14


423
N138
D1
X1












1927
N13
G1


424
R81
S59













1928
H13
X1


425
T138
A1
S1












1929
V14


426
A140














1930
S14


427
L140














1931
P14


428
N140














1932
T14


429
C140














1933
H14


430
N137
S2
D1












1934
Y14


431
D70
A48
E20
T1
S1










1935
V14


432
S137
N2
T1












1936
P14


433
L136
H2
I2












1937
E14


434
N59
D41
E12
T9
H9
Q4
S3
Y1
K1
X1





1938
S13
N1


435
T135
A4
S1












1939
D14


436
G140














1940
A13
T1


437
W128
F12













1941
A14


438
I49
L44
V39
M4











1942
A13


439
A129
T4
V2
G1











1943
R13


440
G129
S3
R2
A2











1944
V13


441
L135
P1













1945
T13


442
F95
L26
I11
S1
M1
X1
V1








1946
A10
T2
V1


443
Y132
H4













1947
I13


444
Y70
H40
R8
S4
Q4
N3
V2
F2
P1
T1
A1




1948
L13


445
H82
N37
Y9
D2
R2
S1
K1
T1
L1






1949
S11
G2


446
K107
R13
Q5
N3
H3
S2
G2
T1







1950
S13


447
F136














1951
L13


448
N132
D3
K1












1952
T12


449
S130
F2
D2
A2











1953
V12


450
S125
T11
X1












1954
T11
N1


451
G137














1955
Q9
X2
K1


452
C137














1956
L9
P2


453
P111
S15
T3
A3
L2
F1
R1
V1







1957
L11


454
E134
Q1
G1
A1











1958
R9
X2


455
R135
M1
G1












1959
R8
X1
G1


456
L81
M54
S1
V1











1960
L8


457
A134
S2
T1












1961
H8


458
S135
G1
T1












1962
Q8


459
C133
S1
X1
R1
Y1










1963
W8


460
R118
K12
Q3
X1
H1
L1









1964
I6
V2


461
P96
R28
S6
L2
X1
N1









1965
S8


462
L124
I4
F3
V1











1966
S8


463
T79
A49
S2
I1
D1










1967
E8


464
D111
Y5
N4
H4
A3
S2
R1
T1
G1






1968
C7
S1


465
F128
L2
Y2












1969
T8


466
D69
A60
V1
S1
G1










1970
S10
T8


467
Q132














1971
M10
P8


468
G132














1972
G10
C8
M3


469
W130
S1
R1












1973
S22
A3


470
G131
X1













1974
G25


471
P131
L1













1975
S25


472
I131
M1













1976
W25


473
S99
G12
R7
T6
N2
I2
X1
D1
K1
E1





1977
L25
I1


474
Y84
H47
F1












1978
R26


475
A99
T21
V8
G2
N1
I1









1979
D26


476
N110
D17
T2
E2
Y1










1980
I26


477
G125
T2
V1
R1
P1
I1
Q1








1981
W26


478
S111
G9
T7
N4
D1










1982
D26


479
G124
D3
S2
N1
R1
E1









1983
W26


480
P110
S11
L8
F1
R1
A1









1984
I26


481
D87
E40
A2
G2
K1










1985
C47


482
Q57
H53
E21
G1











1986
E47


483
R132














1987
V47


484
P132














1988
L46
V1


485
Y131
H1













1989
S46
C1


486
C132














1990
D47


487
W131














1991
F47


488
H132














1992
K46
E1


489
Y130
C1
H1












1993
T47


490
P103
T16
S6
A6
X1










1994
W47


491
P111
X20
T1












1995
L26
X21


492
K77
R35
L1












1996
K47


493
P109
R1
L1
S1











1997
A47


494
C110
W1
S1












1998
K46
Q1


495
G111
D1













1999
L47


496
I103
X5
V2
F1
T1










2000
M46
V1


497
V102
E2
X1
G1











2001
P47


498
P100
S3
X1
D1











2002
Q46
R1


499
A100
X3
T1












2003
L47


500
K47
Q26
R14
E3
L3
M2
X2
G2
S1






2004
P47


501
S51
T20
N19
Q2
G2
X1
R1
A1







2005
G47


502
V93
D1
X1
I1











2006
I46
L1


503
C72
W1
X1












2007
P47


504
G67
A1













2008
F42
L5


505
P65
R2
Q1












2009
V46
M1


506
V65
X3













2010
S47


507
Y58
F1
V1












2011
C47


508
C57
Y1
G1
X1











2012
Q47


509
F53
X7













2013
R46
Q1


510
T52
A1













2014
G47


511
P52
L1













2015
Y47


512
S53














2016
R33
K14


513
P53














2017
G26
X21


514
V51














2018
V45
A2


515
V48
A3













2019
W47


516
V51














2020
R46
Q1


517
G51














2021
G44
V2
A1


518
T51














2022
D46
E1


519
T51














2023
G47


520
D47
N4













2024
I43
V4


521
R35
K14
G1
H1











2025
M47


522
S33
L11
A4
F1
M1
T1









2026
H46
Y1


523
G51














2027
T47


524
A38
V11
M1
T1











2028
R46
H1


525
P50
A1













2029
C47


526
T51














2030
H43
Y2
N1
P1


527
Y51














2031
C46


528
N32
S15
R3
T1











2032
G46
Q1


529
W51














2033
A46
T1


530
G51














2034
E45
A1
D1


531
E24
A15
S10
V1
Q1










2035
I46
G1


532
N50
T1













2036
T44
A2
S1


533
D39
E11
R1












2037
G47


534
T50
A1













2038
H47


535
D51














2039
V47


536
V49
I1
F1












2040
K29
X18


537
F44
L7













2041
N44
T2
M1


538
V45
I4
L1
X1











2042
G26
X21


539
L51














2043
T46
S1


540
N50
T1













2044
M47


541
N49
Y1
S1












2045
R47


542
T50
X1













2046
I45
T1
V1


543
R50














2047
V40
A4
I2
F1


544
P50














2048
G47


545
P49
R1













2049
P47


546
L42
M3
S2
Q1
G1
W1









2050
R24
K23


547
G49
A1













2051
T47


548
N48
G1
L1












2052
C47


549
W48
V1













2053
R46
K1


550
F46
S2
G1












2054
N47


551
G45
X3













2055
M44
T1
V1
I1


552
C41
X2













2056
W26
X21


553
T40
N1













2057
S39
N7
D1


554
W41














2058
G47


555
M41














2059
T46
A1


556
N41














2060
F45
L2


557
S40
A1













2061
P46
L1


558
T27
S13
C1












2062
I47


559
G41














2063
N47


560
F33
Y8













2064
A46
P1


561
T41














2065
Y47


562
K41














2066
T47


563
V36
T3
A2












2067
T47


564
C41














2068
G47


565
G41














2069
P47


566
A40
G1













2070
C47


567
P41














2071
T44
N2
V1


568
P41














2072
P47


569
C41














2073
L26
X18
S3


570
V23
A6
D5
N4
T2
F1









2074
P44
X3


571
I41














2075
A47


572
G39
R1
K1












2076
P47


573
G41














2077
N45
S2


574
V24
A7
G5
M2
S1
K1
I1








2078
Y47


575
G38
S2
N1












2079
T30
K13
S3
E1


576
N41














2080
F47


577
N37
T2
D1
H1











2081
A47


578
T40
F1













2082
L47


579
L40
W1













2083
W47


580
H23
Y7
R4
L3
S2
Q1
T1








2084
R26
X21


581
C41














2085
V47


582
P41














2086
S47


583
T41














2087
A47


584
D41














2088
E47


585
C41














2089
E46
D1


586
F41














2090
Y47


587
R41














2091
V46
A1


588
K41














2092
E46
A1


589
H41














2093
I44
V3


590
P41














2094
R26
X21


591
E40
D1













2095
Q35
R12


592
A41














2096
V27
X20


593
T41














2097
G46
X1


594
Y41














2098
D47


595
S39
A2













2099
F47


596
R40
K1













2100
H47


597
C41














2101
Y47


598
G41














2102
V47


599
S41














2103
T45
S1
V1


600
G41














2104
G47


601
P40
A1













2105
M46
V1


602
W41














2106
T47


603
I34
L6
V1












2107
T39
A8


604
T41














2108
D47


605
P41














2109
N43
D4


606
R40
K1













2110
L47


607
C41














2111
K44
R3


608
L37
M3
I1












2112
C47


609
V41














2113
P47


610
H16
D14
N11












2114
C47


611
Y41














2115
Q47


612
P38
A2
S1












2116
V46
I1


613
Y41














2117
P47


614
R41














2118
S45
T1
A1


615
L41














2119
P47


616
W38














2120
E47


617
H38














2121
F47


618
X25
Y13













2122
F47


619
P13














2123
T47


620
C13














2124
E47


621
T13














2125
L46
V1


622
I8
V2
L2
M1











2126
D47


623
N13














2127
G47


624
Y12
F1













2128
V47


625
T12
S1













2129
R47


626
I8
L4
T1












2130
L45
I2


627
F13














2131
H47


628
K13














2132
R26
X21


629
V9
I4













2133
F26
X21


630
R13














2134
A47


631
M13














2135
P47


632
Y13














2136
P47


633
V13














2137
C47


634
G13














2138
K47


635
G13














2139
P47


636
V13














2140
L47


637
E13














2141
L47


638
H13














2142
R46
A1


639
R13














2143
E31
D16


640
L13














2144
E47


641
E8
G1
Q1
D1
N1










2145
V28
X19


642
A11
V1













2146
S24
T15
A6
X2


643
A12














2147
F27
X20


644
C12














2148
R46
S1


645
N12














2149
V47


646
W12














2150
G47


647
T12














2151
L47


648
R12














2152
H46
N1


649
G12














2153
E24
D15
A5
S1
V1
T1


650
E12














2154
Y47


651
R12














2155
P47


652
C12














2156
V28
X19


653
D10
N2













2157
G45
X2


654
L12














2158
S47


655
E9
D3













2159
Q47


656
D12














2160
L47


657
R12














2161
P47


658
D12














2162
C47


659
R12














2163
E47


660
S12














2164
P47


661
E12














2165
E47


662
L12














2166
P26
X21


663
S12














2167
D47


664
P12














2168
V47


665
L12














2169
A45
T2


666
L12














2170
V47


667
L12














2171
L45
V2


668
S10
T2













2172
T47


669
T12














2173
S47


670
T12














2174
M46
T1


671
Q11
E1













2175
L47


672
W12














2176
T47


673
Q12














2177
D47


674
V9
I3













2178
P46
S1


675
L12














2179
S47


676
P12














2180
H47


677
C12














2181
I44
V2
L1


678
S12














2182
T47


679
F12














2183
A47


680
T12














2184
E47


681
T12














2185
A25
X21
T1


682
L12














2186
A47


683
P12














2187
G33
R7
K4
A3


684
A12














2188
R47


685
L12














2189
R47


686
S9
T3













2190
L47


687
T12














2191
A41
X4
E1
T1


688
G12














2192
R30
X17


689
L12














2193
G47


690
I12














2194
S47


691
H12














2195
P46
S1


692
L12














2196
P33
X13
L1


693
H12














2197
S26
X16


694
Q12














2198
V24
E11
M6
L3
X3


695
N12














2199
A47


696
I12














2200
S47


697
V12














2201
S47


698
D12














2202
S47


699
V12














2203
A46
V1


700
Q12














2204
S47


701
Y12














2205
Q47


702
L12














2206
L47


703
Y12














2207
S47


704
G12














2208
A46
T1


705
V11
I1













2209
P46
L1


706
G12














2210
S46
P1


707
S12














2211
L47


708
S11
A1













2212
K46
R1


709
I8
V4













2213
A47


710
A7
V4
T1












2214
T46
A1


711
S12














2215
C47


712
W11
Y1













2216
T47


713
A12














2217
A24
T17
V3
I2
G1


714
I12














2218
N44
H1
R1
D1


715
K12














2219
H45
Y1
R1


716
W12














2220
D44
V1
E1
Y1


717
E8
D4













2221
S47


718
Y12














2222
P46
T1


719
V12














2223
D47


720
V8
I3
L1












2224
A46
V1


721
L12














2225
E45
D2


722
L12














2226
L47


723
F12














2227
I47


724
L12














2228
E42
A2
Q2
T1


725
L12














2229
A47


726
L12














2230
N47


727
A12














2231
L47


728
D12














2232
L47


729
A12














2233
W47


730
R12














2234
R46
N1


731
V7
I5













2235
Q47


732
C12














2236
E45
A2


733
S12














2237
M47


734
C12














2238
G47


735
L12














2239
G46
C1


736
W12














2240
N47


737
M12














2241
I47


738
M12














2242
T47


739
L12














2243
R47


740
L12














2244
V47


741
I12














2245
E47


742
S12














2246
S47


743
Q12














2247
E46


744
A11
V1













2248
N46
S1


745
E12














2249
K47


746
A12














2250
V47


747
A10
X1













2251
V45
A1
X1


748
L10














2252
I40
V6


749
E10














2253
L46


750
N10














2254
D46


751
L10














2255
S25
X21


752
V9
I1













2256
F46


753
V5
I4
L1












2257
D43
E2
G1


754
L10














2258
P46


755
N10














2259
L28
X18


756
A10














2260
V24
X18
C2
R1
T1


757
A10














2261
A41
X4
E1


758
S10














2262
E25
X20


759
L10














2263
E44
K1
X1


760
A10














2264
D46


761
G10














2265
E46


762
T10














2266
R45
Q1


763
H8
R1
Q1












2267
E45
D1


764
G10














2268
X21
I14
V11


765
L10














2269
S45
A1


766
V7
A3













2270
V43
T1
A1
I1


767
S9
X1













2271
P42
A3
T1


768
F9














2272
A46


769
L9














2273
E45
G1


770
V8
M1













2274
I45
M1


771
F9














2275
L46


772
F9














2276
R45
L1


773
C9














2277
K43
R2
T1


774
F8
L1













2278
S42
Y2
T1
R1


775
A9














2279
R45
G1


776
W9














2280
R44
K2


777
Y9














2281
F25
L15
X6


778
L9














2282
A27
X11
P4
T4


779
K9














2283
P13
Q12
X10
R7
S2
A1
E1


780
G9














2284
A25
G11
X10


781
R5
K4













2285
L46


782
W9














2286
P45
A1


783
V9














2287
V25
I21


784
P9














2288
W46


785
G9














2289
A25
X21


786
A8
M1













2290
R46


787
A5
V4













2291
P46


788
Y9














2292
D44
E2


789
A8
T1













2293
Y46


790
L4
F4
I1












2294
N46


791
Y8
F1













2295
P25
X21


792
G9














2296
P46


793
M9














2297
L46


794
W9














2298
L23
V12
I10
M1


795
P9














2299
E46


796
L8
F1













2300
T39
A4
S1
L1
E1


797
L9














2301
W46


798
L9














2302
K32
X14


799
L9














2303
K38
X4
S3
E1


800
L9














2304
P43
X3


801
L9














2305
D44
G2


802
A9














2306
Y46


803
L9














2307
E44
K2


804
P9














2308
P46


805
Q9














2309
P46


806
R9














2310
V45
T1


807
A9














2311
V46


808
Y9














2312
H45
Y1


809
A9














2313
G46


810
L9














2314
C34
X12


811
D9














2315
P37
X9


812
T9














2316
L46


813
E9














2317
P46


814
V7
M2













2318
P45
S1


815
A8
X1













2319
P44
S2


816
A8














2320
Q30
K8
R8


817
S8














2321
S45
P1


818
C8














2322
P45
X1


819
G8














2323
P25
X20
S1


820
G8














2324
V46


821
V8














2325
P46


822
V8














2326
P45
S1


823
L8














2327
P46


824
V8














2328
R25
X21


825
G8














2329
K39
R7


826
L8














2330
K45
R1


827
M8














2331
R46


828
A7
V1













2332
T44
M2


829
L8














2333
V45
I1


830
T8














2334
V45
I1


831
L8














2335
L46


832
S8














2336
T44
S2


833
P8














2337
E46


834
Y7
H1













2338
S46


835
Y8














2339
T36
S4
N3
P1
A1
L1


836
K8














2340
V32
L14


837
R7
H1













2341
S43
P3


838
Y8














2342
T42
S2
A2


839
I7
V1













2343
A46


840
S8














2344
L46


841
W8














2345
A46


842
C8














2346
E45


843
L4
F3
M1












2347
L46


844
W8














2348
A46


845
W8














2349
T45
S1


846
L8














2350
K44
R1
I1


847
Q8














2351
S31
X13
T2


848
Y8














2352
F38
X8


849
F8














2353
G46


850
L8














2354
S44
D1
G1


851
T8














2355
S45
A1


852
R8














2356
S46


853
V6
A2













2357
T26
X20


854
E8














2358
S45
X1


855
A8














2359
G46


856
Q5
H2
L1












2360
I44
V1
T1


857
L8














2361
T40
A3
S2
V1


858
H7
Q1













2362
G38
S5
A3


859
V8














2363
D42
G4


860
W8














2364
N40
D4
S2


861
V6
I1
F1












2365
T45
A1


862
P8














2366
T41
A4
P1


863
P8














2367
T37
A6
S2
V1


864
L8














2368
S38
P8


865
N8














2369
S43
T1
H1
P1


866
V7
A1













2370
E46


867
R8














2371
P38
A5
L2
S1


868
G8














2372
A37
T5
S3
P1


869
G8














2373
P31
S15


870
R8














2374
S39
P6
T1


871
D8














2375
G32
V8
D5
I1


872
A8














2376
C39
R4
X2
H1


873
V7
I1













2377
P23
X14
S5
L3
F1


874
I8














2378
P41
X5


875
L8














2379
D46


876
L8














2380
S46


877
M8














2381
D46


878
C8














2382
A36
V5
T3
D1
N1


879
A4
V4













2383
E24
D21
G1


880
V6
I2













2384
S25
X21


881
H8














2385
Y32
C12
F1
N1


882
P8














2386
S46


883
T6
A1
S1












2387
S46


884
L8














2388
M46


885
V8














2389
P27
X19


886
F8














2390
P45
X1


887
D8














2391
L45
X1


888
I8














2392
E46


889
T8














2393
G46


890
K8














2394
E46


891
L8














2395
P46


892
L8














2396
G46


893
L8














2397
D45
G1


894
A8














2398
P46


895
I4
V3
A1












2399
D46


896
F7
L1













2400
L46


897
G8














2401
S46


898
P8














2402
D44
G1
E1


899
L8














2403
G45
A1


900
W8














2404
S46


901
I8














2405
W46


902
L8














2406
S46


903
Q8














2407
T46


904
A6
T2













2408
V46


905
S8














2409
S45
G1


906
L8














2410
S45
N1


907
L8














2411
E25
G19
R2


908
K8














2412
A41
T2
D1
P1
S1


909
V8














2413
G23
D22
N1


910
P8














2414
T37
A8
G1


911
Y8














2415
E46
X2


912
F8














2416
D47
G1
X1


913
V8














2417
V50


914
R8














2418
V49
L1


915
V8














2419
C50


916
Q8














2420
C29


917
G8














2421
S12


918
L8














2422
M12


919
L7
I1













2423
S12


920
R8














2424
Y12


921
I6
F1
V1












2425
S11
T1


922
C8














2426
W12


923
A7
V1













2427
T12


924
L6
A1
I1












2428
G12


925
A7
S1













2429
A12


926
R7
P1













2430
L12


927
K8














2431
V7
I5


928
M7
I1













2432
T12


929
A5
V2
I1












2433
P12


930
G8














2434
C12


931
G7
L1













2435
A11
T1


932
H7
R1













2436
A12


933
Y8














2437
E12


934
V8














2438
E11
G1


935
Q8














2439
Q12


936
M8














2440
K12


937
A6
V2













2441
L12


938
I6
M1
V1












2442
P12


939
I8














2443
I12


940
K8














2444
N12


941
L5
V1
A1
I1











2445
A12


942
G8














2446
L12


943
A8














2447
S12


944
L8














2448
N12


945
T8














2449
S12


946
G8














2450
L12


947
T8














2451
L12


948
Y8














2452
R12


949
V6
I2













2453
H12


950
Y8














2454
H12


951
N7
D1













2455
N11
X1


952
H8














2456
L12


953
L8














2457
V12


954
T8














2458
Y12


955
P8














2459
S12


956
L8














2460
T12


957
R8














2461
T12


958
D8














2462
S12


959
W8














2463
R12


960
A8














2464
S12


961
H8














2465
A12


962
N7
S1













2466
C11
G1


963
G7
S1













2467
Q11
L1


964
L8














2468
R12


965
R8














2469
Q12


966
D8














2470
K12


967
L8














2471
K12


968
A8














2472
V12


969
V8














2473
T12


970
A8














2474
F12


971
V8














2475
D12


972
E8














2476
R12


973
P8














2477
L11
V1


974
V8














2478
Q12


975
V8














2479
V12


976
F8














2480
L12


977
S8














2481
D12


978
Q5
R3













2482
S10
N2


979
M8














2483
H12


980
E8














2484
Y12


981
T8














2485
Q11
R1


982
K8














2486
D12


983
L8














2487
V12


984
I8














2488
L12


985
T8














2489
K12


986
W8














2490
E12


987
G8














2491
V12


988
A7
G1













2492
K12


989
D8














2493
A12


990
T8














2494
A12


991
A8














2495
A12


992
A8














2496
S12


993
C8














2497
K12


994
G8














2498
V12


995
D8














2499
K12


996
I8














2500
A12


997
I8














2501
N12


998
N7
D1













2502
L12


999
G7
N1













2503
L12


1000
L8














2504
S12


1001
P8














2505
V12


1002
V8














2506
E12


1003
S8














2507
E12


1004
A8














2508
A12


1005
R8














2509
C12


1006
R8














2510
S12


1007
G8














2511
L12


1008
R6
Q2













2512
T12


1009
E8














2513
P12


1010
I8














2514
P12


1011
L8














2515
H12


1012
L8














2516
S12


1013
G8














2517
A12


1014
P8














2518
K8
R4


1015
A8














2519
S12


1016
D8














2520
K12


1017
G8














2521
F12


1018
M8














2522
G12


1019
V7
A1













2523
Y12


1020
S8














2524
G12


1021
K8














2525
A12


1022
G8














2526
K12


1023
W8














2527
D14


1024
R8














2528
V14


1025
L8














2529
R14


1026
L7
Q1













2530
C14


1027
A122
X2













2531
H14


1028
P124














2532
A14


1029
I124














2533
R14


1030
T124














2534
K14


1031
A123
X1













2535
A14


1032
Y124














2536
V14


1033
A122
T2













2537
N8
A5
T1


1034
Q123
X1













2538
H14


1035
Q124














2539
I14


1036
T124














2540
N14


1037
R122
X2













2541
S14


1038
G124














2542
V14


1039
L124














2543
W14


1040
L122
X2













2544
K14


1041
G123
R1













2545
D14


1042
C123
X1













2546
L14


1043
I124














2547
L14


1044
I108
V14
X2












2548
E14


1045
T124














2549
D13
A1


1046
S123
N1













2550
S11
N3


1047
L124














2551
V14


1048
T124














2552
T14


1049
G124














2553
P14


1050
R124














2554
I14


1051
D124














2555
D14


1052
K122
R1
X1












2556
T14


1053
N123
D1













2557
T14


1054
Q124














2558
I14


1055
V102
A22













2559
M14


1056
E124














2560
A14


1057
G124














2561
K14


1058
E124














2562
N14


1059
V120
I3
A1












2563
E14


1060
Q122
H1
X1












2564
V14


1061
I122
X1
V1












2565
F14


1062
V124














2566
C14


1063
S124














2567
V14


1064
T124














2568
Q14


1065
A123
T1













2569
P14


1066
A100
T21
X3












2570
E14


1067
Q124














2571
K14


1068
T122
S1
X1












2572
G14


1069
F123
X1













2573
G14


1070
L124














2574
R13
C1


1071
A123
X1













2575
K14


1072
T120
X4













2576
P14


1073
C123
S1













2577
A14


1074
I124














2578
R14


1075
N124














2579
L14


1076
G124














2580
I14


1077
V123
X1













2581
V14


1078
C124














2582
F13
Y1


1079
W124














2583
P14


1080
T124














2584
D14


1081
V124














2585
L14


1082
Y124














2586
G14


1083
H124














2587
V14


1084
G124














2588
R14


1085
A123
X1













2589
V14


1086
G123
R1













2590
C13
F1


1087
T121
X1
A1
S1











2591
E13
K1


1088
R120
K4













2592
K14


1089
T124














2593
M14


1090
I103
M19
L2












2594
A14


1091
A124














2595
L14


1092
S122
T2













2596
Y14


1093
S70
P34
A20












2597
D12
E2


1094
K124














2598
V13
L1


1095
G124














2599
V14


1096
P124














2600
S11
T2
R1


1097
V123
X1













2601
K14


1098
I124














2602
L14


1099
Q124














2603
P14


1100
M123
T1













2604
L12
P1
V1


1101
Y123
X1













2605
A14


1102
T124














2606
V14


1103
N123
X1













2607
M14


1104
V123
I1













2608
G14


1105
D124
X1













2609
S13
G1


1106
Q133
K23
G1
R1











2610
S14


1107
D157
P1













2611
Y14


1108
L157
X1













2612
G14


1109
V157
G1













2613
F14


1110
G157
X1













2614
Q14


1111
W156
R1
G1












2615
Y14


1112
P152
X2
V2
A1
S1










2616
S14


1113
A144
T11
P1
G1
X1










2617
P14


1114
P128
L21
X3
Q2
A2
H1
M1








2618
G13
A1


1115
Q130
P21
L3
X3
E1










2619
Q14


1116
G154
X1
E1
N1
S1










2620
R14


1117
A89
S36
T27
N1
I1
C1
V1
G1
R1






2621
V14


1118
R158














2622
E14


1119
S158














2623
F14


1120
L158














2624
L14


1121
T155
V1
A1
X1











2625
V14


1122
P156
X1
A1












2626
Q13
K1


1123
C158














2627
A14


1124
T152
A3
X2
P1











2628
W14


1125
C157
G1













2629
K14


1126
G157
X1













2630
S14


1127
S155
X3













2631
K14


1128
S156
X1
K1












2632
K13
R1


1129
D156
N1
X1












2633
T12
N2


1130
L158














2634
P14


1131
Y158














2635
M14


1132
L157
P1













2636
G14


1133
V157
A1













2637
F14


1134
T158














2638
S14


1135
R158














2639
Y14


1136
H158














2640
D14


1137
A158














2641
T14


1138
D157
E1













2642
R14


1139
V157
L1













2643
C14


1140
I157
V1













2644
F14


1141
P157
X1













2645
D14


1142
V158














2646
S14


1143
R158














2647
T14


1144
R158














2648
V14


1145
R153
Q3
X2












2649
T13
N1


1146
G157
D1













2650
E14


1147
D157
X1













2651
S14


1148
S154
G2
X2












2652
D14


1149
R157
T1













2653
X91
I14


1150
G155
A2
X1












2654
R106


1151
S156
T1
G1












2655
T105
X1


1152
L155
X2
Q1












2656
E104
X2


1153
L155
X2
P1












2657
E105
G1


1154
S158














2658
A105
L1


1155
P157
H1













2659
I103
X2
V1


1156
R157
X1













2660
Y105
X1


1157
P158














2661
Q105
L1


1158
I157
V1













2662
C106


1159
S156
X1
P1












2663
C105
X2


1160
Y158














2664
D106
X1


1161
L157
X1













2665
L106
X2


1162
K156
X2













2666
D108


1163
G158














2667
P108


1164
S157
X1













2668
Q104
R2
X2


1165
S158














2669
A108


1166
G158














2670
R108


1167
G158














2671
V106
T1
K1


1168
P157
X1













2672
A105
V1
X1
P1


1169
L157
X1













2673
I107
X1


1170
L156
X2













2674
K71
R37


1171
C158














2675
S108


1172
P157
S1













2676
L108


1173
A155
X1
P1
V1











2677
T107
I1


1174
G155
X3













2678
E106
Q1
X1


1175
H157
N1













2679
R108


1176
A157
X1













2680
L107
X1


1177
V157
X1













2681
Y107
X1


1178
G156
S2













2682
V107
I1


1179
I138
L20













2683
G108


1180
F157
X1













2684
G108


1181
R156
X2













2685
P108


1182
A155
P2
G1












2686
L108


1183
A157
X1













2687
T108


1184
V158














2688
N107
X1


1185
C154
X4













2689
S108


1186
T158














2690
R99
K8
X1


1187
R157
P1













2691
G108


1188
G157
X1













2692
E108


1189
V158














2693
N108


1190
A158














2694
C106
X2


1191
K157
X1













2695
G108


1192
A156
X2













2696
Y107
X1


1193
V158














2697
R108


1194
D155
X2
E1












2698
R107
X1


1195
F153
X5













2699
C106
X2


1196
I157
V1













2700
R108


1197
P154
X4













2701
A106
X2


1198
V157
A1













2702
S107
R1


1199
E157
V1













2703
G107
I1


1200
N87
S48
G18
X3
D2










2704
V108


1201
L156
X2













2705
L107
X1


1202
E157
G1













2706
T108


1203
T158














2707
T108


1204
T156
X2













2708
S108


1205
M157
X1













2709
C108


1206
R158














2710
G108


1207
S158














2711
N107
X1


1208
P42














2712
T108


1209
V44














2713
L108


1210
F43
X1













2714
T108


1211
T41
X2
S1












2715
C108


1212
D43
X1













2716
Y108


1213
N43
X1













2717
I108


1214
S44














2718
K108


1215
S41
A1
T1
X1











2719
A108


1216
P44














2720
Q81
R24
K2
L1


1217
P44














2721
A108


1218
A38
X4
V2












2722
A107
X1


1219
V43
X1













2723
C108


1220
P44














2724
R108


1221
Q43
E1













2725
A107
S1


1222
S42
T2













2726
A108


1223
F41
Y2
X1












2727
G100
R7
X1


1224
Q44














2728
L106
X1
R1


1225
V44














2729
R83
Q25


1226
A43
X1













2730
D104
X2
G1
E1


1227
H44














2731
C108


1228
L44














2732
T108


1229
H43
X1













2733
M108


1230
A44














2734
L103
X3
V2


1231
P44














2735
V107
X1


1232
T43
X1













2736
C106
X2


1233
G44














2737
G107
X1


1234
S44














2738
D107
X1


1235
G44














2739
D108


1236
K41
X2
N1












2740
L107
X1


1237
S43
X1













2741
V105
X2
I1


1238
T44














2742
V105
X2
A1


1239
K43
X1













2743
I105
X1


1240
V44














2744
C105
X1


1241
P44














2745
E105
X1


1242
A44














2746
S105
X1


1243
A42
X2













2747
A59
Q44
V1
X1
E1


1244
Y42
X2













2748
G106


1245
A44














2749
V104
I1
X1


1246
A44














2750
Q103
P3


1247
Q43
X1













2751
E106


1248
G44














2752
D103
X3


1249
Y44














2753
A106


1250
K44














2754
A102
X1
P1
G1
M1


1251
V44














2755
S102
N2
X1
C1


1252
L44














2756
L106


1253
V44














2757
R105
X1


1254
L44














2758
A106


1255
N44














2759
F105
X1


1256
P44














2760
T105
X1


1257
S44














2761
E106


1258
V44














2762
A106


1259
A43
F1













2763
M106


1260
A44














2764
T105
I1


1261
T43
X1













2765
R106


1262
L43
X1













2766
Y105
N1


1263
G43
S1













2767
S105
X1


1264
F44














2768
A105
V1


1265
G44














2769
P105


1266
A41
V2
T1












2770
P105


1267
Y44














2771
G104
R1


1268
M44














2772
D105


1269
S44














2773
P104
H1


1270
K41
R2
Q1












2774
P105


1271
A44














2775
Q101
R3
X1


1272
H43
Y1













2776
P105


1273
G44














2777
E105


1274
I25
V19













2778
Y105


1275
D41
E2
X1












2779
D105


1276
P42
X2













2780
L105


1277
N41
S2
X1












2781
E102


1278
I43
S1













2782
L101
R1


1279
R44














2783
I102


1280
T43
X1













2784
T102


1281
G44














2785
S101
X1


1282
V44














2786
C102


1283
R44














2787
S101
X1


1284
T44














2788
S102


1285
I44














2789
N102


1286
T44














2790
V102


1287
T44














2791
S102


1288
G43
S1













2792
V101
X1


1289
S44














2793
A102


1290
P41
S2
X1












2794
H102


1291
I44














2795
D101
X1


1292
T43
X1













2796
G97
S2
X1
D1
E1


1293
Y44














2797
A92
T9
V1


1294
S44














2798
G102


1295
T44














2799
K101
R1


1296
Y44














2800
R102


1297
G44
X1













2801
V102


1298
K45














2802
Y102


1299
F45














2803
Y102


1300
L45














2804
L101
X1


1301
A45














2805
T102


1302
D44
X1













2806
R102


1303
G43
X2













2807
D102


1304
G45














2808
P101
X1


1305
C44
X1













2809
T101
A1


1306
S45














2810
T102


1307
G45














2811
P102


1308
G44
X1













2812
L97
F4
I1


1309
A45














2813
A100
V1
X1


1310
Y45














2814
R102


1311
D45














2815
A102


1312
I45














2816
A102


1313
I45














2817
W102


1314
I44
X1













2818
E100
X2


1315
C44
X1













2819
T102


1316
D45














2820
A102


1317
E45














2821
R100
K2


1318
C45














2822
H101
R1


1319
H43
X2













2823
T102


1320
S45














2824
P102


1321
T44
P1













2825
V88
I14


1322
D45














2826
N102


1323
A43
X1













2827
S102


1324
T43
X1
P1












2828
W102


1325
S44
X1













2829
L102


1326
I39
V5
X1












2830
G102


1327
L45














2831
N101
X1


1328
G45














2832
I101
V1


1329
I45














2833
I102


1330
G45














2834
M102


1331
T45














2835
F101
Y1


1332
V43
A2













2836
A102


1333
L42
X3













2837
P102


1334
D45














2838
T102


1335
Q45














2839
L102


1336
A43
X2













2840
W102


1337
E43
D1
X1












2841
A79
V23


1338
T43
X2













2842
R101
K1


1339
A44
X1













2843
M102


1340
G45














2844
I97
V5


1341
A45














2845
L95
M7


1342
R43
T1
X1












2846
M86
L14
A1
T1


1343
L45














2847
T102


1344
V44
L1













2848
H102


1345
V45














2849
F102


1346
L45














2850
F102


1347
A45














2851
S101
G1


1348
T45














2852
V68
I33
X1


1349
A44
P1













2853
L102


1350
T44
P1













2854
I81
M19
L2


1351
P45














2855
A100
T1
X1


1352
P45














2856
R102


1353
G45














2857
D101
X1


1354
S44
X1













2858
Q102


1355
V36
I7
X2












2859
L102


1356
T43
S1
X1












2860
E102


1357
V44
X1













2861
Q102


1358
P40
S5













2862
A102


1359
H44
X1













2863
L101
X1


1360
P44
S1













2864
D98
N3
X1


1361
N45














2865
C100
G1


1362
I45














2866
E101


1363
E45














2867
I101


1364
E45














2868
Y100


1365
V43
A1
I1












2869
G101


1366
A45














2870
A97
X2


1367
L45














2871
C92
X3


1368
S43
P2













2872
Y92


1369
T44
X1













2873
S91
X1


1370
T45














2874
I92


1371
G45














2875
E91
X1


1372
E45














2876
P92


1373
I43
V2













2877
L91
X1


1374
P44
X1













2878
D88
X4


1375
F45














2879
L91
I1


1376
Y45














2880
P87
X5


1377
G45














2881
P73
X6
S3
Q3
L2
A1


1378
K45














2882
I79
X3
V1


1379
A45














2883
I80
X2


1380
I45














2884
Q80
X2


1381
P45














2885
X61
R6


1382
L45














2886
L6


1383
E44
G1













2887
H5


1384
V32
A11
X2












2888
G5


1385
I44
X1













2889
L5


1386
K43
R1
X1












2890
S5


1387
G45














2891
A5


1388
G45














2892
F5


1389
R45














2893
S5


1390
H45














2894
L5


1391
L45














2895
H5


1392
I45














2896
S5


1393
F45














2897
Y5


1394
C44
X1













2898
S5


1395
H45














2899
P5


1396
S45














2900
G5


1397
K44
R1













2901
E5


1398
K44
R1













2902
I5


1399
K45














2903
N5


1400
C43
X2













2904
R5


1401
D41
N2
X2












2905
V5


1402
E43
D2













2906
A5


1403
L45














2907
A5


1404
A45














2908
C5


1405
A42
T2
X1












2909
L4
X1


1406
K44
X1













2910
R4


1407
L45














2911
K4


1408
V42
X3













2912
L4


1409
A42
V1
G1
X1











2913
G4


1410
L43
M2













2914
V4


1411
G44
X1













2915
P4


1412
I28
V16
L1












2916
P3
A1


1413
N45














2917
L4


1414
A45














2918
R4


1415
V45














2919
A4


1416
A44
X1













2920
W4


1417
Y43
X2













2921
R4


1418
Y45














2922
H4


1419
R45














2923
R4


1420
G45














2924
A4


1421
L45














2925
R4


1422
D45














2926
S4


1423
V45














2927
V4


1424
S45














2928
R4


1425
V44
A1













2929
A4


1426
I45














2930
R3
K1


1427
P45














2931
L4


1428
T43
A2













2932
L4


1429
S44
N1













2933
S4


1430
G44
X1













2934
R4


1431
D44
X1













2935
G3
V1


1432
V45














2936
G4


1433
V45














2937
R4


1434
V45














2938
A4


1435
V45














2939
A4


1436
A39
S4
X2












2940
I4


1437
T45














2941
C4


1438
D45














2942
G4


1439
A44
X1













2943
K4


1440
L45














2944
Y4


1441
M45














2945
L4


1442
T43
X2













2946
F4


1443
G45














2947
N4


1444
Y26
F19













2948
W4


1445
T44
X1













2949
A4


1446
G45














2950
V4


1447
D45














2951
R4


1448
F44
X1













2952
T4


1449
D45
X1













2953
K4


1450
S46














2954
L4


1451
V46














2955
K4


1452
I46














2956
L4


1453
D46














2957
T4


1454
C46
X2













2958
P4


1455
N47
X1













2959
I4


1456
T48














2960
A3
T1


1457
C47
R1













2961
A4


1458
V48














2962
A4


1459
T48














2963
G4


1460
Q47
X2













2964
Q3
R1


1461
T47
X2













2965
L4


1462
V49














2966
D4


1463
D49














2967
L4


1464
F49














2968
S4


1465
S49














2969
G4


1466
L48
F1













2970
W4


1467
D48
H1













2971
F4


1468
P48
X1













2972
T4


1469
T47
X1
A1












2973
A4


1470
F49














2974
G4


1471
T49














2975
Y4


1472
I48
X1













2976
S4


1473
E46
D3













2977
G4


1474
T48
I1













2978
G4


1475
T46
S2
I1












2979
D4


1476
T49














2980
I4


1477
L48
X1













2981
Y4


1478
P49














2982
H4


1479
Q48
X1













2983
S4


1480
D49














2984
V4


1481
A47
X2













2985
S4


1482
V47
X2













2986
R3
H1


1483
S49














2987
A4


1484
R46
X3













2988
R4


1485
T48
S1













2989
P4


1486
Q49














2990
R4


1487
R47
X2













2991
W4


1488
R48
X1













2992
F4


1489
G49














2993
W4


1490
R49














2994
F4


1491
T49














2995
C4


1492
G49














2996
L4


1493
R48
X1













2997
L4


1494
G49














2998
L4


1495
K44
R5













2999
L4


1496
P48
A1













3000
A4


1497
G49














3001
A4


1498
I47
F1
X1












3002
G4


1499
Y49














3003
V4


1500
R49














3004
G3
C1


1501
F49














3005
I4


1502
V49














3006
Y4


1503
A46
T3













3007
L4


1504
P49














3008
L4


1505















3009
P4


1506















3010
N4


1507















3011
R4
















TABLE 6







HCV 1b Consensus Sequences


























A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R





























1
M235
R1


















2
S235
Q1


3
T235
E1


4
N230
I2
T2
D1
F1


5
P234
L1
G1


6
K236


7
P236


8
Q236


9
R236


10
K227
Q8
R1


11
T232
I3
S1


12
K233
I1
N1
Y1


13
R236
L1


14
N236


15
T236


16
N229
Y4
I1
D1
S1


17
R234
L2


18
R236


19
P236


20
Q236


21
D236


22
V233
I2
L1


23
K236


24
F236


25
P236


26
G235
A1


27
G236


28
G236


29
Q234
K1
R1


30
I235
V1


31
V236


32
G236


33
G236


34
V236


35
Y236


36
L235
V1


37
L234
F1
M1


38
P234
T2


39
R235
P1


40
R236


41
G235
C1


42
P236


43
R226
K5
T2
A2
S1


44
L236


45
G236


46
V236


47
R236


48
A233
T1
P1
X1


49
T224
P8
I2
R1
L1


50
R236


51
K235
Q1


52
T234
A1
I1


53
S235
W1


54
E236


55
R236


56
S233
P3


57
Q236


58
P234
A2


59
R235
A1


60
G235
E1


61
R235
S1


62
R235
P1


63
Q235
H1


64
P235
L1


65
I235
V1


66
P236


67
K233
R1
N1
E1


68
A228
V8


69
R236


70
R140
Q90
H4
K1
X1


71
P229
L2
H2
S2
R1


72
E236


73
G236


74
R235
W1


75
A112
T110
V5
S5
N4


76
W234
C1
S1


77
A236


78
Q236


79
P236


80
G235
W1


81
Y235
H1


82
P236


83
W236


84
P236


85
L236


86
Y235
F1


87
G231
A5


88
N234
D2


89
E235
Q1


90
G235
S1


91
M125
L105
C3
I1
X1
F1


92
G236


93
W236


94
A233
T2
P1


95
G235
E1


96
W235
L1


97
L235
F1


98
L235
V1


99
S236


100
P236


101
R227
H5
Q2
Y2


102
G235
S1


103
S236


104
R236


105
P235
S1


106
S216
N18
R2


107
W236


108
G236


109
P236


110
T205
N16
S12
R1
K1
I1


111
D236


112
P236


113
R236


114
R234
C2


115
R230
K5
G1


116
S236
X1


117
R236
S1


118
N236
I1


119
L236
V1


120
G237


121
K237


122
V236
I1


123
I237


124
D237


125
T236
S1


126
L236
F1
P1


127
T238


128
C238


129
G238


130
F227
L8
V3


131
A236
P2


132
D238


133
L236
P1
F1


134
M236
V1
L1


135
G238
X12


136
Y249
H1


137
I246
V2
L2


138
P250


139
L242
R5
P1
F1
V1


140
V250


141
G250


142
A242
P4
G4


143
P250


144
L248
V1
I1


145
G247
R2
A1


146
G249


147
A202
V48


148
A245
S3
V2


149
R249
K1


150
A241
V9


151
L250
X1


152
A250
V1


153
H251


154
G250
S1


155
V251
X1


156
R252


157
V246
A5
X3
I2
L1


158
L241
V14
R2


159
E255
L2


160
D255
G1
V1


161
G233
S24


162
V257


163
N257


164
Y257


165
A256
P2


166
T258
X1


167
G259


168
N259
X1


169
L255
M3
I1
F1


170
P259
A1


171
G260


172
C260


173
S253
P7
X1


174
F261


175
S260
Y1


176
I257
X8
L3
V1


177
F272
S3


178
L269
F6
X2


179
L268
W3
S2
M2
G1
I1


180
A271
V2
P2
F1
G1


181
L273
F3
V1


182
L270
V5
M2


183
S277


184
C275
G2


185
L275
V1
M1


186
T275
P1
I1


187
I216
T40
V16
A1
H1
N1
X1
M1


188
P275
Q1


189
A246
V21
T7
N1
G1


190
S275
F1


191
A274
T2
G1


192
Y182
I31
H9
X6
P1


193
E215
Q5
D2
V1
K1


194
V221
G1
T1
M1


195
R208
H9
G4
A1
N1
C1


196
N220
X5
H3
K1


197
V188
A35
X1
I1
L1
G1
T1
S1


198
S226
X2
F1
P1


199
G226
R2
D1
E1


200
V144
I43
A17
M15
L7
G3
D1


201
Y229
V1


202
H221
Q6
Y2
L1


203
V229
G1


204
T229
S1


205
N227
Y1
H1
S1


206
D229
Y1


207
C230


208
S228
P2


209
N222
Y3
S2
H1
K1
Q1


210
S179
A44
T6
L1


211
S228
N1
G1


212
I229
V1


213
V226
A2
G2


214
Y218
F10
S1
Q1


215
E227
K1
D1
G1


216
A193
T33
V3
P1


217
A208
S6
V5
E4
D3
Q2
T1
K1


218
D224
G4
H1
N1


219
M161
V35
L19
I12
T2
S1


220
I229
M1


221
M215
L14
I1


222
H228
Q1
D1


223
T183
A19
S15
I7
L3
F1
N1
V1


224
P230


225
G230


226
C229
F1


227
V225
M2
A1
T1
L1


228
P229
A1


229
C228
F1
S1


230
V230


231
R220
Q4
L2
W2
S1
G1


232
E225
D4
V1


233
N106
G49
D40
S16
A11
K3
E2
Q2
Y1


234
N224
D2
G2
Y1
K1


235
S183
F12
A7
I6
T6
H5
L5
V4
Y1
C1


236
S231


237
R224
S3
H2
K1
Q1


238
C229
Y2


239
W231


240
V223
A5
I2
T1


241
A230
E1


242
L229
V1
I1


243
T225
A6


244
P230
L1


245
T231


246
L228
V1
P1
H1


247
A229
S2


248
A228
T1
S1
G1


249
R228
K3


250
N228
D2
S1


251
S92
A85
T14
G9
I9
N7
V5
R3
F3
D1
H1
L1
P1


252
S217
T10
N3
R1


253
V132
I98
L1


254
P229
S2


255
T228
V2
I1


256
T186
A24
K15
M6


257
T197
A29
S4
Q1


258
I212
L10
M5
V4


259
R231


260
R225
H4
D1
C1


261
H231


262
V226
I5


263
D230
N1


264
L230
V1


265
L230
H1


266
V231


267
G242


268
A190
T47
V4
G1


269
A242


270
A222
T16
V4


271
F211
L31


272
C228
S7
W5
L1
R1


273
S241
A1


274
A235
V6
S1


275
M241
L1


276
Y245


277
V243
G1
M1


278
G243
W1
E1


279
D244
N1


280
L236
F8
X1


281
C245


282
G244
R1


283
S245


284
V236
I5
A3
X1


285
F232
L13


286
L244
P1


287
V215
I27
A2
L1


288
S239
G3
A3


289
Q245


290
L245


291
F243
I1
L1


292
T241
V3
A1


293
F233
L9
I3


294
S244
L1


295
P242
A2
T1


296
R243
S1
P1


297
R220
Q14
W3
L3
H2
V2
G1


298
H185
Y60


299
E206
Q9
M6
V6
W4
T3
A3
N2
G2
K2
R2


300
T242
I2
V1


301
V216
I8
L7
T7
A5
E2


302
Q243
H1
R1


303
D228
E12
N2
Y1
T1
S1


304
C245


305
N243
K1
D1


306
C245


307
S245


308
I202
L42
H1


309
Y244
F1


310
P238
L3
S2
A1
V1


311
G245


312
H241
R3
K1


313
V194
L34
I15
A2


314
S164
T81


315
G243
V1
S1


316
H244
Y1


317
R244
A1


318
M249
I1


319
A250
X5


320
W254
X10
L1


321
D261
X24
N3
E1


322
M288
V1
X1


323
M286
I3
R2
X1


324
M291
X3


325
N290
T1


326
W290
C1


327
S291


328
P313
L4
A1
H1


329
T315
A5
K2
Q1
S1


330
T209
A115
S3
X2
I1
Q1
L1
K1
V1


331
A329
G3
T3
L2
D1
S1
P1
V1


332
L332
I5
P4
M2
X1


333
V334
I4
M2
G2
S1
L1


334
V328
M6
L5
A4
P1
G1


335
S326
A9
V3
L2
C1
X1
G1
H1
D1


336
Q338
W2
H2
Y1
E1
V1


337
L324
I6
X6
V5
M1
Y1
A1
F1


338
L331
F3
P2
M1
G1


339
R335
P1
M1
Q1


340
I320
L8
V7
N1
X1
Y1


341
P330
X7
R1


342
Q333
E1


343
A325
T6
S2
V1


344
V261
I64
A7
T2


345
V231
M76
L24
I3
G1


346
D332
X1
N1
L1


347
M313
I10
V10
T1
E1


348
V309
I18
M4
L4


349
A301
V14
T13
G6
S1


350
G335


351
A326
G6
S2
X1


352
H331
Y1
Q1
L1


353
W334


354
G332
E1
X1


355
V316
I17
A1


356
L331
M1
P1
X1


357
A329
V4
F1


358
G333
D1


359
L315
I18
P1


360
A331
G2
V1


361
Y333


362
Y318
F15


363
S322
A7
T2
P2


364
M333


365
V290
A38
I6
Q1


366
G330
A4
R1


367
N332
K1
S1
A1


368
W335


369
A334
X4


370
K338


371
V334
A2
L1


372
L331
V4
M1
F1


373
I299
V32
L6


374
V329
X5
A2
I1


375
M306
L25
I1


376
L332


377
L329
P2
R1


378
F331
A1


379
A324
S6
L1
V1


380
G330
S1
R1


381
V329
A3


382
D329
N2
A1


383
G304
A19
R3
E1
Q1
T1
L1
P1
X1


384
S43
T41
E40
G34
D31
N30
H28
Q21
R20
A16
K4
V4
Y2
M1
P1


385
T311
D1
S1
N1
I1
A1


386
H144
Y69
R59
T14
V6
S5
L5
N3
Q3
I2
M1
A1
F1
K1
G1


387
V181
T107
A12
I5
L4
M3
Q1
S1


388
T183
S56
V26
I21
M14
L7
A5
W1
X1


389
G314
V1


390
G278
A27
E6
S2
V1
K1


391
A110
T86
V49
S29
Q20
E6
K5
H4
R3
X1
N1
M1


392
A110
Q109
V31
T28
S14
E6
P4
L3
N2
H2
M2
K1
Y1
R1
I1


393
A143
G130
S40
T1
X1


394
R144
H88
Y28
Q15
F12
S11
L7
K7
M1
D1
A1


395
T147
N48
S41
A37
G11
V6
D5
H4
Q3
Y3
L3
M1
R1
I1
K1
E1
C1
F1


396
T196
A59
L22
I18
V17
S1
G1
M1


397
R89
S85
Q33
H30
Y25
L17
N10
F7
A6
G4
W3
V2
K1
M1
T1
I1


398
G180
S63
R38
T13
Q6
K5
A3
V2
I2
L1
M1
E1


399
L143
F133
I27
V10
H1
M1


400
T180
A94
V34
M3
S3
R1


401
S245
G29
T16
N10
A7
R5
D1
H1
Y1


402
L235
F44
I24
M9
W2
X1


403
F293
L21
Y1


404
S133
T119
A28
N14
R9
Q3
D3
M2
K1
L1
I1
V1


405
P135
S58
L33
R23
V18
F17
A10
Q8
T7
C2
I2
H1
Y1


406
G314
E1


407
P173
A103
S36
Q1
X1
R1


408
S196
A61
K16
Q13
T12
R6
H4
N3
V2
E1
L1


409
Q310
H4
E1


410
K193
N85
R26
T3
D3
E2
H2
P1


411
I274
V20
L20
N1


412
Q311
K2
E1


413
L312
I2


414
V163
I145
M3
X2


415
N303
K5
H2
Y1


416
T291
S15
N2
A1


417
N305
S1
E1


418
G305
D1
X1


419
S304
N2
I1


420
W304
R1


421
H305


422
I274
V31


423
N305


424
R298
S7


425
T305


426
A303
S1
X1


427
L304


428
N298
S5
D1


429
C303
W1


430
N289
D10
K3
Y1
X1


431
D283
A11
E8
G1
X1


432
S284
T16
A2
P1


433
L292
I4
F3
H3
Y1


434
Q117
N108
K34
H24
D7
S6
E3
R3


435
T295
S1
A1


436
G297


437
F277
W20


438
L185
I93
V10
F9


439
A289
T4
S3
G1


440
A275
G16
S5
T1


441
L296
V1


442
F283
L6
I5
S1
V1


443
Y295
F1


444
T159
A83
V25
Y8
H8
K5
R4
F2
Q1
L1


445
H201
N41
R37
Y9
K6
S2


446
K149
R105
S19
N16
Q3
E2
M1
G1


447
F283
I6
L6
V1


448
N294
K1
D1


449
A144
S133
T7
D7
G2
E1
P1
M1


450
S287
T6
C1
A1
P1


451
G294
E1
R1


452
C293
X2
G1


453
P258
S20
L6
T3
A2
R2
I2
V1
G1


454
E270
Q13
A6
G3
K1
D1
P1


455
R287
K3
H2
G2
P1


456
M234
L60
I1


457
A283
S5
X3
V2
N1
T1


458
S278
T4
G3
Q2
R1
H1
V1
N1
X1


459
C291


460
R272
S6
K6
H3
Q2
L1
C1


461
P206
S75
T3
R2
N1
A1
X1
F1
H1


462
I267
L20
V3


463
D253
A11
S6
T6
N5
E4
G3
V1


464
K150
E51
T32
Q17
A10
D9
R7
G5
N3
S2
Y1
H1
W1


465
F288
Y1


466
A149
D87
S25
N10
V5
T5
E3
H3
G1
R1


467
Q288
I1


468
G289


469
W287
F1
S1


470
G288
X1


471
P275
S12
A1
L1


472
I284
V3
L2


473
T266
S13
A3
N2
R2
Q1
H1
K1


474
Y248
H37
N1
F1
C1
S1


475
A165
V38
T36
D28
G11
N6
I2
H1
S1
Y1


476
E153
K33
V24
G15
N15
Q12
R9
A9
D7
T4
S4
M3
P1


477
P175
S48
G26
A8
D6
Q6
R6
H5
Y3
T2
N2
L1
V1


478
D68
N42
S35
G35
P31
R23
H18
T16
A7
E4
V4
Q3
K2
L1


479
S120
G50
D41
N27
I27
V8
R5
T3
A2
K2
P2
H1
L1


480
S116
L70
P58
Q26
V4
R4
M3
T3
K2
E1
X1
W1


481
D271
E13
X3
G1


482
Q269
H14
R1
M1


483
R235
K48
G1
S1


484
P284
L1


485
Y285


486
C285


487
W283
S1
G1


488
H283
S1
P1


489
Y271
X13
N1


490
A230
P30
R1


491
P259
Q1
L1


492
R157
Q62
K36
P3
E2
S1


493
P193
Q40
R10
K9
L6
A1
X1
S1


494
C259


495
G249
S3
D2
K1
V1
X1
N1
T1


496
I238
T11
V10


497
V233
I22
X1
A1
T1
E1


498
P249
S6
X2
A1
H1


499
A248
P2
X2
T2
W1


500
S211
A15
L13
Q7
R3
K2
X1


501
Q133
E74
K20
T7
N7
G3
D3
R1
S1


502
V243
A1
C1
M1
X1


503
C228
X14
P1


504
G227
V5
A1
X1
S1


505
P228
Q6
G1


506
V223
X8
M3


507
Y183
H1
X1


508
C183
R1


509
F179
L1
X1


510
T175
X5


511
P174


512
S171
T2
A1


513
P174


514
V171
I2


515
V171
A2


516
V171
N1
X1


517
G171
W1


518
T168
S2
K1
X1


519
T171
S1


520
D170
N1


521
R161
H5
S2
K1
G1
P1


522
F111
S45
L9
Y2
H2
A1
T1


523
G168
S2


524
V112
A55
N2
T1


525
P169
R1


526
T170


527
Y158
X1
S1


528
S77
N36
T24
R18
K2
D1
V1


529
W158
X10
R1


530
G165
X3
R1


531
E131
A16
D6
G5
V4
N3
S1


532
N164
S2


533
E158
D3
K2
G2
V1


534
T163
S2
I1


535
D166


536
V163
M1
A1
I1


537
L165
F1


538
L142
I12
V7
F2
P2
Y1


539
L166


540
N164
T1
S1


541
N162
S4


542
T157
S4
M2
A2
X1


543
R163
G2
L1


544
P163
L2
A1


545
P155
X11


546
Q112
R28
H7
L7
P1


547
G153
A1
D1


548
N151
T2
S2


549
W154
K1


550
F155


551
G155


552
C154
Y1


553
T154
S1


554
W154
Y1


555
M153
I1
V1


556
N154
S1


557
S99
G44
A8
T3
N1


558
T153
S2


559
G154
R1


560
F150
Y4
S1


561
T154
A1


562
K154
M1


563
T153
V1
A1


564
C155


565
G154
E1


566
G142
A10
R1
D1
V1


567
P155


568
P151
S2
A1
L1


569
C155


570
N130
D9
K7
G3
H2
T2
V1
E1


571
I155


572
G152
E1
S1
R1


573
G155


574
V105
A33
I6
G5
E2
L2
N1
R1


575
G153
S1
D1


576
N153
S1


577
N113
D34
K3
T2
L1
S1
E1


578
T150
S3
R1
A1


579
L154
F1


580
T85
I58
V10
H1
L1


581
C155


582
P153
S2


583
T154
S1


584
D155


585
C155


586
F155


587
R155


588
K153
X1
E1


589
H154


590
P153
S1


591
E145
G3
D3
A2
Q1


592
A153
S1


593
T153
A1


594
Y154


595
T141
A9
S3
X1


596
K144
R10


597
C154


598
G153
X1


599
S154


600
G154


601
P154


602
W154


603
L152
S1
I1


604
T149
R1
I1


605
P151


606
R150
K1


607
C151


608
M69
L51
I31


609
V149
I2


610
D126
H24
N1


611
Y151


612
P151


613
Y151


614
R151


615
L150
P1


616
W147
L1


617
H147
Y1


618
Y143
X5


619
P143


620
C142
S1


621
T141
A2


622
V131
L5
I4
A2
F1


623
N142
H1


624
F141
Y2


625
T118
S24
A1


626
I126
V12
T3
L2


627
F142
L1


628
K139
T3
Q1


629
V131
I12


630
R142
K1


631
M143


632
Y142
N1


633
V143


634
G143


635
G143


636
V134
I3
A2
L2
M2


637
E142
G1


638
H142
Q1


639
R142
K1


640
L140
F3


641
N79
D21
E13
S13
T8
K5
I2
H1
V1


642
A142
V1


643
A141
G1
V1


644
C140
S1
G1
R1


645
N142
K1


646
W143


647
T143


648
R141
W1


649
G142
R1


650
E139
D3
A1


651
R143


652
C142
R1


653
D103
N36
G2
A1
E1


654
L140
V3


655
E126
T5
D5
V2
G2
S1
R1
A1


656
D143


657
R143


658
D143


659
R143


660
S134
A5
T2
P1
L1


661
E143


662
L143


663
S143


664
P143


665
L143


666
L142
I1


667
L142
R1


668
S141
A1
F1


669
T142
I1


670
T143


671
E143


672
W142


673
Q143


674
I77
V65
T1


675
L142
M1


676
P143


677
C143


678
S138
A4
T1


679
F137
Y6


680
T143


681
T140
A1
P1
G1


682
L143


683
P143


684
A141
G1
V1


685
L141
V1
P1


686
S142
F1


687
T143


688
G143


689
L143


690
I143


691
H143


692
L143


693
H143


694
Q108
R35


695
N142
G1


696
I126
V10
T6
F1


697
V143


698
D142
H1


699
V132
I10
A1


700
Q143


701
Y142
H1


702
L142
R1


703
Y141
H1
D1


704
G142
C1


705
I102
V41


706
G143


707
S143


708
A124
V16
T2
G1


709
V133
I5
A2
G1
L1
F1


710
V138
A4
I1


711
S141
P2


712
F68
Y32
I19
V16
L6
A1


713
A99
V42
T1


714
I134
V8


715
K126
R16


716
W142


717
E140
D2


718
Y141


719
V113
I28
F1


720
L99
V38
I4
M1


721
L141
I1


722
L137
F4
S1
X1


723
F141
S1
X1
L1


724
L140
F3
N1


725
L110
F24
Y6
P3
S1


726
L143
A1


727
A143
M1


728
D142
Y1
A1


729
A143
E1


730
R143
S1


731
V137
I4
F3


732
C144


733
A141
P1
G1
S1


734
C137
A3
W3
G1


735
L144


736
W144


737
M142
T1
R1


738
M143
I1


739
L140
M4


740
L142
M2


741
I124
V18
X1
K1


742
A138
P2
V1
G1
T1


743
Q143


744
A142
V1


745
E140
D2
K1


746
A140
G1
P1


747
A133
T5
V1
L1
P1


748
L140
M1


749
E138
G2
K1


750
N139
K1
I1


751
L141


752
V140
A1


753
V135
I4
A2


754
L140
F1


755
N141


756
A140
S1


757
A138
S1
G1
P1


758
S139
Y1
A1


759
V127
L11
M3


760
A139
T1
V1


761
G138
A1
D1
R1


762
A94
T36
V4
E2
M2
K1
S1
R1


763
H138
R2
D1


764
G138
S2
D1


765
I114
V9
T8
F5
L4
M1


766
L134
F4
P2
S1


767
S128
P11
F1
T1


768
F140
L1


769
L138
I2
F1


770
V139
M2


771
F131


772
F131


773
C131


774
A129
E2


775
A131


776
W129
E2


777
Y127
F2
C2


778
I128
M1


779
K128
Q1


780
G129


781
R99
K29
E1


782
L129


783
V121
X5
A2
I1


784
P123


785
G121
A2


786
A120
V3


787
A118
T4
V1


788
Y123


789
A121
V2


790
F61
L48
I8
V4
Y1
S1


791
Y123


792
G118
S5


793
V118
A3
M1
I1


794
W122
C1


795
P123


796
L123


797
L121
F2


798
L122
M1


799
L123


800
L123


801
L120
M3


802
A111
T8
S3
X1


803
L123


804
P123


805
P118
A2
Q1
S1
H1


806
R120
L1
Q1
X1


807
A122
V1


808
Y121
F2


809
A123


810
M122
L7
A2
I1


811
D127
E4
A1


812
R129
T1
Q1
P1


813
E130
G2


814
M124
V8


815
A130
T1
V1


816
A126
P3
G2


817
S130


818
C129
S1


819
G129
E1


820
G130


821
A100
G13
V10
M4
T3


822
V130


823
F120
L10


824
V68
I55
L4
A1
M1
T1


825
G130


826
L130


827
A70
V46
I7
M5
T2


828
L112
F15
I3


829
L130


830
T130


831
L126
M4


832
S130


833
P130


834
H76
Y52
Q2


835
Y125
X3
C1


836
K127
R3


837
V106
A14
L3
E3
F1
M1
P1
T1


838
F121
L6
V2
I1


839
L130


840
A128
T2


841
R110
K20


842
L123
I3
F3
P1


843
I124
M3
L3


844
W128
R2


845
W129
R1


846
L125
S4
M1


847
Q130


848
Y129
F1


849
F103
L26
Y1


850
I112
L13
T4
S1


851
T127
A3


852
R118
I11
K1


853
A123
T3
V2
S2


854
E130


855
A130


856
H104
L16
Q4
C2
Y2
I1
D1


857
L128
V2


858
Q114
H14
C1
L1


859
V127
M2
I1


860
W129
S1


861
I73
V54
T1
A1
L1


862
P128
S2


863
P130


864
L130


865
N129
S1


866
V109
I18
A3


867
R130


868
G128
X2


869
G130


870
R130


871
D128
N2


872
A130


873
I124
V6


874
I130


875
L128
I1
F1


876
L129
I1


877
T81
M36
A10
V3


878
C130


879
A98
V31
M1


880
V77
I38
A12
F1
L1
T1


881
H127
Y2
R1


882
P122
S7
A1


883
E121
G8
D1


884
L128
R1
S1


885
I121
V8
T1


886
F129
S1


887
D77
A17
E17
T17
S2


888
I129
V1


889
T129
S1


890
K129
R1


891
I62
L58
N4
T2
F2
Y2


892
L129
V1


893
L124
I4
V2


894
A125
S4
T1


895
I111
T11
V6
M1
A1


896
L108
F21
V1


897
G130


898
P130
K7


899
L129
P4
I3
V1


900
M116
L10
T8
V1
Y1
P1


901
V130
M5
I2


902
L129
F8


903
Q134
L1
R1
P1


904
A132
V3
T1
G1


905
G114
S12
A8
V2
R1


906
I101
L32
M4


907
T100
A16
I16
V2
S1
N1


908
R104
K27
A3
Q2


909
V123
M9
A2
L1
I1


910
P136


911
Y135
D1


912
F136


913
V136


914
R134
P2


915
A136


916
Q90
H46


917
G136


918
L135
V1


919
I135
V1


920
R134
H2


921
A121
V13
E1
M1


922
C136


923
M109
L23
V2
A1
I1


924
L136


925
V120
L11
A5


926
R134
W2


927
K129
Q4
E3


928
V109
A24
T2
I1


929
A128
V5
L1
S1
G1


930
G136


931
G135
D1


932
H131
Q4
L1


933
Y136


934
V116
I13
A3
F3
L1


935
Q136


936
M136


937
A130
V6


938
F75
L61


939
M116
V19
I1


940
K94
R42


941
L136


942
A127
G8
T1


943
A133
E1
P1
G1


944
L131
M5


945
T132
A2
S1
K1


946
G135
S1


947
T136


948
Y135
S1


949
V117
L12
I7


950
Y136
X1


951
D110
N25
T1
A1


952
H137


953
L137


954
T120
A13
S4


955
P137


956
L137


957
R90
Q44
K2
G1


958
D111
H11
Y9
G4
N2


959
W136
C1


960
A133
P3
R1


961
H134
R3


962
A76
E28
T16
V11
S3
G3


963
G131
S5
A1


964
L137


965
R135
Q2


966
D135
G1
E1


967
L137


968
A133
V3
T1


969
V135
A2


970
A137


971
V137


972
E137


973
P137


974
V136
I1


975
V112
I23
L1
T1


976
F137


977
S137


978
D131
A5
N1


979
M136
T1


980
E137


981
T125
I12


982
K137


983
I122
V14
L1


984
I135
V1
T1


985
T137


986
W136
G1


987
G136
R1


988
A135
G1
P1


989
D136
E1


990
T137
X1


991
A138
X1


992
A137
T1
E1


993
C139


994
G138
V1


995
D138
N1


996
I138
F1


997
I138
L1


998
L69
S55
Q10
A2
X1
M1
W1


999
G135
A2
C1
S1


1000
L137
C1
Y1


1001
P135
A2
R1
L1


1002
V136
A2
S1


1003
S139


1004
A135
G4


1005
R139


1006
R138
K1


1007
G138
R1


1008
R106
K32
N1


1009
E138


1010
I128
V9
L1


1011
L124
F9
H5


1012
L138
X1


1013
G139


1014
P139


1015
A139


1016
D139


1017
S128
G5
N3
R2
K1


1018
L82
F53
H2
I1
Y1


1019
E100
G24
D7
K5
V2
R1


1020
G106
E30
H2
W1


1021
Q127
R6
H4
K1
S1


1022
G139


1023
W138
L1


1024
R139


1025
L139
K2


1026
L141


1027
A517
G1
S1
X1


1028
P517
X1
A1
H1


1029
I519
X1


1030
T515
A2
R2
S1


1031
A520


1032
Y520


1033
S303
A211
C3
X1
T1
I1


1034
Q517
H2
R1


1035
Q518
H1
R1


1036
T520


1037
R519
L1


1038
G516
X2
C1
A1


1039
L515
M2
P1
I1
V1


1040
L476
F40
V3
I1


1041
G517
A1
R1
C1


1042
C519
S1


1043
I518
V2
X1


1044
I451
V70


1045
T521


1046
S521


1047
L521


1048
T521


1049
G520
R1


1050
R519
P1
Q1


1051
D521


1052
K508
R12
Q1


1053
N518
X2
K1


1054
Q516
X3
P1
R1


1055
V517
X1
T1
F1
A1


1056
E505
D15
G1


1057
G521


1058
E518
X1
K1
G1


1059
V519
I2


1060
Q520
X1


1061
V506
I11
M3
E1


1062
V506
L15


1063
S519
X2


1064
T519
I1
P1


1065
A520
X1


1066
T515
A5
K1


1067
Q517
H3
K1


1068
S513
T8


1069
F521


1070
L521


1071
A516
S4
G1


1072
T516
S4
N1


1073
C519
X1
R1


1074
V368
I115
L35
A2
T1


1075
N515
S3
G2
I1


1076
G521


1077
V485
A35
T1


1078
C519
F1
X1


1079
W520
R1


1080
T515
S6


1081
V517
I2
X2


1082
Y380
F141


1083
H520
X1


1084
G521


1085
A520
X1


1086
G520
C1


1087
S442
T46
A28
P4
C1


1088
K513
R8


1089
T511
A8
D1
I1


1090
L517
I4


1091
A516
T1
D1
X1
V1
G1


1092
G512
S6
A2
P1


1093
P461
Q33
I13
S6
A4
R1
L1
G1
K1


1094
K476
G38
X2
Q2
R1
M1
N1


1095
G520
X1


1096
P520
Q1


1097
I467
V53
T1


1098
T391
I102
V18
A7
S1
X1
N1


1099
Q521


1100
M519
K1
I1


1101
Y521


1102
T520
X1


1103
N520
X1
S1


1104
V522


1105
D521
N1


1106
Q498
L24
X1


1107
D522
G1


1108
L523


1109
V521
D1
L1


1110
G520
D2
X1


1111
W523


1112
Q320
P156
H25
L12
A5
M1
W1
S1
X1
R1


1113
A517
R2
S1
E1
K1
V1


1114
P517
A3
T1
L1
R1


1115
P410
S103
Q6
A2
H1
X1


1116
G522
R1


1117
A483
S36
X2
T1
M1


1118
R519
X2
H1


1119
S522


1120
L438
M82
I2


1121
T497
S20
I3
A2


1122
P519
X1
A1
S1


1123
C511
R7
X1
W1
Y1
G1


1124
T519
X1
N1
A1


1125
C522


1126
G521
V1


1127
S515
G4
X1
I1
A1


1128
S521
X1


1129
D521
V1


1130
L521
P1


1131
Y519
F2
H1


1132
L521
X1


1133
V520
I1
G1


1134
T520
S1
P1


1135
R520
X1
W1


1136
H516
Q2
Y2
X1
N1


1137
A521
T1


1138
D522


1139
V520
I2


1140
I513
V9


1141
P522


1142
V521
A1


1143
R495
H23
C4


1144
R521
X1


1145
R521
G1


1146
G517
S5


1147
D522


1148
S441
G39
N23
T16
X1
V1
C1


1149
R520
Q1
K1


1150
G521
X1


1151
S492
A28
G1
P1


1152
L520
X2


1153
L522


1154
S520
P2


1155
P520
X2


1156
R516
K4
X1
G1


1157
P520
H2


1158
V343
I178
L1


1159
S521
X1


1160
Y521
R1


1161
L522


1162
K521
X1


1163
G522


1164
S522


1165
S521
P1


1166
G522


1167
G522


1168
P521
T1


1169
L522


1170
L519
P2
X1


1171
C521
W1


1172
P520
X1
S1


1173
S415
L96
A9
F2


1174
G520
V1
R1


1175
H519
X1
R1
L1


1176
A346
V162
L8
I4
T2


1177
V513
A8
X1


1178
G520
S1
X1


1179
I508
V8
L5
N1


1180
F520
L2


1181
R520
P2


1182
A521
T1


1183
A518
P4


1184
V519
L1
X1
A1


1185
C521
R1


1186
T521
N1


1187
R514
T4
P3
G1
X1


1188
G521
R1
M1


1189
V523
X1


1190
A520
T2
X1
S1


1191
K520
M2
Q1
N1


1192
A520
T2
S1
X1


1193
V518
L4
M1
X1


1194
D515
E9


1195
F523
L1


1196
V347
I173
X1
G1
A1
L1


1197
P522
X1
S1


1198
V521
A3


1199
E523
D1


1200
S501
A8
C4
G3
F2
N1
X1
P1
Y1
T1
H1


1201
M517
L7


1202
E521
A1
X1
D1


1203
T524


1204
T521
X1
I1
S1


1205
M494
A18
T7
I4
L1


1206
R523
W1


1207
S362
P1
A1


1208
P134
T1
L1


1209
V138
I2


1210
F136
Y4


1211
T140


1212
D140


1213
N139
T1


1214
S138
P2


1215
S135
T5


1216
P140


1217
P140


1218
A139
G1


1219
V140


1220
P140
X2


1221
Q141
E1


1222
T124
A10
S8


1223
F139
Y3


1224
Q142


1225
V142


1226
A141
G1


1227
H142


1228
L141
I1


1229
H142


1230
A142


1231
P142


1232
T142


1233
G142


1234
S142


1235
G142


1236
K142


1237
S141
T1


1238
T140
A1
N1


1239
K133
R9


1240
V141
X1


1241
P141
X1


1242
A141
V1


1243
A141
E1


1244
Y141
X1


1245
A142


1246
A137
T3
S2


1247
Q142


1248
G142


1249
Y142


1250
K141
M1


1251
V142


1252
L140
R2


1253
V142


1254
L142


1255
N142


1256
P142


1257
S142


1258
V142


1259
A141
T1


1260
A142


1261
T133


1262
L133


1263
G122
S10
A1


1264
F133


1265
G133


1266
A131
T2


1267
Y133


1268
M131
T1
V1


1269
S133


1270
K133


1271
A133


1272
H111
Y22


1273
G133


1274
V72
I32
T28
M1


1275
D122
E10


1276
P132


1277
N120
S12


1278
I122
V5
L4
T1


1279
R131
S1


1280
T132


1281
G132


1282
V116
I8
A6
T2


1283
R132


1284
T129
A3


1285
I126
V5
T1


1286
T132
X1


1287
T132
X1


1288
G133


1289
A121
S6
G6


1290
P126
S6
G1


1291
I131
M1
V1


1292
T133


1293
Y133


1294
S133


1295
T133
X1


1296
Y134


1297
G132
C2


1298
K134


1299
F134


1300
L133
F1


1301
A134


1302
D133
X1


1303
G134


1304
G134


1305
C132
G2


1306
S134


1307
G134


1308
G134


1309
A134


1310
Y134


1311
D133
G1


1312
I134


1313
I133
V1


1314
I111
M23


1315
C133
X1


1316
D133


1317
E133


1318
C133


1319
H132
X1


1320
S132


1321
T131
I1


1322
D132


1323
S128
A2
W2


1324
T132


1325
T86
S46


1326
I128
V4


1327
L128
Y2
M2


1328
G131
S1


1329
I130
V2


1330
G132


1331
T131
A1


1332
V131
A1


1333
L132


1334
D132


1335
Q131
R1


1336
A132


1337
E131
K1


1338
T132


1339
A130
V2


1340
G132


1341
A132


1342
R132


1343
L131
F1


1344
V132


1345
V131
L1


1346
L132


1347
A130
S2


1348
T131
N1


1349
A131
R1


1350
T132


1351
P132


1352
P132


1353
G132


1354
S132


1355
V121
I10
T1


1356
T132


1357
V131
F1


1358
P132


1359
H132


1360
P127
S3
L2


1361
N130
S2


1362
I132


1363
E127
Q5


1364
E132


1365
V123
I5
A3
S1


1366
A126
G5
P1


1367
L131
Q1


1368
S129
P1
T1
A1


1369
N125
T4
S3


1370
T108
I17
V3
A2
S2


1371
G132


1372
E132


1373
I127
V5


1374
P133


1375
F133


1376
Y133


1377
G132
A1


1378
K131
R2


1379
A131
G1
T1


1380
I132
F1


1381
P133


1382
I112
L20
V1


1383
E124
D6
A2
G1


1384
T64
A34
V28
N3
I2
F1
L1


1385
I132
V1


1386
K127
R6


1387
G130
E3


1388
G133


1389
R133


1390
H133


1391
L133


1392
I133


1393
F133


1394
C132
W1


1395
H131
R1
P1


1396
S133


1397
K113
R20


1398
K132
E1


1399
K132
E1


1400
C132
Y1


1401
D133


1402
E132
D1


1403
L131
P1
V1


1404
A132
P1


1405
A117
T10
G3
S1
V1
E1


1406
K130
Q2
R1


1407
L133


1408
S120
T6
V4
L3


1409
G60
A47
S21
T3
D1
N1


1410
L133


1411
G133


1412
L83
I30
V19
M1


1413
N132
H1


1414
A133


1415
V132
I1


1416
A131
V1
S1


1417
Y133


1418
Y133


1419
R133


1420
G133


1421
L131
F2


1422
D133


1423
V133


1424
S133


1425
V127
I5
D1


1426
I133


1427
P133


1428
T115
A10
S6
V1
P1


1429
S128
N4
I1


1430
G133


1431
D120
N13


1432
V130
X1
D1
A1


1433
V130
I3


1434
V131
A1
I1


1435
V133


1436
A132
T1


1437
T132
S1


1438
D133


1439
A133


1440
L133


1441
M133


1442
T133


1443
G134


1444
F78
Y56


1445
T134


1446
G134


1447
D132
N2


1448
F132
L1
S1


1449
D133
X1
V1


1450
S135


1451
V133
W1
X1


1452
I134
T1


1453
D135


1454
C134
R1


1455
N135


1456
T131
V3
X1


1457
C134
X1


1458
V135


1459
T130
I3
N1
A1


1460
Q135


1461
T135


1462
V135


1463
D135


1464
F134
L1


1465
S135


1466
L134


1467
D134


1468
P134


1469
T134


1470
F134


1471
T134


1472
I133
M1


1473
E131
D3


1474
T133
X1


1475
T131
M1
A1


1476
T133


1477
V128
L3
M2


1478
P133


1479
Q132
H1


1480
D132
Y1


1481
A131
S2


1482
V133


1483
S132
A1


1484
R131
S1
A1


1485
S123
T8
A2


1486
Q133


1487
R131
L1
X1


1488
R133


1489
G133


1490
R132
K1


1491
T133


1492
G132
S1


1493
R132
S1


1494
G133


1495
R132
T1


1496
R41
G30
T25
A12
M9
P4
S4
Q3
I2
E2
X1


1497
G133


1498
I126
T5
M1
V1


1499
Y133


1500
R133


1501
F132
Y1


1502
V133


1503
T126
A4
I2
X1


1504
P132


1505
G131
X1


1506
E131


1507
R131


1508
P129
T2


1509
S129
L2


1510
G129
A2


1511
M131


1512
F130
L1


1513
D131


1514
S131


1515
S131


1516
V131


1517
L131


1518
C131


1519
E130
D1


1520
C129
Y1
S1


1521
Y131


1522
D131


1523
A130
S1


1524
G131


1525
C131


1526
A129
T1


1527
W130


1528
Y130
D1


1529
E130
D1


1530
L130
V1


1531
T130
A1


1532
P131


1533
A130
S1


1534
E131


1535
T130
A1


1536
S101
T30


1537
V129
I2


1538
R131


1539
L129
V2


1540
R131


1541
A131


1542
Y131


1543
L130
M1


1544
N130
X1


1545
T129
A1


1546
P130


1547
G130


1548
L129
F1


1549
P129
A1


1550
V129
F1


1551
C129
W1


1552
Q130


1553
D130


1554
H129
Y1


1555
L130


1556
E130


1557
F130


1558
W128
S2


1559
E130


1560
S73
G56
D1


1561
V130


1562
F129
S1


1563
T130


1564
G130


1565
L130


1566
T129
N1


1567
H128
K1
R1


1568
I130


1569
D129
E1


1570
A129
S1


1571
H130
X8


1572
F138


1573
L137
M1


1574
S138


1575
Q138


1576
T138


1577
K138


1578
Q138


1579
A138


1580
G137
R1


1581
D129
E9


1582
N135
D1
H1
S1


1583
F134
L4


1584
P138


1585
Y138


1586
L137
M1


1587
V133
I2
T2
A1


1588
A138


1589
Y138


1590
Q138


1591
A138


1592
T138


1593
V138


1594
C137
R1


1595
A138


1596
R137
K1


1597
A135
S2
X1


1598
Q133
K5


1599
A138


1600
P133
L3
R1
S1


1601
P138


1602
P138


1603
S138


1604
W138


1605
D138


1606
Q134
L2
E2


1607
M139


1608
W139


1609
K137
R2


1610
C138
S1


1611
L139


1612
I117
T20
V1
L1


1613
R138
W1


1614
L138
X1


1615
K138
R1


1616
P138
A1


1617
T135
V4


1618
L139


1619
H137
Q2


1620
G138
X1


1621
P137
X1


1622
T137


1623
P137


1624
L137


1625
L137


1626
Y136
C1


1627
R137


1628
L136
M1


1629
G137


1630
A131
G3
S1
T1
P1


1631
V135
A1
L1


1632
Q136
H1


1633
N135
H1
S1


1634
E129
D8


1635
V119
I17
T1


1636
T116
I13
V4
N2
X1
S1


1637
L130
T3
F2
P1


1638
T136


1639
H136


1640
P136


1641
I115
V19
M2


1642
T135
A1


1643
K136


1644
Y71
F61
L3


1645
I132
V3


1646
M132
T3


1647
A105
T30


1648
C134
S1


1649
M135


1650
S125
A9
X1


1651
A134


1652
D134


1653
L134


1654
E134


1655
V132
I2


1656
V102
A32


1657
T134


1658
S129
G1


1659
T130


1660
W130


1661
V129
X1


1662
L130


1663
V130


1664
G129
S1


1665
G130


1666
V130


1667
L129
P1


1668
A130


1669
A130


1670
L130


1671
A129
T1


1672
A130


1673
Y130


1674
C130


1675
L130


1676
T129
P1


1677
T128
A2


1678
G130


1679
S127
C3


1680
V130


1681
V130


1682
I130


1683
V130


1684
G130


1685
R130


1686
I124
V4
T1
L1


1687
I123
V5
T1
N1


1688
L130


1689
S130


1690
G130


1691
K68
R61
S1


1692
P130


1693
A129
V1


1694
V68
I62


1695
I79
V47
L4


1696
P130


1697
D129
H1


1698
R130


1699
E128
D1
G1


1700
V123
A5
L1
I1


1701
L130


1702
Y125
H5


1703
Q73
R57


1704
E101
Q27
A1
D1


1705
F130


1706
D130


1707
E130


1708
M130


1709
E130


1710
E128
A1
D1


1711
C129
G1


1712
A128
G1
S1


1713
S94
T34
A1
P1


1714
H127
Q3


1715
L130


1716
P130


1717
Y130


1718
I128
F2


1719
E130


1720
Q130


1721
G130


1722
M126
I3
V1


1723
Q124
H5
L1


1724
L130


1725
A130


1726
E128
A1


1727
Q129
E1


1728
F130


1729
K128
R2


1730
Q130


1731
K129
R1


1732
A130


1733
L125
I4
F1


1734
G130


1735
L130


1736
L128
P1
X1


1737
Q130


1738
T122
I4
V2
M1
A1


1739
A130


1740
T128
S2


1741
K128
T1
N1


1742
Q130


1743
A130


1744
E122
X8


1745
A118
V4


1746
A121
V1


1747
A118
V3
X1


1748
P122


1749
V117
A3
I1
M1


1750
V121
M1


1751
E122


1752
S121
T1


1753
K118
R4


1754
W122


1755
R112
Q8
K2


1756
A104
T16
G2


1757
L122


1758
E122


1759
T44
A41
S31
V4
G2


1760
F122


1761
W122


1762
A120
E1
G1


1763
K120
N2


1764
H120
D2


1765
M120
X2


1766
W119
R1


1767
N119
S1


1768
F120


1769
I120


1770
S120


1771
G120


1772
I114
V6


1773
Q120


1774
Y120


1775
L120


1776
A120


1777
G119
A1


1778
L120


1779
S120


1780
T120


1781
L120


1782
P119
H1


1783
G120


1784
N120


1785
P119
L1


1786
A120


1787
I119
M1


1788
A118
R2


1789
S120


1790
L118
P2


1791
M120


1792
A118
E1
V1


1793
F120


1794
T120


1795
A119
S1


1796
S119
A1


1797
I108
V12


1798
T120


1799
S119
X1


1800
P119


1801
L116
F3


1802
T117
S1
A1


1803
T118
I1


1804
Q117
N2


1805
H57
S30
N22
Y7
Q2
T1


1806
T119


1807
L118
F1


1808
L117
M2


1809
F118
L1


1810
N118
D1


1811
I119


1812
L118
W1


1813
G119


1814
G119


1815
W119


1816
V118
L1


1817
A119


1818
A119


1819
Q118
X1


1820
L117
I1
P1


1821
A118
P1


1822
P119


1823
P118
A1


1824
S111
G4
R3
N1


1825
A119


1826
A117
S1
V1


1827
S118
T1


1828
A119


1829
F119


1830
V119


1831
G119


1832
A119


1833
G119


1834
I117
S1
V1


1835
A112
V5
T2


1836
G117
X1
R1


1837
A118
X1


1838
A119


1839
V109
I10


1840
G119


1841
S116
T3


1842
I116
V3


1843
G119


1844
L117
V1
F1


1845
G119


1846
K116
R3


1847
V119


1848
L117
I2


1849
V119


1850
D118
E1


1851
I115
V3
M1


1852
L118
V1


1853
A119


1854
G119


1855
Y119


1856
G119


1857
A119


1858
G119


1859
V118
X1


1860
A119


1861
G119


1862
A119


1863
L119


1864
V119


1865
A118
D1


1866
F119


1867
K119


1868
V95
I24


1869
M119


1870
S119


1871
G119


1872
E104
D15


1873
M82
A17
V12
T4
L2
R1
I1


1874
P119


1875
S118
A1


1876
T103
A14
P1
S1


1877
E119


1878
D118
E1


1879
L109
M8
R1
I1


1880
V113
I5
D1


1881
N119


1882
L118
M1


1883
L118
I1


1884
P117
H1
L1


1885
A119


1886
I118
V1


1887
L118
F1


1888
S119


1889
P119


1890
G119


1891
A119


1892
L119


1893
V119


1894
V119


1895
G119


1896
V114
I5


1897
V119


1898
C119


1899
A119


1900
A119


1901
I118
V1


1902
L118
Q1


1903
R119


1904
R118
G1


1905
H119


1906
V119


1907
G118
D1


1908
P118
X1


1909
G118


1910
E118


1911
G118


1912
A118


1913
V118


1914
Q117
H1


1915
W118


1916
M117
V1


1917
N118


1918
R118


1919
L118


1920
I118


1921
A118


1922
F119


1923
A119


1924
S118
F1


1925
R118
A1


1926
G119


1927
N119


1928
H119


1929
V118
D1


1930
S119


1931
P119


1932
T116
R2
A1


1933
H119


1934
Y119


1935
V119


1936
P119


1937
E119


1938
S118
N1


1939
D117
E2


1940
A117
P2


1941
A119


1942
A105
V11
S1
G1
Q1


1943
R118
P1


1944
V119


1945
T119


1946
Q119


1947
I115
V4


1948
L119


1949
S119


1950
S104
N11
G4


1951
L119


1952
T119


1953
I117
V2


1954
T119


1955
Q119


1956
L119


1957
L119


1958
K111
R8


1959
R118
K1


1960
L119


1961
H119


1962
Q117
R1
H1


1963
W119


1964
I119


1965
N118
S1


1966
E119


1967
D119


1968
C119


1969
S119


1970
T119
X1


1971
P119
M4


1972
C119
G4
M1


1973
S236
A1


1974
G233
S3
D1


1975
S236
T1


1976
W237


1977
L237


1978
R228
K9


1979
D234
E3


1980
V222
I15


1981
W237


1982
D235
E2


1983
W237


1984
I234
V3


1985
D237


1986
T229
S2
A2
M2
I1
X1


1987
V235
A1
M1


1988
L236
V1


1989
T171
S33
A28
V2
I2
N1


1990
D237


1991
F235
L2


1992
K236
R1


1993
T234
N1
S1
A1


1994
W236
R1


1995
L236
I1


1996
Q230
K5
R2


1997
S231
T6


1998
K234
R3


1999
L233
V3
I1


2000
L227
M9
V1


2001
P236
X1


2002
R222
Q10
K4
L1


2003
L228
M6
X1
I1
F1


2004
P237


2005
G237


2006
V215
I15
L5
A2


2007
P237


2008
F236
X1


2009
F115
L109
I8
Y4
M1


2010
S235
X2


2011
C236
W1


2012
Q237


2013
R237


2014
G237


2015
Y236
F1


2016
K190
R47


2017
G237


2018
V200
I36
A1


2019
W237


2020
R233
Q3
L1


2021
G234
E2
S1


2022
D235
E2


2023
G237


2024
I214
V23


2025
M236
V1


2026
Q154
H70
Y8
L2
N1
X1
C1


2027
T235
A2


2028
T226
I8
V1
A1
N1


2029
C237


2030
P225
S5
Q3
L3
A1


2031
C237


2032
G234
X2
A1


2033
A233
G3


2034
Q231
R2
H1
D1
E1


2035
I233
L2
M1


2036
T192
A32
S12


2037
G236


2038
H236


2039
V235
F1


2040
K234
R1
T1


2041
N234
T2


2042
G236


2043
S236


2044
M236


2045
R226
K10


2046
I231
X3
L1
V1


2047
V215
A8
T6
F2
I2
X1
Y1
W1


2048
G236


2049
P233
S2
X1


2050
K147
R89


2051
T227
A8
S1


2052
C236


2053
S236


2054
N235
S1


2055
T216
M17
X1
V1
A1


2056
W233
C3


2057
H215
Y8
L3
C3
R2
Q2
S1
N1
D1


2058
G236


2059
T231
A3
X1
S1


2060
F236


2061
P234
S2


2062
I231
V4
T1


2063
N236


2064
A229
T5
V1
G1


2065
Y224
H11
C1


2066
T236


2067
T236


2068
G235
D1


2069
P226
S8
R1
H1


2070
C228
S7
W1


2071
T227
S4
V3
M1
A1


2072
P234
X1
A1


2073
S220
T13
A3


2074
P236


2075
A235
T1


2076
P232
X3
S1


2077
N232
S2
X1
D1


2078
Y236


2079
S229
T5
F2


2080
R201
K33
N1
T1


2081
A234
G2


2082
L236


2083
W235
L1


2084
R235
W1


2085
V234
M1
R1


2086
A224
T7
S3
V2


2087
A233
S1
P1
F1


2088
E235
D1


2089
E234
G1
D1


2090
Y236


2091
V233
L3


2092
E236


2093
V226
I10


2094
T229
R3
K2
A1
V1


2095
R230
Q6


2096
V231
M3
L2


2097
G236


2098
D232
E4


2099
F231
S3
C1
Y1


2100
H236


2101
Y235
H1


2102
V231
I5


2103
T236


2104
G233
D2
S1


2105
M224
V7
I4
L1


2106
T236


2107
T224
A7
N2
V1
I1
S1


2108
D236


2109
N235
D1


2110
V182
I44
L10


2111
K230
R3
X2
E1


2112
C236


2113
P236


2114
C235
X1


2115
Q236


2116
V235
E1


2117
P236


2118
A230
T2
P2
S2


2119
P234
X1
L1


2120
E236
D1


2121
F236
X1


2122
F236


2123
T234
K1
S1


2124
E234
X1
A1


2125
V187
L49


2126
D236


2127
G236


2128
V234
G1
I1


2129
R232
Q4


2130
L236


2131
H236


2132
R235
X1
T1


2133
Y234
F2
N1


2134
A237


2135
P237


2136
A189
P29
V16
E1
X1
T1


2137
C234
S2
G1


2138
K209
R26
X1
G1


2139
P233
X2
A1
T1


2140
L235
X1
V1


2141
L238
P1
X1
I1


2142
R239
P1
H1


2143
E147
D86
V3
X2
Q1
K1
T1


2144
E231
D9
G1


2145
V239
T1
A1


2146
T198
V19
S15
D4
A3
M1
K1


2147
F238
Y2
L1


2148
Q213
L17
M9
R1
T1


2149
V298


2150
G297
R1


2151
L294
C4


2152
N278
H20


2153
Q292
R2
X1
E1
K1
H1


2154
Y288
F8
X2


2155
L201
P56
V29
A5
T5
C1
I1


2156
V295
I2
X1


2157
G296
X1
E1


2158
S297
A1


2159
Q297
P1


2160
L298


2161
P298


2162
C297
X1


2163
E298


2164
P298


2165
E298


2166
P296
L2


2167
D294
N2
X1
G1


2168
V294
A2
X1
T1


2169
A156
T134
V4
S2
X1
E1


2170
V297
T1


2171
L274
V16
I4
X2
F2


2172
T293
A4
X1


2173
S297
A1


2174
M296
L2


2175
L293
F3
R1
H1


2176
T294
S2
A1
I1


2177
D298


2178
P298


2179
S292
P4
A1
T1


2180
H298


2181
I298


2182
T298


2183
A296
X1
G1


2184
E298


2185
T238
A57
M1
X1
V1


2186
A298


2187
K249
R31
G17
A1


2188
R297
X1


2189
R295
K2
M1


2190
L294
P3
V1


2191
A286
D9
P1
E1
X1


2192
R297
K1


2193
G298


2194
S294
X2
C1
P1


2195
P295
S2
G1


2196
P297
G1


2197
S297
X1


2198
L295
M2
X1


2199
A297
G1


2200
S296
R1
G1


2201
S297
X1


2202
S297
X1


2203
A298


2204
S296
X1
T1


2205
Q296
X2


2206
L296
X1
P1


2207
S297
X1


2208
A298


2209
P283
L12
V1
A1
F1


2210
S296
X1
P1


2211
L296
X1
S1


2212
K288
R9
W1


2213
A295
V3


2214
T288
A7
K2
S1


2215
C297
X1


2216
T295
P2
I1


2217
T279
A11
I4
Y2
S1
X1


2218
R146
H128
C11
Q4
L3
N3
X1
Y1
G1


2219
H283
N5
R5
L2
X1
P1
Q1


2220
D283
G8
N2
V2
X1
H1
T1


2221
S285
A6
P3
C1
H1
T1
F1


2222
P296
X1
T1


2223
D296
N1
E1


2224
A270
V17
T6
L2
X1
I1
P1


2225
D295
E2
G1


2226
L297
X1


2227
I287
V8
L2
X1


2228
E293
D5


2229
A298


2230
N293
X2
S2
H1


2231
L295
X2
F1


2232
L294
M4


2233
W298


2234
R291
W4
K1
L1
H1


2235
Q296
R1
H1


2236
E293
X2
A1
G1
V1


2237
M295
L2
K1


2238
G296
X2


2239
G292
S3
D2
X1


2240
N287
E3
D3
S3
K1
I1


2241
I296
V1
H1


2242
T295
S2
N1


2243
R298


2244
V298


2245
E298


2246
S297
P1


2247
E294
X3
D1


2248
N293
T2
X2
S1


2249
K296
E1
R1


2250
V288
I9
T1


2251
V284
I13
X1


2252
I281
V17


2253
L295
M2
X1


2254
D298


2255
S297
X1


2256
F295
X2
L1


2257
D218
E74
V4
G1
C1


2258
P296
Q1
S1


2259
L291
I4
V1
P1
C1


2260
R275
Q13
H6
V2
G1
Y1


2261
A292
V4
T2


2262
E294
A1
D1
G1
V1


2263
E290
K2
X2
G2
D2
R1


2264
D284
E8
G5
S1
N1


2265
E279
D9
V5
G4
Q2


2266
R271
G19
K6
W2
N1


2267
E299


2268
V218
I62
M8
P8
E1
R1
L1


2269
S299


2270
V282
I13
L2
X1
A1


2271
P218
A64
E16
T1


2272
A296
S2
E1


2273
E296
X2
D1


2274
I295
V3
X1


2275
L298
P1


2276
R288
L7
Q2
W2


2277
K230
R68
T1


2278
S230
T60
P8
N1


2279
R244
K53
G2


2280
K262
R25
N3
E3
X1
D1
A1
V1
S1
G1


2281
F298
Y1


2282
P296
X2
T1


2283
P169
R70
S27
A16
L8
Q6
T2
M1


2284
A298
X1


2285
M186
L92
I18
V3


2286
P298
L1


2287
I228
V70
E1


2288
W299


2289
A297
T1
X1


2290
R295
P2
H1
Q1


2291
P298
A1


2292
D294
E3
G1
N1


2293
Y298
H1


2294
N298
Y1


2295
P299


2296
P299


2297
L297.
I1
V1


2298
L224
I40
V34
M1


2299
E295
Q4


2300
S263
P28
T4
A3
X1


2301
W299


2302
K286
R12
E1


2303
D255
N24
A6
K5
S5
R2
G2


2304
P299


2305
D285
A5
E4
N4
G1


2306
Y298
S1


2307
V276
I8
A5
T4
D2
E1
S1
N1
P1


2308
P298
L1


2309
P297
X2


2310
V292
A5
G1
L1


2311
V298
X1


2312
H296
R1
Y1
L1


2313
G298


2314
C298


2315
P297
A1


2316
L297
F1


2317
P297
S1


2318
P291
S7


2319
T209
A63
V13
I8
P2
N1
S1
D1


2320
K265
R18
T8
E3
G3


2321
A255
T20
G10
V7
N1
I1
E1
S1
D1


2322
P292
A3
H1
L1


2323
P297


2324
I266
V28
L2
T1


2325
P295
L1
S1


2326
P297


2327
P296
Q1


2328
R296
X1


2329
R274
K23


2330
K294
R3


2331
R283
K14


2332
T293
A2
V1


2333
V283
I9
X1
F1


2334
V282
I10
A1


2335
L293


2336
T282
S11


2337
E288
D4
G1


2338
S293


2339
T278
N8
S6
A1


2340
V288
L3
M2


2341
S291
P1
A1


2342
S291
T1
A1


2343
A288
V4
S1


2344
L293


2345
A293


2346
E291
G1
D1


2347
L291
F2


2348
A290
X1
P1
G1


2349
T272
A10
V5
K3
I3


2350
K287
R2
E1
T1
S1
Q1


2351
T287
A5
S1


2352
F293


2353
G259
S31
D2
C1


2354
S264
G20
N7
D2


2355
S290
P2
F1


2356
E152
G128
K5
D3
T1
R1
X1
M1
V1


2357
S282
P4
T3
L3
A1


2358
S278
P6
A3
T2
L2
R1
E1


2359
A270
G9
I5
S3
T3
V2
C1


2360
V222
A52
I10
G4
X1
D1
S1
P1
T1


2361
D287
G3
A2
N1


2362
S285
G4
N3
R1


2363
G287
S5
R1


2364
T275
A6
V6
M5
I1


2365
A283
V5
S2
E1
T1
M1


2366
T264
S20
A4
I4
P1


2367
A274
G14
T3
D1
I1


2368
P273
S13
L5
Y1
C1


2369
P284
H4
L4
S1


2370
D265
G20
N3
E3
T2


2371
Q282
L4
R2
E2
H2
G1


2372
P162
A73
S23
T19
L12
V3
I1


2373
S257
P19
L8
F6
A1
Y1
T1


2374
D242
N19
G14
E6
S5
A5
K2


2375
D210
N48
G10
E9
S6
A5
C3
V1
Y1


2376
G278
D12
E1
S1
V1


2377
D245
G43
N2
A1
H1
E1


2378
A124
T109
K44
R5
S4
P1
M1
G1
V1
L1
N1
Q1


2379
G255
E28
R4
S2
C1
D1
A1
K1


2380
S290
T3


2381
D293


2382
V229
A58
I4
G1
T1


2383
E245
G48


2384
S291
A1
E1


2385
Y276
C8
H6
F2
A1


2386
S292
P1


2387
S293


2388
M291
T2


2389
P293


2390
P293


2391
L293


2392
E293


2393
G293


2394
E293


2395
P292
Q1


2396
G293


2397
D292
T1


2398
P292
L1


2399
D293


2400
L289
F4


2401
S291
N2


2402
D292
E1


2403
G236


2404
S236


2405
W234
G2


2406
S236
X1


2407
T236
S1


2408
V229
M8


2409
S235
G1
N1


2410
E226
G10
D1


2411
E234
X10
G1
Q1


2412
A232
D5
V4
G3
P2


2413
S168
G70
T2
N2
E1
D1
A1
R1


2414
E236
D5
Q4
G1


2415
D238
S2
V2
N2
G2


2416
V241
I5
A1
X1


2417
V245
I3


2418
C248


2419
C244
W3
X1


2420
S170
P2
X1


2421
M171
L1


2422
S172


2423
Y166
H1
S1


2424
T155
S10
A1
P1


2425
W167


2426
T166
A1


2427
G167


2428
A167


2429
L166
M1


2430
I164
V2
N1


2431
T166
P1


2432
P167


2433
C167


2434
A135
S31
G1


2435
A167


2436
E167


2437
E165
G2


2438
S159
T5
N3


2439
K165
Q1
E1


2440
L167


2441
P167


2442
I167


2443
N166
I1


2444
A128
P38
S1


2445
L167


2446
S165
I1
N1


2447
N166
Y1


2448
S162
P4
T1


2449
L166
Q1


2450
L167


2451
R167


2452
H163
N3
R1


2453
H166
R1


2454
N164
S3


2455
M159
L8


2456
V162
I5


2457
Y167


2458
A157
S8
T1
V1


2459
T166
A1


2460
T167


2461
S167


2462
R167


2463
S166
I1


2464
A167


2465
S136
G26
C4
V1


2466
Q107
L60


2467
R167


2468
Q167


2469
K162
R5


2470
K167


2471
V167


2472
T166
A1


2473
F165
I2


2474
D167


2475
R167


2476
L153
M9
Q5


2477
Q167


2478
V167


2479
L166
Q1


2480
D166
E1


2481
D162
N3
S1
K1


2482
H167


2483
Y167


2484
R159
Q7
W1


2485
D166
G1


2486
V167


2487
L167


2488
K167


2489
E164
D2
G1


2490
M163
V3
I1


2491
K166
N1


2492
A164
V3


2493
K166
E1


2494
A167


2495
S167


2496
T166
K1


2497
V167


2498
K163
R2
N1


2499
A166


2500
K144
R22


2501
L165
F1


2502
L166


2503
S163
P2
T1


2504
V124
I41
L1


2505
E166


2506
E165
X1


2507
A165
P1


2508
C166


2509
K136
M22
R6
Q1
N1


2510
L165
W1


2511
T166


2512
P166


2513
P166


2514
H164
Q1
L1


2515
S166


2516
A166


2517
K108
R57
S1


2518
S165
L1


2519
K163
Q3


2520
F156
Y8
V1
S1


2521
G166


2522
Y166
X1


2523
G167


2524
A166
G1


2525
K166
S1


2526
D167


2527
V167


2528
R167


2529
N136
S31


2530
L167


2531
S164
T3


2532
S148
G13
N3
R3


2533
K132
R34
G1


2534
A167


2535
V121
I37
T8
L1


2536
N152
D5
K3
S3
R2
T2


2537
H167


2538
I167


2539
R117
H39
N6
S2
L2
T1


2540
S167


2541
V167


2542
W167


2543
K134
E33


2544
D167


2545
L166
S1


2546
L165
Q2


2547
E166
D1


2548
D167


2549
T147
S9
N8
P2
D1


2550
E133
V12
D12
Q6
A2
K1
I1


2551
T167


2552
P167


2553
I161
L5
F1


2554
D138
N17
Q4
S3
E2
T1
I1
P1


2555
T167


2556
T166
V1


2557
I151
V16


2558
M167


2559
A167


2560
K167


2561
N144
S23


2562
E167


2563
V162
I5


2564
F166
I1


2565
C164
V2
D1


2566
V161
I5
L1


2567
Q164
D1
K1
E1


2568
P167


2569
E165
T1
X1


2570
K165
M1
R1


2571
G167


2572
G167


2573
R167


2574
K167


2575
P160
A5
S2


2576
A167


2577
R167


2578
L163
F4


2579
I167


2580
V167


2581
F164
Y3


2582
P167


2583
D167


2584
L167


2585
G167


2586
V167


2587
R167


2588
V166
I1


2589
C167


2590
E167


2591
K167


2592
M167


2593
A167


2594
L167


2595
Y167


2596
D166
N1


2597
V166
X1


2598
V167


2599
S167


2600
T163
N3
I1


2601
L167


2602
P167


2603
Q159
H6
K1
R1


2604
A163
V2
P1
T1


2605
V167


2606
M167


2607
G167


2608
S150
P13
A4


2609
S161
A5
L1


2610
Y166
F1


2611
G167


2612
F166
C1


2613
Q167


2614
Y167


2615
S167


2616
P165
L1
S1


2617
G143
K21
A2
S1


2618
Q166
H1


2619
R167


2620
V167


2621
E163
D3
Q1


2622
F167


2623
L167


2624
V166
L1


2625
N155
K11
D1


2626
A129
T38


2627
W167


2628
K166
S1


2629
S131
K29
A5
Q2


2630
K167


2631
K158
R7
N1
E1


2632
N66
C57
S34
T6
V2
A1
K1


2633
P166
A1


2634
M167


2635
G165
A1
X1


2636
F165
X1
S1


2637
A96
S71


2638
Y167


2639
D167


2640
T166
A1


2641
R167


2642
C166
R1


2643
F167


2644
D166
E1


2645
S167
X1


2646
T168


2647
V168


2648
T168
X2


2649
E169
Q1


2650
N89
S80
H1


2651
D170


2652
I170


2653
R170


2654
V140
T24
I5
F1


2655
E169
G1


2656
E170


2657
S169
P1


2658
I170


2659
Y170


2660
Q170


2661
C156
S14


2662
C170
X13


2663
D182
E1


2664
L183


2665
A174
V3
D3
T1
S1
G1


2666
P183


2667
E180
D3
X1


2668
A184


2669
R164
K19
S1


2670
Q168
L10
R3
K2
T1


2671
A165
V19


2672
I184


2673
R139
K45


2674
S184


2675
L184


2676
T184


2677
E184


2678
R184


2679
L184


2680
Y184


2681
I162
V22


2682
G184


2683
G184


2684
P184


2685
L177
M7


2686
T183
I1


2687
N184


2688
S184


2689
K184


2690
G184


2691
Q184


2692
N173
S8
D2
H1


2693
C184


2694
G184


2695
Y184


2696
R184


2697
R184


2698
C184


2699
R184


2700
A181
V2
X1


2701
S183
T1


2702
G183
V1


2703
V184


2704
L184


2705
T184
R1


2706
T184
R1


2707
S184
R1


2708
C184
X1


2709
G183
S1
A1


2710
N184
X1


2711
T184
X1


2712
L182
I2
X1


2713
T185


2714
C185


2715
Y182
F2
H1


2716
L185


2717
K185


2718
A185


2719
S149
T30
A6


2720
A185


2721
A185


2722
C185


2723
R185


2724
A183
D1
V1


2725
A184
G1


2726
K183
G1
X1


2727
L184
X1


2728
Q165
R20


2729
D183
G1
S1


2730
C184
Y1


2731
T185


2732
M182
L2
T1


2733
L185


2734
V185


2735
C119
N65
H1


2736
G185


2737
D185


2738
D185


2739
L184
X1


2740
V182
I2
X1


2741
V183
X1
I1


2742
I184
X1


2743
C184
X1


2744
E179
D5
X1


2745
S183
X1
C1


2746
A181
E2
T1
X1


2747
G184


2748
T181
A1
V1
I1


2749
Q181
E2
X1


2750
E184


2751
D184


2752
A169
E14
P1


2753
A183
E1


2754
S151
N26
A4
R2
C1


2755
L184


2756
R184


2757
V163
A20
F1


2758
F184


2759
T183
A1


2760
E184


2761
A184


2762
M184


2763
T184


2764
R184


2765
Y176
N7
X1


2766
S182
X1


2767
A182
G1


2768
P176
H7


2769
P175
G5
A2
X1


2770
G174
X4
H3
A1
S1


2771
D174
X5
G1
K1
E1
S1


2772
P162
L18
C2
S1


2773
P179
V1
L1


2774
Q117
K47
R16
X2


2775
P181


2776
E176
A3
V1
T1


2777
Y181


2778
D181


2779
L179
Q1
K1


2780
E181


2781
L176
S2
X1
R1
M1


2782
I181


2783
T179
I1
S1


2784
S180
A1


2785
C180
R1


2786
S173
X8


2787
S172
P1


2788
N173


2789
V173


2790
S173


2791
V173


2792
A171
X2


2793
H172
Y1


2794
D173


2795
A171
V2


2796
S166
A3
L2
T1
N1


2797
G171
N2


2798
K171
R2


2799
R172
X1


2800
V173


2801
Y173


2802
Y173


2803
L173


2804
T173


2805
R173


2806
D171
N2


2807
P173


2808
T161
A5
I4
S2
X1


2809
T166
I5
N1
S1


2810
P172
X1


2811
L155
I14
F4


2812
A169
S2
G2


2813
R173


2814
A171
X1
T1


2815
A173


2816
W173


2817
E172
A1


2818
T172
A1


2819
A167
V4
S1
X1


2820
R167
K6


2821
H168
S5


2822
T172
I1


2823
P171
L1
S1


2824
V167
I4
T2


2825
N173


2826
S172
T1


2827
W173


2828
L173


2829
G173


2830
N173


2831
I172
V1


2832
I172
M1


2833
M173


2834
Y172
F1


2835
A172
G1


2836
P173


2837
T172
A1


2838
L172
I1


2839
W173


2840
A162
V11


2841
R173


2842
M173


2843
I156
V17


2844
L168
M3
I2


2845
M171
L2


2846
T170
P2
I1


2847
H173


2848
F171
I2


2849
F172
S1


2850
S173


2851
I171
V1
N1


2852
L172
X1


2853
L171
M1
I1


2854
A167
V3
X1
P1
F1


2855
Q172
R1


2856
E173


2857
Q172
Y1


2858
L173


2859
E161
G8
D4


2860
K170
R2
Q1


2861
A165
T6
V2


2862
L170
X2
Q1


2863
D166
E5
X2


2864
C172
F1


2865
Q172
H1


2866
I173


2867
Y173


2868
G171
R1
E1


2869
A172
T1


2870
C116
T38
Y8
I5
X2
S2
H1
V1


2871
Y170
H2


2872
S172


2873
I170
V1


2874
E155
Q3
G1


2875
P159


2876
L159


2877
D159


2878
L158
I1


2879
P159


2880
Q156
P2
L1


2881
I158
V1


2882
I159


2883
Q99
E59
G1


2884
R159


2885
L159


2886
H159


2887
G158
S1


2888
L159


2889
S158
G1


2890
A159


2891
F159


2892
S159


2893
L159


2894
H159


2895
S156
N1
I1
T1


2896
Y159


2897
S158
P1


2898
P159


2899
G158
R1


2900
E159


2901
I159


2902
N159


2903
R159


2904
V159


2905
A159


2906
S121
A35
T2
X1


2907
C158
S1


2908
L159


2909
R159


2910
K158
Q1


2911
L159


2912
G159


2913
V159


2914
P159


2915
P159


2916
L159


2917
R159


2918
V146
A10
T2
I1


2919
W159


2920
R158
I1


2921
H158
L1


2922
R159


2923
A159


2924
R158
K1


2925
S156
N1
G1
R1


2926
V159


2927
R159


2928
A159


2929
K148
R11


2930
L159


2931
L159


2932
S159


2933
Q152
R6
P1


2934
G158
R1


2935
G159


2936
R157
K1
E1


2937
A157
Y1
G1


2938
A158
S1


2939
T124
N26
I9


2940
C159


2941
G158
X1


2942
K138
R19
X1
T1


2943
Y159


2944
L159


2945
F159


2946
N159


2947
W159


2948
A159


2949
V159


2950
R117
K42


2951
T159


2952
K159


2953
L159


2954
K159


2955
L159


2956
T159


2957
P159


2958
I156
N3


2959
P159


2960
A153
E4
G2


2961
A159


2962
S157
Y1
F1


2963
Q110
R45
H2
K2


2964
L159


2965
D159


2966
L159


2967
S159


2968
G93
S51
N13
D1
K1


2969
W159


2970
F159


2971
V149
I6
T4


2972
A159


2973
G159


2974
Y159


2975
S143
G10
N6


2976
G159


2977
G159


2978
D159


2979
I155
V3
T1


2980
Y159


2981
H158
R1


2982
S159


2983
L133
V24
P2


2984
S157
P2


2985
R147
H10
P1
C1


2986
A155
X2
T2


2987
R157


2988
P157


2989
R157


2990
W157


2991
F149
L7
P1


2992
M153
L4


2993
L84
W71
F2


2994
C157


2995
L157


2996
L155
F1
P1


2997
L157


2998
L157


2999
S154
T1
F1
X1


3000
V157


3001
G157


3002
V157


3003
G157


3004
I152
V5


3005
Y156
N1


3006
L156
C1


3007
L156
X1


3008
P156


3009
N154
A1
K1


3010
R156








Claims
  • 1. An isolated nucleic acid encoding an HCV polyprotein or a fragment thereof wherein the HCV polyprotein comprises the consensus sequence 1a (SEQ ID NO: 1).
  • 2. The nucleic acid of claim 1 wherein the encoded HCV polyprotein or fragment thereof comprises one or more of the non-synonymous changes shown in Table 5.
  • 3. An isolated nucleic acid encoding an HCV polyprotein or a fragment thereof wherein the HCV polyprotein comprises the consensus sequence 1b (SEQ ID NO: 2).
  • 4. The nucleic acid of claim 3 wherein the encoded HCV polyprotein or fragment thereof comprises one or more of the non-synonymous changes shown in Table 6.
  • 5. An isolated HCV protein having the amino acid sequence comprising the consensus sequence 1a (SEQ ID NO:1) or a fragment thereof.
  • 6. An isolated HCV protein having the amino acid sequence comprising the consensus sequence 1b (SEQ ID NO:2) or a fragment thereof.
  • 7. An isolated HCV 1a core protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.
  • 8. An isolated HCV 1a E1 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.
  • 9. An isolated HCV 1a E2 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.
  • 10. An isolated HCV 1a p7 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.
  • 11. An isolated HCV 1a NS2 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.
  • 12. An isolated HCV 1a NS3 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.
  • 13. An isolated HCV 1a NS4a protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.
  • 14. An isolated HCV 1a NS4b protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.
  • 15. An isolated HCV 1a NS5a protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.
  • 16. An isolated HCV 1a NS5b protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 5.
  • 17. An isolated HCV 1b core protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6.
  • 18. An isolated HCV 1b E1 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6.
  • 19. An isolated HCV 1b E2 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6.
  • 20. An isolated HCV 1b p7 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6.
  • 21. An isolated HCV 1b NS2 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6.
  • 22. An isolated HCV 1b NS3 protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6.
  • 23. An isolated HCV 1b NS4a protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6.
  • 24. An isolated HCV 1b NS4b protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6.
  • 25. An isolated HCV 1b NS5a protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6.
  • 26. An isolated HCV 1b NS5b protein sequence or fragments thereof comprising the consensus sequence or non-synonymous changes as shown in Table 6.
  • 27. An isolated consensus protein which comprises at least 5 contiguous amino acids of a hepatitis C 1a virus.
  • 28. An isolated expression construct comprising the following operably linked elements: a. transcription promoter andb. a nucleic acid encoding the consensus protein of claim 27 or a fragment thereof.
  • 29. An isolated consensus protein or fragment thereof from Hepatitis C 1a virus which is immunogenic.
  • 30. An isolated consensus protein which comprises at least 5 contiguous amino acids of a hepatitis C 1b virus.
  • 31. An isolated expression construct comprising the following operably linked elements: a. transcription promoterb. a nucleic acid encoding the consensus protein of claim 30 or a fragment thereof.
  • 32. An isolated consensus protein or fragment thereof from Hepatitis C 1b virus which is immunogenic.
  • 33. An isolated nucleic acid encoding an HCV epitope capable of eliciting an immunogenic response in an individual wherein the sequence of the epitope is selected from any of the epitopes listed in Table 7.
  • 34. An isolated HCV epitope capable of eliciting an immunogenic response in an individual wherein the sequence of the epitope is selected from any of the epitopes listed in Table 7.
  • 35. A composition comprising at least one HCV protein or a fragment thereof encoded by the polynucleotide of claim 1 or claim 2.
  • 36. A composition comprising at least one HCV protein or a fragment thereof encoded by the polynucleotide of claim 3 or claim 4.
  • 37. A composition comprising at least one HCV protein or a fragment thereof as recited in claims 7-26.
  • 38. A composition comprising at least one nucleic acid sequence as recited in claim 1 or claim 2.
  • 39. A composition comprising at least one nucleic acid sequence as recited in claim 3 or claim 4.
  • 40. A composition comprising at least one nucleic acid sequence which codes for an HCV protein or a fragment thereof as recited in claims 7-26.
  • 41. A vaccine comprising all or a portion of consensus sequence 1a (SEQ ID NO:1).
  • 42. The vaccine of claim 41 wherein there is a non-synonymous change at a modal consensus sequence.
  • 43. A vaccine comprising all or a portion of consensus sequence 1b (SEQ ID NO:2).
  • 44. The vaccine of claim 43 wherein there is a non-synonymous change at a modal consensus sequence.
  • 45. A method of identifying an immunogen for use as a vaccine comprising: a) Obtaining a sequence of HCV that is derived from a subject and is longer than 500 nucleotides;b) Obtaining the primary open reading frame of the sequence;c) Removing sequences that contain more than 1% ambiguous sites or more than 1 frameshift;d) Converting terminal “gap” characters to “missing”;e) Removing sequences that are redundant by identifying identical sequences and checking related publications and removing linked sequences;f) Generating predicted polyprotein sequences by using standard eukaryotic genetic code; andg) Identifying majority-rule consensus sequence for each subtype to identify modal amino acid residue at each site.
  • 46. A method of inducing or augmenting an immune response against hepatitis C virus comprising administering an effective amount of a vaccine according to any one of the vaccines recited in claims 41 to 44.
  • 47. A method for protecting an individual from hepatitis C virus infection comprising administering an effective amount of a vaccine according to any one of the vaccines recited in claims 41 to 44.
  • 48. A method of lessening the probability that a HCV-infected individual will enter a chronic phase of hepatitis C infection comprising administering an effective amount of a vaccines according to any one of the vaccine recited in claims 41 to 44.
  • 49. A method for diagnosing an individual infected with hepatitis C virus 1a comprising: a. obtaining a biological sample from the individual andb. using PCR primers to consensus sequences of HCV 1a to amplify nucleic acids in the biological sample to determine if the individual has been infected with hepatitis C virus 1a.
  • 50. A method for diagnosing an individual infected with hepatitis C virus 1b comprising: a. obtaining a biological sample from the individual andb. using PCR primers to consensus sequences of HCV 1b to amplify nucleic acids in the biological sample to determine if the individual has been infected with hepatitis C virus 1b.
  • 51. A kit comprising a HCV vaccine and instructions for the administration thereof.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit and priority of U.S. Provisional Application Ser. No. 60/648,550, filed Jan. 31, 2005, and Provisional Application Ser. No. 60/648,877, filed Feb. 2, 2005, both of which are herein incorporated by reference for all purposes.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

This invention was made with Government support under DA-11880, DK-57998 and AI-40035 awarded by the PHS. The Government has certain rights in the invention.

PCT Information
Filing Document Filing Date Country Kind 371c Date
PCT/US2006/003514 1/31/2006 WO 00 10/13/2008
Provisional Applications (2)
Number Date Country
60648550 Jan 2005 US
60648877 Feb 2005 US