This application is a national phase application under 35 U.S.C. 371 of International Application No. PCT/IB2010/051158 filed Mar. 17, 2010, which claims priority to Canadian Application No. 2,658,714 filed Mar. 17, 2009.
The present invention relates to Hepatitis C virus (HCV)-derived polypeptides and nucleic acid molecules encoding same which advantageously comprises a cd81-binding region. In this connection, the present invention specifically relates to the use of the polypeptides or nucleic acid molecules in compositions and methods for the prevention, the treatment and the diagnosis of HCV infections.
More than 170 million people worldwide are infected with the Hepatitis C Virus (HCV), a major human pathogen against which there is currently no vaccine and no sufficiently effective and tolerable therapeutic treatment available. In most cases, the infection causes chronic liver disease that often develops into cirrhosis and hepatocellular carcinoma. HCV is a small enveloped virus in the Hepacivirus is genus within the Flaviviridae family of positive-strand RNA viruses [2]. The viral genome is a messenger RNA of 9.5 kilobases, containing a single long open reading frame which is translated into a precursor polyprotein of ˜3010 amino acids. Maturation of the precursor into the individual viral proteins is carried out by cellular and viral proteases and takes place both co- and post-translationally [3]. The structural proteins are derived from the N-terminal portion of the precursor, and include the core (C) protein and the envelope glycoproteins, E1 and E2, arranged in this order from the N-terminus of the polyprotein.
Circulating HCV virions are associated with cellular components, in particular low- and very low-density lipoproteins (LDL and VLDL) [4], which results in heterogeneous infectious particles of low buoyant density. The virus targets essentially human hepatocytes, the entry process into which is not fully understood. A number of cellular entry factors (or putative receptors) have been identified, including the tetraspanins CD81 [5], Claudins 1, 6 and 9 [6,7], occludin [8], the scavenger receptor B1 (SR-B1) [9], the LDL receptor [10], and glycosaminoglycans (GAGs) [11]. The current data suggest that several of these cellular factors are recruited sequentially for virus entry [12], however the precise order and timing of the relevant interactions is not fully understood. The major players of the virion are the envelope proteins E1 and E2, but their individual specific roles during entry have not been experimentally demonstrated. It has been shown that after initial attachment to glycosaminoglycans [11] E2 binds to SR-BI, an interaction involving a segment called “hypervariable region 1” (HVR1) at the N-terminus of E2 [9, 12, 13]. Furthermore, E2 also interacts with CD81, the binding site of which includes three discontinuous stretches in E2 that are distant in the primary structure [14-17]. It has been reported that CD81 and SR-BI act cooperatively to initiate the entry process [18]. The HCV virion is then internalized by receptor-mediated endocytosis via clathrin-coated vesicles [19,20]. The low pH environment of the endosome is believed to trigger a fusogenic conformational change in the envelope proteins, inducing fusion of the viral and endosomal membranes and the release of the genomic RNA into the cytoplasm of the target cell.
The 3D organization of the HCV envelope has been poorly studied, essentially because of the difficulties in producing enough material for the relevant structural analyses. Several properties of the HCV envelope glycoproteins as well as of viral particles have therefore been inferred by extrapolation from better-studied members of the Flaviviridae family, namely the viruses forming the flavivirus genus. In spite of the lack of sequence conservation in the structural protein region, the members of the different genera within this family have the same genomic organization as HCV, encoding the structural proteins in the same order in the N-terminal portion of the precursor polyprotein. Moreover, the organization of the structural genes in HCV is also similar to members of the related Togaviridae family of small enveloped, positive-strand RNA viruses, comprising the alphaviruses genus for which structural studies are also available. Similar to HCV, the envelope proteins of viruses belonging to these families fold as a heterodimer in the ER of the infected cell and in both cases the first envelope protein has been shown to play a chaperone role in the folding of the second one [21,22].
The envelope proteins of flavi- and alphaviruses appear to have diverged from a distant common ancestor—as suggested by the crystal structure of their corresponding membrane fusion proteins, E and E1, respectively, which display the same 3D fold and are the prototype of the class II membrane fusogenic proteins. The acid pH induced fusogenic conformational changes of flavivirus E and alphavirus E1, have both been structurally characterized [23-25]. These structural studies have provided insight into the process of membrane fusion induced by the beta-rich class II fusion proteins, revealing important mechanistic similarities to that of the predominantly alpha-helical “class I” proteins (reviewed in [26]). It is widely believed that viruses belonging to other genera within these families—including HCV—are likely to code for class II fusion proteins as well. The tertiary structure of class II proteins features 3 distinct domains folded essentially as beta sheets, with a central domain I containing the N-terminus, a fusion domain II that is made from two polypeptide segments emanating from domain I, and a C-terminal domain III displaying an immunoglobulin superfamily fold located at the opposite side of domain I in the pre-fusion conformation. The conformational change leads to a trimerization during which the subunits adopt a hairpin conformation, bringing together the fusion loop and the trans-membrane segment, with domain III displaced by about 30-40 Å with respect to the other two domains, stabilizing the post-fusion homotrimer.
The similarities mentioned above have led to the proposal of a theoretical atomistic model of HCV E2 based on the class II fold, derived from the crystal structure of the flavivirus virus E protein homodimer [1]. This model was used to fit a low-resolution cryo-EM 3D reconstruction of HCV-like particles [27]. However, no experimental data supporting these models have been obtained so far.
Several studies have addressed the mechanism of membrane fusion initiated by the HCV glycoproteins [28-30], however the identity of the HCV fusion protein remains to be experimentally determined. Structural studies on E2 can provide important insights into its role during entry. Such studies can only come from the use of recombinant proteins, complemented by low resolution studies of authentic HCV virions. X-ray crystallography analyses on the individual proteins are however difficult, mainly because both E1 and E2 are heavily glycosylated [31]- and the presence of several glycans has been shown to be essential for folding in the ER lumen [32]. Their 3D fold is further stabilized by an important number of disulfide bridges—E1 and E2 display 8 and 18 strictly conserved cysteines, which are believed to be involved in 4 and 9 intramolecular disulfide bridges, respectively. These features concur to make production of the purified glycoproteins in sufficient quantities for structural studies a very difficult task.
The inventors have designed Hepatitis C virus (HCV)-derived polypeptides and nucleic acid molecules encoding same which advantageously comprises a cd81-binding region. In this connection, the present invention specifically relates to the use of said polypeptides or nucleic acid molecules in compositions and methods for the prevention, the treatment and the diagnosis of HCV infections.
Definitions
The term “isolated” is meant to describe a nucleic acid construct or a polypeptide that is in an environment different from that in which the nucleic acid construct or the polypeptide naturally occurs.
The term “specifically binds to” or “having binding specificity for” refers to antibodies that bind with a relatively high affinity to one or more epitopes of the polypeptide of the invention, but which do not substantially recognize and bind molecules other than the HCV-derived polypeptides of the invention. As used herein, the term “relatively high affinity” means a binding affinity between the antibody and the polypeptide of at least 106 M−1, or may be of at least about 107 M−1 or even may be at least about 108 M−1 to about 1010 M−1. Determination of such affinity may be conducted under standard competitive binding immunoassay conditions which are common knowledge to one skilled in the art.
The term “treating” refers to a process by which the symptoms of an infection or a disease associated with a HCV strain are alleviated or completely eliminated. As used herein, the term “preventing” refers to a process by which symptoms of an infection or a disease associated with a HCV strain are obstructed or delayed.
Polypeptides and Polynucleotides of the Invention
It is therefore an object of the invention to provide Hepatitis C virus (HCV)-derived polypeptides which advantageously comprise a cd81-binding region. Such a HCV-derived polypeptide consists of an isolated polypeptide comprising or consisting of a peptide chosen from:
(a) a peptide substantially identical to an amino acid sequence comprising SEQ ID NO: 1 or 2;
(b) a peptide substantially identical to an amino acid sequence comprising SEQ ID NO: 3 or 4; or
(c) a peptide substantially identical to an amino acid sequence comprising SEQ ID NO: 5, 6 or 7.
By “substantially identical” when referring to an amino acid sequence, it will be understood that the polypeptide of the present invention preferably has an amino acid sequence having at least 75% identity, or even preferably 85% identity, or even more preferably 95% identity to part or all of the sequence of SEQ ID NO: 1 to 7.
The polypeptide of the invention also comprises or consists of a peptide chosen from:
(d) a peptide comprising or consisting of a sequence chosen from SEQ ID NO: 1, 2, 3, 4, 5, 6 or 7; or
(e) a peptide generating anti-HCV antibodies (e.g. neutralizing antibodies) having binding specificity for a peptide having or consisting of an amino acid sequence chosen from SEQ ID NO: 1, 2, 3, 4, 5, 6 or 7. It will be understood that the neutralizing antibodies advantageously inhibit the HCV binding to cd81. By “inhibit” is meant having the ability to interfere with the binding of a HCV strain to the B-cell cd81 receptor.
SEQ ID NO: 1 and 2 respectively represent domain I of the E2 protein of HCV H77 (genbank accession GI:130461) and UKN2b—2.8 (genbank accession AY734983) strains. SEQ ID NO: 3 and 4 respectively represent domain III of the E2 protein of HCV H77 and UKN2b—2.8 strains. SEQ ID NO: 5 and 6 respectively represent domain I+III of the E2 protein of HCV H77 strain without and with a linker. SEQ ID NO: 7 represents domain I+III of the E2 protein of HCV UKN2b—2.8 strain with a linker.
The HCV-derived polypeptide of the present invention also relates to a soluble fragment of HCV E2 protein, wherein the fragment consists of the contiguous amino acids from the N-terminus of the E2 protein to the last amino acid before the transmembrane domain of the E2 protein, and is produced, in particular recombinantly produced, in an insect cell, in particular a Drosophila cell, more particularly in a Schneider 2 (S2) cell.
In particular, the soluble fragment of HCV E2 protein corresponds to amino acids 384 to 715 of the polyprotein of HCV H77 strain. The polyprotein of HCV H77 strain is notably represented by genbank accession number GI:130461 and amino acids 348 to 715 of the polyprotein of HCV H77 strain are represented by SEQ ID NO: 15. It is well within the common skills of one of skill in the art to identify sequences from other HCV strains corresponding to, homologous to, or aligning with amino acids 384 to 715 of the polyprotein of HCV H77 strain. By way of example,
As will be clear to one of skill in the art, as intended herein a “soluble” polypeptide is preferably such that it does not precipitate in an aqueous medium, such as cytoplasm, a cell culture medium, or a standard protein conservation medium, in particular for a period of at least 1 month at 4° C.
It is also an object of the invention to provide an isolated nucleic acid molecule which encodes for the polypeptides of the invention. More particularly, the nucleic acid molecule of the invention comprises a polynucleotide chosen from:
(a) a polynucleotide encoding a peptide substantially identical to an amino acid sequence comprising SEQ ID NO: 1 or 2 or fragments or analogs thereof;
(b) a polynucleotide encoding a peptide substantially identical to an amino acid sequence comprising SEQ ID NO: 3 or 4 or fragments or analogs thereof; or
(c) a polynucleotide encoding a peptide substantially identical to an amino acid sequence comprising SEQ ID NO: 5, 6 or 7 or fragments or analogs thereof.
The nucleic acid molecule of the invention also comprises a polynucleotide chosen from:
(d) a polynucleotide encoding a peptide comprising a sequence chosen from SEQ ID NO: 1, 2, 3, 4, 5, 6 or 7 or fragments or analogs thereof; or
(e) a polynucleotide encoding a peptide generating anti-HCV antibodies having binding specificity for a peptide having an amino acid sequence chosen from SEQ ID NO: 1, 2, 3, 4, 5, 6 or 7 or fragments or analogs thereof.
More particularly, the nucleic acid molecule of the invention may comprise a nucleotide sequence substantially identical to SEQ ID NOS 8, 9, 10, 11, 12, 13 or 14.
By “substantially identical” when referring to a nucleic acid sequence, it will be understood that the polynucleotide of the invention preferably has a nucleic acid sequence which is at least 65% identical, more particularly 80% identical and even more particularly 95% identical to part or all of the sequence shown in SEQ ID NOS 8 to 14 or functional fragments thereof.
A “functional fragment”, as is generally understood and used herein, refers to a nucleic acid sequence that encodes for a functional biological activity that is substantially similar to the biological activity of the whole nucleic acid sequence. In other words, and within the context of the present invention, it preferably refers to a nucleic acid or fragment(s) thereof that substantially retains the capacity of encoding a polypeptide/protein which elicits antibodies, and more preferably neutralizing antibodies, to a HCV strain challenge when administered to an animal.
In another object, the invention is further directed to a vector (e.g., a cloning or expression vector) comprising a polynucleotide of the invention as defined above.
As used herein, the term “vector” refers to a polynucleotide construct designed for transduction/transfection of one or more cell types. Vectors may be, for example, “cloning vectors” which are designed for isolation, propagation and replication of inserted nucleotides, “expression vectors” which are designed for expression of a nucleotide sequence in a host cell, or a “viral vector” which is designed to result in the production of a recombinant virus or virus-like particle, or “shuttle vectors”, which comprise the attributes of more than one type of vector.
A number of vectors suitable for stable transfection of cells and bacteria are available to the public (e.g., plasmids, adenoviruses, baculoviruses, yeast baculoviruses, plant viruses, adeno-associated viruses, retroviruses, Herpes Simplex Viruses, Alphaviruses, Lentiviruses), as are methods for constructing such cell lines. It will be understood that the present invention encompasses any type of vector comprising any of the polynucleotide molecule of the invention.
Another object of the present invention is to provide a host cell transfected with a vector as defined above. It is understood that any suitable cell to one skilled in the art may be used in accordance with the present invention. In a related aspect, there is provided a method for producing a polypeptide as defined above, comprising culturing a host cell of the invention under conditions suitable for expression of said polypeptide and harvesting said expressed polypeptide. It will be understood that the conditions for expression may be those described in the Example section.
Compositions and Methods of Use of the Invention
The HCV-derived polypeptides, in particular the soluble fragment of HCV E2 protein, and nucleic acid molecules encoding same of the invention may be used in many ways in the treatment and/or prevention of infection caused by HCV.
For instance, and according to an aspect of the invention, the HCV-derived polypeptides, in particular the soluble fragment of HCV E2 protein, of the invention may be used as immunogens for the production of specific antibodies (e.g. neutralizing antibodies) for the treatment and/or prevention of a HCV infection.
The present invention thus also relates to the HCV-derived polypeptides, in particular the soluble fragment of HCV E2 protein, of the invention for use for the prevention and/or treatment of a HCV infection.
In a related aspect, there is provided a method for producing antibodies neutralizing entry of HCV into a cell. Such a method comprises the steps of:
(a) administering to a suitable host a polypeptide or a soluble fragment of HCV E2 protein of the invention, a nucleic acid molecule of the invention, or a composition as defined above to produce HCV neutralizing antibodies and
(b) harvesting said HCV neutralizing antibodies.
The present invention also relates to a method for producing anti-HCV antibodies or B-lymphocytes, comprising a step of harvesting said anti-HCV antibodies or B-lymphocytes in a biological sample, such as a blood, serum or plasma sample, from a non-human animal which has been administered a HCV-derived polypeptide, in particular a soluble fragment of HCV E2 protein, according to the invention. In a further step, anti-HCV monoclonal antibodies may be generated from the B-lymphocytes.
Suitable antibodies may be determined using appropriate screening methods, for example by measuring the ability of a particular antibody to neutralize the HCV infection in a cellular test model. Examples of such cellular test model are well known to one skilled in the art and will not be discussed further.
The present invention also relates to the in vitro use of a HCV-derived polypeptide, in particular a soluble fragment of HCV E2 protein, according to the invention for generating specific ligands of HCV. The ligands may be of any nature. However, it is preferred that the ligands are scFv fragments, or peptide or nucleotide aptamers.
According to another aspect, the nucleic acid molecules encoding polypeptides of the invention or derivatives thereof may be used in a DNA immunization method. That is, they can be incorporated into a vector which is replicable and expressible upon injection thereby producing the antigenic polypeptide in vivo. For example polynucleotides may be incorporated into a plasmid vector under the control of the CMV promoter which is functional in eukaryotic cells. For instance, the vector may be injected intramuscularly. The use of a nucleic acid molecules of the invention in genetic immunization will preferably employ a suitable delivery method or system such as direct injection of plasmid DNA into muscles [Wolf et al. H M G (1992) 1: 363, Turnes et al., Vaccine (1999), 17: 2089, Le et al., Vaccine (2000) 18: 1893, Alves et al., Vaccine (2001) 19: 788], injection of plasmid DNA with or without adjuvants [Ulmer et al., Vaccine (1999) 18: 18, MacLaughlin et al., J. Control Release (1998) 56: 259, Hartikka et al., Gene Ther. (2000) 7: 1171-82, Benvenisty and Reshef, PNAS USA (1986) 83: 9551, Singh et al., PNAS USA (2000) 97: 811], targeting cells by delivery of DNA complexed with specific carriers [Wa et al. J Biol Chem (1989) 264: 16985, Chaplin et al., Infect. Immun. (1999) 67:6434], injection of plasmid complexed or encapsulated in various forms of liposomes [Ishii et al., AIDS Research and Human Retroviruses (1997) 13: 142, Perrie et al., Vaccine (2001) 19:3301], administration of DNA with different methods of bombardment [Tang et al., Nature (1992) 356: 152, Eisenbraun et al., DNA Cell Biol (1993) 12: 791, Chen et al., Vaccine (2001) 19:2908], and administration of DNA with lived vectors [Tubulekas et al., Gene (1997) 190: 191, Pushko et al., Virology (1997) 239: 389, Spreng et al. FEMS (2000) 27: 299, Dietrich et al., Vaccine (2001) 19: 2506].
In this connection, another aspect of the present invention relates to a composition, in particular a pharmaceutical composition, more particularly a vaccine composition, for preventing or treating such HCV infections. The composition of the present invention advantageously comprises an acceptable carrier and a polypeptide(s) of the invention or a soluble fragment of HCV E2 protein of the invention. Alternatively, the composition of the invention can comprise a nucleic acid molecule and/or an expression vector of the invention.
In a preferred embodiment, the composition of the invention further comprises an adjuvant. As used herein, the term “adjuvant” means a substance added to the composition of the invention to increase the composition's immunogenicity. The mechanism of how an adjuvant operates is not entirely known. Some adjuvants are believed to enhance the immune response (humoral and/or cellular response) by slowly releasing the antigen, while other adjuvants are strongly immunogenic in their own right and are believed to function synergistically. Known adjuvants include, but are not limited to, oil and water emulsions (for example, complete Freund's adjuvant and incomplete Freund's adjuvant), Corytzebactei-ium parvuin, Quil A, cytokines such as IL 12, Emulsigen-Plus®, Bacillus Calmette Guerin, aluminum hydroxide, glucan, dextran sulfate, iron oxide, sodium alginate, Bacto Adjuvant, certain synthetic polymers such as poly amino acids and co-polymers of amino acids, saponin, paraffin oil, and muramyl dipeptide. Adjuvants also encompass genetic adjuvants such as immunomodulatory molecules encoded in a co-inoculated DNA, or as CpG oligonucleotides. The coinoculated DNA can be in the same plasmid construct as the plasmid immunogen or in a separate DNA vector.
Yet, a further aspect of the present invention is to provide a method for treating and/or preventing a Hepatitis C virus (HCV) infection in a host. The method of the invention comprises the step of administering to the host a polypeptide, a soluble fragment of HCV E2 protein and/or a nucleic acid molecule and/or a composition as defined above. The host may be an animal such as a human.
Further agents can be added to the composition of the invention. For instance, the composition of the invention may also comprise agents such as drugs, immunostimulants (such as α-interferon, β-interferon, γ-interferon, granulocyte macrophage colony stimulator factor (GM-CSF), macrophage colony stimulator factor (M-CSF), and interleukin 2 (IL2)), antioxidants, surfactants, flavoring agents, volatile oils, buffering agents, dispersants, propellants, and preservatives. For preparing such compositions, methods well known in the art may be used.
The amount of the components or the elements of the composition of the invention is preferably a therapeutically effective amount. A therapeutically effective amount of the contemplated component is the amount necessary to allow the same to perform their immunological role without causing overly negative effects in the host to which the composition is administered. The exact amount of the components to be used and the composition to be administered will vary according to factors such as the type of condition being treated, the type and age of the host to be treated, the mode of administration, as well as the other ingredients in the composition.
The composition of the invention may be given to the host through various routes of administration. For instance, the composition may be administered in the form of sterile injectable preparations, such as sterile injectable aqueous or oleaginous suspensions. These suspensions may be formulated according to techniques known in the art using suitable dispersing or wetting agents and suspending agents. The sterile injectable preparations may also be sterile injectable solutions or suspensions in non-toxic parenterally-acceptable diluents or solvents. They may be given parenterally, for example intravenously, intramuscularly or sub-cutaneously by injection, by infusion or per os. Suitable dosages will vary, depending upon factors such as the amount of each of the components in the composition, the desired effect (short or long term), the route of administration, the age and the weight of the host to be treated. Any other methods well known in the art may be used for administering the composition of the invention.
Methods of Detection or Diagnosis and Kits
The HCV polypeptides and nucleic acid molecules encoding same of the invention may also be used in different ways in the detection and diagnosis of HCV infection.
In this connection and in a further aspect, the present invention provides a method for diagnostic of HCV infection in a host susceptible to HCV infection comprising the steps of:
(a) incubating an antibody or fragment thereof that specifically binds to a polypeptide or soluble fragment of HCV E2 protein as defined above with a biological sample obtained from a host to form a mixture; and
(b) detecting specifically bound antibody or bound fragment in the mixture which indicates the presence of HCV.
As used herein, the term “sample” refers to a variety of sample types obtained from the host and can be used in a diagnostic or detection assay. The definition encompasses blood and other liquid samples of biological origin, solid tissue samples such as a biopsy specimen or tissue culture or cells derived therefrom.
Yet, in another embodiment, the present invention provides a method for detection of antibody specific to HCV antigen in a biological sample comprising the steps of:
(a) incubating a polypeptide of the invention or fragments thereof, or a soluble fragment of HCV E2 protein, with a biological sample obtained from a host to form a mixture; and
(b) detecting specifically bound polypeptide or bound fragment in the mixture which indicates the presence of antibody specific to HCV.
One skilled in the art will recognize that this diagnostic test may take several forms, including an immunological test such as an enzyme-linked immunosorbent assay (ELISA) or a radioimmunoassay, essentially to determine whether antibodies specific for the HCV protein (such as E2) are present in an organism.
The present invention further provides kits for use within any of the above diagnostic methods. Such kits typically comprise two or more components necessary for performing a diagnostic assay. Components may be compounds, reagents, containers and/or equipment. For example, one container within a kit may contain an antibody or fragment thereof that specifically binds to a HCV polypeptide of the invention. One or more additional containers may enclose elements, such as reagents or buffers, to be used in the assay.
In this connection, the present invention also provides a kit comprising a polypeptide and/or a soluble fragment of HCV E2 protein and/or a nucleic acid molecule of the invention for detection or diagnosis of HCV infection. Such a kit may further comprise a reagent to detect polypeptide-antibody immune complex, a biological reference sample lacking antibodies that immunologically bind with the HCV peptide. The kit may also comprise a comparison sample comprising antibodies which can specifically bind to the HCV peptide. It will be understood that the HCV polypeptide, reagent, biological reference sample, and comparison sample are advantageously present in an amount sufficient to perform said detection.
Soluble E2 eluted from a Strep-Tactin column was separated in a size exclusion chromatography using a Superdex 200 column (GE Healthcare). Chromatograms showing absorption at 280 nm (top curve) and at 254 nm (bottom curve) as well as SDS-PAGE of corresponding fractions under non-reducing conditions followed by Coomassie staining are presented for isolates UKN2b—2.8 and UKN4—11.1, respectively. Arrows indicate aggregated (A), dimeric (D) and monomeric (M) forms of the protein.
Predicted trypsin cleavage sites (σ) and N-glycosylation sites (⋄) are indicated, cysteines are boxed and the respective disulfide bridges are shown (-SS-). Peptides identified after tryptic cleavage are shaded, named according to the respective isolate and numbered sequentially following the amino acid sequence of E2.
Based on the experimentally disulfide connectivity pattern the ectodomain of HCV E2 was modelled using the class II viral fusion protein fold as template. N- and C-Terminus are indicated. The same colour code for the three distinct domains was used as in similar figures showing TBEV E and SFV E1 [51, 60] and a schematic drawing is appended to illustrate the domain organization in HCV E2. While in TBEV E as well as SFV E1 two long insertions between strands in the central domain form the dimerization domain II, in HCV E2 only one insertion forms the tip domain II. In contrast, domain III, which has previously been predicted not to be present, has got a similar size as in TBEV E or SFV E1, however the extended stem region seems to contain one of the nine disulfide bridges. Residues that have been previously reported to be involved into CD81 binding are encircled, the disulfide bridges are indicated by black bars.
Envelope glycoproteins are key players in the replication cycle of enveloped viruses. In addition to carrying the main antigenic determinants, these proteins are responsible for receptor recognition and triggering fusion of the viral and cellular membrane during viral entry. The inventors report here a biochemical and preliminary structural characterization of recombinant glycoprotein E2 from the hepatitis C virus, a major human pathogen. An expression system for the ectodomain of HCV E2 (sE2) was established using Drosophila S2 cells, which allows the production of large quantities of correctly folded monomeric protein, as assayed by a number of conformational and functional tests. sE2 was used to analyze the secondary structure composition of the folded protein and to determine the connectivity of the disulfides. Together, these data have strong implications for the overall 3D fold of the protein, and the possible tertiary structure based in the class II fold is discussed.
Materials and Methods
Cells, Viruses and Media
Drosophila Schneider 2 (S2) cells were purchased from Invitrogen and cultured at 28° C. in Schneiders Drosophila medium (Invitrogen, Carlsbad, USA). Stable cell lines were transferred to serum-free Insect Xpress media (Lonza, Basel, Switzerland), which was also used for protein production. Human hepatoma cells (Huh 7.5) [33] were cultured in Dulbecco's Modified Eagle Medium supplemented with 10% fetal bovine serum (FBS), 100 U/ml penicillin, 100 μg/ml streptomycin, and a 100 μM mixture of non-essential amino acids (DMEM-10%, all reagents from Invitrogen, Cergy-Pontoise, France) and maintained at 37° C. in a 5% CO2 atmosphere.
HCV stocks (HCVcc) were produced after electroporation of cB76.1/Huh7 cells [34] with RNA transcripts synthesized in vitro from pJFH1 [35]. Cell supernatants were harvested at 7 days post-transfection and virus titers (in focus forming units (FFU)/mL) were determined by indirect immunofluorescence on Huh7.5 cells, as described below.
Plasmids
For improved purification efficiency the region encoding the V5-His Tag in the plasmid pMT/BiP/V5-His was replaced by a region encoding an Enterokinase cleavage site and a double Strep Tag separated by a linker region (GGS)4 (SEQ ID NO: 18), resulting in the following amino acid sequence downstream of the ApaI and BstBI sites . . . DDDDKAGWSHPQFEKGGGSGGGSGGGSWSHPQFEK-COOH (SEQ ID NO: 19). All synthetic HCV glycoprotein genes were purchased from GeneCust (Dudelange, Luxemburg) and amplified by PCR using strain specific 5′-oligonucleotides containing BgI II, which allows insertion immediately downstream of the BiP secretion signal, and strain specific 3′-oligonucleotides containing Apa I.
A full list of oligonucleotides used in this study is available upon request.
Generation of Inducible Drosophila S2 Cell Lines Producing sE2
2 μg of the respective plasmid was transfected into Drosophila S2 cells using Effectene (Qiagen, Hilden, Germany) according to the manufacturers recommendations. A plasmid encoding either Blasticidin S deaminase or puromycin acetyltransferase, respectively, was cotransfected as dominant selectable marker. Stable sE2 expressing cell lines were selected by addition of 6 μg/ml Puromycin or 25 μg/ml Blasticidin S (Invivogen, San Diego, USA) to the culture medium 72 h after transfection. Adaptation of the cell lines to serum free Insect Xpress media was performed stepwise as recommended by Invitrogen. Expression systems for West-Nile virus E protein, E2 of bovine viral diarrhea virus as well as for E1 of Chikungunya virus were designed in a similar way.
Production and Purification of Envelope Glycoprotein
For large scale production of sE2 the cells were cultured in spinner flasks or in Wave Bioreactors (2/10, Wave Biotech, Somerset, USA) and induced with 4 μM CdCl2 at a density of approximately 7×106 cells per ml. After 8 days at 28° C. cells were pelleted and sE2 in the supernatant was purified by affinity chromatography using a StrepTactin Superflow column (IBA, Goettingen, Germany) followed by gel filtration chromatography using a Superdex200 column (GE Healthcare, Uppsala, Sweden). Pure protein was quantified using adsorption at UV280nm and concentrated to approximately 1 mg/ml.
Pull-Down Assay of Antibodies and CD81 Using sE2
25 μg of sE2 was bound to a StrepTactin Superflow mini column and washed with 10 column volumes of washing buffer. Subsequently, 10 μg of CD81 large extracellular loop (produced as described before [36]) or 50 μg conformation dependent antibodies CBH-4B, CBH-4D against HCV E2 (kindly provided by S. Foung, Stanford, USA) or a control antibody were added followed by washing with 10 column volumes. Complexes were eluted in 4.5 column volumes elution buffer and concentrated 20-fold by ultrafiltration. This concentrate was analysed by SDS-PAGE followed by Coomassie staining.
Inhibition of HCVcc Infection by sE2
Huh7.5 cells plated on glass coverslips in 24-well plates (4.5×104 cells per well) were incubated for 1 h at RT in the absence or presence of increasing concentrations (0.05-2 μM) of HCV sE2, or BVDV sE2 or West Nile virus sE used as controls. Cells were subsequently washed and infected with 7×103 FFU of HCVcc in the presence of identical concentrations of respective viral glycoproteins. After a 5 h adsorption period at 37° C., the viral inoculum was removed and replaced with fresh medium. At 3 days post-infection, cells were fixed with 4% paraformaldehyde, permeabilized in 0.2% Triton X-100 in PBS, and subsequently processed for core detection by indirect immunofluorescence using 0.5 μg/mL of anti-core monoclonal antibody 1851 (Abcys, Paris, France) and Alexafluor 555-conjugated anti-mouse IgG (Invitrogen) at a 1:500 dilution. Coverslips were mounted on slides using ProLong Gold Antifade Reagent with DAPI (Invitrogen) allowing counterstaining of cell nuclei. Virus titers were determined by counting labeled foci following acquisition of mosaic images spanning the entire surface of each coverslip using an Axioplan2i microscope (Zeiss) with a Wide Field ApoTome (Zeiss) and the Axiovision software (Zeiss).
Far-UV Circular Dichroism of Envelope Glycoproteins
HCV UKN2b—2.8 E2 or either of the control proteins in a concentration of 0.7 mg/ml in 20 mM phosphate pH 7.5, 150 mM NaF were used for circular dichroism of envelope glycoproteins. CD spectra were obtained with an AVIV CD spectrophotometer model 215 using a 0.02 cm path length cell at room temperature. Five successive scans were averaged and the background spectrum of the sample buffer, acquired under identical conditions, was subtracted. The resulting corrected CD intensities were then converted to Δε per residue. Secondary structure contents were estimated from the far-UV CD spectra using the CDSSTR routine of the DICHROWEB server [37, 38] run on the SP175 reference dataset [39]. Quality of fit was judged from normalised root mean square deviation with values <0.1 (HCV E2 0.04±0.03, for CHIK V sE1 0.060±0.02, and for WNV sE 0.05±0.00) considered as a very good fit for all three proteins.
Fourier Transform Infrared Spectroscopy of Envelope Glycoproteins
HCV E2 and E protein of Yellow Fever virus in a concentration of 5-10 mg/ml in 10 mM Tris pH 8.0, 150 mM NaCl were used for FTIR analysis. Attenuated total reflectance FTIR spectra were measured with a Bruker vector 22 spectrophotometer equipped with a 45° diamond ATR attachment (National Instruments, U.K.). The spectra shown represent the average of 100 scans after removal of the buffer signal.
Deglycosylation and Proteolytic Digestion of sE2
200 μg of sE2 were boiled 10 minutes at 95° C. in 2.5% SDS and the denatured protein was incubated with His-tagged PNGase F in excess at 37° C. overnight. Subsequently, PNGase F was removed by ion metal affinity purification, the deglycosylated protein was concentrated to a concentration of approximately 1.5 mg/ml and analysed by SDS-PAGE followed by staining with Amido Black. Bands containing approximately 15 μg sE2 were cut out of the gel and subsequently digested at 37° C. in 0.01% Tween20, 50 mM TrisHCl with 0.5 μg Trypsin. In order to stabilize existing disulfide bonds the pH of the reaction was set to the lowest possible value (pH 7.0 for experiment 1, pH 7.6 for experiment 2 and pH 8.6 for experiment 3), which resulted in efficient digestion. After 8 h 0.25 μg Trypsin were added and the digestion was continued for further 16 h. The peptides were eluted from the gel using 200 μl water and two times 100 μl 60% Acetonitril.
For a control experiment the tryptic digest was performed as described above, but in presence of 5% DMSO, acting as oxidizing agent. For digestion in Nitrocellulose membrane the deglycosylated protein was analysed by SDS-PAGE followed by transfer onto a Nitrocellulose membrane and staining with Poinceau-Red. Bands containing approximately 15 μg sE2 were cut out of the membrane and saturated with 1 ml of 0.2% PVP K30 for 15 minutes followed by four washes with water and two washes with 50 mM TrisHCl pH 7.6. Subsequently the protein was digested with 1 μg Trypsin for 3 h at 37° C. in the absence or presence of 5 mM NEM. The peptides were eluted from the membrane using 200 μl water.
HPLC of Resulting Peptides and Subsequent Proteomic Analysis
Half of the tryptic digest was separated by reverse-phase HPLC using DEAE-C18 columns (1 mm diameter) and a gradient of 2 to 70% Acetonitrile in TFA 0.1%. The second half of the tryptic digest was reduced by addition of TCEP to a final concentration of 2.5 mM (for the control experiment including NEM 10 mM TCEP were used) for 30 minutes at RT and subsequently subjected to reverse-phase HPLC under identical conditions (
N-Terminal Sequencing and SELDI-TOF Analysis
N-terminal sequencing was performed using a ABI 494 Protein Sequencer (Applied BioSystems, Foster City, USA). SELDI-TOF analysis was performed on a Protein ChipReader System 4000 using a H4 (reversed phase, Ciphergen Biosystems, Fremont, Calif., USA) surface and a SPA matrix, which was prepared according to the manufacterer's instructions. Peak identification was carried out using ProteinChip Software 3.1 (Ciphergen). Molecular weight prediction of disulfide-connected peptides was performed using MS-BRIDGE [40], while molecular weight of reduced peptides was predicted using PeptideMass [41].
Results
Expression and Characterization of Soluble HCV E2
The inventors adapted the Drosophila Expression System (DES, Invitrogen) for expression of the full ectodomain of HCV E2 glycoproteins from a number of strains for large-scale production and purification from the cell culture medium. The Invitrogen pMT/BiP vector was modified to encode for an engineered sE2 protein such that it can be efficiently purified upon induction of the metallothionin promoter with divalent cations (Cu2+ or Cd2+). The original vector encodes the Drosophila BiP signal sequence (SS) at the N-terminus of the construct for efficient translocation into the ER of the S2 cells, in frame with the gene of the secreted protein of interest (replacing the endogenous SS). At the C-terminal end, the pMT/BiP vector includes a segment coding for the V5 epitope followed by a 6-Histidine tag allowing NTA affinity purification. In order to avoid an interaction of the divalent metal ions used for induction with the histidine tag in the secreted protein, this region was replaced in our construct by a segment coding for a specific proteolytic cleavage site, followed by a double strep-tag peptide (IBA, www.iba-go.com). This is a tandem strep-tag with a linker region (GlyGlySer)4 (SEQ ID NO: 18) in between. The proteolytic cleavage site was added to allow the specific removal of the tag for structural studies. Because of the high susceptibility of class II viral envelope proteins to reducing agents, the inventors avoided using cysteine-proteases such as the TEV or the picornavirus 3C (or “Prescission” protease)—which require a reducing agent for their activity. The inventors included instead an enterokinase (EK) cleavage site, which is a serine-protease that is relatively specific for the sequence (Asp)4Lys↓X, cleaving at the site indicated by the arrow with a cleavage efficiency between 60 and 80% for X being any amino acid [42]. The engineered production vector was termed pMT/BiP/EK2ST.
Numerous studies on HCV E2 have been reported in the literature using soluble E2 truncated at aa661 (protein sE2661; this numbering refers to the precursor polyprotein of HCV strain H77c, genotype 1a, and is used throughout this manuscript). This protein is efficiently secreted into the supernatant and was shown to be recognized by conformational antibodies and to bind to the candidate cellular receptors CD81 and SR-BI [43, 44]. sE2661 lacks the C-terminal cysteine (Cys677), leaving an unpaired cysteine residue (out of the 18 strictly conserved cysteines present in the protein), resulting in the formation of non-specific disulfide linked multimers. To avoid these difficulties, the inventors cloned the intact E2 ectodomain, truncated just upstream of the putative TM segment.
The inventors inserted the coding sequence spanning residues 191 to 715 (or equivalent after multiple alignments)—beginning at the authentic N-terminus of E1 generated by signalase—from 9 HCV strains into the pMT/BiP/EK2ST vector (
The inventors characterized the purified sE2715 by investigating the binding of two anti HCV E2 monoclonal antibodies recognizing conformational epitopes (CBH-4B and CBH-4D, [47] kindly provided by S. Foung), as well as recombinant CD81-LEL produced as described previously [36].
For this purpose sE2715 was bound to a Strep-Tactin column and an equimolar ratio of either CD81 or monoclonal antibody was loaded. After washing the complex was eluted, concentrated approximately 20-fold and analyzed in SDS-Page followed by Coomassie staining. sE2 was able to pull down CD81 as well as both conformation-dependent antibodies, but not a control antibody (
Inhibition of HCVcc by HCV E2 Produced in Drosophila Cells
As an additional control of the functionality of the recombinant sE2715 protein, the inventors tested its capacity to inhibit HCVcc infection of Huh7.5 cells. It was previously reported that soluble E2715 produced in mammalian cells does not inhibit HCVcc entry, even at high concentrations (≦1 μM) [48]. Inhibition of HCVcc infection of Huh7.5 cells by our Drosophila-produced sE2715 was done as described in materials and methods, using as a control its pestivirus and flavivirus counterpart, BVDV sE2 and West Nile Virus sE protein, respectively. The latter two proteins were produced in the inventors' laboratory in the same way for other purposes. Irrespective of the protein concentration used, both control proteins reduced the number of HCVcc-infected foci to approximately 50-60%, as compared to foci obtained upon infection in the absence of exogenous viral glycoprotein (100%,
Analysis of Secondary Structure of HCV E2
After confirming the functional properties of the recombinant Drosophila-produced sE2715, which suggest that it has achieved a native conformation, the inventors set out to analyze the characteristics of its 3D fold more closely. The inventors carried out comparative circular dichroism analyses using class II proteins of known structure, the WNV sE (PDB 2HG0 and 2I69 [49, 50]) and the chikungunya virus (CHIK V) sE1. The latter is an alphavirus very close to Semliki Forest Virus, for which the structure of sE1 is known (PDB 2Ala [51, 52]).
The far-UV CD spectra of the three proteins exhibited considerable differences (
Far-UV CD is known to be very sensitive to the presence of α-helices, but less sensitive to β-sheet structures. The inventors therefore used Fourier Transform Infrared Spectroscopy (FTIR), which is more sensitive to β-sheet conformations, to complement the secondary structure analysis.
Disulfide Mapping Strategy
The inventors based the analysis of the disulfide connectivity pattern of HCV E2 on trypsin digestion of the soluble ectodomain sE2715 under denaturing conditions and HPLC analysis of the resulting peptides, both under reducing and non-reducing conditions. The peaks in the elution chromatograms that were affected by reduction with TCEP (exemplified in
As indicated in
Determination of Disulfide Bridges in JFH-1 E2
In a first experiment the inventors performed a tryptic digestion of sE2715 of the JFH-1 isolate (genbank accession GI:116078059). The HPLC chromatogram of the resulting digest revealed peaks 6-3, 12-3 and 16-3 to be TCEP sensitive and disappear upon reduction.
Peak 6-3 revealed a mixture of peptides, the N-terminal sequencing of which showed that only J1 and J2 (Table 1) contained a cysteine residue (position 452 and 459, respectively). In the respective mass spectrum a peak corresponding to the disulfide linked dipeptide could be identified (1471.71 Da), which disappeared upon reduction, indicating a disulfide bridge between Cys452 and Cys459.
Peak 12-3 contained peptides J6 and J7, each of which with one cysteine (position 607 and 644, respectively). While peptides J6 and J7 were found as single peptides in the mass spectrum, indicating partial reduction, the inventors also observed a peak at the predicted molecular weight of the two peptides linked by a disulfide bond (Fig. S1, 2045.37 Da). This peak disappeared, as expected, upon reduction. This clearly suggested a disulfide bridge between Cys607 and Cys644.
Peak 16-3 contained a mixture of peptides with one dominant sequence corresponding to peptide J4, containing two cysteines (position 503 and 508, respectively) and a praline residue in between. This peptide was unambiguously identified in the mass spectrum of peak 16-3 (
Determination of Disulfide Bridges in UKN2b—2.8 E2
Subsequently the inventors subjected the sE2715 from isolate UKN2b—2.8 to trypsin digestion. HPLC separation of the resulting peptides revealed that peaks 13-1, 20-1, 29-3, 42-3 and 19-1 were TCEP sensitive and disappeared upon reduction.
N-terminal sequencing identified two peptides, U6 and U7, containing Cys607 and Cys644, in peak 13-1 (which correspond to J6 and J7, see Table 1), confirming the data obtained with JFH-1 sE2715. Mass spectrometry of U6 and U7 confirmed the presence of this disulfide bond in the UKN2b—2.8 isolate (2037.59 Da). However, the observed partial reduction of this disulfide bond in these strains suggested that experimentally induced disulfide shuffling may have occurred. In order to assess this, the inventors performed three different control experiments limiting this effect: (1) in-gel digestion in the presence of 5% DMSO, which acts as oxidizing agent, (2) digestion on a Nitrocellulose membrane in order to reduce incubation time to 3 h in the absence or (3) in the presence of NEM (N-ethylmaleimide), which covalently binds to free cysteines and thus blocks any disulfide rearrangements. The disulfide bridge between Cys607 and Cys644 was observed by N-terminal sequencing in all three control experiments (data not shown), strongly suggesting that it is also present in the native protein.
Peak 20-1 contained exclusively peptide U3, which corresponds to peptide J4 in JFH-1 sE2715, thereby confirming the presence of a disulfide bridge between Cys503 and Cys508 (Table 1, 2194.94 Da).
Analysis of peak 29-3 revealed two TCEP sensitive peptides, U2 and U3. The inventors were identified U3 previously to carry an internal disulfide bridge, thus identifying an additional internal disulfide bond between Cys486 and Cys494 in peptide U2.
Peak 42-3 contained a mixture of three different peptides: U1, U3 and U4. Previous experiments showed that the two cysteines in U3 (Cys503 and Cys508) form an intrapeptidic disulfide bridge. Since U1 and U4 each contain one cysteine (position 429 and 552, respectively) this suggested a disulfide bond between Cys429 and Cys552. Although the disulfide linked peptides could not be identified by mass spectrometry, upon reduction a peak corresponding to the reduced peptide U1 was observed (Fig. S4, 2308 Da). Likely the high molecular weight of the disulfide linked dipeptide (U1+U4−6890.68 Da) prevented its appearance in the spectrum.
One peak (19-1) was found to contain a mixture of sequences, with one dominant sequence corresponding to peptide U5, in which two cysteines (position 581 and 585) are present. The inventors observed a peak corresponding to the peptide harboring an intrapeptidic disulfide bridge in the mass spectrum (Table 1, 1849.64 Da). Reduction resulted as expected in an increase of the molecular weight by 2 Da. In addition, peptide U2 was found in the same peak, which has previously been shown to carry an intrapeptidic disulfide bond.
Determination of Disulfide Bridges in H77 E2
Finally, the inventors performed a tryptic digestion of the of sE2715 of H77 (genbank accession GI:130461) followed by HPLC of the resulting peptides, which revealed that peaks 15-2, 6-2, 26-2, 32-2, 43-2 and 33-2 disappeared upon reduction.
Peptides H5 and H6, which correspond to J6/J7 and U6/U7 were identified in peak 15-2. For both E2 of JFH-1 and UKN2b—2.8 a disulfide bridge between the respective cysteines (position 607 and 644) was shown in this study. Mass spectrometry clearly demonstrated the presence of a disulfide bridge between Cys607 and Cys644 in the ectodomain of H77 E2 as well (Table 1 and Fig. S5, 2110.35 Da).
Peptide H1 was found in two different peaks. Together with peptide H2, which corresponds to peptide J2, it was observed in peak 6-2, suggesting the presence of a disulfide bridge between Cys452 and Cys459, which has already been identified in sE2715 from strain JFH-1. A peak in the mass spectrum corresponding to this peptide confirmed the presence of this disulfide bond (1544.29 Da). However, peptide H1 was also found together with peptide H6 in peak 26-2, which clearly suggested a disulfide rearrangement for these cysteines. In order to verify the actual disulfide bonding partner of Cys452 present in H1, the three control experiments mentioned above were performed. In all control experiments the disulfide bridge between peptides H1 and H2 was observed by N-terminal sequencing, while the presence of peptides H1 and H6 in the same peak disappeared under the control experiment conditions. Thus the inventors conclude that Cys452 is effectively linked to Cys459.
In peak 32-2 the inventors found only peptide H3, which corresponds to U2 (Table 1), in which the inventors were already identified an intrapeptidic disulfide bond. Mass spectrometry confirmed the presence of this disulfide bond between Cys486 and Cys494 (4321.98 Da).
Peak 43-2 consisted of the peptides H7 and H8, each containing one cysteine residue (position 652 and 677, respectively). In the mass spectrum the inventors observed a peak corresponding to the disulfide linked dipeptide (Table 1, 6849.91 Da), unambiguously identifying a disulfide bridge between Cys652 and Cys677 in the ectodomain of H77 E2.
Comparing the sequence alignment of E2 in the region between Cys569 and Cys581 the inventors noticed that while UKN2b—2.8 and JFH-1 E2 contain three trypsin cleavage sites, H77 E2 has no cleavage sites in this region (
Tryptic digestion followed by reverse-phase HPLC and further analysis of single peaks of the reverse phase HPLC by N-terminal sequencing and SELDI-TOF analysis allowed us to identify 8 out of 9 disulfide bridges present in HCV E2 (Table 2). 5 of them could be confirmed with different isolates.
Discussion
Very little structural information is currently available on glycoprotein E2, a key player in Hepatitis C virus entry. E2 is the major viral antigen recognized by neutralizing antibodies in the infected organism and is also responsible for interactions with the cellular receptors SR-BI and CD81 during entry. Although many studies have provided important elements for understanding the HCV entry process, a detailed view of the molecular interactions is missing, in part due to the lack of structural information on the viral envelope proteins. Structural analyses of these proteins have been limited by the lack of purified samples in sufficient quantities, and the initial aim of this work was to develop a suitable protein production system of the HCV E2 glycoprotein for structural studies, taking advantage of the Drosophila S2 cell expression system. This system has been shown previously to result in the production of properly folded envelope glycoproteins for other members of the families Flaviviridae [54], Togaviridae [55], (Dubois et al., unpublished data), and of HIV gp120 [56], leading to the determination of the corresponding crystal structures. The advantage of the Drosophila system compared to others is that it is relatively easy to obtain stable transfectants (compared to mammalian cells) inducibly secreting the protein of interest without saturation of the chaperoning capacity of the ER (compared to the baculovirus infected lepidopteran cells). This is important for proteins with slow folding kinetics, as is the case for many of the viral class II membrane-fusogenic proteins, and which has also been shown for the HCV envelope proteins [57, 58]. Induced S2 cells remain healthy during a period of several weeks of recombinant protein secretion, without cell lysis which causes release of misfolded material into the medium—a problem with baculovirus infected insect cells, where lysis begins after about 3 days post-infection. Given that sE2661 lacks the C-terminal cysteine (Cys677) the inventors decided to truncate our sE2 constructs at aa715, although previous studies reported inefficient folding and secretion of HCV E2715 expressed in mammalian cells [43]. In our hands, the drosophila expressed recombinant HCV sE2 protein behaved as a soluble monomeric protein in gel filtration and did not form disulfide linked oligomeric aggregates, as observed previously with E2 made in different expression systems [43, 59] and remains monomeric at 4° C. for several months. The inventors found, however, that this quality of the fold of the secreted E2 protein depends on the used strain. For instance, sE2 from isolate UKN4—11.1 displayed a tendency to form disulfide-linked oligomers—which could however be separated from the monomeric form by size exclusion chromatography as shown in
In order to ensure that the Drosophila expressed E2 proteins adopt a correct conformation, and therefore correct disulfide bonding pattern, the inventors selected isolates that had previously been shown to be functional in a HCVpp assay [46]. In addition, the inventors showed that the purified proteins were able to bind the large extracellular loop (LEL) of CD81 as well as two monoclonal conformation-sensitive anti-HCV E2 antibodies, which do not overlap with the CD81 binding site in HCV E2. This strongly suggests that the 3D fold—and therefore its disulfide connectivity pattern—is identical to the one present in E2 protein in virions. The functionality of our recombinant sE2 proteins was further confirmed by an in vivo assay, which demonstrated their capacity to compete with HCVcc for receptor binding and thus block infection of Huh-7.5 cells. The fact that sE2 efficiently inhibits HCVcc infection in a dose dependent way—in contrast to the effect of relevant controls like the soluble WNV E glycoprotein or the soluble pestiviral E2—strongly supports the notion that the recombinant HCV sE2 protein has acquired a functional conformation. Interestingly, previous studies failed to demonstrate effective reduction of infectivity using sE2 produced in a different expression system [48] and used at concentrations identical to some of the data points displayed in
Disulfide bonds are key structural elements stabilizing the functional native conformation of viral envelope proteins. The inventors determined the disulfide connectivity pattern of HCV E2 by trypsin digestion followed by N-terminal sequencing and mass spectrometry. One disadvantage of this method is the occurrence of disulfide shuffling presumably taking place during trypsin digestion, which the inventors overcame by performing control experiments including N-ethylmaleimide whenever the results were ambiguous. The analysis was performed using recombinant E2 of three different strains representing two genotypes (1 and 2) and two subtypes of the latter, 2a and 2b. The ambiguities and difficulties in the identification of the peptides resulting from trypsin digestion of the different strains were different for each strain and gave complementary information, which together resulted in an unambiguous assessment. The inventors were thus able to experimentally identify eight disulfide bridges corresponding to 16 out of 18 cysteines. The last disulfide was therefore identified by exclusion, since the initial assumption is that the 18 strictly conserved cystein residues present in the HCV E2 ectodomain are all involved in disulfide bridges, as is the case in many other viral envelope proteins studied to date. Furthermore, five out of eight experimentally determined disulfide bridges were independently observed in more than one strain (Table 2). To the inventors knowledge this is the first report characterizing the disulfide connectivity pattern of a glycoprotein simultaneously for several strains of the same virus, experimentally confirming that the disulfide connectivity pattern and thus the overall three-dimensional fold of E2 is, as expected, strictly conserved within all genotypes of Hepatitis C virus.
The disulfide connectivity pattern strongly constrains the possible overall fold of the protein. The longer the distance along the primary structure between two cysteine residues forming a disulfide bridge, the stronger the impact for understanding the tertiary structure will be. In HCV E2 five out of nine disulfide bridges are formed by two consecutive cysteines, which are separated by less than 10 aa enclosing in each case one proline residue. The disulfide bridge connecting the two C-terminal cysteines and the two disulfide bridges Cys597-Cys620 as well as Cys 607-Cys644 span longer distances in the primary structure. In addition, there is one disulfide bridge (Cys429-Cys552) connecting the N-terminal with the C-terminal half of the protein, thus implying strong restraints on the overall fold of the protein.
The structural restraints implied by the disulfide connectivity pattern are not sufficient to clearly demonstrate a particular fold of the HCV E2 ectodomain, thus a number of different overall protein folds would be conceivable. All of these need to combine the disulfide connectivity pattern determined in this study with the results of the secondary structure analysis of HCV E2 as well as a previously suggested composite CD81 binding platform [14-17]. One possible fold of HCV E2 predominantly consisting of beta sheets, which possess the determined disulfide pattern and contains a composite CD81 binding site is shown in
However, the present experimental observation by circular dichroism and Fourier-Transform Infra-Red spectroscopy that HCV E2 consists mainly of beta sheets, could also be interpreted to further support the hypothesis that its fold is belongs to the class II fold observed in the related flaviviruses. The hall mark of the latter is the presence of a central domain (“domain I”) folded as a beta barrel with up-and-down topology, flanked on each side by two other beta-rich domains [60]. Two long loops (insertions) connecting sequential strands in domain one (D0-E0 and H0-I0 loops) form the complex “domain II”, with the fusion loop at the tip; domain III, which has an Ig-like fold, is connected via a flexible linker to the opposite side of domain I, following strand I0.
The disulfide connectivity of HCV E2 is consistent with such a fold only if the inventors assume that the second loop forming domain II (H0-I0 loop) is absent, or is very short. There are a number of possibilities to arrange the polypeptide chain to conform to both, a class II fold and the constraints introduced by the disulfide bonding pattern. The inventors present in
In flaviviruses, domain I contains an N-terminal extension, which includes strand A0 (absent in the alphaviruses). In HCV E2, the N-terminal region is variable (termed “hypervariable region 1” (HVR1)) and has been shown not to be essential for the folding of E2. Deletions mutants in which the HVR1 is absent appear to fold correctly and to be infectious in HCVpp assays [61], suggesting that this region is also an N-terminal extension of domain I. The HVR1 extends to about residue 410 [62], indicating that the N-terminus of domain I would begin roughly at this position. Besides the HVR1 two more variable regions are described in HCV E2, the “hypervariable region 2” (HVR2), located within Cys459 and Cys486 (residues 474-482, [63]), and the intergenotypic variable region (IgVR) located within Cys 569 and Cys 581 [64]. A core domain of HCV E2 has been described lacking all three variable regions [64] based on the assumption that the cysteines flanking HVR2 and IgVR are disulfide bonded and the protein would attain its global fold in spite of the deletion of these loops, as in the case of HIV gp120 [56]. However, although in contrast to the surface loops in gp120 the cysteines flanking HVR2 and IgVR are not disulfide bonded, as assumed by the authors, such a deletion mutant still appears to fold and retains a functional conformation as attested by the binding of CD81 or antibodies recognizing conformational epitopes [64]. This observation suggests that the region containing the HVR2, representing a part of domain II (in yellow in
The third variable region, the IgVR represents a variable linker between domain I and domain III, which displays a polymorphism of linker length in different HCV strains and isolates, thus suggesting that removal of this region would not impair overall protein fold or function. In class II fusion proteins, this linker region between domains I and III, becomes stretched after the conformational change to allow DII to reach a position on the side of the post-fusion class II homotrimer.
In the prefusion form of flavi- and alphaviruses envelope proteins one side of each domain I and domain III are juxtaposed [51, 60]. The presented model of HCV E2 contains two of the three discontinuous regions of the polypeptide forming the CD81 binding interface [14-17] located in domain I, the third one in domain III, the three of them forming a binding site crossing the interface between these two domains.
The mechanism of membrane fusion induced by the HCV glycoproteins and the identity of the fusion loop has been subject of several studies spanning both glycoproteins E1 and E2 [28,29,65]. In the flavi- and alphaviruses the fusion loop is located in domain II. The determined disulfide connectivity pattern provides little information about this region, containing only three disulfide bridges formed by consecutive cysteines. Two regions have been proposed previously in E2 to act as fusion determinants (416-430 and 600-620) [29]. Due to their location within the receptor binding interface of the E2 molecule, which is likely an exposed platform in order to facilitate binding to CD81, both of these regions have been placed in domain I and domain III, respectively, suggesting that the identity of the fusion loop remains to be elucidated. The structural information available to date, in particular with respect to domain II, does not allow its identification, but will be highly useful to design further experiments.
The N-glycosylation sites in E2 are located exclusively on the outward-facing surface of the molecule or in the connecting regions between the beta strands thus allowing the placement of E2 flat onto the viral membrane as shown for flavivirus E protein [66]. It has been shown previously that mutation of single N-glycosylation sites reduces entry of HCVpp without affecting the overall fold of HCV E2 by modulating binding to CD81 [32, 67], indicating a close proximity between these N-linked carbohydrates and the CD81 binding site. The N-glycosylation sites N1 (N417) and N6 (N532) are reported to increase the affinity to CD81 and N11 have been shown to modulate entry of HCVpp [67, 68]. All of these N-glycosylation sites are located close to the proposed CD81 binding site, thus explaining the modified HCVpp entry.
The combination of MS and N-terminal sequencing of peptide fragments has allowed the inventors to assign all disulfide bonds in HCV E2. Based on this knowledge and the secondary structure analysis of the HCV E2 ectodomain the inventors propose an overall fold that is consistant with HCV E2 being a class II viral fusion protein. This model enlarges understanding of receptor binding as well as glycoprotein interactions and other putative functions of HCV E2 during viral entry and will be highly useful for design of new experiments. One important implication is that the residues interacting with CD81 are found in two domains, domain I and domain III, which have to move apart during the fusogenic conformational change. This suggest that CD81 may have to dissociate away for such a conformational change to take place, or on the contrary, that its binding may help to lower the energy barrier for the conformational change to occur upon exposure to low pH in the endosomes.
Peptides resulting from tryptic digestion of soluble E2 from three different isolates were identified by N-terminal sequencing and mass spectrometry (n.f.—not found in mass spectrometry). Peptides were named according to the isolate and numbered sequentially according to their appearance in the full ectodomain amino acid sequence. Molecular mass predictions were performed using MS-BRIDGE [40] for disulfide-connected peptides and using PeptideMass for reduced peptides [41].
Number | Date | Country | Kind |
---|---|---|---|
2658714 | Mar 2009 | CA | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2010/051158 | 3/17/2010 | WO | 00 | 11/23/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2010/106509 | 9/23/2010 | WO | A |
Number | Date | Country |
---|---|---|
WO 2006039326 | Apr 2006 | WO |
WO 2008022401 | Feb 2008 | WO |
2008108918 | Sep 2008 | WO |
Entry |
---|
Rudinger, Peptide Hormones, JA Parsons, Ed., 1976, pp. 1-7. |
SIGMA, 2004, pp. 1-2. |
Berendsen, A Glimpae of the Holy Grail?, Science, 1998, 282, pp. 642-643. |
Voet et al, Biochemistry, John Wiley & Sons Inc., 1995, pp. 235-241. |
Ngo et al, Computational Complexity, Protein Structure Protection, and the Levinthal Paradox, 1994, pp. 491-494. |
Bradley et al., Limits of Cooperativity in a Structurally Modular Protein: Response of the Notch Ankyrin Domain to Analogous Alanine Substitutions in Each Repeat, J. Mol. BIoL (2002) 324, 373-386. |
McCaffrey et al., “Expression and Characterization of a Minimal Hepatitis C Virus Glycoprotein E2 Core Domain That Retains CD81 Binding,” Journal of Virology, vol. 81, No. 17, pp. 9584-9590, Sep. 2007. |
Drummer et al., “A Conserved Gly436-Trp-Leu-Ala-Gly-Leu-Phe-try Motif in Hepatitis C Virus Glycoprotein E2 is a Determinant of CD81 Bniding and Viral Entry,” Journal of Virology, vol. 80, No. 16, pp. 7844-7853, Aug. 2006. |
Flint et al., “Characterization of Hepatitis C Virus E2 Glycoprotein Interaction with a Putative Cellular Receptor, CD81,” Journal of Virology, vol. 72, No. 8, pp. 6235-6244, Aug. 1999. |
Grollo et al., “Exploiting information inherent in binding sites of virus-specific antibodies: design of an HCV vaccine candidate cross-reactive with multiple gentotypes,” Antiviral Therapy, vol. 11, pp. 1005-1014, Jan. 2006. |
Owsianka et al., “Monoclonal Antibody AP33 Defines a Broadly Neutralizing Epitope on the Hepatitis C Virus E2 Envelope Glycoprotein,” Journal of Virology, vol. 79, No. 17, pp. 11095-11104, Sep. 2005. |
Krey et al., “The Disulfide Bonds in Glycoprotein E2 of Hepatitis C Virus Reveal the Tertiary Organization of the Molecule,” Plos Pathogens, vol. 6, No. 2, p. e1000762, Feb. 2010. |
Whidby et al., “Blocking Hepatitis C Virus Infection with Recombinant Form of Envelope Protein 2 Ectodomain,” Journal of Virology, vol. 83, No. 21, pp. 11078-11089, Nov. 2009. |
Fenouillet et al., “Contribution of Redox Status to Hepatitis C Virus E2 Envelope Protein Function and Antigenicity,” The Journal of Biological Chemistry, vol. 283, No. 39, pp. 26340-26348, Sep. 2008. |
International Search Report issued in application No. PCT/IB2010/051158 on Nov. 10, 2010. |
Number | Date | Country | |
---|---|---|---|
20120088718 A1 | Apr 2012 | US |