The present invention concerns wild-strains of Chikungunya virus isolated from patients exhibiting severe forms of infection and stemming from a human arbovirosis epidemy. The present invention also concerns polypeptide sequences and fragment thereof derived from their genome, the polynucleotide encoding same and their use as diagnostic products, as vaccine and/or as immunogenic compositions.
Chikungunya virus (CHIKV) is a mosquito-transmitted Alphavirus belonging to family Togaviridae [1,2]. It was isolated for the first time from a Tanzanian outbreak in 1952 [3]. It is responsible for an acute infection of abrupt onset, characterized by high fever, arthralgia, myalgia, headache and rash [4,5]. Poly-arthralgia, the pathognomonic sign of the disease, is very painful. Symptoms are generally self-limiting and last 1 to 10 days. However, arthralgia or arthritic symptoms may persist for months or years. In some patients, minor hemorrhagic signs such as epistaxis or gingivorrhagia have also been described.
CHIKV is geographically distributed in Africa, India and South East Asia. In Africa, the virus is maintained through a sylvatic transmission cycle between wild primates and mosquitoes such as Aedes luteocephalus, Ae. furcifer or Ae. taylori [4]. In Asia, CHIKV is mainly transmitted from human to human by Ae. aegypti and to a lesser extent by Ae. albopictus through an urban transmission cycle. Since the 1952 Tanzania outbreak, CHIKV has caused outbreaks in East Africa (Tanzania, Uganda), in Austral Africa (Zimbabwe, South Africa), in West Africa (Senegal, Nigeria) and in Central Africa (Central African Republic, Democratic Republic of the Congo) [4]. The most recent epidemic re-emergence was documented in 1999-2000 in Kinshasa, where an estimated 50,000 persons were infected [6]. Since the first documented Asian outbreak in 1958 in Bangkok, Thailand, outbreaks have been documented in Thailand, Cambodia, Vietnam, Laos, Myanmar, Malasia, Philippines and Indonesia [4,5]. The most recent epidemic re-emergence was documented in 2001-2003 in Java after 20 years [7]. Either in Africa or Asia, the re-emergence was unpredictable, with intervals of 7-8 years to 20 years between consecutive epidemics.
Since the end of 2004, Chikungunya virus (CHIKV) has emerged in the islands of the south-western Indian Ocean. Between January and March 2005, more than 5,000 cases were reported in Comoros. Later in 2005, the virus has circulated in the other islands, i.e Mayotte, Seychelles, Réunion and Mauritius. Starting in December 2005, the rainy season gave rise to a renewed epidemic circulation of the virus. Between January 1st and Mar. 1, 2006, 2,553, 3,471, and 4,650 cases have been reported in Mauritius, Mayotte and Seychelles (Mar. 12, 2006). The most affected island is Reunion with an estimated 212,000 cases until Mar. 12, 2006 (total population: 770,000). More recently, circulation of the virus has been documented in Madagascar.
In Reunion Island, the first documented cases were patients coming 1 ng back from Comoros in March 2005. More than 3,000 cases were reported from March to June. The transmission was limited during the winter season of the southern hemisphere and a major upsurge has been observed since mid-December, with an estimated 210,000 cases between January and March 2006 [8]. Since March 2005, 85 patients with a confirmed CHIKV infection have developed severe clinical signs (meningoencephalitis or fulminant hepatitis) which justified hospitalization in an intensive care unit. Several cases of meningo-encephalitis and major algic syndrome have been associated with vertical transmission of the virus 9.
To date, two CHIKV complete nucleotide sequences have been determined, for the strains Ross (accession no: AF490259) and S27 [9], both isolated from patients during the 1952 Tanzania outbreak. Another complete nucleotide sequence has been determined for a strain isolated in Ae. furcifer during the Senegal 1983 outbreak (accession no AY726732). Khan and coworkers [9] showed that the S27 genome was similar in its structure to that of other alphaviruses and that O'nyong-nyong virus (ONN) was the closest relative to CHIKV. In addition, phylogenetic analyses based on partial E1 sequences from African and Asian isolates revealed the existence of three distinct CHIKV phylogroups, one containing all isolates from West Africa, one containing isolates from Asia, and one corresponding to Eastern, Central and Southern African isolates [10]. Strains isolated in 1999-2000 in the Democratic Republic of the Congo belonged to the latter phylogroup [6].
An aspect of the invention is to provide new diagnostic and immunologic tools against CHIK virus associated diseases, such as arbovirosis.
Such an aspect is particularly achieved by providing an isolated and purified wild strain of Chikungunya virus (CHIK) capable of in vitro infecting human cells; and its use for the detection of a CHIKV associated to an arbovirus, or for the preparation of a composition that prevents and/or treats an arbovirus.
Another aspect of the invention concerns an isolated and purified strain of CHIKV comprising at least one mutation in structural protein E1 and/or structural protein E2; and its use for the detection of a CHIKV associated to an arbovirus, or for the preparation of a composition that prevents and/or treats an arbovirus.
Another aspect of the invention concerns an isolated and purified polynucleotide comprising all or part of the sequence of SEQ ID NOS: 1, 2, 3, 4, 5 or 6; and its use for the detection of a CHIKV associated to an arbovirus, or for the preparation of a composition that prevents and/or treats an arbovirus.
Another aspect of the invention concerns a fragment of the polynucleotide of the invention wherein it codes for the ectodomain of glycoprotein E2 or E1; and its use for the detection of a CHIKV associated to an arbovirus, or for the preparation of a composition that prevents and/or treats an arbovirus.
Other aspects of the invention concern a vector or plasmid comprising a polynucleotide or fragment contemplated by the present invention, and host cell comprising said vector or plasmid; and its use for the detection of a CHIKV associated to an arbovirus, or for the preparation of a composition that prevents and/or treats an arbovirus.
Yet another aspect of the invention concerns a purified polypeptide encoded by a polynucleotide or fragment of the invention; and its use for the detection of a CHIKV associated to an arbovirus, or for the preparation of a composition that prevents and/or treats an arbovirus.
A further aspect of the invention concerns a monoclonal or polyclonal antibody or fragment thereof that specifically binds to a polypeptide of the invention; and its use for the detection of a CHIKV associated to an arbovirus, or for the preparation of a composition that prevents and/or treats an arbovirus.
A) Ribbon diagram of E1, with domain I colored red, domain II yellow and domain III, blue. Green tubes mark the disulfide bonds. The fusion peptide, at the tip of the molecule (in domain II) is colored orange and labeled. The N-terminus and the C-terminus observed in the crystal (which is 30 aa upstream of the transmembrane region) are also labeled. The 2 unique changes observed in the Indian Ocean isolates are indicated by stars and labeled: positions 226 (white) and 284 (magenta).
B) Partial representation (one octant, slightly extended) of the icosahedral E1 scaffold at the surface of the virion, viewed down a 5-fold symmetry axis. One E1 protomer is highlighted in colors, as in A); all the others are represented in grey. The location of some of the icosahedral symmetry axes are drawn as solid black symbols: pentagon for 5-fold axis, triangle for 3-fold axes, ellipse for 2-fold axes (which in the T=4 lattice of alphaviruses are coincident with quasi 6-fold axes). Open triangles indicate roughly the location of the E2 trimers that interact tightly with E1, covering domain II and the fusion peptide, and presenting the main antigenic sites. The open triangles mark also quasi 3-fold symmetry axes of the T=4 surface icosahedral lattice. A magenta ball marks the location of Glu 284, at an inter-E1 protomer contact site. This contact is propagated 240 times at the surface lattice (note all pink balls drawn on the grey protomers). Note that the fusion peptide, in orange, is pointing up and away from contacts with other E1 protomers. This is more easily seen at the periphery of the virion, where one of them is labeled (FP). In the virion, this region of E1 is not accessible, covered underneath the E2 molecule [19].
A. Alignment of Repeat Sequence Elements found in the 3′NTR region of chikungunya virus genome. All sequences form conserved and stable stem-loop structures in which the less conserved nucleotides around position 20 constitute the loop. Three RSE are found in all chikungunya genomes. The first one (RSE1) is inserted before the internal poly-A sequence of S27 genome [9], whereas the two others are found downstream this motif.
B. Predicted secondary structure for RSE1 of isolate 05-115.
Mosquito AP61 cells in 24-well plates were infected with CHIK virus stocks grown on mosquito cells (virus titers 2−5×10%8 FFU. mL-1) at 0.0001 (top well) or 0.00001 (bottow well) multiplicity of infection. Infected cells were overlaid with CMC in Leibovitz L15 growth medium with 2% FBS for 2 days to allow focus development at 28° C. The cells were fixed with 3% PFA in PBS, permeabilized with Triton X-100 in PBS, and foci of CHIK virus replication were immunostained with mouse anti-CHIK HMAF (dilution 1:2,000) and peroxidase-conjugated goat anti-mouse Ig (dilution 1:100).
In the present study, the inventors determined the nearly complete nucleotide sequences of viruses isolated from six patients originating from Reunion and Seychelles Islands. The present invention allows to determine the genome structure as well as the unique molecular features of the Indian Ocean outbreak isolates, which distinguish them from other reported CHIKV and alphavirus sequences.
As one in the art may appreciate, the originality of the present invention is the identification of novel strains of the Chikungunya (CHIK) virus which are distinguished from CHIK virus of the prior art, and the use of these CHIK strains and the polypeptides and the polynucleotides encoding same derived from their genome in the diagnostic, prevention and/or treatment of arbovirosis.
According to a first aspect, the present invention concerns an isolated and purified wild strain of chikungunya virus (CHIKV) capable of in vitro infecting human cells. Preferably, the present invention concerns a wild strain of CHIK virus which exhibits the same characteristics than those selected from the group consisting of the isolates 05.115, 05.61, 05.209, 06.21, 06.27 and 06.49. According to a preferred embodiment, the strains that are within the scope of the present invention are characterized in that their genome comprises at least one mutation when compared to the sequence of the genome of the CHIK virus strain S-27 (GenBank AF339485). Also within the scope of the invention, is any strain grown or obtained by cell culture from a sample of a preferred CHIK strain of the invention. The genome of the preferred strains according to the present invention comprises a sequence as shown in
According to another aspect, the present invention provides an isolated and purified strain of chikungunya virus (CHIKV) comprising at least one mutation in structural protein E1 and/or in structural protein E2, and more particularly in their ectodomain region. According to a preferred embodiment, the strain of the invention is characterized by the fact that its genome comprises at least one mutation in the E2 protein at a position homologous to amino acid position 382, 399, 404, 485, 489, 506, 536, 624, 637, 669, 700 or 711 of SEQ ID NO: 23 (
As use herein, the expression “at a position homologous to an amino acid position” of a protein, refers to amino acid positions that are determined to correspond to one another based on sequence and/or structural alignments with a specified reference protein. For instance, in a position corresponding to an amino acid position of a CHIK virus structural protein set forth as SEQ ID NO: 1 can be determined empirically by aligning the sequences of amino acids set forth in SEQ ID NO: 1 with a particular CHIK virus structural protein. Homologous or corresponding positions can be determined by such alignment by one of skill in the art using manual alignments or by using the numerous alignment programs available (for example, BLASTP). Homologous or corresponding positions also can be based on structural alignment, for example by using computers simulated alignments of protein structure. Recitation that amino acids of a polypeptide correspond to amino acids in a disclosed sequence refers to amino acids identified upon alignment of the polypeptide with the disclosed sequence to maximize identity or homology (where conserved amino acids are aligned) using a standard algorithm, such as the GAP algorithm. As used herein, “at a position homologous to” refers to a position of interest (i.e., base number or residue number) in a nucleic acid molecule or protein relative to the position in another reference nucleic acid molecule or protein. The position of interest to the position in another reference protein can be in, for example, an amino acid sequence from the same protein of another CHIK strain. Homologous positions can be determined by comparing and aligning sequences to maximize the number of matching nucleotides or residues, for instance, such that identity between the sequences is greater than 95%, preferably greater than 96%, more preferably greater than 97%, even more preferably greater than 98% and most preferably greater than 99%. The position of interest is then given the number assigned in the reference nucleic acid molecule.
Another aspect of the invention concerns an isolated and purified polynucleotide comprising all or part of the sequence as shown in
Another aspect of the invention concerns a fragment of the polynucleotide of the invention characterized by the fact that it codes for the glycoprotein E1 or E2, and more preferably for their ectodomain region. Advantageously, the fragment of the invention when coding for the E2 ectodomain, comprises, or more preferably, consists of a nucleotide sequence as shown in
Yet another aspect of the invention concerns a fragment of the polynucleotide of the invention characterized by the fact that it codes for a soluble form of glycoprotein E2. According to a preferred embodiment, the soluble fragment of glycoprotein E2 comprises or more preferably consists of a nucleotide sequence as shown in
As one skilled in the art may appreciate, a fragment as contemplated by the present invention may be obtained by:
According to another aspect, the present invention is concerned with an isolated and purified polypeptide encoded by a polynucleotide or by a fragment of the invention. As used herein, the terms “polypeptide” and “protein” are used interchangeably to denote an amino acid polymer or a set of two or more interacting or bound amino acid polymers.
By “isolated” is meant, when referring to a polypeptide, that the indicated molecule is separate and discrete from the whole organism with which the molecule is found in nature or is present in the substantial absence of other biological macro-molecules of the same type. The term “isolated” with respect to a polynucleotide is a nucleic acid molecule devoid, in whole or part, of sequences normally associated with it in nature; or a sequence, as it exists in nature, but having heterologous sequences in association therewith; or a molecule disassociated from the chromosome.
Broadly defined, the terms “purified polypeptide” or “purified polynucleotide” refer to polypeptides or polynucleotides that are sufficiently free of other proteins or polynucleotides, or carbohydrates, and lipids with which they are naturally associated. The polypeptide or polynucleotide may be purified by any process by which the protein or polynucleotide is separated from other elements or compounds on the basis for instance, of charge, molecular size, or binding affinity.
The preferred peptides of the invention comprise at least one amino acid substitution compared with the amino acid sequence of strain S-27 (GenBank AF339485) and are derived from the sequence of a protein coded by a fragment of the invention. Preferably, a purified polypeptide of the invention comprises all or part of the amino acid sequence of a CHIK virus ORF 1 or 2 contemplated by the present invention such as one defined in any one of SEQ ID NOS 24 to 29 (ORF 2) or of SEQ ID NOS 30 to 34 and 78 (ORF 1). More preferably, a purified polypeptide of the invention comprises all or part of the amino acid sequence of a glycoprotein E2 contemplated by the present invention such as one defined in any one of SEQ ID NOS 15 to 18 (
The present invention is also concerned with a vector comprising a polynucleotide of the invention or a fragment of a polynucleotide of the invention. As used herein, the term “vector” refers to a polynucleotide construct designed for transduction/transfection of one or more cell types. Vectors may be, for example, “cloning vectors” which are designed for isolation, propagation and replication of inserted nucleotides, “expression vectors” which are designed for expression of a nucleotide sequence in a host cell, or a “viral vector” which is designed to result in the production of a recombinant virus or virus-like particle, or “shuttle vectors”, which comprise the attributes of more than one type of vector. Preferred vector are those deposited at the CNCM (Collection Nationale de Cultures de Microorganismes), 28 rue du Docteur Roux, 75724 PARIS Cedex 15, France, on Mar. 15, 2006 under accession numbers I-3587, I-3588, I-3589 and I-3590.
Another preferred vector contemplated by the present invention is the plasmid called TRIP-CHIK.sE2 which has been deposited at the CNCM (Collection Nationale de Cultures de Microorganismes), 28 rue du Docteur Roux, 75724 PARIS Cedex 15, France, on Mar. 14, 2007, under accession number I-3733. Such a vector comprises a fragment which codes for a soluble form of the glycoprotein E2 of the invention. This preferred vector has been optimised for efficient production of the recombinant E2 protein into mammalian cells. As used herein, the term “optimised” means that the vector incorporates regulation sequences, such as a signal peptide sequence, in order to provide adequate expression of the desired encoded protein.
In a related aspect, the present invention provides a host cell comprising a vector as defined above. The term “host cell” refers to a cell that has a new combination of nucleic acid segments that are not covalently linked to each other in nature. A new combination of nucleic acid segments can be introduced into an organism using a wide array of nucleic acid manipulation techniques available to those skilled in the art. A host cell can be a single eukaryotic cell, or a single prokaryotic cell, or a mammalian cell. The host cell can harbor a vector that is extragenomic. An extragenomic nucleic acid vector does not insert into the cell's genome. A host cell can further harbor a vector or a portion thereof that is intragenomic. The term intragenomic defines a nucleic acid construct incorporated within the host cell's genome. A preferred host cell of the invention E. coli such as the one containing a vector of the invention and deposited at the CNCM (Collection Nationale de Cultures de Microorganismes), 28 rue du Docteur Roux, 75724 PARIS Cedex 15, France, on Mar. 15, 2006 under accession numbers I-3587, I-3588, I-3589 and I-3590 and on Mar. 14, 2007 under accession number I-3733.
The present invention is further concerned with a monoclonal antibody or polyclonal antibodies, or fragments thereof, that specifically bind to a polypeptide of the invention. As used herein, the term “specifically binds to” refers to antibodies that bind with a relatively high affinity to one or more epitopes of a protein of the invention, but which do not substantially recognize and bind to molecules other than the one(s) of interest. As used herein, the term “relatively high affinity” means a binding affinity between the antibody and the protein of interest of at least 10−6 M, and preferably of at least about 10−7 M and even more preferably 10−8 M to 10−10 M. Determination of such affinity is preferably conducted under standard competitive binding immunoassay conditions which is common knowledge to one skilled in the art.
As used herein, the term “antibody” refers to a glycoprotein produced by lymphoid cells in response to a stimulation with an immunogen. Antibodies possess the ability to react in vitro and in vivo specifically and selectively with an antigenic determinant or epitope eliciting their production or with an antigenic determinant closely related to the homologous antigen. The term “antibody” is meant to encompass constructions using the binding (variable) region of such an antibody, and other antibody modifications. Thus, an antibody useful in the method of the invention may comprise a whole antibody, an antibody fragment, a polyfunctional antibody aggregate, or in general a substance comprising one or more specific binding sites from an antibody. The antibody fragment may be a fragment such as an Fv, Fab or F(ab′)2 fragment or a derivative thereof, such as a single chain Fv fragment. The antibody or antibody fragment may be non-recombinant, recombinant or humanized. The antibody may be of an immunoglobulin isotype, e.g., IgG, IgM, and so forth. In addition, an aggregate, polymer, derivative and conjugate of an immunoglobulin or a fragment thereof can be used where appropriate.
Another aspect of the invention is the use of an element selected from the group consisting of a strain, a polynucleotide, a fragment, a vector, a host cell, a polypeptide and an antibody of the invention for either the detection of a CHIKV associated to an arbovirosis, or for the preparation of a composition that prevents and/or treats an arbovirosis.
Another aspect of the present invention relates to a composition for treating and/or preventing an arbovirosis. The composition of the present invention advantageously comprises at least one element selected from the group consisting of a strain, a polynucleotide, a fragment, a vector, a host cell, a polypeptide and an antibody of the invention. The composition of the invention may further comprise an acceptable carrier. In a related aspect, the invention provides a method for treating and/or preventing an arbovirosis. The method comprises the step of administering to a subject in need thereof a composition of the invention.
As used herein, the term “treating” refers to a process by which the development of an infection from a CHIKV is affected or completely eliminated. As used herein, the term “preventing” refers to a process by which the CHIKV infection is obstructed or delayed.
As used herein, the expression “an acceptable carrier” means a vehicle for containing the components (or elements) of the composition of the invention that can be administered to a animal host without adverse effects. Suitable carriers known in the art include, but are not limited to, gold particles, sterile water, saline, glucose, dextrose, or buffered solutions. Carriers may include auxiliary agents including, but not limited to, diluents, stabilizers (i.e., sugars and amino acids), preservatives, wetting agents, emulsifying agents, pH buffering agents, viscosity enhancing additives, colors and the like.
The amount of components of the composition of the invention is preferably a therapeutically effective amount. A therapeutically effective amount of components of the composition of the invention is the amount necessary to allow the same to perform their preventing and/or treating role against a CHIKV infection without causing overly negative effects in the host to which the composition is administered. The exact amount of components to be used and the composition to be administered will vary according to factors such as the mode of administration, as well as the other ingredients in the composition.
The composition of the invention may be given to a host (such as a human) through various routes of administration. For instance, the composition may be administered in the form of sterile injectable preparations, such as sterile injectable aqueous or oleaginous suspensions. These suspensions may be formulated according to techniques known in the art using suitable dispersing or wetting agents and suspending agents. The sterile injectable preparations may also be sterile injectable solutions or suspensions in non-toxic parenterally-acceptable diluents or solvents. They may be given parenterally, for example intravenously, intramuscularly or sub-cutaneously by injection, by infusion or per os. Suitable dosages will vary, depending upon factors such as the amount of each of the components in the composition, the desired effect (short or long term), the route of administration, the age and the weight of the host to be treated. Any other methods well known in the art may be used for administering the composition of the invention.
Yet another aspect of the invention is the use of a composition as defined hereabove for the preparation of a medicament for treating and/or preventing an arbovirosis in a subject in need thereof.
Yet another aspect of the invention is to provide a kit for the detection of a CHIKV associated to an arbovirosis, comprising at least one element selected from the group consisting of a strain, a polynucleotide, a fragment, a vector, a host cell, a polypeptide and an antibody of the invention. Kits according to this embodiment of the invention may comprise packages, each containing one or more of the above mentioned elements (typically in concentrated form) which are required to perform the respective diagnostic tests.
The examples here below will highlight other characteristics and advantages of the present invention, and will serve to illustrate the scope of the use of the present invention and not to limit its scope. Modifications and variations may be made without departing from the spirit and the scope of the invention. Although it is possible to use other methods or products equivalent to those that are found here below to test or to realize the present invention, the preferred material and methods are described.
The inventors (as sometimes referred therein as “we”) report the nearly complete genome sequence of six selected clinical isolates, along with partial sequences of glycoprotein E1 from a total of 60 patients from Reunion, Seychelles, Mauritius, Madagascar and Mayotte Islands. The present results indicate that the outbreak was initiated by a strain related to East-African isolates, from which viral variants have evolved following a traceable microevolution history. Unique molecular features of the outbreak isolates were identified. Notably, in the region coding for the non-structural proteins, ten amino acid changes were found, three of which being located in alphavirus conserved positions of nsP2 (which contains helicase, protease and RNA triphosphatase activities) and of the polymerase nsP4. The sole isolate obtained from the cerebrospinal fluid of a patient showed unique changes in nsP1 (T301I), nsP2 (Y642N) and nsP3 (E460 deletion). In the structural protein region, two noteworthy changes (A226V and D284E) were observed in the membrane fusion glycoprotein E1. Homology 3D modelling allowed mapping of these two changes to regions that are important for virion assembly and for membrane fusion. Change E1-A226V was absent in the initial strains but was observed in >85% of subsequent viral sequences from Reunion, denoting evolutionary success possibly due to adaptation to the mosquito vector.
Material and Methods
Patients.
The 60 patients for whom partial or complete CHIKV nucleotide sequences were determined originated from Reunion (N=43), Seychelles (N=3), Madagascar (N=7), Mayotte (N=4) and Mauritius (N=3). Characteristics of the patients and biological samples are listed in Table 1.
Virus Isolation and RNA Extraction.
Viruses were isolated either from serum or cerebrospinal fluid (CSF) (Table 1). Briefly, C6-36 Aedes albopictus cells were inoculated with 1 ml of serum or CSF diluted 1:10 in L15 medium (Gibco). The cells were grown at 28° C. in L15 supplemented with 5% foetal bovine serum and 10% tryptose-phosphate. Cells and supernatants were harvested after the first passage (5 days) and the second passage (7 days). The virus isolates were identified as CHIKV by indirect immunofluorescence, using CHIKV hyper immune ascitic fluid. In the case of isolates 05.115, 06.21, 06.27 and 06.49 whose genomes were sequenced, absence of yellow fever, dengue and West Nile viruses was confirmed by indirect immunofluorescence using specific sera. RNA was extracted using the QIAAmp Viral Minikit (Qiagen, France).
Nucleotide Sequencing.
Primers (Table 4) were designed based on the nucleotide sequence 20 of the S27 strain. RT-PCR was performed using the Titan One Tube RT-PCR kit (Roche, France). RT-PCR fragments were purified by ultrafiltration prior to sequencing (Millipore, France). Sequencing reactions were performed using the BigDye Terminator v1.1 cycle sequencing kit (Applied Biosystems, USA) and purified by ethanol precipitation. Sequence chromatograms were obtained on automated sequence analysers ABI3100 or ABI3700 (Applied Biosystems). All amplicons were sequenced on both strands.
Assembly of Genome Sequences and Sequence Analysis.
Contig assembly was performed independently by distinct operators and software, using either BioNumerics version 4.5 (Applied-Maths, Sint-Martens-Latem, Belgium) or PhredPhrap/Consed [11]. Both analyses yielded exactly the same consensus sequence for all strains. A single contig of 11,601 nt was obtained for five isolates, whereas for strain 05.61, a sequence portion was missing, between S27 positions 5,246 to 5,649 (positions 390 to 524 of nsP3). Sequence alignments and computation of substitution tables were performed using programs BioNumerics, DNASP version 4.10 [12] and DAMBE version 4.2.13 [13]. Alignments of nucleotide and amino acid sequences against selected alphavirus sequences were performed with the ClustalW1.7 software [14]. Sequence identities were computed with the Phylip package [15]. RNA secondary structure was predicted with the Vienna RNA secondary structure server [16]. Neighbor-joining trees were constructed using MEGA version 3.1 [17] with the Kimura-2 parameter corrections of multiple substitutions. Reliability of nodes was assessed by bootstrap resampling with 1,000 replicates. Amounts of synonymous substitutions per synonymous site (Ks) and of non synonymous substitutions per non synonymous site (Ka) were estimated using DNASP. RDP2 [18] was used to detect putative mosaic sequences.
3D Structure Modeling.
The crystallographic structure of the ectodomain of the glycoprotein E1 of Semliki Forest Virus (SFV) at neutral pH [19]; Protein Data Bank code 2ALA) was used as a template to model and analyze the two amino acid mutations of the Indian Ocean isolates.
Detection of Viral Foci by Immunological Staining.
Aedes pseudoscutellaris AP61 cells were grown in a 24-well tissue culture plates in Leibovitz L-15 growth medium with 10% heat inactivated fetal calf serum (FCS) for 24 h. Mosquito cell monolayers were washed once with Leibovitz L-15 and 0.2 ml Leibovitz L-152% FCS were added. Cells were infected with CHIK virus in 0.2 ml of Leibovitz L-152% FCS and incubated at 28° C. for 1 h. Overlay medium consisting of 0.4 ml of Leibovitz L-152% FBS and carboxymethylcellulose (CMC) (1.6%) was then added and the tissue culture plates were incubated at 28° C. for 2 days. Foci of infected cells were visualized by focus immunoassay (FIA). The cells were washed with PBS, fixed with 3% paraformaldehyde (PFA) in PBS for 20 min, and permeabilized with 0.5% Triton X-100 in PBS for 4 min at room temperature. The fixed cells were incubated for 20 min at 37° C. with 1:2,000 dilution of hyperimmune mouse ascitic fluid (HMAF) directed against CHIKV. Goat anti-mouse IgG, horseradish peroxidase conjugated was used as the second antibody (1:100 dilution) at 37° C. for 20 min. Foci were visualized with DAB. Peroxidase Substrate (Sigma).
1. Genome structure and molecular signatures of the Indian Ocean outbreak chikungunya viruses
Genome Organization.
We determined the nearly complete genome sequences of six CHIKV isolates (05.115, 05.61, 05.209, 06.21, 06.27 and 06.49) representing distinct geographic origins, time points and clinical forms (Table 1) of the Indian Ocean outbreak of chikungunya virus. 11,601 nucleotides were determined, corresponding to positions 52 (5′NTR) to 11,667 (3′NTR, end of third Repeat Sequence Element) in the nucleotide sequence of the 1952 Tanzanian isolate S27 (total length 11,826 nt). There were three insertion/deletion events between S27 and Réunion isolates, two of which were observed in the 3′NTR. First, the internal poly-A stretch of 14 nucleotides observed in S27 (11,440-11,443) and corresponding to a probable internal poly-A site [9] was replaced by a stretch of only 5 A in Indian Ocean isolates, similar to what was observed in other chikungunya viruses, e.g. the Ross strain (accession no.: AF490259). Second, one A was missing in Indian Ocean isolates in a 5-A stretch at S27 position 11,625. Finally, one codon was missing in isolate 06.27, corresponding to nsP3 codon 460, at which all other Indian Ocean isolates analyzed and available alphavirus sequences are GAA, coding for Glu.
The genome sequences of the six isolates presented therein was similar to those previously reported for alphaviruses [9, 21, 22]. Coding sequences consisted of two large open reading frames (ORF) of 7,422 nt and 3,744 nt encoding the non-structural polyprotein (2,474 amino-acids) and the structural polyprotein (1,248 amino-acids), respectively. The non structural polyprotein is the precursor of proteins nsP1 (535 aa), nsP2 (798 aa), nsP3 (530 aa) and nsP4 (611 aa), and the structural polyprotein is the precursor of proteins C (261 aa), p62 (487 aa, precursor to E3-64 aa- and E2-423 aa), 6K (61 aa), and E1 (439 aa). Cleavage sites characteristic of the alphavirus family in the non-structural and structural polyproteins were conserved. Glycosylation sites in E3, E2 and E1 were also conserved. A 65 nt junction sequence was identified between the stop codon (TAG, 7499-7501) of the non-structural ORF and the start codon (7567-7569) of the structural ORF. The 5′ non-translated region (5′NTR) ended at position 76. The 3′NTR region started at position 11,314 and contained three repeat sequence elements (RSE) with predicted secondary structures (
Differences Between Indian Ocean Outbreak Isolates and Strain S27.
Compared to strain S27, Reunion isolate 05.115 showed 28 aa changes (1.13%) in the non-structural proteins (Table 5, with the highest proportion in nsP3 (2.26%) and the lowest in nsP2 (0.6%). Ten out of 12 amino acid changes in nsP3 were concentrated between positions 326 and 524 (5.0% variation), similar to findings in ONN viruses [23]. One important difference with S27 was that the Indian Ocean isolates exhibited an opal stop codon (UGA) at nsP3 codon 524, instead of Arg (CGA) in S27. This opal codon was observed in related alphaviruses [9, 22, 23], and is believed to regulate the expression of nsP4, the putative RNA polymerase, by a read-through mechanism [21, 24].
Compared to S27, the structural proteins showed 21 (1.68%, for 05.115) to 22 (1.76%, for other isolates) amino-acid substitutions in Indian Ocean isolates (Table 6). Notably, envelope protein E2 showed the highest variation, with 14 (3.3%) aa changes, higher than envelope protein E1 (0.68%) and the capsid protein (0.38%). The ratio of rates of evolution of synonymous and non-synonymous sites (Ks/Ka) between S27 and 05.115 isolates was 11.0 for the whole polyprotein, whereas it was only 6.12 for protein E2, probably indicative of a positive selection in favor of amino-acid changes in this immunogenic protein. By comparison, Ks/Ka was 18.75 for the non-structural polyprotein.
Indian Ocean Outbreak Molecular Signatures in Non-Structural Proteins and Phenotypic Variation.
Ten positions (excluding polymorphic positions) had aa that were unique to the non-structural proteins of outbreak isolates, when compared to other CHIKV sequences (Table 2). First, nsP2-54 was Asn in Indian Ocean isolates and in SFV, but was Ser in all other sequences. Second, nsP2-374 was Tyr in Indian Ocean isolates, but was His or Asn in other alphavirus sequences (Table 2). Third, position 500 in nsP4 was Leu in the Indian Ocean sequences instead of Gln in the four other reported CHIKV sequences. Interestingly, this position, which is about 30 aa from the catalytic “GDD” motif, is a strictly conserved Glu in all other alphaviruses. The remaining seven changes took place in relatively variable regions.
Additional specific changes were observed in isolates 05.209 (S358P) and 06.27 (nsP1-T301I, nsP2-Y642N, and nsP3-460del). Notably, our phenotypic assays conducted in parallel showed differences for strain 06.27. Focus immunoassay showed that CHIKV stocks 05.115, 06.21, 06.27 and 06.49 formed mixtures of foci with different sizes on Ae. Albopictus C636 (data not shown) and Ae. pseudoscuterallis AP61 cells (
Indian Ocean Molecular Signatures in Structural Proteins and 3D Modelling.
When analyzing the aa sequences of the structural proteins, seven positions (four in E2, one in 6K and two in E1) were found to be unique to isolates from the Indian Ocean outbreak (Table 2). Two of these were located in the E2 ectodomain, with Thr 164 and Met 312 being identified in our isolates instead of Ala and Thr, respectively, in all other available CHIKV sequences (Table 2). The first of these two positions is variable in alphaviruses; it lies in a region defined previously as containing neutralizing epitopes [5, 25]. At position 312, Thr is present in other CHIKV, in ONNV and in SFV, but varies in other alphaviruses; it lies in a region identified as important for E1-E2 oligomerization [5, 25].
In E1, two crucial substitutions were observed, one at residue 284, specific to Indian Ocean isolates, and one at residue 284, present in 3 out of 6 Indian isolates (06.21, 06.27 and 06.49). Both mutations were mapped on the 3D structure (modeled from the crystal structure of SFV E1) in
The other unique aa observed in E1 from Indian Ocean isolates was Glu 284. This is a highly conserved position in E1, which displays an Asp in the majority of alphaviruses or an Asn in SIN (Table 2). This amino acid is located at the interface between E1 protomers at the surface of the virion, participating in contacts that make up the icosahedral E1 scaffold (
2. Phylogenetic Analysis
Previous work based on E1 protein sequences showed strong phylogeographic structure of the chikungunya virus species [6, 10]. In order to determine the progenitor phylogroup from which the Indian Ocean outbreak isolates emerged, we compared a 1,044 nt region within the E1 coding sequence (positions 271 to 1314, i.e., codons 91 to 438) from 63 biological specimens from 60 patients from Reunion, Seychelles, Madagascar, Mayotte and Comoros (Table 1) with 29 other available chikungunya sequences (Table 7). Phylogenetic analysis (
Comparison of the sequences of Indian Ocean outbreak isolates to the S27 sequence revealed 316 (2.7%) nucleotide substitutions in isolate 05.115 (Table 8). The Asian Glade Nagpur strain showed 5.1% average nucleotide divergence from 05.115, whereas the West-African Glade Senegal strain 37997 displayed 15% difference (Table 8). Interestingly, the latter strain showed complete conservation of an 87 nucleotides portion (9,958-10,045, at the junction between structural proteins 6K and E1) with East-African and Indian Ocean outbreak isolates. Sequence identity in this portion may reflect a past event of genetic recombination between West-African and East/Central-African strains. Differently, we did not find statistical support (P>7E-2) for sequence mosaicism or recombination since the split between S27 and Reunion isolates, although some genomic regions differed in their density of nucleotide polymorphisms.
3. Genotypic and Phenotypic Variation Among Indian Ocean Outbreak Isolates and Microevolutionary Scenario
Specific aa changes in the non-structural proteins were observed in the isolates 05209 (S358P) and 06.27 (nsP1-T3011, nsP2-Y642N, and nsP3-460del). In the structural proteins, change E1-A226V was observed in isolates 06.21, 06.27 and 06.49, and change E2-Q146R in the Seychelles isolate 05.209. In addition to these non-synonymous changes, there were 8 silent substitutions, observed in 05.209, 06.27 and 06.49 (Table 3).
A history of probable sequence evolution that occurred during the outbreak (
Since Reunion isolates had E1-226A at the beginning of the outbreak and E1-266V A at the beginning of the outbreak and E1-266V later in the epidemics, we compared residue 226 in 57 additional sequences (57 sequences from 54 sera and 3 CSF) from the Indian Ocean epidemic. Remarkably, the nature of E1-226 differed totally on Reunion Island before and after the winter season. Five sequences from patients sampled from March to June 2005 (including the sequence originating from a traveller back from Comoros) had E1-226A. Between September and end December 2005, 21 sequences showed E1-226V. Among 17 Reunion sequences from 2006, E1-226V was observed 12 times and E1-226A 5 times (Table 1). On Madagascar and Seychelles sequences, for which the samples were collected when the first clinical cases were suspected (i.e probably at the beginning of the outbreaks), only the E1-226 Ala was observed. On Mayotte 2006 sequences, only the E1-226 V was observed. On Mauritius 2006 sequences, both E1-226 Ala and Val were observed.
To date, only CHIKV laboratory strains, passaged many times on mosquito or mammalian cells, had been entirely sequenced [9]. We provide for the first time nearly complete nucleotide sequences of six clinical isolates passaged in-vitro only once or twice (see M&M section). The presence in infected patients of a mixed viral population, called quasispecies [31-33], with genotypes co-existing in an equilibrium governed by a balance between mutation and natural selection. The presence in S27 of an Arg codon instead of the opal stop codon in Indian Ocean isolates is probably explained by numerous in-vitro passages of S27, as evolution of opal to Arg was observed experimentally in ONN viruses [23]. Whereas it may be advantageous for viral quasispecies to maintain the opal codon in-vivo, an Arg codon probably confers a selective advantage in-vitro, as observed for the closely related Semliki Forest virus [34]. Chikungunya virus quasispecies situation in-vivo could also explain the nsP1-T3011 polymorphism observed for the LCR isolate 06.27. Indeed, it is likely that selection for a subset of genotypes harboring this change may be associated with invasion of the LCR [33]. These results underscore that the genome sequence of laboratory “reference” strains may not accurately reflect the natural situation, as the genotypic complexity of quasispecies in-vivo is subject to erosion by in-vitro selection. Since the Indian Ocean isolates sequenced here were subjected to in-vitro selection for only a few generations, they probably correspond more closely to the in-vivo genotypes than previously sequenced chikungunya strains.
The amino acid (aa) differences detected among the outbreak 1 isolates may relate to biological or pathogenic characteristics of the virus. Although our viral culture results are preliminary, they clearly show phenotypic differences between the unique isolate from CSF (06.27), isolated from a neonatal encephalopathy case, and three other isolates, associated with either the classical form of the disease or encephalopathy. The larger foci observed in culture with 06.27 could reflect a higher replication rate of the virus and be linked to the specific amino acid changes identified in nsP1, nsP2 and nsP3. Single amino-acid changes in nsP1, including a Thr/Ile change (residue 538 of Sindbis virus) [35,36] and a 18-nt deletion in nsP3 have previously been shown to affect neurovirulence in other alphaviruses [35-37]. However, in the absence of nsP1 structural data, it is difficult to predict the structural or functional impact of the I301T change observed in 06.27 isolate. It should also be noted that all the viral sequences determined from either the serum or the isolates from three neonatal encephalopathy cases and an adult meningo-encephalitis case had E1-226 Val. However, as this genotype is observed also in classical forms of the disease, a potential link of E1-226 Val with neuropathogenesis needs further studies. Host factors have to be considered in the occurrence of neurological forms of the disease. For example, the blood-brain crossing may be favoured by young age or hypertension.
Unique molecular signatures of the Indian Ocean outbreak genomes were identified when they were compared to all other reported alphavirus sequences. These features represent interesting targets for future functional studies, as well as for epidemiological follow-up. One particularly interesting feature was the E1-226 Val residue (see above). Another interesting molecular signature of Indian Ocean outbreak genomes was E1-284 Asp. Although pseudo-atomic model of the scaffold used is of modest resolution (the resolution of the crystal structure is limited—approaching 3 Å—and the model results of fitting this structure into a 9 Å resolution cryo-electron microscopy reconstruction), it appears that the side-chain of Asp 284 interacts with the main chain of an adjacent E1 polypeptide in the virion. Indeed, it is in a position compatible with acceptance of a hydrogen bond from main chain amide 379 from the neighboring E1 protomer. Because the packing is very tight (see
The TOPO/CHIK-21.pE2 (CNCM I-3587) plasmid containing the cDNA coding for the pE2 glycoprotein (E3+E2) from the CHIK 21 virus strain (Schuffenecker et al., Plos Med., 3:1058, 2006) was used as a template for the amplification by PCR of the ectodomain sequence of the E2 envelope glycoprotein (
Drosophila S2 cells were transfected with the recombinant plasmid pMT/BiP/CHIK-sE2 in the presence of the plasmid coding for the blasticidin resistance gene. The S2/CHIK-sE2 stable cell line was obtained by successive passages in presence of blasticidin. The cell line was selected for its capacity to promote efficient secretion of the CHIK-sE2 virus following the activation of the metallothioneine promoter.
The S2/CHIK-sE2 cells in suspension were induced for the secretion of sE2 during 21 days in the presence of Cu2+. The cellular supernatant is filtered at 0.22 μM and concentrated for 16 hours on an affinity column of 5 ml HiTrap Chelating HP (Amersham Biosciences) with the help of a peristaltic pump. The CHIK sE2 protein is eluded from the affinity column in the presence of increasing concentrations of imidazole (50, 100 and 500 mM, pH 8). The CHIK sE2 protein is specifically eluded at a concentration of 500 mM imidazole (E3 elution) from the E37 fraction (
The gene coding the CHIK sE2 protein has been optimised by the Genecust firm so as to provide a synthetic DNA with an enriched G+C content in comparison to the cDNA obtained from the viral genomic RNA. The G+C rich codons (amino acids E2-1 to E2-364, soluble gp-E2 ectodomain, sE2) were fused to the signal peptide sequence of the human calreticuline (ssCRT) MLLSVPLLLLGLLGLAA (SEQ ID NO: 77) for translocation of the viral protein into the secretion pathway. The enzyme restriction sites BamHI in 5′ and XhoI in 3′ have been added at their respective ends of the sequences coding for the fusion ssCRT+sE2 protein.
The synthetic gene was cloned into the TRIP vector between the BamHI and XhoI sites under the transcription of the ieCMV promoter. The non-replicative and integrative TRIP/CHIK.sE2 plasmid thus produced was validated for the expression of the sE2 protein following transduction of 293 cells.
As shown in
The inventors have generated the stable inducible S2/CHIK.sE2 cell line which releases the soluble form of the envelope E2-glycoprotein (sE2) from Reunion CHIK virus strains. The inventors have also generated a stable cell line 293A/CHIK.sE2 which was transducted by the recombinant lentiviral vector TRIP/CHIK.sE2. A synthetic sE2 gene that was modified for optimal codon usage in mammalian cells had to be used in order to obtain efficient expression of CHIK virus sE2 in human fibroblastic 293A cells. The TRIP/CHIK.sE2 vector is currently assessed for its capability to induce protective immunity in a murine model of experimental infection. Viral suspension mainly enriched in CHIK pE2 (E2 precursor or E3E2) was obtained by solubilizing CHIK virions grown in mosquito cells with Triton X-100. Adult mice were hyperimmunized with CHIK pE2 in the presence of adjuvant in order to generate hybridoma directed against CHIK structural proteins. Anti-CHIK E2 monoclonal antibodies produced by mouse hybridoma were characterized by ELISA assay on highly purified CHIK virion and Western blot on secreted sE2 from stable cell line S2/CHIK.sE2. (
Number | Date | Country | Kind |
---|---|---|---|
2538898 | Mar 2006 | CA | national |
2545597 | Apr 2006 | CA | national |
This is a Division of application Ser. No. 12/225,111, filed Sep. 29, 2009, which is a §371 of PCT/IB2007/001716, which claims the benefit of Canadian Application No. 2,545,597, filed Apr. 4, 2006 and Canadian Application No, 2,538,898, filed Mar. 15, 2006, all of which are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
20100233209 | Higgs | Sep 2010 | A1 |
20150111197 | Despres | Apr 2015 | A1 |
Entry |
---|
Tsetsarkin et al. (Vector-Borne and Zoonotic Diseases. 2006; 6 (4): 325-337). |
Khan, A.H., Complete nucleotide sequence of chikungunya virus and evidence for an inernal polyadenylation site, Journal of General Virology (2002), 83: 3075-3084. |
Powers, Ann M., Re-emergence of chikungunya virus and o'nyong-nyong viruses: evidence for distinct geographical lineages and distant evolutionary relationships, Journal of General Virology (2000), 81:471-479. |
Schuffenecker, I., Genome Microevolution of Chikungunya viruses Causing the Indian Ocean Outbreak, PLoS Medicine, (2006) 3:1058-1070. |
Kan, A.H., NCBI Sequence No. AF369024 Report (Jan. 14, 2003), pp. 1-6. |
Yuzhen, Z., Abstract, Susceptibility of GC32 cell to three strains of arbovirus, (Sep. 1998). |
Number | Date | Country | |
---|---|---|---|
20150111197 A1 | Apr 2015 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 12225111 | Sep 2009 | US |
Child | 14335065 | US |