The present invention lies in the medical field, and in particular relates to an in vitro method for detecting the presence of genotype variants of the Epstein Barr virus (EBV), and to the use thereof as possible risk predictors, biomarkers, and therapeutic target in multiple sclerosis.
Multiple sclerosis (MS) is a demyelinating disease of the central nervous system, it belongs to the group of autoimmune diseases and has unknown aetiology, however there is evidence of an association both with genetic and environmental factors.
Epidemiological, clinical and laboratory studies support the aetiological role of the Epstein Barr virus (EBV) in MS1. In fact, human beings are the only natural host of the virus, which may explain why MS is a pathology which manifests itself exclusively in humans.
The studies carried out until now have shown that the onset of infectious mononucleoses and primary infection at young-adult age correlate with an increased risk of developing multiple sclerosis: in Northern Europe (in areas with a high prevalence of MS), a third of the population is infected with EBV during or after puberty, whilst in Southern Europe (areas in which the prevalence is low), subclinical infections occur in the first years of life. In addition, seroepidemiological evidence suggests an aetiological role for the virus: perspective studies have demonstrated that the increase of serum antibody titres against the virus precede the onset of multiple sclerosis. Viral proteins are a target of the antibody response detected in the cerebrospinal fluid of patients, and post-mortem studies have revealed the infection by EBV in the cerebral tissues of the affected subjects.
Patent application WO2008/125366 describes that the infection of the B cells by EBV leads to autoimmune diseases, including multiple sclerosis, and proposes the use of anti-EBV substances for the treatment of the autoimmune diseases.
However, the discrepancy between global spread of the infection (more than 90% of the adult population) and the relatively limited prevalence of MS remains a large obstacle for interpreting the pathogenetic mechanisms forming the basis of the association between the ubiquitous virus and the disease. The presence of genotype variants of the virus specifically associated with the pathology can contribute to the explanation of this paradox.
Given the methodological difficulties in the study of the variants of EBV through sequencing of the whole viral genome, an approach focused on a particular gene (candidate-gene approach) is followed by various groups2-7.
In the gene coding for the latent protein of the “Epstein Barr nuclear antigen 2” (EBNA2) resides one of the most variable regions of the entire virus, which allows to discriminate between the two main types of EBV (type 1 and 2). In addition, EBNA2 is expressed during the first phase of infection of the B cells, having a key role in the activation and in the proliferation of these cells.
Five major variants of the type 1 virus (the most widespread in the Caucasian population)8 have been identified within the sequence of EBNA29 defining the following alleles: 1.1, 1.2, 1.3A, 1.3B and 1.3E. These variants are characterised by changes with respect to the sequence of the reference prototype B95.8 (defined as allele 1.1) in the most polymorphic region (B95.8 coordinates 48.959-49.208) of the coding portion of EBNA2.
Previous studies2-7, based on multiple sclerosis, have analysed other regions of the virus without revealing particular correlations between multiple sclerosis and the gene variants of EBV.
There is thus a need to provide a method for identifying subjects who are “at risk” or pathological situations having diagnostic or prognostic peculiarities, and for identifying therapeutic targets, also for preventative strategies.
The authors have identified a connection between MS and gene variants of EBV. The authors have investigated the link between the variants of EBNA2 and MS in a population of continental Italy. The present invention is based on the identification of gene variants of the Epstein Barr virus (EBV), identified in the coding sequence of the Epstein Barr nuclear antigen 2 (EBNA2) protein, in a population of subjects affected by multiple sclerosis (MS). The peculiarity of the EBV virus genotype that characterises the subjects with MS compared to controls paired by age, sex and geographical area can be utilised as a potential biomarker (risk-predictive condition for the development of the disease, a trait that may contribute to the diagnosis and/or prognosis of MS).
The authors have demonstrated a correlation between an EBV-associated pathology, in particular MS, and viral gene variants in the region of EBNA2, selected for being notoriously the most polymorphic region of the viral genome.
The “profile” of the gene variants of EBNA2 may contribute to the identification of subjects who are “at risk” or pathological situations having specific diagnostic or prognostic peculiarities.
In addition, the gene variants identified by the authors can be used as a therapeutic target, even for preventative therapies. For example, such variants can be used to produce anti-viral drugs or vaccines.
Object of the present invention is a compound consisting of:
a) a nucleic acid coding for the variant of the 1.2 sub-type of Epstein Barr nuclear antigen 2 (EBNA2), characterised in that it comprises at least one substitution of the triplet coding for one or more amino acids selected from the group consisting of:
aa. 134, aa. 236, aa. 245 and aa. 267 of SEQ ID NO: 2, or
b) a messenger RNA transcribed from said nucleic acid, or
c) a protein coded by said nucleic acid.
The nucleotide sequence of EBNA2-1.2 (SEQ ID NO: 1) is as follows:
The amino acid sequence of EBNA2-1.2 (SEQ ID NO: 2) is as follows:
In a preferred embodiment the protein is coded by a nucleic acid wherein said substitution of the triplet coding for at least one of the amino acids selected from the group consisting of: aa. 134, aa. 236, aa. 245 and aa. 267 of SEQ ID NO: 2 does not correspond to a silent mutation. More preferably, said protein in position aa. 245 presents an amino acid S or T and/or in position aa. 267 presents an amino acid I.
In a preferred aspect of the invention the substitution of the triplet corresponds to at least one of the following:
the substitution of the triplet CTT with the triplet CTG coding for aa. 134 of SEQ ID NO: 2, the substitution of the triplet ACC with the triplet ACT coding for aa. 236 of SEQ ID NO: 2, the substitution of the triplet CCA with the triplet TCA or ACA coding for aa. 245 of SEQ ID NO: 2, the substitution of the triplet ACC with the triplet ATC coding for aa. 267 of SEQ ID NO: 2.
More preferably, said substitution of the triplet corresponds to at least the substitution of the triplet ACC with the triplet ATC coding for aa. 267 of SEQ ID NO: 2 and/or the substitution of the triplet CTT with the triplet CTG coding for aa. 134 of SEQ ID NO: 2. The compound according to the invention is preferably a biomarker of multiple sclerosis.
A further object of the invention is a compound consisting of:
a) a nucleic acid coding for a variant of the Epstein Barr nuclear antigen 2 (EBNA2), characterised in it comprises:
the sequence coding for a nuclear antigen 2 of the sub-type 1.2 or variants thereof, characterised by at least one substitution of the triplet coding for one or more amino acids selected from the group consisting of: aa. 134, aa. 236, aa. 245 and aa. 267 of SEQ ID NO: 2, or
b) a messenger RNA transcribed from said nucleic acid, or
c) a protein coded by said nucleic acid
for use in a method for predicting the risk of contracting or developing multiple sclerosis and/or for screening and/or for the diagnosis and/or the prognosis of multiple sclerosis. Said compound is preferably a biomarker of multiple sclerosis.
Said substitution of the triplet preferably corresponds to at least one of the following:
the substitution of the triplet CTT with the triplet CTG coding for aa. 134 of SEQ ID NO: 2, the substitution of the triplet ACC with the triplet ACT coding for aa. 236 of SEQ ID NO: 2, the substitution of the triplet CCA with the triplet TCA or ACA coding for aa. 245 of SEQ ID NO: 2, the substitution of the triplet ACC with the triplet ATC coding for aa. 267 of SEQ ID NO: 2.
More preferably, the substitution of the sequence corresponds to at least the substitution of the triplet ACC with the triplet ATC coding for aa. 267 of SEQ ID NO: 2 and/or the substitution of the triplet CTT with the triplet CTG coding for aa. 134 of SEQ ID NO: 2.
The compound of the present invention, DNA, RNA or protein, finds industrial applicability, for example in the identification of primers and/or probes or as an antigenic compound.
A further object of the invention is an in vitro method for predicting the risk of contracting or developing multiple sclerosis in a subject, wherein the detection of the compound as above defined, in a biological sample isolated from the subject, is an indication of increased risk.
In a preferred embodiment of the method according to the present invention, the presence of the nucleic acid as defined above is detected through DNA genotyping. In a preferred aspect of the method according to the invention, the detection of the presence of the sequence coding for the nuclear antigen 2 of the sub-type 1.3 B or of the protein coded by it is an indication of low risk.
The nucleotide sequence of EBNA2-1.3B (SEQ ID NO: 4) is as follows:
The amino acid sequence of EBNA2-1.3B (SEQ ID NO: 5) is as follows:
Another object of the invention is an in vitro method for screening and/or for the diagnosis and/or prognosis of multiple sclerosis in a subject through detection of the presence of the compound as defined above, in a biological sample isolated from the subject. In a preferred embodiment of the method according to the present invention, the presence of the nucleic acid as defined above is detected through DNA genotyping.
In the present invention a “method for screening” or the “screening” preferably includes the screening of subjects potentially at risk for multiple sclerosis.
In a preferred embodiment of the in vitro method according to the invention, the subject in which the presence of the compound as defined above has been detected is then subjected to further methods for diagnosis and/or prognosis of multiple sclerosis, including, for example, magnetic resonance of the brain and of the spinal cord, and examination of the cerebrospinal fluid. Such diagnostic methods currently used are costly and invasive. Thus, the method according to the invention allows to carry out a first screening of subjects potentially at risk that is effective, fast and non-invasive.
In the methods according to the present invention, the biological sample is preferably selected from: blood cells, biological fluids, preferably serum, plasma, saliva or cerebrospinal fluid, cerebral tissue, epithelial cells. The biological sample can also be any cell type able to be infected by the Epstein Barr virus.
A further object of the invention is a kit for predicting the risk of contracting or developing multiple sclerosis and/or for screening and/or diagnosis and/or prognosis of multiple sclerosis comprising the detection of the compound as defined above, comprising detection means for said compound and optionally control means.
In the kit according to the invention, the detected compound is preferably a nucleic acid, and the kit further comprises means for amplifying said nucleic acid.
The control means can be used to compare the presence of the compound as defined above with a proper control. The control can be obtained for example, with reference to the known standards, either from a normal subject or from a normal population. If the compound is a protein, the detection means are, for example, at least one antibody specific for the protein, functional analogues or derivatives thereof.
In the present invention, the detection of the compound as defined above may refer to the detection of the presence of one, two, three, four, five, six, etc. compounds as defined above. Any combination of the compounds is suitable for the purpose of the invention.
A further object of the invention is an inhibitor for the Epstein Barr virus having as a specific target the compound as defined above, for use in the prevention and/or treatment of multiple sclerosis, said inhibitor preferably being a vaccine, an antibody, a siRNA or a drug with low atomic weight.
The variants described above may comprise one of the substitutions of the triplets mentioned above, or two of the substitutions of the triplets mentioned above (for example the substitution of the triplets coding for aa. 134 and 236, aa. 134 and 245, aa. 134 and 267, aa. 236 and 245, aa. 236 and 267, aa. 245 and 267), or three of the substitutions of the triplets mentioned above (for example the substitution of the triplets coding for aa. 134, 236 and 245; for aa. 134, 236 and 267; for aa. 134, 245 and 267; for aa. 236, 245 and 267), or four of the substitutions of the triplets mentioned above (for example the substitution of the triplets coding for aa. 134, 236, 245 and 267).
The variants of the nucleotide sequence EBNA2-1.2 are preferably characterised in that they have a nucleotide substitution from T to G in position 402 of SEQ ID No. 1 and/or a nucleotide substitution of a C with a T in position 708 of SEQ ID No. 1 and/or a nucleotide substitution of a C with an A or a T in position 733 of SEQ ID No. 1, and/or a nucleotide substitution of a C with a T in position 800 of SEQ ID No. 1.
The variant of the amino acid sequence EBNA2-1.2 corresponding to the nucleotide sequence where the nucleotide substitution of a C with a T or with an A is present in nucleotide position 733 of SEQ ID No. 1 is characterised in that it has in position aa 245 of SEQ ID No. 2, respectively, an amino acid S or T, instead of an amino acid P. The variant of the amino acid sequence EBNA2-1.2 corresponding to the nucleotide sequence wherein in the nucleotide position 800 of SEQ ID No. 1, is present a nucleotide substitution of a C with a T is characterised in that it has in position aa 267 of SEQ ID No. 2 an amino acid I instead of an amino acid T.
In the present invention, the expression “detection” in relation to a protein or to a nucleic acid (DNA or RNA) refers, for example, to any method of observation, ascertainment, or quantification of the signals indicative of the presence of a protein in a sample or the absolute or relative quantity of said target protein in a sample. The methods may be combined with methods for labelling proteins or nucleic acids for providing a signal, for example: immunohistochemical staining, ELISA, cell suspension, cytology, fluorescence, radioactivity, colorimetry, gravimetry, X-ray diffraction or adsorption, magnetism, enzyme activity and the like. In the present invention, the detection of the presence of messenger RNA transcribed from the nucleic acid as defined above can be performed by means of any technique known to a person skilled in the art, for example Northern blotting or quantitative or semi-quantitative RT-PCR methods using suitable oligonucleotide primers. DNA detection can be carried out for example through DNA genotyping. For DNA genotyping is intended any technique which allows to detect the genotype from an organism by means of biological tests (genotype tests). These techniques include, for example: PCR, analysis of a DNA fragment, allele-specific oligonucleotide (ASO) probes, DNA sequencing and microarray.
The present invention will be described in non-limiting examples with reference to the following figure:
Subjects and Methods
Blood samples were taken from patients with relapsing-remitting multiple sclerosis MS11 and healthy donors (HD) matched by age, sex and geographical origin. At the same time as the blood sampling, the patients underwent magnetic resonance imaging (MRI) with contrast agent (gadolinium). The Ethics Committee approved the study, and all the participating subjects gave written informed consent.
Peripheral blood mononuclear cells (PBMC) were obtained by density centrifugation over Ficoll-Hypaque according to standard procedure. The cells were stained with human anti-CD19 antibodies (Miltenyi Biotec) and separated by magnetic beads in accordance with the manufacturer's recommended protocol.
Genomic DNA was extracted from PBMCs, CD19+ B cells and cell lines (B95-8 EBV-positive cell line and BJAB EBV-negative cell line) by a commercially available kit (QIAmp DNA mini kit, Qiagen). All the samples were analysed by a “semi-nested PCR” approach using specific primers of EBNA29:
All PCR products were assayed by means of sequence analyses.
The sequences were aligned with the multiple sequence alignment program ClustalW2 (www.ebi.ac.uk/Tools/msa/clustalw2/) and the variants were considered when there was a deviation from EBV B95.8 prototype (NCBI accession number V01555; V01555-EBNA-2 (SEQ ID NO: 3, published):
All the participants of the study (patients and healthy subjects) were subjected to human leukocyte antigen (HLA) loci typing. The haplotypes of classes I HLA-A, HLA-B and HLA-C and of class II DRB1 were analysed by means of standard sequence specific primer polymerase chain reaction (SSP-PCR)12, using “Histo Type DNA” wells plates (BAG, Formedic diagnostici, Milan, Italy) in accordance with manufacturer's instructions.
It was possible to detect the alleles recognised by the specific primers after amplification in a GeneAmp PCR 9700 thermocycler (Applied Biosystems, Foster City, Calif. U.S.A.) and gel electrophoresis on 2% agarose. Fisher's exact test (Graph Pad Prism 5) was used to compare the proportion of variants in all patients vs HD, whilst Barnard's exact test13 was used to compare only the donors infected with the sub-type 1.2.
A Bayes classifier14 was used for partitioning the dataset: 50% of the data was used to train the classifier, and the remaining 50% was then used to test its predictive capability. The receiver operating characteristic (ROC) curve and the area under the curve (AUC) were then computed to quantify the predictive potential of EBNA2 genotyping as a marker of disease status.
Results
By means of “semi-nested PCR”, the authors amplified a fragment of approximately 500 bp corresponding to the fragment having coordinates 48.810-49.310 located within the hypervariable region of EBNA2, which was then subjected to sequencing.
Blood samples were obtained from 58 patients affected by MS (6 undergoing immunomodulatory therapy) and 49 healthy donors (HD) selected by age, sex, ethnicity and geographic origin, comparable with the group of patients. The success rate of the EBNA2 genotyping was comparable between the two groups: 53/91 (58%) in the group of patients and 37/56 in the healthy subjects (66%).
Of the 53 patients with a successful genotyping, 7 were under therapy with Interferon beta, whilst the others had never been treated. The mean age was 35±10.7 in the patients and 37±9.5 in the HD; the male/female ratio was 13/40 in the patients, 11/26 in the controls. Seven patients with MS and three HD were analysed at different time points (from one to twelve months between each sampling): two patients and one HD presented infection from various strains (1.2, 1.3B, GD1) (Table 3) at different time points, whilst the majority of the donors showed stability of the EBV sub-type.
Considering each sampling as an independent observation, a total of 107 blood samples (66 blood samples from patients with MS and 41 from HD) were analysed.
A significant difference in the distribution of the EBV sub-types was observed between patients and controls (Table 1): the allele 1.2 was prevalent in the patients with MS (26/53 vs 6/37, p=0.0004), whereas the allele 1.3B was prevalent in the controls (26/37 vs 18/53, p=0.0012; Fisher's exact test). When the samples obtained from the same subject at different time points were added to the analyses (66 from patients with MS and 41 from HD), comparable results in the EBV sub-types and in the distribution of the variants were obtained, as well as in the potential of predicting the state of the disease (shown in Tables 1-2).
The obtained data was analysed by comparing the distribution of the viral strains between healthy or diseased women and men, observing a distribution comparable with that obtained over the entire population. The gender therefore did not appear to modify the aforementioned distribution (Table 1B).
Extending the observation beyond the region 48990-49170 (where the variants described in literature reside)9, new sequence variations, never observed before, were identified, in correspondence of the amino acids 134, 236, 245, 256 and 267 (Table 2; Table 3). Whilst the variation at position 256 (Asp/Asn) is present in sub-types 1.2, 1.3B and GD1, without differences between patients and controls, the other four variants resulted detectable only in the presence of the sub-type 1.2, and therefore correlated with MS (Table 2; Table 3).
To verify the consistency of the EBNA2 genotyping data, a Bayes classifier14 was tested for its capability to predict the state of the disease. The predictive potential of the EBV genetic variants was therefore quantified and illustrated in a ROC curve (
In general, the risk of developing the disease is higher in carriers of sub-type 1.2 (odd ratio[OR]=5) and variants thereof (in particular the divergence of amino acid 245, OR=9.4), whilst the presence of sub-type 1.3B demonstrated to be protective, OR=0.22.
No correlation was found between EBV genetic variants and HLA haplotypes (included those known to be associated with MS)15 of the donors. No correlation was found between EBV genotypes and the clinical characteristics of the patients (MRI data, disability scales, disease duration).
The subjects included in this study were characterised by the HLA haplotype and by their clinical and neuroradiological characteristics at the moment of blood sampling. The authors have demonstrated that the state of the disease correlates with an excess of EBNA2 sub-type 1.2 and a deficiency of sub-type 1.3B, irrespective of the clinical characteristics of the patients and their HLA haplotype. In addition, the patients have a greater probability of being infected with viral strains presenting new genotype variants correlated to sub-type 1.2 (in particular that corresponding at amino acid position 245).
Previous studies on a possible link between EBV genotypes and MS have provided conflicting results: no link with EBNA6,2 EBNA1 and LMP1,3,4 EBNA2;5 “marginally different frequencies” for BRRF2 and EBNA1;6 most frequent co-infection EBNA2 of type 1 and 2 in patients compared with the healthy subjects.7
Differences in the geographic variants can explain, in part, the incoherencies, as has been reported for malignant EBV-correlated tumours and the different geographical distribution thereof.16
In a region of EBNA2 not studied by the previous studies, the authors found genotypes with associations that are stronger than those reported beforehand: for a model that includes EBNA2 sub-types and new identified EBNA2 polymorphisms, the ROC analysis provides increased reliability in the prediction of the state of the MS.
Aside from the predictive potential of the EBNA2 variants, the probability that they can induce functional consequences, contributing in this way to the aetiology of the disease, merits further consideration. In a virus with a low propensity to mutate, new variants have greater probability of having a certain functional impact insofar as they tend to be maintained and fixed in the viral genome.
There are only two main types of EBV (type 1 and 2), which seem to be identical for the majority of the genome, but demonstrate allele polymorphism in a sub-group of latent genes: EBNA-LP, EBNA2, EBNA3A, EBNA3B and EBNA3C. In particular, the two types of EBV share 64% of the nucleotide sequences and 53% of the amino acids sequences predicted of EBNA2.17 EBNA2 sequence mutations are able to influence its interaction with other host proteins.18
The authors studied the most polymorphic region of EBNA2, where the sequence divergence between type 1 and type 2 resides. These differences can have immunological consequences.19,20
In addition, this region is involved in interactions with cellular proteins, such as Nur7721 and SMARCB122, that have been associated respectively with MS23 and antiviral responses.24
These results are complementary to data showing that part of the genetic predisposition to MS can be attributable to variants in genes that interact with EBV.
Number | Date | Country | Kind |
---|---|---|---|
RM2013A000206 | Apr 2013 | IT | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2014/060489 | 4/7/2014 | WO | 00 |