The invention relates to methods for determining whether an individual has an increased risk of developing chronic autoimmune diseases.
Autoimmune diseases are generally understood to be diseases where the target of the disease is “self” or “self antigen”. Among the many types of autoimmune diseases, there are a number of diseases that are believed to involve T cell immunity directed to self antigens, including, for example, multiple sclerosis (MS), Type I diabetes, and rheumatoid arthritis (RA).
RA is a chronic inflammatory disorder characterized by joint pain. The course of the disease is variable, but can be both debilitating and mutilating. According to conservative estimates approximately 50,000,000 individuals are afflicted with RA worldwide. Those individuals are not only subjected to life-long disability and misery, but as current evidence suggests, their life expectancy is compromised as well. Systemic lupus erythematosus (SLE) is a chronic inflammatory disease that can affect various parts of the body including skin, blood, kidneys, and joints. SLE may manifest as a mild disease or be serious and life-threatening. More than 16,000 cases of SLE are reported in the United States each year, with up to 1.5 million cases diagnosed.
Although SLE can occur at any age, and in either sex, it has been found to occur 10-15 times more frequently in women.
SLE is characterized by the production of auto-antibodies having specificity for a wide range of self-antigens. SLE auto-antibodies mediate organ damage by directly binding to host tissues and by forming immune complexes that deposit in vascular tissues and activate various immune cells. SLE induced damage to the host targets the skin, kidneys, vasculature, joints, various blood elements, and the central nervous system (CNS). The severity of disease, the spectrum of clinical involvement, and the response to therapy vary widely among patients. The clinical heterogeneity of SLE makes it challenging to diagnose, monitor and manage.
When a patient is diagnosed with an autoimmune disease such as RA and SLE, the choice of appropriate therapeutic interventions would be considerably facilitated by diagnostic and prognostic indicators that accurately predict future severity. Thus, there is a need in the art for reliable prognostic methods to predict suffering from chronic autoimmune diseases in individuals.
The present inventors have determined that a number of genes are indicative of an increased risk of developing chronic autoimmune diseases, in particular Rheumatoid Arthritis (RA). They found that the occurrence of RA may be predicted even long before an individual presents with symptoms of RA. A set of genes that may be used in the invention is presented herein, in particular a gene selected from the group consisting of genes ISG15, EPSTI1, IFI6, OAS3, IFI44L, RSAD2, IFIT1, MX1, CD274, SERPING1, IFI27, CD19, CD79A, CD79B, MS4A1, FCRL5, DARC, BCL2L1, RBM38, BAG1, TESC, KLF1, ERAF, SELENBP1, CCL5, IFNG, GZMH and NKG7 was found to be useful in a method according to the invention.
Hence, the invention relates to an in vitro method for determining whether an individual has an increased risk of developing symptoms of Rheumatoid Arthritis comprising the steps of determining the expression level of at least the ISG15 gene in a sample obtained from said individual and comparing said expression level with a predetermined reference value wherein an increased level of expression of said ISG15 gene in said sample is indicative of an increased risk.
In a broad sense, a method is provided for determining whether an individual has an increased risk of developing symptoms of a chronic autoimmune disease within a time frame of 10 years after the completion of said method compared to the average risk of a reference population, comprising determining in a sample of said individual the levels of expression products of a collection of genes and determining whether the expression products are present at a level that is different when compared to the level of the same expression products of a control.
Upon analysis of the gene set indicative of an increased risk as shown in table 8, it was found that the genes clustered into 4 distinct categories. They appeared to be involved in (i) IFN-mediated immunity, (ii) hematopoiesis, (iii) B-cell mediated immunity and/or (iv) cytokine mediated immunity.
It is therefore postulated that other genes involved in those cellular processes which are not necessarily shown in table 8 will also prove to be indicative of an increased risk of developing chronic autoimmune diseases, in particular RA.
Higher levels of expression products of genes involved in IFN-mediated immunity, hematopoiesis and cytokines are predictive for an increased risk and higher levels of expression products of genes involved in the cellular process of B-cell mediated immunity are predictive for a decreased risk.
A method according to the invention may be used to determine an increased risk of developing symptoms of a chronic autoimmune disease within a time frame of 10 years after the completion of said method.
The expression “a predetermined reference value” in this context means that the expression level of a particular gene is determined in at least one sample from a normal individual, more in particular an individual that has proven not to develop any symptoms of RA in a number of years after the sample was taken from the individual, in particular wherein the number of years is more than 5 years, such as more than 10 years. Preferably, the reference value is an average of more than one such as 2, 3, 4,or more than 5 samples taken from different normal individuals.
Preferred samples include whole blood, saliva, faecal material, buccal smears, skin, and biopsies of specific organ tissues, such as muscle or nerve tissue and hair follicle, because these samples comprise relevant expression products. Preferably, said sample comprises a cell sample, because cell samples comprise proteins and nucleic acids. Most preferably, the biological sample is a blood sample, because a blood sample is easy obtainable and comprises large amounts of relevant expression products.
Expression products comprise nucleic acids and proteins. Nucleic acids in the invention preferably comprise RNA. The levels of the expression products may be determined separately for each different expression product or as a single measurement for more different expression products simultaneously. Preferably, the determination of the level of the expression products is performed for each different expression product separately, resulting in a separate measurement of the level of the expression product for each different expression product. This enables a more accurate comparison of expression levels of expression products with the expression levels of the same expression products in a control. A control may be a single individual or group comprising of at least two individuals, preferably four individuals. If the expression level is determined of an expression product from more than one individual, usually the median or mean expression level of these individuals is used for comparison.
Determination of the level of the expression products according to a methods of the invention may comprise the measurement of the amount of nucleic acids or of proteins. In a preferred embodiment of the invention, determination of the level of the expression products comprises determination of the amount of RNA, preferably mRNA. A level can be the absolute level or a relative level compared to the level of another mRNA. mRNA can be isolated from the samples by methods well known to those skilled in the art as described, e.g., in Ausubel et al., Current Protocols in Molecular Biology, Vol. 1 , pp. 4.1.1-4.2.9 and 4.5.1-4.5.3, John Wiley & Sons, Inc. (1996). Methods for detecting the amount of mRNA are well known in the art and include, but are not limited to, northern blotting, reverse transcription PCR, real time quantitative PCR and other hybridization methods. The amount of mRNA is preferably determined by contacting the mRNAs with at least one sequence-specific oligonucleotide which hybridises to said mRNA. In a preferred embodiment said mRNA is determined with two sequence-specific oligonucleotides which hybridise to different sections of said mRNA. The sequence-specific oligonucleotides are preferably of sufficient length to specifically hybridize only to the RNA or to a cDNA prepared from said mRNA. As used herein, the term “oligonucleotide” refers to a single-stranded nucleic acid. Generally the sequence-specific oligonucleotides will be at least 15 to 20 nucleotides in length, although in some cases longer probes of at least 20 to 25 nucleotides will be desirable. Said sequence-specific oligonucleotides may also comprise non-specific nucleic acids. Such non-specific nucleic acids can be used for structural purposes, for example as an anchor to immobilise the oligonucleotides. The sequence-specific oligonucleotide can be labelled with one or more labelling moieties to permit detection of the hybridized probe/target polynucleotide complexes. Labelling moieties can include compositions that can be detected by spectroscopic, biochemical, photochemical, bioelectronic, immunochemical, and electrical optical or chemical means. Examples of labelling moieties include, but are not limited to, radioisotopes, e.g., 32P, 33P, 35S, chemiluminescent compounds, labelled binding proteins, heavy metal atoms, spectroscopic markers such as fluorescent markers and dyes, linked enzymes, mass spectrometry tags, and magnetic labels. Oligonucleotide arrays for mRNA or expression monitoring can be prepared and used according to techniques which are well known to those skilled in the art as described, e.g., in Lockhart et al., Nature Biotechnology, Vol. 14, pp. 1675-1680 (1996); McGall et al., Proc. Natl. Acad. Sci. USA, Vol. 93, pp. 13555-13460 (1996); and U.S. Pat. No. 6,040,138.
A preferred method for determining the amount of mRNA involves hybridization of labelled mRNA to an ordered array of sequence-specific oligonucleotides. Such a method allows the simultaneously determination of the mRNA amounts. The sequence-specific oligonucleotides utilized in this hybridization method typically are bound to a solid support. Examples of solid supports include, but are not limited to, membranes, filters, slides, paper, nylon, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, polymers, polyvinyl chloride dishes, etc.
According to a preferred embodiment of the invention the determining the level(s) of the expression products is performed by measuring the amount of protein. The term “protein” as used herein may be used synonymously with the term “polypeptide” or may refer to, in addition, a complex of two or more polypeptides which may be linked by bonds other than peptide bonds, for example, such polypeptides making up the protein may be linked by disulfide bonds. The term “protein” may also comprehend a family of polypeptides having identical amino acid sequences but different post-translational modifications, particularly as may be added when such proteins are expressed in eukaryotic hosts. These proteins can be either in their native form or they may be immunologically detectable fragments of the proteins resulting, for example, from proteolytic breakdown. By “immunologically detectable” is meant that the protein fragments comprise an epitope which is specifically recognized by e.g. mass spectrometry or antibody reagents as described below. Proteins levels can be determined by methods known to the skilled person, comprising but not limited to: mass spectrometry, Western blotting, immunoassays, protein expression assay, protein microarray etc.
A preferred embodiment of the invention provides a protein microarray (Templin et al. 2004; Comb. Chem. High Throughput Screen., vol. 7, no. 3, pp. 223-229) for simultaneous binding and quantification of the at least two biomarker proteins according to the invention. The protein microarray consists of molecules (capture agents) bound to a defined spot position on a support material. The array is then exposed to a complex protein sample. Capture agents such as antibodies are able to bind the protein of interest from the biological sample. The binding of the specific analyte proteins to the individual spots can then be monitored by quantifying the signal generated by each spot (MacBeath 2002; Nat. Genet, vol. 32 Suppl, pp. 526-532; Zhu & Snyder 2003; Curr. Opin. Chem. Biol., vol. 7, no. 1, pp. 55-63). Protein microarrays can be classified into two major categories according to their applications. These are defined as protein expression microarrays, and protein function microarrays (Kodadek 2001; Chem. Biol., vol. 8, no. 2, pp. 105-115). Protein expression microarrays mainly serve as an analytic tool, and can be used to detect and quantify proteins, antigen or antibodies in a biological fluid or sample. Protein function microarrays on the other hand can be used to study protein-protein, enzyme-substrate and small molecule-protein interactions (Huang 2003; Front Biosci., vol. 8, p. d559-d576). Protein microarrays also come in many structural forms. These include two-dimensional microarrays constructed on a planar surface, and three-dimensional microarrays which use a Flow-through support.
Types of protein microarray set-ups: reverse phase arrays (RPAs) and forward phase arrays (FPAs) (Liotta et al. 2003; Cancer Cell, vol. 3, no. 4, pp. 317-325). In RPAs a small amount of a tissue or cell sample is immobilized on each array spot, such that an array is composed of different patient samples or cellular lysates. In the RPA format, each array is incubated with one detection protein (e.g., antibody), and a single analyte endpoint is measured and directly compared across multiple samples. In FPAs capture agents, usually an antibody or antigen, are immobilized onto the surface and act as a capture molecule. Each spot contains one type of immobilized antibody or capture protein. Each array is incubated with one test sample, and multiple analytes are measured at once.
One of the most common forms of FPAs is an antibody microarray. Antibody microarrays can be produced in two forms, either by a sandwich assay or by direct labelling approach. The sandwich assay approach utilizes two different antibodies that recognize two different epitopes on the target protein. One antibody is immobilized on a solid support and captures its target molecule from the biological sample. Using the appropriate detection system, the labelled second antibody detects the bound targets. The main advantage of the sandwich assay is its high specificity and sensitivity (Templin, Stoll, Bachmann, & Joos 2004; Comb. Chem. High Throughput. Screen., vol. 7, no. 3, pp. 223-229). High sensitivity is achieved by a dramatic reduction of background yielding a high signal-to noise ratio. In addition, only minimal amounts of labelled detection antibodies are applied in contrast to the direct labelling approach were a huge amount of labelled proteins are present in a sample. The sandwich immunoassay format can also be easily amenable to the field of microarray technology, and such immunoassays can be applied to the protein microarray format to quantify proteins in conditioned media and/or patient sera (Huang et al. 2001; Clin. Chem. Lab Med., vol. 39, no. 3, pp. 209-214; Schweitzer et al. 2002; Nat Biotechnol., vol. 20, no. 4, pp. 359-365).
In the direct labelling approach, all proteins in a sample are labelled with a fluorophore. Labelled proteins that bind to the protein microarray such as to an antibody microarray are then directly detected by fluorescence. An adaptation of the direct labeling approach is described by Haab and co-workers (Haab, Dunham, & Brown 2001; Genome Biol., vol. 2, no. 2, p). In this approach, proteins from two different biological samples are labelled with either Cy3 or Cy5 fluorophores. These two labelled samples are then equally mixed together and applied to an antibody microarray. This approach, for example, allows comparisons to be made between diseased and healthy, or treated and untreated samples. Direct labelling has several advantages, one of which is that the direct labelling method only requires one specific antibody to perform an assay.
Miniaturized and multiplexed immunoassays may also used to screen a biological sample for the presence or absence of proteins such as antibodies (Joos et al. 2000; Electrophoresis, vol. 21, no. 13, pp. 2641-2650; Robinson et al. 2002; Nat. Med., vol. 8, no. 3, pp. 295-301).
In a preferred embodiment of the invention, the detection or capture agents such as the antibodies are immobilized on a solid support, such as for example on a polystyrene surface. In another most preferred embodiment, the detection or capture agents are spotted or immobilized in duplicate, triplicate or quadruplicate onto the bottom of one well of a 96 well plate.
An increased expression level of a particular gene or group of genes may also be determined by measuring other factors than RNA or protein directly expressed by the gene. WO 2008/147206 provides a number of polymorphisms that can be used to determine an increased interferon signature. In particular, the invention relates to a method for classifying an individual as preclinical RA patient with an increased type 1 interferon response signature, said method comprising the steps of determining in a nucleic acid sample from said individual one or more polymorphisms in the IFR5 gene related to an increased type 1 interferon gene signature and determining, based on the genotype of said polymorphism(s), if the individual has increased type 1 interferon response signature.
In a preferred embodiment said one or more polymorphisms comprises a polymorphism in SNP rs2004640, in SNP rs4728142, or in a 30 bp insertion-deletion polymorphism in exon 6.
We found that the T-allele and in particular the TT genotype of rs2004640 and/or the A-allele and in particular the AA genotype of rs4728142 is associated with an increased IFN response signature in the blood. Also patients homozygous for the 5 bp CGGGG deletion show a increased IFN response activity response.
In a preferred embodiment more than one polymorphism is determined to indicate whether an individual is a preclinical RA patient. It is preferred to determine the haplotype for the indicated polymorphisms and classify the cells of the individual on the basis of of the determined haplotype. In this embodiment Haplotype A (rs4728142 (A) rs2004640 (T) exon 6 indel (del) rs10954213 (A)) associated with an increased risk and Haplotype B (rs4728142 (G) rs2004640 (G) exon 6 indel (in) rs10954213 (G)) is associated with prevention to develop arthritis.
Thus in one aspect the invention provides a method wherein a polymorphism is used to indicate whether an individual is likely to be a preclinical RA patient. In a preferred embodiment said polymorphism is a polymorphism of rs4728142, rs2004640, exon 6 indel (del) or exon 6 indel (in), or rs10954213. In a preferred embodiment the haplotype for at least two and preferably at least 3 and more preferably all of the polymorphisms mentioned are determined.
The polymorphism may also determined in a method according to the invention by analysis of the complementary strand. In this case, the correlations and predictions as indicated herein above, are of course associated with the presence of the respective complementary nucleotide(s).
The combining feature is that these polymorphisms are all situated in or close by the IRF5 gene.
The invention further provides the use of a polymorphism that discriminates alleles of the IFR5 gene and that correlates for more than 90% with a polymorphism associated with development of RA (preclinical RA patient), for classifying an individual as preclinical RA patient.
In a method according to the invention, an individual has an increased risk of developing the symptoms of a chronic autoimmune disease if expression level of the expression products of a said collection of genes are different compared to the levels of the same expression products of a control. An expression level is classified as different when said expression level of said expression product is statistically significantly increased or decreased in said individual compared to the level of the same expression product found in control individuals. The term “significantly” or “statistically significant” refers to statistical significance and generally means a two standard deviation (SD) above normal, or higher, or below, or lower concentration of the expression product. In preferred embodiments, said difference is classified as statistically significant if the expression level is at least a 20 percent increased or decreased compared to expression level of the same expression product in control individuals. Preferably, the increase or decrease is at least 20, 25, 30, 35, 40, 45, 50, 75, 100, 150, 200 or 250 percent. Most preferably, said increase or decrease is at least 100 percent.
It is also possible to determine an increased risk of developing the symptoms of a chronic autoimmune disease by determining if the number of expression products having a different expression level, differs significantly from the same number in control individuals. Preferably, said numbers are compared within the same collection of genes involved in the same cellular process. Said numbers can be related to the total number of expression products of said collection of genes from which the levels of expression products are determined or a selection thereof. It is preferred that said numbers are compared to numbers of control individuals. Said levels can be compared with cut-off levels of reference intervals of said expression products, wherein said reference intervals are based on the levels of the same expression products in a control group. Said cut-off levels are usually the values corresponding to 5% or 95% of a reference interval (which is usually a statistically determined confidence interval) of the levels the expression products in a control group. Alternatively, a cut-off level can also be based on the level of the expression products in a control individual or a control group. Preferably the cut-off level is at least a factor two higher or lower than the level of the same expression product in a control individual or mean or median level of the expression product in a control group. The number of said levels is determined that are outside said intervals within said at least one collection of genes. Said number is compared with a cut off value, wherein said cut off value is based on the number of levels of the same expression products that are outside said intervals in a control group within said at least one collection of genes, wherein a higher number than said cut off value is predictive for an increased risk.
Preferably said at least one gene comprises at least 10 genes, more preferably at least 15, 20, 25, 30, 35, 45 or 52 genes of Table 8. If more genes are used in the method, the method will generate less false positive and false negative results.
In a preferred embodiment, the chronic autoimmune disease is Rheumatoid Arthritis (RA) or Systemic Lupus Erythematosus (SLE). The method is very predictive for these diseases. Preferably, said autoimmune disease is RA.
Even more preferred is a method, wherein said genes involved in the cellular process of IFN-mediated immunity comprise at least 1 gene from
Table 4. More preferably 2, 3, 5, 7, 10, 16 genes of
Table 4. If more genes involved in IFN-mediated immunity are used, the method is more accurate.
Another more preferred embodiment is a method, wherein said genes involved in the cellular process of B-cell mediated immunity comprise at least 1 gene from Table 5. More preferably 2, 3, 5, 7, 10 genes of Table 5. If more genes involved in B-cell mediated immunity are used, the method is more accurate.
Another more preferred embodiment is a method, wherein said genes involved in the cellular process of hematopoiesis comprise at least 1 gene from Table 6. More preferably 2, 3, 5, 7, 10 genes of Table 6. If more genes involved in hematopoiesis are used, the method is more accurate.
Another more preferred embodiment is a method, wherein said genes involved in the cellular process of cytokine mediated immunity comprise at least 1 gene from Table 7, more preferably 2, 3, 5, 7, 9 genes of Table 7. If more genes involved in cytokine mediated immunity are used, the method is more accurate.
Another preferred embodiment is a method, wherein said genes are selected from the group comprising ISG15, EPSTI1, IFI6, OAS3, IFI44L, RSAD2, IFIT1, MX1, CD274, SERPING1, IFI27, CD19, CD79A, CD79B, MS4A1, FCRL5, DARC, BCL2L1, RBM38, BAG1, TESC, KLF1, ERAF, SELENBP1, CCL5, IFNG, GZMH and NKG7. When using these genes, the method is more accurate.
Another preferred embodiment is a method wherein said collection of genes comprises genes coding for a secreted protein. An advantage thereof is that this method is very well suited for determining the risk based on a secreted protein as indicated in Table 8. Said secreted protein can easily be obtained. The levels of said secreted protein can be compared with control levels. Significantly different levels of a secreted protein are predictive of an increased risk of developing RA.
Another preferred embodiment is a method wherein said collection of genes comprises genes coding for a membrane bound protein. An advantage thereof is that this method is very well suited for determining risk using for example FACS analysis. Levels of membrane bound proteins can be established using a FACS and compared to a control and determine whether said level is elevated. It is also possible to determine said levels in cellular subsets based on the expression of surface markers.
Also preferred is a method as described above, further comprising testing a sample of said individual for a further factor associated with an increased risk for developing RA. By combining the method with testing a further factor that is associated with an increased risk for developing RA, the method is more accurate. Any said further factor can be used. Preferably, said further factor comprises detecting the presence of an anti-citrullinated protein/peptide antibody, and/or rheumatoid factor (RF). An advantage is that risk of developing RA within 2-5 years when using a combined test using ACPA and/or RF with the method according to the invention is increased to 95-100%. Examples of anti-citrullinated protein/peptide antibody are antiperinuclear factor (APF, Nienhuis and Mandema, Ann. Rheum Dis 1964; 23: 302-5), antikeratin antibody (AKA, Aho K J. Rheumatol 1993; 20:1278-8). These antibodies bind to substrates containing modified cittruline (Schellekens et al. J. Clin Invest 1998; 101: 273-281. Methods of using anti-citrullinated protein/peptide antibody to predict RA are described in Meyer et al., Ann Rheum Dis 2003;62:120-126 or Alexiou et al. Clinical rheumatology 2008, vol. 27, no4, pp. 511-513 or Alexiou et al. Clinical rheumatology 2008, vol. 27, no4, pp. 511-513. A method to use RF for the prediction of RA is described in Rantapaa-Dahlqvist 2003 Arthritis Rheum 48: 2741-2749 and Nielen et al. Arthritis Rheum. 2004 August; 50(8):2423-7. Such further factor may also comprise detecting an elevated level of MCP-1 , IL-10, FGF2 and/or Flt-3L or a polymorphism of a gene associated with susceptibility to RA. MCP-1 polymorphism is associated with the susceptibility to RA in patients lacking the HLA SE. Therefore, if an individuals is further tested on the presence of an MCP-1 polymorphism (described in González-Escribano et al., Human Immunology Volume 64, Issue 7, July 2003, Pages 741-744), the method of the invention will be more powerful. The use of for the presence of polymorphisms of the IL-10 gene for determining the risk of developing RA in an individual is described in MacKay Rheumatology 2003; 42: 149-153. In addition, synovial fluid (SF) levels of IL-10 are increased in patients of early RA (G A Mittal and V R Joshi J Indian Rheumatol Assoc 2002: 10: 59-60). The levels of IL-10 in SF can also be determined as a further factor in the method. FLT3-mediated conditions in autoimmune diseases are described in WO/2008/016665 and WO/2002/067760.
Preferably, a method according to the invention is combined with a medical treatment of the patient. Preferably, the patient is treated without any clinical symptoms being apparent. Such may be done by administering to said individual a composition for the treatment or prevention of RA. Said composition may be any composition that is used for the treatment or prevention of RA. A non limiting list of treatments comprises:
a) nonsteroidal anti-inflammatory drugs (NSAIDs);
b) disease-modifying anti-rheumatic drugs (DMARDs);
c) steroids; and
d) analgesics.
Preferably, said administration is done at a time point when said individual does not suffer from a clinical symptom of RA. An advantage thereof is that treatment of RA is more successful if it is started before symptoms have developed.
Gene expression levels for the selected genes in Table 2 were measured in the total cohort of autoantibody positive arthralgia patients (n=109) and compared to ACPA and RF negative healthy controls (n=25). For each sample the mean expression values were calculated for the genes selected from the two different PAM analyses i.e. 6 controls vs. 19 autoantibody positive arthralgia patients (A) and 6 controls vs. 6 ACPA+RF− arthralgia patients (B) as well as for the IFN-induced gene set (C). Mean expression values of each gene set were compared between autoantibody positive arthralgia patients and healthy controls.
*P<0.05; **P<0.01; ***P<0.001; ns not significant.
Arthritis free survival in ACPA and/or IgM-RF positive arthralgia patients, stratified for gene expression profiles I+II versus III+IV, as described in the examples section.
Arthritis free survival in ACPA and/or IgM-RF positive arthralgia patients, stratified for gene expression profiles I+II+IV versus III, as described in the examples section
Between September 2004 and March 2007, ACPA and/or IgM-RF positive arthralgia patients were included for prospective follow-up of arthritis development. Inclusion and exclusion criteria for this cohort have been described previously [19]. In summary, a trained medical doctor (WB) and a senior rheumatologist (DS) independently scored for absence of arthritis (swollen joint count [SJC]=0) in 44 joints at physical examination at the baseline visit [20]. The senior rheumatologist was blinded for the reported joint complaints and the autoantibody status. Exclusion criteria were: arthritis revealed by chart review or baseline physical examination, erosions on hand or feet X-ray examination and previous treatment with a disease modifying anti-rheumatic drug (DMARD). In total, 109 patients were available for analysis. Arthritis development during follow-up was defined as a SJC of ≧1 and was independently confirmed by both physicians.
For comparison, 25 ACPA and IgM-RF negative healthy lab donors and 25 RA patients with established disease were included. The RA patients consisted of 25 randomly selected patients starting anti-TNF treatment at the Jan van Breemen Institute. All fulfilled the ACR criteria for RA [21] and the median disease duration was 11 years. As expected, only the age of the RA patients was different from controls and autoantibody positive arthralgia patients.
An overview of the subjects' characteristics is given in Table 1.
Baseline laboratory parameters (determined batch wise at the end of the study period using the blood samples obtained at inclusion) included IgM-RF by in house enzyme-linked immunosorbent assay (ELISA) and ACPA by ELISA (second generation anti-CCP ELISA, Axis Shield, Dundee, United Kingdom). IgM-RF was calibrated with a national reference serum containing 200 IU/ml [22]. The cut-off level for IgM-RF antibody positivity was set at 30 IU/ml determined on the basis of ROC curves as described previously [6]. The cut-off level for ACPA positivity was set at 5 Arbitrary Units/ml (AU/ml) according to the manufacturer's instructions. High sensitive CRP levels were measured using CRPHS reagents with module C501 on a COBAS 6000 platform (Roche Diagnostics GmbH, Mannheim, Germany).
2.5 ml blood was drawn at baseline in PAXgene blood RNA isolation tubes (PreAnalytix, GmbH, Germany) and stored at −20° C. Tubes were thawed for 2 hours at room temperature prior to RNA isolation. Next, total RNA was isolated using the PAXgene RNA isolation kit according to the manufacturer's instructions including a DNAse (Qiagen, Venlo, Netherlands) step to remove genomic DNA.
We used 43K cDNA microarrays from the Stanford Functional Genomics Facility (http://microarray.org/sfgf/) printed on aminosilane-coated slides containing ˜20.000 unique genes. Only one batch of arrays was used for all experiments. First DNA spots were UV-crosslinked to the slide using 150-300 mJoules. Prior to sample hybridization, slides were pre-hybridized at 42 degrees Celsius for 15 minutes in a solution containing 40% ultra-pure formamide (Invitrogen, Breda, Netherlands), 5% SSC (Biochemika, Sigma), 0.1% SDS (Fluka Chemie, GmbH, Switzerland) and 50 g/ml BSA (Panvera, Madison, USA). After pre-hybridization slides were briefly rinsed in MilliQ water, thoroughly washed in boiling water and 95% ethanol and air-dried. Sample preparation and microarray hybridization was performed as described previously [23] apart from the different post-processing and pre-hybridization described above.
Data storage and filtering was performed using the Stanford Microarray Database (SMD at: http://genome-www5.stanford.edu//) [24] as described previously [25]. Raw data can be downloaded from the publicly accessible Stanford database website. We used the Q-score tool from the database as a quality measure to remove low quality spots. Q-score determined the appropriate filter criteria for: the regression correlation between channels 1 and 2, the background settings and the minimal channel intensities. After removing the low quality spots, data values with the same Unigene Identifier were averaged and all array data was median centered (genes and arrays) resulting in good quality data for 19,648 gene transcripts. Statistical Analysis of Microarrays (SAM) [26] was used to determine significantly differential expressed genes. A gene was considered as significantly differential expressed if the False Discovery Rate (FDR) was equal to or less than 5%. Cluster analysis [27] was used to define clusters of coordinately expressed genes after which the data was visualized using Treeview. PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System (Applied Biosystems, Foster City, Calif., USA) was used at http://PANTHER.appliedbiosystems.com [28,29] to interpret our data. This analysis uses the binomial statistics tool to compare the list of significantly up- or downregulated genes to a reference list in order to statistically determine over- or under representation of PANTHER classification categories such as biological processes. A Bonferroni correction was applied to correct for multiple testing and a significant p-value (p<0.05) indicates that a given category may be of biological interest. For comparison the Ontology TermFinder tool from the SMD database was used, which, in the same manner as PANTHER when given a list of genes, identifies significantly over- or under represented gene ontology terms (using an FDR cut-off value of 5%). Both the PANTHER and TermFinder tools use Gene Ontology terms ([30] at http://www.geneontology.org/), but the PANTHER classification is greatly abbreviated and simplified to facilitate high-throughput analyses.
Microarray analysis revealed 6313 significantly differential expressed genes between autoantibody negative healthy individuals and arthralgia patients positive for ACPA and/or RF (supplementary Table S1). In order to narrow down the list of genes for validation, class prediction analysis of microarrays (PAM) was applied using the gene expression data as a training set. Applying a ten-fold cross validation this analysis identified a set of only 17 genes that could correctly classify our controls from autoantibody positive arthralgia patients while only two of the 19 arthralgia patients were classified as controls. Thus, the expression of these 17 genes could predict with a class error rate of only 10.5% if a sample was derived from an autoantibody negative control or an autoantibody positive arthralgia patient at risk of developing RA. Although this analysis could predict the sample class rather well, the proportions between the two sample groups are very unbalanced. Therefore we performed a second PAM analysis in which only data was used of the 6 autoantibody negative controls and the 6 ACPA+ but RF negative persons at risk resulting in a more balanced analysis. Strikingly, the expression of 14 genes could correctly classify these two different groups with a class error rate of 0%. Interestingly, the majority of these 14 genes are interferon-induced and only one gene overlapped with the first PAM analysis. When we performed a PAM analysis between the 6 controls and the 9 ACPA−RF+ patients, most of the genes overlapped with the first PAM analysis (6 controls versus 19 autoantibody positive arthralgia patients). Genes from both PAM analyses were selected for confirmation analysis in a larger cohort using the Taqman Low Density Array (TLDA) technology which is based on quantitative real-time PCR. Available predesigned Taqman primers and probes were used to validate gene expression levels of the above PAM analyses. For 10 selected genes no predesigned TLDA assays were available and therefore these genes were excluded from analysis.
We included 20 IFN related genes derived from comparisons between active RA patients and healthy controls performed previously [23]. In addition, IFN specific genes IFNA2, IFNb1 and IFNg were included and housekeeping genes GAPDH and 18SRNA were added for normalization. Detailed information for selected target genes is listed in Table 2.
Per TLDA card it is possible to analyze the expression of 48 genes in eight samples simultaneously. The expression of selected target genes (Tables 2 and 3) was validated in the total study cohort using TLDA (Applied Biosystems). Corresponding predesigned primers and probes (Table S2) were selected from the Applied Biosystems database to set up custom TLDA cards. In 160 samples randomly dispersed over the TLDA cards the expression levels of selected genes were measured at the outsourcing company ServiceXS B.V. (Leiden, Netherlands). Total RNA (0.5 μg) was reverse transcribed into cDNA using a Revertaid H-minus cDNA synthesis kit (MBI Fermentas, St. Leon-Rot, Germany) according to the manufacturer's instructions. From each sample diluted cDNA corresponding to 100 ng total RNA was used per TLDA card. For normalization the housekeeping genes GAPDH and 18SRNA were included in the analysis. Since the inter-sample variation in expression for GAPDH was much higher than for 18SRNA, the latter was chosen for normalization. In addition, the correlation between array and TLDA data was much better with 18SRNA normalized data (data not shown). An arbitrarily chosen ACPA-/RF− control sample was selected as calibrator sample. Data was analyzed using RQ manager 1.2 (Applied Biosystems) and since this program can only analyze 10 TLDA cards in one experiment, the calibrator sample was analyzed in duplicate on two different TLDA cards (Pearson R=0.9936 between the two experiments).
Data with a Gaussian distribution, expressed as the mean and SD, were analyzed using a T test or one-way ANOVA for multiple comparisons. Outcome measurements with a non-Gaussian distribution were expressed as the median and interquartile range (IQR) and were analyzed by the Mann-Whitney U test or Kruskal-Wallis test for multiple comparisons. Categorical variables were compared using the Chi-square or Fisher's exact test as appropriate. Cox-regression hazard analysis assessed the relative risk for arthritis development in subgroups of autoantibody positive arthralgia patients clustering together with RA patients compared to the subgroups of patients who did not cluster together with RA patients. Data were analyzed using the Statistical Package for Social Sciences version 14.0 (SPSS; Chicago, Ill., United States) and were considered significant with two sided p-values less than 0.05.
The gene expression profiles of peripheral blood cells derived from 19 arthralgia patients positive for ACPA and/or RF were analyzed and compared to the profiles of 6 ACPA and RF negative healthy controls. A total of 3484 genes were significantly upregulated in the arthralgia group whereas 2829 genes were significantly downregulated compared to healthy controls (false discovery rate (FDR) 5%).
In order to visualize these results in a comprehensive manner, 255 significantly differential genes were selected whose transcript levels were at least two-fold differentially expressed between the two groups and unsupervised two-way hierarchical cluster analysis was applied, All autoantibody negative healthy controls concentrated together in one arm of the dendogram. The autoantibody positive arthralgia patients were characterized by differential expression of two gene clusters designated as cluster A, which consisted of genes that are upregulated in the risk group and cluster B with genes that are downregulated compared to the control group.
To interpret the biological function of the genes in clusters A and B, two different pathway-level analysis programs were used: TermFinder and Protein ANalysis THrough Evolutionary Relationships (PANTHER) classification analysis. These analyses look within each gene cluster for significant overrepresentation of genes involved in a biological process. Cluster A contained genes involved in several immune defense related processes: immune (system) response, (cellular) defense response, regulation of the MAPKKK cascade, protein maturation via proteolysis, interferon (IFN) mediated immunity, NK cell mediated immunity and immunity and defense. Cluster B contained genes that are downregulated in most of the persons at risk. These genes could not be classified into a biological process and apparently this cluster contains many genes with unknown function.
To confirm the differential gene expression profiles between autoantibody positive arthralgia patients and controls in a larger cohort we carefully selected 46 genes based on SAM and PAM analysis or biological relevance supplemented with an RA associated set of 20 type I IFN response genes (Table 2). Detailed information on the criteria for gene selection is described above. The expression levels of these genes were measured using TLDA in a larger cohort of autoantibody positive arthralgia patients (n=109) and autoantibody negative healthy controls (n=25). As shown in Table 2, the average expression levels of 25 of the selected genes were significantly differentially expressed between autoantibody positive arthralgia patients and controls, thereby confirming the microarray results. In addition, this analysis showed that the expression levels were highly variable within the autoantibody positive arthralgia patient group (
The previous analyses already revealed that not all persons at risk display similar expression profiles. To investigate the molecular heterogeneity within the risk group in more detail, a two-way hierarchical cluster analysis was performed using the microarray data of the autoantibody positive arthralgia patients only (n=19). Therefore, 554 genes were selected whose transcript levels deviated more than two-fold from the median expression level in at least four patients. The structure of the dendogram indicated that the autoantibody positive arthralgia patients were separated in subgroups. Indicative for the robustness of sub-classification, application of different gene selection criteria did not alter this result (data not shown).
The position of any autoantibody-positive arthralgia patient in a dendogram was determined by differentially expressed genes that were categorized in four clusters (A, B, C and D). Pathway level analysis on the coordinately regulated genes in each cluster provides a basis for the biological interpretation. Cluster A is characterized by the expression of genes involved in B-cell mediated immunity. Genes involved in macrophage, T-cell, NK-cell and granulocyte mediated immunity were characteristic for cluster B. Cluster C contained genes of yet unknown processes, although some of these genes are known for their role in inflammation (e.g. PBEF1, SSP1 and S100A12). Genes that represent chemokine and cytokine mediated immunity, such as CCL5, IFNg and IL32, were characteristic for cluster D. In aggregate, the two-way hierarchical cluster analysis demonstrated molecular heterogeneity among autoantibody positive arthralgia patients based on differential expression of genes that are involved in diverse arms of the immune response.
To confirm the molecular heterogeneity among arthralgia patients we validated these results in a large cohort of 109 auto-antibody positive arthralgia patients. Therefore, we used 87 genes, consisting of 46 genes most representative for the gene clusters A, B, C and D, which were selected based on PAM, SAM or biological relevance for further analysis using TLDA (Table 3), supplemented with 21 genes (Table 2) that were shown to be differentially expressed within the at-risk group based on the comparison between healthy controls (
Moreover, pathway-level analysis revealed that heterogeneity was based on differential expression of clusters of genes involved in essentially the same processes as mentioned earlier, i.e. IFN-mediated immunity, B-cell activation, cytokine/chemokine mediated immunity and some genes with unknown function. The gene cluster involving genes in macrophage, T-cell, NK-cell and granulocyte mediated immunity was not prominently present in this cluster diagram. Instead, the theme “hematopoiesis” was represented.
This analysis confirmed the molecular heterogeneity between autoantibody positive arthralgia patients based on the differential expression of genes reflecting skewed immune processes.
Next, we wanted to investigate the existence of commonalities between (subsets of) autoantibody positive arthralgia patients and RA patients with established disease. Therefore, we analyzed the resemblance between the gene expression characteristics of the heterogeneous at risk subgroup and those of RA patients with established disease (n=25) utilizing all 87 genes (Tables 2 and 3). Two-way hierarchical cluster analysis of the autoantibody positive arthralgia patients and RA patients on the basis of these genes divided the patients into essentially similar subgroups as observed for the subclassification of the arthralgia patients only. In order to reduce the gene set that could be used for classification purposes we performed the same cluster analysis using a selected set of 52 genes, consisting of those genes that had the highest variance in the different gene clusters. A two-way hierarchical cluster analysis based on this gene set divided the autoantibody positive arthralgia patients and RA patients into four subgroups. Most remarkable, RA patients with established disease preferentially co-clustered with at risk individuals of groups I and II compared to those in groups III and IV (Fisher's exact test P<0.01). Groups I and II were characterized by an increased expression of genes involved IFN-mediated immunity and hematopoiesis, respectively, whereas groups III and IV are associated with increased expression of genes involved in cytokine/chemokine mediated immunity and B cell activation, respectively.
These data show that increased expression of genes involved IFN-mediated immunity and/or hematopoiesis reflects a common denominator present in both, a subgroup of autoantibody positive arthralgia patients and patients with established RA. Hence the patients in this subgroup may be more likely to develop RA.
In order to study the association between the arthralgia subgroups and the development of arthritis we performed an interim analysis to determine the distribution over the different subgroups of those autoantibody positive arthralgia patients who have developed arthritis in the course of the study. Interim analysis revealed that 20 autoantibody positive arthralgia patients have developed arthritis after a median of 7 months (IQR 4-15; median follow-up of all patients is 30 [IQR 22-39] months) in a median of 3 joints (IQR 3-5).
Cox-regression analysis showed that subgroups I and II were associated with arthritis development (Hazard Ratio [HR] 5.1; 95% confidence interval [C.I.] 1.2-21.9; P=0.03). Correcting for ACPA decreased the HR to 4.1 (95% C.I. 1.0-17.9; P=0.06), resulting in a strong trend independent of ACPA status, implicating that in the presence of ACPA, gene expression profiles specific for subgroups I and II have predictive value for identifying those patients at risk for the development of RA. The presence of the ‘shared epitope’ genotype, as well as mean ACPA and RF levels were similar in both arthralgia subgroups (I and II versus III and IV, data not shown) and did not influence these results. Excluding those autoantibody positive arthralgia patients who had received two intramuscular dexamethasone injections (n=25) in a trial of primary prevention of RA did not alter these results.
Essentially, similar results were found when excluding the RA patients from the two-way hierarchical cluster analysis. This analysis with auto-antibody arthralgia patients only, resulted in the formation of a subgroup (n=25) without arthralgia patients who have converted to RA. Inclusion of a “dummy” RA converter case in this group, which is required for Cox regression analysis resulted in borderline significance (Hazard Ratio [HR] 7.0; 95% confidence interval [C.I.] 0.94-52.2; P=0.057;
These analyses reveal that autoantibody positive arthralgia patients with high expression of genes involved in IFN mediated immunity, cytokine/chemokine mediated immunity, or hematopoiesis are more likely to develop arthritis. Conversely, autoantibody positive arthralgia patients with high expression of genes involved in B-cell mediated immunity may be protected against development of arthritis or progression towards disease pathogenesis may be suppressed.
In this document we demonstrate the heterogeneous nature of ACPA+ and/or RF+ arthralgia patients at risk for development of RA. We identified a set of genes whose expression profiles segregate arthralgia patients at risk for RA into four different groups. Most interestingly, the group of patients that is characterized by increased expression of genes involved in humoral immunity are devoid of cases who have developed arthritis in the follow-up period. Subgroups that are characterized by a gene signature of IFN-mediated immunity, cytokine and chemokine activity, and haematopoiesis all contain at risk persons who have developed arthritis. Our results indicate that predisposition for the development of arthritis can already be observed in the peripheral blood gene expression profile of ACPA+ and/or RF+ individuals at risk. Thus, the gene set that distinguishes autoantibody positive arthralgia patients can be used to predict the diagnosis in preclinical RA. This signature can be used to decide on preventive treatment for the development of RA. Moreover, genes that are associated with development of RA are potential targets for the rational to development of new arthritis drugs. The presence of these gene expression characteristics increases the risk for arthritis development ˜5-fold. These results may provide insight in the pathogenic mechanism leading to development of RA and are indicative for an additional biomarkers (set) for the detection of individuals at risk for the development of rheumatoid arthritis.
The functional annotation for the genes provides insight into the underlying biological mechanism leading to progression to and prevention of arthritis. Genes involved IFN-mediated immunity, cytokine and chemokine activity, and haematopesis with the in B-cell immunology are significantly upregulated in the preventive signature
In this study we performed a comparative analysis of the individuals at risk for RA and patients with established RA. Patients with established RA exhibit similar expression patterns to the autoantibody positive arthralgia patients who cluster in the groups containing arthritis follow-up cases. We conclude from this comparison that changes in the peripheral blood transcript level that are characteristic for established RA are already present in the asymptomatic phase of disease.
The different gene expression profiles provide evidence for the assignment of at least four subgroups of persons at risk that display highly distinct gene expression profiles reflecting pronounced differences in the immune status. Previously, we have shown molecular heterogeneity of RA patients based on differential expression of type I IFN induced genes [23]. In the present study we could not only confirm this sub-classification of RA patients with established disease, but we could also classify the autoantibody positive arthralgia patients who later developed arthritis into an IFN-high and IFN-low group. The IFN-low group of RA and autoantibody positive arthralgia patients (group II) was characterized by increased expression levels of genes involved in hematopoiesis. The IFN-high RA patients had a significantly higher mean age, a longer disease duration and a higher mean sedimentation rate, but did not differ with respect to disease activity score, CRP levels, RF and ACPA levels compared to the IFN-low RA patients (data not shown). However, the autoantibody positive arthralgia patients in the different subgroups did not differ in mean age, ACPA, IgM-RF and CRP levels (data not shown). Therefore, these two different molecular profiles—IFN and hematopoiesis—separating the RA patients may represent two different mechanisms involved in disease pathogenesis.
Previously, several research groups, including our own, have shown increased expression levels of IFN related genes in different chronic autoimmune diseases such as RA [23], SLE [33], SSc [34], myositis [35] and MS [25]. Whereas in SLE type I IFN is associated with disease severity, in MS it is used as a treatment strategy with partial beneficial effects [36]. These opposing effects of type I IFN in autoimmunity are highly intriguing and suggest that the in vivo interplay between type I IFN and other inflammatory cytokines may determine the balance between protective immunity and autoimmunity [37]. From the current study it is tempting to speculate that type I IFN is involved in inducing disease, although several studies have demonstrated also a possible protective role for type I IFN in RA [38,39]. Type I IFNs are known to upregulate MHC expression and to induce differentiation of monocytes into antigen presenting dendritic cells (DCs) [40,41]. In addition, since immature DCs control peripheral tolerance by deletion of circulating autoreactive T cells, continued type I IFN induced maturation of DCs may lead to a break of peripheral tolerance through activation of autoreactive cells resulting in immunity to self-antigens. Accordingly, a recent study showed that IFN induced protein IFIT4 might play a role in promoting monocyte differentiation into DC-like cells and subsequent regulation of Th1 cell differentiation [42]. These properties of type I IFN may contribute to the initiation of arthritis in those autoantibody positive arthralgia patients who display increased levels of IFN induced genes.
Hence, whereas ACPA/RF seems involved in the development of arthritis the concomitant activation of the type I IFN system might contribute by facilitating a break of tolerance that triggers the process of autoimmunity leading to chronic inflammation.
The group of genes classified as being involved in hematopoiesis contains two genes (KLF1 and ERAF) involved in erythrocyte development, whereas two other genes (BCL2L1 and BAG1) have anti-apoptotic capacities. Interestingly, the gene DARC encodes for an antigen receptor for chemokines and is present on the surface of venular endothelial cells, cerebral neurons and erythrocytes of Duffy antigen positive individuals [43]. On erythrocytes, DARC is described to act as a sink for free pro-inflammatory chemokines present in the bloodstream in order to limit dissemination through blood into other organs and tissues as well as to prevent leukocyte desensitization [44]. A temporal expression pattern was shown for DARC in the endothelial venules of inflamed synovium with increased expression in the early stage of RA and lower expression in patients with prolonged disease [45]. Moreover, the expression of DARC on endothelial cells was essential for the recruitment of neutrophils in a multicellular model of RA synovium [46]. Our results in peripheral blood extend these findings in inflamed synovium and suggest that DARC expression in both the tissue as well as in the peripheral blood compartment plays a functional role in leukocyte migration into the synovium. How these genes are related to disease pathogenesis in combination with the others in the gene cluster (RBM38, TESC and SELENBP1) remains to be established.
Another group of autoantibody positive arthralgia patients revealed abundant expression of genes indicative for alterations in the humoral immunity i.e. B-cell mediated immunity. Both CD79A and CS79B are necessary for the expression and function of the B-cell antigen receptor and MS4A1 encodes for the B-cell antigen CD20. Whereas CD19 is involved in activating the B-cell receptor, FCRL5 inhibits B-cell activation [47]. This subgroup of autoantibody positive arthralgia patients clustered together with only one autoantibody negative RA patient and none of the 24 autoantibody positive arthralgia patients developed arthritis so far. This suggests that patients with a relatively high expression of genes involved in B-cell mediated immunity may be protected against developing disease. This is in line with the hypothesis that B-cells can act as regulators of autoimmune pathology and protect against disease [48]. A recent study using an established mouse model showed that B-cell receptor revision occurs in antigen-activated B-cells and requires the function of IL7R [49]. Interestingly, the receptor revision acted as an important mechanism of peripheral tolerance in order to diminish autoimmune humoral responses. In addition, although RA patients with active disease can successfully be treated with B-cell depleting agents [50], the rodent model of collagen-induced arthritis (CIA) has shown that transfer of activated B-cells has protective effects [51,52]. This could mean that the role of B-cells in protecting against or inducing autoimmunity may depend of the phase of disease.
Another explanation for the relative increased expression of genes involved in B-cell activation may be that it reflects the amount of B-cells and possible migration of B-cells from the periphery to affected joints in those autoantibody positive arthralgia patients who will develop arthritis.
The invention therefore provides another method for determining whether an individual has an increased risk of developing RA by determining the amount of B lymphocytes in circulation and comparing that amount with a normal value and concluding that an individual has an increased risk of developing RA if the amount in said individual is decreased in comparison to the normal value.
The amount of B-cells in circulation may be determined by conventional techniques employing specific B-cell markers but also by using secretion products such as immunoglobulines.
The last subgroup of autoantibody positive arthralgia patients showed evidence for increased cytokine and chemokine mediated immunity especially related to NK-cells and apoptosis. This data is partially in line with a previous study wherein early rheumatoid arthritis patients showed increased blood levels of pro-inflammatory cytokines associated with autoantibodies targeting citrullinated antigens [53]. On the other hand, using a prospective cohort of Norwegian blood donors, Jorgensen and co-workers showed that cytokines and cytokine related markers (i.e. several interleukins, TNF and IFNg) are generally negative in case sera obtained more than 5 years before the diagnosis of RA [11]. Only TNF was significantly increased in case sera less than 5 years before diagnosis compared to controls. Most other cytokines were only significantly increased after the diagnosis of RA, suggesting that these markers appear to be upregulated rather late in RA development. In our study, the identified genes involved in cytokine mediated immunity are, except for IFNg, distinct from the cytokines measured in the previous studies, making a direct comparison impossible. Although this patient group did not cluster together with any RA patient with active disease, two of the autoantibody positive arthralgia patients developed arthritis meaning that these cytokine related genes may be involved in inducing arthritis.
The current study demonstrates that especially ACPA and/or IgM positive arthralgia patients who display increased expression levels of genes involved in IFN-mediated immunity or hematopoiesis have an increased risk for development of arthritis.
An additional set of genes that may be used in a method according to the invention is provided in tables 9 and 10. The expression of the genes presented therein were found to be increased in patients who developed symptoms of RA, some even after a considerable amount of time, even up to 5 or 10 years (Table 9). The expression of the genes shown in table 10 were decreased in patients that did not develop symptoms of RA in the same amount of time.
The genes that serve a role as classifiers between patients who developed RA and those individuals that did not develop RA based on significance analysis using Prediction Analysis of Microarrays (PAM) (Tibshirani R. et al., Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc. Natl. Acad. Sci. USA 2002, 99: 6567-6572)) are tabulated in Table 11.
†Reason for selection (See M&M for details): 1. PAM analysis: 6 controls vs. 19 persons at risk; 2. PAM analysis: 6 controls vs. 6 ACPA+/RF− at risk; 3. Active RA vs. healthy controls: IFN-induced genes [23]; 4. Explorative SAM analyses between subgroups of patients; IFN: type I/II IFN specific gene; ND: Not detected; NA: Not applicable; ns: not significant; control: gene used for normalization
‡PCR amplification failed in five cDNA samples (2 controls, 2 autoantibody positive arthralgia patients and 1 RA patient).
Homo sapiens translocase of inner mitochondrial membrane 10 homolog
Homo sapiens lymphocyte antigen 6 complex. locus E mRNA.
Homo sapiens tripartite motif-containing 22 mRNA.
Homo sapiens sterile alpha motif domain containing 9-like mRNA
Homo sapiens myxovirus (influenza virus) resistance 1. interferon-induc
Homo sapiens 2′,5′-oligoadenvlate svnthetase 1, 40/46 kDa. transc
Homo sapiens 2′-.5′-oligoadenvlate svnthetase-like. transcript va
Homo sapiens 2′-5′-oligoadenvlate svnthetase-like, transcript va
Homo sapiens epithelial stromal interaction 1 (breast), transc
Homo sapiens interferon-induced protein with tetratricopeptide repeats
Homo sapiens interferon. alpha-inducible protein 6, transcript v
Homo sapiens interferon-induced protein 44, mRNA.
Homo sapiens ISG15 ubiquitin-like modifier, mRNA.
Homo sapiens 2′-5′-oligoadenvlate svnthetase 3, 100 kDa, mRNA.
Homo sapiens interferon-induced protein with tetratricopeptide repeats
Homo sapiens hect domain and RLD 5, mRNA.
Homo sapiens radical S-adenosyl methionine domain containing 2,
Homo sapiens interferon-induced protein 44-like. mRNA.
Homo sapiens interferon-induced protein with tetratriconeptide repeats
Homo sapiens XIAP associated factor 1, transcript variant 2, mRNA
Homo sapiens signal transducer and activator of transcription 2, 113 kDa
Homo sapiens poly (ADP-ribose) polymerase family, member 12, m
Homo sapiens interferon induced transmembrane protein 3 (1-8U)
Homo sapiens eukarvotic translation initiation factor 2-alpha kinase 2
Homo sapiens poly (ADP-ribose) polymerase family, member 14, m
Homo sapiens myocardial infarction associated transcript (non-protein c
Homo sapiens inositol 1,4,5-triphosphate receptor. type 3, mRNA
Homo sapiens family with sequence similarity 39. member E, mRN
Homo sapiens S100 calcium binding protein A4. transcript varia
Homo sapiens actin related protein 2/3 complex, subunit 1B, 41 kDa
Homo sapiens SEC14-like 1 (S. cerevisiae). mRNA.
Homo sapiens DEAD (Asp-Glu-Ala-Asp) box-polypeptide 23, mRNA.
Homo sapiens protein phosphatase 1. catalytic subunit, alpha isoform
Homo sapiens diaphanous homolog 1 (Drosophila). mRNA.
Homo sapiens isocitrate dehvdrogenase 3 (NAD+) beta. nuclear ge
Homo sapiens Janus kinase 1 (a protein tyrosine kinase). mRNA.
Homo sapiens diacylqlvcerol kinase. alpha 80 kDa, transcript vari
Homo sapiens Rho GTPase activating protein 30, transcript va
Homo sapiens macrophage erythroblast attacher, transcript varian
Homo sapiens phosphogluconate dehvdrogenase. mRNA.
Homo sapiens dysferlin, limb girdle muscular dystrophy 2B (autosomal
Homo sapiens sema domain, immunoglobulin domain (Ig),
Homo sapiens y-ral simian leukemia viral oncogene homolog B (ras
Homo sapiens neutrophil cytosolic factor 1. (chronic granulomatous dise
Homo sapiens formvl peptide receptor 2. transcript variant 1, mR
Homo sapiens zinc finger, NFX1-type containing 1. mRNA.
Homo sapiens adenosine deaminase, RNA-specific, transcript varia
Homo sapiens cathepsin A, mRNA.
Homo sapiens plasminogen activator. urokinase receptor. transcr
Homo sapiens aldolase A. fructose-bisphosphate, transcript vari
Homo sapiens phosphoinositide-3-kinase, catalytic. delta polypeptide (P
Homo sapiens chemokine (C—X—C motif) receptor 4, transcript vat
Homo sapiens interleukin 8 receptor. beta, mRNA.
Homo sapiens 0-linked N-acetylglucosamine (GlcNAc) transferase
Homo sapiens T cell receptor alpha locus. mRNA (cDNA clone MGC:
Homo sapiens full-length cDNA clone CS0CAP005YH21 of Thymus
Homo sapiens voltage-dependent anion channel 1, mRNA.
Homo sapiens DnaJ (Hsp40) homolog, subfamily A. member 3, mRNA
Homo sapiens PTC7 Protein phosphatase homolog (S. cerevisiae),
Homo sapiens alkaline phosphatase. liver/bone/kidney, mRNA
Homo sapiens carbonic anhydrase IV. mRNA
Homo sapiens peptidoglycan recognition protein 1, mRNA.
Homo sapiens solute carrier family 11 (proton-coupled divalent metal io
Homo sapiens KIAA1324, mRNA.
Homo sapiens peptidyl arginine deiminase. type IV, mRNA.
Homo sapiens matrix metallopeptidase 9 (gelatinase B, 92 kDa
Homo sapiens cvtidine deaminase, mRNA.
Homo sapiens cvsteine-rich secretory protein LCCL domain containing 2 (
Homo sapiens myosin IF, mRNA
Homo sapiens membrane bound 0-acyltransferase domain containing 7 (
Homo sapiens interleukin 8 receptor, alpha, mRNA.
Homo sapiens hect domain and RLD 5 (HERC5), mRNA
Homo sapiens serum deprivation response (phosphatidylserine
Homo sapiens Janus kinase 1 (a protein tyrosine kinase) (JAK1),
Homo sapiens dysferlin, limb girdle muscular dystrophy 2B (autosomal
Homo sapiens v-ral simian leukemia viral oncogene homolog B (ras
Homo sapiens Rho GTPase activating protein 30 (ARHGAP30),
Homo sapiens ribonuclease, RNase A family, 2 (liver, eosinophil-
Homo sapiens T cell receptor alpha locus, mRNA (cDNA clone
Homo sapiens hCG39912 (LOC642250), mRNA.
Homo sapiens ribosomal protein L17 (RPL17), transcript variant 2,
Homo sapiens cytochrome P450, family 27, subfamily A, polypeptide 1
Homo sapiens hairy and enhancer of split 4 (Drosophila) (HES4),
Homo sapiens KIAA1324 (KIAA1324), mRNA.
Homo sapiens interferon-induced protein 44 (IF144), mRNA.
Homo sapiens chromosome 15 open reading frame 15 (C15orf15),
Homo sapiens regulator of G-protein signaling 18 (RGS18), mRNA.
Homo sapiens cathepsin Z (CTSZ), mRNA.
Homo sapiens eukaryotic translation initiation factor 2C, 2 (EIF2C2),
Homo sapiens Charcot-Leyden crystal protein (CLC), mRNA.
Homo sapiens mediator complex subunit 25 (MED25), mRNA.
Homo sapiens protein phosphatase 1, catalytic subunit, alpha isoform
Homo sapiens aldolase A, fructose-bisphosphate (ALDOA),
Homo sapiens leukocyte specific transcript 1 (LST1), transcript variant
Homo sapiens additional sex combs like 2 (Drosophila) (ASXL2),
Homo sapiens eukaryotic translation initiation factor 2-alpha kinase 2
Homo sapiens matrix metallopeptidase 9 (gelatinase B, 92 kDa
Homo sapiens interferon induced protein with tetratricopeptide repeats
Homo sapiens coiled-coil domain containing 72 (CCDC72), mRNA
Homo sapiens interferon-induced protein 44-like (IF144L), mRNA.
Homo sapiens ISG15 ubiquitin-like modifier (ISG15), mRNA.
Homo sapiens lymphocyte antigen 96 (LY96), mRNA.
Homo sapiens histidine triad nucleotide binding protein 1 (HINT1),
Homo sapiens C-type lectin domain family 12, member A (CLEC12A),
Homo sapiens basigin (Ok blood group) (BSG), transcript variant 3,
Homo sapiens interferon, alpha-inducible protein 6 (IFI6), transcript
Homo sapiens XIAP associated factor 1 (XAF1), transcript variant 2,
Homo sapiens small nuclear ribonucleoprotein polypeptide G
Homo sapiens neutrophil cytosolic factor 1, (chronic granulomatous
Homo sapiens peptidyl arginine deiminase, type IV PADI4),
Homo sapiens ribosomal protein S15a (RPS15A), transcript variant 1,
Homo sapiens proteasome (prosome, macropain) subunit, alpha type,
Homo sapiens voltage-dependent anion channel 1 (VDAC1), mRNA.
Homo sapiens endoplasmic reticulum aminopeptidase 2 (ERAP2),
Homo sapiens SEC14-like 1 (S. cerevisiae) (SEC14L1), mRNA.
Homo sapiens cytochrome c oxidase subunit VIIc (COX7C), nuclear
Homo sapiens CD79a molecule, immunoglobulin-associated alpha
Homo sapiens syndecan binding protein (syntenin) (SDCBP),
Homo sapiens S100 calcium binding protein A4 (S100A4), transcript
Homo sapiens orosomucoid 1 (ORM1), mRNA
Homo sapiens ubiquinol-cytochrome c reductase, complex III
Homo sapiens C-type lectin domain family 12, member A (CLEC12A),
Number | Date | Country | Kind |
---|---|---|---|
08167570.4 | Oct 2008 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2009/064099 | 10/26/2009 | WO | 00 | 5/17/2011 |