AUTO-ANTIGEN BIOMARKERS FOR LUPUS

Abstract
The presence of certain auto-antibodies indicates that a subject has lupus. The auto-antibodies recognise antigens listed in Table 1 herein. These auto-antibodies and/or the antigens themselves can be used as biomarkers for assessing lupus in a subject.
Description

This application claims the benefit of UK application 1017520.6 (filed 15 Oct. 2010), the complete contents of which are hereby incorporated herein by reference for all purposes.


TECHNICAL FIELD

The invention relates to biomarkers useful in diagnosis, monitoring and/or treatment of lupus.


BACKGROUND

Systemic lupus erythematosus (SLE) or lupus is a chronic autoimmune disease that can affect the joints and almost every major organ in the body, including heart, kidneys, skin, lungs, blood vessels, liver, and the nervous system. As in other autoimmune diseases, the body's immune system attacks the body's own tissues and organs, leading to inflammation. A person's risk to develop lupus appears to be determined mainly by genetic factors, but environmental factors, such as infection or stress may trigger the onset of the disease. The course of lupus varies, and is often characterised by alternating periods of flares, i.e. increased disease activity, and periods of remission. Subjects with lupus may develop a variety of conditions such as lupus nephritis, musculoskeletal complications, haematological disorders and cardiac inflammation.


Lupus occurs approximately 10 times more frequently in women than in men. It is part of a family of closely related disorders known as the connective tissue diseases which also includes rheumatoid arthritis (RA), polymyositis-dermatomyositis (PM-DM), systemic sclerosis (SSc or scleroderma), Sjogren's syndrome (SS) and various forms of vasculitis. These diseases share a number of clinical symptoms and abnormalities. Subjects suffering from lupus can present with a variety of diverse symptoms, many of which occur in other connective tissue diseases, fibromalgia, dermatomyositis or haematological conditions such as idiopathic thrombocytopenic purpura. Diagnosis can therefore be challenging.


It takes on average 4 years to obtain a correct diagnosis for lupus, in part due to the range and complexity of symptoms and the necessity to discount other possible causes. The American College of Rheumatologists has established eleven criteria to assist in the diagnosis of lupus for the inclusion of patients in clinical trials and developed the SLE Disease Activity Index (SLEDAI) to assess lupus activity. In addition to considering medical history, the subject's age and gender and a physical examination, a number of laboratory tests are also available to assist in diagnosis. These include tests for the presence of antinuclear antibodies (ANA) and tests for other auto-antibodies such as anti-DNA, anti-Sm, anti-RNP, anti-Ro (SSA), anti-Lb (SSB) and anti-cardiolipin antibodies. Other diagnostic tools include tests for serum complement levels, urine analysis, and biopsies of an affected organ. Some of these criteria are very specific for lupus but have poor sensitivity, but none of these tests provides a definitive diagnosis and so the results of multiple differing tests must be integrated to enable a clinical judgement by an expert. For example, a positive ANA test can occur due to infections or rheumatic diseases, and even healthy people without lupus can test positive. The ANA test has high sensitivity (93%) but low specificity (57%) [1]. Antibodies to double-stranded DNA and/or nucleosomes were associated with lupus over 50 years ago and active lupus is generally associated with IgG. The sensitivity and specificity of the Farr test for anti-DNA is 78.8% and 90.9%, respectively [2]. Thus it is clear that the status of multiple autoantibody species can provide information on the lupus status of a patient but to date these clinical analyses are performed individually in a piecemeal fashion. The necessity for a unified test offering both high sensitivity and specificity for lupus is clear.


Many autoantibody species have been described in connection with lupus [3] and their cognate antigens include numerous classes of proteins, subcellular organs such as the nucleus and non-protein species such as phospholipid and DNA. Frequently the antigen is either poorly described or uncharacterised at the molecular level e.g. antimitochondrial antibodies. Given the challenges in obtaining a correct diagnosis, there is a need for new or improved in vitro tests with better specificity and sensitivity to enable non-invasive diagnosis of lupus. Such tests can be based on biomarkers that can be used in methods of diagnosing lupus, for the early detection of lupus, subclinical or presymptomatic lupus or a predisposition to lupus, or for monitoring the progression of lupus or the likelihood to transition from remission to flare or vice versa, or the efficacy of a therapeutic treatment thereof. Such improved diagnostic methods would provide significant clinical benefit by enabling earlier active management of lupus while reducing unnecessary intervention caused by mis-diagnosis. It is an object of the invention to meet these needs.


DISCLOSURE OF THE INVENTION

The invention is based on the identification of correlations between lupus and the level of auto-antibodies against certain auto-antigens. The inventors have identified antigens for which the level of auto-antibodies can be used to indicate that a subject has lupus. Auto-antibodies against these antigens are present at significantly different levels in subjects with lupus and without lupus and so the auto-antibodies and their antigens function as biomarkers of lupus. Detection of the biomarkers in a subject sample can thus be used to improve the diagnosis, prognosis and monitoring of lupus. Advantageously, the invention can be used to distinguish between lupus and other autoimmune diseases, particularly other connective tissue diseases such as rheumatoid arthritis (RA), polymyositis-dermatomyositis (PM-DM), systemic sclerosis (SSc or scleroderma), Sjogren's syndrome and vasculitis where inflammation and similar symptoms are common.


The inventors have identified 50 such biomarkers and the invention uses at least one of these to assist in the diagnosis of lupus by measuring level(s) of auto-antibodies against the antigen(s) and/or the level(s) of the antigen(s) themselves. The biomarker can be (i) auto-antibody which binds to an antigen in Table 1 and/or (ii) an antigen in Table 1, but is preferably the former.


The invention thus provides a method for analysing a subject sample, comprising a step of determining the level of a Table 1 biomarker in the sample, wherein the level of the biomarker provides a diagnostic indicator of whether the subject has lupus.


Analysis of a single Table 1 biomarker can be performed, and detection of the auto-antibody/antigen can provide a useful diagnostic indicator for lupus even without considering any of the other Table 1 biomarkers. The sensitivity and specificity of diagnosis can be improved, however, by combining data for multiple biomarkers. It is thus preferred to analyse more than one Table 1 biomarker. Analysis of two or more different biomarkers (a “panel”) can enhance the sensitivity and/or specificity of diagnosis compared to analysis of a single biomarker. Each different biomarker in a panel is shown in a different row in Table 1 i.e. measuring both auto-antibody which binds to an antigen listed in Table 1 and the antigen itself is measurement of a single biomarker rather than of a panel.


Thus the invention provides a method for analysing a subject sample, comprising a step of determining the levels of x different biomarkers of Table 1, wherein the levels of the biomarkers provide a diagnostic indicator of whether the subject has lupus. The value of x is 2 or more e.g. 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more (e.g. up to 50). These panels may include (i) any specific one of the 50 biomarkers in Table 1 in combination with (ii) any of the other 49 biomarkers in Table 1. Suitable panels are described below and panels of particular interest include those listed in Tables 2 to 16. Preferred panels have from 2 to 15 biomarkers, as using >15 of them adds little to sensitivity and specificity.


The Table 1 biomarkers can be used in combination with one or more of: (a) known biomarkers for lupus, which may or may not be auto-antibodies or antigens; and/or (b) other information about the subject from whom a sample was taken e.g. age, genotype (genetic variations can affect auto-antibody profiles [4]), weight, other clinically-relevant data or phenotypic information; and/or (c) other diagnostic tests or clinical indicators for lupus. Such combinations can enhance the sensitivity and/or specificity of diagnosis. Thus the invention provides a method for analysing a subject sample, comprising a step of determining:

    • (a) the level(s) of y Table 1 biomarker(s), wherein the levels of the biomarkers provide a diagnostic indicator of whether the subject has lupus; and also one or more of:
    • (b) if a sample from the subject contains a known biomarker selected from the group consisting of autoantibodies including ANA, anti-Smith, anti-dsDNA, anti-phospholipid, anti-ssDNA, anti-RNP, anti-Ro, anti-Lb, anti-cardiolipis, and/or anti-histone (and optionally, any other known biomarkers e.g. see above); wherein detection of the known biomarker provides a second diagnostic indicator of whether the subject has lupus;
    • (c) if the subject has one or more of a false positive serological test for syphilis, serositis, pleuritis, pericarditis, oral ulcers, nonerosive arthritis of two or more peripheral joints, photosensitivity, hemolytic anemia, leukopenia, lymphopenia, thrombocytopenia, hypocomplementemia, renal disorder, seizures, psychosis, malar rash, and/or discoid rash, wherein a positive test for these provides a third diagnostic indicator of whether the subject has lupus;
    • (d) the subject's age and gender,
    • and combining the different diagnostic indicators to provide an aggregate diagnostic indicator of whether the subject has lupus.


The samples used in (a) and (b) may be the same or different.


The value of y is 1 or more e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 (e.g. up to 50). When y>1 the invention uses a panel of different Table 1 biomarkers.


The invention also provides, in a method for diagnosing if a subject has lupus, an improvement consisting of determining in a sample from the subject the level(s) of y biomarker(s) of Table 1, wherein the level(s) of the biomarker(s) provide a diagnostic indicator of whether the subject has lupus.


The invention also provides a method for diagnosing a subject as having lupus, comprising steps of: (i) determining the levels of y biomarkers of Table 1 in a sample from the subject; and (ii) comparing the determination from step (i) to data obtained from samples from subjects without lupus and/or from subjects with lupus, wherein the comparison provides a diagnostic indicator of whether the subject has lupus. The comparison in step (ii) can use a classifier algorithm as discussed in more detail below.


The invention also provides a method for monitoring development of lupus in a subject, comprising steps of: (i) determining the levels of z1 biomarker(s) of Table 1 in a first sample from the subject taken at a first time; and (ii) determining the levels of z2 biomarker(s) of Table 1 in a second sample from the subject taken at a second time, wherein: (a) the second time is later than the first time; (b) one or more of the z2 biomarker(s) were present in the first sample; and (c) a change in the level(s) of the biomarker(s) in the second sample compared with the first sample indicates that lupus is in remission or is progressing. Thus the method monitors the biomarker(s) over time, with changing levels indicating whether the disease is getting better or worse.


The disease development can be either an improvement or a worsening, and this method may be used in various ways e.g. to monitor the natural progress of a disease, or to monitor the efficacy of a therapy being administered to the subject. Thus a subject may receive a therapeutic agent before the first time, at the first time, or between the first time and the second time. Increased levels of antibodies against a particular antigen may be due to “epitope spreading”, in which additional antibodies or antibody classes are raised to antigens against which an antibody response has already been mounted [5].


The value of z1 is 1 or more e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 (e.g. up to 50). The value of z2 is 1 or more e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 (e.g. up to 50). The values of z1 and z2 may be the same or different. If they are different, it is usual that z1>z2 as the later analysis (z2) can focus on biomarkers which were already detected in the earlier analysis; in other embodiments, however, z2 can be larger than z1 e.g. if previous data have indicated that an expanded panel should be used; in other embodiments z2=z1 e.g. so that, for convenience, the same panel can be used for both analyses. When z1>1 or z2>1, the biomarkers are different biomarkers.


The invention also provides a method for monitoring development of lupus in a subject, comprising steps of: (i) determining the level of at least w1 Table 1 biomarkers in a first sample taken at a first time from the subject; and (ii) determining the level of at least w2 Table 1 biomarkers in a second sample taken at a second time from the subject, wherein: (a) the second time is later than the first time; (b) at least one biomarker is common to both the w1 and w2 biomarkers; (c) the level of at least one biomarker common to both the w1 and w2 biomarkers is different in the first and second samples, thereby indicating that the lupus is progressing or regressing. Thus the method monitors the range of biomarkers over time, with a broadening in the number of detected biomarkers indicating that the disease is getting worse. As mentioned above, this method may be used to monitor disease development in various ways.


The value of w1 is 1 or more e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 (e.g. up to 50). The value of w2 is 2 or more e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 (e.g. up to 50). The values of w1 and w2 may be the same or different. If they are different, it is usual that w2>w1, as the later analysis should focus on a biomarker panel that is at least as wide as the number already detected in the earlier analysis. There will usually be an overlap between the w1 and w2 biomarkers (including situations where they are the same, such that the same biomarkers are measured at two time points) but it is also possible for w1 and w2 to have no biomarkers in common.


Where the methods involve a first time and a second time, these times may differ by at least 1 day, 1 week, 1 month or 1 year. Samples may be taken regularly. The methods may involve measuring biomarkers in more than 2 samples taken at more than 2 time points i.e. there may be a 3rd sample, a 4th sample, a 5th sample, etc.


The invention also provides a diagnostic device for use in diagnosis of lupus, wherein the device permits determination of the level(s) of y Table 1 biomarkers. The value of y is defined above. The device may also permit determination of whether a sample contains one or more of the known lupus biomarkers mentioned above e.g. ANA and/or anti-DNA antibodies.


The invention also provides a kit comprising (i) a diagnostic device of the invention and (ii) instructions for using the device to detect y of the Table 1 biomarkers. The value of y is defined above. The kit is useful in the diagnosis of lupus.


The invention also provides a kit comprising reagents for measuring the levels of x different Table 1 biomarkers. The kit may also include reagents for determining whether a sample contains one or more of the known lupus biomarkers mentioned above e.g. ANA and/or anti-DNA antibodies. The value of x is defined above. The kit is useful in the diagnosis of lupus.


The invention also provides a kit comprising components for preparing a diagnostic device of the invention. For instance, the kit may comprise individual detection reagents for x different biomarkers, such that an array of those x biomarkers can be prepared.


The invention also provides a product comprising (i) one or more detection reagents which permit measurement of x different Table 1 biomarkers, and (ii) a sample from a subject.


The invention also provides a software product comprising (i) code that accesses data attributed to a sample, the data comprising measurement of y Table 1 biomarkers, and (ii) code that executes an algorithm for assessing the data to represent a level of y of the biomarkers in the sample. The software product may also comprise (iii) code that executes an algorithm for assessing the result of step (ii) to provide a diagnostic indicator of whether the subject has lupus. As discussed below, suitable algorithms for use in part (iii) include support vector machine algorithms, artificial neural networks, tree-based methods, genetic programming, etc. The algorithm can preferably classify the data of part (ii) to distinguish between subjects with lupus and subjects without based on measured biomarker levels in samples taken from such subjects. The invention also provides methods for training such algorithms.


The invention also provides a computer which is loaded with and/or is running a software product of the invention.


The invention also extends to methods for communicating the results of a method of the invention. This method may involve communicating assay results and/or diagnostic results. Such communication may be to, for example, technicians, physicians or patients. In some embodiments, detection methods of the invention will be performed in one country and the results will be communicated to a recipient in a different country.


The invention also provides an isolated antibody (preferably a human antibody) which recognises one of the antigens listed in Table 1. The invention also provides an isolated nucleic acid encoding the heavy and/or light chain of the antibody. The invention also provides a vector comprising this nucleic acid, and a host cell comprising this vector. The invention also provides a method for expressing the antibody comprising culturing the host cell under conditions which permit production of the antibody. The invention also provides derivatives of the human antibody e.g. F(ab′)2 and F(ab) fragments, Fv fragments, single-chain antibodies such as single chain Fv molecules (scFv), minibodies, dAbs, etc.


The invention also provides the use of a Table 1 biomarker as a biomarker for lupus.


The invention also provides the use of x different Table 1 biomarkers as biomarkers for lupus. The value of x is defined above. These may include (i) any specific one of the 50 biomarkers in Table 1 in combination with (ii) any of the other 49 biomarkers in Table 1.


The invention also provides the use as combined biomarkers for lupus of (a) at least y Table 1 biomarker(s) and (b) biomarkers including autoantibodies including ANA, anti-Smith, anti-dsDNA, anti-phospholipid, anti-ssDNA, anti-histone, false positive test for serological test for syphilis, indicators of serositis, oral ulcers, arthritis, photosensitivity haematological disorder, renal disorder, antinuclear antibody, immunologic disorder, neurologic disorder, malar rash, discoid rash (and optionally, any other known biomarkers e.g. see above). The value of y is defined above. When y>1 the invention uses a panel of biomarkers of the invention.


In all embodiments of the invention, the biomarker(s) from Table 1 is/are preferably those in Table 18. Table 18 is a preferred subset of 44 of the 50 biomarkers in Table 1. Even more preferably, the biomarker(s) from Table 1 is/are also in Table 20. Table 20 is a preferred subset of 17 of the 50 biomarkers in Table 1.


Biomarkers of the Invention

Auto-antibodies against 145 different human antigens have been identified and these can be used as lupus biomarkers. Details of the 145 antigens are given in Table 17. Within the 145 antigens, 50 human antigens are particularly useful for distinguishing between samples from subjects with lupus and from subjects without lupus. Details of these 50 antigens are given in Table 1. A preferred subset of antigens are the 44 antigens given in Table 18. An even more preferred subset of antigens is the 17 antigens given in Table 20. Further auto-antibody biomarkers can be used in addition to these 50 (e.g. any of the other biomarkers listed in Table 17). The sequence listing provides an example of a natural coding sequence for each of these antigens. These specific coding sequences are not limiting on the invention, however, and auto-antibody biomarkers may recognise variants of polypeptides encoded by these natural sequences (e.g. allelic variants, polymorphic forms, mutants, splice variants, or gene fusions), provided that the variant has an epitope recognised by the auto-antibody. Details on allelic variants of or mutations in human genes are available from various sources, such as the ALFRED database [6] or, in relation to disease associations, the OMIM [7] and HGMD [8] databases. Details of splice variants of human genes are available from various sources, such as ASD [9].


As mentioned above, detection of a single Table 1 biomarker can provide useful diagnostic information, but each biomarker might not individually provide information which is useful i.e. auto-antibodies against a Table 1 antigen may be present in some, but not all, subjects with lupus. An inability of a single biomarker to provide universal diagnostic results for all subjects does not mean that this biomarker has no diagnostic utility, however, or else ANA also would not be useful; rather, any such inability means that the test results (as in all diagnostic tests) have to be properly understood and interpreted.


To address the possibility that a single biomarker might not provide universal diagnostic results, and to increase the overall confidence that an assay is giving sensitive and specific results across a disease population, it is advantageous to analyse a plurality of the Table 1 biomarkers (i.e. a panel). For instance, a negative signal for a particular Table 1 antigen is not necessarily indicative of the absence of lupus (just as absence of antibodies to DNA is not), confidence that a subject does not have lupus increases as the number of negative results increases. For example, if all 50 biomarkers are tested and are negative then the result provides a higher degree of confidence than if only 1 biomarker is tested and is negative. Thus biomarker panels are most useful for enhancing the distinction seen between diseased and non-diseased samples. As mentioned above, though, preferred panels have from 2 to 15 biomarkers as the burden of measuring a higher number of markers is usually not rewarded by better sensitivity or specificity. Preferred panels are given below.


Where a biomarker or panel provides a strong distinction between lupus and non-lupus subjects then a method for analysing a subject sample can function as a method for diagnosing if a subject has lupus. As with many diagnostic tests, however, and as is already known for other diagnostics tests e.g. the PSA test used of prostate cancer, a method may not always provide a definitive diagnosis and so a method for analysing a subject sample can sometimes function only as a method for aiding in the diagnosis of lupus, or as a method for contributing to a diagnosis of lupus, where the method's result may imply that the subject has lupus (e.g. the disease is more likely than not) and/or may confirm other diagnostic indicators (e.g. passed on clinical symptoms). The test may therefore function as an adjunct to, or be integrated into, the SLEDAI analysis, or similar methodologies e.g. adjusted mean SLEDAI, European League Against Rheumatism (EULAR). Dealing with these considerations of certainty/uncertainty is well known in the diagnostic field.


The Subject

The invention is used for diagnosing disease in a subject. The subject will usually be female and at least 10 years old (e.g. >15, >20, >25, >30, >35, >40, >45, >50, >55, >60, >65, >70). They will usually be at least of child-bearing age as the risk of lupus increases in this age group, and for these subjects it may be appropriate to offer a screening service for Table 1 biomarkers. The subject may be a post-menopausal female.


The subject may be pre-symptomatic for lupus or may already be displaying clinical symptoms. For pre-symptomatic subjects the invention is useful for predicting that symptoms may develop in the future if no preventative action is taken. For subjects already displaying clinical symptoms, the invention may be used to confirm or resolve another diagnosis. The subject may already have begun treatment for lupus.


In some embodiments the subject may already be known to be predisposed to development of lupus e.g. due to family or genetic links. In other embodiments, the subject may have no such predisposition, and may develop the disease as a result of environmental factors e.g. as a result of exposure to particular chemicals (such as toxins or pharmaceuticals), as a result of diet [10], of infection, of oral contraceptive use, of postmenopausal use of hormones, etc. [11].


Because the invention can be implemented relative easily and cheaply it is not restricted to being used in patients who are already suspected of having lupus. Rather, it can be used to screen the general population or a high risk population e.g. subjects at least 10 years old, as listed above.


The subject will typically be a human being. In some embodiments, however, the invention is useful in non-human organisms e.g. mouse, rat, rabbit, guinea pig, cat, dog, horse, pig, cow, or non-human primate (monkeys or apes, such as macaques or chimpanzees). In non-human embodiments, any detection antigens used with the invention will typically be based on the relevant non-human ortholog of the human antigens disclosed herein. In some embodiments animals can be used experimentally to monitor the impact of a therapeutic on a particular biomarker.


The Sample

The invention analyses samples from subjects. Many types of sample can include auto-antibodies and/or antigens suitable for detection by the invention, but the sample will typically be a body fluid. Suitable body fluids include, but are not limited to, blood, serum, plasma, saliva, lymphatic fluid, a wound secretion, urine, faeces, mucus, sweat, tears and/or cerebrospinal fluid. The sample is typically serum or plasma.


In some embodiments, a method of the invention involves an initial step of obtaining the sample from the subject. In other embodiments, however, the sample is obtained separately from and prior to performing a method of the invention. After a sample has been obtained then methods of the invention are generally performed in vitro.


Detection of biomarkers may be performed directly on a sample taken from a subject, or the sample may be treated between being taken from a subject and being analysed. For example, a blood sample may be treated to remove cells, leaving antibody-containing plasma for analysis, or to remove cells and various clotting factors, leaving antibody-containing serum for analysis. Faeces samples usually require physical treatment prior to protein detection e.g. suspension, homogenisation and centrifugation. For some body fluids, though, such separation treatments are not usually required (e.g. tears or saliva) but other treatments may be used. For example, various types of sample may be subjected to treatments such as dilution, aliquoting, sub-sampling, heating, freezing, irradiation, etc. between being taken from the body and being analysed e.g. serum is usually diluted prior to analysis. Also, addition of processing reagents is typical for various sample types e.g. addition of anticoagulants to blood samples.


Biomarker Detection

The invention involves determining the level of Table 1 biomarker(s) in a sample. Immunochemical techniques for detecting antibodies against specific antigens are well known in the art, as are techniques for detecting specific antigens themselves. Detection of an antibody will typically involve contacting a sample with a detection antigen, wherein a binding reaction between the sample and the detection antigen indicates the presence of the antibody of interest. Detection of an antigen will typically involve contacting a sample with a detection antibody, wherein a binding reaction between the sample and the detection antibody indicates the presence of the antigen of interest. Detection of an antigen can also be determined by non-immunological methods, depending on the nature of the antigen e.g. if the antigen is an enzyme then its enzymatic activity can be assayed, or if the antigen is a receptor then its binding activity can be assayed, etc. For example, the MAP2K5 kinase can be assayed using methods known in the art.


A detection antigen for a biomarker antibody can be a natural antigen recognised by the auto-antibody (e.g. a mature human protein disclosed in Table 1), or it may be an antigen comprising an epitope which is recognized by the auto-antibody. It may be a recombinant protein or synthetic peptide. Where a detection antigen is a polypeptide its amino acid sequence can vary from the natural sequences disclosed above, provided that it has the ability to specifically bind to an auto-antibody of the invention (i.e. the binding is not non-specific and so the detection antigen will not arbitrarily bind to antibodies in a sample). It may even have little in common with the natural sequence (e.g. a mimotope, an aptamer, etc.). Typically, though, a detection antigen will comprise an amino acid sequence (i) having at least 90% (e.g. ≧91%, ≧92%, ≧93%, ≧94%, ≧95%, ≧96%, ≧97%, ≧98%, ≧99%) sequence identity to the relevant SEQ ID NO disclosed herein across the length of the detection antigen, and/or (ii) comprising at least one epitope from the relevant SEQ ID NO disclosed herein. Thus the detection antigen may be one of the variants discussed above.


Epitopes are the parts of an antigen that are recognised by and bind to the antigen binding sites of antibodies and are also known as “antigenic determinants”. An epitope-containing fragment may contain a linear epitope from within a SEQ ID NO and so may comprise a fragment of at least n consecutive amino acids of the SEQ ID NO:, wherein n may be 7 or more (e.g. 8, 10, 12, 14, 16, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250 or more). B-cell epitopes can be identified empirically (e.g. using PEPSCAN [12,13] or similar methods), or they can be predicted e.g. using the Jameson-Wolf antigenic index [14], ADEPT [15], hydrophilicity [16], antigenic index [17], MAPITOPE [18], SEPPA [19], matrix-based approaches [20], the amino acid pair antigenicity scale [21], or any other suitable method e.g. see ref. 22. Predicted epitopes can readily be tested for actual immunochemical reactivity with samples.


Detection antigens can be purified from human sources but it is more typical to use recombinant antigens (particularly where the detection antigen uses sequences which are not present in the natural antigen e.g. for attachment). Various systems are available for recombinant expression, and the choice of system may depend on the auto-antibody to be detected. For example, prokaryotic expression (e.g. using E. coli) is useful for detecting many auto-antibodies, but if an auto-antibody recognises a glycoprotein then eukaryotic expression may be required. Similarly, if an auto-antibody recognises a specific discontinuous epitope then a recombinant expression system which provides correct protein folding may be required.


The detection antigen may be a fusion polypeptide with a first region and a second region, wherein the first region can react with an auto-antibody in a sample and the second region can react with a substrate to immobilise the fusion polypeptide thereon.


A detection antibody for a biomarker antigen can be a monoclonal antibody or a polyclonal antibody. Typically it will be a monoclonal antibody. The detection antibody should have the ability to specifically bind to a Table 1 antigen (i.e. the binding is not non-specific and so the detection antibody will not arbitrarily bind to other antigens in a sample).


Various assay formats can be used for detecting biomarkers in samples. For example, the invention may use one or more of western blot, immunoprecipitation, silver staining, mass spectrometry (e.g. MALDI-MS), conductivity-based methods, dot blot, slot blot, colorimetric methods, fluorescence-based detection methods, or any form of immunoassay, etc. The binding of antibodies to antigens can be detected by any means, including enzyme-linked assays such as ELISA, radioimmunoassays (RIA), immunoradiometric assays (IRMA), immunoenzymatic assays (IEMA), DELFIA™ assays, surface plasmon resonance or other evanescent light techniques (e.g. using planar waveguide technology), label-free electrochemical sensors, etc. Sandwich assays are typical for immunological methods.


In embodiments where multiple biomarkers are to be detected an array-based assay format is preferable, in which a sample that potentially contains the biomarkers is simultaneously contacted with multiple detection reagents (antibodies and/or antigens) in a single reaction compartment. Antigen and antibody arrays are well known in the art e.g. see references 23-29, including arrays for detecting auto-antibodies. Such arrays may be prepared by various techniques, such as those disclosed in references 30-34, which are particularly useful for preparing microarrays of correctly-folded polypeptides to facilitate binding interactions with auto-antibodies. It has been estimated that most B-cell epitopes are discontinuous and such epitopes are known to be important in diseases with an autoimmune component. For example, in autoimmune thyroid diseases, auto-antibodies arise to discontinuous epitopes on the immunodominant region on the surface of thyroid peroxidase and in Goodpasture disease auto-antibodies arise to two major conformational epitopes. Protein arrays which have been developed to present correctly-folded polypeptides displaying native structures and discontinuous epitopes are therefore particularly well suited to studies of diseases where auto-antibody responses occur [27].


Methods and apparatuses for detecting binding reactions on protein arrays are now standard in the art. Preferred detection methods are fluorescence-based detection methods. To detect biomarkers which have bound to immobilised proteins a sandwich assay is typical e.g. in which the primary antibody is an auto-antibody from the sample and the secondary antibody is a labelled anti-sample antibody (e.g. an anti-human antibody).


Where a biomarker is an auto-antibody the invention will generally detect IgG antibodies, but detection of auto-antibodies with other subtypes is also possible e.g. by using a detection reagent which recognises the appropriate class of auto-antibody (IgA, IgM, IgE or IgD rather than Ig). The assay format may be able to distinguish between different antibody subtypes and/or isotypes. Different subtypes [35] and isotypes [36] can influence auto-antibody repertoires. For instance, a sandwich assay can distinguish between different subtypes by using differentially-labelled secondary antibodies e.g. different labels for anti-IgG and anti-IgM.


As mentioned above, the invention provides a diagnostic device which permits determination of whether a sample contains Table 1 biomarkers. Such devices will typically comprise one or more antigen(s) and/or antibodies immobilised on a solid substrate (e.g. on glass, plastic, nylon, etc.). Immobilisation may be by covalent or non-covalent bonding (e.g. non-covalent bonding of a fusion polypeptide, as discussed above, to an immobilised functional group such as an avidin [32] or a bleomycin-family antibiotic [34]). Antigen arrays are a preferred format, with detection antigens being individually addressable. The immobilised antigens will be able to react with auto-antibodies which recognise a Table 1 antigen.


In some embodiments, the solid substrate may comprise a strip, a slide, a bead, a well of a microtitre plate, a conductive surface suitable for performing mass spectrometry analysis [37], a semiconductive surface [38,39], a surface plasmon resonance support, a planar waveguide technology support, a microfluidic devices, or any other device or technology suitable for detection of antibody-antigen binding.


Where the invention provides or uses an antigen array for detecting a panel of auto-antibodies as disclosed herein, in some embodiments the array may include only antigens for detecting these auto-antibodies. In other embodiments, however, the array may include polypeptides in addition to those useful for detecting the auto-antibodies. For example, an array may include one or more control polypeptides. Suitable positive control polypeptides include an anti-human immunoglobulin antibody, such as an anti-IgM antibody, an anti-IgG antibody, an anti-IgA antibody, an anti-IgE antibody or combinations thereof. Other suitable positive control polypeptides which can bind to sample antibodies include protein A or protein G, typically in recombinant form. Suitable negative control polypeptides include, but are not limited to, β-galactosidase, serum albumins (e.g. BSA or HSA), protein tags, bacterial proteins, yeast proteins, citrullinated polypeptides, etc. Negative control features on an array can also be polypeptide-free e.g. buffer alone, DNA, etc. An array's control features are used during performance of a method of the invention to check that the method has performed as expected e.g. to ensure that expected proteins are present (e.g. a positive signal from serum proteins in a serum sample) and that unexpected substances are not present (e.g. a positive signal from an array spot of buffer alone would be unexpected).


In an antigen array of the invention, at least 10% (e.g. ≧20%, ≧30%, ≧40%, ≧50%, ≧60%, ≧70%, ≧80%, ≧90%, ≧95%, or more) of the total number of different proteins present on the array may be for detecting auto-antibodies as disclosed herein.


An antigen array of the invention may include one or more replicates of a detection antigen and/or control feature e.g. duplicates, triplicates or quadruplicates. Replicates provide redundancy, provide intra-array controls, and facilitate inter-array comparisons.


An antigen array of the invention may include detection antigens for more than just the 44 different auto-antibodies described here, but preferably it can detect antibodies against fewer than 10000 antigens (e.g. <5000, <4000, <3000, <2000, <1000, <500, <250, <100, etc.).


An array is advantageous because it allows simultaneous detection of multiple biomarkers in a sample. Such simultaneous detection is not mandatory, however, and a panel of biomarkers can also be evaluated in series. Thus, for instance, a sample could be split into sub-samples and the sub-samples could be assayed in series. In this embodiment it may not be necessary to complete analysis of the whole panel e.g. the diagnostic indicators obtained on a subset of the panel may indicate that a patient has lupus without requiring analysis of any further members of the panel. Such incomplete analysis of the panel is encompassed by the invention because of the intention or potential of the method to analyse the complete panel.


As mentioned above, some embodiments of the invention can include a contribution from known tests for lupus, such as ANA and/or anti-DNA tests. Any known tests can be used e.g. Farr test, Crithidia, etc.


Thus an array of the invention (or any other assay format) may also provide an assay for one or more of these additional markers e.g. an array may include a DNA spot.


Data Interpretation

The invention involves a step of determining the level of Table 1 biomarker(s). In some embodiments of the invention this determination for a particular marker can be a simple yes/no determination, whereas other embodiments may require a quantitative or semi-quantitative determination, still other embodiments may involve a relative determination (e.g. a ratio relative to another marker, or a measurement relative to the same marker in a control sample), and other embodiments may involve a threshold determination (e.g. a yes/no determination whether a level is above or below a threshold). Usually biomarkers will be measured to provide quantitative or semi-quantitative results (whether as relative concentration, absolute concentration, titre, etc.) as this gives more data for use with classifier algorithms.


Usually the raw data obtained from an assay for determining the presence, absence, or level (absolute or relative) require some sort of manipulation prior to their use. For instance, the nature of most detection techniques means that some signal will sometimes be seen even if no antigen/antibody is actually present and so this noise may be removed before the results are interpreted. Similarly, there may be a background level of the antigen/antibody in the general population which needs to be compensated for. Data may need scaling or standardising to facilitate inter-experiments comparisons. These and similar issues, and techniques for dealing with them, are well known in the immunodiagnostic area.


Various techniques are available to compensate for background signal in a particular experiment. For example, replicate measurements will usually be performed (e.g. using multiple features of the same detection antigen on a single array) to determine intra-assay variation, and average values from the replicates can be compared (e.g. the median value of binding to quadruplicate array features). Furthermore, standard markers can be used to determine inter-assay variation and to permit calibration and/or normalisation e.g. an array can include one or more standards for indicating whether measured signals should be proportionally increased or decreased. For example, an assay might include a step of analysing the level of one or more control marker(s) in a sample e.g. levels of an antigen or antibody unrelated to lupus. Signal may be adjusted according to distribution in a single experiment. For instance, signals in a single array experiment may be expressed as a percentage of interquartile differences e.g. as [observed signal−25th percentile]/[75th percentile−25th percentile]. This percentage may then be normalised e.g. using a standard quantile normalization matrix, such as disclosed in reference 40, in which all percentage values on a single array are ranked and replaced by the average of percentages for antigens with the same rank on all arrays. Overall, this process gives data distributions with identical median and quartile values. Data transformations of this type are standard in the art for permitting valid inter-array comparisons despite variation between different experiments.


The level of a biomarker relative to a single baseline level may be defined as a fold difference. Normally it is desirable to use techniques that can indicate a change of at least 1.5-fold e.g. ≧1.75-fold, ≧2-fold, ≧2.5-fold, ≧5-fold, etc.


As well as compensating for variation which is inherent between different experiments, it can also be important to compensate for background levels of a biomarker which are present in the general population. Again, suitable techniques are well known. For example, levels of a particular antigen or auto-antibody in a sample will usually be measured quantitatively or semi-quantitatively to permit comparison to the background level of that biomarker. Various controls can be used to provide a suitable baseline for comparison, and choosing suitable controls is routine in the diagnostic field. Further details of suitable controls are given below.


The measured level(s) of Table 1 biomarker(s), after any compensation/normalisation/etc., can be transformed into a diagnostic result in various ways. This transformation may involve an algorithm which provides a diagnostic result as a function of the measured level(s). Where a panel is used then each individual biomarker may make a different contribution to the overall diagnostic result and so two biomarkers may be weighted differently.


The creation of algorithms for converting measured levels or raw data into scores or results is well known in the art. For example, linear or non-linear classifier algorithms can be used. These algorithms can be trained using data from any particular technique for measuring the marker(s). Suitable training data will have been obtained by measuring the biomarkers in “case” and “control” samples i.e. samples from subjects known to suffer from lupus and from subjects known not to suffer from lupus. Most usefully the control samples will also include samples from subjects with a related disease which is to be distinguished from the disease of interest e.g. it is useful to train the algorithm with data from rheumatoid arthritis subjects and/or with data from subjects with connective tissue diseases other than lupus. The classifier algorithm is modified until it can distinguish between the case and control samples e.g. by adding or removing markers from the analysis, by changes in weighting, etc. Thus a method of the invention may include a step of analysing biomarker levels in a subject's sample by using a classifier algorithm which distinguishes between lupus subjects and non-lupus subjects based on measured biomarker levels in samples taken from such subjects.


Various suitable classifier algorithms are available e.g. linear discriminant analysis, naïve Bayes classifiers, perceptrons, support vector machines (SVM) [41] and genetic programming (GP) [42]. GP is particularly useful as it generally selects relatively small numbers of biomarkers and overcomes the problem of trapping in a local maximum which is inherent in many other classification methods. SVM-based approaches have previously been applied to lupus datasets [43]. The inventors have previously confirmed that both SVM and GP approaches can be trained on the same biomarker panels to distinguish the auto-antibody/antigen biomarker profiles of case and control cohorts with similar sensitivity and specificity i.e. autoantibody biomarkers are not dependent on a single method of analysis. Moreover, these approaches can potentially distinguish lupus subjects from subjects with (i) other forms of autoimmune disease and (ii) rheumatoid arthritis. The 50 biomarkers in Table 1 can be used to train such algorithms to reliably make such distinctions.


It will be appreciated that, although there may be some biomarkers in Table 1 which always give a negative absolute signal when contacted with negative control samples (and thus any positive signal is immediately indicative of lupus), it is more common that a biomarker will give at least a low absolute signal (and thus that a disease-indicating positive signal requires detection of auto-antibody levels above that background level). Thus references herein detecting a biomarker may not be references to absolute detection but rather (as is standard in the art) to a level above the levels seen in an appropriate negative control. Such controls may be assayed in parallel to a test sample but it can be more convenient to use an absolute control level based on empirical data, or to analyse data using an algorithm which can (e.g. by previous training) use biomarker levels to distinguish samples from disease patients vs. non-disease patients.


The level of a particular biomarker in a sample from a lupus-diseased subject may be above or below the level seen in a negative control sample. Antibodies that react with self-antigens occur naturally in healthy individuals and it is believed that these are necessary for survival of T- and B-cells in the peripheral immune system [44]. In a control population of healthy individuals there may thus be significant levels of circulating auto-antibodies against some of the antigens disclosed in Table 1 and these may occur at a significant frequency in the population. The level and frequency of these biomarkers may be altered in a disease cohort, compared with the control cohort. An analysis of the level and frequency of these biomarkers in the case and control populations may identify differences which provide diagnostic information. The level of auto-antibodies directed against a specific antigen may increase or decrease in a lupus sample, compared with a healthy sample.


In general, therefore, a method of the invention will involve determining whether a sample contains a biomarker level which is associated with lupus. Thus a method of the invention can include a step of comparing biomarker levels in a subject's sample to levels in (i) a sample from a patient with lupus and/or (ii) a sample from a patient without lupus. The comparison provides a diagnostic indicator of whether the subject has lupus. An aberrant level of one or more biomarker(s), as compared to known or standard expression levels of those biomarker(s) in a sample from a patient without lupus, indicates that the subject has lupus.


The level of a biomarker should be significantly different from that seen in a negative control. Advanced statistical tools can be used to determine whether two levels are the same or different. For example, an in vitro diagnosis will rarely be based on comparing a single determination. Rather, an appropriate number of determinations will be made with an appropriate level of accuracy to give a desired statistical certainty with an acceptable sensitivity and/or specificity. Antigen and/or antibody levels can be measured quantitatively to permit proper comparison, and enough determinations will be made to ensure that any difference in levels can be assigned a statistical significance to a level of p<0.05 or better. The number of determinations will vary according to various criteria (e.g. the degree of variation in the baseline, the degree of up-regulation in disease states, the degree of noise, etc.) but, again, this falls within the normal design capabilities of a person of ordinary skill in this field. For example, interquartile differences of normalised data can be assessed, and the threshold for a positive signal (i.e. indicating the presence of a particular auto-antibody) can be defined as requiring that antibodies in a sample react with a diagnostic antigen at least 2.5-fold more strongly that the interquartile difference above the 75th percentile. Other criteria are familiar to those skilled in the art and, depending on the assays being used, they may be more appropriate than quantile normalisation. Other methods to normalise data include data transformation strategies known in the art e.g. scaling, log normalisation, median normalisation, etc.


The underlying aim of these data interpretation techniques is to distinguish between the presence of a Table 1 biomarker and of an arbitrary control biomarker, and also to distinguish between the response of sample from a lupus subject from a control subject. Methods of the invention may have sensitivity of at least 70% (e.g. >70%, >75%, >80%, >85%, >90%, >95%, >96%, >97%, >98%, >99%). Methods of the invention may have specificity of at least 70% (e.g. >70%, >75%, >80%, >85%, >90%, >95%, >96%, >97%, >98%, >99%). Advantageously, methods of the invention may have both specificity and sensitivity of at least 70% (e.g. >70%, >75%, >80%, >85%, >90%, >95%, >96%, >97%, >98%, >99%). As shown in Tables 9-16, the invention can consistently provide specificities above 90% and sensitivities greater than 80%.


Data obtained from methods of the invention, and/or diagnostic information based on those data, may be stored in a computer medium (e.g. in RAM, in non-volatile computer memory, on CD-ROM) and/or may be transmitted between computers e.g. over the internet.


If a method of the invention indicates that a subject has lupus, further steps may then follow. For instance, the subject may undergo confirmatory diagnostic procedures, such as those involving physical inspection of the subject, and/or may be treated with therapeutic agent(s) suitable for treating lupus.


Monitoring the Efficacy of Therapy

As mentioned above, some methods of the invention involve testing samples from the same subject at two or more different points in time. In general, where the above text refers to the presence or absence of biomarker(s), the invention also includes an increasing or decreasing level of the biomarker(s) over time. An increasing level of an auto-antibody biomarker includes a spread of antibodies in which additional antibodies or antibody classes are raised against a single antigen. Methods which determine changes in biomarker(s) over time can be used, for instance, to monitor the efficacy of a therapy being administered to the subject (e.g. in theranostics). The therapy may be administered before the first sample is taken, at the same time as the first sample is taken, or after the first sample is taken.


The invention can be used to monitor a subject who is receiving lupus therapy. There is presently no cure for lupus. Current therapies for lupus include therapeutic drugs, alternative medicines or life-style changes. Approved drugs include non-steroidal and steroidal anti-inflammatory drugs (e.g. prednisolone), anti-malarials (e.g. hydroxychloriquine) and immunosupressants (e.g. cyclosporin A). A series of new drugs are being developed, many of which target B-cells, such as Rituximab which targets CD20 and Belimumab which is directed against B-lymphocyte stimulator (BlyS). The appropriate treatment regime will depend on the severity of the disease, and the responsiveness of the patient. Disease-modifying antirheumatic drugs can be used preventively to reduce the incidence of flares. When flares occur, they are often treated with corticosteroids. Given the similarities between rheumatic diseases, discussed below, it is not surprising that many of the therapeutics developed for one disease may have efficacy in another. In particular, the success of cytokine inhibitors in treating RA has advanced our understanding of these diseases and has opened up the possibility that some of these new classes of therapeutics will be of use in multiple disease areas. For example, Belimumab failed to meet its target in RA but has demonstrated efficacy in a phase III trial for lupus. Another anti-CD20 antibody, Ocrelizumab, is being investigated for use in RA and lupus and Imatinib which targets kit, abl and PDGFR kinases is in Phase II for RA and scleroderma. Other representative molecules which are directed towards rheumatic diseases are (target in parentheses): Tocilizumab (IL-6 receptor), AMG714 mAb (IL-15), AlN457 mAb (IL-17), Ustekinumab (IL-23/IL-12), Belimumab (BLyS/BAFF), Atacicept (BLyS/BAFF and APRIL), Baminercept (LTα/LTβ/LIGHT), Ocrelizumab (CD20), Ofatumumab (CD20), TRU-015/SMIP (CD20), Epratuzumab (CD22), Abatacept (CD80/CD86), Denosumab (RANKL), INCB018424 (JAK1/JAK2/Tyk2), CP-690,550 (JAK3), Fostamatinib (Syk), multiple compounds (p38), Imatinib (PDGF-R, c-kit, c-abl), ARRY-162 (ERK/MEK), AS-605240 (PI3Kγ), Maraviroc (CCR5), IB-MECA/CF101 (Adenosine A3 receptor agonist) and CE-224,535 (P2X7 antagonist).


In related embodiments of the invention, the results of monitoring a therapy are used for future therapy prediction. For example, if treatment with a particular therapy is effective in reducing or eliminating disease symptoms in a subject, and is also shown to decrease levels of a particular biomarker in that subject, detection of that biomarker in another subject may indicate that this other subject will respond to the same therapy. Conversely, if a particular therapy was not effective in reducing or eliminating disease symptoms in a subject who had a particular biomarker or biomarker profile, detection of that biomarker or profile in another subject may indicate that this other subject will also fail to respond to the same therapy.


In other embodiments, the presence of a particular biomarker can be used as the basis of proposing or initiating a particular therapy (patient stratification). For instance, if it is known that levels of a particular auto-antibody can be reduced by administering a particular therapy then that auto-antibody's detection may suggest that the therapy should begin. Thus the invention is useful in a theranostic setting.


Normally at least one sample will be taken from a subject before a therapy begins.


Immunotherapy

Where the development of auto-antibodies to a newly-exposed auto-antigen is causative for a disease, early priming of the immune response can prepare the body to remove antigen-exposing cells when they arise, thereby removing the cause of disease before auto-antibodies develop dangerously. For example, one antigen known to be recognised by auto-antibodies is p53, and this protein is considered to be both a vaccine target and a therapeutic target for the modulation of cancer [45-47]. The antigens listed in Tables 1 and 17 are thus therapeutic targets for treating lupus.


Thus the invention provides a method for raising an antibody response in a subject, comprising eliciting to the subject an immunogen which elicits antibodies which recognise an antigen listed in Table 1. The method is suitable for immunoprophylaxis of lupus.


The invention also provides an immunogen for use in medicine, wherein the immunogen can elicit antibodies which recognise an antigen listed in Table 1. Similarly, the invention also provides the use of an immunogen in the manufacture of a medicament for immunoprophylaxis of lupus, wherein the immunogen can elicit antibodies which recognise an antigen listed in Table 1.


As discussed above for detection antigens, the immunogen may be the antigen itself or may comprise an amino acid sequence having identity and/or comprising an epitope from the antigen. Thus the immunogen may comprise an amino acid sequence (i) having at least 90% (e.g. ≧91%, ≧92%, ≧93%, ≧94%, ≧95%, ≧96%, ≧97%, ≧98%, ≧99%) sequence identity to the relevant SEQ ID NO disclosed herein, and/or (ii) comprising at least one epitope from the relevant SEQ ID NO disclosed herein. Other immunogens may also be used, provided that they can elicit antibodies which recognise the antigen of interest.


As an alternative to immunising a subject with a polypeptide immunogen, it is possible to administer a nucleic acid (e.g. DNA or RNA) immunogen encoding the polypeptide, for in situ expression in the subject, thereby leading to the development of an antibody response.


The immunogen may be delivered in conjunction (e.g. in admixture) with an immunological adjuvant. Such adjuvants include, but are not limited to, insoluble aluminium salts, water-in-oil emusions, oil-in-water emulsions such as MF59 and AS03, saponins, ISCOMs, 3-O-deacylated MPL, immunostimulatory oligonucleotides (e.g. including one or more CpG motifs), bacterial ADP-ribosylating toxins and detoxified derivatives thereof, cytokines, chitosan, biodegradable microparticles, liposomes, imidazoquinolones, phosphazenes (e.g. PCPP), aminoalkyl glucosaminide phosphates, gamma inulins, etc. Combinations of such adjuvants can also be used. The adjuvant(s) may be selected to elicit an immune response involving CD4 or CD8 T cells. The adjuvant(s) may be selected to bias an immune response towards a TH1 phenotype or a TH2 phenotype.


The immunogen may be delivered by any suitable route. For example, it may be delivered by parenteral injection (e.g. subcutaneously, intraperitoneally, intravenously, intramuscularly), or mucosally, such as by oral (e.g. tablet, spray), topical, transdermal, transcutaneous, intranasal, ocular, aural, pulmonary or other mucosal administration.


The immunogen may be administered in a liquid or solid form. For example, the immunogen may be formulated for topical administration (e.g. as an ointment, cream or powder), for oral administration (e.g. as a tablet or capsule, as a spray, or as a syrup), for pulmonary administration (e.g. as an inhaler, using a fine powder or a spray), as a suppository or pessary, as drops, or as an injectable solution or suspension.


Imaging and Staining

The antigens listed in Tables 1 and 17 can be useful for imaging. A labelled antibody against the antigen can be injected in vivo and the distribution of the antigen can then be detected. This method may identify the source of the antigen (e.g. an area in the body where there is a high concentration of the antigen), potentially offering early identification of lupus. Imaging techniques can also be used to monitor the progress or remission of disease, or the impact of a therapy.


The antigens listed in Table 1 can be useful for analysing tissue samples by staining e.g. using standard immunocytochemistry. A labelled antibody against a Table 1 antigen can be contacted with a tissue sample to visualise the location of the antigen. A single sample could be stained with different antibodies against multiple different antigens, and these different antibodies may be differentially labelled to enable them to be distinguished. As an alternative, a plurality of different samples can each be stained with a single antibody.


Thus the invention provides a labelled antibody which recognises an antigen listed in Table 1. The antibody may be a human antibody, as discussed above. Any suitable label can be used e.g. quantum dots, spin labels, fluorescent labels, dyes, etc.


Alternative Biomarkers

The invention has been described above by reference to auto-antibody and antigen biomarkers, with assays of auto-antibodies against an antigen being used in preference to assays of the antigen itself. In addition to these biomarkers, however, the invention can be used with other biological manifestations of the Table 1 antigens. For example, the level of mRNA transcripts encoding a Table 1 antigencan be measured, particularly in tissues where that gene is not normally transcribed (such as in the potential disease tissue). Similarly, the chromosomal copy number of a gene encoding a Table 1 antigen can be measured e.g. to check for a gene duplication event. The level of a regulator of a Table 1 antigen can be measured e.g. to look at a microRNA regulator of a gene encoding the antigen. Furthermore, things which are regulated by or respond to a Table 1 antigen can be assessed e.g. if an antigen is a regulator of a metabolic pathway then disturbances in that pathway can be measured. Further possibilities will be apparent to the skilled reader.


Preferred Panels

Preferred embodiments of the invention are based on a panel of biomarkers. Panels of particular interest consist of or comprise the combinations of biomarkers listed in Tables 3 to 16 (which show ten panels of 2, 3, 4, . . . , 14 and 15 biomarkers). Table 19 shows 13 further preferred panels.


The ten different panels listed in each of Tables 3 to 16 can be expanded by adding further biomarker(s) to create a larger panel. The further biomarkers can usefully be selected from known biomarkers (such as ANA, anti-DNA antibodies, etc.; see above), from Table 17, or from Table 1. In general the addition does not decrease the sensitivity or specificity of the panel shown in the Tables. Such panels include, but are not limited to:

    • A panel comprising or consisting of 2 different biomarkers, namely: (i) a biomarker selected from Table 2 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 2 different biomarkers, namely: (i) a biomarker selected from Table 2 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 2 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 3 different biomarkers, namely: (i) a group of 2 biomarkers selected from Table 3 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 3 different biomarkers, namely: (i) a group of 2 biomarkers selected from Table 3 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 3 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 4 different biomarkers, namely: (i) a group of 3 biomarkers selected from Table 4 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 4 different biomarkers, namely: (i) a group of 3 biomarkers selected from Table 4 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 4 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 5 different biomarkers, namely: (i) a group of 4 biomarkers selected from Table 5 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 5 different biomarkers, namely: (i) a group of 4 biomarkers selected from Table 5 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 5 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 6 different biomarkers, namely: (i) a group of 5 biomarkers selected from Table 6 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 6 different biomarkers, namely: (i) a group of 5 biomarkers selected from Table 6 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 6 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 7 different biomarkers, namely: (i) a group of 6 biomarkers selected from Table 7 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 7 different biomarkers, namely: (i) a group of 6 biomarkers selected from Table 7 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 7 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 8 different biomarkers, namely: (i) a group of 7 biomarkers selected from Table 8 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 8 different biomarkers, namely: (i) a group of 7 biomarkers selected from Table 8 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 8 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 9 different biomarkers, namely: (i) a group of 8 biomarkers selected from Table 9 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 9 different biomarkers, namely: (i) a group of 8 biomarkers selected from Table 9 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 9 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 10 different biomarkers, namely: (i) a group of 9 biomarkers selected from Table 10 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 10 different biomarkers, namely: (i) a group of 9 biomarkers selected from Table 10 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 10 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 11 different biomarkers, namely: (i) a group of 10 biomarkers selected from Table 11 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 11 different biomarkers, namely: (i) a group of 10 biomarkers selected from Table 11 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 11 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 12 different biomarkers, namely: (i) a group of 11 biomarkers selected from Table 12 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 12 different biomarkers, namely: (i) a group of 11 biomarkers selected from Table 12 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 12 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 13 different biomarkers, namely: (i) a group of 12 biomarkers selected from Table 13 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 13 different biomarkers, namely: (i) a group of 12 biomarkers selected from Table 13 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 13 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 14 different biomarkers, namely: (i) a group of 13 biomarkers selected from Table 14 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 14 different biomarkers, namely: (i) a group of 13 biomarkers selected from Table 14 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of 14 different biomarkers selected from Table 20.
    • A panel comprising or consisting of 15 different biomarkers, namely: (i) a group of 14 biomarkers selected from Table 15 and (ii) a further biomarker selected from Table 17.
    • A panel comprising or consisting of 15 different biomarkers, namely: (i) a group of 14 biomarkers selected from Table 15 and (ii) a further biomarker selected from Table 1 or preferably from Table 18.
    • A panel comprising or consisting of a group of 15 different biomarkers selected from Table 16.
    • A panel comprising or consisting of 15 different biomarkers selected from Table 20.


Preferred panels have between 2 and 15 biomarkers in total.


Table 21

All definitions herein which refer to biomarkers of Table 1 are also disclosed by reference to Table 21 instead. Thus, for instance, the invention provides a method for analysing a subject sample, comprising a step of determining the level of a Table 21 biomarker in the sample, wherein the level of the biomarker provides a diagnostic indicator of whether the subject has lupus.


General

The term “comprising” encompasses “including” as well as “consisting” e.g. a composition “comprising” X may consist exclusively of X or may include something additional e.g. X+Y.


References to an antibody's ability to “bind” an antigen mean that the antibody and antigen interact strongly enough to withstand standard washing procedures in the assay in question. Thus non-specific binding will be minimised or eliminated.


References to a “level” of a biomarker mean the amount of an analyte measured in a sample and this encompasses relative and absolute concentrations of the analyte, analyte titres, relationships to a threshold, rankings, percentiles, etc.


An assay's “sensitivity” is the proportion of true positives which are correctly identified i.e. the proportion of lupus subjects who test positive by a method of the invention. This can apply to individual biomarkers, panels of biomarkers, single assays or assays which combine data integrated from multiple sources e.g. ANA, anti-DNA and/or other clinical test such as those included in the SLEDAI index. It can relate to the ability of a method to identify samples containing a specific analyte (e.g. antibodies) or to the ability of a method to correctly identify samples from subjects with lupus.


An assay's “specificity” is the proportion of true negatives which are correctly identified i.e. the proportion of subjects without lupus who test negative by a method of the invention. This can apply to individual biomarkers, panels of biomarkers, single assays or assays which combine data integrated from multiple sources e.g. ANA, anti-DNA and/or other clinical tests such as those included for consideration in the SLEDAI index. It can relate to the ability of a method to identify samples containing a specific analyte (e.g. antibodies) or to the ability of a method to correctly identify samples from subjects with lupus.


Unless specifically stated, a method comprising a step of mixing two or more components does not require any specific order of mixing. Thus components can be mixed in any order. Where there are three components then two components can be combined with each other, and then the combination may be combined with the third component, etc.


References to a percentage sequence identity between two amino acid sequences means that, when aligned, that percentage of amino acids are the same in comparing the two sequences. This alignment and the percent homology or sequence identity can be determined using software programs known in the art, for example those described in section 7.7.18 of ref. 48. A preferred alignment is determined by the Smith-Waterman homology search algorithm using an affine gap search with a gap open penalty of 12 and a gap extension penalty of 2, BLOSUM matrix of 62. The Smith-Waterman homology search algorithm is disclosed in ref. 49.


Table 17 lists 145 biomarkers. From within these 145, a preferred subset is SEQ ID NOs:1-139.


Table 1 lists 50 biomarkers. From within these 50, a preferred subset is the 44 listed in Table 18.


In all embodiments of the invention, where only one biomarker is used, the biomarker is preferably not PIAS2 or PABPC1. In all embodiments of the invention, where only two biomarkers are used, these two biomarkers are preferably not PIAS2 and PABPC1.





BRIEF DESCRIPTION OF DRAWINGS


FIG. 1 shows a receiver operating characteristic (ROC) curve for t-Test feature ranking: AUC=0.74873, and S+S=1.4131. Y-axis shows sensitivity, x-axis shows 1-specificity.





MODES FOR CARRYING OUT THE INVENTION
Array Preparation

Three separate protein arrays were developed which were enriched for proteins associated with transcription (TRN array), kinases and kinase-associated proteins (KIN array) and cancer associated antigens (CAG array) described in sources such as the cancer immunome and SEREX databases. Full-length open reading frames for target genes encoding the 999 proteins present on the arrays were cloned in-frame with a sequence encoding a C-terminal E. coli BCCP-myc tag [23, 33] in a baculovirus transfer vector and sequence-verified. Several of the kinases which were integral membrane proteins were cloned as N- or C-terminal truncations representing the extracellular or cytoplasmic domains. Recombinant baculoviruses were generated, amplified and expressed in Sf9 cells using standard methods adapted for 24-well deep well plates. Recombinant protein expression was analyzed for protein integrity and biotinylation by Western blotting. Cells harbouring recombinant protein were lysed and lysates were spotted in quadruplicate using a QArray2 Microarrayer equipped with 300 μm solid pins on to streptavidin-coated glass slides. Spotted proteins project into an aqueous environment and orient away from the surface of the slide, exposing them for binding by auto-antibodies. In addition to the proteins on each array, four control proteins for the BCCP-myc tag (BCCP, BCCP-myc, β-galactosidase-BCCP-myc and β-galactosidase-BCCP) were arrayed, along with Cy3/Cy5-labeled biotin-BSA, dilution series of biotinylated-IgG and biotinylated IgM, a biotinylated-myc peptide dilution series and buffer-only spots.


Biomarker Confirmation

Serum samples were obtained from two groups of subjects:

    • 1. “disease”: serum samples from subjects diagnosed with lupus (n=160).
    • 2. “healthy and confounding disease”: serum samples from age-matched healthy donors (n=156).


Serum samples from both groups were individually analysed using each of the three types of arrays. Serum samples were incubated with each of the three array types separately. Serum samples were clarified by centrifugation at 10-13K rpm for 2 minutes at 4° C. to remove particulates, including lipids. The samples were then diluted 200-fold in 0.1% v/v Triton/0.1% v/v BSA in 1×PBS (Triton-BSA buffer) and then applied to the arrays. Diluted serum (4 mL) sample was added to each array housed in a separate compartment of a plastic dish. All arrays were incubated for 2 hours at room temperature (RT, 20° C.) with gentle orbital shaking (˜50 rpm). Arrays were removed carefully from the dish and any excess probing solution was removed by blotting the sides of the array onto lint-free tissue. Probed arrays were washed three times in fresh Triton-BSA buffer at RT for 20 minutes with gentle orbital shaking. The washed slides were then blotted onto lint-free tissue to remove excess wash buffer and were incubated in a secondary staining solution (prepared just prior to use) at RT for 2 hours, with gentle orbital shaking and protected from light using aluminium foil. The secondary staining solution was a labelled anti-human IgG antibody. Slides were washed three times in Triton-BSA buffer for 5 minutes at RT with gentle orbital shaking, rinsed briefly (5-10 seconds) in distilled water, and centrifuged for 2 minutes at 240 g in a container suitable for centrifugation. To help wick away excess liquid on the arrays, a lint-free tissue was placed at the bottom of the arrays during centrifugation.


The probed and dried arrays were then scanned using a microarray scanner capable of using an excitation wavelength suitable for the detection of the secondary staining solution, to detect auto-antibodies bound by the array and to determine magnitude of auto-antibody binding. The microarray scans produced images for each array that were used to determine the intensity of fluorescence bound to each protein spot which were used to normalize and score array data.


Raw median signal intensity (also referred to as the relative fluorescent unit, RFU) of each protein feature (also referred to as a spot or antigen) on the array was subtracted from the local median background intensity. Alternative analyses use other measures of spot intensity such as the mean fluorescence, total fluorescence, as known in the art.


The resulting net fluorescent intensities of all protein features on each array were then normalized to reduce the influence of technical bias (e.g. laser power variation, surface variation, binding to BCCP, etc.) by a multiscaling procedure. Other methods for data normalization suitable for the data include, amongst others, quantile normalization [40], multiplication of net fluorescent intensities by a normalisation factor consisting of the product of the 1st quartile of all intensities of a sample and the mean of the 1st quartiles of all samples and the “VSN” method [50]. Such normalization methods are known in the art of microarray analysis. The normalized fluorescent intensities were then averaged for each protein feature.


The multiscaling method was applied to all 3996 quadruplicate signals from 326 protein arrays. Data were arbitrarily split in test and training sets and the data from the training set was then used with GP to identify classifiers which would successfully distinguish case from control samples. Classifiers were then assessed for performance by referring to the combined sensitivity and specificity (S+S score) using the test set. Data were repeatedly split into test and training sets and analysis cycles repeated until a stable set of classifiers (“panel”) was identified.


The number of biomarkers in each panel was limited to n where n=1-15. Multiple combinations of putative biomarkers were derived and the performance of the derived panels was then ranked by combined S+S score. The top 6000 panels for each n-mer panel were taken and the frequency of appearance of each protein in these panels was used to rank the predictive power of each protein for that specific n-mer. The top 10 biomarkers for each n-mer, as judged by frequency of appearance were also identified and then combined into a single list (Table 18). These represent biomarkers of particular interest as they represent the subset of biomarkers with the greatest predictive properties.


For each n-mer, the 25 panels which provide the highest combined S+S score are presented in Tables 2-16. The biomarkers frequently appearing in the top 25 panels for all the presented n-mers were combined to produce the set of 44 markers in Table 18. The top panels in Tables 5-16 each have a S+S score higher than the value of 1.5 (i.e. above the typical value for ANA [1]).


Overall, Tables 2-16 produced the biomarkers of SEQ ID NOs:1-139 in Table 17, a subset of 44 of which are presented in Table 18. Many of these 44 biomarkers has significant predictive power across multiple n-mers. For example, IGHG1 has the greatest combined S+S score for a single marker but is not a significant contributor to panels above 2-mers in size. In contrast, KIT is important for all sizes of panels from n=1 to n=15 Thus the contribution that a particular biomarker provides to the discriminatory power of a panel can depend on the number of markers in that panel as well as on their identity.


Some markers have previously been identified in association with lupus in particular or more generally with diseases with an autoimmune component. In particular, STAT1 has been previously linked with active pathways in lupus [51] and SSX2 and SSX4 were originally identified as antigens to which autoantibodies were raised in cancer.


The presence of antibodies to the Table 18 antigens was confirmed to be significantly different between the two groups. A back propagation algorithm was used to confirm biomarkers that can distinguish between the two groups. The data analysis was validated by two permutation assays. These assays confirmed that the chosen biomarkers are related to the disease status of the sera. The core biomarker set was successfully validated by depleting the set of 999 proteins of the 44 identified biomarkers and repeating the analysis. With the data from these biomarkers removed, it was no longer possible to derive a panel which could distinguish between healthy and diseased serum samples with comparable performance.


In a second analysis, the identical raw data as described previously was used. The identification of biomarkers was performed essentially as described above with the following changes. The raw array data was normalized by consolidating the replicates (median consolidation), followed by normal transformation and then median normalisation. Outliers were identified and removed. There is no method of normalisation which is universally appropriate and factors such as study design and sample properties must be considered. For the current study median normalisation was used. Other normalisation methods include, amongst others, quantile normalisation, multiplication of net fluorescent intensities by a normalisation factor consisting of the product of the 1st quartile of all intensities of a sample and the mean of the 1st quartiles of all samples and the “VSN” method. Such normalisation methods are known in the art of microarray analysis.


This normalised data was then used for the identification of biomarker panels. It is not possible to predict a priori which classifier will perform best with a given dataset, therefore data analysis was performed with 5 different feature ranking methods (1-5) plus forward feature selection:

    • 1. Entropy
    • 2. Bhattacharyya
    • 3. T-test
    • 4. Wilcoxon
    • 5. ROC
    • 6. Forward selection


Other classification methods as known in the art could be used. Classifiers were then assessed for performance by referring to the combined sensitivity and specificity (S+S score) and area under the curve (AUC). Data were repeatedly split and analysis cycles repeated until a stable set of classifiers (“panels”) was identified. Nested cross validation was applied to the classification procedures in order to avoid overfitting of the study data. The performance of the classification was compared to a randomized set of case-control status samples (permutation assay) which should give no predictive performance and provides an indication of the background in the analysis. A FIGURE close to 1.0 is expected for the null assay (equivalent to a sensitivity+specificity (S+S) score of 0.5+0.5, respectively) whereas an S+S score of 2.0 would indicate 100% sensitivity and 100% specificity. The difference between the values for the permutation analysis and the classifier performance indicates the relative strength of the classifier. For each analysis, multiple combinations of putative biomarkers were derived and the performance of the derived panels was then ranked by combined S+S score. The top 13 panels for the best performing n-mer panel (containing 3 biomarkers; shown in Table 19) were taken and the frequency of appearance of each protein in these panels was used to rank the predictive power of each protein included in these panels. The biomarkers with the greatest diagnostic power, as judged by frequency of appearance in the panels derived were identified and combined into a single list (Table 20). These represent biomarkers of particular interest as they correspond to the subset of biomarkers with the greatest predictive properties.


The maximum S+S score was obtained with the forward feature selection method (S+S=1.41; sensitivity=0.54, specificity=0.87) which gave an AUC value of 0.75 and corresponding to panels consisting of 3 biomarkers. The sensitivity reached 0.54 and the specificity was 0.87. The biomarkers which showed greatest diagnostic power include KIT, PIAS2, RPL15, ACTL7B, EEF1G and TCEB3, many of which were also identified in the previous analysis.


The performance of biomarker panels containing 3 proteins, identified by forward selection is shown below:


















Feature








ranking
S + S
Sensitivity
Specificity
AUC
S * S
Panel size







Forward
1.41
0.54
0.87
0.75
0.47
3


Selection










FIG. 1 shows the ROC curve for Forward Feature Selection. Curve (i) shows the performance of the original data and curve (ii) shows the performance of the permutated data. The sensitivity is 0.54 and the specificity is 0.87 (circled) and the overall sum of sensitivity and specificity is 1.41.


It will be understood that the invention has been described by way of example only and modifications may be made whilst remaining within the scope and spirit of the invention.









TABLE 1







Biomarkers useful with the invention











Symbol(i)
No.(ii)
HGNC(iii)















ACTL7B
1
162



BAG3
6
939



C6orf93
13
21173



CCNI
18
1595



CCT3
19
1616



CDK3
21
1772



CKS1B
24
19083



COPG2
25
2237



DNCLI2
33
2966



DOM3Z
34
2992



EEF1D
36
3211



FBXO9
37
13588



GTF2H2
43
4656



IGHG1
49
5525



KATNB1
54
6217



KIAA0643
55
19009



KIT
57
6342



MAP2K5
64
6845



MAP2K7
65
6847



MARK4
69
13538



MGC42105
71



MLF1
73
7125



MTO1
74
19261



NFE2L2
76
7782



NME6
77
20567



NTRK3
79
8033



PFKFB3
85
8874



PIAS2
89
17311



POLR2E
90
9192



PRKCBP1
92
9397



RALBP1
94
9841



RPL15
101
10306



RPL18A
103
10311



RPL34
107
10340



RPL37A
108
10348



RPS6KA1
110
10430



RRP41
111
18189



SSX4
117
11338



STK4
124
11408



SUCLA2
125
11448



TCEB3
127
11620



TRIM37
134
7523



TUBA1
135
12407



WDR45L
138
25072



EEF1G
140
3213



RNF38
141
18052



PHLDA2
142
12385



KCMF1
143
20589



NUBP2
144
8042



VPS45A
145
14579







Columns




(i)The “Symbol” column gives the gene symbol which has been approved by the HGNC. The symbol thus identifies a unique human gene. This symbol can be related via Table 17 to the gene's Official Full Name provided by NCBI.





(ii)This number is the SEQ ID NO: for the coding sequence for the auto-antigen biomarker, as shown in Table 17.





(iii)The HUGO Gene Nomenclature Committee aims to give unique and meaningful names to every human gene. The HGNC number thus identifies a unique human gene.







Table 1 lists biomarkers useful with the invention. The measured biomarker can be (i) presence of auto-antibody which binds to an antigen listed in Table 1 and/or (ii) the presence of an antigen listed in Table 1, but is preferably the former.














TABLE 2







Biomarker(i)
S + S(ii)
Sensitivity
Specificity





















IGHG1
1.344
0.672
0.672



COPG2
1.214
0.623
0.591



MAP2K7
1.208
0.706
0.502



TUBA1
1.206
0.616
0.591



KIT
1.206
0.706
0.5



PRKCBP1
1.199
0.562
0.637



TCEB3
1.199
0.58
0.618



TRIM37
1.196
0.572
0.624



MLF1
1.189
0.567
0.622



MTO1
1.188
0.563
0.625



P4HB
1.185
0.584
0.601



AP2M1
1.183
0.573
0.61



RPL10
1.181
0.62
0.561



UTP14
1.18
0.585
0.594



NRIP1
1.179
0.592
0.586



RNF38
1.177
0.573
0.604



PHIP
1.174
0.579
0.595



BAT8
1.173
0.584
0.588



RPL18A
1.172
0.563
0.609



ME2
1.172
0.593
0.579



BRD2
1.172
0.584
0.588



RPL15
1.169
0.573
0.597



C6orf93
1.167
0.588
0.579



RNF12
1.167
0.559
0.607



RPL13A
1.166
0.575
0.591







Columns (Tables 2 to 16)




(i)This is the symbol for the relevant biomarker (or, for Tables 3-16, biomarkers in the panel).





(ii)S + S is the sum of the sensitivity and specificity columns. These final two columns show the sensitivity and specificity of a test based solely on the relevant biomarker (or, for Tables 3-16, panel) shown in the left-hand column when applied to the samples used in the examples.



















TABLE 3







Panel
S + S
Sensitivity
Specificity





















CCT3, CCNI,
1.434
0.794
0.64



PIAS2, MARK4,
1.431
0.824
0.607



PIAS2, C6orf93,
1.421
0.803
0.618



PIAS2, BAT8,
1.419
0.789
0.63



PIAS2, MLF1,
1.413
0.826
0.588



P4HB, BAG3,
1.412
0.787
0.625



RPL15, CCT3,
1.41
0.752
0.658



RPL37A, CCT3,
1.409
0.761
0.647



ME2, BAG3,
1.408
0.775
0.633



BAT8, BAG3,
1.407
0.784
0.623



TUBA1, BAG3,
1.406
0.779
0.628



RPL30, RPL15,
1.406
0.805
0.601



RUVBL1, ACTL7B,
1.404
0.806
0.599



RPL30, AP2M1,
1.402
0.749
0.654



PELO, MARK4,
1.4
0.765
0.635



FBXO9, BAT8,
1.4
0.728
0.672



MARK4, CCT3,
1.398
0.759
0.639



RRP41, PELO,
1.398
0.782
0.616



PIAS2, CCNI,
1.398
0.805
0.592



YARS, DOM3Z,
1.397
0.761
0.637



RPL13A, CCT3,
1.397
0.754
0.643



MLF1, BAG3,
1.396
0.789
0.608



RPL18A, PELO,
1.394
0.736
0.659



MLF1, IHPK2,
1.394
0.77
0.624



PHIP, FBXO9,
1.394
0.725
0.669




















TABLE 4





Panel
S + S
Sensitivity
Specificity


















MLF1, BAG3, D6S2654E,
1.499
0.844
0.655


PIAS2, MLF1, LIMS1,
1.487
0.823
0.664


PIAS2, MARK4, BAG3,
1.478
0.848
0.63


PHIP, FBXO9, PFKFB3,
1.477
0.764
0.714


PIAS2, MARK4, KIT,
1.472
0.814
0.658


PIAS2, MARK4, THUMPD1,
1.471
0.855
0.616


MARK4, DOM3Z, FBXO9,
1.469
0.793
0.676


WDR45L, PIAS2, KIT,
1.468
0.831
0.637


STK4, KIT, RPL18A,
1.468
0.794
0.673


TRIM37, FBXO9, UTP14,
1.468
0.762
0.705


PIAS2, MARK4, LIMS1,
1.466
0.819
0.647


RPL13A, CCT3, BAG3,
1.466
0.789
0.677


PHIP, FBXO9, MAP2K7,
1.464
0.768
0.697


BAG3, ACTL7B, CDH19,
1.463
0.812
0.652


TCEB3, PIAS2, MAP2K7,
1.463
0.809
0.654


PHIP, FBXO9, PFKFB4,
1.463
0.75
0.713


STK17B, PRKAA1, MAP4K5,
1.463
0.773
0.69


TUBA1, PIAS2, KIT,
1.462
0.82
0.642


RPL18A, PIAS2, PAK7,
1.462
0.812
0.65


MLF1, BAG3, RPL30,
1.459
0.806
0.654


BAG3, ACTL7B, HAGHL,
1.459
0.799
0.66


RPL15, DOM3Z, FBXO9,
1.459
0.792
0.667


RRP41, PELO, FBXO9,
1.458
0.793
0.664


PHIP, FBXO9, MAP3K7,
1.457
0.756
0.701


RPL15, DOM3Z, RPL34
1.457
0.785
0.672



















TABLE 5





Panel
S + S
Sensitivity
Specificity


















PIAS2, MLF1, KIT, NME6,
1.557
0.87
0.686


PIAS2, MLF1, KIT, MGC42105,
1.557
0.882
0.675


PIAS2, MLF1, KIT, STK11,
1.555
0.881
0.674


PIAS2, MLF1, KIT, PACE-1,
1.555
0.871
0.684


TUBA1, PIAS2, KIT, CKS1B,
1.553
0.872
0.681


PIAS2, MLF1, KIT, SNARK,
1.553
0.868
0.684


PIAS2, MLF1, KIT, CDK3,
1.552
0.871
0.681


PIAS2, ACTL7B, KIT, FLJ20574,
1.551
0.843
0.708


STK4, KIT, CCT5, DOM3Z,
1.55
0.825
0.725


PIAS2, MLF1, KIT, IRAK1,
1.549
0.877
0.672


PIAS2, MLF1, KIT, CDC2,
1.549
0.874
0.675


RPL15, PIAS2, KIT, STK4,
1.549
0.862
0.687


PIAS2, MLF1, KIT, FGFR4_aa 25-369,
1.549
0.879
0.67


PIAS2, MLF1, KIT, ITPK1,
1.549
0.867
0.682


PIAS2, MLF1, KIT, STK24,
1.549
0.884
0.665


STK4, KIT, CCNI, CCT3,
1.547
0.816
0.731


TUBA1, PIAS2, KIT, CDK3,
1.546
0.874
0.671


PIAS2, MLF1, KIT, PTK2,
1.545
0.852
0.693


TUBA1, PIAS2, KIT, CDKN2D,
1.545
0.87
0.675


PIAS2, MLF1, KIT, STK38,
1.545
0.872
0.673


TUBA1, PIAS2, KIT, PDK3,
1.544
0.868
0.677


PIAS2, ACTL7B, KIT, STK17B,
1.544
0.833
0.712


PIAS2, IFI16, KIT, NME6,
1.544
0.869
0.676


PIAS2, MLF1, KIT, TOPK,
1.544
0.868
0.675


PIAS2, MLF1, KIT, FGFR2,
1.544
0.872
0.671



















TABLE 6





Panel
S + S
Sensitivity
Specificity


















PIAS2, CCNI, KIT, ITPK1, RPL34,
1.598
0.868
0.73


PIAS2, MLF1, KIT, ITPK1, BAG3,
1.593
0.879
0.714


PIAS2, MLF1, KIT, NME6, FLJ13081,
1.588
0.889
0.699


PIAS2, MLF1, KIT, PIM1, CCT3,
1.587
0.867
0.72


PIAS2, MLF1, KIT, STK4, MAPK7,
1.586
0.878
0.708


PIAS2, CCNI, KIT, MAP2K5, RPL34,
1.586
0.872
0.713


PIAS2, CCNI, KIT, CDK3, RPL34,
1.585
0.874
0.711


PIAS2, MLF1, KIT, SNARK, BAG3,
1.585
0.878
0.707


PIAS2, MLF1, KIT, NME6, PITRM1,
1.583
0.878
0.705


PIAS2, ACTL7B, KIT, CDK3, MIF,
1.582
0.857
0.726


STK4, KIT, CCT5, DOM3Z, PIAS2,
1.582
0.846
0.736


RPL15, PIAS2, KIT, MGC42105,
1.581
0.882
0.699


KIAA0643,


PIAS2, MLF1, KIT, MGC42105, BAG3,
1.581
0.886
0.695


RPL15, PIAS2, KIT, NTRK3, KATNB1,
1.581
0.885
0.696


PIAS2, CCNI, KIT, LOC91461, GRK5,
1.581
0.87
0.711


RPL15, PIAS2, KIT, STK4, MAPK7,
1.581
0.883
0.697


PIAS2, MLF1, KIT, STK11, PAPSS2,
1.58
0.888
0.692


RPL15, PIAS2, KIT, CDKN2B,
1.58
0.885
0.695


KIAA0643,


PIAS2, MLF1, KIT, NME6, BAG3,
1.58
0.88
0.7


PIAS2, MLF1, KIT, MGC42105, STK16,
1.58
0.894
0.686


PIAS2, MLF1, KIT, PDK4, RFK,
1.579
0.875
0.704


PIAS2, MLF1, KIT, NME6, HSPD1,
1.579
0.877
0.702


PIAS2, MLF1, KIT, AKT2, KIAA0643,
1.579
0.871
0.708


PIAS2, MLF1, KIT, PDPK1, BAG3,
1.579
0.887
0.691


RPL15, PIAS2, KIT, STK4, SDCCAG10,
1.578
0.882
0.696



















TABLE 7





Panel
S + S
Sensitivity
Specificity


















RPL15, PIAS2, KIT, NTRK3, KATNB1, RRP41,
1.633
0.898
0.734


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, RRP41,
1.626
0.897
0.729


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, DNAJA1,
1.626
0.899
0.727


TUBA1, PIAS2, KIT, CKS1B, STAT1, NR1I2,
1.62
0.893
0.726


TUBA1, PIAS2, KIT, CKS1B, STAT1, ZNFN1A3,
1.619
0.887
0.732


RPL15, PIAS2, KIT, RIPK1, KIAA0643, RRP41,
1.618
0.896
0.722


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL10,
1.617
0.887
0.731


PIAS2, ACTL7B, KIT, STK33, GTF2H2, KIT_aa 23-520,
1.616
0.887
0.729


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, TUBA1,
1.616
0.891
0.725


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A,
1.616
0.881
0.734


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41,
1.615
0.896
0.72


RPL15, PIAS2, KIT, STK4, MAPK7, KIAA0643,
1.614
0.893
0.72


TUBA1, PIAS2, KIT, CKS1B, STAT1, TFEC,
1.613
0.892
0.72


PIAS2, CCNI, KIT, STK17B, RPL34, PDGFRA_aa 24-524,
1.613
0.884
0.729


PIAS2, CCNI, KIT, PKE, RPL34, PDGFRA_aa 24-524,
1.613
0.883
0.73


TUBA1, PIAS2, KIT, CKS1B, STAT1, PITX2,
1.613
0.888
0.724


RPL15, PIAS2, KIT, STK4, DYRK4, KIAA0643,
1.612
0.907
0.704


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RNF38,
1.612
0.888
0.724


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, UTP14,
1.611
0.881
0.731


PIAS2, MLF1, KIT, AKT2, KIAA0643, IFI16,
1.611
0.893
0.718


PIAS2, CCNI, KIT, STK38, RPL34, PDGFRA_aa 24-524,
1.611
0.894
0.717


PIAS2, CCNI, KIT, ITPK1, RPL34, MLF1,
1.611
0.875
0.735


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, FGFR2_aa 22-377,
1.611
0.898
0.713


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, STK4,
1.61
0.904
0.706


RPL15, PIAS2, KIT, STK17B, KIAA0643, RRP41,
1.61
0.883
0.727



















TABLE 8





Panel
S + S
Sensitivity
Specificity


















TUBA1, PIAS2, KIT, CKS1B, STAT1, NR1I2, KLF7,
1.652
0.892
0.76


RPL15, PIAS2, KIT, STK4, MAPK7, KIAA0643, KIF9,
1.65
0.9
0.75


PIAS2, CCNI, KIT, ITPK1, RPL34, FOXI1, STAT4,
1.648
0.885
0.764


PIAS2, ACTL7B, KIT, FGFR4_aa 25-369, MIF, SUCLA2,
1.648
0.9
0.748


DNAJA1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, RALBP1,
1.646
0.907
0.738


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, NEDD9,
1.644
0.912
0.732


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, RPL32,
1.644
0.881
0.763


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, RPL18,
1.644
0.881
0.763


RPL15, PIAS2, KIT, NTRK3, KATNB1, RRP41, DDR1_aa 444-913,
1.643
0.898
0.746


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, DDIT3,
1.642
0.907
0.735


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, DNCLI2,
1.642
0.882
0.76


RPL15, PIAS2, KIT, STK17B, KIAA0643, STK4, HK1,
1.641
0.908
0.734


RPL15, PIAS2, KIT, STK4, STK38L, KIAA0643, PKE,
1.641
0.911
0.73


PIAS2, CCNI, KIT, CDK3, RPL34, FOXI1, STAT4,
1.641
0.885
0.756


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, HIPK1,
1.64
0.917
0.724


TCEB3, PIAS2, KIT, CKS1B, RPL18, ACTL7B, FOXI1,
1.64
0.879
0.761


PIAS2, CCNI, KIT, NTRK3, RPL34, C20orf97, FOXI1,
1.64
0.888
0.752


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, STK4, CDKN2D,
1.64
0.923
0.717


RPL15, PIAS2, KIT, NTRK3, KATNB1, RRP41, RHOT2,
1.639
0.902
0.737


RPL15, PIAS2, KIT, NTRK3, KATNB1, RRP41, PPP1R2P9,
1.639
0.902
0.737


PIAS2, CCNI, KIT, SNARK, RPL34, DYRK2_1, CSNK2A2,
1.638
0.877
0.761


RPL15, PIAS2, KIT, NTRK3, KATNB1, RRP41, PHF7,
1.638
0.9
0.738


RPL15, PIAS2, KIT, NTRK3, KATNB1, RRP41, GMEB1,
1.637
0.901
0.736


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, CDK3,
1.637
0.881
0.756


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, MAPK7,
1.637
0.898
0.739



















TABLE 9





Panel
S + S
Sensitivity
Specificity


















RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.695
0.912
0.783


TCEB3,


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, STK4, CDKN2D, RRP41,
1.676
0.935
0.741


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, KIAA0643, PFN2,
1.674
0.898
0.776


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, STK4, CDKN2D, CTAG2,
1.672
0.929
0.743


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, STK4, CDKN2D, KRT15,
1.671
0.936
0.735


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, RPL32, DNCLI2,
1.671
0.889
0.782


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, STK4, CDKN2D, GRK5,
1.67
0.917
0.753


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, RPL18, DYRK4,
1.668
0.881
0.787


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, RPL18,
1.667
0.879
0.788


MGC16169,


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, STK4, CDKN2D, RNF38,
1.667
0.927
0.74


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, RPL18, DDR1,
1.667
0.884
0.782


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, RPL18, DNCLI2,
1.666
0.88
0.786


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, STK4, CDKN2D,
1.666
0.932
0.734


POLR2E,


RPL15, PIAS2, KIT, STK4, STK33, KIAA0643, RRP41, PFKFB3,
1.665
0.924
0.741


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, DNCLI2, MAPK7,
1.665
0.89
0.775


RPL15, PIAS2, KIT, CDKN2B, KIAA0643, STK4, CDKN2D, ACTL7B,
1.665
0.929
0.736


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, RPL18, PFN2,
1.664
0.894
0.771


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, DNCLI2, CDK4,
1.664
0.892
0.772


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, RPL18, RIOK2,
1.664
0.886
0.778


RPL15, PIAS2, KIT, STK4, STK33, KIAA0643, RRP41, CTBP2,
1.664
0.918
0.746


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, DNCLI2, MATK,
1.663
0.889
0.774


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, RPL18, CAMK2G,
1.663
0.886
0.777


PIAS2, ACTL7B, KIT, NTRK3, SUCLA2, RPL37A, DNCLI2, CDK3,
1.663
0.893
0.77


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, MAPK1,
1.663
0.914
0.749


HGRG8,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, MAPK1, AKT1,
1.663
0.908
0.755



















TABLE 10





Panel
S + S
Sensitivity
Specificity


















RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.712
0.922
0.79


TCEB3, AF5Q31,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.711
0.912
0.8


TCEB3, GSTT1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.709
0.918
0.791


TCEB3, RPLP1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.708
0.914
0.795


TCEB3, KIF9,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.707
0.922
0.784


TCEB3, RALBP1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.706
0.905
0.801


TCEB3, DNAJB1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.706
0.909
0.797


TCEB3, HGRG8,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.705
0.921
0.784


TCEB3, ELF2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.705
0.908
0.797


TCEB3, NRIP1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.705
0.907
0.798


TCEB3, CARHSP1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.705
0.916
0.789


TCEB3, HK1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.705
0.912
0.792


TCEB3, JIK,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.704
0.912
0.792


TCEB3, MAPK1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.704
0.927
0.777


TCEB3, NFE2L2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.704
0.913
0.791


TCEB3, KRT8,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.703
0.919
0.785


TCEB3, COTL1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.703
0.917
0.787


TCEB3, GPRK6,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.703
0.915
0.788


TCEB3, ACAT2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.703
0.918
0.784


TCEB3, POLR2E,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.703
0.911
0.791


TCEB3, CLK4,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.702
0.916
0.786


TCEB3, TDRKH,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.702
0.909
0.793


TCEB3, CSNK1G1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.702
0.914
0.788


TCEB3, VCL,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.702
0.911
0.791


TCEB3, DDX55,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.702
0.922
0.78


TCEB3, TPD52,



















TABLE 11





Panel
S + S
Sensitivity
Specificity


















RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.726
0.913
0.813


TCEB3, POLR2E, RUVBL1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.722
0.923
0.799


TCEB3, POLR2E, SFRS5,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.722
0.918
0.804


TCEB3, KIF9, PRKD2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.721
0.923
0.798


TCEB3, NFE2L2, STK11,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.72
0.92
0.8


TCEB3, POLR2E, SSX4,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.72
0.928
0.792


TCEB3, BATF, ZNF19,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.72
0.913
0.807


TCEB3, HGRG8, PRKAG3,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.719
0.92
0.799


TCEB3, NRIP1, MAPK7,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.719
0.916
0.803


TCEB3, HGRG8, MAPK7,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.719
0.92
0.799


TCEB3, KIF9, AAK1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.719
0.916
0.803


TCEB3, DNAJB1, TPD52,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.719
0.91
0.809


TCEB3, BOP1, ZMAT2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.718
0.916
0.802


TCEB3, KIF9, PCTK2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.718
0.913
0.805


TCEB3, CHEK1, LOC91461,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.718
0.915
0.803


TCEB3, KIF9, KLK3,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.718
0.904
0.814


TCEB3, KIF9, ZMAT2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.718
0.91
0.808


TCEB3, DNAJB1, RGS19IP1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.718
0.921
0.797


TCEB3, SFRS5, RPS6KL1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.718
0.916
0.802


TCEB3, HGRG8, SRPK2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.718
0.911
0.807


TCEB3, CALM1, STK11,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.718
0.917
0.801


TCEB3, ACAT2, LMNA,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.718
0.926
0.791


TCEB3, POLR2E, SSX2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.717
0.907
0.81


TCEB3, STK11, RPL18,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.717
0.92
0.797


TCEB3, RPLP1, JIK,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.717
0.919
0.798


TCEB3, KIF9, TPM3,



















TABLE 12





Panel
S + S
Sensitivity
Specificity


















RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.735
0.932
0.803


TCEB3, POLR2E, GTF2H2, RPS6KA1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.733
0.919
0.814


TCEB3, POLR2E, RUVBL1, TTK,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.732
0.92
0.813


TCEB3, POLR2E, SFRS5, BOP1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.732
0.923
0.809


TCEB3, POLR2E, SSX4, MKNK1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.731
0.924
0.807


TCEB3, POLR2E, SSX4, ACAT2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.731
0.923
0.808


TCEB3, POLR2E, SSX4, CAMK2D,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.731
0.93
0.802


TCEB3, POLR2E, SSX4, EGFR_aa 669-1210,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.731
0.92
0.811


TCEB3, POLR2E, SSX4, VIM,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.731
0.92
0.811


TCEB3, POLR2E, SSX4, CSK,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.731
0.921
0.81


TCEB3, POLR2E, SSX4, ALDOA,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.731
0.923
0.808


TCEB3, POLR2E, SSX4, HK1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.731
0.923
0.807


TCEB3, POLR2E, SSX4, PDK3,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.731
0.922
0.808


TCEB3, POLR2E, SSX4, CSNK2A1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.73
0.924
0.807


TCEB3, POLR2E, SSX4, C20orf97,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.73
0.921
0.809


TCEB3, POLR2E, SSX4, PTK6,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.73
0.925
0.805


TCEB3, POLR2E, SFRS5, PCTK2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.73
0.92
0.81


TCEB3, POLR2E, SSX4, EMS1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.73
0.924
0.805


TCEB3, POLR2E, SSX4, CABC1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.73
0.921
0.809


TCEB3, POLR2E, SSX4, RPS6KL1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.73
0.917
0.813


TCEB3, POLR2E, RUVBL1, RPLP1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.729
0.917
0.813


TCEB3, POLR2E, SSX4, APEG1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.729
0.919
0.811


TCEB3, POLR2E, PHKG2, LRRFIP2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.729
0.92
0.809


TCEB3, EEF1A1, APEG1, TDRD3,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.729
0.924
0.805


TCEB3, RPLP1, ACTL7B, ZMAT2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.729
0.921
0.808


TCEB3, POLR2E, SSX4, BMX,



















TABLE 13





Panel
S + S
Sensitivity
Specificity


















RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.747
0.932
0.814


TCEB3, POLR2E, GTF2H2, RPS6KA1, MAPK14,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.746
0.931
0.816


TCEB3, POLR2E, GTF2H2, RPS6KA1, BUB1B,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.746
0.926
0.819


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK32A,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.745
0.928
0.817


TCEB3, POLR2E, GTF2H2, RPS6KA1, PRKD2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.745
0.93
0.814


TCEB3, POLR2E, GTF2H2, RPS6KA1, DYRK4,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.744
0.929
0.815


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.743
0.936
0.807


TCEB3, POLR2E, GTF2H2, RPS6KA1, CAMK4,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.743
0.932
0.812


TCEB3, POLR2E, GTF2H2, RPS6KA1, PDK3,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.743
0.933
0.81


TCEB3, POLR2E, GTF2H2, RPS6KA1, SPG20,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.743
0.929
0.814


TCEB3, POLR2E, GTF2H2, RPS6KA1, PACE-1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.742
0.932
0.811


TCEB3, POLR2E, GTF2H2, RPS6KA1, H11,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.742
0.925
0.817


TCEB3, POLR2E, GTF2H2, RPS6KA1, CAMKK2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.742
0.929
0.813


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK16,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.742
0.919
0.823


TCEB3, POLR2E, GTF2H2, RPS6KA1, AHCY,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.742
0.928
0.813


TCEB3, POLR2E, GTF2H2, RPS6KA1, RPS6KL1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.742
0.931
0.811


TCEB3, POLR2E, GTF2H2, RPS6KA1, BCKDK,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.742
0.929
0.812


TCEB3, POLR2E, GTF2H2, RPS6KA1, NFIB,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.741
0.93
0.81


TCEB3, POLR2E, SSX4, PTK6, NME7,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.741
0.932
0.809


TCEB3, POLR2E, GTF2H2, RPS6KA1, UQCRC1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.74
0.924
0.816


TCEB3, POLR2E, SSX4, CSK, LDHB,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.74
0.935
0.805


TCEB3, POLR2E, GTF2H2, RPS6KA1, TK1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.74
0.918
0.822


TCEB3, STK11, RPL18, BANK1, CALM1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.74
0.922
0.818


TCEB3, POLR2E, SFRS5, BOP1, LDHB,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.74
0.923
0.816


TCEB3, POLR2E, SSX4, LDHB, PCTK2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.74
0.923
0.817


TCEB3, POLR2E, SSX4, ALDOA, HK1,



















TABLE 14





Panel
S + S
Sensitivity
Specificity


















RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.758
0.928
0.831


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.756
0.93
0.826


TCEB3, POLR2E, GTF2H2, RPS6KA1, DYRK4, HRB2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.755
0.922
0.834


TCEB3, POLR2E, GTF2H2, RPS6KA1, AHCY, STK11,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.754
0.935
0.818


TCEB3, POLR2E, GTF2H2, RPS6KA1, PDK3, SOX2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.753
0.928
0.826


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, CTBP2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.753
0.932
0.821


TCEB3, POLR2E, GTF2H2, RPS6KA1, BUB1B, PHKG2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.753
0.923
0.83


TCEB3, POLR2E, GTF2H2, RPS6KA1, PACE-1, AHCY,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.753
0.93
0.822


TCEB3, POLR2E, GTF2H2, RPS6KA1, PDK3, KIF9,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.753
0.93
0.822


TCEB3, POLR2E, GTF2H2, RPS6KA1, PDK3, BMPR1B,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.753
0.923
0.829


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, STK11,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.931
0.822


TCEB3, POLR2E, GTF2H2, RPS6KA1, H11, NLK,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.93
0.823


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK32A, CSNK2A1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.928
0.824


TCEB3, POLR2E, GTF2H2, RPS6KA1, DYRK4, BIRC3,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.931
0.821


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TRB2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.928
0.824


TCEB3, POLR2E, GTF2H2, RPS6KA1, BUB1B, STK11,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.928
0.824


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK32A, SOX2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.93
0.822


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK32A, PHKG2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.931
0.821


TCEB3, POLR2E, GTF2H2, RPS6KA1, H11, TRB2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.931
0.821


TCEB3, POLR2E, GTF2H2, RPS6KA1, PDK3, CKM,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.917
0.835


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, PRKAA1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.93
0.821


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, FLJ10377,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.752
0.929
0.822


TCEB3, POLR2E, GTF2H2, RPS6KA1, DDR1, RARA,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.751
0.931
0.82


TCEB3, POLR2E, GTF2H2, RPS6KA1, SOX2, ADCK4,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.751
0.93
0.821


TCEB3, POLR2E, GTF2H2, RPS6KA1, DYRK4, SNX6,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.751
0.933
0.818


TCEB3, POLR2E, GTF2H2, RPS6KA1, SPG20, MAPK11,



















TABLE 15





Panel
S + S
Sensitivity
Specificity


















RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.764
0.932
0.832


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, MAPK11,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.763
0.917
0.846


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, HGRG8,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.763
0.922
0.841


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, STK11, BANK1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.762
0.926
0.836


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, CDC2L1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.762
0.932
0.83


TCEB3, POLR2E, GTF2H2, RPS6KA1, H11, TLK2, NME7,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.762
0.928
0.834


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, TLK2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.762
0.932
0.83


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, CSNK1G1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.762
0.933
0.829


TCEB3, POLR2E, GTF2H2, RPS6KA1, PDK3, SOX2, CSK,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.762
0.925
0.836


TCEB3, POLR2E, GTF2H2, RPS6KA1, AHCY, STK11, TDRD3,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.761
0.933
0.829


TCEB3, POLR2E, GTF2H2, RPS6KA1, H11, HRB2, NDUFV3,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.761
0.929
0.833


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, RBM6,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.761
0.931
0.83


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TRB2, C1orf33,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.761
0.926
0.835


TCEB3, POLR2E, GTF2H2, RPS6KA1, RPS6KL1, STK11, KIT_aa


544-976,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.761
0.93
0.831


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, RHOT2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.761
0.931
0.83


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, ADCK1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.761
0.93
0.831


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK32A, SOX2, STK11,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.761
0.925
0.836


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, MAPK12,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.761
0.933
0.828


TCEB3, POLR2E, GTF2H2, RPS6KA1, DYRK4, SNX6, SOX2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.761
0.931
0.83


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, RPLP1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.76
0.931
0.829


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, MST4,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.76
0.93
0.83


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, CDK2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.76
0.928
0.832


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, PRKCBP1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.76
0.932
0.828


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, KRT8,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.76
0.925
0.835


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, AHCY, RAB11FIP3,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.76
0.934
0.826


TCEB3, POLR2E, GTF2H2, RPS6KA1, DDR1, STK11, EGFR_aa


669-1210,



















TABLE 16





Panel
S + S
Sensitivity
Specificity


















RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.774
0.931
0.842


TCEB3, POLR2E, GTF2H2, RPS6KA1, AHCY, STK11, TDRD3,


STK24,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.771
0.936
0.834


TCEB3, POLR2E, GTF2H2, RPS6KA1, AHCY, STK11, TDRD3,


MAPK7,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.77
0.93
0.84


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, STK11, BANK1, JIK,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.769
0.932
0.837


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, PRKACG,


NME7,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.769
0.935
0.834


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TRB2, SSX2, BMX,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.768
0.927
0.842


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, CDC2L1,


SOX2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.768
0.931
0.837


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, NME7,


RNASEL,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.768
0.93
0.839


TCEB3, POLR2E, GTF2H2, RPS6KA1, RPS6KL1, NDUFV3, PIM1,


GFAP,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.768
0.924
0.844


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, HGRG8,


NME7,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.768
0.935
0.833


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, SSX2, TRB2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.93
0.838


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, STK11, BANK1,


P4HB,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.929
0.838


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, DNCLI2, NLK,


PRKAA1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.934
0.833


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, NME7, LIMK2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.929
0.838


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, STK11, BANK1, TK1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.928
0.839


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, STK11, BANK1,


TPM1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.926
0.84


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, PRKACG,


SOX2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.923
0.844


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, STK11, BANK1,


MEF2A,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.935
0.832


TCEB3, POLR2E, GTF2H2, RPS6KA1, NEK11, BANK1, STK11,


NTRK2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.936
0.831


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, STK11, BANK1,


MAPK7,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.934
0.832


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, NME7,


MAP3K6,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.931
0.836


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, PRKACG,


PCTK3,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.931
0.836


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TRB2, C1orf33,


TARDBP,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.767
0.924
0.842


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, TLK2, CDC2L1,


TBC1D2,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.766
0.937
0.829


TCEB3, POLR2E, GTF2H2, RPS6KA1, PDK4, STK11, BANK1,


PTK2_1,


RPL15, PIAS2, KIT, MAP2K5, KIAA0643, RRP41, WDR45L,
1.766
0.932
0.835


TCEB3, POLR2E, GTF2H2, RPS6KA1, STK24, STK11, BANK1,


NTRK2,




















TABLE 17





No:(i)
Symbol(ii)
Name(iii)
GI(iv)
ID(v)



















1
ACTL7B
actin-like 7B
21707461
10880


2
AF5Q31
AF4/FMR2 family member 4
38614473
27125


3
AHCY
S-adenosylhomocysteine hydrolase
33869587
191


4
ALDOA
aldolase A fructose-bisphosphate transcript variant 1
13279256
226


5
AP2M1
adaptor-related protein complex 2, mu 1 subunit,
13436451
1173


6
BAG3
BCL2-associated athanogene 3
13623600
9531


7
BANK1
B-cell scaffold protein with ankyrin repeats 1
21619549
55024


8
BAT8
HLA-B associated transcript 8
12803700
10919


9
BCKDK
branched chain alpha-ketoacid dehydrogenase kinase
33873582
10295


10
BMX
BMX non-receptor tyrosine kinase
34189854
660


11
BRD2
bromodomain containing 2, mRNA (cDNA clone
39645316
6046




MGC:74927)


12
BUB1B
BUB1 budding uninhibited by benzimidazoles 1
17511776
701




homolog beta (yeast)


13
C6orf93
chromosome 6 open reading frame 93
33872922
84946


14
C9orf86
chromosome 9 open reading frame 86
18089263
55684


15
CALM1
calmodulin 1 (phosphorylase kinase delta)
33869376
801


16
CAMK4
calcium/calmodulin-dependent protein kinase IV
16876820
814


17
CAMKK2
calcium/calmodulin-dependent protein kinase kinase 2
33991300
10645




beta transcript varia


18
CCNI
cyclin I
38197480
10983


19
CCT3
chaperonin containing TCP1 subunit 3 (gamma)
14124983
7203


20
CDC2
cell division cycle 2 G1 to S and G2 to M transcript
15778966
983




variant 1


21
CDK3
cDNA clone MGC: 54300 complete cds
28839544
1018


22
CDKN2B
cyclin-dependent kinase inhibitor 2B (p15 inhibits
15680230
1030




CDK4) transcript varian


23
CDKN2D
cyclin-dependent kinase inhibitor 2D (p19 inhibits
38114834
1032




CDK4) transcript varian


24
CKS1B
CDC28 protein kinase regulatory subunit 1B
40226240
1163


25
COPG2
coatomer protein complex, subunit gamma 2
16924304
26958


26
CRYAB
crystallin alpha B
13937812
1410


27
CSK
c-src tyrosine kinase (CSK)
187475371
1445


28
CSNK2A1
casein kinase 2 alpha 1 polypeptide transcript variant 2
33991298
1457


29
D6S2654E
DNA segment on chromosome 6(unique) 2654
12654834
26240




expressed sequence


30
DDX55
DEAD (Asp-Glu-Ala-Asp) box polypeptide 55
34190861
57696


31
DNAJA1
DnaJ (Hsp40) homolog subfamily A member 1
14198244
3301


32
DNAJB1
DnaJ (Hsp40) homolog subfamily B member 1
38197192
3337


33
DNCLI2
dynein cytoplasmic light intermediate polypeptide 2
19684162
1783


34
DOM3Z
dom-3 homolog Z (C. elegans)
33878616
1797


35
DYRK4
dual-specificity tyrosine-(Y)-phosphorylation regulated
21411487
8798




kinase 4


36
EEF1D
eukaryotic translation elongation factor 1 delta
33988346
1936




(guanine nucleotide exchange protein)


37
FBXO9
F-box only protein 9
33875682
26268


38
FGFR4_aa
fibroblast growth factor receptor 4, transcript variant 3
33873872
2264



25-369


39
FOXI1
forkhead box I1 transcript variant 2
20987405
2299


40
GCN5L2
GCN5 general control of amino-acid synthesis 5-like 2
21618599
2648




(yeast)


41
GRK5
G protein-coupled receptor kinase 5 mRNA (cDNA clone
40352898
2869




MGC: 71228)


42
GSTT1
glutathione S-transferase theta 1
13937910
2952


43
GTF2H2
general transcription factor IIH polypeptide 2 44 kDa
40674449
2966


44
H11
protein kinase H11
33877008
26353


45
H2AFY
H2A histone family member Y
15426457
9555


46
HGRG8
high-glucose-regulated protein 8
33990650
51441


47
HK1
hexokinase 1 transcript variant 1
33869444
3098


48
IFI16
interferon gamma-inducible protein 16
16877621
3428


49
IGHG1
immunoglobulin heavy constant gamma 1 (G1m
15779221
3500




marker)


50
IHPK2
inositol hexaphosphate kinase 2
18043110
51447


51
IRAK1
interleukin-1 receptor-associated kinase 1
15929004
3654


52
ITPK1
inositol 134-triphosphate 5/6 kinase
33869549
3705


53
JIK
STE20-like kinase
33877128
51347


54
KATNB1
katanin p80 (WD repeat containing) subunit B 1
38197184
10300


55
KIAA0643
KIAA0643 protein,
34190884
23059


56
KIF9
kinesin family member 9
34193691
64147


57
KIT
v-kit Hardy-Zuckerman 4 feline sarcoma viral oncogene
47938801
3815




homolog


58
KIT_aa 23-
v-kit Hardy-Zuckerman 4 feline sarcoma viral oncogene
47938801
3815



520
homolog, mRNA (cDNA clone MGC: 87427)


59
KRT15
keratin 15
33876966
3866


60
LDHB
lactate dehydrogenase B
12803116
3945


61
LIMS1
LIM and senescent cell antigen-like domains 1
13529136
3987


62
LMNA
lamin A/C transcript variant 2
33991068
4000


63
LYK5
protein kinase LYK5, mRNA (cDNA clone MGC: 10181)
27696779
92335


64
MAP2K5
mitogen-activated protein kinase kinase 5, transcript
33871775
5607




variant A


65
MAP2K7
mitogen-activated protein kinase kinase 7
34192881
5609


66
MAPK14
mitogen-activated protein kinase 14 transcript variant 2
12652686
1432


67
MAPK7
mitogen-activated protein kinase 7 transcript variant 4
20988367
5598


68
MARK2
MAP/microtubule affinity-regulating kinase 2 mRNA
54261524
2011




(cDNA clone MGC: 99619)


69
MARK4
cDNA clone MGC: 88635 complete cds
47940615
57787


70
ME2
malic enzyme 2 NAD(+)-dependent mitochondrial
12652790
4200


71
MGC42105
hypothetical protein MGC42105
34783729
167359


72
MIF
macrophage migration inhibitory factor (glycosylation-
33875452
4282




inhibiting factor)


73
MLF1
myeloid leukemia factor 1
13937875
4291


74
MTO1
mitochondrial translation optimization 1 homolog (S. cerevisiae)
15029678
25821


75
NDUFV3
NADH dehydrogenase (ubiquinone) flavoprotein 3
33871569
4731




10 kDa


76
NFE2L2
nuclear factor (erythroid-derived 2)-like 2
15079436
4780


77
NME6
non-metastatic cells 6 protein expressed in (nucleoside-
38197001
10201




diphosphate kinase)


78
NRIP1
nuclear receptor interacting protein 1
25955638
8204


79
NTRK3
neurotrophic tyrosine kinase receptor type 3 transcript
15489167
4916




variant 3


80
P4HB
procollagen-proline 2-oxoglutarate 4-dioxygenase
14790032
5034




(proline 4-hydroxylase) b


81
PDGFRA_aa
platelet-derived growth factor receptor, alpha
39645304
5156



24-524
polypeptide,


82
PDK3
pyruvate dehydrogenase kinase isoenzyme 3
16198532
5165


83
PDK4
pyruvate dehydrogenase kinase isoenzyme 4
25955470
5166


84
PELO
pelota homolog (Drosophila)
33870521
53918


85
PFKFB3
6-phosphofructo-2-kinase/fructose-26-biphosphatase 3
26251768
5209


86
PFN2
profilin 2 transcript variant 1
17390097
5217


87
PHIP
pleckstrin homology domain interacting protein
14286225
55023


88
PHKG2
phosphorylase kinase gamma 2 (testis)
33876835
5261


89
PIAS2
Msx-interacting-zinc finger transcript variant alpha
15929521
9063


90
POLR2E
polymerase (RNA) II (DNA directed) polypeptide E
13325243
5434




25 kDa


91
PPP2R5C
protein phosphatase 2 regulatory subunit B (B56)
16740598
5527




gamma isoform transcript


92
PRKCBP1
protein kinase C binding protein 1
21315038
23613


93
PSMD4
proteasome (prosome macropain) 26S subunit non-
38197196
5710




ATPase 4 transcript varia


94
RALBP1
ralA binding protein 1
15341886
10928


95
RGS19IP1
regulator of G-protein signalling 19 interacting protein 1
33988493
10755


96
RHOT2
ras homolog gene family member T2
15928946
89941


97
RNF12
ring finger protein 12, transcript variant 1
33872118
51132


98
RNF38
ring finger protein 38
21707089
152006


99
RPL10
ribosomal protein L10
13097176
6134


100
RPL13A
ribosomal protein L13a
38197177
23521


101
RPL15
ribosomal protein L15
15928752
6138


102
RPL18
ribosomal protein L18
38197133
6141


103
RPL18A
ribosomal protein L18a
38196939
6142


104
RPL27A
ribosomal protein L27a
13529097
6157


105
RPL30
ribosomal protein L30
34783378
6156


106
RPL32
ribosomal protein L32
15079341
6161


107
RPL34
ribosomal protein L34 transcript variant 2
12804692
6164


108
RPL37A
ribosomal protein L37a
34783289
6168


109
RPLP1
ribosomal protein large P1
13097206
6176


110
RPS6KA1
ribosomal protein S6 kinase 90 kDa polypeptide 1
15929012
6195


111
RRP41
exosome complex exonuclease RRP41
38114779
54512


112
RUVBL1
RuvB-like 1 (E. coli)
12804268
8607


113
SFRS5
splicing factor arginine/serine-rich 5
33869323
6430


114
SNARK
likely ortholog of rat SNF1/AMP-activated protein
33878200
81788




kinase


115
SOX2
SRY (sex determining region Y)-box 2
33869633
6657


116
SSX2
synovial sarcoma X breakpoint 2 transcript variant 2
33872900
6757


117
SSX4
synovial sarcoma X breakpoint 4 transcript variant 1
13529094
6759


118
STAT1
signal transducer and activator of transcription 1 91 kDa
33877045
6772




transcript varian


119
STK11
serine/threonine kinase 11 (Peutz-Jeghers syndrome)
33872385
6794


120
STK24
serine/threonine kinase 24 (STE20 homolog yeast)
23274190
8428


121
STK3
serine/threonine kinase 3 (STE20 homolog yeast)
34189966
6788


122
STK32A
hypothetical protein MGC22688
18203872
202374


123
STK33
serine/threonine kinase 33
22658391
65975


124
STK4
serine/threonine kinase 4 (STK4)
38327560
6789


125
SUCLA2
succinate-CoA ligase ADP-forming beta subunit
34783884
8803


126
TADA3L
transcriptional adaptor 3 (NGG1 homolog yeast)-like
38114820
10474




transcript variant 2


127
TCEB3
transcription elongation factor B (SIII) polypeptide 3
38197222
6924




(110 kDa elongin A)


128
TCF4
transcription factor 4
21410271
6925


129
TDRD3
tudor domain containing 3
20987778
81550


130
TK1
thymidine kinase 1 soluble
39644822
7083


131
TLK2
tousled-like kinase 2 mRNA (cDNA clone MGC: 44450)
27924134
11011


132
TPM3
tropomyosin 3
15929958
7170


133
TRB2
tribbles homolog 2
33990940
28951


134
TRIM37
tripartite motif-containing 37
23271191
4591


135
TUBA1
tubulin alpha 1 (testis specific)
37589861
7277


136
UTP14
serologically defined colon cancer antigen 16,
12654624
10813


137
VCL
vinculin
24657578
7414


138
WDR45L
hypothetical protein 628
12803025
56270


139
ZMAT2
zinc finger matrin type 2
34785080
153527


140
EEF1G
Eukaryotic translation elongation factor 1 gamma
38197136
1937


141
RNF38
ring finger protein 38
21707089
152006


142
PHLDA2
pleckstrin homology-like domain, family A, member 2
13477152
7262


143
KCMF1
Potassium channel modulatory factor 1
13111812
56888


144
NUBP2
Nucleotide binding protein 2 (MinD homolog, E. coli)
33990898
10101


145
VPS45A
Vacuolar protein sorting 45A (yeast)
15277874
11311





Columns



(i)This number is the SEQ ID NO: for the coding sequence for the auto-antigen biomarker, as shown in the sequence listing.




(ii)The “Symbol” column is as described for Table 1.




(iii)This name is taken from the Official Full Name provided by NCBI. An antigen may have been referred to by one or more pseudonyms in the prior art. The invention relates to these antigens regardless of their nomenclature.




(iv)A “GI” number, “GenInfo Identifier”, is a series of digits assigned consecutively to each sequence record processed by NCBI when sequences are added to its databases. The GI number bears no resemblance to the accession number of the sequence record. When a sequence is updated (e.g. for correction, or to add more annotation or information) it receives a new GI number. Thus the sequence associated with a given GI number is never changed.




(v)The “ID” column shows the Entrez GeneID number for the antigen marker. An Entrez GeneID value is unique across all taxa.


















TABLE 18







Symbol(i)
No.(ii)
HGNC(iii)




















ACTL7B
1
162



BAG3
6
939



C6orf93
13
21173



CCNI
18
1595



CCT3
19
1616



CDK3
21
1772



CKS1B
24
19083



COPG2
25
2237



DNCLI2
33
2966



DOM3Z
34
2992



EEF1D
36
3211



FBXO9
37
13588



GTF2H2
43
4656



IGHG1
49
5525



KATNB1
54
6217



KIAA0643
55
19009



KIT
57
6342



MAP2K5
64
6845



MAP2K7
65
6847



MARK4
69
13538



MGC42105
71



MLF1
73
7125



MTO1
74
19261



NFE2L2
76
7782



NME6
77
20567



NTRK3
79
8033



PFKFB3
85
8874



PIAS2
89
17311



POLR2E
90
9192



PRKCBP1
92
9397



RALBP1
94
9841



RPL15
101
10306



RPL18A
103
10311



RPL34
107
10340



RPL37A
108
10348



RPS6KA1
110
10430



RRP41
111
18189



SSX4
117
11338



STK4
124
11408



SUCLA2
125
11448



TCEB3
127
11620



TRIM37
134
7523



TUBA1
135
12407



WDR45L
138
25072







Columns




(i)The “Symbol” column gives the gene symbol which has been approved by the HGNC. The symbol thus identifies a unique human gene. This symbol can be related via Table 17 to the gene's Official Full Name provided by NCBI.





(ii)This number is the SEQ ID NO: for the coding sequence for the auto-antigen biomarker, as shown in Table 17.





(iii)The HUGO Gene Nomenclature Committee aims to give unique and meaningful names to every human gene. The HGNC number thus identifies a unique human gene.















TABLE 19





Panel
Biomarker
















1
ACTL7B, KIT, EEF1G


2
RPL15, KIT, PABPC1


3
PIAS2, KIT, RPL15


4
RPL15, KIT, PHLDA2


5
PIAS2, KIT, TCEB3


6
KIT, KCMF1, KIF9


7
ACTL7B, KIT, TCEB3


8
RNF38, KIT, CALM1


9
RRP41, KIT, NUBP2


10
KIT, RNF38, VPS45A


11
RPL15, KIT, PIAS2


12
TCF4, KIT, CALM1


13
RNF38, KIT, MAPK1



















TABLE 20





Symbol(i)
Name(ii)
GI(iii)
ID(iv)


















KIT
v-kit Hardy-Zuckerman 4 feline sarcoma viral oncogene
47938801
3815



homolog


PIAS2
Msx-interacting-zinc finger transcript variant alpha
15929521
9063


RPL15
ribosomal protein L15,
15928752
6138


ACTL7B
actin-like 7B,
21707461
10880


EEF1G
Eukaryotic translation elongation factor 1 gamma
38197136
1937


TCEB3
transcription elongation factor B (SIII) polypeptide 3
38197222
6924



(110 kDa elongin A)


RNF38
ring finger protein 38,
21707089
152006


CALM1
calmodulin 1 (phosphorylase kinase delta)
33869376
801


PHLDA2
pleckstrin homology-like domain, family A, member 2
13477152
7262


KCMF1
Potassium channel modulatory factor 1
13111812
56888


KIF9
kinesin family member 9
34193691
64147


MAPK1
mitogen-activated protein kinase 1, transcript variant 2
17389605
5594


NUBP2
Nucleotide binding protein 2 (MinD homolog, E. coli)
33990898
10101


PABPC1
Poly(A) binding protein, cytoplasmic 1
33872187
26986


RRP41
exosome complex exonuclease RRP41
38114779
54512


TCF4
transcription factor 4
21410271
6925


VPS45A
Vacuolar protein sorting 45A (yeast)
15277874
11311





Columns



(i)The “Symbol” column is as described for Table 1.




(ii)This name is taken from the Official Full Name provided by NCBI. An antigen may have been referred to by one or more pseudonyms in the prior art. The invention relates to these antigens regardless of their nomenclature.




(iii)A “GI” number, “GenInfo Identifier”, is a series of digits assigned consecutively to each sequence record processed by NCBI when sequences are added to its databases. The GI number bears no resemblance to the accession number of the sequence record. When a sequence is updated (e.g. for correction, or to add more annotation or information) it receives a new GI number. Thus the sequence associated with a given GI number is never changed.




(iv)The “ID” column shows the Entrez GeneID number for the antigen marker. An Entrez GeneID value is unique across all taxa.


















TABLE 21





No:(i)
Symbol(ii)
Name(iii)
GI(iv)
ID(v)



















140
EEF1G
Eukaryotic translation elongation factor 1 gamma
38197136
1937


141
RNF38
ring finger protein 38,
21707089
152006


142
PHLDA2
pleckstrin homology-like domain, family A, member 2
13477152
7262


143
KCMF1
Potassium channel modulatory factor 1
13111812
56888


144
NUBP2
Nucleotide binding protein 2 (MinD homolog, E. coli)
33990898
10101


145
VPS45A
Vacuolar protein sorting 45A (yeast)
15277874
11311





Columns



(i)This number is the SEQ ID NO: for the coding sequence for the auto-antigen biomarker, as shown in the sequence listing.




(ii)The “Symbol” column is as described for Table 1.




(iii)This name is taken from the Official Full Name provided by NCBI. An antigen may have been referred to by one or more pseudonyms in the prior art. The invention relates to these antigens regardless of their nomenclature.




(iv)A “GI” number, “GenInfo Identifier”, is a series of digits assigned consecutively to each sequence record processed by NCBI when sequences are added to its databases. The GI number bears no resemblance to the accession number of the sequence record. When a sequence is updated (e.g. for correction, or to add more annotation or information) it receives a new GI number. Thus the sequence associated with a given GI number is never changed.




(v)The “ID” column shows the Entrez GeneID number for the antigen marker. An Entrez GeneID value is unique across all taxa.







REFERENCES



  • [1] Habash-Bseiso (2005) Clin Med. Res. 3(3): 190-3.

  • [2] Antico et al. (2010) Lupus doi: 10.1177/0961203310362995.

  • [3] Sherer et al. (2004) Arthritis Rheum. 34(2):501-37.

  • [4] Pa ppworth et al. (2009) Mol Immunol 46:1042-9.

  • [5] Vanderlugt & Miller (1996) Curr Opin Immunol. 8:831-6.

  • [6] Cheung et al. (2000) Nucleic Acids Res. 28(1):361-3. http://alfred.med.yale.edu/a/fred/[7] McKusick (1998) Mendelian Inheritance in Man. A Catalog of Human Genes and Genetic Disorders. Baltimore: Johns Hopkins University Press, 1998 (12th edition). See also http://www.ncbi.nlm.nih.gov/omim/.

  • [8] Stenson et al. (2009) Genome Med 1:13.

  • [9] Stamm et al. (2006) Nucleic Acids Res 34: D46-D55.

  • [10] Sonn et al. (2005) Lupus Prostatic Dis 8:304-10.

  • [11] Costenbader et al. (2007) Arthritis Rheum. 56(4):1251-62.

  • [12] Geysen et al. (1984) PNAS USA 81:3998-4002.

  • [13] Carter (1994) Methods Mol Biol 36:207-23.

  • [14] Jameson, B A et al. 1988, CABIOS 4(1):181-186.

  • [15] Maksyutov & Zagrebelnaya (1993) Comput Appl Biosci 9(3):291-7.

  • [16] Hopp (1993) Peptide Research 6:183-190.

  • [17] Welling et al. (1985) FEBS Lett. 188:215-218.

  • [18] Bublil et al. (2007) Proteins 68(1):294-304.

  • [19] Sun et al. (2009) Nucleic Acids Res 37:W612-6.

  • [20] Raddrizzani & Hammer (2000) Brief Bioinform 1(2):179-89.

  • [21] Chen et al. (2007) Amino Acids 33(3):423-8.

  • [22] Reimer (2009) Methods Mol Biol 524:335-44.

  • [23] Boutell et al. (2004) Proteomics 4:1950-8.

  • [24] Tassinari et al. (2008) Curr Opin Mol Ther 10:107-15.

  • [25] Stoevesandt et al. (2009) Expert Rev Proteomics 6:145-57.

  • [26] Tao et al. (2007) Comb Chem High Throughput Screen 10:706-18.

  • [27] Gnjatic et al. (2009) J Immunol Methods 341:50-8.

  • [28] Hartmann et al. (2009) Anal Bioanal Chem 393:1407-16.

  • [29] Fall & Niessner (2009) Methods Mol Biol 509:107-22.

  • [30] WO01/57198.

  • [31] WO02/27327.

  • [32] Blackburn & Hart (2005) Methods Mol. Biol. 310:197-216

  • [33] WO03/064656.

  • [34] WO2004/046730.

  • [35] Stahl et al. (2006) Immunol Lett 102:50-9.

  • [36] Quintana (2008) PNAS USA 105:18889-94.

  • [37] Koopmann & Blackburn (2003) Rapid Commun Mass Spectrom. 17:455-62.

  • [38] WO01/61040.

  • [39] Oleinikov et al. (2003) J Proteome Res. 2:313-9.

  • [40] Bolstad et al. (2003) Bioinformatics 19:185-93.

  • [41] Meyer et al. (2003) Neurocomputing 55:169-86.

  • [42] Koza (1992), Genetic Programming: On the Programming of Computers by Means of Natural Selection, MIT Press.

  • [43] Wang & Japkowicz (2008) Lecture Notes in Computer Science 4994/2008, 38-47.

  • [44] Elkon & Casali (2008) Nat Clin Pract Rheumatol. 4(9):491-8.

  • [45] Chada et al. (2003) Curr Opin Drug Discov Devel. 6(2):169-73.

  • [46] Chene (2003) Nature Reviews Cancer 3, 102-109.

  • [47] Wang & EI-Deiry (2008) Curr Opin Oncol. 20(1):90-6.

  • [48] Current Protocols in Molecular Biology (F. M. Ausubel et al., eds., 1987) Supplement 30

  • [49] Smith & Waterman (1981) Adv. Appl. Math. 2: 482-489.

  • [50] Huber et al. (2002) Bioinformatics 18 suppl. 1 S96-S104.

  • [51] Martinez-Lostao et al. (2007) Lupus. 16(7):483-8.


Claims
  • 1. A method for analysing a subject sample, comprising a step of determining the levels of x different biomarkers in the sample, wherein the levels of the biomarkers provide a diagnostic indicator of whether the subject has lupus; wherein x is 1 or more and wherein the x different biomarkers are selected from auto-antibodies against (i) KIT, (ii) C6orf93, (iii) RPL34, (iv) DOM3Z, (v) COPG2, (vi) DNCL12, (vii) RRP41, (viii) FBXO9, (ix) RALBP1, (x) PIAS2, (xi) EEF1D, (xii) CONI, (xiii) KATNB1, (xiv) POLR2E, (xv) CCT3, (xvi) KIAA0643, (xvii) RPL37A, (xviii) GTF2H2, (xix) MAP2K5, (xx) CDK3, (xxi) RPS6KA1, (xxii) MARK4, (xxiii) MTO1, (xxiv) MGC42105, (xxv) NFE2L2, (xxvi) WDR45L, (xxvii) STK4, (xxviii) PFKFB3, (xxix) NTRK3, (xxx) MLF1, (xxxi) TRIM37, (xxxii) ACTL7B, (xxxiii) RPL18A, (xxxiv) CKS1B, (xxxv) TUBA1, (xxxvi) NME6, (xxxvii) SUCLA2, (xxxviii) IGHG1, (xxxix) PRKCBP1, (xl) BAG3, (xli) TCEB3, (xlii) RPL15, (xliii) SSX4, (xliv) MAP2K7, (xlv) EEF1G, (xlvi) RNF38, (xlvii) PHLDA2, (xlviii) KCMF1, (xlix) NUBP2, (I) VPS45A.
  • 2. The method of claim 1, wherein x is 2 or more.
  • 3. The method of claim 2, wherein x is 10 or more.
  • 4. The method of claim 1, wherein x is 50 or fewer.
  • 5. The method of claim 4, wherein x is 15 or fewer.
  • 6. The method of claim 1, wherein the method also includes a step of determining if a sample from the subject contains ANA and/or anti-DNA antibodies.
  • 7. The method of claim 1, wherein the sample is a body fluid.
  • 8. The method of claim 7, wherein the sample is blood, serum or plasma.
  • 9. The method of claim 1, wherein the subject is (i) pre-symptomatic for lupus or (ii) already displaying clinical symptoms of lupus.
  • 10. The method of claim 1, wherein the presence of auto-antibodies is determined using an immunoassay.
  • 11. The method of claim 10, wherein the immunoassay utilises an antigen comprising an amino acid sequence (i) having at least 90% sequence identity to an amino acid sequence encoded by a SEQ ID NO listed in Table 1, and/or (ii) comprising at least one epitope from an amino acid sequence encoded by a SEQ ID NO listed in Table 1.
  • 12. The method of claim 10, wherein the immunoassay utilises a fusion polypeptide with a first region and a second region, wherein the first region can react with an auto-antibody in a sample and the second region can react with a substrate to immobilise the fusion polypeptide thereon.
  • 13. The method of claim 1, wherein the subject is a human male.
  • 14. The method of claim 1, wherein the method involves comparing levels of the biomarkers in the subject sample to levels in (i) a sample from a patient with lupus and/or (ii) a sample from a patient without lupus.
  • 15. The method of claim 1, wherein the method involves analysing levels of the biomarkers in the sample with a classifier algorithm which uses the measured levels of to distinguish between patients with lupus and patients without lupus.
  • 16. The method of claim 2, wherein the 2 or more different biomarkers are: A panel comprising or consisting of 2 different biomarkers, namely: (i) a biomarker selected from Table 2 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 2 different biomarkers, namely: (i) a biomarker selected from Table 2 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 2 different biomarkers selected from Table 20.A panel comprising or consisting of 3 different biomarkers, namely: (i) a group of 2 biomarkers selected from Table 3 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 3 different biomarkers, namely: (i) a group of 2 biomarkers selected from Table 3 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 3 different biomarkers selected from Table 20.A panel comprising or consisting of 4 different biomarkers, namely: (i) a group of 3 biomarkers selected from Table 4 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 4 different biomarkers, namely: (i) a group of 3 biomarkers selected from Table 4 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 4 different biomarkers selected from Table 20.A panel comprising or consisting of 5 different biomarkers, namely: (i) a group of 4 biomarkers selected from Table 5 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 5 different biomarkers, namely: (i) a group of 4 biomarkers selected from Table 5 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 5 different biomarkers selected from Table 20.A panel comprising or consisting of 6 different biomarkers, namely: (i) a group of 5 biomarkers selected from Table 6 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 6 different biomarkers, namely: (i) a group of 5 biomarkers selected from Table 6 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 6 different biomarkers selected from Table 20.A panel comprising or consisting of 7 different biomarkers, namely: (i) a group of 6 biomarkers selected from Table 7 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 7 different biomarkers, namely: (i) a group of 6 biomarkers selected from Table 7 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 7 different biomarkers selected from Table 20.A panel comprising or consisting of 8 different biomarkers, namely: (i) a group of 7 biomarkers selected from Table 8 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 8 different biomarkers, namely: (i) a group of 7 biomarkers selected from Table 8 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 8 different biomarkers selected from Table 20.A panel comprising or consisting of 9 different biomarkers, namely: (i) a group of 8 biomarkers selected from Table 9 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 9 different biomarkers, namely: (i) a group of 8 biomarkers selected from Table 9 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 9 different biomarkers selected from Table 20.A panel comprising or consisting of 10 different biomarkers, namely: (i) a group of 9 biomarkers selected from Table 10 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 10 different biomarkers, namely: (i) a group of 9 biomarkers selected from Table 10 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 10 different biomarkers selected from Table 20.A panel comprising or consisting of 11 different biomarkers, namely: (i) a group of 10 biomarkers selected from Table 11 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 11 different biomarkers, namely: (i) a group of 10 biomarkers selected from Table 11 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 11 different biomarkers selected from Table 20.A panel comprising or consisting of 12 different biomarkers, namely: (i) a group of 11 biomarkers selected from Table 12 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 12 different biomarkers, namely: (i) a group of 11 biomarkers selected from Table 12 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 12 different biomarkers selected from Table 20.A panel comprising or consisting of 13 different biomarkers, namely: (i) a group of 12 biomarkers selected from Table 13 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 13 different biomarkers, namely: (i) a group of 12 biomarkers selected from Table 13 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 13 different biomarkers selected from Table 20.A panel comprising or consisting of 14 different biomarkers, namely: (i) a group of 13 biomarkers selected from Table 14 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 14 different biomarkers, namely: (i) a group of 13 biomarkers selected from Table 14 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of 14 different biomarkers selected from Table 20.A panel comprising or consisting of 15 different biomarkers, namely: (i) a group of 14 biomarkers selected from Table 15 and (ii) a further biomarker selected from Table 17.A panel comprising or consisting of 15 different biomarkers, namely: (i) a group of 14 biomarkers selected from Table 15 and (ii) a further biomarker selected from Table 1 or preferably Table 18.A panel comprising or consisting of a group of 15 different biomarkers selected from Table 16.A panel comprising or consisting of 15 different biomarkers selected from Table 20.
  • 17. A diagnostic device for use in diagnosis of lupus, wherein the device permits determination of the level(s) of 1 or more Table 1 biomarkers.
  • 18. The device of claim 17, wherein the device comprises a plurality of antigens immobilised on a solid substrate as an array.
  • 19. The device of claim 18, wherein the device contains antigens for detecting auto-antibodies against all of the antigens listed in Table 1.
  • 20. The device of claim 19, wherein the device contains antigens for detecting auto-antibodies against all of the antigens listed in Table 17.
  • 21. The device of claim 18, wherein the array includes one or more control polypeptides.
  • 22. The device of claim 21, comprising one or more an anti-human immunoglobulin antibody(s).
  • 23. The device of claim 16, including one or more replicates of an antigen.
  • 24. A method for analysing a subject sample, comprising a step of determining the levels of x different biomarkers in the sample, wherein the levels of the biomarkers provide a diagnostic indicator of whether the subject has lupus; wherein x is 1 or more and wherein the x different biomarkers are selected from auto-antibodies against (i) KIT, (ii) C6orf93, (iii) RPL34, (iv) DOM3Z, (v) COPG2, (vi) DNCL12, (vii) RRP41, (viii) FBXO9, (ix) RALBP1, (x) PIAS2, (xi) EEF1D, (xii) CONI, (xiii) KATNB1, (xiv) POLR2E, (xv) CCT3, (xvi) KIAA0643, (xvii) RPL37A, (xviii) GTF2H2, (xix) MAP2K5, (xx) CDK3, (xxi) RPS6KA1, (xxii) MARK4, (xxiii) MTO1, (xxiv) MGC42105, (xxv) NFE2L2, (xxvi) WDR45L, (xxvii) STK4, (xxviii) PFKFB3, (xxix) NTRK3, (xxx) MLF1, (xxxi) TRIM37, (xxxii) ACTL7B, (xxxiii) RPL18A, (xxxiv) CKS1B, (xxxv) TUBA1, (xxxvi) NME6, (xxxvii) SUCLA2, (xxxviii) IGHG1, (xxxix) PRKCBP1, (xl) BAG3, (xli) TCEB3, (xlii) RPL15, (xliii) SSX4, (xliv) MAP2K7, (xlv) EEF1G, (xlvi) RNF38, (xlvii) PHLDA2, (xlviii) KCMF1, (xlix) NUBP2, (I) VPS45A, using the device of claim 17.
  • 25. In a method for diagnosing if a subject has lupus, an improvement consisting of determining in a sample from the subject the level(s) of y biomarker(s) of Table 1, wherein y is 1 or more and the level(s) of the biomarker(s) provide a diagnostic indicator of whether the subject has lupus.
  • 26. A human antibody which recognises an antigen listed in Table 17 (preferably in Table 1).
Priority Claims (1)
Number Date Country Kind
1017520.6 Oct 2010 GB national
PCT Information
Filing Document Filing Date Country Kind 371c Date
PCT/IB2011/054572 10/14/2011 WO 00 8/22/2013