BIOMARKERS AND COMBINATIONS THEREOF FOR DIAGNOSING TUBERCULOSIS

FIELD OF THE INVENTION

This invention relates to the detection and diagnosis of tuberculosis. More specifically, the invention relates to new biomarkers and combinations thereof that enable the accurate detection and diagnosis of tuberculosis.

BACKGROUND OF THE INVENTION

Tuberculosis (TB) is a progressive, often fatal, infectious disease, caused by the bacterial pathogen Mycobacterium tuberculosis (M. tuberculosis, MTB). This is a significant cause of mortality worldwide, being the eighth largest leading cause of death globally, and is primarily a disease of poverty, particularly in developing countries. Latent TB infection is believed to affect as much as one third of the world's population.

Tuberculosis is a notifiable disease and is a major concern for many governmental and other health bodies including the World Health Organisation (WHO), who have initiated numerous control and treatment programmes like the “Stop TB Partnership”.

The WHO estimates that nearly nine million new cases of TB, and nearly two million deaths, occur globally each year. The largest number of new TB cases in 2005 occurred in South-East Asia (34% of incident cases globally), and the estimated incidence rate in sub-Saharan Africa is nearly 350 cases per 100,000 population. However, TB infection is not limited to the developing world: the UK has seen a resurgence of tuberculosis since the late 1980s and there are currently over 8000 new cases each year—a rate of 14.0 per 100,000 population. About 40% of these new cases occur in the London region, where the rate of infection is 44.8 per 100,000 population.

M. tuberculosis is capable of forming intracellular infections. These infections may be exclusively intracellular, or may contain both intracellular and extracellular components. Generally, M. tuberculosis bacilli do not circulate freely in the body, for example, in the bloodstream, and as such are often difficult to detect. They are also less amenable to drug treatment regimes. Intracellular survival and multiplication of mycobacteria is suspected to be a main contributory factor for mycobacterial disease progression.

The term “latency” is synonymous with “persistence”, and describes a reversible state of low metabolic activity in which mycobacterial cells can survive for extended periods with limited or no cell division. During latency (i.e. latent infection), the clinical symptoms associated with a mycobacterial infection do not become manifest.

The presence of a large reservoir of asymptomatic individuals latently-infected with mycobacteria is a major problem for the control of M. tuberculosis infections. In addition, conventional methods for the detection of a latent mycobacterial infection by skin testing may be compromised by BCG vaccination and by exposure to environmental mycobacteria.

Timely, accurate and sensitive diagnosis is imperative for disease control. This is a key priority for many health and immigration authorities, particularly at “point of entry” for developed countries where the majority of TB cases are imported. Optimal patient management requires early initiation of drug therapy and isolation of infectious individuals as soon as possible. Left untreated, each person with active TB disease will infect on average between 10 and 15 people every year. TB infection can normally be treated by a 6 month course of antibiotics; however, patient compliance to long-term drug treatment is varied, with patients often stopping therapy when their symptoms cease. Failure to complete the treatment regime can promote the development of multiple drug-resistant mycobacteria.

Despite considerable investment in surveillance, control and treatment programmes, as well as in research and development for new diagnostics and therapeutics, TB control and eradication has proved challenging. The standard methods used for TB diagnosis have not changed significantly in recent years in many routine diagnostic laboratories, and there is substantial evidence that TB diagnosis is subject to significant error, with up to 52% under-diagnosis reported in some studies using comparative indices between TB diagnosis methods as measured against autopsy observations.

Early detection of a disease condition typically allows for a more effective therapeutic treatment with a correspondingly more favourable clinical outcome. In view of the increasing threat and global prevalence of TB, new strategies are required for more effective prevention, treatment, and diagnosis of TB and M. tuberculosis infection. Ideally, diagnosis would be made by a technique that accurately, rapidly, and simultaneously measures a plurality of biomarkers at a single point in time, thereby minimizing disease progression during the time required for diagnosis.

SUMMARY OF THE INVENTION

Previous attempts to develop new diagnostic methods for TB have proved problematic. In particular, earlier work attempting to enable the accurate and timely diagnosis of early stage or latent infection TB, where symptoms may not be apparent and where detection of M. tuberculosis by culture or specific polymerase chain reaction (PCR) is not achieved, has faced challenges.

Other groups have investigated host biomarkers in active and latent TB. However, these methods were unable to maintain the required level of specificity for TB across different subgroups, such as different ethnic groups.

The present inventors have conducted a temporal differential gene expression study in peripheral blood leukocytes (PBLs) in an aerosol Macaca fascicularis non-human primate model of TB. Using this method, the inventors have identified host biomarkers associated with early exposure to TB. Microarray hybridisation analyses to human whole genome arrays have revealed many significant gene expression changes, showing substantial temporal changes in PBL gene expression in response to M. tuberculosis challenge across the time-course of the study. Using parametric and non-parametric tools for data analysis, including artificial neural network analysis, the inventors have identified highly-significant host biomarkers associated with TB and M. tuberculosis infections. The biomarkers identified by the present invention have improved specificity for TB across different subgroups, such as different ethnic groups.

Therefore, the present invention allows for accurate, rapid, and sensitive prediction and diagnosis of TB through a measurement of one or more biomarker taken from a biological sample at a single point in time.

Accordingly, the present invention provides the use of one or more of SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRAP1, WSB1, BST1, SERPINB1, MVP, APBB1IP, MB21D1/C6orf150, TICAM2, DEFB128 and IL8 as a biomarker for tuberculosis.

The invention also provides a method for diagnosing tuberculosis in an individual comprising determining the presence and/or amount of one or more biomarker for tuberculosis in a sample obtained from the individual, wherein the one or more biomarker for tuberculosis is selected from SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRAP1, WSB1, BST1, SERPINB1, MVP, APBB1IP, MB21D1/C6orf150, TICAM2, DEFB128 and IL8.

The tuberculosis detected and/or diagnosed by the method or use of the present invention may be an active tuberculosis infection and the one or more biomarker a biomarker for an active tuberculosis infection.

Typically the one or more biomarker is selected from SNX10, CPVL, PF4V1 and HERC2, or any combination thereof. In a preferred embodiment, the one or more biomarker is selected from: (i) SNX10 and CREG1; and/or (ii) PF4V1 and HERC2.

The tuberculosis detected and/or diagnosed by the method or use of the present invention may be a latent tuberculosis infection and the one or more biomarker a biomarker for a latent tuberculosis infection. Typically the one or more biomarker for a latent tuberculosis infection is selected from PF4V1, LYN, CD52, HERC2, KLRAP1, DEFB128, LGALS3BP and IL8.

A use of the invention may comprise determining the presence and/or amount of the one or more biomarker for tuberculosis in a sample obtained from an individual.

The present invention also provides a use or method as defined herein, wherein said one or more biomarker is able to identify an individual with an active tuberculosis infection and/or an individual with a latent tuberculosis infection.

One or more additional biomarker for tuberculosis may be used in the method or use of the invention. The one or more additional biomarker may be (a) a biomarker for an active tuberculosis infection selected from: (i) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, PSMB9, LGALS3BP, BST1, BAZ1A, LYN, TAPBP, SERPINB1, WSB1, MVP, APBB1IP, FYB, MB21D1/C6orf150, TICAM2, CD52, KLRAP1, DEFB128 and IL8; and/or (ii) a biomarker listed in Table 3; and/or (b) a biomarker for a latent tuberculosis infection selected from: (i) a biomarker listed in Table 4; and/or (ii) a biomarker listed in Table 5. In a preferred embodiment, the one or more additional biomarker for an active tuberculosis infection is selected from LOC400759/GBP1P1, CREG1, PSMB9, ALPK1, GBP1, IRF1, HLA-B, IFITM3, S100A11, MMP9 and CD96. In a more preferred embodiment, the one or more biomarkers for tuberculosis are SNX10 and CPVL and the one or more additional biomarkers for tuberculosis are LOC400759/GBP1P1 and CREG1; and/or the one or more biomarkers for tuberculosis are PF4V1 and HERC2 and the one or more additional biomarkers for tuberculosis are LOC400759/GBP1P1 and ALPK1.

One or more further additional biomarkers may be used in the methods and/or uses of the invention. In one embodiment, the one or more further additional biomarker is PSMB9 and/or PF4V1. Alternatively and/or in addition, the one or more additional biomarker for an active tuberculosis infection, or the one or more further additional biomarker is: (i) GBP1, IRF1 and HLA-B; (ii) GBP1, IRF1, IFITM3 and S100A11; and/or (iii) GBP1, IRF1, MMP9 and CD96.

The presence and/or amount of the one or more biomarker for tuberculosis may be compared with the presence and/or amount of the one or more biomarker for tuberculosis in a control sample. The specificity of the comparison of the presence and/or amount of the one or more biomarker for tuberculosis in the sample and the presence and/or absence of the one or more biomarker for tuberculosis in the control diagnoses tuberculosis may be at least about 80%.

The presence and/or amount of the one or more biomarker for tuberculosis may be determined using an antibody and/or an oligonucleotide specific for said one or more biomarker. Typically, an oligonucleotide specific for said one or more biomarker is used. Preferably: (i) the one or more biomarker for tuberculosis is LOC400759/GBP1P1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 1, 2 or 3; (ii) the one or more biomarker for tuberculosis is PF4V1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 4 or 5; (iii) the one or more biomarker for tuberculosis is ALPK1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 6 or 7; (iv) the one or more biomarker for tuberculosis is HERC2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 8, 9 or 168 to 171; (v) the one or more biomarker for tuberculosis is LGALS3BP and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 10 or 11; (vi) the one or more biomarker for tuberculosis is BST1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 12 or 13; (vii) the one or more biomarker for tuberculosis is SNX10 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 14 or 15; (viii) the one or more biomarker for tuberculosis is CREG1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 16 or 17; (ix) the one or more biomarker for tuberculosis is BAZ1A and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 18 or 19; (x) the one or more biomarker for tuberculosis is LYN and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 20 or 21; (xi) the one or more biomarker for tuberculosis is TAPBP and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 22 or 23; (xii) the one or more biomarker for tuberculosis is SERPINB1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 24 or 25; (xiii) the one or more biomarker for tuberculosis is PSMB9 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 26 or 27; (xiv) the one or more biomarker for tuberculosis is WSB1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 28 or 29; (xv) the one or more biomarker for tuberculosis is MVP and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 30 or 31; (xvi) the one or more biomarker for tuberculosis is APBB1IP and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 32 or 33; (xvii) the one or more biomarker for tuberculosis is FYB and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 34 or 35; (xviii) the one or more biomarker for tuberculosis is MB21D1/C6orf150 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 36 or 37; (xix) the one or more biomarker for tuberculosis is CPVL and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 38 or 39; (xx) the one or more biomarker for tuberculosis is TICAM2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 40 or 41; (xxi) the one or more biomarker for tuberculosis is CD52 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 42 or 43; (xxii) the one or more biomarker for tuberculosis is KLRAP1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 44 or 45; (xxiii) the one or more biomarker for tuberculosis is DEFB128 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 46 or 47; (xxiv) the one or more biomarker for tuberculosis is IL8 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 48 or 49; (xxv) the one or more biomarker for tuberculosis is GBP1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 50 or 51; (xxvi) the one or more biomarker for tuberculosis is IRF1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 52 or 53; (xxvii) the one or more biomarker for tuberculosis is MMP9 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 54 or 55; (xxviii) the one or more biomarker for tuberculosis is CD96 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 56 or 57; (xxix) the one or more biomarker for tuberculosis is AIM2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 58 or 59; (xxx) the one or more biomarker for tuberculosis is CD274 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 60 or 61; (xxxi) the one or more biomarker for tuberculosis is CDH23 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 62 or 63; (xxxii) the one or more biomarker for tuberculosis is IFIT3 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 64 or 65; (xxxiii) the one or more biomarker for tuberculosis is IFITM3 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 66 or 67; (xxxiv) the one or more biomarker for tuberculosis is GK and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 68 or 69; (xxxv) the one or more biomarker for tuberculosis is NELL2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 70 or 71; (xxxvi) the one or more biomarker for tuberculosis is S100A11 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 72 or 73; (xxxvii) the one or more biomarker for tuberculosis is SAMD9L and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 74 or 75; (xxxviii) the one or more biomarker for tuberculosis is STAT1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 76 or 77; (xxxix) the one or more biomarker for tuberculosis is TLR6 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 78 or 79; (xl) the one or more biomarker for tuberculosis is WARS and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 80 or 81; (xli) the one or more biomarker for tuberculosis is DOCKS and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 82 or 83; (xlii) the one or more biomarker for tuberculosis is SIRPB2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 84 or 85; (xliii) the one or more biomarker for tuberculosis is ANKRD22 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 86 or 87; (xliv) the one or more biomarker for tuberculosis is ABCF2 (NM 005692.3 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 88 or 89; (xlv) the one or more biomarker for tuberculosis is FNBP1L and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 90 or 91; (xlvi) the one or more biomarker for tuberculosis is NCF1C and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 92 or 93; (xlvii) the one or more biomarker for tuberculosis is TBC1D3B and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 94 or 95; (xlviii) the one or more biomarker for tuberculosis is SLC14A1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 96 or 97; (xlix) the one or more biomarker for tuberculosis is CALCOCO2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 98 or 99; (l) the one or more biomarker for tuberculosis is GTF2B and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 100 or 101; (li) the one or more biomarker for tuberculosis is HLA-B and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 102 or 103; (lii) the one or more biomarker for tuberculosis is HLA-F and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 104 or 105; (liii) the one or more biomarker for tuberculosis is MGST2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 106 or 107; (liv) the one or more biomarker for tuberculosis is SPAST and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 108 or 109; and/or (lv) the one or more biomarker for tuberculosis is WAC and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 110 or 111 or 168 to 171.

The presence and/or absence of the at least one biomarker for tuberculosis in the individual may be determined at least twice using a separate sample taken each time the presence and/or absence of the at least one biomarker for tuberculosis is determined. The samples from the individual may be taken prior to, during and/or after treatment initiation.

The invention further provides a device for carrying out the use of the invention, or for use in a method of the invention, which comprises (i) one or more antibody specific for the one or more biomarker for tuberculosis; or (ii) one or more oligonucleotide specific for the one or more biomarker for tuberculosis. In a preferred embodiment, the one or more oligonucleotide specific for the one or more biomarker for tuberculosis comprised in the device is an oligonucleotide of the invention as defined herein.

DESCRIPTION OF FIGURES

FIG. 1: shows a box plot of LOC400759 normalised gene expression in Caucasian controls (CC); Controls of Asian descent recruited from Hindu temples in London who tested negative for TB in skin and/or IFNγ tests and originate from high-incidence areas of TB (NMRL CNTRL); individuals of Asian descent recruited from Hindu temples in London and test positive for TB in Mantoux skin and/or IFNγ tests and diagnosed with latent TB (NMRL LTNT); individuals with early stage active TB recruited at St. Thomas's and Royal Free hospitals in London (EATB); and individuals of Asian descent recruited at the Jawaharlal Institute of Postgraduate Medical Education and Research (JIPMER), India, diagnosed with active TB (ATB). The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 2: shows a box plot of GBP1 normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 3: shows a box plot of IRF1 normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 4: shows a box plot of S100A11 normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 5: shows a box plot of CPVL normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 6: shows a box plot of IFITM3 normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 7: shows a box plot of NCF1C normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 8: shows a box plot of SNX10 normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 9: shows a box plot of CREG1 normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 10: shows a box plot of PSMB9 normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 11: shows a box plot of PF4V1 normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

FIG. 12: shows a box plot of ALPK1 normalised gene expression in CC, NMRL CNTRL, NMRL LTNT, EATB and ATB. The box represents highest and lowest gene expression interquartile range and median gene expression. The error bars represent minimum and maximum values. Grey bars represent outlier values.

DETAILED DESCRIPTION OF THE INVENTION

The present invention allows for the rapid, sensitive, and accurate diagnosis or prediction of TB using one or more biological samples obtained from an individual at a single time point (“snapshot”) or during the course of disease progression. TB may be diagnosed or predicted prior to the onset of clinical symptoms, and/or as subsequent confirmation after the onset of clinical symptoms. Accordingly, the present invention allows for more effective therapeutic intervention and/or diagnosis in the pre-symptomatic stage of the disease.

Tuberculosis and Mycobacterium tuberculosis

Tuberculosis (TB) is a progressive, often fatal, infectious disease, caused by the bacterial pathogen Mycobacterium tuberculosis (M. tuberculosis, MTB). Pulmonary symptoms of TB include a productive, prolonged cough of three or more weeks, chest pain, and hemoptysis. Systemic symptoms include low grade remittent fever, chills, night sweats, appetite loss, weight loss, easy fatigability, and production of sputum that starts out mucoid but changes to purulent. A reference herein to the detection or diagnosis of TB is equivalent to the detection or diagnosis of M. tuberculosis infection. When the M. tuberculosis cells are metabolically active and/or undergoing cell division, this results in the symptoms of TB becoming overt, and is described as an active TB/M. tuberculosis infection.

In latent TB, an individual is infected with M. tuberculosis, but the individual does not display the symptoms of active TB disease. In latent TB, the mycobacterial cells survive for extended periods in a state of low metabolic activity and with limited or no cell division. Thus, during latency (i.e. latent infection), the clinical symptoms associated with a mycobacterial infection do not become manifest. This can make it difficult to distinguish between a latent TB infection and the absence of a TB infection using conventional methods and techniques. A reference herein to the detection or diagnosis of latent TB is equivalent to the detection or diagnosis of latent M. tuberculosis infection.

The present inventors have also found that there is a temporal aspect to the expression of some biomarkers for TB during the active phase of an infection. Specifically, some biomarkers for active TB are expressed at relatively low levels at an early stage in active TB, but become expressed at higher levels as the active stage of the infection progresses. In this context, the term “low level of expression” is relative. For example, the expression of these active TB biomarkers during the early active phase may be low relative to the expression level later in the active phase, and similar to (or slightly greater than) the expression level of the same biomarkers in an uninfected individual and/or an individual with latent TB. Typically the expression of these active TB biomarkers during the early active phase is low relative to the expression level later in the active phase, but still higher than the expression level of the same biomarkers in an uninfected individual and/or an individual with latent TB.

The present invention provides biomarkers for the detection and/or diagnosis of TB infection. In particular, the present invention provides biomarkers for the detection and/or diagnosis of an active TB infection, including an early stage active TB infection and/or a later stage active TB infection. The present invention also provides biomarkers for the detection and/or diagnosis of a latent TB infection. The present invention further provides biomarkers for distinguishing between active and latent TB infections. The present invention also provides biomarkers for distinguishing between a latent TB infection and an absence/lack of TB infection (active or latent). The present invention also provides biomarkers for distinguishing between early stage active TB and later stage active TB. The present invention also provides biomarkers for distinguishing between an individual who has no symptomatic TB infection (active or latent) and has not been exposed to TB (e.g. because they are from a non/low-TB endemic region) and an individual who has no symptomatic TB infection (active or latent) but has been exposed to TB (e.g. because they are from a high-TB endemic region).

Any appropriate technique may be used to confirm the diagnosis of active and/or latent TB according to the present invention. Standard techniques are known in the art. For example, chest x-ray, microbiological culture of M. tuberculosis in a sample (sputum, pus, cerebrospinal fluid, biopsied tissue, etc.) from the individual, CT scan, MMR, antibodies from lymphocyte secretion (ALS) assay, IFNγ assay and tuberculin skin tests (e.g. Mantoux and Heaf tests).

Biomarkers for Tuberculosis

A “biomarker” is virtually any biological compound, such as a protein and a fragment thereof, a peptide, a polypeptide, a proteoglycan, a glycoprotein, a lipoprotein, a carbohydrate, a lipid, a nucleic acid, an organic on inorganic chemical, a natural polymer, and a small molecule, that is present in the biological sample and that may be isolated from, or measured in, the biological sample. Furthermore, a biomarker can be the entire intact molecule, or it can be a portion thereof that may be partially functional or recognized, for example, by an antibody or other specific binding protein. A biomarker is considered to be informative if a measurable aspect or characteristic of the biomarker is associated with a given state of an individual, such as infection with TB. Such a measurable aspect or characteristic may include, for example, the presence, absence, or concentration of the biomarker in the biological sample from the individual and/or its presence as part of a profile of biomarkers. Such a measurable aspect of a biomarker is defined herein as a “feature.” For example, the presence of a biomarker may be a feature. As another example, the amount of a biomarker in a sample, or the amount of a biomarker in a sample compared with a control or reference sample may be a feature. A feature may also be a ratio of two or more measurable aspects of biomarkers, which biomarkers may or may not be of known identity, for example. A “biomarker profile” comprises at least two such features, where the features can correspond to the same or different classes of biomarkers such as, for example, two nucleic acids or a nucleic acid and a carbohydrate. A biomarker profile may also comprise at least three, four, five, 10, 20, 30 or more features. In one embodiment, a biomarker profile comprises hundreds, or even thousands, of features. In another embodiment, the biomarker profile comprises at least one measurable aspect of at least one internal standard.

The new biomarkers for TB identified by the present inventors are listed in Table 2 herein (together with corresponding sequence identifiers (SEQ ID NOs). In particular, the present inventors have identified LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, PSMB9, ALPK1, HERC2, LGALS3BP, BST1, BAZ1A, LYN, TAPBP, SERPINB1, WSB1, MVP, APBB1IP, FYB, MB21D1/C6orf150, TICAM2, CD52, KLRAP1, DEFB128 and IL8 as biomarkers for TB. Therefore, the present invention provides the use of one or more of LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, PSMB9, ALPK1, HERC2, LGALS3BP, BST1, BAZ1A, LYN, TAPBP, SERPINB1, WSB1, MVP, APBB1IP, FYB, MB21D1/C6orf150, TICAM2, CD52, KLRAP1, DEFB128 and IL8 as a biomarker for tuberculosis. Each of these biomarkers may be used alone, in combination with any of the other biomarkers, and/or in combination with one or more additional biomarker for tuberculosis as disclosed herein. For example, the invention may relate to the use of LOC400759/GBP1P1, SNX10, CPVL and/or CREG1 (alone or in any combination thereof), optionally in combination with PF4V1 and/or PSMB9 and/or in combination with any of the other biomarkers disclosed herein.

Typically the present invention provides the use of one or more of SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRAP1, WSB1, BST1, SERPINB1, MVP, APBB1IP, MB21D1/C6orf150, TICAM2, DEFB128 and IL8 as a biomarker for tuberculosis.

Any combination of these biomarkers may be used according to the present invention. For example, any two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten or more, up to and including all of these biomarkers may be used to diagnose TB according to the present invention.

The one or more biomarker of the invention may be a hormone, a growth factor, a transcription factor, a cell surface marker or a soluble protein derived from cells. The one or more biomarker of the invention may be a nucleic acid encoding for one of said proteins.

The one or more of biomarker of the invention may be used in the detection and/or diagnosis of an active TB infection. The one or more biomarker of the invention may be used in the detection and/or diagnosis of a latent TB infection. The one or more biomarker of the invention may be used to diagnose the absence of a TB infection (active or latent). The one or more biomarker of the invention may be used to identify an individual with an active TB infection and/or an individual with a latent TB infection. The one or more biomarker of the invention may be used to identify an individual with an active TB infection and/or an individual with a latent TB infection and/or an individual uninfected with TB. The one or more biomarker of the invention may be used in the detection and/or diagnosis of an early stage active TB infection or a late/later stage active TB infection. The one or more biomarker of the invention may be used to determine exposure of an individual to TB, even in the absence of a symptomatic active or asymptomatic latent TB infection. Thus, the one or more biomarker of the invention may be used to distinguish between one or more individual with an active (early or later stage) TB infection and/or one or more individual with a latent TB infection, and/or one or more individual uninfected with TB. The one or more biomarker of the invention may also be used to distinguish between one or more individual with an early stage active TB infection and one or more individual with a late/later stage active TB infection.

Typically, the present invention relates to the use of one or more of SNX10, CPVL, PF4V1, HERC2, CD52 and LYN as a biomarker for TB. One or more of these biomarkers may be used in the detection and/or diagnosis of an active TB infection (early or late/later stage), or to distinguish between an early stage active TB infection and a late/later stage active TB infection. Alternatively, one or more of these biomarkers may be used in the detection and/or diagnosis of a latent TB infection, or to diagnose the absence of a TB infection (active or latent). Any combination of SNX10, CPVL, PF4V1, HERC2 CD52 and LYN may be used as biomarkers for TB according to the present invention. As a non-limiting example: (i) SNX10 and CPVL; (ii) SNX10 and PF4V1; (iii) SNX10 and HERC2; (iv) CPVL and PF4V1; (v) CPVL and HERC2; (vi) PF4V1 and HERC2; (vii) SNX10, CPVL and PF4V1; (viii) SNX10, CPVL and HERC2; (ix) SNX10, PF4V1 and HERC2; (x) CPVL, PF4V1 and HERC2; and/or (xi) SNX10, CPVL, PF4V1 and HERC2 may be used in combination as biomarkers in the detection and/or diagnosis of TB according to the present invention. Any of these combinations may be used with CD52 and/or LYN. Similarly, CD52 and/or LYN may be used in combination with one or more of SNX10, CPVL, PF4V1 and HERC2, or with any combination of SNX10, CPVL, PF4V1 and HERC2. Thus, in one embodiment, the invention relates to the use of SNX10, CPVL, PF4V1, HERC2, CD52 and LYN.

Typically the invention relates to the use of (i) SNX10 and CPVL; and/or (ii) PF4V1 and HERC2 as biomarkers for tuberculosis. In a preferred embodiment, SNX10 and CPVL are used in combination with LOC400759/GBP1P1 and/or CREG1 as biomarkers in the diagnosis of TB according to the present invention. In another preferred embodiment, PF4V1 and HERC2 are used in combination with LOC400759/GBP1P1 and/or ALPK1 as biomarkers in the diagnosis of TB according to the present invention. Any of these combinations may be used with CD52 and/or LYN.

One or more additional biomarker for TB (or further additional biomarker for TB) may also be used in the detection and/or diagnosis of TB according to the present invention. Any combination of the one or more additional biomarker (or further additional biomarker) may be used in combination with the one or more biomarker of the invention. For example at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten or more additional biomarkers for TB may be used in combination with the one or more biomarker of the invention. As a non-limiting example, in the cases where the one or more biomarker is selected from SNX10 and/or CPVL, the one or more additional biomarker may be selected from LOC400759/GBP1P1, CREG1, PF4V1, PSMB9, ALPK1, HERC2, LGALS3BP, BST1, BAZ1A, LYN, TAPBP, SERPINB1, WSB1, MVP, APBB1IP, FYB, MB21D1/C6orf150, TICAM2, CD52, KLRAP1, DEFB128, HERC2 and IL8. As another non-limiting example, in the case where the one or biomarker is selected from PF4V1 and/or HERC2, the one or more additional biomarker may be selected from LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, LGALS3BP, BST1, BAZ1A, LYN, TAPBP, SERPINB1, WSB1, MVP, APBB1IP, FYB, MB21D1/C6orf150, CPVL, TICAM2, CD52, KLRAP1, DEFB128 and IL8. Again, any of these combinations may be used with CD52 and/or LYN.

Typically, the one or more additional biomarker is selected from the biomarkers listed in Tables 2, 3, 4 and/or 5 herein (corresponding sequence identifiers (SEQ ID NOs) are also given in Tables 2 to 5).

In a preferred embodiment, the one or more biomarker of the invention is selected from SNX10 and CPVL and the one or more additional biomarker is selected from the biomarkers in Tables 2 and 3 or 5. In a more preferred embodiment, the one or more biomarker of the invention is selected from SNX10 and CPVL and the one or more additional biomarker is selected from LOC400759/GBP1P1, CREG1, PF4V1, PSMB9, GBP1, IRF1, HLA-B, IFITM3 and S100A11. In a more preferred embodiment, the present invention provides the use SNX10 and CPVL in combination with PF4V1 and/or PSMB9, and optionally in combination with one or more additional biomarker for TB as disclosed herein. Said one or more additional biomarker is preferably selected from LOC400759/GBP1P1, CREG1, GBP1, IRF1, HLA-B, IFITM3 and S100A11. Any of these combinations may be used with CD52 and/or LYN.

In a particularly preferred embodiment, the present invention relates to the use of SNX10, CPVL, LOC400759/GBP1P1 and CREG1, the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PSMB9, the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PF4V1, the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PSMB9, GBP1, IRF1 and HLA-B or the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11 as biomarkers for TB. Most preferably the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PSMB9, GBP1, IRF1 and HLA-B or the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11 is used. Any of these combinations may be used with CD52 and/or LYN.

In another preferred embodiment, the one or more biomarker of the invention is selected from PF4V1 and HERC2 and the one or more additional biomarker is selected from the biomarkers in Tables 2 and 3 or 5. In a preferred embodiment, the one or more biomarker of the invention is selected from PF4V1 and HERC2 and the one or more additional biomarker is selected from LOC400759/GBP1P1, CREG1, PF4V1, PSMB9, GBP1, IRF1, HLA-B, IFITM3 and S100A11, MMP9 and CD96. In a more preferred embodiment, the invention relates to the use of PF4V1 and HERC2 in combination with one or more additional biomarker for TB as disclosed herein. Said one or more additional biomarker is preferably selected from LOC400759/GBP1P1, CREG1, GBP1, IRF1, HLA-B, IFITM3, S100A11, MMP9, KLRA1, DEFB128 and IL8 and CD96. Thus, in one preferred embodiment, the present invention provides the use of PF4V1 and HERC2 in combination with one or more additional biomarker selected from LOC400759/GBP1P1, ALPK1, GBP1, IRF1, MMP9 and CD96; or in combination with one or more additional biomarker selected from of LOC400759/GBP1P1, ALPK1, GBP1, IRF1, MMP9, CD96, KLRA1, DEFB128 and IL8. In a more preferred embodiment, the present invention provides the use of the combination of PF4V1, HERC2, LOC400759/GBP1P1, ALPK1, GBP1, IRF1, MMP9 and CD96, or the combination of PF4V1, HERC2, LOC400759/GBP1P1, ALPK1, GBP1, IRF1, MMP9, CD96, KLRA1, DEFB128 and IL8 as biomarkers for TB. Any of these combinations may be used with CD52 and/or LYN.

Combinations of one or more of LOC400759/GBP1P1, SNX10, CPVL and CREG1 are particularly preferred. Such combinations include: (i) LOC400759/GBP1P1 and SNX10; (ii) LOC400759/GBP1P1 and CPVL; (iii) LOC400759/GBP1P1 and CREG1; (iv) SNX10 and CPVL; (v) SNX10 and CREG1; (vi) CPVL and CREG1; (vii) LOC400759/GBP1P1, SNX10 and CPVL; (viii) LOC400759/GBP1P1, SNX10 and CREG1; (ix) LOC400759/GBP1P, CPVL and CREG1; (x) SNX10, CPVL and CREG1; and/or (xi) LOC400759/GBP1P1, SNX10, CPVL and CREG1. These combinations may be used in combination with one or more further additional biomarker as disclosed herein, with one or more of GBP1, IRF1, HLA-B, IFITM3 and/or S100A11 being particularly preferred as disclosed herein. Any of these combinations may be used with CD52 and/or LYN.

Alternatively or in addition, combinations of one or more of LOC400759/GBP1P1, PF4V1, ALPK1 and HERC2 are preferred. Such combinations include: (i) LOC400759/GBP1P1 and PF4V1; (ii) LOC400759/GBP1P1 and ALPK1; (iii) LOC400759/GBP1P1 and HERC2; (iv) PF4V1 and ALPK1; (v) PF4V1 and HERC2; (vi) ALPK1 and HERC2; (vii) LOC400759/GBP1P1, PF4V1 and ALPK1; (viii) LOC400759/GBP1P1, PF4V1 and HERC2; (ix) LOC400759/GBP1P1, ALPK1 and HERC2; (x) PF4V1, ALPK1 and HERC2; and (xi) LOC400759/GBP1P1, PF4V1, ALPK1 and HERC2. These combinations may be used in combination with one or more further additional biomarker as disclosed herein, with one or more of GBP1, IRF1, MMP9, CD96, KLRA1, DEFB128 and IL8 being particularly preferred as disclosed herein. Any of these combinations may be used with CD52 and/or LYN.

The combination of SNX10, CPVL, PF4V1, HERC2, CD52 and LYN, optionally including one or more additional biomarker for TB, preferably selected from CREG1, PSMB9, LOC400759/GBP1P1, ALPK1, GBP1, IRF1, HLA-B, IFITM3, S100A11, MMP9, CD96, KLRA1, DEFB128 and/or IL8, or any combination thereof, is also preferred. Similarly, the combination of SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRA1 and WSB1, optionally including one or more additional biomarker for TB, preferably selected from CREG1, PSMB9, LOC400759/GBP1P1, ALPK1, GBP1, IRF1, HLA-B, IFITM3, S100A11, MMP9, CD96, KLRA1, DEFB128 and/or IL8, or any combination thereof, is also preferred.

The present inventors have also identified biomarkers for latent TB, and which can be used to distinguish between latent and active forms of TB, i.e. between latent and active forms of M. tuberculosis infection. These biomarkers for latent TB can also be used according to the present invention to distinguish between latent TB infection and the absence of TB infection. In particular, the present inventors have identified PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 as biomarkers for latent TB. These biomarkers may be used to distinguish between active TB and/or latent TB and/or the absence of TB. In a preferred embodiment, these biomarkers are used to distinguish between latent TB and the absence of TB infection, i.e. to identify one or more individual with a latent TB infection and/or one or more individual uninfected with TB.

Accordingly, the present invention provides the use of one or more of the biomarkers selected from PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 for distinguishing between latent and active M. tuberculosis infection, and hence latent and active TB. The present invention also provides the use of one or more of the biomarkers selected from PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 for distinguishing between active TB and/or latent TB and/or the absence of TB. In a preferred embodiment, the present invention provides the use of one or more of the biomarkers selected from PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 for distinguishing between one or more individual with a latent TB infection, and one or more individual uninfected with TB.

Any combination of these biomarkers may be used according to the present invention. For example, any two, three or four, or all five of these biomarkers may be used to distinguish between latent TB and/or active TB and/or the absence of TB according to the present invention. For example, the combination of the biomarkers PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 is used to distinguish between latent TB and/or active TB and/or the absence of TB according to the present invention. In a preferred embodiment, the combination of the biomarkers PF4V1, LYN, CD52, HERC2 is used to distinguish between latent TB and the absence of TB, and/or to identify one or more individual with a latent TB infection and/or one or more individual uninfected with TB. In another preferred embodiment, the combination of the biomarkers PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 is used to distinguish between latent TB and the absence of TB. Thus, the combination of the biomarkers PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128 and IL8 may be used to identify an individual with a latent TB infection and/or an individual uninfected with TB. In a preferred embodiment, the combination of biomarkers PF4V1, LYN, CD52, HERC2, the combination of biomarkers, HERC2, KLRAP1, PF4V1, DEFB128, IL8 or the combination of biomarkers PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 is used to distinguish between one or more individual with a latent TB infection, and one or more individual uninfected with TB.

One or more additional biomarker for latent TB may also be used in combination with the one or more biomarker selected from PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8. In a preferred embodiment, the one or more additional biomarker is selected from the biomarkers listed in Tables 4 and 5.

One or more additional biomarker for TB may also be used in to distinguish between latent TB and/or active TB and/or the absence of TB according to the present invention. Any combination of the one or more additional biomarker may be used in combination with the one or more biomarker of the invention. For example at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten or more additional biomarkers for TB may be used in combination with the one or more biomarker of the invention. The one or more additional biomarker for use in distinguishing between latent TB and/or active TB and/or the absence of TB can be any biomarker disclosed herein.

Other biomarkers for distinguishing between latent TB and/or active TB and/or the absence of TB, particularly for distinguishing between latent TB and the absence of TB (i.e. to identify one or more individual with a latent TB infection and/or one or more individual uninfected with TB) include HLA-B, NCF1C, ABCF2, FNBP1L, TBC1D3B, SLC14A1, CALCOCO2, GTF2B, HLA-F, MGST2, SPAST and WAC. These biomarkers are listed in Tables 4 and 5 herein.

The present inventors have also identified biomarkers which can be used to distinguish between early stage active TB and late/later stage active TB, i.e. between early stage active and late/later stage active forms of M. tuberculosis infection. In particular, the present inventors have identified GBP1 as such a biomarker. The GBP1 biomarker may be used to distinguish between early stage active TB and late/later stage active TB. As used herein, the term “early stage active TB” refers to patients on first presentation with low to moderate symptoms, such as persistent cough and/or fever, and/or suspected pulmonary tuberculosis which is subsequently confirmed using conventional methods such as smear positivity (graded 1-4 in terms of severity of bacterial load), M. tuberculosis culture or M. tuberculosis PCR positivity (such as using the Cepheid GeneXpert™), As used herein, the term “later or later stage active TB” refers to patients with fully symptomatic active pulmonary tuberculosis, such as persistent cough of some duration, prolonged fever, weight loss, subsequently confirmed using conventional methods as above.

Accordingly, the present invention provides the use of the GBP1 biomarker for distinguishing between early stage active TB and late/later stage active TB. The present invention also provides the use of the GBP1 biomarker for distinguishing between active (early or late active stage) TB and/or latent TB and/or the absence of TB.

One or more additional biomarker for TB may also be used to distinguish between early stage active TB and late/later stage active TB according to the present invention. Any combination of the one or more additional biomarkers may be used in combination with the GBP1 biomarker of the invention. For example at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten or more additional biomarkers for TB may be used in combination with the GBP1 biomarker of the invention. The one or more additional biomarker for use in distinguishing between early stage active TB and late/later stage active TB can be any biomarker disclosed herein.

The present inventors have also identified biomarkers which can be used to determine exposure of an individual to TB, even in the absence of an active or latent TB infection. In particular, the present inventors have identified IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B as such biomarkers for exposure to TB. One or more of the IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B biomarkers, or any combination thereof, may be used to determine exposure to TB. As used herein, the term “exposure to TB” is defined by comparison to non-exposed controls from regions of non/low-TB endemic regions. As an example, the Caucasian control group used in Example 2 below are an example of non-exposed individuals.

Accordingly, the present invention provides the use of one or more of the IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B biomarkers for determining exposure to TB. Any combination of these biomarkers may be used according to the present invention. For example, any one, two, or all three of these biomarkers may be used to determine exposure to TB according to the present invention. Typically, the combination of the biomarkers IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B or the combination of IRF1, S100A11, CD52, LYN, IFITM3 and NCF1C is used to determine exposure to TB according to the present invention.

One or more additional biomarker for TB may also be used to determine exposure to TB according to the present invention. Any combination of the one or more additional biomarkers may be used in combination with one or more of the IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B biomarkers of the invention. For example at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten or more additional biomarkers for TB may be used in combination with one or more of the IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B biomarkers of the invention. The one or more additional biomarker for use in determining exposure to TB can be any biomarker disclosed herein.

The one or more biomarker of the invention as described herein may have a nucleic acid sequence as shown in the sequences in the Sequence Information section herein. The relevant sequence identifiers are also shown in Tables 2 to 5. The one or more biomarker of the invention may have a sequence identity of at least 80% with the corresponding nucleic acid sequence shown in the Sequence Information section. Sequence identity may be calculated as described herein. A sequence identity of at least 80% includes at least 82%, at least 84%, at least 86%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, and 100% sequence identity (to each and every nucleic acid sequence presented herein and/or to each and every SEQ ID NO presented herein).

Thus, as described herein, by studying both human and non-human primate biomarkers for TB, the present inventors have identified a robust set of biomarkers for TB that are mutually compatible (i.e. retain accurate binding specificity) within a single set of assay conditions (i.e. a singleplex format). Similarly, the present inventors have also identified robust sets of mutually compatible biomarkers for distinguishing between latent and active TB, for distinguishing between early active and late/later active TB and for determining exposure to TB. Combinations of biomarkers for use according to the present invention are discussed in detail herein. As discussed above, preferably, the present invention provides the use of the combination of: (i) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, GBP1, IRF1 and HLA-B; (ii) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11; and/or (iii) LOC400759/GBP1P1, PF4V1, ALPK1, HERC2, GBP1, IRF1, MMP9, CD96, KLRA1, DEFB128 and IL8 as biomarkers for TB. These combinations (and the other combinations of biomarkers disclosed herein) may be used not only as biomarkers for TB, but also to distinguish between latent TB and/or active TB and/or the absence of TB, to distinguish between early active and late/later stage active TB and/or to determine exposure to TB.

The one or more biomarkers of the invention may be used in a decision tree process. For example, the present invention may first provide one or more biomarkers for the detection and/or diagnosis of active TB (an active TB infection) in an individual. Any suitable biomarker or combination of biomarkers disclosed herein may be used for the detection and/or diagnosis of active TB. In a preferred embodiment, the one or more biomarker for the detection and/or diagnosis of active TB is selected from (i) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, GBP1, IRF1 and HLA-B; (ii) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11; and/or (iii) LOC400759/GBP1P1, PF4V1, ALPK1, HERC2, GBP1, IRF1, MMP9 and CD96; optionally in combination with one or more additional biomarker as disclosed herein. If the individual tests positive for active TB using this method, they may be treated appropriately.

If, however, the individual tests negative for active TB, they may then be tested for latent TB (a latent TB infection) according to the present invention. This is the next “branch” of the decision tree. Any suitable biomarker or combination of biomarkers disclosed herein may be used for the detection and/or diagnosis of latent TB. In a preferred embodiment, the one or more biomarker for the detection and/or diagnosis of latent TB is selected from PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128 and IL8, optionally in combination with one or more additional biomarker as disclosed herein.

The present invention enables the rapid detection of TB, and also to rapidly distinguish between latent TB and/or active TB and/or the absence of TB. By way of example, the method of the invention is typically completed within 2.5 hours, preferably within 2 or 1.5 hours. In contrast, existing multiplex assays typically take at least 4-5 hours, typically at least 5 hours.

Biomarker Profiles

A “phenotypic change” is a detectable change in a parameter associated with a given state of the individual. For instance, a phenotypic change may include an increase or decrease of a biomarker in a bodily fluid, where the change is associated with TB or distinguishing between active and latent TB. The presence and/or amount of each of the one or more biomarkers of the invention is a feature or phenotypic change according to the present invention.

A phenotypic change may further include a change in a detectable aspect of a given state of the individual that is not a change in a measurable aspect of a biomarker. For example, a change in phenotype may include a detectable change in body temperature, weight loss, fatigue, respiration rate or other physiological parameter. Such changes can be determined via clinical observation and measurement using conventional techniques that are well-known to the skilled artisan. As used herein, “conventional techniques” are those techniques that classify an individual based on phenotypic changes without obtaining a biomarker profile according to the present invention.

A “decision rule” or a “decision tree” is a method used to classify individuals. This rule can take on one or more forms that are known in the art, as exemplified in Hastie et al., in “The Elements of Statistical Learning” Springer-Nerlag (Springer, New York (2001)). Analysis of biomarkers in the complex mixture of molecules within the sample generates features in a data set. A decision rule or a decision tree may be used to act on a data set of features to detect and/or diagnose, or to distinguish between active TB and/or latent TB and/or the absence of TB (for example uninfected control(s)).

The application of the decision rule or the decision tree does not require perfect classification. A classification may be made with at least about 90% certainty, or even more, in one embodiment. In other embodiments, the certainty is at least about 80%, at least about 70%, or at least about 60%. The useful degree of certainty may vary, depending on the particular method of the present invention. “Certainty” is defined as the total number of accurately classified individuals divided by the total number of individuals subjected to classification. As used herein, “certainty” means “accuracy”.

Classification may also be characterized by its “sensitivity”. The “sensitivity” of classification relates to the percentage of individuals with TB who were correctly identified as having TB, or in the case of distinguishing between active and latent TB, the percentage of individuals correctly identified as having active TB, or latent TB, or as uninfected with TB. “Sensitivity” is defined in the art as the number of true positives divided by the sum of true positives and false negatives.

The “specificity” of a method is defined as the percentage of patients who were correctly identified as not having TB, or in the case of distinguishing between active and latent TB, the percentage of individuals correctly identified as not having active or latent TB compared with an uninfected control(s). That is, “specificity” relates to the number of true negatives divided by the sum of true negatives and false positives.

Typically, the accuracy, sensitivity and/or specificity is at least about 90%, at least about 80%, at least about 70% or at least about 60%.

Diagnosing TB in an individual means to identify or detect TB in the individual. Distinguishing between active and latent TB in an individual means to identify or detect TB in the individual and to determine whether the TB is active or latent as described herein. Distinguishing between early stage active and late/later stage active TB in an individual means to identify or detect TB in the individual and to determine whether the TB is early stage active or late/later stage active as described herein. Distinguishing between latent TB and the absence of TB in an individual means to identify or detect latent TB in the individual compared with an uninfected control. Determining exposure of an individual to TB means to determine whether an individual has been exposed to TB, but is not themselves infected with active or latent TB.

Because of the sensitivity of the present invention to detect TB before an overtly observable clinical manifestation, the diagnosis, identification or detection of TB includes the detection of the onset of TB, as defined above.

According to the present invention, TB may be diagnosed or detected, or active and latent TB distinguished, by obtaining a profile of biomarkers from a sample obtained from an individual. As used herein, “obtain” means “to come into possession of”. The present invention is particularly useful in predicting and diagnosing TB in an individual, who is suspected of having TB, or who is at risk of TB infection. In the same manner, the present invention may be used to distinguish between active TB and/or latent TB and/or the absence of TB in an individual. That is, the present invention may be used to confirm a clinical suspicion of TB.

The presence and/or amount of the one or more biomarker of the invention in an individual or the profile of biomarkers in an individual may be measured relative to a control or reference population, for example relative to the corresponding biomarker profile of a reference population. Similarly, the biomarker profile of an individual may be measured relative to a biomarker profile from a control or reference population. Herein the terms “control” and “reference population” are used interchangeably. The actual amount of the one or more biomarkers, such as the mass, molar amount, concentration or molarity of the one or more biomarker of the invention may be assessed and compared with the corresponding value from the control or reference population. Alternatively, the amount of one or more biomarker of the invention may be compared with that of the control or reference population without quantifying the mass, molar amount, concentration or molarity of the one or more biomarker.

The control or reference biomarker profile can be generated from one individual or a population of two or more individuals. The control or reference population, for example, may comprise three, four, five, ten, 15, 20, 30, 40, 50 or more individuals. Furthermore, the control or reference biomarker profile and the individual's (test) biomarker profile that are compared in the methods of the present invention may be generated from the same individual, provided that the test and reference biomarker profiles are generated from biological samples taken at different time points and compared to one another. For example, a sample may be obtained from an individual at the start of a study period. A control or reference biomarker profile taken from that sample may then be compared to biomarker profiles generated from subsequent samples from the same individual. Such a comparison may be used, for example, to determine the progression of TB in the individual by repeated classifications over time.

The control or reference may be obtained, for example, from a population of TB-negative individuals, TB-positive individuals, individuals with active TB and individuals with latent TB. In the Examples herein, the Caucasian control group consists of professional individuals recruited locally to the project team who constitute a low risk group, coming from non/low-TB endemic regions, such that their risk of having been exposed to TB is extremely low. Typically this is the preferred control group. The second control group in the Examples consists of individuals of Asian descent who tested negative for TB using the standard Mantoux skin test and IFNγ test and who come from regions where TB is endemic. The likelihood is that these individuals have been exposed to TB, even if they are not themselves (currently) infected. Thus, without being bound by theory, any differences in the detection of biomarkers of the invention between this control group and the Caucasian controls may result from the likely exposure of this Asian control group to TB.

Typically the control or reference population does not have TB and/or is not infected with M. tuberculosis (i.e. is TB-negative). The control or reference population may be TB-positive and are then subsequently diagnosed with TB using conventional techniques. For example, a population of TB-positive individuals used to generate the reference profile may be diagnosed with TB about 24, 48, 72, 96 or more hours after biological samples were taken from them for the purposes of generating a reference biomarker profile. In one embodiment, the population of TB-positive individuals is diagnosed with TB using conventional techniques about 0-36 hours, about 36-60 hours, about 60-84 hours, or about 84-108 hours after the biological samples were taken. If the biomarker profile is indicative of TB, a clinician may begin treatment prior to the manifestation of clinical symptoms of TB.

The amount of the one or more biomarker of the invention, for example in a biomarker profile, may differ by at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60, at least 70%, at least 80%, at least 90%, at least 100%, at least 150%, at least 200% or more compared with a control or reference population.

For example, if the amount of the one or more biomarker of the invention, typically in a biomarker profile, is reduced compared with a control or reference population, the expression may be reduced partially or totally compared with the control or reference population. Typically the amount is reduced by at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60, at least 70%, at least 80%, at least 90%, at least 95%, at least 99%, up to total elimination of the one or more biomarker.

If the amount of one or more biomarker of the invention, typically in a biomarker profile, is increased compared with a control or reference population, the amount may be increased by at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60, at least 70%, at least 80%, at least 90&, at least 100%, at least 150%, at least 200% compared with the control or reference population.

The amount of the one or more biomarker of the invention may be increased or decreased compared with a control or reference population as shown in Tables 2 to 5 herein (where ↑ means the one or more biomarker is upregulated/an increased amount of the one or more biomarker and ↓ means the one or more biomarker is downregulated/a decreased amount of the one or more biomarker). In instances where more than one indication of up or downregulation is given in Tables 2 to 5, the first recited statement is preferred. For example, Table 2 discloses that ALPK1 is increased in monocytes, neutrophils and CD4 positive T cells compared with a control or reference population. In this example, the amount of ALPK1 may be increased in CD4 positive T cells, preferably increased in neutrophils and most preferably increased in monocytes. The amount of ALPK1 may be increased in CD4 positive T cells, neutrophils and monocytes, and may also be increased in other cell types not listed in Tables 2 to 5.

The amount of the one or more biomarker may be increased in some cell types and/or decreased in other cell types. For example, as shown in Table 2 herein, PF4V1 is upregulated (increased amount) in monocytes of individuals with TB, whereas PF4V1 is downregulated (decreased amount) in neutrophils of individuals with TB.

The presence and/or amount of the one or more biomarker of the invention may be determined by quantitative and/or qualitative analysis. The amount of the one or more biomarker of the invention encompasses the mass of the one or more biomarker, the molar amount of the one or more biomarker, the concentration of the one or biomarker and the molarity of the one or more biomarker. This amount may be given in any appropriate units. For example, the concentration of the one or more biomarker may be given in pg/ml, ng/ml or μg/ml.

The presence and/or amount of the one or more biomarker of the invention may be measured directly or indirectly. The relative presence and/or amount of the one or more biomarker of the invention relative to a control or reference population may be determined using any appropriate technique. Suitable standard techniques are known in the art, for example Western blotting and enzyme-linked immunosorbent assays (ELISAs). Preferred methods include microarray analysis (as used in Example 1) and quantitative real-time PCR (qPCR) (as used in Example 2). Different one or more biomarkers may be used with different detection methods according to the present invention. For example, in one embodiment, the one or more biomarker is selected from PF4V1/or HERC2, preferably in combination with LOC400759/GBP1P1 and/or ALPK1 as disclosed herein, for use with microarray analysis. Typically, the one or more biomarker is selected from SNX10 and/or CPVL, preferably in combination with LOC400759/GBP1P1 and/or CREG1, for use with qPCR analysis. Again, additional one or more biomarkers as disclosed herein can be selected dependent on the preferred detection method.

As used herein, “comparison” includes any means to discern at least one difference in the presence and/or amount of the one or more biomarker in the individual and the control or reference population, or at least one difference in the individual's and the control or reference profiles. Thus, a comparison may include a visual inspection of chromatographic spectra, and a comparison may include arithmetical or statistical comparisons of values assigned to the features of the profiles. Such statistical comparisons include, but are not limited to, applying a decision rule. If the biomarker profiles comprise at least one internal standard, the comparison to discern a difference in the biomarker profiles may also include features of these internal standards, such that features of the biomarker are correlated to features of the internal standards. The comparison can confirm the presence or absence of TB, and thus to detect or diagnose TB; or the comparison can distinguish between active and latent TB.

The presence and/or amount level of the one or more biomarker may be altered compared with a control or reference population for at least 12 hours, at least 24 hours, at least 30 hours, at least 48 hours, at least 72 hours, at least 96 hours, at least 120 hours, at least 144 hours, at least 1 week, at least 2 weeks, at least 3 weeks, at least 4 weeks, at least 5 weeks, at least 6 weeks, at least 7 weeks, at least 8 weeks, at least 9 weeks, at least 10 weeks, at least 11 weeks, at least 12 weeks, at least 13 weeks, at least 14 weeks, at least 15 weeks or more.

Although the invention does not require a monitoring period to classify an individual, it will be understood that repeated classifications of the individual, i.e., repeated snapshots, may be taken over time until the individual is no longer at risk. Alternatively, a profile of biomarkers obtained from the individual may be compared to one or more profiles of biomarkers obtained from the same individual at different points in time.

As used herein, an “individual” is an animal, preferably a mammal, more preferably a human or non-human primate. The terms “individual,” “subject” and “patient” are used interchangeably herein. The individual can be normal, suspected of having TB or at risk of a TB infection. In a preferred embodiment, the present invention relates to the detection and/or diagnosis of TB in adult humans (over the age of 16 years).

The progression of an individual from normalcy (i.e., a condition characterized by not having TB) to latent or active TB, and vice versa, will be characterized by changes in biomarker profiles, as certain biomarkers are expressed at increasingly higher levels and the expression of other biomarkers becomes down regulated. These changes in biomarker profiles may reflect the progressive establishment of a physiological response in the reference population to infection. The biomarker profile of the control or reference population also will change as a physiological response subsides. As stated above, one of the advantages of the present is the capability of classifying an individual, using a biomarker profile from a single biological sample, as having membership in a particular population. The determination of whether a particular physiological response is becoming established or is subsiding may be facilitated by a subsequent classification of the individual. To this end, the present invention provides numerous biomarkers that both increase and decrease in level of expression as a physiological response to TB is established or subsides. For example, a feature of an individual's biomarker profile that is known to change in intensity as a physiological response to TB becomes established may be selected. A comparison of the same feature in a profile from a subsequent biological sample from the individual can establish whether the individual is progressing toward more severe TB or is progressing toward normalcy.

Detection and Quantification of Biomarkers and Determination of Biomarker Profiles

A feature as defined herein for the diagnosis of TB, a TB infection and/or a M. tuberculosis infection may be detected, quantified or determined by any appropriate means. For example, the one or more biomarker of the invention, a measurable aspect or characteristic of the one or more biomarker or a biomarker profile of the invention may be detected by any appropriate means. The presence and/or amount of the one or more biomarkers of the invention may be considered together as a “biomarker profile” of the invention. The presence and/or amount of the individual biomarkers within any of the biomarker combinations disclosed herein may be considered together as a “biomarker profile” of the invention. For example, in a preferred embodiment of the invention, the combination of biomarkers: (i) SNX10 and CPVL; (ii) LOC400759/GBP1P1, SNX10, CPVL and CREG1; (iii) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, GBP1, IRF1 and HLA-B; (iv) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11; (v) PF4V1 and HERC2; (vi) SNX10, CPVL, PF4V1 and HERC2; (vii) SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRA1 and WSB1 and/or (viii) LOC400759/GBP1P1, PF4V1, ALPK1, HERC2, GBP1, IRF1, MMP9 and CD96 is used to detect and or diagnose TB. Thus, the presence and/or amount of: (i) SNX10 and CPVL; (ii) LOC400759/GBP1P1, SNX10, CPVL and CREG1; (iii) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, GBP1, IRF1 and HLA-B; (iv) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11; (v) PF4V1 and HERC2; (vi) SNX10, CPVL, PF4V1 and HERC2; (vii) SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRA1 and WSB1 and/or (viii) LOC400759/GBP1P1, PF4V1, ALPK1, HERC2, GBP1, IRF1, MMP9 and CD96 may be considered as a biomarker profile according to the present invention. The presence and/or amount of any other combination of biomarkers according to the present invention may also be considered as a biomarker profile. A biomarker profile of the invention may comprise: (i) SNX10 and CPVL; (ii) LOC400759/GBP1P1, SNX10, CPVL and CREG1; (iii) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, GBP1, IRF1 and HLA-B; (iv) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11; (v) PF4V1 and HERC2; (vi) SNX10, CPVL, PF4V1 and HERC2; (vii) SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRA1 and WSB1 and/or (viii) LOC400759/GBP1P1, PF4V1, ALPK1, HERC2, GBP1, IRF1, MMP9, CD96, KLRA1, DEFB128 and IL8.

The presence and/or amount of the one or more biomarker of the invention may be determined in a sample obtained from an individual. The sample may be any suitable biological material, for example blood, plasma, saliva, serum, sputum, urine, cerebral spinal fluid, cells, a cellular extract, a tissue sample, a tissue biopsy, a stool sample and the like. Typically the sample is blood sample. The precise biological sample that is taken from the individual may vary, but the sampling preferably is minimally invasive and is easily performed by conventional techniques. In a preferred embodiment, the sample is a whole blood sample, a purified peripheral blood leukocyte sample or a cell type sorted leukocyte sample, such as a sample of the individual's neutrophils. The biological sample may be taken from the individual before, during, and/or after treatment for TB infection. In one embodiment, the sample is taken after treatment for TB infection has been initiated.

Measurement of a phenotypic change may be carried out by any conventional technique. Measurement of body temperature, respiration rate, pulse, blood pressure, or other physiological parameters can be achieved via clinical observation and measurement. Measurements of biomarker molecules may include, for example, measurements that indicate the presence, concentration, expression level, or any other value associated with a biomarker molecule. The form of detection of biomarker molecules typically depends on the method used to form a profile of these biomarkers from a biological sample. For instance, biomarkers separated by 2D-PAGE are detected by Coomassie Blue staining or by silver staining, which are well-established in the art.

The biomarkers of the invention may be detected at the nucleic acid or protein level. Thus, the biomarkers of the invention may be DNA, RNA or protein and may be detected using any appropriate technique. The presence and/or amount of the one or more biomarker of the invention may be measured directly or indirectly. Any appropriate agent may be used to determine the presence and/or amount of the one or more biomarker of the invention. For example, the presence and/or amount of the one or more biomarker of the invention may be determined using an agent selected from peptides and peptidomimetics, antibodies, small molecules and single-stranded DNA or RNA molecules, as described herein. The relative presence and/or amount of the one or more biomarker of the invention relative to a control or reference population (see above) may be determined using any appropriate technique. Suitable standard techniques are known in the art.

For example, when the one or more biomarker is detected at the nucleic acid level this may be carried out using: (i) biomarker-specific oligonucleotide DNA or RNA or any other nucleic acid derivative probes bound to a solid surface; (ii) purified RNA (labelled by any method, for example using reverse transcription and amplification) hybridised to probes; (iii) whole lysed blood, from which the RNA is labelled by any method and hybridised to probes; (iv) purified RNA hybridised to probes and a second probe (labelled by any method) hybridised to the purified RNA; (v) whole lysed blood from which the RNA is hybridised to probes, and a second probe (labelled by any method) which is hybridised to the RNA; (vi) purified peripheral blood leukocytes, obtaining purified RNA (labelled by any method), and hybridising the purified labelled RNA to probes; (vii) purified peripheral blood leukocytes, obtaining purified RNA and hybridising the RNA to probes, then using a second probe (labelled by any method) which hybridises to the RNA; (viii) RT-PCR using any primer/probe combination or inter-chelating fluorescent label, for example SyberGreen; (ix) end-point PCR; (x) digital PCT; (xi) sequencing; (xii) array cards (RT-PCT); (xiii) lateral flow devices/methodology; and/or (xiv) digital microfluidics.

In a preferred embodiment, RNA from a sample (either purified or unpurified) is labelled via any method (typically amplification) and used to interrogate one or more probe immobilised on a surface. Typically the one or more probes are 50 to 100 nucleotides in length.

In another preferred embodiment, one or more probe is immobilised on a surface and the RNA from a sample is hybridised to one or more second probe (labelled by any method). The RNA hybridised with the second (labelled) probe is then used to interrogate the one or more probe immobilised on the surface. Examples of such methodology are known in the art, including the Vantix™ system.

For example, when the one or more biomarker is detected at the protein acid level this may be carried out using: (i) biomarker-specific primary antibodies or antibody fragments bound to a solid surface; (ii) whole lysed blood biomarker antigen bound to antibodies or antibody fragments; (iii) secondary biomarker-specific antibodies or antibody fragments used to detect biomarker antigen bound to primary antibody (labelled using any method); (iv) biomarker-specific primary aptamers bound to a solid surface; (v) whole lysed blood—biomarker antigen bound to aptamers; (vi) secondary biomarker-specific aptamer used to detect biomarker antigen bound to primary aptamer (labelled using any method); (vii) any antibody derivative i.e. phage display etc. used as above; (viii) lateral flow devices/methodology; (ix) chromatography; (x) mass spectrometry; (xi) nuclear magnetic resonance (NMR); (xii) protein gels/transfers to filter; and/or (xiii) immunoprecipitation.

Any agent for the detection of or for the determination of the amount of the one or more biomarker of the invention may be used to determine the presence of and/or amount of the one or more biomarker. Similarly, any method that allows for the detecting of the one or more biomarker, the quantification, or relative quantification of the one or more biomarker may be used.

Agents for the detection of or for the determination of the amount of one or more biomarker may be used to determine the amount of the one or more biomarker in a sample obtained from the individual. Such agents typically bind to the one or more biomarker. Such agents may bind specifically to the one or more biomarker. The agent for the detection of or for the determination of the amount of the one or more biomarker may be an antibody or other binding agent specific for the one or more biomarker. By specific, it will be understood that the agent or antibody binds to the molecule of interest, in this case the one or more biomarker, with no significant cross-reactivity to any other molecule, particularly any other protein. For example, an agent or antibody that is specific for LOC400759/GBP1P1 will show no significant cross-reactivity with human neutrophil elastase. Cross-reactivity may be assessed by any suitable method. Cross-reactivity of an agent or antibody for the one or more biomarker with a molecule other than the one or more biomarker may be considered significant if the agent or antibody binds to the other molecule at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90% or 100% as strongly as it binds to the one or more biomarker. An agent or antibody that is specific for the one or more biomarker may bind to another molecule such as human neutrophil elastase at less than 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25% or 20% the strength that it binds to the one or more biomarker. Preferably, the agent or antibody binds to the other molecule at less than 20%, less than 15%, less than 10% or less than 5%, less than 2% or less than 1% the strength that it binds to the one or more biomarker.

As described herein, the presence and/or amount of the one or more biomarker, and hence the biomarker profile may be determined immunologically by reacting antibodies, or functional fragments thereof, specific to the biomarkers. A functional fragment of an antibody is a portion of an antibody that retains at least some ability to bind to the antigen to which the complete antibody binds. The fragments, which include, but are not limited to, scFv fragments, Fab fragments, F(ab) fragments and F(ab)2 fragments, can be recombinantly produced or enzymatically produced. Specific binding molecules other than antibodies, such as aptamers, may be used to bind the biomarkers.

The antibody may be monoclonal or polyclonal. The antibody may be produced by any suitable method known in the art. For example, polyclonal antibodies may be obtained by immunizing a mammal, typically a rabbit or a mouse, with the one or more biomarker under suitable conditions and isolating antibody molecules from, for example, the serum of said mammal. Monoclonal antibodies may be obtained by hybridoma or recombinant methods.

Hybridoma methods may involve immunizing a mammal, typically a rabbit or a mouse, with the one or more biomarker under suitable conditions, then harvesting the spleen cells of said mammal and fusing them with myeloma cells. The mixture of fused cells is then diluted and clones are grown from single parent cells. The antibodies secreted by the different clones are then tested for their ability to bind to the one or more biomarker, and the most productive and stable clone is then grown in culture medium to a high volume. The secreted antibody is collected and purified.

Recombinant methods may involve the cloning into phage or yeast of different immunoglobulin gene segments to create libraries of antibodies with slightly different amino acid sequences. Those sequences which give rise to antibodies which bind to the one or more biomarker may be selected and the sequences cloned into, for example, a bacterial cell line, for production.

Typically the antibody is a mammalian antibody, such as a primate, human, rodent (e.g. mouse or rat), rabbit, ovine, porcine, equine or camel antibody. The antibody may be a camelid antibody or shark antibody. The antibody may be a nanobody. The antibody can be any class or isotype of antibody, for example IgM, but is preferably IgG. The antibody may be a humanised antibody.

The antibody or fragment may be associated with other moieties, such as linkers which may be used to join together 2 or more fragments or antibodies. Such linkers may be chemical linkers or can be present in the form of a fusion protein with the fragment or whole antibody. The linkers may thus be used to join together whole antibodies or fragments which have the same or different binding specificities, e.g. that can bind the same or different polymorphisms. The antibody may be a bispecific antibody which is able to bind to two different antigens, typically any two of the polymorphisms mentioned herein. The antibody may be a ‘diabody’ formed by joining two variable domains back to back. In the case where the antibodies used in the method are present in any of the above forms which have different antigen binding sites of different specificities then these different specificities are typically to polymorphisms at different positions or on different proteins. In one embodiment the antibody is a chimeric antibody comprising sequence from different natural antibodies, for example a humanised antibody.

Methods to assess an amount of the one or more biomarker may involve contacting a sample with an agent or antibody capable of binding specifically to the one or more biomarker. Such methods may include dipstick assays and Enzyme-linked Immunosorbant Assay (ELISA), or similar assays, such as those using a lateral flow device. Other immunoassay types may also be used to assess the one or more biomarker amounts. Typically dipsticks comprise one or more antibodies or proteins that specifically bind to the one or more biomarker. If more than one antibody is present, the antibodies preferably have different non-overlapping determinants such that they may bind to the one or more biomarker simultaneously.

ELISA is a heterogeneous, solid phase assay that requires the separation of reagents. ELISA is typically carried out using the sandwich technique or the competitive technique. The sandwich technique requires two antibodies. The first specifically binds the one or more biomarker and is bound to a solid support. The second antibody is bound to a marker, typically an enzyme conjugate. A substrate for the enzyme is used to quantify the one or more biomarker-antibody complex and hence the amount of the one or more biomarker in a sample. The antigen competitive inhibition assay also typically requires a one or more biomarker-specific antibody bound to a support. A biomarker-enzyme conjugate is added to the sample (containing the one or more biomarker) to be assayed. Competitive inhibition between the biomarker-enzyme conjugate and unlabelled biomarker allows quantification of the amount of the one or more biomarker in a sample. The solid supports for ELISA reactions preferably contain wells.

Antibodies capable of binding specifically to the one or more biomarker may be used in methods of immunofluorescence to detect the presence of the one or more biomarker and hence in methods of diagnosing TB, a TB infection, infection with M. tuberculosis, or to distinguish between active and latent TB according to the present invention.

The present invention may also employ methods of determining the amount of the one or more biomarker that do not comprise antibodies. High Performance Liquid Chromatography (HPLC) separation and fluorescence detection is preferably used as a method of determining the amount of the one or more biomarker. HPLC apparatus and methods as described previously may be used (Tsikas D et al. J Chromatogr B Biomed Sci Appl 1998; 705: 174-6) Separation during HPLC is typically carried out on the basis of size or charge. Prior to HPLC, endogenous amino acids and an internal standard L-homoarginine are typically added to assay samples and these are phase extracted on CBA cartridges (Varian, Harbor City, Calif.). Amino acids within the samples are preferably derivatized with o-phthalaldehyde (OPA). The accuracy and precision of the assay is preferably determined within quality control samples for all amino acids.

Other methods of determining the amount the one or more biomarker that do not comprise antibodies include mass spectrometry. Mass spectrometric methods may include, for example, matrix-assisted laser desorption/ionization mass spectrometry (MALDI MS), surface-enhanced laser desorption/ionization mass spectrometry (SELDI MS), time of flight mass spectrometry (TOF MS) and liquid chromatography mass spectrometry (LC MS).

A separation method may be used to determine the presence and/or amount of the one or more biomarker and hence to create a profile of biomarkers, such that only a subset of biomarkers within the sample is analysed. For example, the biomarkers that are analysed in a sample may consist of mRNA species from a cellular extract, which has been fractionated to obtain only the nucleic acid biomarkers within the sample, or the biomarkers may consist of a fraction of the total complement of proteins within the sample, which have been fractionated by chromatographic techniques. One or more, two or more, three or more, four or more, or five or more separation methods may be used according to the present invention.

Determination of the presence and/or amount of the one or more biomarker, and hence the creation of a profile of biomarkers may be carried out without employing a separation method. For example, a biological sample may be interrogated with a labelled compound that forms a specific complex with a biomarker in the sample, where the intensity of the label in the specific complex is a measurable characteristic of the biomarker. A suitable compound for forming such a specific complex is a labelled antibody. A biomarker may be measured using an antibody with an amplifiable nucleic acid as a label. The nucleic acid label may become amplifiable when two antibodies, each conjugated to one strand of a nucleic acid label, interact with the biomarker, such that the two nucleic acid strands form an amplifiable nucleic acid.

The presence and/or amount of the one or more biomarker, and hence the biomarker profile may be derived from an assay, such as an array, of nucleic acids, where the biomarkers are the nucleic acids or complements thereof. For example, the biomarkers may be ribonucleic acids. The presence and/or amount of the one or more biomarker, and hence the biomarker profile may be obtained using a method selected from nuclear magnetic resonance, nucleic acid arrays, dot blotting, slot blotting, reverse transcription amplification and Northern analysis.

The biomarker profile may comprise any measurable aspect of M. tuberculosis or a component thereof. For example, the biomarker profile may comprise measurable aspects of small molecules, which may include fragments of proteins or nucleic acids, or which may include metabolites.

The determination of the presence and/or amount of the one or more biomarker, and hence a biomarker profile may be generated by the use of one or more separation methods. For example, suitable separation methods may include a mass spectrometry method, such as electrospray ionization mass spectrometry (ESI-MS), ESI-MS/MS, ESI-MS/(MS)n (n is an integer greater than zero), matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF-MS), surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF-MS), desorption/ionization on silicon (DIOS), secondary ion mass spectrometry (SLMS), quadrupole time-of-flight (Q-TOF), atmospheric pressure chemical ionization mass spectrometry (APCI-MS), APCI-MS/MS, APCI-(MS)n, atmospheric pressure photoionization mass spectrometry (APPI-MS), APPI-MS/MS, and APPI-(MS)n. Other mass spectrometry methods may include, inter alia, quadrupole, fourier transform mass spectrometry (FTMS) and ion trap. Other suitable separation methods may include chemical extraction partitioning, column chromatography, ion exchange chromatography, hydrophobic (reverse phase) liquid chromatography, isoelectric focusing, one-dimensional polyacrylamide gel electrophoresis (PAGE), two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) or other chromatography, such as thin-layer, gas or liquid chromatography, or any combination thereof. The sample may be fractionated prior to application of the separation method.

The determination of the presence and/or amount of the one or more biomarker, and hence a biomarker profile may be generated by methods that do not require physical separation of the biomarkers themselves. For example, nuclear magnetic resonance (NMR) spectroscopy may be used to resolve a profile of biomarkers from a complex mixture of molecules. An analogous use of NMR to classify tumours is disclosed in Hagberg, NMR Biomed. 11: 148-56 (1998), for example. Additional procedures include nucleic acid amplification technologies, which may be used to generate a profile of biomarkers without physical separation of individual biomarkers. (See Stordeur et al, J. Immunol. Methods 259: 55-64 (2002) and Tan et al, Proc. Nat'l Acad. Sci. USA 99: 11387-11392 (2002), for example.)

In one embodiment, laser desorption/ionization time-of-flight mass spectrometry is used to determine the presence and/or amount of the one or more biomarker, and hence create a biomarker profile where the biomarkers are proteins or protein fragments that have been ionized and vaporized off an immobilizing support by incident laser radiation. A profile is then created by the characteristic time-of-flight for each protein, which depends on its mass-to-charge (“m/z”) ratio. A variety of laser desorption/ionization techniques are known in the art. (See, e.g., Guttman et al, Anal Chem. 73: 1252-62 (2001) and Wei et al, Nature 399: 243-46 (1999).)

Laser desorption/ionization time-of-flight mass spectrometry allows the generation of large amounts of information in a relatively short period of time. A sample is applied to one of several varieties of a support that binds all of the biomarkers, or a subset thereof, in the sample. Cell lysates or samples are directly applied to these surfaces in volumes as small as 0.5 μL, with or without prior purification or fractionation. The lysates or sample can be concentrated or diluted prior to application onto the support surface. Laser desorption/ionization is then used to generate mass spectra of the sample, or samples, in as little as three hours.

In a preferred embodiment, the total mRNA from a cellular extract of the individual is assayed, and the various mRNA species that are obtained from the sample are used as biomarkers. Biomarker profiles may be obtained, for example, by hybridizing these mRNAs to an array of probes, which may comprise oligonucleotides or cDNAs, using standard methods known in the art. Alternatively, the mRNAs may be subjected to gel electrophoresis or blotting methods such as dot blots, slot blots or Northern analysis, all of which are known in the art. (See, e.g., Sambrook et al. in “Molecular Cloning, 3rd ed.,” Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (2001).) mRNA profiles also may be obtained by reverse transcription followed by amplification and detection of the resulting cDNAs, as disclosed by Stordeur et al, supra, for example. In another embodiment, the profile may be obtained by using a combination of methods, such as a nucleic acid array combined with mass spectroscopy.

Different methods have different advantages and may be preferred depending on numerous factors, such as the particular circumstances of the individuals to be tested and/or the availability of reagents/equipment in the diagnostics laboratory. For example, qPCR using probe/quencher hydrolysis probes as described herein is highly specific and stringent. As another example, microarray analysis can resolve subtle differences in expression of transcript variants, which may be important in disease pathology and diagnosis.

Probes

Any appropriate detection means can be used to detect or quantify the one or more biomarker of the invention, as described herein.

Typically when the one or more biomarker of the invention is a nucleic acid, the presence of the one or more biomarker may be detected, and/or the amount of the one or more biomarker determined using an oligonucleotide probe.

An oligonucleotide probe of the invention may have at least 80% sequence identity to the one or more biomarker of the invention, or a target region within said biomarker, measured over any appropriate length of sequence. Typically the % sequence identity is determined over a length of contiguous nucleic acid residues. An oligonucleotide probe of the invention may, for example, have at least 80% sequence identity to the one or more biomarker of the invention, or target region thereof, measured over at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, or more nucleic acid residues, up to the oligonucleotide probe having at least 80% sequence identity with the one or more biomarker of the invention, or target region thereof, over the entire length of the oligonucleotide probe.

An oligonucleotide probe of the invention may be complementary to the one or more nucleic acid biomarker of the invention, or a target region thereof. Typically the oligonucleotide probe of the invention is complementary over a length of contiguous nucleic acid residues. An oligonucleotide probe of the invention may, for example, be complementary to the one or more biomarker of the invention, or target region thereof, measured over at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, or more nucleic acid residues, up to the oligonucleotide probe having being complementary to the one or more biomarker of the invention, or target region thereof, over the entire length of the oligonucleotide probe.

An oligonucleotide probe of the invention may be complementary to a variant of the one or more biomarker of the invention, or a variant of a target region of said biomarker. Typically the oligonucleotide probe is complementary to a variant having at least 80% sequence identity to the one or more biomarker of the invention, or a variant having at least 80% sequence identity to the target region of said biomarker. The % sequence identity of the variant to the one or more biomarker of the invention, or a variant of a target region of said biomarker may be calculated over any appropriate length of sequence in the one or more biomarker, as described herein.

A sequence identity of at least 80% includes at least 82%, at least 84%, at least 86%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, and 100% sequence identity (to each and every nucleic acid sequence presented herein and/or to each and every SEQ ID NO presented herein).

Any of a variety of sequence alignment methods can be used to determine percent identity, including, without limitation, global methods, local methods and hybrid methods, such as, e.g., segment approach methods. Protocols to determine percent identity are routine procedures within the scope of one skilled in the art. Global methods align sequences from the beginning to the end of the molecule and determine the best alignment by adding up scores of individual residue pairs and by imposing gap penalties. Non-limiting methods include, e.g., CLUSTAL W, see, e.g., Julie D. Thompson et al., CLUSTAL W: Improving the Sensitivity of Progressive Multiple Sequence Alignment Through Sequence Weighting, Position-Specific Gap Penalties and Weight Matrix Choice, 22 (22) Nucleic Acids Research 4673-4680 (1994); and iterative refinement, see, e.g., Osamu Gotoh, Significant Improvement in Accuracy of Multiple Protein. Sequence Alignments by Iterative Refinement as Assessed by Reference to Structural Alignments, 264(4) J. MoI. Biol. 823-838 (1996). Local methods align sequences by identifying one or more conserved motifs shared by all of the input sequences. Non-limiting methods include, e.g., Match-box, see, e.g., Eric Depiereux and Ernest Feytmans, Match-Box: A Fundamentally New Algorithm for the Simultaneous Alignment of Several Protein Sequences, 8(5) CABIOS 501-509 (1992); Gibbs sampling, see, e.g., C. E. Lawrence et al., Detecting Subtle Sequence Signals: A Gibbs Sampling Strategy for Multiple Alignment, 262 (5131) Science 208-214 (1993); Align-M, see, e.g., Ivo Van WaIIe et al., Align-M—A New Algorithm for Multiple Alignment of Highly Divergent Sequences, 20 (9) Bioinformatics:1428-1435 (2004). Thus, percent sequence identity is determined by conventional methods. See, for example, Altschul et al., Bull. Math. Bio. 48: 603-16, 1986 and Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89:10915-19, 1992.

Variants of the specific sequences provided above may alternatively be defined by reciting the number of nucleotides that differ between the variant sequences and the specific reference sequences provided above. Thus, in one embodiment, the sequence may comprise (or consist of) a nucleotide sequence that differs from the specific sequences provided above at no more than 2 nucleotide positions, for example at no more than 1 nucleotide position. Conservative substitutions are preferred. The term variants as defined herein also encompasses splice variants.

An oligonucleotide probe of the invention may be at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, or more nucleotides in length. In a preferred embodiment, the oligonucleotide probe is 40 to 100 nucleotides in length, more preferably 50 to 100 nucleotides in length, even more preferably 50 to 80 nucleotides in length and most preferably 50 to 70 nucleotides in length.

The probes of the invention are typically designed to hybridise to their target nucleic acid sequence present in the one or more biomarker of the invention.

A probe may comprise or be complementary to a nucleic acid sequence within a target nucleic acid sequence from the one or more biomarker of the invention, or to a nucleic acid sequence having at least 80% identity to said target nucleic acid sequence. Any suitable probe which comprises or is complementary (as defined herein) to a nucleic acid sequence within a target nucleic acid sequence of one or more biomarker of the invention may be used. Preferred target sequences within the one or more biomarkers of the invention are underlined in the nucleic acid sequences shown in the Sequence Information section.

In embodiments wherein the one or more biomarker for TB is LOC400759/GBP1P1, a target nucleic acid sequence may comprise bases 91 to 640 of SEQ ID NO: 112 or bases 13751 to 13950 of SEQ ID NO: 113, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is PF4V1, a target nucleic acid sequence may comprise bases 21 to 450 of SEQ ID NO: 134, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is ALPK1, a target nucleic acid sequence may comprise bases 511 to 3220 of SEQ ID NO: 117, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is HERC2, a target nucleic acid sequence may comprise bases 2411 to 5641, 8141 to 9630 and/or 13651 to 14930 of SEQ ID NO: 132, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is LGALS3BP, a target nucleic acid sequence may comprise bases 1431 to 1850 of SEQ ID NO: 114, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is BST1, a target nucleic acid sequence may comprise bases 361 to 840 of SEQ ID NO: 115, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is SNX10, a target nucleic acid sequence may comprise bases 1901 to 2480 of SEQ ID NO: 116, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is CREG1, a target nucleic acid sequence may comprise bases 961 to 1620 of SEQ ID NO: 118, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is BAZ1A, a target nucleic acid sequence may comprise bases 4561 to 5270 of SEQ ID NO: 119, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is LYN, a target nucleic acid sequence may comprise bases 1681 to 2520 of SEQ ID NO: 120, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is TAPBP, a target nucleic acid sequence may comprise bases 171 to 1820 of SEQ ID NO: 121, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is SERPINB1, a target nucleic acid sequence may comprise bases 1201 to 2050 of SEQ ID NO: 122, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is PSMB9, a target nucleic acid sequence may comprise bases 241 to 870 of SEQ ID NO: 123, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is WSB1, a target nucleic acid sequence may comprise bases 851 to 2250 of SEQ ID NO: 124, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is MVP, a target nucleic acid sequence may comprise bases 1901 to 2880 of SEQ ID NO: 125, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is APBB1IP, a target nucleic acid sequence may comprise bases 301 to 1830 of SEQ ID NO: 126, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is FYB, a target nucleic acid sequence may comprise bases 1621 to 2690 of SEQ ID NO: 127, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is MB21D1/C6orf150, a target nucleic acid sequence may comprise bases 1051 to 1570 of SEQ ID NO: 128, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is CPVL, a target nucleic acid sequence may comprise bases 381 to 1140 of SEQ ID NO: 129, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is TICAM2, a target nucleic acid sequence may comprise bases 2671 to 3020 of SEQ ID NO: 130, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is CD52, a target nucleic acid sequence may comprise bases 51 to 450 of SEQ ID NO: 131, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is KLRA1, a target nucleic acid sequence may comprise bases 801 to 1310 of SEQ ID NO: 133, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is DEFB128, a target nucleic acid sequence may comprise bases 11 to 270 of SEQ ID NO: 135, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is IL8, a target nucleic acid sequence may comprise bases 241 to 1460 of SEQ ID NO: 136, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is GBP1, a target nucleic acid sequence may comprise bases 2171 to 2800 of SEQ ID NO: 142, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is IRF1, a target nucleic acid sequence may comprise bases 1411 to 2050 of SEQ ID NO: 141, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is MMP9, a target nucleic acid sequence may comprise bases 1091 to 2190 of SEQ ID NO: 152, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is CD96, a target nucleic acid sequence may comprise bases 641 to 3760 of SEQ ID NO: 138, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is AIM2, a target nucleic acid sequence may comprise bases 541 to 1060 of SEQ ID NO: 137, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is CD274, a target nucleic acid sequence may comprise bases 541 to 1930 of SEQ ID NO: 138, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is CDH23, a target nucleic acid sequence may comprise bases 9681 to 10990 of SEQ ID NO: 140, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is IFIT3, a target nucleic acid sequence may comprise bases 1041 to 1830 of SEQ ID NO: 143, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is IFITM3, a target nucleic acid sequence may comprise bases 211 to 580 of SEQ ID NO: 144, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is GK, a target nucleic acid sequence may comprise bases 1251 to 1970 of SEQ ID NO: 145, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is NELL2, a target nucleic acid sequence may comprise bases 2401 to 3110 of SEQ ID NO: 146, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is S100A11, a target nucleic acid sequence may comprise bases 291 to 580 of SEQ ID NO: 147, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is SAMD9L, a target nucleic acid sequence may comprise bases 461 to 3260 of SEQ ID NO: 148, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is STAT1, a target nucleic acid sequence may comprise bases 2261 to 3170 of SEQ ID NO: 149, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is TLR6, a target nucleic acid sequence may comprise bases 1751 to 2430 of SEQ ID NO: 150, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is WARS, a target nucleic acid sequence may comprise bases 1801 to 2860 of SEQ ID NO: 151, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is DOCKS, a target nucleic acid sequence may comprise bases 5791 to 6460 of SEQ ID NO: 153, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is SIRPB2, a target nucleic acid sequence may comprise bases 741 to 1950 of SEQ ID NO: 154, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is ANKRD22, a target nucleic acid sequence may comprise bases 981 to 1320 of SEQ ID NO: 155, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is ABCF2, a target nucleic acid sequence may comprise bases 1741 to 2370 of SEQ ID NO: 156, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is FNBP1L, a target nucleic acid sequence may comprise bases 4591 to 5220 of SEQ ID NO: 157, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is NCF1C, a target nucleic acid sequence may comprise bases 461 to 940 of SEQ ID NO: 158, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is TBC1D3B, a target nucleic acid sequence may comprise bases 1421 to 2090 of SEQ ID NO: 159, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is SLC14A1, a target nucleic acid sequence may comprise bases 2031 to 2950 of SEQ ID NO: 160, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is CALCOCO2, a target nucleic acid sequence may comprise bases 2601 to 3600 of SEQ ID NO: 161, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is GTF2B, a target nucleic acid sequence may comprise bases 661 to 1160 of SEQ ID NO: 162, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is HLA-B, a target nucleic acid sequence may comprise bases 961 to 1430 of SEQ ID NO: 163, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is HLA-F, a target nucleic acid sequence may comprise bases 461 to 1520 of SEQ ID NO: 164, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is MGST2, a target nucleic acid sequence may comprise bases 161 to 760 of SEQ ID NO: 165, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is SPAST, a target nucleic acid sequence may comprise bases 701 to 1770 of SEQ ID NO: 166, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

In embodiments wherein the one or more biomarker for TB is WAC, a target nucleic acid sequence may comprise bases 2011 to 3590 of SEQ ID NO: 167, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.

It is preferred that the binding conditions for a probe hybridising to its target sequence are such that a high level of specificity is provided—i.e. hybridisation of the probe occurs under “stringent conditions”. In general, stringent conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target (or complement) sequence hybridises to a perfectly matched probe. In this regard, the Tm of probes of the present invention, at a salt concentration of about 0.02M or less at pH 7, is for example above 60° C., such as about 70° C.

Premixed buffer solutions are commercially available (e.g. EXPRESSHYB Hybridisation Solution from CLONTECH Laboratories, Inc.), and hybridisation can be performed according to the manufacturer's instructions.

Probes of the present invention may be screened to minimise self-complementarity and dimer formation (probe-probe binding).

Any of the probes described herein may comprise a tag and/or label. The tag and/or label may, for example, be located (independently of one another) towards the middle or towards or at the 5′ or 3′ end of the herein described probes, for example at the 5′ end.

Hence, following hybridisation of tagged/labelled probe to target nucleic acid, the tag/label is associated with the target nucleic acid in the one or more biomarker. Alternatively, if an amplification step is employed, the probes may act as primers during the method of the invention and the tag/label may therefore become incorporated into the amplification product as the primer is extended.

Examples of suitable labels include detectable labels such as radiolabels or fluorescent or coloured molecules, enzymatic markers or chromogenic markers—e.g. dyes that produce a visible colour change upon hybridisation of the probe. By way of example, the label may be digoxygenin, fluorescein-isothiocyanate (FITC), R-phycoerythrin, Alexa 532 or Cy3. The probes preferably contain a Fam label (e.g. a 5′ Fam label), and/or a minor groove binder (MGB). The label may be a reporter molecule, which is detected directly, such as by exposure to photographic or X-ray film. Alternatively, the label is not directly detectable, but may be detected indirectly, for example, in a two-phase system. An example of indirect label detection is binding of an antibody to the label.

Examples of suitable tags include “complement/anti-complement pairs”. The term “complement/anti-complement pair” denotes non-identical moieties that form a non-covalently associated, stable pair under appropriate conditions. Examples of suitable tags include biotin and streptavidin (or avidin). By way of example, a biotin tag may be captured using streptavidin, which may be coated onto a substrate or support such as a bead (for example a magnetic bead) or membrane. Likewise, a streptavidin tag may be captured using biotin, which may be coated onto a substrate or support such as a bead (for example a magnetic bead) or membrane. Other exemplary complement/anti-complement pairs include receptor/ligand pairs, antibody/antigen (or hapten or epitope) pairs, and the like. Another example is a nucleic acid sequence tag that binds to a complementary sequence. The latter may itself be pre-labelled, or may be attached to a surface (e.g. a bead) which is separately labelled. An example of the latter embodiment is the well-known LuminexR bead system. Other exemplary pairs of tags and capture molecules include receptor/ligand pairs and antibody/antigen (or hapten or epitope) pairs. Where subsequent dissociation of the complement/anti-complement pair is desirable, the complement/anti-complement pair has a binding affinity of, for example, less than 10⁹M⁻¹. One exemplary tagged probe is a biotin-labelled probe, which may be detected using horse-radish peroxidase conjugated streptavidin.

The probes of the invention may be labelled with different labels or tags, thereby allowing separate identification of each probe when used in the method of the present invention.

Any conventional method may be employed to attach nucleic acid tags to a probe of the present invention (e.g. to the 5′ end of the defined binding region of the probe). Alternatively, nucleic acid probes of the invention (with pre-attached nucleic acid tags) may be constructed by commercial providers.

If an amplification step is employed, this step may be carried out using methods and platforms known in the art, for example PCR (for example, with the use of “Fast DNA Polymerase”, Life Technologies), such as real-time PCR, block-based PCR, ligase chain reaction, glass capillaries, isothermal amplification methods including loop-mediated isothermal amplification, rolling circle amplification transcription mediated amplification, nucleic acid sequence-based amplification, signal mediated amplification of RNA technology, strand displacement amplification, isothermal multiple displacement amplification, helicase-dependent amplification, single primer isothermal amplification, and circular helicase-dependent amplification. If employed, amplification may be carried using any amplification platform.

A general amplification step (e.g. pre-detection) may be employed to increase the amount of the one or more biomarker of the invention present in the sample. PCR amplification primers are typically employed to amplify approximately 100-400 base pair regions of the target/complementary nucleic acid that contain the nucleotide targets of the present invention. In the presence of a suitable polymerase and DNA precursors (dATP, dCTP, dGTP and dTTP), forward and reverse primers are extended in a 5′ to 3′ direction, thereby initiating the synthesis of new nucleic acid strands that are complementary to the individual strands of the target nucleic acid. The primers thereby drive amplification of target nucleic acid sequences in the one or more biomarker, thereby generating amplification products comprising said target nucleic acid sequences.

An amplification step may be employed in which the probes of the present invention act as primers. In this embodiment, the probes (acting as primers) are extended from their 3′ ends (i.e. in a 5′-to-′3′) direction. Such an amplification step may be employed in conjunction with a general amplification step, such as the one described above.

The detection step may be carried out by any known means. In this regard, the probe or amplification product may be tagged and/or labelled, and the detection method may therefore comprise detecting said tag and/or label.

In one embodiment, the probe(s) may comprise a tag and/or label. Thus, in one embodiment, following hybridisation of tagged/labelled probe to target nucleic acid in the one or more biomarker, the tag/label becomes associated with the target nucleic acid. Thus, in one embodiment, the assay may comprise detecting the tag/label and correlating presence of tag/label with presence of the one or more nucleic acid biomarker of the invention.

In one embodiment, tag and/or label may be incorporated during extension of the probe(s). In doing so, the amplification product(s) become tagged/labelled, and the assay may therefore comprise detecting the tag/label and correlating presence of tag/label with presence of amplification product, and hence the presence of one or more nucleic acid biomarker of the invention.

By way of example, in one embodiment, the amplification product may incorporate a tag/label (e.g. via a tagged/labelled dNTP such as biotin-dNTP) as part of the amplification process, and the assay may further comprise the use of a binding partner complementary to said tag (e.g. streptavidin) that includes a detectable tag/label (e.g. a fluorescent label, such as R-phycoerythrin). In this way, the amplified product incorporates a detectable tag/label (e.g. a fluorescent label, such as R-phycoerythrin).

In one embodiment, the probe(s) and/or the amplification product(s) may include a further tag/label (as the complement component) to allow capture of the amplification product(s).

By way of example, a “complement/anti-complement” pairing may be employed in which an anti-complement capture component binds to said further tag/label (complement component) and thereby permits capture of the probe(s) and/or amplification product(s). Examples of suitable “complement/anti-complement” partners have been described earlier in this specification, such as a complementary pair of nucleic acid sequences, a complementary antibody-antigen pair, etc. The anti-complement capture component may be attached (e.g. coated) on to a substrate or solid support—examples of suitable substrates/supports include membranes and/or beads (e.g. a magnetic or fluorescent bead). Capture methods are well known in the art. For example, LuminexR beads may be employed. Alternatively, the use of magnetic beads may be advantageous because the beads (plus captured, tagged/labelled amplification product) can easily be concentrated and separated from the sample, using conventional techniques known in the art.

Immobilisation provides a physical location for the anti-complement capture component (or probes), and may serve to fix the capture component/probe at a desired location and/or facilitate recovery or separation of probe. The support may be a rigid solid support made from, for example, glass, plastic or silica, such as a bead (for example a fluorescent or magnetic bead). Alternatively, the support may be a membrane, such as nylon or nitrocellulose membrane. 3D matrices are also suitable supports for use with the present invention—e.g. polyacrylamide or PEG gels. Immobilisation to a support/platform may be achieved by a variety of conventional means. By way of example, immobilisation onto a support such as a nylon membrane may be achieved by UV cross-linking. Alternatively, biotin-labelled molecules may be bound to streptavidin-coated substrates (and vice-versa), and molecules prepared with amino linkers may be immobilised on to silanised surfaces. Another means of immobilisation is via a poly-T tail or a poly-C tail, for example at the 3′ or 5′ end. Said immobilisation techniques apply equally to the probe component (and primer pair component, if present) of the present invention.

In one embodiment, the probes of the invention comprise a nucleic acid sequence tag/label (e.g. attached to each probe at the 5′ end of the defined sequence of the probe that binds to target/complement nucleic acid). In more detail, each of the probes is provided with a different nucleic acid sequence tag/label, wherein each of said tags/labels (specifically) binds to a complementary nucleic acid sequence present on the surface of a bead. Each of the different tags/labels binds to its complementary sequence counterpart (and not to any of the complementary sequence counterparts of the other tags), which is located on a uniquely identifiable bead. In this regard, the beads are uniquely identifiable, for example by means of fluorescence at a specific wavelength. Thus, in use, probes of the invention bind to target nucleic acid (if present in the sample). Thereafter, (only) the bound probes may be extended (in the 3′ direction) in the presence of one or more labelled dNTP (e.g. biotin labelled dNTPs, such as biotin-dCTPs).

The extended primers may be contacted with a binding partner counterpart to the labelled dNTPs (e.g. a streptavidin labelled fluorophore, such as streptavidin labelled R-phycoerythrin), which binds to those labelled dNTPs that have become incorporated into the extended primers. Thereafter, the labelled extended primers may be identified by allowing them to bind to their nucleic acid counterparts present on the uniquely identifiable beads. The latter may then be “called” (e.g. to determine the type of bead present by wavelength emission) and the nature of the primer extension (and thus the type of target/complement nucleic acid present) may be determined.

Typically, probes of the invention are oligonucleotides having sequence identity with a region of the one or more biomarker of the invention as disclosed herein. One or more probe may be immobilised on a solid support, and used to interrogate mRNA obtained from a test sample. If the mRNA from the test sample contains the one or more biomarker targeted by the immobilised probe, it will bind to the probe, and may then be detected. The biomarkers of the invention may also be detected using PCR, such as real time PCR.

Any oligonucleotide with the appropriate level of sequence identity with the one or more biomarker of the invention, or with one or more target sequences within said one or more biomarker of the invention may be used as a probe as described herein. Any oligonucleotide with the appropriate level of complementarity with the one or more biomarker of the invention, or with one or more target sequences within said one or more biomarker of the invention may be used as a probe as described herein. Exemplary sequences of the one or more biomarkers of the invention are given in SEQ ID NOs: 112 to 167 (see Tables 2 to 5 herein). Sequences of exemplary target regions within the one or more biomarkers of the invention are shown as underlined in the sequences of the Sequence Information section (as discussed herein). Exemplary probe nucleic acid sequences for the biomarkers disclosed herein are set out in Table 6 (SEQ ID NOs: 1 to 111 and 168 to 171) and are shown as double-underlined in the sequences of the Sequence Information section.

In embodiments wherein the one or more biomarker for TB is LOC400759/GBP1P1, the oligonucleotide probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NO: 1, 2 or 3.

In embodiments wherein the one or more biomarker for TB is PF4V1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 4 or 5.

In embodiments wherein the one or more biomarker for TB is ALPK1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 6 or 7.

In embodiments wherein the one or more biomarker for TB is HERC2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 8, 9 or 168 to 171.

In embodiments wherein the one or more biomarker for TB is LGALS3BP, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 10 or 11.

In embodiments wherein the one or more biomarker for TB is BST1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 12 or 13.

In embodiments wherein the one or more biomarker for TB is SNX10, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 14 or 15.

In embodiments wherein the one or more biomarker for TB is CREG1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 16 or 17.

In embodiments wherein the one or more biomarker for TB is BAZ1A, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 18 or 19.

In embodiments wherein the one or more biomarker for TB is LYN, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 20 or 21.

In embodiments wherein the one or more biomarker for TB is TAPBP, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 22 or 23.

In embodiments wherein the one or more biomarker for TB is SERPINB1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 24 or 25.

In embodiments wherein the one or more biomarker for TB is PSMB9, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 26 or 27.

In embodiments wherein the one or more biomarker for TB is WSB1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 28 or 29.

In embodiments wherein the one or more biomarker for TB is MVP, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 30 or 31.

In embodiments wherein the one or more biomarker for TB is APBB1IP, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 32 or 33.

In embodiments wherein the one or more biomarker for TB is FYB, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 34 or 35.

In embodiments wherein the one or more biomarker for TB is MB21D1/C6orf150, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 36 or 37.

In embodiments wherein the one or more biomarker for TB is CPVL, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 38 or 39.

In embodiments wherein the one or more biomarker for TB is TICAM2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 40 or 41.

In embodiments wherein the one or more biomarker for TB is CD52, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 42 or 43.

In embodiments wherein the one or more biomarker for TB is KLRA1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 44 or 45.

In embodiments wherein the one or more biomarker for TB is DEFB128, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 46 or 47.

In embodiments wherein the one or more biomarker for TB is IL8, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 48 or 49.

In embodiments wherein the one or more biomarker for TB is GBP1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 50 or 51.

In embodiments wherein the one or more biomarker for TB is IRF1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 52 or 53.

In embodiments wherein the one or more biomarker for TB is MMP9, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 54 or 55.

In embodiments wherein the one or more biomarker for TB is CD96, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 56 or 57.

In embodiments wherein the one or more biomarker for TB is AIM2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 58 or 59.

In embodiments wherein the one or more biomarker for TB is CD274, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 60 or 61.

In embodiments wherein the one or more biomarker for TB is CDH23, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 62 or 63.

In embodiments wherein the one or more biomarker for TB is IFIT3, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 64 or 65.

In embodiments wherein the one or more biomarker for TB is IFITM3, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 66 or 67.

In embodiments wherein the one or more biomarker for TB is GK, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 68 or 69.

In embodiments wherein the one or more biomarker for TB is NELL2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 70 or 71.

In embodiments wherein the one or more biomarker for TB is S100A11, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 72 or 73.

In embodiments wherein the one or more biomarker for TB is SAMD9L, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 74 or 75.

In embodiments wherein the one or more biomarker for TB is STAT1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 76 or 77.

In embodiments wherein the one or more biomarker for TB is TLR6, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 78 or 79.

In embodiments wherein the one or more biomarker for TB is WARS, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 80 or 81.

In embodiments wherein the one or more biomarker for TB is DOCKS, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 82 or 83.

In embodiments wherein the one or more biomarker for TB is SIRPB2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 84 or 85.

In embodiments wherein the one or more biomarker for TB is ANKRD22, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 86 or 87.

In embodiments wherein the one or more biomarker for TB is ABCF2 (NM_005692.3), the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 88 or 89.

In embodiments wherein the one or more biomarker for TB is FNBP1L, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 90 or 91.

In embodiments wherein the one or more biomarker for TB is NCF1C, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 92 or 93.

In embodiments wherein the one or more biomarker for TB is TBC1D3B, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 94 or 95.

In embodiments wherein the one or more biomarker for TB is SLC14A1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 96 or 97.

In embodiments wherein the one or more biomarker for TB is CALCOCO2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 98 or 99.

In embodiments wherein the one or more biomarker for TB is GTF2B, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 100 or 101.

In embodiments wherein the one or more biomarker for TB is HLA-B, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 102 or 103.

In embodiments wherein the one or more biomarker for TB is HLA-F, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 104 or 105.

In embodiments wherein the one or more biomarker for TB is MGST2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 106 or 107.

In embodiments wherein the one or more biomarker for TB is SPAST, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 108 or 109.

In embodiments wherein the one or more biomarker for TB is WAC, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 110 or 111.

Use of a Data Analysis Algorithm

In one embodiment, comparison of the one or more biomarker or the biomarker profile to a reference or control comprises applying a decision rule, or using a decision tree, as described herein. The decision rule or decision tree can comprise a data analysis algorithm, such as a computer pattern recognition algorithm. Other suitable algorithms include, but are not limited to, logistic regression or a nonparametric algorithm that detects differences in the distribution of feature values (e.g., a Wilcoxon Signed Rank Test). The decision rule may be based upon one, two, three, four, five, 10, 20 or more features. In one embodiment, the decision rule or decision tree is based on hundreds or more of features. Applying the decision rule or decision tree may also comprise using a classification tree algorithm. For example, the control or reference biomarker profile may comprise at least three features or biomarkers, where the features are predictors in a classification tree algorithm. The data analysis algorithm predicts membership within a population (or class) with an accuracy of at least about 60%, at least about 70%, at least about 80% and at least about 90%.

Suitable algorithms are known in the art, some of which are reviewed in Hastie et al, supra. Such algorithms classify complex spectra from biological materials, such as a blood sample, to distinguish individuals as normal or as possessing biomarker expression levels characteristic of a particular disease state. While such algorithms may be used to increase the speed and efficiency of the application of the decision rule and to avoid investigator bias, one of ordinary skill in the art will realize that computer-based algorithms are not required to carry out the methods of the present invention.

Algorithms may be applied to the comparison of the one or more biomarker or the biomarker profiles, regardless of the method that was used to generate the data for the one or more biomarker or the biomarker profile. For example, suitable algorithms can be applied to biomarker profiles generated using gas chromatography, as discussed in Harper, “Pyrolysis and GC in Polymer Analysis” Dekker, New York (1985). Further, Wagner et al, Anal Chem 74: 1824-35 (2002) disclose an algorithm that improves the ability to classify individuals based on spectra obtained by static time-of-flight secondary ion mass spectrometry (TOF-SIMS). Additionally, Bright et al, J. Microbiol Methods 48: 127-38 (2002) disclose a method of distinguishing between bacterial strains with high certainty (79-89% correct classification rates) by analysis of MALDI-TOF-MS spectra. Dalluge, Fresenius J. Anal. Chem. 366: 701-11 (2000) discusses the use of MALDI-TOF-MS and liquid chromatography-electrospray ionization mass spectrometry (LC/ESI-MS) to classify profiles of biomarkers in complex biological samples.

Methods of Diagnosis

As described herein, the present invention provides a method for diagnosing TB in an individual, comprising determining the presence and/or amount of one or more biomarker for TB in a sample obtained from the individual, wherein the one or more biomarkers is selected from SNX10, CPVL, PF4V1, HERC2, LGALS3BP, BST1, BAZ1A, LYN, SERPINB1, WSB1, MVP, APBB1IP, MB21D1/C6orf150, TICAM2, CD52, KLRA1, DEFB128 and IL8. Any combination of biomarkers as disclosed herein may be used in a method according to the present invention.

The method may comprising obtaining a first biomarker profile from a first sample taken from the individual at a single initial point in time and multiple time points thereafter to monitor the efficacy of treatment and disease resolution; and comparing said individual's first biomarker profile to a reference or control biomarker profile, wherein said comparison determines the status of TB infection in the individual with an accuracy, sensitivity and/or specificity of at least about 90%, at least about 80%, at least about 70% or at least about 60%; and wherein the biomarker profiles comprise determining the presence and/or amount of one or more biomarker of the invention. Typically the accuracy, sensitivity and/or specificity is of at least about 80% or at least about 90%.

The method may comprise obtaining a first biomarker profile from a first sample from the individual; and comparing the individual's first biomarker profile to a reference or control biomarker profile obtained from a reference or control population, said comparison being capable of classifying the individual as belonging to or not belonging to the reference or control population, wherein the comparison determines the status of TB infection in the individual, and wherein the biomarker profiles comprise determining the presence and/or amount of one or more biomarker of the invention.

The method may comprise comparing a measurable characteristic of at least three biomarkers of the invention between (i) a first biomarker profile obtained from a first sample from the individual and (ii) a biomarker profile obtained from samples from a control or reference population; and classifying the individual as belonging to or not belonging to the control or reference population, wherein the comparison determines the status of TB infection in the individual, and wherein the measurable characteristic optionally comprises the presence and/or amount of the biomarker.

The method may comprise selecting at least two features from a set of biomarkers of the invention in a first biomarker profile generated from a first sample of the individual; and comparing the at least two features to a set of the same biomarkers in a biomarker profile generated from samples from a control or reference population, wherein the comparison is capable of classifying the individual as belonging to or not belonging to the control reference population with an accuracy, sensitivity and/or specificity of at least about 90%, at least about 80%, at least about 70% or at least about 60%, wherein the comparison determines the status of TB in the individual, and wherein the feature optionally comprises the presence and/or amount of the biomarker. Typically the accuracy, sensitivity and/or specificity is of at least about 80% or at least about 90%.

The method may comprise determining an abundance or a change in an abundance of at least three biomarkers contained in a first biomarker profile obtained from a first biological sample of the individual; and (b) comparing the abundance or the change in the abundance to an abundance or change in an abundance of said at least three biomarkers contained in biological samples from a control or reference population, wherein the comparison is capable of classifying the individual as belonging to or not belonging to the control or reference population; and wherein the comparison determines the status of TB in the individual.

The method may further comprise obtaining a second biomarker profile from a second sample taken from the individual; and comparing the individual's second biomarker profile to the control or reference biomarker profile; wherein the individual's second biomarker profile and the control or reference biomarker profile comprise features that are measurable characteristics of a biomarker of the invention, wherein the second comparison is capable of classifying the individual as belonging to or not belonging to the control or reference population, and wherein the second comparison determines the status TB infection in the individual. The biomarker profiles optionally comprise one or more of the biomarkers of the present invention, and the measurable characteristic optionally comprises the presence and/or amount of one or more biomarker of the invention.

The methods of the invention may be repeated at least once, at least twice, at least three times, at least four times, at least five times, or more. A separate biomarker profile can be obtained from the individual from a separate sample taken each time the method is repeated.

The methods of the invention may be used to diagnose, detect and/or predict TB, TB infection and/or infection with M. tuberculosis. The methods of the invention may be used to distinguish between active and latent TB, a TB infection and/or infection with M. tuberculosis. The methods of the invention may be used to distinguish between latent TB and the absence of TB. The methods of the invention may be used to identify an individual with an active TB infection and/or a latent TB infection. The methods of the invention may be used to identify an individual with an active TB infection and/or a latent TB infection and/or an individual uninfected with TB. The methods of the invention may be used to identify an individual with an early stage active TB infection and/or a late/later stage active TB infection. The methods of the invention may be used to distinguish between an early stage active TB infection and/or a late/later stage active TB infection. The methods of the invention may also be used to determine the status of TB, a TB infection and/or infection with M. tuberculosis in an individual. Determining the status of TB, a TB infection and/or infection with M. tuberculosis in an individual may comprise determining the progression or resolution of TB, a TB infection and/or infection with M. tuberculosis in the individual. Determining the status of TB, a TB infection and/or infection with M. tuberculosis in an individual may comprise determining the presence of active or latent TB, a TB infection and/or infection with M. tuberculosis in an individual. The methods of the invention may be used to determine whether an individual has been exposed to TB.

The methods of the invention may comprise applying a decision rule as described herein. Applying the decision rule may comprise using a data analysis algorithm, also as described herein. The data analysis algorithm may comprise at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least 15, at least 20, at least 25, at least 50 or more input parameters. The data analysis algorithm may use any of the biomarkers of the invention, or combination of biomarkers of the invention as input parameters. Typically, the data analysis algorithm uses at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least 15, at least 20, at least 25, at least 50 of the biomarkers of the invention (e.g. as listed in any one of Tables 2 to 5) as input parameters.

In a preferred embodiment, the features and/or biomarkers profile used in the methods of the invention are the one or more biomarkers of the present invention, as described herein, and preferably the methods relate to determining the presence and/or amount of the one or more biomarker. Similarly, the “measurable characteristic” in a method of the invention may be any quantitative or qualitative characteristic associated with one or more biomarker of the invention, and is preferably the presence and/or amount of said biomarker.

In a more preferred embodiment, the one or more biomarker of the invention is nucleic acids, selected from DNA or RNA, typically mRNA. The biomarker profile may comprise any measurable aspect of said nucleic acid biomarker, and is typically a measurable characteristic of an mRNA biomarker, such as the presence and/or amount of said mRNA biomarker. The one or more biomarker of the invention and/or the biomarker may comprise a measurable aspect of a nucleic acid biomarker that encodes a protein that is informative of the state of the immune system in response to TB, a TB infection and/or infection with M. tuberculosis in an individual.

As described herein, a method of the invention may comprise fractionation of the sample prior to determining the presence and/or amount of the one or more biomarker of the invention, or obtaining a biomarker profile. Typically, the method comprises at least one, at least two, at least three, at least four, at least five, or more separation methods as described herein. The at least one separation method may be selected from inflammatory cell separation, chemical extraction partitioning, ion exchange chromatography, gel electrophoresis, and any combination thereof.

The invention also provides the use of one or more biomarker for TB as defined herein in the manufacture of a diagnostic for TB. Said diagnostic may be for diagnosing active TB and/or latent TB and/or the absence of TB.

Kits and Devices

The invention also provides kits and devices that are useful in determining the status of TB, diagnosing or detecting TB, distinguishing between active and latent TB in an individual, distinguishing between early stage active TB and late/later stage active TB and/or to determine whether an individual has been exposed to TB. The kits and devices of the present invention comprise at least one biomarker of the invention and/or one or more agent for the detection of or for the determination of the amount of the one or more biomarker of the invention. Specific biomarkers and agents for the detection of said biomarkers useful in the present invention are set forth herein. The biomarkers of the kit or device can be used to generate biomarker profiles according to the present invention.

Generally, the biomarkers of the kit or biomarker will bind, with at least some specificity, to the biomarker molecules contained in the sample from which the biomarker profile is generated. Examples of classes of compounds of the kit or device include, but are not to, proteins (including antibodies of the invention), and fragments thereof, peptides, polypeptides, proteoglycans, glycoproteins, lipoproteins, carbohydrates, lipids, nucleic acids, organic and inorganic chemicals, and natural and synthetic polymers. The biomarker(s) and/or agent(s) for the detection of the one or more biomarker may be part of an array, or the biomarker(s) and/or agent(s) may be packaged separately and/or individually. The biomarker(s) and/or agent(s) may be immobilised on an inert support.

The kit or device may also comprise at least one internal standard to be used in generating the biomarker profiles of the present invention. Likewise, the internal standards can be any of the classes of compounds described above.

The kits and devices of the present invention also may contain reagents that can be used to detectably label biomarkers contained in the biological samples from which the biomarker profiles are generated. For this purpose, the kit or device may comprise a set of antibodies or functional fragments thereof that specifically bind at least two, three, four, five, 10, 20, 30, 40, 50 or more, up to all 55 of the biomarkers set forth in any one of Tables 2 to 6 that list biomarkers for use in the invention. The antibodies themselves may be detectably labelled. The kit or device also may comprise a specific biomarker binding component, such as an aptamer.

In a preferred embodiment, a kit or device of the invention comprises (i) one or more antibody specific for the one or more biomarker for tuberculosis; or (ii) one or more oligonucleotide specific for the one or more biomarker for tuberculosis. In a more preferred embodiment, the one or more oligonucleotide specific for the one or more biomarker for tuberculosis is an oligonucleotide is an oligonucleotide of the invention, more preferably one or more of SEQ ID NOs: 1 to 111 or 168 to 171.

If the biomarkers comprise a nucleic acid, the kit or device may provide one or more oligonucleotide probe that is capable of forming a duplex with the one or more biomarker or with a complementary strand of said one or more biomarker. The one or more oligonucleotide probe may be detectably labelled. Typically, the one or more oligonucleotide probe used in the methods of the invention is selected from one or more of the oligonucleotide described herein. In a preferred embodiment, the one or more oligonucleotide probe is selected from an oligonucleotide probe that comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of any one or more of SEQ ID NOs: 1 to 111 or 168 to 171.

The kits and devices of the present invention may also include pharmaceutical excipients, diluents and/or adjuvants when the biomarker is to be used to raise an antibody. Examples of pharmaceutical adjuvants include, but are not limited to, preservatives, wetting agents, emulsifying agents, and dispersing agents. Prevention of the action of microorganisms can be ensured by the inclusion of various antibacterial and antifungal agents, for example, paraben, chlorobutanol, phenol sorbic acid, and the like. It may also be desirable to include isotonic agents such as sugars, sodium chloride, and the like.

The Following Examples Illustrate the Invention
EXAMPLES
Example 1
TB-Specific Biomarker Identification

Naive Cynomologous macaques (Macaca fascicularis) aged between 2-4 yrs from two separate breeding colonies i.e. of Mauritian or Chinese origin, from established United Kingdom or Chinese breeding facilities were challenged with were challenged with live M. tuberculosis using aerosol challenge Erdman strain K 01. They were confirmed as naïve in terms of prior exposure to mycobacterial antigens (M. tuberculosis infection or environmental mycobacteria), by negative tuberculin test while in their original breeding colony and prior to the start of the study using the gamma interferon (IFN-γ)-based Primagam test kit (Biocor; CSL). All procedures involving animals were approved by the Ethical Review Committee of the Centre for Emergency Preparedness and Response, Salisbury, United Kingdom.

Mono-dispersed bacteria in particles were generated using a three-jet Collison nebulizer (BGI) and in conjunction with a modified Henderson apparatus, delivered to the nose of each sedated primate via a modified veterinary anaesthesia mask. The challenge was performed on sedated animals placed within a “head-out” plethysmography chamber (Buxco, Wilmington, N.C.), to enable the aerosol to be delivered simultaneously with the measurement of the respiration rate. None of the animals had been used previously for experimental procedures.

Whole heparinised blood was obtained at three independent time points prior to challenge and at one, two, four and six weeks post M. tuberculosis challenge. Within 1 hour of collection, 1 ml of blood from each animal was mixed with 5 ml of Erythrocyte Lysis (EL) Buffer (Qiagen) followed by incubation on ice for 10-15 min. Peripheral blood leukocytes (PBLs) were recovered from erythrocyte-lysed blood by centrifugation at 400×g for 10 min at 4° C. and re-suspended in a further 2 ml of EL buffer. PBLS were again recovered by centrifugation as described above and processed for recovery of total RNA.

One ml of TRIzol® was added to the PBL pellet and then total RNA was extracted from the lysed PBL pellet according to the manufacturer's instructions, using aqueous-phase separation with chloroform isoamyl alcohol and the precipitation using 2-isopropanol. Recovered, dried RNA pellets were re-suspended in 10 μl of diethylpyrocarbonate (DECP) water (Invitrogen), then concentration and purity (A260/A280 ratio≧1.8) assessed by spectrophotometry using a NanoDrop ND-1000 spectrophotometer (Thermo Scientific). Genomic DNA was removed prior to its use in further procedures using the DNase I kit (Qiagen), according to the manufacturer's instructions. The GeniSphere SenseAmp RNA amplification kit according to manufactures instructions (http://genisphere.com/). The resulting amplified cRNA was purified using RNeasy Min-Elute Cleanup kit (Qiagen), again according to the manufacturer's protocol. The cRNA concentration and purity (A260/A280 ratio≧1.8) was then assessed by spectrophotometry using a NanoDrop ND-1000 spectrophotometer.

Total amplified cRNAs were then labelled with Cy3 and hybridised to replicate Operon Human Genome AROS V4.0 slides (n=3/sample/time point (http://www.microarrays.com/dna-arrays.php)), using established protocols. The slides were air-dried and scanned using an Affymetrix 480 microarray scanner, at a gain threshold of 65. Feature extraction was then conducted using the microarray quantification package BlueFuse (BlueGnome ltd.). Raw data were then exported and hybridisation fluorescence intensities quantified using the software analysis program Bluefuse™, using default background subtraction and normalisation methods, to remove data generated from poor-quality spots, hybridisation artifacts. All raw data were then processed further using the microarray analysis package Genespring 12.5.

Data output files from BlueFuse were imported into GeneSpring 12.5 (GX12) for differential gene expression and statistical analysis. Raw data was normalized to the 50th percentile followed by median baseline transformed to the corresponding animal pre-bleed. This was conducted to normalise data across all time points and assess differential gene expression of each gene entity, relative to a baseline i.e. pre-bleed level of expression prior to M. tuberculosis challenge. The mean value across three replicate samples slides for each feature was used for further analysis. Data were assessed for quality, then filtered on gene expression where entities in at least 100 percent of samples and in any one out of one conditions had normalised expression values within the cut-off-10.699 to 7.037. Statistically significant features were identified using one-way ANOVA analysis across all entities and time points, using the Benjamini-Hochberg False Discovery Rate (BH-FDR) at a cut-off p<0.05. To identify temporally, differentially expressed entities between time-points post-infection, fold-change cut-off analysis was conducted, all against the pre-bleed condition and where the minimum number of pairs was equal to 1 out of the 4 condition pairs i.e. weeks 1, 2, 4 or 6 and using the default cut-off setting >2.0.

Data outputs were also analysed using Artificial Neural Network Analysis (ANN). Normalised expression data was analysed using ANN based data mining approach (Lancashire L J et al (2010), Breast Cancer Res Treat. Feb; 120(1):83-93). This approach comprised a supervised learning approach where the data for a given single probe was used to classify a known sample. The classifier consisted of a multi-layer perceptron ANN, where weights were updated by a back propagation algorithm (Rumelhart D E et al (1986) Nature 323: 533-536). The ANN architecture utilised a constrained architecture to prevent over-fitting, having only 2 hidden nodes in the hidden layer. ANN training incorporated Monte Carlo Cross Validation, where, the data was randomly divided into three subsets; 60% for training the classifier, 20% for testing (to assess model performance during the training process) and 20% for validation (to independently test the model on data completely blind to the model). This process of random sample cross validation was utilised to prevent over-fitting of the data and assess how well the model would perform on a blind data set. This random re-sampling and training process was repeated 50 times to generate predictions and associated error values for each sample with respect to the validation (blind) data. Probes were ranked in ascending order based on predictive error for test data from the Monte Carlo Cross validation. Significant hits were identified by cross-comparison between ANOVA p value-based (lowest to highest) and ANN test error-based ranked order lists (lowest to highest) and further filtered using the heat map and cluster functions in Genespring 12.0, using default settings. Highly significant biomarker datasets were refined by cross comparison of entity lists obtained using either one way ANOVA (P≧0.05) or ANN analysis (top one thousand entities ranked on average test error). Fifty-five biomarkers were selected for further progression from these gene lists.

All fifty-five biomarkers and individual smaller panels of up to ten biomarkers each were used to interrogate previously published human datasets using the cluster algorithm of GeneSpring 12.5, using the unsupervised hierarchical Euclidean clustering setting on conditions and entities. Small, select panels of biomarkers more amenable to use on point of care diagnostic platforms were identified which exhibited the best sensitivity and specificity in discriminating active Tuberculosis patients from Latent Tuberculosis and controls in one analysis and also in discrimination Latent Tuberculosis from uninfected controls in a second tier analysis. These are given below in Table 1.

All TB 55 Panel; all Biomarkers combined from Tables 2-5

Active TB 8 Panel; LOC400759, PF4V1, ALPK1, HERC2, IRF1, MMP9, GBP1, CD96
Latent TB 5 Panel; HERC2, KLRAP1, PF4V1, DEFB128, IL8

TABLE 1

Select Biomarkers for TB

Biomarker
No. True
No. False
No. True
No. False
Sensitivity
Specificity

Data Set
Panel
Negatives
Negatives
Positives
Positives
(%)
(%)

Human
All TB
82
21
53
10
84.1
79.6

Dataset
55 panel

(9 Latent

1^a

TB, 1

uninfected)

Human
All TB
60
14
32
4
88.9
81.1

Dataset
55 panel

(all Latent

2^b

TB)

Human
Active TB
88
18
56
10
84.8
83

Dataset
8 panel

(9 Latent

1^a

TB, 1

uninfected)

Human
Active TB
62
16
30
2
93.75
79.5

Dataset
8 panel

(all Latent

2^b

TB)

Human
Latent TB
20
17
52
3
94.5
54.1

Dataset
5 panel

1^a

^aBerry MPR. et al (2010) Nature 466(7309): 973-977

^bMaertzdorf J et al (2011) PLoS One 6(10): e26938

Table 2 lists the genes newly identified as biomarkers for TB using the above methods. Also given are the sequence identifiers for the identified genes and an indication of whether expression of the various genes are up or down regulated in TB compared with a control/reference population, and in what cell types (all white blood cells in a sample, monocytes, neutrophils, CD4 positive T cells, CD8 positive T cells, etc.) the change was observed. Where more than one indication is given, the first listed is the most preferred.

TABLE 2

Biomarkers for TB

Corrected

P value

Probe
Gene
SEQ
NHP Dataset

Number
Symbol
ID NO:
BH FDR
Cell Type

AA1
LOC400759/
112/113
1.95E−20
Mainly

GBP1P1

Monocytes↑/

(RP4-644F6.3)

Neutrophils↑

AA2
LGALS3BP
114
7.46E−09
All↑

AB1
BST1
115
2.05E−09
Monocytes↑/

Neutrophils↑/

CD8 positive

T cells (CD8↑

AB2
SNX10
116
8.02E−12
Monocytes↑/

Neutrophils↑/

CD4 positive

T cells(CD4)↑

AC1
ALPK1
117
1.33E−03
Monocytes↑/

Neutrophils↑/

CD4↑

AC2
CREG1
118
1.52E−09
Monocytes↑/

Neutrophils↑/

CD4↑

AD1
BAZ1A
119
5.46E−04
Monocytes↑/

Neutrophils↑

AD2
LYN
120
3.14E−11
Monocytes↑/

Neutrophils↑

AD3
TAPBP
121
1.63E−04
Monocytes↑/

Neutrophils↑

AE1
SERPINB1
122
1.61E−14
Monocytes↑/

Neutrophils↑

AE2
PSMB9
123
9.98E−17
All↑

AE3
WSB1
124
3.82E−04
Monocytes↑/

Neutrophils↑

AF1
MVP
125
3.49E−11
Monocytes↑/

Neutrophils↑

AF2
APBB1IP
126
1.21E−08
Monocytes↑/

Neutrophils↑

AF3
FYB
127
3.09E−10
All↑

AG1
MB21D1/
128
1.76E−02
All↑

C6orf150

AG2
CPVL
129
2.30E−15
Monocytes↑/

CD4↑

AG3
TICAM2
130
5.98E−10
Neutrophils↑

AH1
CD52
131
9.66E−04
Monocytes↓/

Neutrophils↓

AI1
HERC2
132
2.07E−09
Neutrophils↓

AJ2
KLRAP1
133
7.81E−07
CD4 & CD8

(KLRA1)

positive T cells

(CD4, CD8)↑

AK1
PF4V1
134
4.37E−02
Monocytes↑/

Neutrophils↓

AL1
DEFB128
135
7.22E−03
CD8↑/

Monocytes↑

AM1
IL8
136

Neutrophils↓

Table 3 lists further biomarkers for TB. Also given are the sequence identifiers for the identified genes and an indication of whether expression of the various genes are up or down regulated in TB compared with a control/reference population, and in what cell types the change was observed. Where more than one indication is given, the first listed is the most preferred.

TABLE 3

Further Biomarkers for TB

Corrected

P value

Probe
Gene
SEQ
NHP Dataset

Number
Symbol
ID NO:
BH FDR

B1
AIM2
137
4.63E−02
Monocytes↑/

Neutrophil↑/

CD4↑

B2
CD274
138
8.31E−04
All↑

B3
CD96
139
4.02E−04
CD4 & CD8

positive T cells

(CD4, CD8) ↑

B4
CDH23
140
8.26E−10
CD8↑/

Neutrophils↓

B5
IRF1
141
2.36E−19
Monocytes↑/

Neutrophil↑

B6
GBP1
142
7.95E−06
All↑

B7
IFIT3
143
7.13E−04
All ↑

B8
IFITM3
144
8.38E−12
Monocytes↑/

Neutrophil↑/

CD4↑

B9
GK
145
3.50E−02
Monocytes↑/

Neutrophils↑/

CD4↑

B10
NELL2
146
4.63E−04
CD8 positive

T cells

(CD8)↓

B11
S100A11
147
7.31E−03
CD4↑/CD8↑

B12
SAMD9L
148
1.60E−04
All↑

B13
STAT1
149
3.42E−05
All↑

B14
TLR6
150
2.13E−03
Monocytes↑/

Neutrophils↑

B15
WARS
151
2.43E−06
Monocytes↑/

Neutrophils↑/

CD4↑

B16
MMP9
152
3.39E−01
Monocytes↓/

Neutrophils↓/

CD8

positive T cells

(CD8)↓

B17
DOCK9
153
1.43E−04
CD4 & CD8

positive T cells

(CD4, CD8)↓/

Neutrophils↑

B18
SIRPB2
154
3.03E−01
Monocytes↑

B19
ANKRD22
155

Monocytes↑/

Neutrophils↑/

CD4↑

Table 4 lists the genes identified as biomarkers for latent TB using the above methods. Also given are the sequence identifiers for the identified genes and an indication of whether expression of the various genes are up or down regulated in TB compared with a control/reference population, and in what cell types the change was observed. Where more than one indication is given, the first listed is the most preferred.

TABLE 4

Biomarkers for Latent TB

Corrected

P value

Probe
Gene
SEQ
NHP Dataset

Number
Symbol
ID NO:
BH FDR

C1
ABCF2
156
5.10E−03
CD4 & CD8

(NM_005692.3)

positive T cells

(CD4, CD8)↑/

Monocytes↓/

Neutrophils↓

C2
FNBP1L
157
1.92E−04
Neutrophils↑

C3
NCF1C
158
3.60E−04
Monocytes↑/

Neutrophils↑/

CD4↑

C4
TBC1D3B
159
1.83E−03
Neutrophils↓↑

(differing splice

variants)

C5
SLC14A1
160

Neutrophils↑

Table 5 lists further biomarkers for latent TB. Also given are the sequence identifiers for the identified genes and an indication of whether expression of the various genes are up or down regulated in TB compared with a control/reference population, and in what cell types the change was observed. Where more than one indication is given, the first listed is the most preferred.

TABLE 5

Further Biomarkers for TB

Corrected

P value

Probe
Gene
SEQ
NHP Dataset

Number
Symbol
ID NO:
BH FDR

D1
CALCOCO2
161
1.77E−03
Monocytes↑/

Neutrophils ↑

D2
GTF2B
162
2.68E−09
Neutrophils↑

D3
HLA-B
163
4.02E−11
Neutrophils↑

D4
HLA-F
164
3.23E−09
Neutrophils↑

D5
MGST2
165
4.59E−08
Neutrophils↑

D6
SPAST
166
1.28E−02
CD8 positive

T cells (CD8)↓/

Neutrophils↑

D7
WAC
167
6.57E−10
All↑/

Neutrophils↑

Table 6 lists the various probes used to detect the various biomarkers of the invention

TABLE 6

Oligonucleotide probes

SEQ

SEQ

Probe
Gene

ID

ID

No.
Symbol
Probes 1a & 1b
NO:
Probe 2
NO:

AA1
LOC40075/
CAGGCCCAATGTGCCTCATTGAGAACACTAATGGGCG
1
CTTCTTCCCAGACTTTGTGTTGACACTGAGAGATTT
3

GBP1P1
ACTGATGGCGAATCCAGAAGCTCTGAAGATCCT

CTAGCATTACAGAAAGCGCTTTTGGACAAAACTGT

GAACAGCACCAAGTGGAACGTGTGAAAGCTGAGTCTG
2

CACAGGCTTCAGCAAAAATGTTGCAGCAAATGC

AA2
LGALS3BP
GCCTTTGGTCAAATATTCTTCTGATTACTTCCAAGCC
10
CACCATTGCCTACGAAAACAAAGCCCTGATGCTCTG
11

CCCTCTGACTACAGATACTACCCCTACCAGTCC

CGAAGGGCTCTTCGTGGCAGACGTCACCGATTTC

AB1
BST1
TGGGAAAATAGCCACCTCCTTGTTAACAGCTTTGCAG
12
TAGTTCTGGGGTGATCCACGTCATGCTGAATGGTTC
13

ACAACACCCGTCGTTTTATGCCCCTGAGCGATG

AGAGCCAACAGGAGCCTATCCCATCAAAGGTTTT

AB2
SNX10
AGTTCATGCCATCCAGGCATTTAAGAGCGATCCTCAT
14
GATAACTAGGATAACTTGTTGCTTTGTTACCCAGCC
15

CCCTTCAGCAATATGTATTTGAGTTCACACTA

TAATTGAAGAGTGGCAGAGGCTACTACAAAAAGC

AC1
ALPK1
TTCCAGTGGGAGTTCTTGGGTTTCATTGCCGGGAAAG
6
GTTCCTGTATGGGCTCGACGTCTCTGGAAAACTTCT
7

ATGAGGAAAGAGATCCTTGAGGCTCGCACCTTG

GCAGGTCGCCAAAGGTCTCCACAAGTTGCAGCCA

AC2
CREG1
CCTGGTATTCTTTTATAAGTAAAGTTTACCCAGGCAT
16
TGGTGCTTCTGAATAAATCTTGCCAAGATAGACAAA
17

GGACCAGCTTCAGCCAGGGACAAAATCCCCTC

CAATGATGAAACTCAGATGGAGCTTCCTACTCAC

AD1
BAZ1A
CACCCAGTAATGTGGACCAAGTTAGCACACCACCGGC
18
GAGTCATTGCCACAAAGTCAAGTGAACAGTCAAGAT
19

TGCGAAAAAGTCACGAATCTGACTTTGTCCTTC

CTGTAAATATTGCTTCAAAACTTTCTCTCCAAGA

AD2
LYN
AAAAGTAACCATCACTGGTTGCACTTATGATTTCATG
20
TCTTCTATGAACACTGCTCAGACCTGCTAGACATGC
21

TGCGGGGATCATCTGCCGTGCCTGGATCCTGAA

CATAGGAGTGGCGTGCACATCTCTCTCTCTTCCA

AD3
TAPBP
TCCACCGCCCCTCATGCCGCCCTTTGGAGGAAAGTGA
22
ACCTGCAAGGATTCAAAGAAGAAAGCAGAGTGAGGG
23

AAGTGAAAGGAGGAAGAGGAGGCTTCATGGCTG

CACTCACTGCCATCCTGTGGAAGCCACCATCATC

AE1
SERPINB1
ACAGCAGGCATCGCAACTTTCTGCATGTTGATGCCCG
24
ACATCCGATGCGTAGATTCTTGACCATGTAGTAATC
25

AAGAAAATTTCACTGCCGACCATCCATTCCTTT

TATAAAATTGCTATATCCTCCTGATAGCCATGGG

AE2
PSMB9
TGCCGGTGTGGACCATCGAGTCATCTTGGGCAATGAA
26
AATAAACTCTCTAGGGCCAAAACCTGGTATGGTCAT
27

CTGCCAAAATTCTATGATGAGTGAACCTTCCCC

TGGGAAATGAGTGCTCAGGGAGATGGAGCTTAGG

AE3
WSB1
AGATGGTAAATACTGACTTACGAAAGTTGAATTGGGT
28
CGTATCGTATTTAGAAGATTCTGCCTTCCCTAGTAG
29

GAGGCGGGCAAATCACCTGAGGTCAGCAGTTT

TAGGGACTGACAGAATACACTTAACACAAACCTC

AF1
MW
CTCAAGCTCCTGGAGACAACCACGTGGTGCCTGTACT
30
CTGGCTGAGGTGGAGGTGAAGAAGTTCAAGCAGATG
31

GCGCTAACTCCTGATTAATACAATGGAAGTTTC

ACAGAGGCCATAGGCCCCAGCACCATCAGGGACC

AF2
APBB1IP
TGTGGCAAAGGCTGGACTTGCCTCTCGGTGGACAAAC
32
ATGAATGATAACAGCACAAAGTCACTGATGGTGGAT
33

TTGGGGACAGTCAATGCAGCTGCACCAGCTCAG

GAGCGGCAGCTGGCCCGAGATGTTCTGGACAACC

AF3
FYB
AAATGGTTGGGCAGAACAGCAAGGGGTTCATATGGCT
34
ATGGCTGCATCTATGACAATGACTAGCACTCAACTT
35

ATATTAAAACAACTGCTGTAGAGATTGACTATG

TGGTCATTCTGCTGTGTTCATTAGGTGCCAATGT

AG1
MB21D1/
CGTATGTACCCAGAACCCTCAAGACAGTCAGTGGGAC
36
CCAAGAAGGCCTGCGCATTCAAAACTGGCTTTCAGC
37

C6orf150
CGCAAAGACCTGGGCCTCTGCTTTGATAACTGC

AAAAGTTAGGAAGCAACTACGACTAAAGCCATTT

AG2
CPVL
ATATTCTGATCCCGAATCAATTATAGGGGGCTATGCA
38
TGTCACAAGTAACATGACCTTGCGTGACAGAGACTT
39

GAATTCCTGTACCAAATTGGCTTGTTGGATGAG

CCCCTGGACCACAACGCTCTCCATGCTTTACATT

AG3
TICAM2
TATATACTAATAAAACATGAACTGCCCACTCTTCATG
40
TTGTATATCCCCTACCAGTACCGGGATCTGCACACA
41

CCTGCCAAACTTGGGGCAATTGATGCTAAATGG

TCTTTTTGCAGTTACCTCTTCATAGCCATGAACC

AH1
CD52
GTTGATGCCAGACATCACCAGGTTGTAGAAGTTGACA
42
CAATGCCATAATCCACCTCTTCTGCTTCAGTTGAGG
43

GGCAGTGCCATGGGGGCAACAGCCAAAATAGGG

TGACACGTCTCAGCCTTAGCCCTGTGCCCCCTGA

AI1
HERC2
GATGTCGACTCCTTTGCTTCGGACTCTACACAAGATT
8
GGTTGATAAGGATTTTATTCCTGGACTCATGTACAT
9

ATTTAACAGGACACTAAGATGGGGAAACGTCCT

CCGAGACAATGAAGCCACCTCAGAGGAGTTTGAA

CTGTGCAGTATGCGATGTTTTGTGGATGGCAAAGACT
168
TGAGGAAGTGACACTTATACGCAAAGCTGATTTGGA
169

TATTCCTGAGGGAATCGATATAGGGGAACCTCT

GAACCATAATAAAGATGGAGGCTTCTGGACTGTG

CTGCTCAATGACTTTTGAGCAGCTGGATCTCCTGCTT
170
ACCAAAAAACACAATACCAGGCATACATTTGGCAGA

CGGCAGGTGAGTGAGGGGATGGATGGTTCCGCG

ATAAATGAACCAGGTCAGTCTGCGGTATTTTGTG
171

AJ1
KLRAP1
GCATTCAAACGTACAATTGTATCTGTGGGAAGAGAAT
44
ACTCTGTTTCTCAATGTTGGACCTAAGATATTGAAG
45

(KLRA1)
AGACTCTATTTTCTCTGATTCGGTGTGCGCCAA

ACAGGCTGGAGCCCAGAGCCTTCATTCAATCTCA

AK1
PF4V1
AGGAGATGCTGTTCTTGGCGTTGCTGCTCCTGCCAGT
4
AGCTACTAGCTGCCTAAGTGTGCACTTTCAATCTAA
5

TGTGGTCGCCTTCGCCAGAGCTGAAGCTGAAGA

CTGTGAAAGAATCTTCTGATGTTTGTATTATCCT

AL1
DEFB128
TGCTTCAATAAAGTAACAGGCTATTGCAGGAAGAAAT
46
TGTGTCATTTAAGAAGCCACATCAACATTCTGGTGA
47

GCAAGGTAGGAGAAAGATATGAAATAGGATGTC

GAAGCTGAGTGTGCTGCAGGATTACATCATCTTA

AM1
IL8
ATTTTAATTGAACTAACAATCCTAGTTTGATACTCCC
48
AAAGAACTGAGAGTGATTGAGAGTGGACCACACTGC
49

AGTCTTGTCATTGCCAGCTGTGTTGGTAGTGCT

GCCAACACAGAAATTATTGTAAAGCTTTCTGATG

B1
AIM2
TGTCCCGCTGAACATTATCAGAAAAGCTGGTGAAACC
58
TAGCAAGATATTATCGGCACAGTGGTTTCTTAGAGG
59

CCGAAGATCAACACGCTTCAAACTCAGCCCCTT

TAAATAGCGCCTCACGTGTGTTAGATGCTGAATC

B2
CD274
AGACCACCACCACCAATTCCAAGAGAGAGGAGAAGCT
60
TAACCCATTAATACTCTGGTTGACCTAATCTTATTC
61

TTTCAATGTGACCAGCACACTGAGAATCAACAC

TCAGACCTCAAGTGTCTGTGCAGTATCTGTTCCA

B3
CD96
TGCATGGTCGGTGGAAAACAGCAGCACGGATTCTTGG
56
GGAGGTATTCACACTCAGGGTCATGCACTTGCACAA
57

GTCCTTCTTTCTAAGGGTATAAAGGAGGATAAT

TGTTGAGAATGAGTACCACTCTCACCATTGGTAT

B4
CDH23
ATCCCACTTTTGCCAGACGCTCATTCAGCATCTGACC
62
TGCTGAAGGTGGTCCTGGAGGATTACCTGCGGCTCA
63

TCTACCTTCATAAGATCTGTTATTTTTATAAGA

AAAAGCTCTTTGCACAGCGGATGGTGCAAAAAGC

B5
IRF1
ATCCCAGGGCTGGCTCTGCACTAAGAGAAAATTGCAC
52
AGCCCTCAACAGGCCCAGGGAGGGAAGTGTGAGCGC
53

TAAATGAATCTCGTTCCCAAAGAACTACCCCCT

CTTGGTATGACTTAAAATTGGAAATGTCATCTAA

B6
GBP1
AGCTGGTACCACTCAGGAGAAGTTTATTCTTCCAGAT
50
TCTCCAGAGGAAGGTGGAAGAAACCATGGGCAGGAG
51

GACCAGCAGTAGACAAATGGATACTGAGCAGAG

TAGGAATTGAGTGATAAACAATTGGGCTAATGAA

B7
IFIT3
GGGACTGAATCCTCTGAATGCATACTCCGATCTCGCT
64
GAGACAGAGGAGGAAAACAGAGCATCAGAAGCCTGC
65

GAGTTCCTGGAGACGGAATGTTATCAGACACCA

AGTGGTGGTTGTGACGGGTAGGACGATAGGAAGA

B8
IFITM3
AGGCCTATGCCTCCACCGCCAAGTGCCTGAACATCTG
66
TGATCTTCCAGGCCTATGGATAGATCAGGAGGCATC
67

GGCCCTGATTCTGGGCATCCTCATGACCATTCT

ACTGAGGCCAGGAGCTCTGCCCATGACCTGTATC

B9
GK
GACCAGCAACAAAATTCTTATGCAGCTACAAGCAGAC
68
AACTCATGGATTCCCAAGATGTGAGCTTTTTACATA
69

ATTCTGTATATACCAGTAGTGAAGCCCTCAATG

ATGAAAGAACCCAGCAATTCTGTCTCTTAATGCA

B10
NELL2
TTGATTGTTGGCCCCTGCCTTGCCCAGATGTGGAGTG
70
TACCGTGACATCCTGAACCCTGGATAGAAAGCCTGA
71

TGAATTCAGCATTCTCCCAGAGAATGAGTGCTG

GCCCATTGGATCTGTGAAAGCCTCTAGCTTCACT

B11
S100A11
CAGCCTTTCTGTCATCATCTCCACAGCCCACCCATCC
72
TTGGTGGCCTAGCTATGGCTTGCCATGACTCCTTCC
73

CCTGAGCACACTAACCACCTCATGCAGGCCCCA

TCAAGGCTGTCCCTTCCCAGAAGCGGACCTGAGG

B12
SAMD9L
ATAACAGCAAGAGGGAACCTGGCAAGGAAGCTATTCC
74
ACTGGAAATCCTCTGTGAAAATGAGTGTACAGAGAC
75

TATAATCCAGGAAAGAGATGAGGAAGGCTTGGA

AGACATCGAGAAAGACAAATCTAAATTCCTGGAG

B13
STAT1
CCTGACATCATTCGCAATTACAAAGTCATGGCTGCTG
76
GATACACCCAAAGTATCAGGACGAGAATGAGGGTCC
77

AGAATATTCCTGAGAATCCCCTGAAGTATCTGT

TTTGGGAAAGGAGAAGTTAAGCAACATCTAGCAA

B14
TLR6
GACTGTGACCTCCCTCTGCATCTACTTGGATCTGCCC
78
ATTCCCAACAAGTACCACAAGCTGAAGGCTCTCATG
79

TGGTATCTCAGGATGGTGTGCCAGTGGACCCAG

ACGCAGCGGACTTATTTGCAGTGGCCCAAGGAGA

B15
WARS
CCAAGGAGTCCTGGCCTCCGCAGATGCTTCATTTTGA
80
CCTGGCCTCTGTAAGCCTGTGTATGTTATCAATACT
81

CCCTTGGCTGCAGTGGAAGTCAGCACAGAGCAG

GTTTCTTCCTGTGAGTTCCATTATTTCTATCTCT

B16
MMP9
TACCACCTCGAACTTTGACAGCGACAAGAAGTGGGGC
54
TTCTACTGGCGCGTGAGTTCCCGGAGTGAGTTGAAC
55

TTCTGCCCGGACCAAGGATACAGTTTGTTCCTC

CAGGTGGACCAAGTGGGCTACGTGACCTATGACA

B17
DOCK9
GGAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCC
82
ACATCTTCAACGCCATCAGTGGGACTCCAACAAGCA
83

ATACACTGCTTCCCTTATGTGAAGAAGCGCATC

CAATGGTTCACGGGATGACCAGCTCGTCTTCGGT

B18
SIRPB2
TTCTGCAAAACGTCTCCAGTGAGGATGCAGGCACCTA
84
TATTAGAATGCAGGTTCAGCAACTATAACAAAGCTC
85

TTACTGTGTAAAGTTTCAGAGGAAACCCAACAG

TTAAATAACAGTGGCTTAAACCAGTGGAAATCAA

B19
ANKRD22
AGACTTTTGGTCTGTGGGCCATTTAACCTGGATGCCA
86
TCAAGTTCACCATGGCCGTAATCCTTCTAAGGGAAA
87

CCATTTTATGGGGATAATGATGCTTACCATGGT

CACTAAAGTTGTTGTAGTCTCCACTTCAGTCAGA

C1
ABCF2
CATCATGAACTCGTTTGTAAACGACGTGTTTGAGCAG
88
CAGCCATGACTTCAGACTCATTCAGCAGGTTGCACA
89

CTGGCGTGTGAGGCTGCCCGGCTGGCCCAGTAC

GGAAATTTGGGTCTGTGAGAAGCAGACAATCACC

C2
FNBP1L
TACTGCCTTCATAAGATCAAGTCACCACTGTTACACA
90
GGAGGAAATGTGATCTGGCTGTGTTTGTCTTCTGTA
91

GCTGACATATAGTGTATTACCTTTGCAGCTAGT

CAAAGCCTGAAGTGCTTATGGTTTTTTGGCTAAC

C3
NCF1C
GGTGGTTCTGTCAGATGAAAGCAAAGCGAGGCTGGAT
92
GACGTCACAGGCTACTTTCCGTCCATGTACCTGCAA
93

CCCAGCATCCTTCCTCGAGCCCCTGGACAGTCC

AAGTCGGGGCAAGACGTGTCCCAGGCCCAACGCC

C4
TBC1D3B
ACTGATTCCGACCAGGGCACCCCCTTCAGAGCTAGGG
94
AGGCTTCTAGAAGCATCTGGGCCAGGGCTCATGGCT
95

ACGAACAGCAGTATGCTCCCACCTCAGGGCCTT

GGATAATTTCCCTAGGCTTAACAACCCAAGCAAG

C5
SLC14A1
TGACATTCTCTCATGGGACAATGTTGGGGTTTTTCAG
96
TCACAATATTCTCTCTCAGAAATCAATGGCATTTGA
97

ACTGACAGGACTGCAAGAGGGAGAAAGGAATTT

ACCACCAAAAAGAAATAAAGGGCTGAGTGCGGTG

D1
CALCOCO2
CATTTTCTATCCCCTCAGGGACTGAACAAATGGAAAT
98
CTGGGCTTTCCCTAATGTGGTTGGGAGTTATGCCCT
99

AACTCCCAGGCAGTATCAGGTGGTCACTACAGA

AGACTAACTGTATTGTCCTAGTCACAGCTCCTTG

D2
GTF2B
CGCTAGAAACCAGTGTGGATTTGATTACAACTGGGGA
100
TCTCTGTGGCAGCGGCAGCTATTTACATGGCCTCAC
101

CTTCATGTCCAGGTTCTGTTCCAACCTTTGTCT

AGGCATCAGCTGAAAAGAGGACCCAAAAAGAAAT

D3
HLA-B
AGCTACTCTCAGGCTGCGTGCAGCGACAGTGCCCAGG
102
GCATAATGTGAGGAGGTGGAGAGACAGCCCACCCTT
103

GCTCTGATGTGTCTCTCACAGCTTGAAAAGCCT

GTGTCCACTGTGACCCCTGTTCCCATGCTGACCT

D4
HLA-F
ATCACCCAGCGCTTCTATGAGGCAGAGGAATATGCAG
104
GAGATCACGCTGACCTGGCAGCGGGATGGGGAGGAA
105

AGGAGTTCAGGACCTACCTGGAGGGCGAGTGCC

CAGACCCAGGACACAGAGCTTGTGGAGACCAGGC

D5
MGST2
TCACTGGGTCACCAGAGTTTGAGAGAGTATTTCGGGC
106
ACGGATCACCGGTTTCCGACTGAGTCTGGGGATTTT
107

ACAACAAAACTGTGTGGAGTTTTATCCTATATT

GGCCTTGTTGACCCTCCTAGGTGCCCTGGGAATT

D6
SPAST
CACAAACGGACGTCTATAATGACAGTACTAACTTGGC
108
TTAGGAATGTGGACAGCAACCTTGCTAACCTTATAA
109

ATGCCGCAATGGACATCTCCAGTCAGAAAGTGG

TGAATGAAATTGTGGACAATGGAACAGCTGTTAA

D7
WAC
GTTTTGTAGAGTGAAGCCATGGGAAGCCATGTGTAAC
110
GGTGCTGACTGCTGTTCTTAGCCATCACAAAACGCT
111

AGAGCTTAGACATCCAAAACTAATCAATGCTGA

AAATTTGTGTAATTGGAGCTTCCTGCTGTTATCT

Example 2
TB-Specific Biomarker Identification in a Cohort of UK Controls, TB Test Negative Controls, TB Test Positive Suspected Latent TB, Early Active and Established Active TB Volunteers

Whole blood samples were obtained from the following cohorts: (1) Caucasian controls-professional individuals recruited locally to the project team who constitute a low risk group, coming from non/low-TB endemic regions, such that their risk of having been exposed to TB is extremely low (CC); (2) Controls of Asian descent recruited from Hindu temples in London who tested negative for TB in skin and/or IFNγ tests and originate from high-incidence areas of TB (LC or NMRL CNTRL); (3) individuals of Asian descent recruited from Hindu temples in London and test positive for TB in Mantoux skin and/or IFNγ tests and diagnosed with latent TB (LTB or NMRL LTNT); (4) individuals with early stage active TB recruited at St. Thomas's and Royal Free hospitals in London (EATB); and (5) individuals of Asian descent recruited at the Jawaharlal Institute of Postgraduate Medical Education and Research (JIPMER), India, diagnosed with active TB (ATB).

Whole blood was obtained at a single time point in PaxGene or Tempus RNA stabilization tubes. Control and Latent TB Blood were collected using PaxGene tubes. Early Active and Active TB blood were collected using Tempus tubes. Blood collected in PaxGene tubes was mixed by inversion and incubated at Room Temperature (˜25° C.) for 2 hours before storing at −80° C. Blood collected in Tempus tubes was vortexed at full speed for 10 seconds and then stored at −20/−80° C. RNA was extracted from these respective tubes according to the manufacturer's instructions. Concentration and purity (A260/A280 ratio>1.8) were assessed by spectrophotometry using a NanoDrop ND-1000 spectrophotometer. The purity of the extracted RNA was analysed using Agilent Bioanalyzer according to the manufacturer's instructions. cDNA was synthesized from the extracted RNA using the Roche Transcriptor First Strand cDNA Synthesis Kit according to the manufacturer's instructions.

Quantitative real-time PCR analyses were performed using the Roche Lightcycler (LC) 480 in 384 well plate format. The LC 480 is a plate-based, highly adaptable and versatile real-time PCR system used for gene expression analysis and has been designed for enhanced throughput and efficiency without compromising sensitivity and specificity. Roche provide an online ‘target to assay’ design and build system, the ‘Realtime Ready configurator’, which can be used to generate quantitative real-time PCR (qPCR) assays in a number of plate formats. Assays for the biomarker targets of interest were designed using this system and the assay plates configured, tested and quality assured by Roche. It uses a dual-colour assay, 165-FAM labelled Universal Probe Library (UPL) probe system and Roche provide all platform-specific dedicated reagents. Assay plates are dispatched in desiccated format, each well containing a target-specific primer pair and assay-specific human LC 480 Universal Probe Library (UPL) probe, coated in the bottom of each of the 384 wells.

All assays were performed using default protocols according to the manufacturer's instructions. Four human reference genes were used throughout, for quantifying the expression of genes of interest. In short, synthesized cDNA was mixed with Roche LC480 probes master mix at a constant dilution and pipetted into the 384 well assay plates using a Qiagility™ robotic pipetter. This reduced manual handling minimised pipetting errors and ensured reagent distribution uniformity throughout the plate wells. Data outputs were quantified using the Lightcycler 480 software and then expressed as a numeric figure of the ratio of the fold-change difference of the target vs the mean of all four reference genes. All raw data were then exported and processed further using the microarray analysis package Genespring GX 12.5 ((GX 12.5).

Data output files from BlueFuse were imported into GX 12.5 for differential gene expression and statistical analysis. Averaged data were imported without further normalisation and then baseline transformed to the median of all samples. Data were assessed for quality, then filtered on gene expression where entities in at least 100 percent of samples and in any one out of one conditions had normalised expression values within the cut-off 0 to 329.0 where at least 1 feature out of all samples had values within range. Statistically significant features were identified using one-way ANOVA analysis across all entities and time points at a cut-off p<0.05. To identify differentially expressed entities between the groups T-tests (unpaired, unequal variance) were performed on the samples arranged by group at a p<0.05 using the cut-off fold-change setting >1.2. The outputs were visualised using the boxplot graphical output facility.

Table 7 lists the genes newly identified as biomarkers for TB in Example 1 (see Table 2 above). Also given are the results of t-tests comparing the expression of a given marker between different test/control groups. The final two columns give ANOVA p values illustrating the significance of the biomarkers as determined using the qPCR technique.

Table 8 provides the results of the qPCR analysis of the human cohort samples using the further biomarkers for TB listed in Table 3 above. Also given are the results oft-tests comparing the expression of a given marker between different test/control groups. The final two columns give ANOVA p values illustrating the significance of the biomarkers as determined using the qPCR technique.

Table 9 provides the results of the qPCR analysis of the human cohort samples using the genes identified as biomarkers for latent TB in Example 1 (see Table 4 above). Also given are the results of t-tests comparing the expression of a given marker between different test/control groups. The final two columns give ANOVA p values illustrating the significance of the biomarkers as determined using the qPCR technique.

Table 10 provides the results of the qPCR analysis of the human cohort samples using the further biomarkers for latent TB listed in Table 5 above. Also given are the results of t-tests comparing the expression of a given marker between different test/control groups. The final two columns give ANOVA p values illustrating the significance of the biomarkers as determined using the qPCR technique.

In Tables 7 to 10 and FIGS. 1 to 12, the terms CC, LC (or NMRL CNTRL), LTB (or NMRL LTNT), EATB and ATB are as defined above. The term ND stands for not detected.

TABLE 7

Biomarkers for TB - qPCR Validation Data on New Cohort of TB infected and Uninfected Donors

Corrected P
Corrected P
Corrected P
Corrected P
Corrected P
Corrected P
Corrected P

value
value
value
value
value
value
value

Probe Number
Gene Symbol
CC vs LC
CC vs LTB
CC vs EATB
CC vs ATB
LC VS LTB
LC VS EATB
LC VS ATB

AA1
LOC400759/
ND
ND
3.76E−06
1.47E−02
ND
1.82E−05
2.51E−02

GBP1P1

(RP4-644F6.3)

AA2
LGALS3BP
ND
ND
ND
ND
ND
ND
ND

AB1
BST1
ND
ND
ND
ND
ND
ND
ND

AB2
SNX10
ND
ND
4.99E−13
2.44E−06
ND
8.17E−10
6.11E−05

AC1
ALPK1
ND
ND
ND
ND
ND
ND
ND

AC2
CREG1
ND
ND
ND
4.05E−03
ND
ND
1.07E−02

AD2
LYN
ND
3.64E−12
2.15E−06
3.66E−05
ND
ND
7.57E−03

AD3
TAPBP
ND
ND
3.57E−04
4.32E−02
ND
7.37E−04
ND

AE1
SERPINB1
ND
ND
ND
9.12E−04
ND
ND
4.03E−03

AE2
PSMB9
ND
ND
2.94E−07
1.02E−06
ND
ND
9.44E−06

AE3
WSB1
ND
ND
ND
ND
ND
ND
ND

AF1
MVP
ND
ND
ND
ND
ND
ND
ND

AF2
APBB1IP
ND
ND
ND
ND
ND
ND
ND

AF3
FYB
ND
ND
7.26E−05
7.02E−03
ND
ND
1.45E−02

AG1
MB21D1/
ND
ND
ND
ND
ND
ND
ND

C6orf150

AG2
CPVL
ND
ND
6.64E−10
1.02E−04
ND
1.21E−08
1.27E−04

AH1
CD52
3.84E−05
1.84E−05
ND
ND
ND
9.08E−04
ND

AJ2
KLRAP1
ND
ND
ND
ND
ND
ND
ND

(KLRA1)

AK1
PF4V1
4.67E−07
ND
0.016029
ND
ND
1.74E−05
1.30E−02

AL1
DEFB128
ND
ND
ND
ND
ND
ND
ND

AM1
IL8
ND
ND
ND
ND
ND
ND
ND

Corrected P
Corrected P
Corrected P

value
value
value
Corrected p
Uncorrected p

Probe Number
Gene Symbol
LTB VS EATB
LTB VS ATB
EATB VS ATB
value ANOVA
value ANOVA

AA1
LOC400759/
6.51E−06
0.018
ND
5.74E−12
1.99E−12

GBP1P1

(RP4-644F6.3)

AA2
LGALS3BP
ND
ND
ND
5.45E−17
1.21E−17

AB1
BST1
ND
ND
ND
1.67E−10
6.71E−11

AB2
SNX10
2.09E−11
1.34E−05
ND
3.86E−32
1.61E−33

AC1
ALPK1
ND
ND
ND
2.15E−09
9.87E−10

AC2
CREG1
ND
0.006925
ND
5.04E−20
6.30E−21

AD2
LYN
ND
ND
ND
2.78E−10
1.16E−10

AD3
TAPBP
9.60E−04
ND
ND
6.55E−06
4.00E−06

AE1
SERPINB1
ND
0.002271
ND
6.22E−06
3.71E−06

AE2
PSMB9
ND
3.95E−05
ND
1.47E−17
3.06E−18

AE3
WSB1
ND
ND
ND
1.58E−08
7.92E−09

AF1
MVP
ND
ND
ND
6.74E−09
3.18E−09

AF2
APBB1IP
ND
ND
ND
0.027901
0.024026

AF3
FYB
1.13E−04
0.006217
ND
1.48E−09
6.38E−10

AG1
MB21D1/
ND
ND
ND
4.59E−12
1.53E−12

C6orf150

AG2
CPVL
4.75E−12
3.42E−05
ND
1.06E−24
7.35E−26

AH1
CD52
1.03E−03
ND
ND
1.83E−04
1.40E−04

AJ2
KLRAP1
ND
ND
ND
3.51E−06
2.05E−06

(KLRA1)

AK1
PF4V1
4.66E−04
ND
ND
1.49E−07
8.06E−08

AL1
DEFB128
ND
ND
ND
0.028874
0.025666

AM1
IL8
ND
ND
ND
2.84E−07
1.58E−07

TABLE 8

Biomarkers for TB - qPCR Validation Data on New Cohort of TB infected and Uninfected Donors

Corrected P
Corrected P
Corrected P
Corrected P
Corrected P
Corrected P
Corrected P

value
value
value
value
value
value
value

Probe Number
Gene Symbol
CC vs LC
CC vs LTB
CC vs EATB
CC vs ATB
LC VS LTB
LC VS EATB
LC VS ATB

B2
CD274
ND
ND
ND
ND
ND
ND
ND

B3
CD96
ND
ND
ND
ND
ND
ND
ND

B4
CDH23
ND
ND
ND
ND
ND
ND
ND

B5
IRF1
1.03E−09
4.40E−08
5.78E−10
5.08E−07
ND
5.34E−07
5.31E−05

B6
GBP1
ND
ND
9.63E−04
4.63E−04
ND
8.76E−04
4.47E−04

B7
IFIT3
ND
ND
2.09E−06
1.16E−03
ND
3.35E−05
4.08E−03

B8
IFITM3
2.33E−08
5.38E−07
9.03E−05
6.71E−04
ND
9.70E−03
3.53E−03

B9
GK
ND
ND
ND
ND
ND
ND
ND

B10
NELL2
ND
ND
ND
ND
ND
ND
ND

B11
S100A11
9.55E−10
5.65E−11
5.64E−09
9.54E−06
ND
4.83E−05
1.15E−03

B12
SAMD9L
ND
ND
ND
ND
ND
ND
2.94E−02

B14
TLR6
ND
ND
ND
ND
ND
ND
ND

B16
MMP9
ND
ND
0.001241
ND
ND
9.03E−03
ND

B17
DOCK9
ND
ND
ND
ND
ND
ND
ND

B18
SIRPB2
ND
ND
ND
ND
ND
ND
ND

B19
ANKRD22
ND
ND
ND
ND
ND
ND
ND

Corrected P
Corrected P
Corrected P

value
value
value
Corrected p
Uncorrected p

Probe Number
Gene Symbol
LTB VS EATB
LTB VS ATB
EATB VS ATB
value ANOVA
value ANOVA

B2
CD274
ND
ND
ND
2.58E−18
4.31E−19

B3
CD96
ND
ND
ND
8.66E−05
6.38E−05

B4
CDH23
ND
ND
ND
8.31E−06
5.43E−06

B5
IRF1
6.08E−08
1.13E−05
ND
4.08E−23
3.40E−24

B6
GBP1
1.48E−02
6.65E−04
0.001967
4.51E−23
4.38E−24

B7
IFIT3
2.63E−05
0.003847
ND
1.98E−12
5.77E−13

B8
IFITM3
3.62E−02
0.006008
ND
4.85E−11
1.89E−11

B9
GK
ND
ND
ND
0.002047
0.001706

B10
NELL2
ND
ND
ND
6.81E−21
7.57E−22

B11
S100A11
5.12E−06
3.35E−04
ND
2.73E−18
4.93E−19

B12
SAMD9L
ND
ND
ND
7.26E−07
4.13E−07

B14
TLR6
ND
ND
ND
4.31E−04
3.35E−04

B16
MMP9
3.15E−03
ND
ND
7.83E−06
4.89E−06

B17
DOCK9
ND
ND
ND
9.70E−19
1.48E−19

B18
SIRPB2
ND
ND
ND
0.001946
0.001595

B19
ANKRD22
ND
ND
ND
1.54E−09
6.86E−10

TABLE 9

Biomarkers for TB - qPCR Validation Data on New Cohort of TB infected and Uninfected Donors

Corrected P
Corrected P
Corrected P
Corrected P
Corrected P
Corrected P
Corrected P

value
value
value
value
value
value
value

Probe Number
Gene Symbol
CC vs LC
CC vs LTB
CC vs EATB
CC vs ATB
LC VS LTB
LC VS EATB
LC VS ATB

C2
FNBP1L
ND
ND
ND
ND
ND
ND
ND

C3
NCF1C
0.006398
ND
1.71E−06
5.51E−04
ND
2.21E−03
3.89E−03

Ifit3
TBC1D3B
ND
ND
ND
ND
ND
ND
ND

C4

C5
SLC14A1
ND
ND
ND
ND
ND
ND
ND

Corrected P
Corrected P
Corrected P

value
value
value
Corrected p
Uncorrected p

Probe Number
Gene Symbol
LTB VS EATB
LTB VS ATB
EATB VS ATB
value ANOVA
value ANOVA

C2
FNBP1L
ND
ND
ND
4.63E−05
3.35E−05

C3
NCF1C
2.25E−04
0.002172
ND
2.47E−12
7.55E−13

Ifit3
TBC1D3B
ND
ND
ND
0.044403
0.040086

C4

C5
SLC14A1
ND
ND
ND
1.86E−05
1.27E−05

TABLE 10

Biomarkers for TB - qPCR Validation Data on New Cohort of TB infected and Uninfected Donors

Corrected P
Corrected P
Corrected P
Corrected P
Corrected P
Corrected P
Corrected P

value
value
value
value
value
value
value

Probe Number
Gene Symbol
CC vs LC
CC vs LTB
CC vs EATB
CC vs ATB
LC VS LTB
LC VS EATB
LC VS ATB

D1
CALCOCO2
ND
ND
ND
ND
ND
ND
ND

D2
GTF2B
ND
ND
ND
ND
ND
ND
ND

D3
HLA-B
ND
5.27E−04
5.27E−04
ND
4.87E−03
4.35E−02
ND

D4
HLA-F
ND
ND
ND
ND
ND
ND
ND

D5
MGST2
ND
ND
ND
ND
ND
ND
ND

D6
SPAST
ND
ND
ND
ND
ND
ND
ND

D7
WAC
ND
ND
ND
ND
ND
ND
ND

Corrected P
Corrected P
Corrected P

value
value
value
Corrected p
Uncorrected p

Probe Number
Gene Symbol
LTB VS EATB
LTB VS ATB
EATB VS ATB
value ANOVA
value ANOVA

D1
CALCOCO2
ND
ND
ND
9.07E−15
2.14E−15

D2
GTF2B
ND
ND
ND
8.31E−06
5.39E−06

D3
HLA-B
9.79E−04
ND
ND
9.05E−04
7.17E−04

D4
HLA-F
ND
ND
ND
3.35E−12
1.07E−12

D5
MGST2
ND
ND
ND
8.92E−06
5.95E−06

D6
SPAST
ND
ND
ND
5.88E−08
3.11E−08

D7
WAC
ND
ND
ND
0.00189
0.001523

SEQUENCE INFORMATION

Set out below are the nucleotide sequences of the TB biomarkers disclosed herein. Exemplary target regions within the biomarker sequences are underlined, and exemplary probe sequences are double underlined.

A1 LOC400759 GBP1P1-guanylate binding protein 1, interferon-inducible

pseudogene 1 GBP1P1, mRNA-NR 003133.2

(SEQ ID NO: 112)

1
aaaatattag tccaaggatc cagtgagaga cacagaagtg ctagaagcca ctcctcatga

61
actaaggaga aaaagaacag acaagggaac accccagaca tggtatcaga gatccacatg

121

a

caggcccaa tgtgcctcat tgagaacact aatgggcgac tgatggcgaa tccagaagct

181

ctgaagatcc t

ttctgccat tacgcagcct gtggtggtgg tggcgactgt gggccgctag

241

cgcacaggaa aatcctacct gattaacaag ctggctcaga agaaaaaggg cttctctctg

301

ggctccacag tgcagtctca cactaaagga atctggatgt ggtgtatgcc ccatcccaag

361

aagccaggcc acatcctagt tctgctggac accgagggtc tgggagatgt agagaagggt

421

gacaaccaga atgactcctg gatcttcgcc ctggccgtcc tcctgaacag cacttccatg

481

tacaatagca taggaaccat taaccagcag gccatggacc aactgcagta tcctttgtga

541

ccca

gaacag caccaagtgg aacgtgtgaa agctgagtct gcacaggctt cagcaaaaat

601

gttgcagcaa atgc

aaagaa agaatgagca gatgatggaa cagaaggaga ggagttatca

661
ggaacacttg aaacaactga ctgagaagat ggagagcgac agggtccagt tgctggaaga

721
gcaagagagg accctcgctc ttaaacttca ggtgtctaat tgcatcacct tgaggtttct

781
gtttttctgt tttctctcca ttctccccga tcacaggctt actgtggcag agagaacatg

841
aagcccaggg gaagaaccct gcttgcttac ttgtactttt caattcctgt ctgtccagcc

901
tgaactggct actgccaagt ctggtcacta aactgcaaat attgcagttg tgtcacattc

961
agtgctttat ctatatatcc ttcatttcaa ggcaggtatt atctgctagc catcattaaa

1021
gtatctgtat ctcttgctta ataccatgtg aagcaagaac tatattctta ttacttagga

1081
gaagaaacaa agtttccaaa aataataaat aaatagagtc acacagctag taaatgtatc

1141
aaagctgtct tcatcactta gtggaatcca caatgattat ttttttctgt gacacctagt

1201
atgaaattaa acttaagaaa acctttgtga gcag

GBP1P1 Genomic Sequence AL691464

(SEQ ID NO: 113)

1
aaaatattag tccaaggatc cagtgagaga cacagaagtg ctagaagcca ctcctcatga

61
actaaggaga aaaagaacag gtaagaactt ttactacttc tcattaagca gctttctctt

121
tagctccaaa ggatctcagc tcagggatat ggaacccata aggttgtggc agggatggga

181
aggaatttat aaaggtcagt tcattttctt aaacatcgtc agaaccaaat taggctgcag

241
atgagcctga agtgggacgc aggtcagatg aaatcctggt gttatcaggg acagcatggc

301
cttaagtgac actacagtgt ttgtgttgaa ttaggcacca gtagacaggg gctaaactga

361
gagtttgaaa tgcactagga tagtctttct ctttgttgtc attctctgtg ttgactggag

421
acatgattac atttccatta tcagtgatgg agcttgctga atcctgctcc tatgcagcta

481
agaaatggaa agagctacaa atgggttctt ttcataagga aagaacagca aatgagaagc

541
agagtaatca gcccactgac atggttaaag acaaagaaaa aactgaaact cagcctgaaa

601
gatgaagatt catgaaaaca gtatactttt tattacactt gtgaacttcg tttcagaatg

661
gattactttc tttagaaata gtccagactc actaatttcc tagtgcccac tgatccctgt

721
cccttagaag tgaaattcaa ccccactgct tcactaaagt gcttccaatt ttgtcttctt

781
ttagtagaga ctggggccta aagtgtttcc tctttaattc tcctgtaatg catctctaga

841
gaaaatatct ttgcttattt taacctctct atgcaatcag actactttaa tcctgccttc

901
tggaaagtcc tcgcctgaat tccttgtcaa gacactagcc tcattcatca ccttctggtc

961
tgatagctct ttctcgctct ctctctctct ctctctctct ctctctctct ctctctctca

1021
cacacacaca cacacacaca caaacacaca cacaccttcc cagccccttc tcctcctcct

1081
ctccactcct actattcctc ctctatttct ctttctccta ccctaccaaa tggaaaacag

1141
aacaaaacag aaatcctaaa gctgtatcgc tggaaatata tttcatttga acaacttcct

1201
aggtagctct ctttatctgc ctcttgatct tttaatcctt atttcattat ctgggggaaa

1261
cattcagcat ttgcaatttt gcattcatac cctcactgag ttagaggcta ccttattgtg

1321
actctaacgc agcttaagtt tcagggccct tctcttggac atgacccttc agagtccttg

1381
gaaggttcct cagccatgtg tttcacatgt ttgtatagtt ttctaaaatt tgcaaaagta

1441
tgttacgttg tttccacttc cagaatggca ccgtgaaaag ctttattgat cctcattgcg

1501
gtgaaataag cataaccact tgaagatgga ggaaggaaca catttaaaaa tctttggaaa

1561
ttgttctaag ggtaaacatc aaactatgaa tactgattca ctgtattatt cactgtaaga

1621
attaataagt aaatcaatat tgaattcctt atacacagcc aggatgcaaa ataaatcctc

1681
taaatcctgt tctatcttcc attaattgac tagtgaaaat atttaaaaag aatccaaaaa

1741
gaaagttttc cactataaat taaatgaaat atttgagttg tgagcatagt tgagtgttga

1801
tgaggcagga aattaaagaa aaatacaatt aaaaataaaa agaaataagt tttcctgtat

1861
taggctgact tgtcccagag gcagcaacag gcacagacca gacccaggaa aagtcttgat

1921
aatactatct aaggtgctct ggagactctc ccagcactcc ctcaacatag gaagaagaaa

1981
aataaatttt cctttgtttt atggaaaagt ttgtagattc ctgttctctg taactagtga

2041
cttcaagtat tctgttttat ctaagaagta gagtgaaggt catgagaagc ctgaataggc

2101
ctgaactaca gctgcctggg caccatagtg aaggttataa tataaaccag tgcaaggctc

2161
tttagagcaa aacgtagata acagacatct gggttgcttg gcaatggtca tgtgtaatcc

2221
tgagtttgtc ctgcctctat atccctgctt tcatgccact gtaagcttgc ttcaagctag

2281
cccacctgct tttgtgaagt gtgtataaaa gtcaagtgct gtctttgcat gtgagtgtgc

2341
tggggcctga gtgtactcaa taaaaattct cctgttttaa cccgaggtct ctctctcgtc

2401
ctcctggatc ccacaacatt gataagtcac tgtcatgcta acattttttg taatgagcta

2461
ctttgacaat acttccataa ttttctcaaa tgatacagat tttgcctcat ctctctcgct

2521
gtcacacaca aaacttttgg ccataatggg gaatcttatg attctccctt ataattgcaa

2581
aacactagaa actctcattt atttcacctc ttctcttgca gacaagggaa caccccagac

2641
atggtatcag agatccacat gacaggccca atgtgcctca ttgagaacac taatgggcga

2701
ctgatggcga atccagaagc tctgaagatc ctttctgcca ttacgcagcc tgtggtggtg

2761
gtggcgactg tgggccgcta gcgcacagga aaatcctacc tgattaacaa gctggctcag

2821
aagaaaaagg gtgagtggca tgagcaaagc tctgccaagt cccttctgtc catctacaca

2881
gtcagcctcc atcatgagga tgtgaagaga gaaagagatg aggatgaata tggaaagcta

2941
actttccatt cacagtcggg ctccttatct tcacgctgct ctaagggata attttaaatt

3001
cattaattat tcccatgata cattagtttc cctttcaaaa gcacaaactg tgcctttcct

3061
aaaaggagta agactgtaat aaaaataatt aatgtacata ataataacta taataaacta

3121
caatttttat gccataacag cgagtttaca gtgatcttta aggttgaaaa aatgtttgtc

3181
tgtattggat attctttttt attgttgtaa aaaaataaca taaaatttaa cctcgtaacc

3241
atttttaagt gtacagctct gtggcattaa ttaaattcac attgttatac agctgtcacc

3301
tccatccata tccagaaatc tttcatcttg cctaacagaa actctgtact aactaaacaa

3361
aagctccaca taaacccatt gcctgccatc attctacatt ctatctctat gaattttact

3421
actacagaaa cctcatataa gtggaattat tcaatatttg tccttttatg actggcttat

3481
ttcaattcat atgtcttcaa ggttcatcag tgtgttagca tgtgtcaaaa ctcttccttt

3541
ttaaggctta gtaattgtac atgtatacta atttgtttat ctcttcatct gtcaatggac

3601
aattggattg cttccacctt ttgtctatta taaataatgc taataggaac atgggtgtct

3661
gaatatctgt tcaagtccct gctttcactt cttttgggaa tatacccaga agggtaattg

3721
ctggctcatg cagtcattcc atgttaactt tttttttttt taagaaatca ctatactatt

3781
ttccacagtg actgtactgt tttacattcc caacagaaat gcacaagggc tctaatttct

3841
ctacctcctc accaacattt tttattttca gtgttttttt tgatagtggc catatgaatg

3901
gatgttaagt agtatctcac tatggttttg attttcattt tcctaatgac tggtgatact

3961
gggcatcttt tcatgtgctt attggccacc tgtatatctt ctttgggaaa atgtctattc

4021
aagtcctttc ttcatttatt tttttttttt aaattcagaa aaattttctt ccacttgaaa

4081
atgttaaaac tcttcattaa acaactatta gatcaagtag aaaatacaaa tcaaaatagg

4141
tgaacatata aacatcaaaa tagtgatata tatatcagaa tctatggaat ataatgaaaa

4201
taattatcaa aaacaatttg tagtctttaa atcatatatc aatagaatta taaattaaaa

4261
tacataaatc taatgtgaaa cacagaaagc tagaaaaaag atcaagaaaa taaagcaaaa

4321
ggaaacataa cagacataaa gagataagag cagaacttca gttagttctg attgtaagct

4381
atgttataaa ttaatcaaca tgctagttct ttgaatagaa cataaacaaa atatgcaaac

4441
caacatccaa cctaatcaag aaaaacaggg agaaaacaaa gttacacaga ataatacacg

4501
aaacaagagg aaatcatact aaaactaagg acatttttaa acttttgagt taaatcattt

4561
tacacagctc taagcaggta gatataaaag cctacataaa atgggtaatc ccatagggaa

4621
attattaaca gtgataccaa tggagacaga aagtttaaga aaatgagaaa gttattaaat

4681
tactactccc acccaaaagt acaagacaca gatgacttta taatggaatt ttatgaattt

4741
ttcaaatatt agataatacc aatgctacat agactgttct taaaaagaga ttgccaagct

4801
atggccagtg ggacaaatat ggcctgccgg ttattatttt attttatttt attttatttt

4861
ttaaatcaat gtatatttta ttttatttta ttatttattt atttatttat ttttattata

4921
ctttaagttt tagggtacat gtgcacattg tgcaggttag ttacatatgt atacatgtgc

4981
catgctgctg cgctgcaccc actaacttgt catctagcat taggtatatc ccccaatgct

5041
atccctcccc cctcccccca ccccacaaca gtccccagag tgtgatattc cccttcctgt

5101
gtccatgtga tctcattgtt caattcccac ctatgagtga gaatatgcgg tgtttggttt

5161
tttgttcttg tgatagttta ctgagaatga tgatttccaa tttcatccat gtccctacaa

5221
aggacatgaa ctcatcattt tttatggctg catagtattc catggtgtat atgtgccaca

5281
ttttcttaat ccagtctatc attgttggac atttgggttg gttccaagtc tttgctattg

5341
tgaataatgc cacaataaac atacgtgtgc atgtgtcttt atagcagcat gatttatagt

5401
cctttggata tatacccagt agtgggatgg ctgggtcaaa tggtatttct agttctagat

5461
tcctgaggaa tcgccacact gacttccgca atggttgaac tagtttacag tcccaccaac

5521
agtgtaaaag tgttcctatt tctccacatc ctctccagca cctgttgttt cctgactttt

5581
taatgatcgc cattctaact ggtgtgagat gatatctcat tgtggttttg atttgcattt

5641
ctctgatggc cagtgatgat gagcattttt tcaagtgttt tttggctgca taaaggtctt

5701
cttttgagaa gtgtctgttc atgtcctttg cccacttttt gatggggttg tttgtttttt

5761
tcttgtaaat ttgttggagt tcattgtaga ttctggatat tagccctttt tcagatgaat

5821
aggttgcaaa aattttctcc cattttatag gttgcctgtt cactctgatg gtagtttctt

5881
ttgctgtgca gaagctcttc agttcaatta gatcccattt gtcaattttg gcttttgttg

5941
ccattgcttt tggtgtttta gacatgaaat ccttgcccat gcctatgtcc tgaatggtaa

6001
tgcctagatt ttcttctagg gtttttatgg ttttaggtct aacttttaag tctttaatcc

6061
accttgaatt aatttttgta taaggtgtaa ggaagggatc cagtttcagc tatctacata

6121
tggctagcca gttttcccag caccatttat taaataggga atcctctccc cattgcttgt

6181
ttttctcagg tttgtcaaag atcagatagt tgtagatatg cggcgttatt tctgagggct

6241
ctgttctgtt ccattgatct atatctctgt tttggtacca gtaccatgct gttttggtta

6301
ctgtagcctt gtagtatagt ttgaagttag gtagtgtgat gcctccagct ttgttctttt

6361
ggcttaggat tgacttggtg atgtgggctc ttttttggtt ccatatgaac tttaaagtag

6421
ttttttccaa ttctgtgaag aaagtcattg gtagcttgat ggggatggca ttgaatctgt

6481
aaattacctt gggcagtacg gccattttca cgatattgag tcttcctact catgagcatg

6541
gaatgttctt ccatttgttt gtatcctctt ttatttcctt gagcagtggt ttgtagttct

6601
ccttgaagag gtccttcaca tcccttgtaa gttgtattcc taggtatttt attctctttg

6661
aagcaattgt gaatgggagt tcactcatga tttggctctc tgtttgtctg ttgttggtgt

6721
acaagaatgc ttgtgatttt ggtacattga ttttgtatcc tgagactttg ctaaagttgc

6781
ttatcagctt aaggagattt tgggctgaga cgatggggtt ttctagatat acaatcatgt

6841
cgtctgcaaa cagggacaat ttgacttcct cttttcctaa ttgaataccc tttatttcct

6901
tctcctgcct aattgccctg gccagaactt ccaacactat gttgaatagg agtggtgaga

6961
gagggcatcc ctgtcttgtg ccagttttca aagggaatgc ttccagtttt tgcccattca

7021
gtatgatatt ggctgtgggt ttgtcataga tagctcttat tattttgaaa tatgtcccat

7081
caatacctaa tttattgaga gtttttagca tgaagggttg ttgaattttg tcaaaggctt

7141
tttctgcatc tattgagata atcatgtggt ttttgtcttt ggctctgttt atatgctgga

7201
ttacatttat tgatttgcat atattgaacc agccttgcat cccagggatg aagcccactt

7261
gatcatggtg gataagcttt ttgatgtgct gctggattcg gtttgccagt attttattga

7321
ggatttttgc atcaatgttc atcaaggata ttggtctaaa attctctttt ttggttgtgt

7381
ctctgcccgg ctttggtatc agaatgatgc tggcctcata aaatgagtta gggaggattc

7441
cctctttttc tattgattgg aatagtttca gaaggaatgg taccatttcc tccttgtacc

7501
tctggtagaa ttcggctgtg aatccatctg gtcctggact ctttttggtt ggtaagctat

7561
tgattattgc cacaatttca gagcctgtta ttggtctatt cagagattca acttcttcct

7621
ggtttagtct tgggagagtg tatgtgtcca ggaatttatc catttcttct agatgttcta

7681
gtttatttgc atagaggtgt ttgtagtata ctctgatggt agtttgtatt tctgtgggat

7741
cgctggtgat atccccttta tcatttttta ttgcgtctat ttgattcttc tctctttttt

7801
tctttattag tcttgctagc ggtctatcaa ttttgttgat cctttcaaaa aaccagctcc

7861
tggattcatt gattttttga agggtttttt gtgtctctat ttccttcagt tctgctctga

7921
ttttagttat ttcttgcctt ctgctagctt ttgaatgtgt ttgctcttgc ttttctagtt

7981
cttttaattg tgatgttagg gtgtcaattt tggatctttc ctgcttttct tgtgggcatt

8041
tagtgctata aatttccctt tacacactgc tttgaatgcg tcccagagat tctggtatgt

8101
tgtgtcgttg ttctcgttgg tttcaaagaa catctttatt tctgccttca tttcattatg

8161
tacccagtag tcattcaggt gcaggttgtt cagtttccat gtagttgagc cgttttgagt

8221
gagattctta atcctgagtc ctagtttgat tgcactgtgg tctgagaaat agtttgttat

8281
aatctctgtt cttttacatt tgctgaggag agctttactt ccaagtatgt ggtcaatttt

8341
ggaataggtg tggtgtggtg ctgaaaaata tgtatattct gttgatttgg ggtggagagt

8401
tctgtagatg tctattaggt ctgcttggtg cagagctgag ttcaattcct gggtatcctt

8461
gttgactttc tgtctcgttg atctgtctaa tgttgacagt ggggtgttaa agtctcccat

8521
tattaatgtg tgggagtcta agtctctttg taggtcactc aggacttgct ttatgaatct

8581
gggtgctcct gtattgggtg catatatatt taggatagtt agctcttctt gttgaattga

8641
tccctttacc attatgtaat ggccttcttt gtctcttttg atcttttttg ttttgacatc

8701
tgttttatca gagactagga ttgcaacccc tgcctttttt tgttttccat ttgcttggta

8761
gatcttcctc catcctttta ttttgagcct atgtgtgtct ctgcacgtga gatgggtttc

8821
ctgggtacag cacactgatg ggtcttgact ctttatccaa tttgccagtc tgtgtctttt

8881
aattggagca tttagtccat ttacatttaa agttaatatt gttatgtgtg aatttgatcc

8941
tgttgttatg atgttagctg gttattttgc tcattagttg atgcaatttc ttcctagact

9001
tgatgatcat gcaaaatttt ggcatgattt tgcagcggct ggtaccggtt gttcctttcc

9061
atgtttagcg tttccttcag gagctctttt agggcaggcc tggtggtgac aaaaatctct

9121
cagcatttga ttgtctgtaa agtattttat ttctccttca cttatgaagc ttagtttggc

9181
tggatatgaa attctgggtt gaaaattctt ttctttaaga atgttgaata ttggccccca

9241
ctctcttctg gcttgtaggg tttctgccga gagatctgct gttagtctga tgggcttccc

9301
tttgtgggta acccgacctt tctctctggc tgcccttaac attttttcct tcatttcaac

9361
tttggtgaat ctgacaatta tgtgtcttgg agttgctctt cttgaggagt atctttgtgg

9421
cgttctctgt atttcctgaa tctgaacgtt ggcctgcctt gctagattgg ggaagttctc

9481
ctggataata tcctgcagag tgttttccaa cttggttcca ttctccccat cactttcagg

9541
tacaccaatc agacgtagat ttggtctttt cacatagtcc catatttctt ggaggctttg

9601
ctcatttctt tttattcttt tgtctctaaa cttcccttct cacttcattt cattcatttc

9661
atcttccatt gctgataccc tttcttccag ttgatcgcat cggctcctca ggcttctgca

9721
ttcttcacgt agttctcgag ccttggcttt cagctccatc agctccttta agcacttctc

9781
tgtattggtt attctagtta tacattcttc taaatttttt ttcaaagttt tcaacttctt

9841
tgcctttggt ttgaatgtcc tcctgtagtt cagtgtaatt tgatagtctg aagccttctt

9901
ctctcagctc gtcaaagtca ttctccatcc agctttgttc cgttgctggt gaggaactgc

9961
gttcctttgg aggaggagag gcgctctgct ttttagagtt tccagttttt ctgttctgtt

10021
ttttccccat ctttgtggtt ttatctactt ttggtctttg atgatggtga tgtacggatg

10081
ggtttttggt gtggatgtcc tttctgtttg ttagtcttcc ttctaacaga caggaccctc

10141
agctgcaggt ctgttggaat accctgccgt gtgaggtgtc agtgtgcccc tgctgggggg

10201
tgcctcccag ttaggctgct cgggggtcag gggtcaggga accacttgag gaggcagtct

10261
gcccgttctc agatctccag ctgcgtgctg ggagaaccac tgctctcttc aaagctgtca

10321
gacagggaca tttaagtctg cagaggttgc tgctgtcttt ttgtttgtct gtgccctgcc

10381
cccagaggtg gagcctacag aggcaggcag gcctccttga gctgtggtgg gctccaccca

10441
gttcgagctt cctggctgct ttgtttacct aagcaagcct gggcaatggt gggcgcccct

10501
cccccagcct cgctgccgcc ttgcagtttg atctcagact gctgtgctag caatcagcga

10561
gactccgtgg gcgtaggacc ctccgagcca ggtgcgggat ataatctcgt ggtgcaccgt

10621
ttttttaagc ccgtcggaaa agcgcagtat tcgggtggga gtgacccgat tttccaggtg

10681
cgtccgtcac tcctttcttt gactgggaaa gggaactccc tgaccccttg cacttcccaa

10741
gtgaggcaat gcctcgccct gcttcggctc gcgcacggtg cgcgcaccca ctgacctgtg

10801
cccactgtct ggcactccct agtgagatga acccgttacc tcagatggaa atgcagaaat

10861
cacccgtctt ctgcgtcgct caggctggga gctgtagacc ccagctgttc ctattcggcc

10921
atcttggctc ctcctccttt attcatttat taatctggtt gtttatctgt gttgctttgt

10981
aaattttttt tatattttct agatataaat cccttatcat atacatgttt aacaaatatt

11041
ttcttacatt ctgtgtgttg ctttttttta actctgttga tagtgtctgt taatacacaa

11101
aagttttaaa tgttgatgaa gtcaactaat ctattttttc ttttattgtc tatacttttg

11161
gtttcttatt aaaaaaaatc attgccaaat ccaatattat ataactttta cccttgtttt

11221
cttctacaaa ttttatagtt ttaactctaa tgtttggttc tttgatccat tttgagttca

11281
tatttgtaag ttataaggta agagcccaac tttttttaag gagatatctc atttcctcaa

11341
catcatttgt taaagagact cttctttctt aattaaatga tcttgacacc catcctggaa

11401
atcactgacc atatatgtca gagtatattc atgggctgtc ttttctattc cattggttta

11461
tatgtcagtc tttataccag taccacacat ggttttgtaa taagtttcag aactcagaaa

11521
ctgtaagact ccaactttgc tcttcctctc ttccttttta agattatttt cataattagg

11581
ggatctctgg aaatttcata tgaaagttat ggtagatttt tctatttata caaagtaatt

11641
ggaatgttgg tagaattacc tcaaacctgt acatcacttt gggtagtagt gacatcttaa

11701
gaatattaag tcttccaatt cataaacgca ggatgttttt ctaatgattt atgtcatcta

11761
caatttcttt aaagggtgtt tgaagttttc actcgacaac tcttgcgcct gcttggttaa

11821
gcttattcct aagtgattta ttcttttgat gctgttaaat gggattgttt tcaaaatttc

11881
cttttctttt tgtttaggaa ggaaattaga attcctgttc taattctatt tttatacact

11941
agcaactgat ttctccatat tggctttgca tcctgcaact ttgatgaatt cttttaatag

12001
ttctaatagt tttttgtgga atatttaatg ttttccacaa attagatact accgtcggca

12061
gacagagata attttacttt ttcattacca attaggatgc ctcctttatg cttttcttgt

12121
ctaattactc tggataggac ttgcagtgtt ctgtggaatg caattggcaa aagtaggcat

12181
ccttttcttg ttcctgatgt tataggaaaa gctatgacac ttcatcatta aatgtgaagt

12241
gagctgtggg tttttaatat atggccttta ttatgttgag gtattgtctt tctatttcta

12301
ctttgttgat tatttttatc gtgaaagcct cctgaatttt tccatgcatt ggatattcaa

12361
atttcctatt tctttgattg taagtaagta aggtgaatac tgattgttgt ttcagttctc

12421
tctcttaacc ttaaaatacg ctaccttttc cttcagttgc tagaactgcc tgttactaaa

12481
tccccatctc tggtctcctc tctcccttgc aggcttctct ctgggctcca cagtgcagtc

12541
tcacactaaa ggaatctgga tgtggtgtat gccccatccc aagaagccag gccacatcct

12601
agttctgctg gacaccgagg gtctgggaga tgtagagaag gtgagactca aggatccaat

12661
tgtggagtga gcccctcttc tctgaatatt ttatgcactg tttaattgtt tattaaccat

12721
taactacagg ctgtaatatg tgtgggttaa cacagatgca taaagggagc acaaataatc

12781
ccagtgtcat gagtcttatc ctgcacagaa ctttagttaa gaatttgggt gctaaagccc

12841
cgtgactttg tatttaaatt taaattctgt cactaattca gtagcctgag aaaattgacc

12901
tattttagcc tcagcattct aagctttaaa atgtatggaa aagacctatg ttggccacat

12961
agtataattt tgaatattta ataagaaaat acgtgtcagg tgtatattaa ttattcgata

13021
aaacagcaac taatagtacc aatcttatat gtgaattctg ttattgaaaa aagaagagaa

13081
aattttaaat ttacattgta ctcaggcctt aaaatgccca ccaaccttga attttaattt

13141
tacaattatc tgttgatgat cattagaaga cctagtaagg atcattgtaa cccaaaacat

13201
tcatcgaaaa actacccaaa agccacatcc tactgagaat actttttgta tttggcttct

13261
ttgataggtc attagtgcta aagaacaaac aaacaaaaaa ccaaaaaccc tctaacatat

13321
gaacatagtt ttactactct tactctggaa agtgctgtga ctacgaaacg tgctcctgac

13381
tccagtgtgt cttgacttcc agggtgacaa ccagaatgac tcctggatct tcgccctggc

13441
cgtcctcctg aacagcactt ccatgtacaa tagcatagga accattaacc agcaggccat

13501
ggaccaactg cagtatcctt tgtgacccag aacagcacca aggtcagagg gcacctgtgt

13561
tcataaacca gctgcctgac tgtgaatcct gatgaatcaa gctcaaaagg agaaaacata

13621
aaatacataa agtacagagg agtgatccca tatatccact ttagacttga cacttaggtt

13681
aagaacaaag gaaaatggaa ggtttgggaa tgtgttgaac taatatggga tgaggtccat

13741
gttcattttg tcacatttct ttagttagct actcagctat gtgacagagc tgacacatcg

13801

agtccaacca aaatcttcac ctgatgagaa tgagaatgag gattcagctg actttgagag

13861

c

ttcttccca gactttgtgt tgacactgag agatttctag cattacagaa agcgcttttg

13921

gacaaaactg t

gataaaata aactaaatgg aggacttttt tttattggaa tagtttcatt

13981
tgtttcatac atattgatca aatgcttact atgaatagac tgaagataca gatataaatg

14041
aaatagatat ggttcctgtc ctaatgttgc ttggggttaa aatggatgca aacattcaat

14101
taacacaggt ctatgtaatc tagtacaaga actctcaaat gttggtatgc atacgtatct

14161
tctgaggatc ttaccaaaga caaaagtcta attcaataga tcttggacag ggtctaagag

14221
tctgtatttc tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa agaattacag ggaatagaga

14281
ctttctttac tacaagaaac attagcattt gttctccata ggcaggtcta gagagcctgg

14341
tgctgaccta tgtcaatgcc atcagcagtg gggatctacc ctgcatggag aacgcagtcc

14401
tggccttggc ccagatagag aactcagccg cagtgcaaaa ggctattgcc cactatgaaa

14461
agcagatggg ccagaaggtg cagctgccca cagaaaccct ccaggagctg ctggacctgc

14521
acagggacag tgagagcaag gccactgaag ttttcatcag gagttccttc aaagatgtgg

14581
accatctatt tcaaaaggag ttagcggtaa tttttgtctc aaatttatat ggtttagggt

14641
catggaagac aaagtactac aaagaaagaa aacgagtatt attttgatag aagtaattct

14701
tcctagcttt cataatggtg acaacaacag atttgtaatc acatcaatca agaggaccaa

14761
ctgtattatt acagactcaa agttttaaaa cattttttcc tgaataattt tccctttacc

14821
taaatgcata caactgataa ccagagcttc taataaaatt acctgcccac tcttctcaga

14881
ctgatttgat attctagcca aacacaaaga aaactttcat cctgcttatc ttgagcatgc

14941
ttctgttcag ccacatttat tccatatgaa atcattagtc caatatgcaa aaccagagtt

15001
ttcctctaac ggttgacata aagctatcaa tctcggtcct gaacctcacc tccaaaaaga

15061
aagcgacttc agtagaaagt ggggtcagaa ggaagagtgt ggtcctggtg aggagtctgt

15121
caatttctcc agcatcattg acttttattt tcagaagtca ttcccgaaat tctgaggtca

15181
agctaacatc ctttccctgt tactcttttt acttcctatt tttacattaa aggcccagct

15241
agacaaaaag cgggatgact tttgtaaaca gaatcaggaa gcatcatcag atcgttgctc

15301
agctttactt caggtcattt tcagtcctct agaagaagaa gtgaaggcgg gaatttattc

15361
gaaaccaggg ggctatcgtc tctttattca gaagttacaa gacctggaga aaaagtacta

15421
tgaggaaccg aggaagggga tacaggtaac caaaattcat ctgtcgatta tggaaacctg

15481
ctgacctgcc tcctacaaac accaaggtga ccaagcttca ctgcacacag atgtgctttt

15541
ttgtttgcat aacattcatg ctttcattca ataacatatg caaagaggct gttattccag

15601
atatgcccta ggtgctcatc aaggagagtg caattaacta ctgagttaca aattcaacta

15661
gcagatgaag caccaagttg tcattatcat cattacagct gggcttttcc tgttgcaaga

15721
gacaggaacc taatgatggc tgcttaaaca aaaatagtaa ttcattgatt taggctgcaa

15781
actggcttta caggacaggt gggtctaagg tttcaagtaa catcatgaac tgtctctctg

15841
tccagtgttc ccccatcccc acagttttct ccctcccacc ctccctctat ctgagtcact

15901
ctgatatttg ctttctcaat gatggcttca ttccccagca ggctcagccc atatgggacc

15961
acagcattaa cagttccaaa cttagagccc ttaatctaaa caggacagag actctccttt

16021
tcccagtatc tatattagcc tactaaaaat gactgctaag taggatgctc tgatcagctt

16081
gccaggagca tgtgcctctc gctctgacag ggaatttcac caccaaggac aacagggcaa

16141
aggaaagaat tcccagagga aaggatggca ggaagacaaa caggaacacc tgcttacagc

16201
cgtctcctac ttctcacttt gtgttctctg ggtcctaagg ctgaagagat tctgcagaca

16261
tacttgaaat ccaaggagtc tatgactgat gcaattctcc agacagacca gactctcaca

16321
gaaaaagaaa aggagattga aggtgaggag tgagttaaga gattagatgg cctcaaaagc

16381
tccaaaaatt gaaataactt gactggataa acatgggacc ctttaactag agcaagatcc

16441
acaaaggtgt gtcttacttg ccgaggtcat ctctgagtag ggcatatgca gtcagcaaca

16501
acgacaggta agtgtataag gaacaatgag gcaacaagat aaccccacac aaattttcct

16561
tctttctttt cctccacagt ggaacgtgtg aaagctgagt ctgcacaggc ttcagcaaaa

16621
atgttgcagc aaatgcaaag aaagaatgag cagatgatgg aacagaagga gaggagttat

16681
caggaacact tgaaacaact gactgagaag atggagagcg acagggtcca gttgctggaa

16741
gagcaagaga ggaccctcgc tcttaaactt caggtgtcta attgcatcac cttgaggttt

16801
ctgtttttct gttttctctc cattctcccc gatcacaggc ttactgtggc agagagaaca

16861
tgaagcccag gggaagaacc ctgcttgctt acttgtactt ttcaattcct gtctgtccag

16921
cctgaactgg ctactgccaa gtctggtcac taaactgcaa atattgcagt tgtgtcacat

16981
tcagtgcttt atctatatat ccttcatttc aaggcaggta ttatctgcta gccatcatta

17041
aagtatctgt atctcttgct taataccatg tgaagcaaga actatattct tattacttag

17101
gagaagaaac aaagtttcca aaaataataa ataaatagag tcacacagct agtaaatgta

17161
tcaaagctgt cttcatcact tagtggaatc cacaatgatt atttttttct gtgacaccta

17221
gtatgaaatt aaacttaaga aaacctttgt gagcaga

AA2 LGALS3BP-galectin-3-binding protein precursor, mRNA-NM_005567.3

(SEQ ID NO: 114)

1
aatcgaaagt agactctttt ctgaagcatt tcctgggatc agcctgacca cgctccatac

61
tgggagaggc ttctgggtca aaggaccagt ctgcagaggg atcctgtggc tggaagcgag

121
gaggctccac acggccgttg cagctaccgc agccaggatc tgggcatcca ggcacggcca

181
tgacccctcc gaggctcttc tgggtgtggc tgctggttgc aggaacccaa ggcgtgaacg

241
atggtgacat gcggctggcc gatgggggcg ccaccaacca gggccgcgtg gagatcttct

301
acagaggcca gtggggcact gtgtgtgaca acctgtggga cctgactgat gccagcgtcg

361
tctgccgggc cctgggcttc gagaacgcca cccaggctct gggcagagct gccttcgggc

421
aaggatcagg ccccatcatg ctggatgagg tccagtgcac gggaaccgag gcctcactgg

481
ccgactgcaa gtccctgggc tggctgaaga gcaactgcag gcacgagaga gacgctggtg

541
tggtctgcac caatgaaacc aggagcaccc acaccctgga cctctccagg gagctctcgg

601
aggcccttgg ccagatcttt gacagccagc ggggctgcga cctgtccatc agcgtgaatg

661
tgcagggcga ggacgccctg ggcttctgtg gccacacggt catcctgact gccaacctgg

721
aggcccaggc cctgtggaag gagccgggca gcaatgtcac catgagtgtg gatgctgagt

781
gtgtgcccat ggtcagggac cttctcaggt acttctactc ccgaaggatt gacatcaccc

841
tgtcgtcagt caagtgcttc cacaagctgg cctctgccta tggggccagg cagctgcagg

901
gctactgcgc aagcctcttt gccatcctcc tcccccagga cccctcgttc cagatgcccc

961
tggacctgta tgcctatgca gtggccacag gggacgccct gctggagaag ctctgcctac

1021
agttcctggc ctggaacttc gaggccttga cgcaggccga ggcctggccc agtgtcccca

1081
cagacctgct ccaactgctg ctgcccagga gcgacctggc ggtgcccagc gagctggccc

1141
tactgaaggc cgtggacacc tggagctggg gggagcgtgc ctcccatgag gaggtggagg

1201
gcttggtgga gaagatccgc ttccccatga tgctccctga ggagctcttt gagctgcagt

1261
tcaacctgtc cctgtactgg agccacgagg ccctgttcca gaagaagact ctgcaggccc

1321
tggaattcca cactgtgccc ttccagttgc tggcccggta caaaggcctg aacctcaccg

1381
aggataccta caagccccgg atttacacct cgcccacctg gagtgccttt gtgacagaca

1441

gttcctggag tgcacggaag tcacaactgg tctatcagtc cagacgggg

g cctttggtca

1501

aatattcttc tgattacttc caagccccct ctgactacag atactacccc taccagt
cct

1561

tccagactcc acaacacccc agcttcctct tccaggacaa gagggtgtcc tggtccctgg

1621

tctacctccc caccatccag agctgctgga actacggctt ctcctgctcc tcggacgagc

1681

tccctgtcct gggcctcacc aagtctggcg gctcagatcg

caccattgcc tacgaaaaca

1741

aagccctgat gctctgcgaa gggctcttcg tggcagacgt caccgatttc gagggctgga

1801

aggctgcgat tcccagtgcc ctggacacca acagctcgaa gagcacctcc tccttcccct

1861
gcccggcagg gcacttcaac ggcttccgca cggtcatccg ccccttctac ctgaccaact

1921
cctcaggtgt ggactagacg gcgtggccca agggtggtga gaaccggaga accccaggac

1981
gccctcactg caggctcccc tcctcggctt ccttcctctc tgcaatgacc ttcaacaacc

2041
ggccaccaga tgtcgcccta ctcacctgag cgctcagctt caagaaatta ctggaaggct

2101
tccactaggg tccaccagga gttctcccac cacctcacca gtttccaggt ggtaagcacc

2161
aggacgccct cgaggttgct ctgggatccc cccacagccc ctggtcagtc tgcccttgtc

2221
actggtctga ggtcattaaa attacattga ggttcctaca aaaaaaaaaa aaaaaaa

AB1 BST1-bone marrow stromal cell antigen 1, mRNA NM_004334.2

(SEQ ID NO: 115)

1
aaagtgctgg gattacaggc atgagccgcc gcgccccgcc ccacgctcag tcttgaaatt

61
gtctggaacg ggaaacggca aacagcgaga tatccgagcg agagtcccgc cctgcatcag

121
tttgcggaac cgccttggta gaaggagaga aggggagtgg aggaagcacg ggactggagg

181
gaccaaagtt ccccgatggc ggcccagggg tgcgcggcat cgcggctgct ccagctgctg

241
ctgcagcttc tgcttctact gttgctgctg gcggcgggcg gggcgcgcgc gcggtggcgc

301
ggggagggca ccagcgcaca cttgcgggac atcttcctgg gccgctgcgc cgagtaccgc

361

gcactgctga gtcccgagca gcggaacaag aactgcacag ccatctggga agcctttaaa

421

gtggcgctgg acaaggatcc ctgctccgtg ctgccctcag actatgacct ttttattaac

481

ttgtccaggc actctattcc cagagataag tccctgttc

t gggaaaatag ccacctcctt

541

gttaacagct ttgcagacaa cacccgtcgt tttatgcccc tgagcgatg
t tctgtatggc

601

agggttgcag atttcttgag ctggtgtcga cagaaaaatg actctggact cgattaccaa

661

tcctgcccta catcagaaga ctgtgaaaat aatcctgtgg attccttttg gaaaagggca

721

tccatccagt attccaagga

tagttctggg gtgatccacg tcatgctgaa tggttcagag

781

ccaacaggag cctatcccat caaaggtttt
tttgcagatt atgaaattcc aaacctccag

841
aaggaaaaaa ttacacgaat cgagatctgg gttatgcatg aaattggggg acccaatgtg

901
gaatcctgcg gggaaggcag catgaaagtc ctggaaaaga ggctgaagga catggggttc

961
cagtacagct gtattaatga ttaccgacca gtgaagctct tacagtgcgt ggaccacagc

1021
acccatcctg actgtgcctt aaagtcggca gcagccgcta ctcaaagaaa agccccaagt

1081
ctttatacag aacaaagggc gggtcttatc attcccctct ttctggtgct ggcttccagg

1141
actcaactgt aactggaaac tgtgttgctc taaccctcct ccagccctgc agcctcccct

1201
tgcagtcatc attcgtgttc tgtgtatacc aaatgattct gttatctaaa gaagcttttt

1261
gctgggaaaa cgatgtcctg aaaatggtat ttcaatgagg catatgttca ggatttcaga

1321
aacaagaagt tagttctatt tagcaggtta aaaaatgctg cattagaatt aaagcaagtt

1381
attttcttat ttgtataatg acacaaagca ttgggagtca gactgcttgt atattatcaa

1441
acattttaag agaattctaa taaagctgta ttttacatca aaaaaaaaaa aaaaaaa

AB2 SNX10-sorting nexin-10, mRNA NM_001199835.1

(SEQ ID NO: 116)

1
gctgagcgcg ggcgcggggc cgctacgtgc gcggggagcg cggggagcgc ggggagcgcg

61
gggctgcgct cgtgtgcgct cctgggcgct cgccgccgcc gctgccgccg cgcgcctttg

121
agtcagcaaa ctccgcggcc cgcaagcccg gctcggcccg gccctgctct gttctgcccg

181
gaggagccgc ccgtaagtga caagagaccc gctgaggggg cctcccctgc accgcccaga

241
ttgatcgtgt cctgtgctga agatgtttcc ggaacaacag aaagaggaat ttgtaagtgt

301
ctgggttcga gatcctagga ttcagaagga ggacttctgg cattcttaca ttgactatga

361
gatatgtatt catactaata gcatgtgttt tacaatgaaa acatcctgtg tacgaagaag

421
atatagagaa ttcgtgtggc tgaggcagag actccaaagt aatgcgttgc tggtacaact

481
gccagaactt ccatctaaaa acctgttttt caacatgaac aatcgccagc acgtggatca

541
gcgtcgccag ggtctggaag atttcctcag aaaagtccta cagaatgcac ttttgctttc

601
agatagcagc cttcacctct tcttacagag ccatctgaat tcagaagaca ttgaggcgtg

661
tgtttctggg cagactaagt actctgtgga agaagcaatt cacaagtttg ccttaatgaa

721
tagacgtttc cctgaagaag atgaagaagg aaaaaaagaa aatgatatag attatgattc

781
agaaagttca tcctctgggc ttggacacag tagtgatgac agcagttcac atggatgtaa

841
agtaaataca gctccgcagg aatcctgaaa aataattcta atgttactat cttaggaata

901
gcaaattatg tccagtcata gagaagaaag cttcataata atacattctt acctaaagct

961
cactgtcatg atgttaggta tttaaattct taaagatgtt gggttgttta ttagtggtat

1021
ttttatgttg tcttatttta ggtaagcttc tgtgtaaagc taaaaatcct gtgaatacaa

1081
tactatcctt tacaggcaga cattattggt aaacaagatc ttgccctcca atgaaatgac

1141
ttacatgttt taaaaaaccg agttggtttt attgaattta aaaagatagg taactaagta

1201
gcatttaaaa tcaagataga gcattccttc ttgtatcagt ggggcagtgt taccataaac

1261
acggtgtata tgttgttaaa ccctatgaag agtaacagtg tagaccagac tgcctctctc

1321
agatatgtgc ctgatatttt gtggatacct cccctgcact ggcaaaacac tatgcttttg

1381
ggtgttagac tgaaatattt taagagtatt taacctttcc agtattctgt ttcacgctta

1441
gatggaaatg tatcttatga atagagacat attaaaataa tgtttacatc ttagaaaaaa

1501
catagatagt gctagtaata ttacttataa ctgtaatata tagattcaga aatacatttt

1561
cattatccaa aatcagcttc aacaaatggt ttctggagac aaataatttg ttttcattat

1621
cattgtataa tcaggttaat gatttatttt ttgactaaat gtgcaatttc ttatcactag

1681
ataactttca gtatcagtgg tggttactta ttacttaaat cagaggaagg attttataaa

1741
gattaataaa tttaatttta ccaataaata ttcccataat ttagaaaagg atgtcgactt

1801
gctaatttca gaaataatta ttcattttta aaaagcccct tttaaagcat ctacttgaag

1861
attggtataa ttttcataaa atgtcttttt ttttagtgtc ccaaagatat cttagataaa

1921

ctattttgaa gttcagattt cagatgaggc aacattttct tgagataatt acccaagttt

1981

catccatgtt gaatggtaca aaatatttct gtgaaactaa caggaagata ttttca

gata

2041

actaggataa cttgttgctt tgttacccag cctaattgaa gagtggcaga ggctactaca

2101

aaaagc

aacc ttttcatttt cactaagagt ttaaaagcta ttgtattatt aaaaagtctt

2161

tacaatgctt gtttcaaaga accaacagaa aaaaaagcta agaaaactga gaactaacat

2221

taaaaaaatt aaatttagaa taagaatgat ttctttaatt tgtccttttt ttctttggtc

2281

taaaacatta ttaaattttt gtaaatattt tgatttaatg tgtcttagat cctcattatt

2341

ttaatacagg aaaagaaaag atttagtaat ttcttaccat gctaatatgt aa

agttcatg

2401

ccatccaggc atttaagagc gatcctcatc ccttcagcaa tatgtatttg agttcacact

2461

a

tttctgttt tacagcagtt ttgaaaaaca catactatgc caccaattgt catattattt

2521
ttagatgatg taacatagcc atcaaaatta atattatgta atgcctaata cttagtatgt

2581
aaatgtcacg agatcatttt tacattaaac gtgaaaaaaa atcaaaaaaa aaaaaaaa

AC1 ALPK1-alpha-kinase 1, mRNA-NM_025144.3

(SEQ ID NO: 117)

1
aattcctact tcctgaaact gaagccgttt atgagaaaca gtgtgtttca gagaggctgt

61
accagaatta actctgctca gagttagatt tgctggtctt aaagtacttt tcctctttaa

121
gataaaagaa gttcttctaa atcaggaatg gattgaaatc taatgaaccg aaactttggg

181
tacttcggcc ttcaaggggc tcctttattg agaatcaatg tcttctccta ggtaattgat

241
caccctagac ccagggacac ccaattcatc gtaatcatca tgaataatca aaaagtggta

301
gctgtgctac tgcaagagtg caagcaagtg ctggatcagc tcttgttgga agcgccagat

361
gtgtcggaag aggacaagag cgaggaccag cgctgcagag ctttactccc cagcgagtta

421
aggaccctga tccaggaggc aaaggaaatg aagtggccct tcgtgcctga aaagtggcag

481
tacaaacaag ccgtgggccc agaggacaaa acaaacctga aggatgtgat tggcgccggg

541

ttgcagcagt tactggcgtc cctgagggcc tccatcctcg ctcgggactg tgcggctgcg

601

gcggctattg tgttcttggt ggaccg

gttc ctgtatgggc tcgacgtctc tggaaaactt

661

ctgcaggtcg ccaaaggtct ccacaagttg cagccagcca cgccaattgc cccgcaggtg

721

gttattcgcc aagcccgaat ctccgtgaac tcaggaaaac ttttaaaagc agagtatatt

781

ctgagcagtc taataagcaa caatggagca acgggtacct ggctgtacag aaatgaaagt

841

gacaaggtcc tggtgcagtc ggtctgtata cagatcagag ggcagattct gcaaaagctg

901

gggatgtggt acgaagcagc agagttaata tgggcctcca ttgtaggata tttggcactt

961

cctcagccgg ataaaaaggg cctctccacg tcgctaggta tactggcaga catctttgtt

1021

tccatgagca agaacgatta tgaaaagttt aaaaacaatc cacaaattaa tttgagcctg

1081

ctgaaggagt ttgaccacca tttgctgtcc gctgcagaag cctgcaagct ggcagctgcc

1141

ttcagtgcct atacgccgct cttcgtgctc acagctgtga atatccgtgg cacgtgttta

1201

ttgtcctaca gtagttcaaa tgactgtcct ccagaattga aaaacttaca tctgtgtgaa

1261

gccaaagagg cctttgagat tggcctcctc accaagagag atgatgagcc tgttactgga

1321

aaacaggagc ttcacagctt tgtcaaagct gctttcggtc tcaccacagt gcacagaagg

1381

ctccatgggg agacagggac ggtccatgca gcaagtcagc tctgtaagga agcaatgggg

1441

aagctgtaca atttcagcac ttcctccaga agtcaggaca gagaagctct gtctcaagaa

1501

gttatgtctg tgattgccca ggtgaaggaa catttacaag ttcaaagctt ctcaaatgta

1561

gatgacagat cttatgttcc cgagagtttc gagtgcaggt tggataaact tatcttgcat

1621

gggcaagggg atttccaaaa aatccttgac acctattcac agcaccatac ttcggtgtgt

1681

gaagtatttg aaagtgattg tggaaacaac aaaaatgaac agaaagatgc aaaaacagga

1741

gtctgcatca ctgctctaaa aacagaaata aaaaacatag atactgtgag tactactcaa

1801

gaaaagccac attgtcaaag agacacagga atatcttcct ccctaatggg taagaatgtt

1861

cagagggaac tcagaagggg aggaaggaga aactggaccc attctgatgc atttcgagtc

1921

tccttggatc aagatgtgga gactgagact gagccatcgg actacagcaa tggtgaggga

1981

gctgttttca acaagtctct gagtggcagc cagacttcca gtgcttggag caacttatca

2041

gggtttagtt cctctgcaag ctgggaggaa gtgaattatc acgttgacga caggtcagcc

2101

agaaaagagc ctggcaaaga acatctggtg gacactcagt gttccactgc cttgtctgag

2161

gagctagaga atgacaggga aggcagagct atgcattcat tgcattcaca gcttcatgat

2221

ctctctcttc aggaacccaa caatgacaat ttggagcctt ctcaaaatca gccacagcaa

2281

cagatgccct tgacaccctt ctcgcctcat aataccccag gcattttctt ggcccctggt

2341

gcagggcttc tagaaggagc tccagaaggt atccaggaag tcagaaatat gggacccaga

2401

aatacttctg ctcactccag accctcatat cgttctgctt cttggtcttc tgattctggt

2461

aggcccaaga atatgggcac acatccttca gtccaaaaag aagaagcctt tgaaataatt

2521

gttgagtttc cagaaaccaa ctgcgatgtc aaagacaggc aggggaaaga gcagggagaa

2581

gaaattagtg aaagaggcgc aggccctaca tttaaagcta gtccctcctg ggttgaccca

2641

gaaggagaaa cagcagaaag cactgaagat gcacccttag actttcacag ggtcctgcac

2701

aattctctgg gaaacatttc catgctgcca tgtagctcct tcacccctaa ttggcctgtt

2761

caaaatcctg actccagaaa aagtggtggc ccagtcgcag agcagggcat cgaccctgat

2821

gcctccacag tggatgagga ggggcaactg ctcgacagca tggatgttcc ctgcacaaat

2881

gggcacggct ctcatagact gtgcattctg agacagccgc ctggtcagag ggcggagacc

2941

cccaattcct ctgtaagcgg taacatcctc ttccctgtcc tcagcgagga ctgcactacc

3001

acagaggaag gaaatcagcc tggaaacatg ctaaactgca gccagaactc cagctcatcc

3061

tcagtgtggt ggctgaaatc acctgcattt tccagtggtt cttctgaggg ggacagccct

3121

tggtcctatc tgaa

ttccag tgggagttct tgggtttcat tgccgggaaa gatgaggaaa

3181

gagatccttg aggctcgcac cttg
caacct gatgactttg aaaagctgtt ggcaggagtg

3241
aggcatgatt ggctgtttca gagactagag aatacggggg tttttaagcc cagtcaactc

3301
caccgagcac atagtgctct tttgttaaaa tattcaaaaa aatctgaact gtggacggcc

3361
caggaaacta ttgtctattt gggggactac ttgactgtga agaaaaaagg cagacaaaga

3421
aatgcttttt gggttcatca tcttcatcaa gaagaaattc tggggaggta tgttgggaaa

3481
gactataagg agcagaaggg gctctggcac cacttcactg atgtggagcg acagatgacc

3541
gcacagcact atgtgacaga atttaacaag agactctatg aacaaaacat tcccacccag

3601
atattctaca tcccatccac aatactactg attttagagg acaagacaat aaagggatgt

3661
atcagtgtgg agccttacat actgggagaa tttgtaaaat tgtcaaataa cacgaaagtg

3721
gtgaaaacag aatacaaagc cacagaatat ggcttggcct atggccattt ttcttatgag

3781
ttttctaatc atagagatgt tgtggtcgat ttacaaggtt gggtaaccgg taatggaaaa

3841
ggactcatct acctcacaga tccccagatt cactccgttg atcagaaagt tttcactacc

3901
aattttggaa agagaggaat tttttacttc tttaataacc agcatgtgga atgtaatgaa

3961
atctgccatc gtctttcttt gactagacct tcaatggaga aaccatgcac atagaatacg

4021
gcacagtctg gtcctttggg gcttgggcag ggccgtgaca caggttctgg ccaatgattt

4081
gcaagaggaa ttgatcagta tcactttaag tcctgcattt aattggcagc acaagatcct

4141
gcagagcctc tttccctctg ccacagttat caagaatggg tcaggagacc gctgcttctg

4201
ggcataagtc ctgcaaggaa agcaacatgg aaaacagccc caactcaccc atgagggatg

4261
aaaagcactc ttgagaaagg catgtgttgt ttaagccatt gagattttag agctttttgt

4321
cactatctgt caagactgat actactgggg cttttcctat tgatttggga gttctttaca

4381
tattaaaaaa atgtgagcct ttgtgatacg aattcaattt gttttcctgt cttttgacat

4441
ttgactttgc ataaaagttt atctgtgcat aattttatat gtagttgaat tcatcaatct

4501
tttattttgt atggcttttt ggttatgtat aatacttaga tcctccttat actctgagtt

4561
tctttctttt taattctcct gtatttcctt ctagtataat taaatctgta aaaagtaaga

4621
tggaagagtg gtacagtttt ctttatccag tctgtccttg atgggcattt aggtagactg

4681
gataaagaaa atgtggtaca tatacaccat ggaacactat gtgtattaat ccactctcac

4741
actgctatga agagatacct gagactgggt aatttagaaa gaaaagaggt ttaattaact

4801
cacagttcca catggctggg gagacctcag gaaacttaca atcatggcag aaggcacctc

4861
ttcatagggt agcaggagag agaatgagtg ccagcagggg aaatgccaga tgcttataaa

4921
gccatcagat cttgtgagaa ttcattcact ctcacgagaa cagcatggga aaaactgcct

4981
caattacctc ctaccaggtc cttcccatga cacatgggaa ttatgggact acaattcgag

5041
atgagatttg ggtggggaca caaagccaaa ccatatcaca atgtaaccat aaaaaagaat

5101
gagatcatgt cctttgcagg gacatggata gagctggagg ccattatttt tagcaaacta

5161
atgcaagaac agaaaactaa ataccacttg ttctcactta taggtgagag ctaagtgatg

5221
agagtaggtg gacacataga gggaacaaca cacaccaggg cttatcagag ggtggacagt

5281
gggaggaggg agaggatcag gaaaaataac taatgggtac taggctgaat acctgggtga

5341
tgaagtaatt cgcacaacaa acccccatga cacaaacctg cacatgtacc cctgaactta

5401
aaataaaagt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a

AC2 CREG1-cellular repressor of E1A-stimulated genes 1, mRNA NM_003851.2

(SEQ ID NO: 118)

1
ggcggggcct gggcgcgccg agctccggct gggtccctgc aggtcttggg gcccgggact

61
cttcctggag acaccgccat ggccgggcta tcccgcgggt ccgcgcgcgc actgctcgcc

121
gccctgctgg cgtcgacgct gttggcgctg ctcgtgtcgc ccgcgcgggg tcgcggcggc

181
cgggaccacg gggactggga cgaggcctcc cggctgccgc cgctaccacc ccgcgaggac

241
gcggcgcgcg tggcccgctt cgtgacgcac gtctccgact ggggcgctct ggccaccatc

301
tccacgctgg aggcggtgcg cggccggccc ttcgccgacg tcctctcgct cagcgacggg

361
cccccgggcg cgggcagcgg cgtgccctat ttctacctga gcccgctgca gctctccgtg

421
agcaacctgc aggagaatcc atatgctaca ctgaccatga ctttggcaca gaccaacttc

481
tgcaagaaac atggatttga tccacaaagt cccctttgtg ttcacataat gctgtcagga

541
actgtgacca aggtgaatga aacagaaatg gatattgcaa agcattcgtt attcattcga

601
caccctgaga tgaaaacctg gccttccagc cataattggt tctttgctaa gttgaatata

661
accaatatct gggtcctgga ctactttggt ggaccaaaaa tcgtgacacc agaagaatat

721
tataatgtca cagttcagtg aagcagactg tggtgaattt agcaacactt atgaagtttc

781
ttaaagtggc tcatacacac ttaaaaggct taatgtttct ctggaaagcg tcccagaata

841
ttagccagtt ttctgtcaca tgctggtttg tttgcttgct tgtttacttg cttgtttacc

901
aatagagttg acctgttatt ggatttcctg gaagatgtgg tagctacttt tttcctattt

961

tgaagccatt ttcgtagaga aatatccttc actataatca aataagtttt gtcccatcaa

1021

ttccaaagat gtttccagtg gtgctcttga agaggaatga gtaccagttt taaattgccc

1081

attggcattt gaaggtagtt gagtatgtgt tctttattcc tagaagccac tgtgcttggt

1141

agagtgcatc actcaccaca gctgcctcct gagctgcctg agcctggtgc aaaaggattg

1201

gcccccatta

tggtgcttct gaataaatct tgccaagata gacaaacaat gatgaaactc

1261

agatggagct tcctactcac
gttgatttat gtctcacaat cctgggtatt gttaattcaa

1321

catagggtga aactatttct gataaagaac ttttgaaaaa ctttttatac tctaaagtga

1381

tactcagaac aaaagaaagt cataaaactc ctgaatttaa tttccccacc taagtcgaaa

1441

cagtattatc aaaacacatg tgcacacaga ttattttttg gctccaaaac tggattgcaa

1501

aagaaagagg agaagaatat tttgtgtgtt

cctggtattc ttttataagt aaagtttacc

1561

caggcatgga ccagcttcag ccagggacaa aatcccctcc
caaaccactc tccacagctt

1621
tttaaaaata cttctactct taacaattac ctaaggcttc ctcaactgcc ccaaatctct

1681
taatagcttc tagtgctgct acaatctaag tcaggtcacc agagggaaga gaacatggca

1741
ttaaaagaat cacatcttca gaagagaaga cactaatatt attacccata tacatgattt

1801
cagaagatga cataagattc ctcttaaaga ggaaatgtca ggaatcaagc cactgaatcc

1861
ttaaagagaa aagttgaata tgagtcattg tgtctgaaaa ctgcaaagtg aacttaactg

1921
agatccagca aacaggttct gtttaagaaa aataatttat actaaattta gtaaaatgga

1981
cttcttattc aaagcatcaa taattaaaag aattatttta atgaaaaaaa aaaaaaaaaa

2041
aaaaaaaa

AD1 BAZ1A-bromodomain adjacent to zinc finger domain protein 1A, mRNA

NM_013448.2

(SEQ ID NO: 119)

1
cttttcccat cgtgtagtca agagtctgtg ccagacttga aggctttact ttgttagcca

61
tgtgtttatg aacccccagc gctttcccta gatcttttgg ctgataatct caaacatgga

121
ggatgcttct gaatcttcac gaggggttgc tccattaatt aataatgtag ttctcccagg

181
ctctccgctg tctcttcctg tatcagtgac aggctgtaaa agtcatcgag tagccaataa

241
aaaggtagaa gcgaggagtg aaaagctcct cccaacagct cttcctcctt cagagccgaa

301
agtagatcag aaacttccca ggagctccga gaggcgggga agtggcggtg ggacgcaatt

361
ccccgcgcgg agtcgggcag tggcagcggg agaagcggca gccaggggcg cggcggggcc

421
ggagagaggc ggtcccctgg gaggacgggg tctcccctcg ttgcctttgt agtggagaag

481
gtggacaagt ggcagtcggc gtgatcgcag ggaagcgggg ccggcgcggg cggccgaggg

541
tccaggcgag cccgcgggcg gacgggagat gccgctgcta caccgaaagc cgtttgtgag

601
acagaagccg cccgcggacc tgcggcccga cgaggaagtt ttctactgta aagtcaccaa

661
cgagatcttc cgccactacg atgacttttt tgaacgaacc attctgtgca acagccttgt

721
gtggagttgt gctgtgacgg gtagacctgg actgacgtat caggaagcac ttgagtcaga

781
aaaaaaagca agacagaatc ttcagagttt tccagaacca ctaattattc cagttttata

841
cttgaccagc cttacccatc gttcgcgctt acatgaaatt tgtgatgata tctttgcata

901
tgtcaaggat cgatattttg tcgaagaaac tgtggaagtc attaggaaca atggtgcaag

961
gttgcagtgt aggattttgg aagtcctccc tccatcacat caaaatggtt ttgctaatgg

1021
acatgttaac agtgtggatg gagaaactat tatcatcagt gatagtgatg attcagaaac

1081
acaaagctgt tcttttcaaa atgggaagaa aaaagatgca attgatccct tactattcaa

1141
gtataaagtg caacccacta aaaaagaatt acatgagtct gctattgtta aagcaacaca

1201
aatcagccgg agaaaacacc tattttctcg tgataaacta aagctttttc tgaagcaaca

1261
ctgtgaacca caagatggag tcattaaaat aaaggcatca tctctttcaa cgtataaaat

1321
agcagaacaa gatttttctt atttcttccc tgatgatcca cccacattta tcttcagtcc

1381
tgctaacaga cgaagaggga gacctcccaa acgaatacat attagtcaag aggacaatgt

1441
tgctaataaa cagactcttg caagttatag gagcaaagct actaaagaaa gagataaact

1501
tttgaaacaa gaagaaatga agtcactggc ttttgaaaag gctaaattaa aaagagaaaa

1561
agcagatgcc ctagaagcga agaaaaaaga aaaagaagat aaagagaaaa agagggaaga

1621
attgaaaaaa attgttgaag aagagagact aaagaaaaaa gaagaaaaag agaggcttaa

1681
agtagaaaga gaaaaggaaa gagagaagtt acgtgaagaa aagcgaaagt atgtggaata

1741
cttaaaacag tggagtaaac ctagagaaga tatggaatgt gatgacctta aggaacttcc

1801
agaaccaaca ccagtgaaaa ctagactacc tcctgaaatc tttggtgatg ctctgatggt

1861
tttggagttc cttaatgcat ttggggaact ttttgatctt caagatgagt ttcctgatgg

1921
agtaacccta gaagtattag aggaagctct tgtaggaaat gacagtgaag gcccactgtg

1981
tgaattgctt tttttcttcc tgactgcaat cttccaggca atagctgaag aagaagagga

2041
agtagccaaa gagcaactaa ctgatgctga caccaaagat ttaacagagg ctttggatga

2101
agatgcagac cccacaaaat ctgcactgtc tgcagttgca tctttggcag ctgcatggcc

2161
acagttacac cagggctgca gtttgaaaag tttggatctt gatagctgca ctctttcaga

2221
aatcctcaga ctgcacatct tagcttcagg tgctgatgta acatcagcaa atgcaaagta

2281
tagatatcaa aaacgaggag gatttgatgc tacagatgat gcttgtatgg agcttcgttt

2341
gagcaatccc agtctagtga agaaactgtc aagcacctca gtgtatgatt tgacaccagg

2401
agaaaaaatg aagatactcc atgctctctg tggaaagcta ctgaccctag tttcaactag

2461
ggattttatt gaagattatg ttgatatatt acgacaggca aagcaggagt tccgggaatt

2521
aaaagcagaa caacatcgaa aagagaggga agaagcagct gccagaattc gtaaaaggaa

2581
ggaagaaaaa cttaaggagc aagaacaaaa aatgaaagag aaacaagaaa aactgaaaga

2641
agatgagcaa agaaattcaa cggcagatat atctattggg gaggaagaaa gggaagattt

2701
tgatactagc attgagagca aagacacaga gcaaaaggaa ttagatcaag atatggtcac

2761
tgaagatgaa gatgacccag gatcacataa aagaggcaga agggggaaaa gaggacaaaa

2821
tggatttaaa gaatttacaa ggcaagaaca gatcaactgt gtaacaagag agcctcttac

2881
tgctgatgag gaagaagcat taaaacagga acaccaacga aaagagaaag agctcttaga

2941
aaaaatccaa agtgccatag cctgtaccaa tatctttccc ttgggtcgcg accgcatgta

3001
tagacgatac tggattttcc cttctattcc tggactcttt attgaagagg attattctgg

3061
tcttactgaa gacatgctgt tgcctagacc ttcatcattt cagaataatg tacagtctca

3121
agatcctcag gtatccacta aaactggaga gcctttgatg tctgaatcta cctccaacat

3181
tgaccaaggt ccacgtgacc attctgtgca gctgccaaaa ccagtgcata agccaaatcg

3241
gtggtgcttt tacagttctt gtgaacagct agaccagctt attgaagctc ttaattctag

3301
aggacataga gaaagtgcct taaaagaaac tttgttacaa gagaaaagca gaatatgtgc

3361
acagctagcc cgtttttctg aagagaaatt tcatttttca gacaaacctc agcctgatag

3421
caaaccaaca tatagtcggg gaagatcttc caatgcatat gatccatctc agatgtgtgc

3481
agaaaagcaa cttgaactaa ggctgagaga ttttctttta gatattgaag atagaatcta

3541
ccaaggaaca ttaggagcca tcaaggttac agatcgacat atctggagat cagcattaga

3601
aagtggacgg tatgagctgt taagtgagga aaacaaggaa aatgggataa ttaaaactgt

3661
gaatgaagac gtagaagaga tggaaattga tgaacaaaca aaggtcatag taaaagacag

3721
acttttgggg ataaaaacag aaactccaag tactgtatca acaaatgcaa gtacaccaca

3781
atcagtgagc agtgtggttc attatctggc aatggcactc tttcaaatag agcagggcat

3841
tgagcggcgt tttctgaaag ctccacttga tgccagtgac agtgggcgtt cttataaaac

3901
agttctggac cgttggagag agtctctcct ttcttctgct agtctatccc aagtttttct

3961
tcacctatcc accttggatc gtagcgtgat atggtctaaa tctatactga atgcgcgttg

4021
caagatatgt cgaaagaaag gcgatgctga aaacatggtt ctttgtgatg gctgtgatag

4081
gggtcatcat acctactgtg ttcgaccaaa gctcaagact gtgcctgaag gagactggtt

4141
ttgtccagaa tgtcgaccaa agcaacgttc tagaagactc tcctctagac agagaccatc

4201
cttggaaagt gatgaagatg tggaagacag tatgggaggt gaggatgatg aagttgatgg

4261
cgatgaagaa gaaggtcaaa gtgaggagga agagtatgag gtagaacaag atgaagatga

4321
ctctcaagaa gaggaagaag tcagcctacc caaacgagga agaccacaag ttagattgcc

4381
agttaaaaca agagggaaac ttagctcttc tttctcaagt cgtggccaac aacaagaacc

4441
tggaagatac ccttcaagga gtcagcagag cacacccaaa acaactgttt cttctaaaac

4501
tggtagaagc ctaagaaaga taaactctgc tcctcctaca gaaacaaaat ctttaagaat

4561

tgccagtcgt tctactcgcc acagtcatgg cccactgcaa gcagatgtat ttgtggaatt

4621

gcttagtcct cgtagaaaac gcagaggcag gaaaagtgct aataatacac cagaaaatag

4681

tcccaacttc cctaacttca

gagtcattgc cacaaagtca agtgaacagt caagatctgt

4741

aaatattgct tcaaaacttt ctctccaaga
gagtgaatcc aaaagaagat gcagaaaaag

4801

acaatctcca gagccatcgc ctgtgacact gggtcgaagg agttctggcc gacagggagg

4861

agttcatgaa ttgtctgctt ttgaacaact tgttgtagaa ttggtacgac atgatgacag

4921

ctggcctttt ttgaaacttg tttctaaaat ccaggtccca gactactatg acatcatcaa

4981

aaagcccatt gccttaaata taattcgtga aaaagtgaat aagtgtgaat ataaattagc

5041

atctgagttt attgatgaca ttgagttaat gttttcgaac tgctttgaat acaaccctcg

5101

taacacaagt gaagcaaaag ctggaactag gcttcaagca ttttttcata ttcaggctca

5161

aaagcttgga ctccacgtca

cacccagtaa tgtggaccaa gttagcacac caccggctgc

5221

gaaaaagtca cgaatctgac tttgtccttc
taaaggatat atttgaagaa aaacaaattg

5281
ttcatgaaaa tggaacatta aatcatgctg tataaagcaa taacaattga ttgaccacat

5341
gaaagtgtgg cctgcactat attctcaatt ttaatattaa gcactcagga gaatgtagga

5401
aagatatcct ttgctacagt tttgttcagt atctaataag tttgatagat gtattggata

5461
cagtactggt ttacagaggt ttttgtacat ttttgagatc attcatgtgt ccagagatct

5521
tggaaaatat tttttcaccc acgatttatt ttgttattga tgattttttt ttaaagtggt

5581
ggtattaagg gagagttatc tacatggatg agtcttccgc tatagcacag tttagaaaag

5641
gtgtttatgt cttaattaat tgtttgagta cattctttca acactacaca tgaatgaatc

5701
caatcttata accttgaagt gctgtaccag tgctggctgc aggtattaag tccaagttta

5761
ttaactagat atttatttag tattgagagt aatttgtgaa tttgttttgt atttataaaa

5821
tttatacctg aaaaatgttc cttaatgttt taaacctttt actgtgtttt tattcctcta

5881
acttccttaa tgatcaatca aaaaaagtaa caccctccct ttttcctgac agttctttca

5941
gctttacaga actgtattat aagtttctat gtataacttt ttaactgtac aaataaaata

6001
acattttttc aaataaaaaa aaaaaaaaaa a

AD2 LYN-v-yes-1 Yamaguchi sarcoma viral related oncogene homolog,

mRNA NM_001111097.2

(SEQ ID NO: 120)

1
agacagccag ttcctctccc gccgcgccgg gccgcgctgc cgctcgctcc ccggccgtgg

61
cgcctccggg ccagacgcgc tgcagcctcc agcccgcggc aagcggggcg gccgcgccac

121
ccccggcccc gcgccagcag cccctcgccg cgcgtccagc gttcccggcc agcagcctcc

181
ccatacgcag gtcctgctgg gccgccccgt cgcgcccccc actctgaact caagtcaccg

241
tggagctccg ccgccccgaa actttcaccg cgagcgggaa atatgggatg tataaaatca

301
aaagggaaag acagcttgag tgacgatgga gtagatttga agactcaacc agttccagaa

361
tctcagcttt tacctggaca gaggtttcaa actaaagatc cagaggaaca aggagacatt

421
gtggtagcct tgtaccccta tgatggcatc cacccggacg acttgtcttt caagaaagga

481
gagaagatga aagtcctgga ggagcatgga gaatggtgga aagcaaagtc ccttttaaca

541
aaaaaagaag gcttcatccc cagcaactat gtggccaaac tcaacacctt agaaacagaa

601
gagtggtttt tcaaggatat aaccaggaag gacgcagaaa ggcagctttt ggcaccagga

661
aatagcgctg gagctttcct tattagagaa agtgaaacat taaaaggaag cttctctctg

721
tctgtcagag actttgaccc tgtgcatggt gatgttatta agcactacaa aattagaagt

781
ctggataatg ggggctatta catctctcca cgaatcactt ttccctgtat cagcgacatg

841
attaaacatt accaaaagca ggcagatggc ttgtgcagaa gattggagaa ggcttgtatt

901
agtcccaagc cacagaagcc atgggataaa gatgcctggg agatcccccg ggagtccatc

961
aagttggtga aaaggcttgg cgctgggcag tttggggaag tctggatggg ttactataac

1021
aacagtacca aggtggctgt gaaaaccctg aagccaggaa ctatgtctgt gcaagccttc

1081
ctggaagaag ccaacctcat gaagaccctg cagcatgaca agctcgtgag gctctacgct

1141
gtggtcacca gggaggagcc catttacatc atcaccgagt acatggccaa gggcagtttg

1201
ctggatttcc tgaagagcga tgaaggtggc aaagtgctgc ttccaaagct cattgacttt

1261
tctgctcaga ttgcagaggg aatggcatac atcgagcgga agaactacat tcaccgggac

1321
ctgcgagcag ctaatgttct ggtctccgag tcactcatgt gcaaaattgc agattttggc

1381
cttgctagag taattgaaga taatgagtac acagcaaggg aaggtgctaa gttccctatt

1441
aagtggacgg ctccagaagc aatcaacttt ggatgtttca ctattaagtc tgatgtgtgg

1501
tcctttggaa tcctcctata cgaaattgtc acctatggga aaattcccta cccagggaga

1561
actaatgccg acgtgatgac cgccctgtcc cagggctaca ggatgccccg tgtggagaac

1621
tgcccagatg agctctatga cattatgaaa atgtgctgga aagaaaaggc agaagagaga

1681

ccaacgtttg actacttaca gagcgtcctg gatgatttct acacagccac ggaagggcaa

1741

taccagcagc agccttagag cacagggaga cccgtccatt tggcaggggt ggctgcctca

1801

tttagagagg

aaaagtaacc atcactggtt gcacttatga tttcatgtgc ggggatcatc

1861

tgccgtgcct ggatcctgaa
atagaggcta aattactcag gaagaacacc ctctaaatgg

1921

gaaagtattc tgtactctta gatggattct ccactcagtt gcaacttgga cttgtcctca

1981

gcagctggta atcttgctct gcttgacaac atctgagtgc agccgtttga gaagaaaaca

2041

tctattctct ccaaaaatgc acccaactag ctctatgttt acaaatggac ataggactca

2101

aagtttcaga gaccattgca atgaatcccc aataattgca gaactaaact catttataaa

2161

gctaaaataa ccggatatat acatagcatg acatttcttt gtgctttggc ttacttgttt

2221

aaaaaaaaaa aaaaactaat ccaacctgtt agattttgca ggtgaagtca gcagcttaaa

2281

aatgtctttc ccagatttca atgatttttt tccccctacc tcccaaaatc tgagactgtt

2341

aaaacatttt

tcttctatga acactgctca gacctgctag acatgccata ggagtggcgt

2401

gcacatctct ctctcttcca
gcaggaggag cccgtgagca cgcacagctg ccctgtctgc

2461

tcacccgaag gcaccgggct cacctggacc tcccaggaaa gggagaagag cctcagaaac

2521
tgctctgtgt ttagaaggaa tatttttaag agtccagctt tttcatttcc acaatttcct

2581
atatccagat ttgttttgac aatgtagttt ggaagaacta agattctaat ctctgaagaa

2641
ccttataggg ccttctaaaa cataagagtt tcctttgttg cttcaaatat ttgaacatta

2701
tgttaaagat caagtattaa ttttagttgt actctagaaa gctaaagtgc cacattcggg

2761
gctattttta tgattcagca atcttttcta aattgtgtag catgtgtatg agactattta

2821
tacccaagga tatgaaggaa cataagtgac tacaaggctc taataagcca cggtggcagg

2881
aggttcaagc ggttctgttc actaaatttt tctcctgtaa gctttgaatg gaaacttctg

2941
tatcacatga tgtgtttcac ttatgctgtt gtgtatatac ctaatatttc tatttttgat

3001
tttattttaa tacacctcgt ccaataacat ctcaagcttt ttatttgcat ttacattttc

3061
agctgtggtc agtgtaaaaa ttggtcatca gctgggggcg gggtggttag aagtgattca

3121
acagagctac atgctttaaa cttgcccaag ttctacctcc ttcctttgaa catttcagat

3181
tggagaacca aggagttgat tgcctgaaca cctgaacatc cgtttatggg ggccagatag

3241
aatttgtttt caaataggct taacaggcat cattaaaatt tcattctgtg tgttttgttt

3301
aggcttgagg tgcttagaag atgggataaa atattctact tttttctaaa ttttaacttt

3361
gtttcctatg tgattttttt aaatgtcctt tctaaaatat tctaaaatta ttgattcaca

3421
agtgccatgt tcagaactat agaatattac tgttacataa tgtctgcaca gctggtccct

3481
tgattcagtg gtaaggtttt tgtgtacacc cccctgcttg cattttattt cagaaccaca

3541
agtattaccc aatatgttac atggagagga actataaaga atccctaagg caaaaagaag

3601
tctctagaaa atgactagag gttttttttt tagcataaca aatttattta aagaaaatta

3661
ttaaatttat cttcgccttg ttttgcttct cccagttcct cctcttcttg ccattttcca

3721
cttgtctttc cctcccaatc aagcctgtga tccttacctc catgtgggcc cttcaccagc

3781
ttgggcctca tctctggtgt ccagcatgtg tggaagtcac acgttccctt gatgaacagc

3841
acacacagtc tccttactta gctataggtt tccagcctcc ctgtgacaga caggcataat

3901
gaggggctga ataggtgttt gtagcatttt cgggtatcca gtggtgtgca aaatggctca

3961
tgtcatcaca cctcaggtta ttgtagagaa ctggaaagac agaatccata ctccctaccg

4021
ccaagattct gacttagctg ttgtgcagcg ggagatgtat gtcagtctat tttaaaagct

4081
tctccagtca gctag

AD3 TAPBP-tapasin isoform 1 precursor, mRNA NM_003190.4

(SEQ ID NO: 121)

1
ggggacgcgg cacagatagg gggaagccgg agtaatggtt ttcgggcaag tggatgttgg

61
agagcacaca caggagttgg ggggcggggg agggcctggg gttggggagg gctcgaactc

121
ggggctgctg ggtagtccag gagggcgcgg taaggctggg gtgtcctggt gagaactgga

181

gaggatctac ccgggtccct gcctggccag tggggaaaca ccggtccccc aggcaccttc

241

acctaaccag agcggggatt

tccaccgccc ctcatgccgc cctttggagg aaagtgaaag

301

tgaaaggagg aagaggaggc ttcatggctg
aggaggtcgc agcgccatga agtccctgtc

361

tctgctcctc gctgtggctt tgggcctggc gaccgccgtc tcagcaggac ccgcggtgat

421

cgagtgttgg ttcgtggagg atgcgagcgg aaagggcctg gccaagagac ccggtgcact

481

gctgttgcgc cagggaccgg gggaaccgcc gccccggccg gacctcgacc ctgagctcta

541

tctcagtgta cacgaccccg cgggcgccct ccaggctgcc ttcaggcggt atccccgggg

601

cgcccccgca ccacactgcg agatgagccg cttcgtgcct ctccccgcct ctgcgaaatg

661

ggccagcggc ctgacccccg cgcagaactg cccgcgggcc ctggatgggg cttggctgat

721

ggtcagcata tccagcccag tcctcagcct ctccagcctc ttgcgaccac agccagagcc

781

tcagcaggag cctgttctca tcaccatggc aacagtggta ctgactgtcc tcacccacac

841

ccctgcccct cgagtgagac tgggacaaga tgctctgctg gacttgagct ttgcctacat

901

gccccccacc tccgaggccg cctcatctct ggctccgggt ccccctccct ttgggctaga

961

gtggcgacgc cagcacctgg gtaagggaca tctgctcctg gctgcaactc ctgggctgaa

1021

tggccagatg ccagcagccc aagaaggggc cgtggcattt gctgcttggg atgatgatga

1081

gccatggggc ccatggaccg gaaatgggac cttctggctg cctacagttc aaccctttca

1141

ggagggcacc tatctggcca ccatacacct gccatacctg caaggacagg tcaccctgga

1201

gcttgctgtg tacaaacccc ccaaagtgtc cctgatgcca gcaacccttg cacgggccgc

1261

cccaggggag gcacccccgg aattgctctg ccttgtgtcc cacttctacc cttctggggg

1321

cctggaggtg gagtgggaac tccggggtgg cccagggggc cgctctcaga aggccgaggg

1381

gcagaggtgg ctctcggccc tgcgccacca ttccgatggc tctgtcagcc tctctgggca

1441

cttgcagccg cccccagtca ccactgagca gcatggggca cgctatgcct gtcgaattca

1501

ccatcccagc ctgcctgcct cggggcgcag cgctgaggtc accctggagg tagcaggtct

1561

ttcagggccc tcccttgagg acagcgtagg ccttttcctg tctgcctttc ttctgcttgg

1621

gctcttcaag gcactgggct gggctgctgt ctacctgtcc

acctgcaagg attcaaagaa

1681

gaaagcagag tgagggcact cactgccatc ctgtggaagc caccatcatc
tctggcccaa

1741

gcttctgtag tagctcccta aaataatacc ctatcatctg ctcctaatcc ctccaatctc

1801

tctccactga gtggctggaa tgcttttttt tttttctttc acttatataa gggataattt

1861
ttcttttttt tttttttttg agacggagtc tcactcttcc gcccaggctg cagtgcagtg

1921
gcatgatctt ggcttactgc aacctccgcc tcctgggttc aagcaattct gtggcttcag

1981
cctccggagt agctgggatt acaggcacat gccaccacac ccagtgaatt tttgtatttt

2041
tagtagagac ggggtttcac catgttggcc aggctggtct tgaattcctg acctcaggtg

2101
atctgcccac ctcagcctcc caaagtgctg ggattacagg cgtgagccac cacaccaggc

2161
ccgagaaatg cttttttaaa aaacacacat cttatggcat tcaccttctt ggagctctag

2221
gacagtggtt ctcaaaattt ttttctctca ggacctctta aaaatcatca aggaccccaa

2281
aaagcttttg ggtatgtggg ttatagctat caatatttat ggtactagaa cttaaaagtg

2341
agaaaaattt aaaacacgag aatacatagg cacacattct attcatcgtg ggaaccatgg

2401
tgtcaataca tatcatgtag cttctgaaaa actccactgt acacttatag aatgaagaag

2461
gcaaaaaact tttttttttt tttttttgag acggagtctc gctctgtcgc ccaggctgga

2521
gtgcagtggc gcgatctcgg ctcactgcaa gctccgcctc tcgggttcac gccattctcc

2581
tgcctcagcc tcccaagtag ctcggactac aggcgtcctc caccatgcct ggctaatatt

2641
ttgtattttt tagtagagac ggggtttcac cgtgttagcc aggatggtct cgatctccta

2701
acctggtgat ccgcccgcct cggcctccca aagtattggg attacccgcg tgagccaccg

2761
cgcccggctg caaataatct ttcttttttt ctgagacaga gtctcgctct gttgcccagg

2821
ctggagtgca gtggcacgat ctcggctcac ggcacgctcc gcctcccggg ttcacgccat

2881
tctcctgcct cagcttcccg agtagctggg actacagggg cccgccacca cgcccggcta

2941
actttttgtg tttttagtag agacggggtt tcaccgtgtt agccaggatg gtctcgatct

3001
cctgaccttg tgatctgccc gcctcggcct cccaaagtgc tgggattaca ggcgtgagcc

3061
accgcgcccg gcggcgaaac acgatattgt actaacatct taattttgtt ataaaatctc

3121
acaaaccccc tgacatagtc tcagagatct gtagggccga ggttacattt ggagaacccg

3181
tactctaggg ccaaatccat tcttcttgcc ctggctcact tgtccccccc accgccccgc

3241
gctggagcca ctgcctagtt cttcagccct agatggtgct cgccagacct cctctcaatg

3301
ctcatcacac acagggctat tcctttcctc caatgaacca aacgcctccc gcccacctcc

3361
aggtcccagt cctctgttcc ctttgcctgg tccacccttg ccctccctgg gtcgcagacg

3421
aggtcggcct cgtcattccc cgcagaccgc cgcgcgtccc tcttgtgcgg ttcaccacag

3481
ttgtatttaa gtgatcgtgt gagtcgtcgt taaatgcctg tctccccgcg gatcatgggc

3541
tcctcgagga cagggactgg cctgtctgtc cactgctgta accccgcgcc ggcataggga

3601
cctaaggccc actggagggc gctcatcaag tagctgctgg atgttgacga aggaagcggc

3661
ggcgcagctc agggatctcc gagtcaggac ggtcggccag acccacgggg taacgggtct

3721
aatcgtgtag gaataaagct gtattccagt gcttccaaaa aaaaaaaaaa aaaaaaaaa

AE1 SERPINB1-serpin peptidase inhibitor, clade B (ovalbumin) member

1, mRNA-NM_030666.3

(SEQ ID NO: 122)

1
agaaagaagc cgcgcccctg aggagggcgc tgcccggaag ccacgctcac ttctgcttgc

61
acttaggcga cctcgggagc tcggactcct acgcagtcac cgggaagggc cgccgccccg

121
cccgcggctg ctggcccggg tgacgcttcc gcctgctata agagcagcgg ccctcggtgc

181
ctccttcctg acctcgcacc cagctcggag cccggagcgt gcctcggcgg cctgtcggtt

241
ttcaccatgg agcagctgag ctcagcaaac acccgcttcg ccttggacct gttcctggcg

301
ttgagtgaga acaatccggc tggaaacatc ttcatctctc ccttcagcat ttcatctgct

361
atggccatgg tttttctggg gaccagaggt aacacggcag cacagctgtc caagactttc

421
catttcaaca cggttgaaga ggttcattca agattccaga gtctgaatgc tgatatcaac

481
aaacgtggag cgtcttatat tctgaaactt gctaatagat tatatggaga gaaaacttac

541
aatttccttc ctgagttctt ggtttcgact cagaaaacat atggtgctga cctggccagt

601
gtggattttc agcatgcctc tgaagatgca aggaagacca taaaccagtg ggtcaaagga

661
cagacagaag gaaaaattcc ggaactgttg gcttcgggca tggttgataa catgaccaaa

721
cttgtgctag taaatgccat ctatttcaag ggaaactgga aggataaatt catgaaagaa

781
gccacgacga atgcaccatt cagattgaat aagaaagaca gaaaaactgt gaaaatgatg

841
tatcagaaga aaaaatttgc atatggctac atcgaggacc ttaagtgccg tgtgctggaa

901
ctgccttacc aaggcgagga gctcagcatg gtcatcctgc tgccggatga cattgaggac

961
gagtccacgg gcctgaagaa gattgaggaa cagttgactt tggaaaagtt gcatgagtgg

1021
actaaacctg agaatctcga tttcattgaa gttaatgtca gcttgcccag gttcaaactg

1081
gaagagagtt acactctcaa ctccgacctc gcccgcctag gtgtgcagga tctctttaac

1141
agtagcaagg ctgatctgtc tggcatgtca ggagccagag atatttttat atcaaaaatt

1201

gtccacaagt catttgtgga agtgaatgaa gagggaacag aggcggcagc tgcc

acagca

1261

ggcatcgcaa ctttctgcat gttgatgccc gaagaaaatt tcactgccga ccatccattc

1321

cttt

tcttta ttcggcataa ttcctcaggt agcatcctat tcttggggag attttcttcc

1381

ccttagaaga aagagactgt agcaatacaa aaatcaagct tagtgcttta ttacctgagt

1441

ttttaataga gccaatatgt cttatatctt taccaataaa accactgttc agaaacaagt

1501

ctttcatttt ctttgtaagt ttggctctgt tggctgttta cacccatgaa ttttggcatg

1561

ggtatctatt tttctttttt acattgaaaa aaatccagtg gttgcttttg aatgcatcaa

1621

gtaaagaaga agaaaagaat

acatccgatg cgtagattct tgaccatgta gtaatctata

1681

aaattgctat atcctcctga tagccatggg
aaaacatgat aagatggtca tttattttgc

1741

agttagaatt ttggaagcca caaaatagac agacaccctg actgttgaag ggaggtttaa

1801

aaacagatat tcaattgaaa tgtaagagag caccccaatt gagagcccag gttacgaaga

1861

caagcttgcc tcgcctgact tttctgtccc ttgttctgca ggattagtat tctgttacag

1921

acctctagtt tttagactct tcaattaaag ggccaatggt tataacctgc attccctttt

1981

ttgttcttct ttatgtataa tatatagttc atgtggcgct gcatgaaatc aagaagtggg

2041

tgtcttagga taaaagatac caagagtcta caaaaataac catgtagtaa gataaactgc

2101
tgaacaaagg ttttactgtt agccaccttc tcatgtgttt tcttttctct ttttcttttt

2161
ctttctttct ttcttttttt tttttttgag acagagtctt gctctgttac ccaggctgga

2221
gtgcagtggc acgatctcag ctcaccgcaa cctctgcctc ctgggttcaa gtgattctct

2281
tgcttcagcc tcctgagtag ctgggattat aggcatgcac cactaggcct ggctaatttt

2341
tgtattttta gtagagatgg ggtttttcca tgttggccag gctggtcccg aactcctgac

2401
ctcaggtgat ccgcgcacct cagcctccca aagtgctggg attacaggca tgagctacca

2461
tgcctggcct tctcatgtgt tttctgatta aggctcttga cttccaaggc tgtgtgggga

2521
gatggggtgg gggctcttgg actgatataa aactttgtca aatgtagttc tttgaatgga

2581
gcttgaaacg ccgcatattc ttgctcccac aaggatagtg ggcatcatga attaataaaa

2641
cgtcctagga ttctgcaagc taaaaaaaaa aaaaaaaa

AE2 PSMB9-proteasome (prosome, macropain) subunit, beta type 9, mRNA

NM_002800.4

(SEQ ID NO: 123)

1
gcgcgttgtg cgctgtccca ggttggaaac cagtgcccca ggcggcgagg agagcggtgc

61
cttgcaggga tgctgcgggc gggagcacca accggggact taccccgggc gggagaagtc

121
cacaccggga ccaccatcat ggcagtggag tttgacgggg gcgttgtgat gggttctgat

181
tcccgagtgt ctgcaggcga ggcggtggtg aaccgagtgt ttgacaagct gtccccgctg

241

cacgagcgca tctactgtgc actctctggt tcagctgctg atgcccaagc cgtggccgac

301

atggccgcct accagctgga gctccatggg atagaactgg aggaacctcc acttgttttg

361

gctgctgcaa atgtggtgag aaatatcagc tataaatatc gagaggactt gtctgcacat

421

ctcatggtag ctggctggga ccaacgtgaa ggaggtcagg tatatggaac cctgggagga

481

atgctgactc gacagccttt tgccattggt ggctccggca gcacctttat ctatggttat

541

gtggatgcag catataagcc aggcatgtct cccgaggagt gcaggcgctt caccacagac

601

gctattgctc tggccatgag ccgggatggc tcaagcgggg gtgtcatcta cctggtcact

661

attacag
c

tg ccggtgtgga ccatcgagtc atcttgggca atgaactgcc aaaattctat

721

gatgagtgaa ccttccccag acttctcttt cttattttgt aataaactct ctagggccaa

781

aacctggtat ggtcattggg aaatgagtgc tcagggagat ggagcttagg

ggaggtgggt

841

gcttccctcc tagatgtcag catacactct ttcttctttt gtcccaggtc taaaacatct

901
ttcctagaga aaacaaaagg gactaaacta gaaatataaa gagccctata catgacaggt

961
gatcacgtac tgaatgattt tgaagtagta caaacaataa aaattctcat tccgcatcat

1021
catgcggtcc atgatgatga ggccgcaa

AE3 WSB1-WD repeat and SOCS box containing 1 mRNA-NM_015626.8

(SEQ ID NO: 124)

1
agatatctcc ggcgccgccc gccattttga ctccagtgtc tcgtttgcag tcggcgcttt

61
aggggaactg tcttcctccg caggcgcgag gctgggtaca gggtctattg tctgtggttg

121
actccgtact ttggtctgag gccttcggga gctttcccga ggcagttagc agaagccgca

181
gcggccgccc ccgcccgtct cctctgtccc tgggcccggg agggaccaac ttggcgtcac

241
gcccctcagc ggtcgccact ctcttctctg ttgttgggtc cgcatcgtat tcccggaatc

301
agacggtgcc ccatagatgg ccagctttcc cccgagggtc aacgagaaag agatcgtgag

361
attacgtact ataggtgaac ttttagctcc tgcagctcct tttgacaaga aatgtggtcg

421
tgaaaattgg actgttgctt ttgctccaga tggttcatac tttgcttggt cacaaggaca

481
tcgcacagta aagcttgttc cgtggtccca gtgccttcag aactttctct tgcatggcac

541
caagaatgtt accaattcaa gcagtttaag attgccaaga caaaatagtg atggtggtca

601
gaaaaataag cctcgtgaac atattataga ctgtggagat atagtctgga gtcttgcttt

661
tgggtcatca gttccagaaa aacagagtcg ctgtgtaaat atagaatggc atcgcttcag

721
atttggacaa gatcagctac ttcttgctac agggttgaac aatgggcgta tcaaaatatg

781
ggatgtatat acaggaaaac tcctccttaa cttggtagat catactgaag tggtcagaga

841
tttaactttt gctccagatg gaagcttgat cctggtgtca gcttcaagag acaaaactct

901

cagagtatgg gacctgaaag atgatggaaa catgatgaaa gtattgaggg ggcatcagaa

961

ttgggtgtac agctgtgcat tctctcctga ctcttctatg ctgtgttcag tcggagccag

1021

taaagcagtt ttcctttgga atatggataa atacaccatg atacggaaac tagaaggaca

1081

tcaccatgat gtggtagctt gtgacttttc tcctgatgga gcattactgg ctactgcatc

1141

ttatgatact cgagtatata tctgggatcc acataatgga gacattctga tggaatttgg

1201

gcacctgttt cccccaccta ctccaatatt tgctggagga gcaaatgacc ggtgggtacg

1261

atctgtatct tttagccatg atggactgca tgttgcaagc cttgctgatg ataaaatggt

1321

gaggttctgg agaattgatg aggattatcc agtgcaagtt gcacctttga gcaatggtct

1381

ttgctgtgcc ttctctactg atggcagtgt tttagctgct gggacacatg acggaagtgt

1441

gtatttttgg gccactccac ggcaggtccc tagcctgcaa catttatgtc gcatgtcaat

1501

ccgaagagtg atgcccaccc aagaagttca ggagctgccg attccttcca agcttttgga

1561

gtttctctcg tat

cgtattt agaagattct gccttcccta gtagtaggga ctgacagaat

1621

acacttaaca caaacctc
aa gctttactga cttcaattat ctgtttttaa agacgtagaa

1681

gatttattta atttgatatg ttcttgtact gcattttgat cagttgagct tttaaaatat

1741

tatttataga caatagaagt atttctgaac atatcaaata taaatttttt taaagatcta

1801

actgtgaaaa catacatacc tgtacatatt tagatataag ctgctatatg ttgaatggac

1861

ccttttgctt ttctgatttt tagttctgac atgtatatat tgcttcagta gagccacaat

1921

atgtatcttt gctgtaaagt gcaaggaaat tttaaattct gggacactga gtt

agatggt

1981

aaatactgac ttacgaaagt tgaattgggt gaggcgggca aatcacctga ggtcagcagt

2041

ttg

agactag cctggcaaac atgatgaaac cctgtctcta ctaaaaatac aaaaaaaaaa

2101

aaaattagcc aggcgtggtg gtgcacacct gtagtcctag ctacttggga ggctgaggca

2161

ggagaattgc ttgaacccag gaggtggagg ttgcagtaag ccaagatcac accactgcac

2221

tccaacctgg acaacagagc gagactccat ctcaaaaaaa aaaaaaaatt gtgttgcctc

2281
atacgaaatg tatttggttt tgttggagag tgtcagactg atctggaagt gaaacacagt

2341
ttatgtacag ggaaaaggat tttattatcc ttaggaatgt catccaagac gtagagcttg

2401
aatgtgacgt tatttaaaaa caacaacaaa gaaggcagag ccaggatata actagaaaaa

2461
ggatgtcttt tttttttttt ttactccccc tctaaacact gctgctgcct taattttaga

2521
aagcagctta ctagtttacc cttgtggtat aaagtattat aaattgttgt gaatttgaag

2581
aatccgtcta ctgtattatt gctaaatatt ttgtttatac taagggacaa ttattttaag

2641
accatggatt taaaaaaaaa aaaaaaaact ctgtttctgc aggggatgat attggtgagt

2701
tgccaaagaa gcaatacagc atatctgctt ttgccttctg ttgtttatct tacctgcaga

2761
tattaagaat gtatgcatta tgtaaaatgc tcaattatat atttttgttg agttttttaa

2821
ttaaagactt gttaaaaaaa aaaaaaaaa

AF1 MVP-major vault protein, mRNA-NM_005115.4

(SEQ ID NO: 125)

1
aactcccaag ccccacccct gggcttggcc tgccttgccc tgccgggaag tgatccccaa

61
ggcagggtga gagttcccca tctgaggcgt ttgttgcagc tacctgcact tctagattca

121
tcttcttgtg agccctgggc ttaggagtca ccatggcaac tgaagagttc atcatccgca

181
tccccccata ccactatatc catgtgctgg accagaacag caacgtgtcc cgtgtggagg

241
tcgggccaaa gacctacatc cggcaggaca atgagagggt actgtttgcc cccatgcgca

301
tggtgaccgt ccccccacgt cactactgca cagtggccaa ccctgtgtct cgggatgccc

361
agggcttggt gctgtttgat gtcacagggc aagttcggct tcgccacgct gacctcgaga

421
tccggctggc ccaggacccc ttccccctgt acccagggga ggtgctggaa aaggacatca

481
cacccctgca ggtggttctg cccaacactg ccctccatct aaaggcgctg cttgattttg

541
aggataaaga tggagacaag gtggtggcag gagatgagtg gcttttcgag ggacctggca

601
cgtacatccc ccggaaggaa gtggaggtcg tggagatcat tcaggccacc atcatcaggc

661
agaaccaggc tctgcggctc agggcccgca aggagtgctg ggaccgggac ggcaaggaga

721
gggtgacagg ggaagaatgg ctggtcacca cagtaggggc gtacctccca gcggtgtttg

781
aggaggttct ggatttggtg gacgccgtca tccttacgga aaagacagcc ctgcacctcc

841
gggctcggcg gaacttccgg gacttcaggg gagtgtcccg ccgcactggg gaggagtggc

901
tggtaacagt gcaggacaca gaggcccacg tgccagatgt ccacgaggag gtgctggggg

961
ttgtgcccat caccaccctg ggcccccaca actactgcgt gattctcgac cctgtcggac

1021
cggatggcaa gaatcagctg gggcagaagc gcgtggtcaa gggagagaag tcttttttcc

1081
tccagccagg agagcagctg gaacaaggca tccaggatgt gtatgtgctg tcggagcagc

1141
aggggctgct gctgagggcc ctgcagcccc tggaggaggg ggaggatgag gagaaggtct

1201
cacaccaggc tggggaccac tggctcatcc gcggacccct ggagtatgtg ccatctgcca

1261
aagtggaggt ggtggaggag cgccaggcca tccctctaga cgagaacgag ggcatctatg

1321
tgcaggatgt caagaccgga aaggtgcgcg ctgtgattgg aagcacctac atgctgaccc

1381
aggacgaagt cctgtgggag aaagagctgc ctcccggggt ggaggagctg ctgaacaagg

1441
ggcaggaccc tctggcagac aggggtgaga aggacacagc taagagcctc cagcccttgg

1501
cgccccggaa caagacccgt gtggtcagct accgcgtgcc ccacaacgct gcggtgcagg

1561
tgtacgacta ccgagagaag cgagcccgcg tggtcttcgg gcctgagctg gtgtcgctgg

1621
gtcctgagga gcagttcaca gtgttgtccc tctcagctgg gcggcccaag cgtccccatg

1681
cccgccgtgc gctctgcctg ctgctggggc ctgacttctt cacagacgtc atcaccatcg

1741
aaacggcgga tcatgccagg ctgcaactgc agctggccta caactggcac tttgaggtga

1801
atgaccggaa ggacccccaa gagacggcca agctcttttc agtgccagac tttgtaggtg

1861
atgcctgcaa agccatcgca tcccgggtgc ggggggccgt ggcctctgtc actttcgatg

1921

acttccataa gaactcagcc cgcatcattc gcactgctgt ctttggcttt gagacctcgg

1981

aagcgaaggg ccccgatggc atggccctgc ccaggccccg ggaccaggct gtcttccccc

2041

aaaacgggct ggtggtcagc agtgtggacg tgcagtcagt ggagcctgtg gatcagagga

2101

cccgggacgc cctgcaacgc agcgtccagc tggccatcga gatcaccacc aactcccagg

2161

aagcggcggc caagcatgag gctcagagac tggagcagga agcccgcggc cggcttgagc

2221

ggcagaagat cctggaccag tcagaagccg agaaagctcg caaggaactt ttggagctgg

2281

aggctctgag catggccgtg gagagcaccg ggactgccaa ggcggaggcc gagtcccgtg

2341

cggaggcagc ccggattgag ggagaagggt ccgtgctgca ggccaagcta aaagcacagg

2401

ccttggccat tgaaacggag gctgagctcc agagggtcca gaaggtccga gagctggaac

2461

tggtctatgc ccgggcccag ctggagctgg aggtgagcaa ggctcagcag

ctggctgagg

2521

tggaggtgaa gaagttcaag cagatgacag aggccatagg ccccagcacc atcagggacc

2581

ttgctgtggc tgggcctgag atgcaggtaa aactgctcca gtccctgggc ctgaaatcaa

2641

ccctcatcac cgatggctcc actcccatca acctcttcaa cacagccttt gggctgctgg

2701

ggatggggcc cgagggtcag cccctgggca gaagggtggc cagtgggccc agccctgggg

2761

aggggatatc cccccagtct gctcaggccc

ctcaagctcc tggagacaac cacgtggtgc

2821

ctgtactgcg ctaactcctg attaatacaa tggaagtttc
tgggcattta caatttcaac

2881
acttaaaaaa aaaaaaaaaa aa

AF2 APBB1IP-amyloid beta (A4) precursor protein-binding, family B,

member 1 interacting protein, mRNA NM_019043.3

(SEQ ID NO: 126)

1
tctcagtctt tggtggaacc atcactaggc cccaatccct tagtccctct tgcgtcgagg

61
ctgcaaaatg gttccattcg ccaggagacg ctcctgagag aagggcgcgc gcggcacagg

121
ggccttcctt gcacctcgga gcaaagcagc tcggatagcg ccacacgtct gcgcgctgcg

181
tgggaagggc agggctgaca gcacttcctc cccggggcag cgacctggag cccgggtgcg

241
gcagtctgca ccgcgcgtcg ctttcccggc cggagtctcg ccgccttccc gcgccccgca

301

gcgccccgca gagcagtcga gatgggtgag tcaagtgaag acatagacca aatgttcagc

361

actttgctgg gagagatgga tcttctgact cagagtttag gagttgacac tctccctcct

421

cctgacccta atccacccag agctgaattt aactacagtg tggggtttaa agatttaaat

481

gagtccttaa atgcactgga agaccaagat ttagatgctc tcatggcaga tctggtagca

541

gacataagtg aggctgagca gaggacaatc caggcacaga aagagtcctt gcagaatcaa

601

catcattcag catctctaca agcatcaatt ttcagtggtg cagcctctct tggttatgga

661

acaaatgttg ctgccactgg tatcagccaa tatgaggatg acttaccacc tccaccagcc

721

gatcctgtgt tagaccttcc actgccacca ccacctcctg aacctctctc tcaggaagag

781

gaagaagccc aagccaaggc tgataaaatt aagctggcgc tggaaaaact gaaggaggcc

841

aaggttaaga agctcgtcgt caaggtgcac

atgaatgata acagcacaaa gtcactgatg

901

gtggatgagc ggcagctggc ccgagatgtt ctggacaacc
ttttcgagaa aactcattgt

961

gactgcaatg tagactggtg tctttatgaa atctacccgg aactacaaat tgagaggttt

1021

tttgaagacc atgaaaatgt tgttgaagtc ttatcagact ggacaagaga cacagaaaat

1081

aaaatactat ttttggagaa agaggagaaa tatgctgtat ttaaaaaccc ccagaatttc

1141

tacttggata acagaggaaa aaaagaaagc aaggaaacta atgagaaaat gaatgctaaa

1201

aacaaggaat ccttacttga ggaaagtttc tgtggaacat ctatcattgt accagaactg

1261

gaaggagctc tttatttgaa agaagatgga aagaaatcct ggaaaaggcg ctattttctt

1321

ttacgggctt ctggaattta ttatgtaccc aaaggaaaga ctaagacatc tcgagatctg

1381

gcgtgtttta tacagtttga aaatgtcaac atttactatg ggactcagca taaaatgaaa

1441

tataaagcgc ccactgacta ttgctttgtt ttaaagcacc cccaaattca gaaggagtcc

1501

cagtatatca agtatctctg ctgtgatgac acaagaaccc ttaaccagtg ggtcatggga

1561

atacggatag ccaagtatgg gaagactctc tatgataact accagcgggc

tgtggcaaag

1621

gctggacttg cctctcggtg gacaaacttg gggacagtca atgcagctgc accagctcag

1681

ccatctacag gacctaaaac aggcaccacc cagcccaatg gacagattcc ccaggctaca

1741

cattctgtca gtgctgttct ccaagaggcc cagagacatg ctgaaacatc gaaggataag

1801

aagccagccc tcgggaacca ccacgacccg gcagtgcccc gggccccgca cgcccccaag

1861
tccagcctgc ccccgccccc tccggtgcgg aggtcctccg acaccagcgg cagtcccgcc

1921
acgcccctca aggccaaggg cacaggcggc gggggcttgc ccgccccacc cgacgacttc

1981
ctgccgccgc cgccaccgcc gccgcccctc gatgaccctg agctcccgcc gccgcccccg

2041
gacttcatgg agccgccccc agacttcgtg cccccgcccc cgccgtcgta cgcagggatc

2101
gcgggctcag agctgccccc gccgccgccg ccgccgcccg cgcccgcgcc cgcccccgtc

2161
cccgactccg ccaggccgcc ccccgcggtg gccaagaggc ctcctgtgcc ccccaagagg

2221
caagagaacc cagggcaccc cggcggagca ggaggcgggg agcaagattt catgtcagac

2281
ctcatgaaag ctttgcaaaa gaagagaggc aacgtgtcct agggacgggc atgatgagtg

2341
ttccagaggg agaagcatcg ctgaccccga gcgcaggttt tgctagcaga ttgccctgac

2401
atcttgttca tttcagataa aatgtgatgg gaaacttctc actgatgtgc tcaagtacag

2461
gcataaccat taacccagta gagttcagaa tatctgccca aatgtacata tcgttcccat

2521
gtattttaac ctaaatggaa tgtatcttcc cttccaagct gcctaaagcg ctgttttagg

2581
ttcatttatt ttattatgtt cagaagcatc aaataaaagt taaacgtttt tccggaaaaa

2641
aaaaaaaaaa aaaaaaaaa

AF3 FYB-FYN binding protein, mRNA NM_001243093.1

(SEQ ID NO: 127)

1
gcatagctaa cttgcacatt cactatccaa gctgcaccat cttcggggtc attgtgtgcc

61
aggcatatca actcttttca ataaaaatgg atggaaaggc agatgtaaag tccctcatgg

121
cgaaatataa cacggggggc aacccgacag aggatgtctc agtcaatagc cgacccttca

181
gagtcacagg gccaaactca tcttcaggaa tacaagcaag aaagaactta ttcaacaacc

241
aaggaaatgc cagccctcct gcaggaccca gcaatgtacc taagtttggg tccccaaagc

301
cacctgtggc agtcaaacct tcttctgagg aaaagcctga caaggaaccc aagcccccgt

361
ttctaaagcc cactggagca ggccaaagat tcggaacacc agccagcttg accaccagag

421
accccgaggc gaaagtggga tttctgaaac ctgtaggccc caagcccatc aacttgccca

481
aagaagattc caaacctaca tttccctggc ctcctggaaa caagccatct cttcacagtg

541
taaaccaaga ccatgactta aagccactag gcccgaaatc tgggcctact cctccaacct

601
cagaaaatga acagaagcaa gcgtttccca aattgactgg ggttaaaggg aaatttatgt

661
cagcatcaca agatcttgaa cccaagcccc tcttccccaa acccgccttt ggccagaagc

721
cgcccctaag taccgagaac tcccatgaag acgaaagccc catgaagaat gtgtcttcat

781
caaaagggtc cccagctccc ctgggagtca ggtccaaaag cggcccttta aaaccagcaa

841
gggaagactc agaaaataaa gaccatgcag gggagatttc aagtttgccc tttcctggag

901
tggttttgaa acctgctgcg agcaggggag gcccaggtct ctccaaaaat ggtgaagaaa

961
aaaaggaaga taggaagata gatgctgcta agaacacctt ccagagcaaa ataaatcagg

1021
aagagttggc ctcagggact cctcctgcca ggttccctaa ggccccttct aagctgacag

1081
tgggggggcc atggggccaa agtcaggaaa aggaaaaggg agacaagaat tcagccaccc

1141
cgaaacagaa gccattgcct cccttgttta ccttgggtcc acctccacca aaacccaaca

1201
gaccaccaaa tgttgacctg acgaaattcc acaaaacctc ttctggaaac agtactagca

1261
aaggccagac gtcttactca acaacttccc tgccaccacc tccaccatcc catccggcca

1321
gccaaccacc attgccagca tctcacccat cacaaccacc agtcccaagc ctacctccca

1381
gaaacattaa acctccgttt gacctaaaaa gccctgtcaa tgaagacaat caagatggtg

1441
tcacgcactc tgatggtgct ggaaatctag atgaggaaca agacagtgaa ggagaaacat

1501
atgaagacat agaagcatcc aaagaaagag agaagaaaag ggaaaaggaa gaaaagaaga

1561
ggttagagct ggagaaaaag gaacagaaag agaaagaaaa gaaagaacaa gaaataaaga

1621

agaaatttaa actaacaggc cctattcaag tcatccatct tgcaaaagct tgttgtgatg

1681

tcaaaggagg aaagaatgaa ctgagcttca agcaaggaga gcaaattgaa atcatccgca

1741

tcacagacaa cccagaagga

aaatggttgg gcagaacagc aaggggttca tatggctata

1801

ttaaaacaac tgctgtagag attgactatg
attctttgaa actgaaaaaa gactctcttg

1861

gtgccccttc aagacctatt gaagatgacc aagaagtata tgatgatgtt gcagagcagg

1921

atgatattag cagccacagt cagagtggaa gtggagggat attccctcca ccaccagatg

1981

atgacattta tgatgggatt gaagaggaag atgctgatga tggctccaca ctacaggttc

2041

aagagaagag taatacgtgg tcctggggga ttttgaagat gttaaaggga aaagatgaca

2101

gaaagaaaag tatacgagag aaacctaaag tctctgactc agacaataat gaaggttcat

2161

ctttccctgc tcctcctaaa caattggaca tgggagatga agtttacgat gatgtggata

2221

cctctgattt ccctgtttca tcagcagaga tgagtcaagg aactaatgtt ggaaaagcta

2281

agacagaaga aaaggacctt aagaagctaa aaaagcagga aaaagaagaa aaagacttca

2341

ggaaaaaatt taaatatgat ggtgaaatta gagtcctata ttcaactaaa gttacaactt

2401

ccataacttc taaaaagtgg ggaaccagag atctacaggt aaaacctggt gaatctctag

2461

aagttataca aaccacagat gacacaaaag ttctctgcag aaatgaagaa gggaaatatg

2521

gttatgtcct tcggagttac ctagcggaca atgatggaga gatctatgat gatattgctg

2581

atggctgcat ctatgacaat gactagcact caactttggt cattctgctg tgttcattag

2641

gtgccaatgt

gaagtctgga ttttaattgg catgttattg ggtatcaaga aaattaatgc

2701
acaaaaccac ttattatcat ttgttatgaa atcccaatta tctttacaaa gtgtttaaag

2761
tttgaacata gaaaataatc tctctgctta attgttaact cagaagacta cattagtgag

2821
atgtaagaat tattaaatat tccatttccg ctttggctac aattatgaag aagttgaagg

2881
tacttctttt agaccaccag taaataatcc tccttcaaaa aataaaaata aaagaaaaag

2941
gaaaatcatt caggaagaaa tgacctgtct aaaaaaacct aaggaagaat aataatataa

3001
gaaaggaaat ttaaaaacat tccacaagaa gaaaaattat tgtttatact tctacttatg

3061
gttatatctt atattctcta ttcaagtgac ctgtctttta aaaaggcagt gctgtcttac

3121
ctcttgctag tgggttaaat gttttcaaaa attatagcag tagtagaagt tttgtataaa

3181
atttgtcctt atttgttaat tgtatataaa tgttaattat ttgatacgaa tgttatgcat

3241
ttagtatgca cattgaagtc taaactgtag aagagtctaa aacaagttct ctttttgcag

3301
attcacatac taatggttta attctgtgct ctgtttaaag tactattata actagagtag

3361
atctgaatga ggataaccct aaaatcatga ggaatggaag aatggacctt gaaactacct

3421
aggcttttat gcatggcacc tctttataat gaagacactt tttaaagttt ttgtttttgt

3481
ttcaattacc gctagatttt tttttctctt tttttaaaat ccattttact ggaaagttgg

3541
ccagcagagg gagtagaaat tattaaaatt ctagtgtttg gattgggccc ttctctaaca

3601
gtacatactc attcccaaag caatccaaaa acaaaatgtg aaccatttgg gtttcaaatg

3661
ttaagaacac taaatagcat gatttaaaaa atgaaaaatg ctaacaccca agaaaagaag

3721
atattaagtg ctttttaaca actcctagag tacaaaatga gtacatcata atgctggctc

3781
ttctactaat gaaccatcga gtgatattga ataaattatt tatcttctca gtttccttat

3841
ctgtaaatta caatattaga ctaagtaagt ttttccaact cttcactacc aattacctta

3901
ggcttttata atgctccgcc tacttcagtc ccatgtttca gaagcttttg tctatttttt

3961
aaactcattg attaaataat gattaatgca ttctccacat tttaatattg caaaggccca

4021
ttggagtttc tgaagtggct ccacagaatt gaaataattt caaataactg taaaggaact

4081
gaaaatcttc acagagatga agtggggttt ccattaggtg ctttgaaatt tgataacaaa

4141
tcatcaactt ccactggtca atatatagat tttgggtgtc tgaggcccca agattagatg

4201
ccactaatct ccaaagattc cctccaatta tgaaatattt taatgtctac ttttagagag

4261
cactagccag tatatgacca tgtgattaat ttcttttcac actagataaa attacctggt

4321
tcaaaagtgg tttttgttta ttaaatttgg taataaatat atataataca cagacaggat

4381
agtttttatg ctgaagtttt tggccagctt tagtttgagg actccttgat aagcttgcta

4441
aactttcaga gtgccctgag acacttccag ccatccctcc tcctgccttc attggggcag

4501
acttgcattg cagtctgaca gtaatttttt ttctgattga gaattatgta aattcagtac

4561
aatgtcagtt tttaaaagtc aaagttagat caagagaata tttcagagtt ttggtttaca

4621
catcaagaaa cagacacaca tacctaggaa agatttacac aatagataat catcttaatg

4681
tgaaagatat ttgaagtatt aattttaata tattaaatat gatttctgtt atagtcttct

4741
gtatggaatt ttgtcactta agatgagctg caaataaata ataccttcaa tggataaaaa

4801
aaaaaaaaaa a

AG1 MB21D1/C6orf150-Mab-21 domain containing 1, mRNA NM_138441.2

(SEQ ID NO: 128)

1
agcctggggt tccccttcgg gtcgcagact cttgtgtgcc cgccagtagt gcttggtttc

61
caacagctgc tgctggctct tcctcttgcg gccttttcct gaaacggatt cttctttcgg

121
ggaacagaaa gcgccagcca tgcagccttg gcacggaaag gccatgcaga gagcttccga

181
ggccggagcc actgccccca aggcttccgc acggaatgcc aggggcgccc cgatggatcc

241
caccgagtct ccggctgccc ccgaggccgc cctgcctaag gcgggaaagt tcggccccgc

301
caggaagtcg ggatcccggc agaaaaagag cgccccggac acccaggaga ggccgcccgt

361
ccgcgcaact ggggcccgcg ccaaaaaggc ccctcagcgc gcccaggaca cgcagccgtc

421
tgacgccacc agcgcccctg gggcagaggg gctggagcct cctgcggctc gggagccggc

481
tctttccagg gctggttctt gccgccagag gggcgcgcgc tgctccacga agccaagacc

541
tccgcccggg ccctgggacg tgcccagccc cggcctgccg gtctcggccc ccattctcgt

601
acggagggat gcggcgcctg gggcctcgaa gctccgggcg gttttggaga agttgaagct

661
cagccgcgat gatatctcca cggcggcggg gatggtgaaa ggggttgtgg accacctgct

721
gctcagactg aagtgcgact ccgcgttcag aggcgtcggg ctgctgaaca ccgggagcta

781
ctatgagcac gtgaagattt ctgcacctaa tgaatttgat gtcatgttta aactggaagt

841
ccccagaatt caactagaag aatattccaa cactcgtgca tattactttg tgaaatttaa

901
aagaaatccg aaagaaaatc ctctgagtca gtttttagaa ggtgaaatat tatcagcttc

961
taagatgctg tcaaagttta ggaaaatcat taaggaagaa attaacgaca ttaaagatac

1021
agatgtcatc atgaagagga aaagaggagg gagccctgct gtaacacttc ttattagtga

1081

aaaaatatct gtggatataa ccctggcttt ggaatcaaaa agtagctggc ctgctagcac

1141

ccaagaaggc ctgcgcattc aaaactggct ttcagcaaaa gttaggaagc aactacgact

1201

aaagccattt

taccttgtac ccaagcatgc aaaggaagga aatggtttcc aagaagaaac

1261

atggcggcta tccttctctc acatcgaaaa ggaaattttg aacaatcatg gaaaatctaa

1321

aacgtgctgt gaaaacaaag aagagaaatg ttgcaggaaa gattgtttaa aactaatgaa

1381

atacctttta gaacagctga aagaaaggtt taaagacaaa aaacatctgg ataaattctc

1441

ttcttatcat gtgaaaactg ccttctttca

cgtatgtacc cagaaccctc aagacagtca

1501

gtgggaccgc aaagacctgg gcctctgctt tgataactgc
gtgacatact ttcttcagtg

1561

cctcaggaca gaaaaacttg agaattattt tattcctgaa ttcaatctat tctctagcaa

1621
cttaattgac aaaagaagta aggaatttct gacaaagcaa attgaatatg aaagaaacaa

1681
tgagtttcca gtttttgatg aattttgaga ttgtattttt agaaagatct aagaactaga

1741
gtcaccctaa atcctggaga atacaagaaa aatttgaaaa ggggccagac gctgtggctc

1801
ac

AG2 CPVL-carboxypeptidase, vitellogenic-like, mRNA NM_019029.2

(SEQ ID NO: 129)

1
gtgactgggt ggggctgcct cacttctgcc tgatttggga agcgctgcaa ggacaaccgg

61
ctggggtcct tgcgcgccgc ggctcaggga ggagcaccga ctgcgccgcg taagtgccgc

121
ctgccctgcg tgggtcgtgc cagctcagcg ggacaggtcc tcgcctcggt ccctcggact

181
tagggagcgc ggggcagacc ctgagagatg gttggtgcca tgtggaaggt gattgtttcg

241
ctggtcctgt tgatgcctgg cccctgtgat gggctgtttc gctccctata cagaagtgtt

301
tccatgccac ctaagggaga ctcaggacag ccattatttc tcacccctta cattgaagct

361
gggaagatcc aaaaaggaag agaattgagt ttggtcggcc ctttcccagg actgaacatg

421

aagagttatg ccggcttcct caccgtgaat aagacttaca acagcaacct cttcttctgg

481

ttcttcccag ctcagataca gccagaagat gccccagtag ttctctggct acagggtggg

541

ccgggaggtt catccatgtt tggactcttt gtggaacatg ggccttatgt

tgtcacaagt

601

aacatgacct tgcgtgacag agacttcccc tggaccacaa cgctctccat gctttacatt

661

gacaatccag tgggcacagg cttcagtttt actgatgata cccacggata tgcagtcaat

721

gaggacgatg tagcacggga tttatacagt gcactaattc agtttttcca gatatttcct

781

gaatataaaa ataatgactt ttatgtcact ggggagtctt atgcagggaa atatgtgcca

841

gccattgcac acctcatcca ttccctcaac cctgtgagag aggtgaagat caacctgaac

901

ggaattgcta ttggagatgg

atattctgat cccgaatcaa ttataggggg ctatgcagaa

961

ttcctgtacc aaattggctt gttggatgag
aagcaaaaaa agtacttcca gaagcagtgc

1021

catgaatgca tagaacacat caggaagcag aactggtttg aggcctttga aatactggat

1081

aaactactag atggcgactt aacaagtgat ccttcttact tccagaatgt tacaggatgt

1141
agtaattact ataacttttt gcggtgcacg gaacctgagg atcagcttta ctatgtgaaa

1201
tttttgtcac tcccagaggt gagacaagcc atccacgtgg ggaatcagac ttttaatgat

1261
ggaactatag ttgaaaagta cttgcgagaa gatacagtac agtcagttaa gccatggtta

1321
actgaaatca tgaataatta taaggttctg atctacaatg gccaactgga catcatcgtg

1381
gcagctgccc tgacagagcg ctccttgatg ggcatggact ggaaaggatc ccaggaatac

1441
aagaaggcag aaaaaaaagt ttggaagatc tttaaatctg acagtgaagt ggctggtta

1501
atccggcaag cgggtgactt ccatcaggta attattcgag gtggaggaca tattttaccc

1561
tatgaccagc ctctgagagc ttttgacatg attaatcgat tcatttatgg aaaaggatgg

1621
gatccttatg ttggataaac taccttccca aaagagaaca tcagaggttt tcattgctga

1681
aaagaaaatc gtaaaaacag aaaatgtcat aggaataaaa aaattatctt ttcatatctg

1741
caagattttt ttcatcaata aaaattatcc ttgaaacaa

AG3 TICAM2-toll-like receptor adaptor molecule 2, mRNA NM_021649.6

(SEQ ID NO: 130)

1
acattaaccc ctgactcaca gctggaccgc cccggcccgc agcgccacgt cccgggtggg

61
gcctgccacg gcaaagcagc agtccggcct cgagcggccc ctcgggggcg gcggggtggg

121
cgccaacagc agtcaggcct gacaagcggc gacctccaag ggtgaggcct ctgcgggccc

181
ccgactcacg cgcgtccggg ctctgcaagc gcggtgggga gcaggctgct gtggtcgcgg

241
ggactgggtt gcggcgcgcc gcgtacggga cggccccaaa ctctcgacgc ccggggcaag

301
acgcccaccc cctgggcgct ctcgctgggc cagaaaggaa gacagaaaag ccgcgggctg

361
actgtggtgg cgctcgcctg cagattgaaa agaaatgctg agaaatacat aaagttttcc

421
tcttctgcct tggatattta taatgggtat cgggaagtct aaaataaatt cctgccctct

481
ttctctctct tggggtaaaa ggcacagtgt ggatacaagt ccaggatatc atgagtcaga

541
ttccaagaag tctgaagatc tatccttgtg taatgttgct gagcacagca atacaacaga

601
ggggccaaca ggaaagcagg agggagctca gagcgtggaa gagatgtttg aagaagaagc

661
tgaagaagag gtgttcctca aatttgtgat attgcatgca gaagatgaca cagatgaagc

721
cctcagagtc cagaatctgc tacaagatga ctttggtatc aaacccggaa taatctttgc

781
tgagatgcca tgtggcagac agcatttaca gaatttagat gatgctgtaa atgggtctgc

841
atggacaatc ttattactga ctgaaaactt tttaagagat acttggtgta atttccagtt

901
ctatacgtcc ctaatgaact ccgttaacag gcagcataaa tacaactctg ttatacccat

961
gcggcccctg aacaatcccc ttccccgaga aaggactccc tttgccctcc aaaccatcaa

1021
tgccttagag gaagaaagtc gtggatttcc tacacaagta gaaagaattt ttcaggagtc

1081
tgtgtataag acacaacaaa ctatatggaa agagacaaga aatatggtac aaagacaatt

1141
tattgcctga gatgaaacat ataacatgtg gctggctctt gttttgtaaa ccaaatgatt

1201
aatcttcact tgagaaagca gtttctagga aatgtttaaa taaaagagag tcttcacctt

1261
aaagaaacct atggagcaca agaaagataa atttctgcag gacagcctat aaaattgtgg

1321
tactttttga tgtttcagta aacttgacat tgtcagagtt tcaaggactt ttctttcaca

1381
attttcctag ttcatggata tgaaaaagga attctcaatc catattcctt gtattgaacc

1441
ttgaacaaaa acttgtatga cagacatttt taaaaatgtg acaacacttt tattctctga

1501
attttgatct caaaggacac agaaaaaaaa tggccccagg agatctgatc acacttcctc

1561
ctgaggcacc tctcatggat gttgcaataa gcattcgggt actatcaccc agaaatatga

1621
attgccagaa tagaacattt agcatgttaa gcgttgatgc atataaaatc agaaatagat

1681
gtgagaatgg tggaactttt taaaagaacc cagtcaaatg tattttctgc tgaaatctgc

1741
atatttggag gcatttccca ccaccgattc acagcccatt tgatagtgtg gtagttaggg

1801
acttcgtgga gtggtgttca gacgtcccct ggggcttaaa tctcttcata ttagtcatca

1861
tttgtaacta tggctttatt tgcagagctt ctaaaaggcg tataactgtg tgagtggcca

1921
gatattcact ttttaaatca aaaacctctc ttatggaagc tttaaaagtt tccgtcacac

1981
acaattctct tctcaggaag tatttctcat ttaggtcttc aaagtagcct gactgtgtgc

2041
atgtgtgtgt gtgataggtt atttataaag actttggata gaaggagatg tattttatta

2101
cctcctattc tagagcccca tgctcctaac aagccagaga ggccccaaac aggattgttt

2161
ctttcctcca cagcccttct gcccatctga gattgaggga gcatcgtcca cttgagatca

2221
gggatggggt ggagaatggg tcatgtcatg taatgagaaa agccctcttc gggatcatga

2281
gacttggttc tagtccaatt tctgccactg aggatgaatg taactgtggg caaactattt

2341
accctccttt atctgtgaaa tgaaagggtt gaattgatgg atctctaaag gcttttgtcc

2401
tctatgagga tgtgaaaaac tagggaccac aaaagggaac aagcaaaaaa gtttggattc

2461
gataaagtga tatgtaatag ttgcagaagg ctttatatat gcttataatg aaaagatatt

2521
ttttgtatat tgacagcata atttattttt aatgctgtca ttacacttaa agtcacagga

2581
aaaaaatata catgcttact caggctttct taaaaataaa tttttataga gatccttgag

2641
taaagacatt ttgcttaatt tcttttttct tattccccac ttgtatatcc cctaccagta

2701

ccgggatctg cacacatctt tttgcagtta cctcttcata gccatgaacc
aaaacgttct

2761

atgaggagca tgcaagtaag tcaagcctcc tattctgtta gtacttatta gaggaggaga

2821

tggttttcat tgcatagtga cattttctta gccttaacgt tctgatagta gcttactact

2881

cacttctctt tttcagtttt cataataagt attcattttt ttgccataat gcttcctgta

2941

aagccaattt

tatatactaa taaaacatga actgcccact cttcatgcct gccaaacttg

3001

gggcaattga tgctaaatgg tatttttaaa ataaatgttt ttattcttta ctcttgaaaa

3061
aaaaaaaaaa aaaa

AH1 CD52-CD52 molecule/CAMPATH1, mRNA NM_001803.2

(SEQ ID NO: 131)

1
ctcctggttc aaaagcagct aaaccaaaag aagcctccag acagccctga gatcacctaa

61

aaagctgcta ccaagacagc cacgaagatc ctaccaaaat gaagcgcttc ctcttcctcc

121

tactcaccat cagcctcctg gttatggtac agatacaaac tggactctca ggacaaaacg

181

acaccagcca aaccagcagc ccctcagcat ccagcaacat aagcggaggc attttccttt

241

tcttcgtggc

caatgccata atccacctct tctgcttcag ttgaggtgac acgtctcagc

301

cttagccctg tgccccctga
aacagctgcc accatcactc gcaagagaat cccctccatc

361

tttgggaggg

gttgatgcca gacatcacca ggttgtagaa gttgacaggc agtgccatgg

421

gggcaacagc caaaataggg
gggtaatgat gtaggggcca agcagtgccc agctgggggt

481
caataaagtt acccttgtac ttgcaaaaaa aaaaaaaaaa aaa

AI1 HERC2 Homo sapiens HECT and RLD domain containing E3 ubiquitin

protein ligase 2, mRNA NM_004667.5

(SEQ ID NO: 132)

1
gcgccggctg agccagcggc tcttgggagg ctgcgtccgc gcgccggcga ggcgaggcgg

61
ccgggccctg cgcgtcaggc ctgagacctg ggaggaagct ggagaaaaga tgccctctga

121
atctttctgt ttggctgccc aggctcgcct cgactccaaa tggttgaaaa cagatataca

181
gcttgcattc acaagagatg ggctctgtgg tctgtggaat gaaatggtta aagatggaga

241
aattgtatac actggaacag aatcaaccca gaacggagag ctccctccta gaaaagatga

301
tagtgtcgaa ccaagtggaa caaagaaaga agatctgaat gacaaagaga aaaaagatga

361
agaagaaact cctgcaccta tatatagggc caagtcaatt ctggacagct gggtatgggg

421
caagcaacca gatgtgaatg aactgaagga gtgtctttct gtgctggtta aagagcagca

481
ggccctggcc gtccagtcag ccaccaccac cctctcagcc ctgcgactca agcagaggct

541
ggtgatcttg gagcgctatt tcattgcctt gaatagaacc gtttttcagg agaatgtcaa

601
agttaagtgg aaaagcagcg gtatttctct gcctcctgtg gacaaaaaaa gttcccggcc

661
tgcgggcaaa ggtgtggagg ggctcgccag agtgggatcc cgagcggcgc tgtcttttgc

721
ctttgccttc ctgcgcaggg cctggcgatc aggcgaggat gcggacctct gcagtgagct

781
gttgcaggag tccctggacg ccctgcgagc acttcccgag gcctcgctct ttgacgagag

841
caccgtgtcc tctgtgtggc tggaggtggt ggagagagcg accaggttcc tcaggtccgt

901
cgtgacgggg gatgttcacg gaacgccagc caccaaaggg ccaggaagca tccccctgca

961
ggaccagcac ttggccctgg ccatcctgct ggagctggct gtgcagagag gcacgctgag

1021
ccaaatgttg tctgccatcc tgttgttgct tcagctgtgg gacagcgggg cacaggagac

1081
tgacaatgag cgttccgccc agggcaccag cgccccactt ttgcccttgc tgcaaaggtt

1141
ccagagcatc atttgcagga aggatgcacc ccactccgag ggcgacatgc accttttgtc

1201
tggccctctg agccccaatg agagtttcct gaggtacctc acccttccac aagacaacga

1261
gcttgccatt gatctgcgac aaacggcggt tgttgtcatg gcccatttag accgtctggc

1321
tacgccctgt atgcctccgc tgtgtagctc tccgacatct cataagggat cattgcaaga

1381
ggtcataggt tgggggttaa taggatggaa atactatgcc aatgtgattg gtccaatcca

1441
gtgcgaaggc ctggccaacc tgggagtcac acagattgcc tgtgcagaga agcgtttcct

1501
gattctgtca cgcaatggcc gcgtgtacac acaggcctat aatagtgaca cgctggcccc

1561
acagctggtc caaggccttg cctccagaaa cattgtaaaa attgctgccc attctgatgg

1621
tcaccactac ctagccttgg ctgctactgg agaggtgtac tcctggggct gtggggacgg

1681
cggacggctg ggccatgggg acactgtgcc tttggaggag cctaaggtga tctccgcctt

1741
ctctggaaag caggccggga agcacgtggt gcacatcgct tgcgggagca cttacagtgc

1801
ggccatcact gccgaggggg agctgtacac ctggggccgc gggaactacg gccggctggg

1861
ccatggctcc agtgaggacg aggccattcc gatgctggta gccgggctta aaggactgaa

1921
ggtcatcgat gtggcgtgtg ggagtgggga tgctcaaacc ctggctgtca ctgagaacgg

1981
gcaagtgtgg tcttggggag atggtgacta tgggaaattg ggcagaggtg gtagtgatgg

2041
ctgcaaaacc ccaaagctga ttgaaaagct tcaagacttg gatgtggtca aagtccgctg

2101
tggaagtcag ttttccattg ctttgacgaa agatggccaa gtttattcat ggggaaaagg

2161
tgacaaccag agacttggac atggaacaga ggaacatgtt cgttatccaa aactcttaga

2221
aggcttgcaa gggaagaagg tgattgatgt ggctgcaggc tccacccact gcctggctct

2281
gactgaggac agcgaggtcc acagctgggg gagcaacgac cagtgccagc actttgacac

2341
cttgcgcgtg accaagccag aacctgcagc attgccagga ctggacacca aacacatagt

2401
gggaattgcc tgtgggcctg cccagagctt tgcttggtca tcatgttctg agtggtccat

2461

tggcctccgt gtcccttttg tggtggacat

ctgctcaatg acttttgagc agctggatct

2521

cctgcttcgg caggtgagtg aggggatgga tggttccgcg
gactggcccc cgccccagga

2581

gaaagagtgt gtggccgtgg caacgctgaa tcttctacga cttcagttgc atgctgccat

2641

tagtcaccag gttgacccgg aattccttgg tttaggtctg ggcagcatcc tcctgaacag

2701

cctgaagcag acggtggtga ccctggccag cagtgcgggc gtgctgagca ccgtgcagtc

2761

ggccgcccag gccgtgctgc agagtggctg gtccgtgctg ctgcccaccg cggaggagcg

2821

ggcccgggca ctctctgctc tcctgccctg cgcagtttca ggcaatgaag tgaacataag

2881

tccaggtcgt cgattcatga ttgatcttct ggtgggcagc ttgatggctg atggagggtt

2941

ggagtcagcc ttacacgcag ccattactgc agagatccag gatattgaag ccaaaaaaga

3001

agcacagaag gaaaaagaaa ttgatgaaca ggaagcgaat gcctcaacat ttcatagaag

3061

caggactcca ctggataaag accttattaa tacggggatc tgtgagtctt ctggcaaaca

3121

gtgtttgcct ctggttcagc tcatacaaca gcttcttaga aacattgctt ctcagactgt

3181

agccagattg aaagatgttg cccgtcggat ttcatcatgt ctggactttg agcaacacag

3241

tcgtgaaaga tctgcttcat tggatttgtt actgcgtttt caacgtttgc ttattagtaa

3301

actttatcca ggagaaagta ttggtcagac ctcagatatt tctagtccag agctaatgga

3361

tgttggttcc ttgctgaaga agtacacagc cctcctgtgc acgcacattg gagatatact

3421

gcctgtggcc gccagcattg cttctaccag ctggcggcac ttcgcggagg tggcttacat

3481

tgtggaaggg gactttactg gtgttctcct tccagaacta gtagtttcta tagtgcttct

3541

gctcagtaaa aatgctggtc tcatgcaaga ggctggagct gtacctctgc tgggtggcct

3601

gttggaacat ctggatcggt tcaaccatct ggcaccagga aaggaacggg atgatcatga

3661

agagttagcc tggcctggca taatggagtc attttttaca ggtcagaact gtagaaataa

3721

tgaggaagtg acacttatac gcaaagctga tttggagaac cataataaag atggaggctt

3781

ctggactgtg

attgacggga aggtgtatga tataaaggac ttccagacac agtcgttaac

3841

aggaaatagt attcttgctc agtttgcagg ggaagaccca gtggtagctt tggaagctgc

3901

tttgcagttt gaagacaccc gggaatccat gcacgcgttt tgtgttggcc agtatttgga

3961

gcctgaccaa gaaatcgtca ccataccaga tctggggagt ctctcttcac ctctgataga

4021

cacagagagg aatctgggcc tgcttctcgg attacacgct tcgtatttgg caatgagcac

4081

accgctgtct cctgtcgaga ttgaatgtgc caaatggctt cagtcatcca tcttctctgg

4141

aggcctgcag accagccaga tccactacag ctacaacgag gagaaagacg aggaccactg

4201

cagctcccca gggggcacac ctgccagcaa atctcgactc tgctcccaca gacgggccct

4261

gggggaccat tcccaggcat ttctgcaagc cattgcagac aacaacattc aggatcacaa

4321

cgtgaaggac tttttgtgtc aaatagaaag gtactgtagg cagtgccatt tgaccacacc

4381

gatcatgttt ccccccgagc atcccgtgga agaggtcggt cgcttgttgt tatgttgcct

4441

cttaaaacat gaagatttag gtcatgtggc attatcttta gttcatgcag gtgcacttgg

4501

tattgagcaa gtaaagcaca gaacgttgcc taagtcagtg gtggatgttt gtagagttgt

4561

ctaccaagca aaatgttcgc tcattaagac tcatcaagaa cagggccgtt cttacaagga

4621

ggtctgcgct cctgtcatcg aacgtttgag attcctcttt aatgaattga gacctgctgt

4681

ttgtaatgac ctctctataa tgtctaagtt taaattgtta agttctttgc cccgttggag

4741

gaggatagct caaaagataa ttcgagaacg aaggaaaaag agagttccta agaagccaga

4801

atctacggat gatgaagaaa aaattggaaa cgaagagagt gatttagaag aagcttgcat

4861

tttgcctcat agtccaataa atgtggacaa gagacccatt gcaattaaat cacccaagga

4921

caaatggcag ccgctgttga gtactgttac aggtgttcac aaatacaagt ggttgaagca

4981

gaatgtgcag ggtctttatc cgcagtctcc actcctcagt acaattgctg aatttgccct

5041

taaagaagag ccagtggatg tggaaaaaat gagaaagtgc ctactaaaac agttggagag

5101

agcagaggtt cgcctggaag ggatagatac aattttaaaa ctggcgagca agaatttctt

5161

acttccat

ct gtgcagtatg cgatgttttg tggatggcaa agacttattc ctgagggaat

5221

cgatataggg gaacctct
ta ctgattgttt aaaggatgtt gatttgatcc cgccttttaa

5281

tcggatgctg ctggaagtca cctttggcaa gctgtacgct tgggctgtac agaacattcg

5341

aaatgttttg atggatgcca gtgccaaatt taaagagctt ggtatccagc cggttcccct

5401

gcaaaccatc accaatgaga acccgtcagg accgagcctg gggaccatcc cgcaagccca

5461

cttcctcctg gtgatgctca gcatgctcac cctgcagcac ggcgcaaaca acctcgacct

5521

tctgctcaat tccggcatgc tggccctcac gcagacggca ctgcgcctga ttggccccag

5581

ttgtgacaac gttgaggaag atatgaatgc ttctgctcaa ggtgcttctg ccacagtttt

5641
ggaagaaaca aggaaggaaa cggctcctgt gcagctccct gtttcaggac cagaactggc

5701
tgccatgatg aagattggaa caagggtcat gagaggtgtg gactggaaat ggggcgatca

5761
ggatgggcct cctccaggcc taggccgcgt gattggtgag ctgggagagg acggatggat

5821
aagagtccag tgggacacag gcagcaccaa ctcctacagg atggggaaag aaggaaaata

5881
cgacctcaag ctggcagagc tgccggctgc tgcacagccc tcagcagagg attcggacac

5941
agaggatgac tctgaagccg aacaaactga aaggaacatt caccccactg caatgatgtt

6001
taccagcact attaacttac tgcagactct ttgtctgtct gctggagttc atgctgagat

6061
catgcagagc gaagccacca agactttatg cggactgctg cgaatgttag tggaaagcgg

6121
aacgacggac aagacatctt ctccaaacag gctggtgtac agggagcaac accggagctg

6181
gtgcacgctg gggtttgtgc ggagcatcgc tctcacgccg caggtatgcg gcgccctcag

6241
ctccccgcag tggatcacgc tgctcatgaa ggtcgtggaa gggcacgcac ccttcactgc

6301
cacctcgctg cagaggcaga tcttagctgt gcatttgttg caagcagtcc ttccatcatg

6361
ggacaagacc gaaagggcga gggacatgaa atgcctcgtg gagaagctgt ttgacttctt

6421
gggaagcttg ctcactacct gctcctctga cgtgccatta ctcagagagt ccacgctgag

6481
gcggcgcagg gtgcgcccgc aggcctcgct gactgccacc cacagcagca cactggcgga

6541
ggaggtggtg gcactgctgc gcacgctgca ctccctgact cagtggaatg ggctcatcaa

6601
caagtacatc aactcccagc tccgctccat cacccacagc tttgtgggaa ggccttccga

6661
aggggcccag ttagaggact acttccccga ctccgagaac cctgaagtgg ggggcctcat

6721
ggcagtcctg gctgtgattg gaggcatcga tggtcgcctg cgcctgggcg gtcaagttat

6781
gcacgatgag tttggagaag gcactgtgac tcgcatcacc ccaaagggca aaatcaccgt

6841
gcagttctct gacatgcgga cgtgtcgcgt ttgcccattg aatcagctga aaccactccc

6901
tgccgtggcc tttaatgtga acaacctgcc cttcacagag cccatgctgt ctgtctgggc

6961
tcagttggtg aacctcgctg gaagcaagtt agaaaagcac aaaataaaga aatcgactaa

7021
acaggccttt gcaggacaag tggacctgga cctgctgcgg tgccagcagt tgaagctata

7081
catcctgaaa gcaggtcggg cgctgctctc ccaccaggat aaactgcggc agatcctgtc

7141
tcagccagct gttcaggaga ctggaactgt tcacacagat gatggagcag tggtatcacc

7201
tgaccttggg gacatgtctc ctgaagggcc gcagcccccc atgatcctct tgcagcagct

7261
gctggcctcg gccacccagc cgtctcctgt gaaggccata tttgataaac aggaacttga

7321
ggctgctgca ctggccgttt gccagtgctt ggctgtggag tccactcacc cttcgagccc

7381
aggatttgaa gactgcagct ccagtgaggc caccacgcct gtcgccgtgc agcacatccg

7441
ccctgccaga gtgaagaggc gcaagcagtc gcccgttccc gctctgccga tcgtggtgca

7501
gctcatggag atgggatttt ccagaaggaa catcgagttt gccctgaagt ctctcactgg

7561
tgcttccggg aatgcatcca gcttgcctgg tgtggaagcc ttggtcgggt ggctgctgga

7621
ccactccgac atacaggtca cggagctctc agatgcagac acggtgtccg acgagtattc

7681
tgacgaggag gtggtggagg acgtggatga tgccgcctac tccatgtcta ctggtgctgt

7741
tgtgacggag agccagacgt acaaaaaacg agctgatttc ttgagtaatg atgattatgc

7801
tgtatatgtg agagagaata ttcaggtggg aatgatggtt agatgctgcc gagcgtatga

7861
agaagtgtgc gaaggtgatg ttggcaaagt catcaagctg gacagagatg gattgcatga

7921
tctcaatgtg cagtgtgact ggcagcagaa agggggcacc tactgggtta ggtacattca

7981
tgtggaactt ataggctatc ctccaccaag ttcttcttct cacatcaaga ttggtgataa

8041
agtgcgggtc aaagcctctg tcaccacacc aaaatacaaa tggggatctg tgactcatca

8101
gagtgtgggg gttgtgaaag ctttcagtgc caatggaaaa gatatcattg tcgactttcc

8161

ccagcagtct cactggactg ggttgctatc agaaatggag ttggtaccca gtattcatcc

8221

tggggttacg tgtgatggat gtcagatgtt tcctatcaat ggatccagat tcaaatgcag

8281

aaactgtgat gactttgatt tttgtgaaac gtgtttcaag

accaaaaaac acaataccag

8341

gcatacattt ggcagaataa atgaaccagg tcagtctgcg gtattttgtg
gccgttctgg

8401

aaaacagctg aagcgttgcc acagcagcca gccaggcatg ctgctggaca gctggtcccg

8461

catggtgaag agcctgaatg tgtcgtcctc cgtgaaccag gcatcccgtc tcattgacgg

8521

cagcgagccc tgctggcagt catcggggtc gcaaggaaag cactggattc gtttggagat

8581

tttcccagat gttcttgttc atagattaaa aatgatcgta gatcctgctg acagtagcta

8641

catgccgtcc ctggttgtag tgtcaggtgg aaattccctg aataacctta ttgaactaaa

8701

gacaatcaat attaaccctt ctgacaccac agtgcccctt ctgaatgact gcacagagta

8761

tcacaggtat attgaaattg ctataaagca gtgcaggagc tcaggaatcg attgtaaaat

8821

ccatggtctc atcctgctgg gacggatccg tgcagaagag gaagatttgg ctgcagttcc

8881

tttcttagct tcggataatg aagaggagga ggatgagaaa ggcaacagcg gaagcctcat

8941

tagaaagaag gctgctgggc tggaatcagc agctacgata agaaccaagg tgtttgtgtg

9001

gggcctgaat gacaaggacc agctgggcgg gctgaaaggc tccaagataa aggttccttc

9061

gttctctgag acactgtcag ctttgaatgt ggtacaggtg gctggtggat ctaaaagttt

9121

gtttgcagtg actgtggaag ggaaggtgta tgcctgtgga gaagccacga atggccggct

9181

ggggctgggc atttccagcg ggacggtgcc catcccacgg cagatcacag ctctcagcag

9241

ctacgtggtc aagaaggtgg ctgttcactc aggtggccgg cacgcgacgg ctttaactgt

9301

cgatggaaaa gtgttttcgt ggggcgaagg tgacgatgga aaacttggac acttcagcag

9361

aatgaactgt gacaaaccaa ggctgatcga ggccctgaaa accaagcgta tccgggatat

9421

cgcctgtggg agctcgcaca gcgcagccct cacatccagc ggagaactgt acacctgggg

9481

cctcggcgag tacggccggc tgggacatgg ggataatacg acacagctaa agcccaaaat

9541

ggtgaaagtc cttctcggtc acagagtaat ccaggttgca tgtgggagta gagacgcgca

9601

gaccctggct ctgaccgatg aaggtttggt attttcctgg ggtgatggtg actttggaaa

9661
actgggccgg ggcggaagtg aaggctgtaa cattccccag aacattgaga gactaaatgg

9721
acagggggtg tgccagattg agtgtggagc tcagttctcc ctggcgctca ccaagtctgg

9781
agtggtgtgg acatggggaa agggggatta cttcagattg ggccacggct ctgacgtgca

9841
cgtgcggaaa ccacaggtgg tggaagggct gagagggaag aagatcgtgc atgtggctgt

9901
cggggccctg cactgcctgg cggtcacgga ctcggggcag gtgtatgctt ggggtgacaa

9961
cgaccacggc cagcagggca atggcacgac cacggttaac aggaagccca cactcgtgca

10021
aggcttagaa ggccagaaga tcacacgcgt ggcttgtggg tcgtcccaca gtgtggcgtg

10081
gacaactgtg gatgtggcca cgccctctgt ccacgagccc gtcctcttcc agactgcaag

10141
agacccttta ggtgcttcct atttaggcgt gccttcagat gctgattctt ctgctgccag

10201
taataaaata agtggtgcaa gtaattctaa gccaaatcgc ccttctcttg ccaagattct

10261
cttgtcattg gatggaaatc tggccaaaca gcaggcctta tcacatattc ttacagcatt

10321
gcaaatcatg tatgccagag atgctgttgt cggggccctg atgccggccg ccatgatcgc

10381
cccggtggag tgcccctcgt tctcctcggc ggccccttcc gacgcatctg cgatggctag

10441
tcccatgaat ggagaagaat gcatgctggc tgttgatatc gaagacagac tgagtccaaa

10501
tccatggcaa gaaaagagag agattgtttc ctctgaggac gcagtgaccc cctctgcagt

10561
gactccgtcg gccccctcag cctccgctcg gccttttatc ccagtgacgg atgacctggg

10621
agccgcaagc atcattgcag aaaccatgac caaaaccaaa gaggatgttg aaagccaaaa

10681
taaagcagca ggtccggagc ctcaggcctt ggatgagttc accagtctgc tgattgcgga

10741
tgacactcgt gtggtggtag acctgctcaa gctgtcagtg tgcagccggg ccggggacag

10801
gggcagggat gtgctctccg cggtgctttc cggcatgggg accgcctacc cacaggtggc

10861
agatatgctg ttggagctct gtgtcaccga gttggaggat gtggccacag actcgcagag

10921
cggccgcctc tcttctcagc ctgtggtggt ggagagtagc cacccttaca ccgacgacac

10981
ctccaccagt ggcacagtga agataccagg tgcagaagga ctcagggtag aatttgaccg

11041
gcagtgctcc acagagaggc gccacgaccc tctcacagtc atggacggcg tcaacaggat

11101
cgtctccgtg cggtcaggcc gagagtggtc cgactggtcc agcgagctgc gcatcccagg

11161
ggatgagtta aagtggaagt tcatcagcga tgggtctgtg aatggctggg gctggcgctt

11221
caccgtctat cccatcatgc cagctgctgg ccctaaagaa ctcctctctg accgctgcgt

11281
cctctcctgt ccatccatgg acttggtgac gtgtctgtta gacttccgac tcaaccttgc

11341
ctctaacaga agcatcgtcc ctcgccttgc ggcctcgctg gcagcttgtg cacagctgag

11401
tgccctagct gccagtcaca gaatgtgggc ccttcagaga ctgaggaagc tgcttacaac

11461
tgaatttggg cagtcaatta acataaatag gctgcttgga gaaaatgatg gggaaacaag

11521
agctttgagt tttacaggta gtgctcttgc tgctttggtg aaaggtcttc cagaagcttt

11581
gcaaaggcag tttgaatatg aagatcctat tgtgaggggt ggcaaacagc tgctccacag

11641
cccattcttt aaggtactgg tagctcttgc ttgtgacctg gagctggaca ctctgccttg

11701
ctgtgccgag acgcacaagt gggcctggtt ccggaggtac tgcatggcct cccgtgttgc

11761
tgtggccctt gacaaaagaa caccgttgcc ccgtctgttt cttgatgagg tggctaagaa

11821
aattcgtgaa ttaatggcag acagcgaaaa catggatgtt ctgcatgaga gccatgacat

11881
ttttaaaaga gagcaagacg aacaacttgt gcagtggatg aacaggcgac cagatgactg

11941
gactctctct gctggtggca gtggaacaat ttatggatgg ggacataatc acaggggcca

12001
gctcgggggc attgaaggcg caaaagtcaa agttcccact ccctgtgaag cccttgcaac

12061
tctcagaccc gtgcagttaa tcggagggga acagaccctc tttgctgtga cggctgatgg

12121
gaagctgtat gccactgggt atggtgcagg tggcagacta ggcattggag ggacagagtc

12181
ggtgtccacc ccaacattgc ttgaatccat tcagcatgtg tttattaaga aagtagctgt

12241
gaactctgga ggaaagcact gccttgccct gtcttcagaa ggagaagttt actcttgggg

12301
tgaggcagaa gatgggaagt tggggcatgg caacagaagt ccgtgtgacc gccctcgtgt

12361
catcgagtct ctgagaggaa ttgaagtggt cgatgttgct gctggcggag cccacagcgc

12421
ctgtgtcaca gcagccgggg acctctacac atggggcaaa ggccgctacg gccggctggg

12481
gcacagcgac agtgaggacc agctgaagcc gaagctggtg gaggcgctgc agggccaccg

12541
tgtggttgac atcgcctgtg gcagtggaga tgcccagacc ctctgcctca cagatgacga

12601
cactgtctgg tcctgggggg acggggacta cggcaagctc ggccggggag gcagcgatgg

12661
ctgtaaagtg cctatgaaga ttgattctct tactggtctt ggagtagtta aagtggaatg

12721
cggatcccag ttttctgttg cccttaccaa atctggagct gtttatacct ggggcaaagg

12781
cgattatcac aggttgggcc atggatcaga tgaccatgtt cgaaggcctc ggcaggtcca

12841
agggttgcag gggaagaaag tcatcgccat cgccactggc tccctgcact gtgtgtgctg

12901
cacagaggat ggtgaggttt atacatgggg cgacaatgat gagggacaac tgggagacgg

12961
aaccaccaat gccatccaga ggcctcggtt ggtagctgcc cttcagggta agaaggtcaa

13021
ccgtgtggcc tgtggctcag cacataccct cgcctggtcg accagcaagc ccgccagtgc

13081
tggcaaactc cctgcacagg tccccatgga gtacaatcac ctgcaggaga tccccatcat

13141
tgcgctgagg aaccgtctgc tgctgctgca ccacctctcc gagctcttct gcccctgcat

13201
ccccatgttc gacctggaag gctcgctcga cgaaactgga ctcgggcctt ctgttgggtt

13261
cgacactctc cgaggaattc tgatatccca gggaaaggag gcggctttcc ggaaagtagt

13321
acaagcaact atggtacgcg atcgtcagca tggccccgtc gtggagctga accgcatcca

13381
ggtcaaacga tcaaggagca aaggcgggct ggccggcccc gacggcacca agtctgtctt

13441
tgggcagatg tgtgctaaga tgagctcgtt tggtcccgac agcctcctcc ttcctcaccg

13501
tgtctggaaa gtcaagtttg tgggtgaatc tgtggatgac tgtgggggcg gctacagcga

13561
gtccatagct gagatctgtg aggagctgca gaacggactc acgcccctgc tgatcgtgac

13621
acccaacggg agggatgagt ctggggccaa ccgagactgc tacctgctca gcccggccgc

13681

cagagcaccc gtgcacagca gcatgttccg cttcctgggt gtgttgctgg gcattgccat

13741

ccgaaccggg agtcccctga gcctcaacct tgccgagcct gtctggaagc agctggctgg

13801

gatgagcctc accatcgcgg acctcagtga

ggttgataag gattttattc ctggactcat

13861

gtacatccga gacaatgaag ccacctca a ggagtttgaa
gccatgagcc tgcccttcac

13921

agtgccaagt gccagtggcc aggacattca gttgagctcc aagcacacac acatcaccct

13981

ggacaaccgc gcggagtacg tgcggctggc gataaactat agactccatg aatttgatga

14041

gcaggtggct gctgttcggg aaggaatggc ccgcgttgtg cctgttcccc tcctctctct

14101

gttcaccggc tacgaactgg agacgatggt gtgtggcagc cctgacatcc cgctgcacct

14161

tctcaagtcg gtggccacct ataaaggcat cgagccttcc gcatcgctga tccagtggtt

14221

ctgggaggtg atggagtcct tctccaacac agagcgctct cttttccttc gcttcgtctg

14281

gggccggacg aggctgccca ggaccatcgc cgacttccgg ggccgagact tcgtcatcca

14341

ggtgttggat aaatacaacc ctccagacca cttcctccct gagtcctaca cctgtttctt

14401

cttgctgaag ctgcccaggt attcctgcaa gcaggtgctg gaggagaagc tcaagtacgc

14461

catccacttc tgcaagtcca tagacacaga tgactacgct cgcatcgcac ttacaggaga

14521

gccagccgcc gacgacagca gcgacgattc agataacgag gatgtcgact cctttgcttc

14581

ggactctaca caagattatt taacaggaca ctaagatggg gaaacgtcct cgtgagatga

14641

gagcctgagc caggcagcag agcgctcgct gctgtgtaga ctgtaggctg cctggtgtgt

14701

ctgatgagaa gcgtccgtcc tcgagccagg cgggaggagg gagtggagag actgactggc

14761

cgtgatggga atgacagtga gaaggtccgc ctgtgcgcgt ggaacactgt ggacgctcga

14821

cttccaaggg tcttctcacc cgtaatgctg cattacatgt aggactgtgt ttactaaagt

14881

gtgtaaatgt ttatataaat accaaattgc agcatcccca aaatgaataa agccttttta

14941
cttgtgggtg caatcgattt tttttctttc tcctttcttt caagtgtcgt gagtcgtctt

15001
gattgtatat tggaaataac tgtgtaacaa atcgtattat aaatatttca attaatttta

15061
ctctgaattt gtttattaaa agacttttga acatgaaatg attagtatta cttgaatgca

15121
tccagaggat atttaaacca aaatgaaaaa ccagaaggcc atttggtgtc ccccctccca

15181
ggtgtcccct tgtagcatat gcattatgtc atctgaattg aggcctttct gtgaacagca

15241
tcataacttc tatcatggaa agtgtactat atataatgtt tgtgtcatgt atatgcctaa

15301
attttaatta tctataaata aaacatctga cataaaagtg

AI2 KLRAP1 (KLRA1)-KLRAP1 killer cell lectin-like receptor subfamily A

pseudogene 1, mRNA NR_028045.1

(SEQ ID NO: 133)

1
ttcagccctc aaatattgat tttgaacatt attttgcaaa gagtactaag tggttggtta

61
gttgagatag aggaatatgc agcttttgac tatctttcct ttcccgtcag taccagcttt

121
catgatacaa tttcctctta tcactttggt caagaggtgg ggcagaaaat tttgagttac

181
agtatcattc gaagagaatt tatttctgcc tttcatgtta tagcccctaa gggatccagg

241
acccgaaagg ccagcttctc cctcattttg aaatcagttt tctccacctg caccactgca

301
tagcacagat acagaaacca tcctatttca ggatttgaat gcaaaactta ccttcttact

361
ctaaagatga atgatcaggg agagatttat tcaaccctga gatttttgca gtctccttca

421
gagtcacaga atagattaag gcctgatgat actcaaaggc ctgggaaaac tgatgacaaa

481
gaattttcag tgccctggca cctcattgca gtgactcttg ggatcctctg tttacttctt

541
ctgatgatag tcacagtgtt ggtgacaaat atctttcagt gtattcaaga aaaacatcaa

601
cggcaggaaa ttctaagaaa ctgtagtgaa aagtacatca tgcaaaatga caactactta

661
aaagagcaga ttttgacaaa taagacttta aaatttgacg ttctcaaaaa tagctttcag

721
cagaaaaagg aactggattc acgccttata caaaagaaca gatgtcatag agaaaatgag

781
atcgttttta aagttttgca aaatacaggc aaattctctg aagaccacgg gtcctgttgt

841

ggagtaaact gttattattt taccatgcag aagaaagact ggaagggatg taaacagact

901

tgtcaacatt gtagatcatc ccttttgaag atagatgaca aagatgaact cgtattttac

961

attcactttt attctcttgg

actctgtttc tcaatgttgg acctaagata ttgaagacag

1021

gctggagccc agagccttca ttcaatctca
gatttatgaa aataattact ggattggatt

1081

atcatatgat gaaagggaaa gtaagtggaa atggattgat aatggcacat ctcctggaat

1141

taattctaca ataatgcgtt tttcttctgg gagaggagaa tgtgcatttt tgacctcaac

1201

aagaatggca actattgatt

gcattcaaac gtacaattgt atctgtggga agagaataga

1261

ctctattttc tctgattcgg tgtgcgccaa
gaagaaaagg tgaaaatgga atgttttctt

1321
tttttgtttc ccataataat ttctgattat aaatcattgc ttttaactgt gggacttagt

1381
taattcttca aaagataaag atgaacagga agaaaaagaa aattattttg gactatgact

1441
ttaaagatca gatgccatct ttcttcctgg agaagaggag attttctctt ttgagagtgg

1501
ttgttccttc ctttaatgtc cctgaggaat tattcattct ttctaattct cagaactacc

1561
tatacaacca gttagagaac tctgatatta tatcctgggt cttttttctt atcaatagga

1621
taaatcattc cagcatcttc tggttttgaa agcagttgtg aactagaatg tagttatttt

1681
tttcttccca tctagaagtt acctcatctt ttaaaacatt tgttttgcta caaaatataa

1741
cttcaaactt actgaaagtt gcaagcatag tacaaggaac ttctatataa cctttactca

1801
tacttactag ttgtttatat tttgctctgc tttatatttc tctctttcta tcttccactt

1861
aataataaat tgaggacttc atgtcccttt gtctaaatat tttccaagat caagggcttt

1921
gttttatata atcacagtgc aattatcaaa ctcaggaaat ttagcattag tacagtacta

1981
tgatctaatc tgtaatccat gttcaaattt tgtcaattgt cccaataatg ccatttatgt

2041
gtatttctta aaaatccatg ttcaggataa ttcaatgcat ttgattgtaa tgtctcttta

2101
gtcttcttta atctgaaaca gttctttagg ctttttcttg accttgacat tttaaaaatt

2161
aactttattg agttatactt ttcatgcaat aaaatgcact cactttaa

AJ1 Homo sapiens platelet factor 4 variant 1 (PF4V1/CXCL4L1), mRNA

NM_0026202

(SEQ ID NO: 134)

1
actgcctgca gaaccccagc ccgactttcc ctgcgcactg ggatcctgct ggaacctcag

61

ctgcaacatg agctccgcag ccaggtcccg cctcacccgc gccacccgcc

aggagatgct

121

gttcttggcg ttgctgctcc tgccagttgt ggtcgccttc gccagagctg aagctgaaga

181

agatggggac ctgcagtgcc tgtgtgtgaa gaccacctcc caggtccgtc ccaggcacat

241

caccagcctg gaggtgatca aggccggacc ccactgcccc actgcccaac tcatagccac

301

gctgaagaat gggaggaaaa tttgcttgga tctgcaagcc ctgctgtaca agaaaatcat

361

taaggaacat ttggagagtt

agctactagc tgcctaagtg tgcactttca atctaactgt

421

gaaagaatct tctgatgttt gtattatcct
tcttatatta tattaacaaa ataaatcaag

481
ttgtggtata gtcaatctat ttcttaataa tactgcaaaa ataatgctga cacatcacaa

541
tttcatattt taaaatttcc agaattttaa gcaaaaagca ttatgaagga aggcttggtt

601
taataaagac tgattttgtt cagtgttata tgttagctga tacatatttg ttcatttatg

661
tgattgcagt actttatagc tacatattta ccttgaatgt tacaattagc ttgccaataa

721
atattagtag ctcttaagca t

AL1 DEFB128 defensin, beta 128, mRNA NM_001037732.1

(SEQ ID NO: 135)

1
atgaagctgt ttctggttct cattattctg ctgtttgagg tactcacaga cggggcaaga

61

ctcaaaaaa

t gcttcaataa agtaacaggc tattgcagga agaaatgcaa ggtaggagaa

121

agatatgaaa taggatgtc
t aagtgggaaa ttatgttgtg ctaatgatga agaagagaaa

181

aaaca

tgtgt catttaagaa gccacatcaa cattctggtg agaagctgag tgtgctgcag

241

gattacatca tctta
cccac catcaccatt ttcacagtct aa

AM1 IL8 interleukin 8, mRNA NM_000584.3

(SEQ ID NO: 136)

1
gagggtgcat aagttctcta gtagggtgat gatataaaaa gccaccggag cactccataa

61
ggcacaaact ttcagagaca gcagagcaca caagcttcta ggacaagagc caggaagaaa

121
ccaccggaag gaaccatctc actgtgtgta aacatgactt ccaagctggc cgtggctctc

181
ttggcagcct tcctgatttc tgcagctctg tgtgaaggtg cagttttgcc aaggagtgct

241

aaagaactta gatgtcagtg cataaagaca tactccaaac ctttccaccc caaatttatc

301

aaagaactga gagtgattga gagtggacca cactgcgcca acacagaaat tattgtaaag

361

ctttctgatg

gaagagagct ctgtctggac cccaaggaaa actgggtgca gagggttgtg

421

gagaagtttt tgaagagggc tgagaattca taaaaaaatt cattctctgt ggtatccaag

481

aatcagtgaa gatgccagtg aaacttcaag caaatctact tcaacacttc atgtattgtg

541

tgggtctgtt gtagggttgc cagatgcaat acaagattcc tggttaaatt tgaatttcag

601

taaacaatga atagtttttc attgtaccat gaaatatcca gaacatactt atatgtaaag

661

tattatttat ttgaatctac aaaaaacaac aaataatttt taaatataag gattttccta

721

gatattgcac gggagaatat acaaatagca aaattgaggc caagggccaa gagaatatcc

781

gaactttaat ttcaggaatt gaatgggttt gctagaatgt gatatttgaa gcatcacata

841

aaaatgatgg gacaataaat tttgccataa agtcaaattt agctggaaat cctggatttt

901

tttctgttaa atctggcaac cctagtctgc tagccaggat ccacaagtcc ttgttccact

961

gtgccttggt ttctccttta tttctaagtg gaaaaagtat tagccaccat cttacctcac

1021

agtgatgttg tgaggacatg tggaagcact ttaagttttt tcatcataac ataaattatt

1081

ttcaagtgta acttattaac ctatttatta tttatgtatt tatttaagca tcaaatattt

1141

gtgcaagaat ttggaaaaat agaagatgaa tcattgattg aatagttata aagatgttat

1201

agtaaattta ttttatttta gatattaaat gatgttttat tagataaatt tcaatcaggg

1261

tttttagatt aaacaaacaa acaattgggt acccagttaa attttcattt cagataaaca

1321

acaaataatt ttttagtata agtacattat tgtttatctg

aaattttaat tgaactaaca

1381

atcctagttt gatactccca gtcttgtcat tgccagctgt gttggtagtg

ctgtgttgaa

1441

ttacggaata atgagttaga actattaaaa cagccaaaac tccacagtca atattagtaa

1501
tttcttgctg gttgaaactt gtttattatg tacaaataga ttcttataat attatttaaa

1561
tgactgcatt tttaaataca aggctttata tttttaactt taagatgttt ttatgtgctc

1621
tccaaatttt ttttactgtt tctgattgta tggaaatata aaagtaaata tgaaacattt

1681
aaaatataat ttgttgtcaa agtaaaaaaa aaaaaaaa

B1 AIM2-interferon-inducible protein AIM2/absent in melanoma 2 mRNA

NM_004833.1

(SEQ ID NO: 137)

1
tcagccaatt agagctccag ttgtcactcc tacccacact gggcctgggg gtgaagggaa

61
gtgtttatta ggggtacatg tgaagccgtc cagaagtgtc agagtctttg tagctttgaa

121
agtcacctag gttatttggg catgctctcc tgagtcctct gctagttaag ctctctgaaa

181
agaaggtggc agacccggtt tgctgatcgc cccagggatc aggaggctga tcccaaagtt

241
gtcagatgga gagtaaatac aaggagatac tcttgctaac aggcctggat aacatcactg

301
atgaggaact ggataggttt aagttctttc tttcagacga gtttaatatt gccacaggca

361
aactacatac tgcaaacaga atacaagtag ctaccttgat gattcaaaat gctggggcgg

421
tgtctgcagt gatgaagacc attcgtattt ttcagaagtt gaattatatg cttttggcaa

481
aacgtcttca ggaggagaag gagaaagttg ataagcaata caaatcggta acaaaaccaa

541

agccactaag tcaagctgaa atgagtcctg ctgcatctgc agccatcaga aatgatgtcg

601

caaagcaacg tgctgcacca aaagtctctc ctcatgttaa gcctgaacag aaacagatgg

661

tggcccagca ggaatctatc agagaagggt ttcagaagcg ctgtttgcca gttatggtac

721

tgaaagcaaa gaagcccttc acgtttgaga cccaagaagg caagcaggag atgtttcatg

781

ctacagtggc tacagaaaag gaattcttct ttgtaaaagt ttttaataca ctgctgaaag

841

ataaattcat tccaaagaga ataattataa

tagcaagata ttatcggcac agtggtttct

901

tagaggtaaa tagcgcctca cgtgtgttag atgctgaatc
tgaccaaaag gttaa

tgtcc

961

cgctgaacat tatcagaaaa gctggtgaaa ccccgaagat caacacgctt caaactcagc

1021

ccctt

ggaac aattgtgaat ggtttgtttg tagtccagaa ggtaacagaa aagaagaaaa

1081
acatattatt tgacctaagt gacaacactg ggaaaatgga agtactgggg gttagaaacg

1141
aggacacaat gaaatgtaag gaaggagata aggttcgact tacattcttc acactgtcaa

1201
aaaatggaga aaaactacag ctgacatctg gagttcatag caccataaag gttattaagg

1261
ccaaaaaaaa aacatagaga agtaaaaagg accaattcaa gccaactggt ctaagcagca

1321
tttaattgaa gaatatgtga tacagcctct tcaatcagat tgtaagttac ctgaaagctg

1381
cagttcacag gctcctctct ccaccaaatt aggatagaat aattgctgga taaacaaatt

1441
cagaatatca acagatgatc acaataaaca tctgtttctc attcc

B2 CD274-CD274 molecule/B7-H, mRNA NM_014143.3

(SEQ ID NO: 138)

1
ggcgcaacgc tgagcagctg gcgcgtcccg cgcggcccca gttctgcgca gcttcccgag

61
gctccgcacc agccgcgctt ctgtccgcct gcagggcatt ccagaaagat gaggatattt

121
gctgtcttta tattcatgac ctactggcat ttgctgaacg catttactgt cacggttccc

181
aaggacctat atgtggtaga gtatggtagc aatatgacaa ttgaatgcaa attcccagta

241
gaaaaacaat tagacctggc tgcactaatt gtctattggg aaatggagga taagaacatt

301
attcaatttg tgcatggaga ggaagacctg aaggttcagc atagtagcta cagacagagg

361
gcccggctgt tgaaggacca gctctccctg ggaaatgctg cacttcagat cacagatgtg

421
aaattgcagg atgcaggggt gtaccgctgc atgatcagct atggtggtgc cgactacaag

481
cgaattactg tgaaagtcaa tgccccatac aacaaaatca accaaagaat tttggttgtg

541

gatccagtca cctctgaaca tgaactgaca tgtcaggctg agggctaccc caaggccgaa

601

gtcatctgga caagcagtga ccatcaagtc ctgagtggta agaccaccac caccaattcc

661

aagagagagg agaagctttt caatgtgacc agcacactga gaatcaacac
aacaactaat

721

gagattttct actgcacttt taggagatta gatcctgagg aaaaccatac agctgaattg

781

gtcatcccag aactacctct ggcacatcct ccaaatgaaa ggactcactt ggtaattctg

841

ggagccatct tattatgcct tggtgtagca ctgacattca tcttccgttt aagaaaaggg

901

agaatgatgg atgtgaaaaa atgtggcatc caagatacaa actcaaagaa gcaaagtgat

961

acacatttgg aggagacgta atccagcatt ggaacttctg atcttcaagc agggattctc

1021

aacctgtggt ttaggggttc atcggggctg agcgtgacaa gaggaaggaa tgggcccgtg

1081

ggatgcaggc aatgtgggac ttaaaaggcc caagcactga aaatggaacc tggcgaaagc

1141

agaggaggag aatgaagaaa gatggagtca aacagggagc ctggagggag accttgatac

1201

tttcaaatgc ctgaggggct catcgacgcc tgtgacaggg agaaaggata cttctgaaca

1261

aggagcctcc aagcaaatca tccattgctc atcctaggaa gacgggttga gaatccctaa

1321

tttgagggtc agttcctgca gaagtgccct ttgcctccac tcaatgcctc aatttgtttt

1381

ctgcatgact gagagtctca gtgttggaac gggacagtat ttatgtatga gtttttccta

1441

tttattttga gtctgtgagg tcttcttgtc atgtgagtgt ggttgtgaat gatttctttt

1501

gaagatatat tgtagtagat gttacaattt tgtcgccaaa ctaaacttgc tgcttaatga

1561

tttgctcaca tctagtaaaa catggagtat ttgtaaggtg cttggtctcc tctataacta

1621

caagtataca ttggaagcat aaagatcaaa ccgttggttg cataggatgt cacctttatt

1681

taacccatta atactctggt tgacctaatc ttattctcag acctcaagtg tctgtgcagt

1741

atctgttcca

tttaaatatc agctttacaa ttatgtggta gcctacacac ataatctcat

1801

ttcatcgctg taaccaccct gttgtgataa ccactattat tttacccatc gtacagctga

1861

ggaagcaaac agattaagta acttgcccaa accagtaaat agcagacctc agactgccac

1921

ccactgtcct tttataatac aatttacagc tatattttac tttaagcaat tcttttattc

1981
aaaaaccatt tattaagtgc ccttgcaata tcaatcgctg tgccaggcat tgaatctaca

2041
gatgtgagca agacaaagta cctgtcctca aggagctcat agtataatga ggagattaac

2101
aagaaaatgt attattacaa tttagtccag tgtcatagca taaggatgat gcgaggggaa

2161
aacccgagca gtgttgccaa gaggaggaaa taggccaatg tggtctggga cggttggata

2221
tacttaaaca tcttaataat cagagtaatt ttcatttaca aagagaggtc ggtacttaaa

2281
ataaccctga aaaataacac tggaattcct tttctagcat tatatttatt cctgatttgc

2341
ctttgccata taatctaatg cttgtttata tagtgtctgg tattgtttaa cagttctgtc

2401
ttttctattt aaatgccact aaattttaaa ttcatacctt tccatgattc aaaattcaaa

2461
agatcccatg ggagatggtt ggaaaatctc cacttcatcc tccaagccat tcaagtttcc

2521
tttccagaag caactgctac tgcctttcat tcatatgttc ttctaaagat agtctacatt

2581
tggaaatgta tgttaaaagc acgtattttt aaaatttttt tcctaaatag taacacattg

2641
tatgtctgct gtgtactttg ctatttttat ttattttagt gtttcttata tagcagatgg

2701
aatgaatttg aagttcccag ggctgaggat ccatgccttc tttgtttcta agttatcttt

2761
cccatagctt ttcattatct ttcatatgat ccagtatatg ttaaatatgt cctacatata

2821
catttagaca accaccattt gttaagtatt tgctctagga cagagtttgg atttgtttat

2881
gtttgctcaa aaggagaccc atgggctctc cagggtgcac tgagtcaatc tagtcctaaa

2941
aagcaatctt attattaact ctgtatgaca gaatcatgtc tggaactttt gttttctgct

3001
ttctgtcaag tataaacttc actttgatgc tgtacttgca aaatcacatt ttctttctgg

3061
aaattccggc agtgtacctt gactgctagc taccctgtgc cagaaaagcc tcattcgttg

3121
tgcttgaacc cttgaatgcc accagctgtc atcactacac agccctccta agaggcttcc

3181
tggaggtttc gagattcaga tgccctggga gatcccagag tttcctttcc ctcttggcca

3241
tattctggtg tcaatgacaa ggagtacctt ggctttgcca catgtcaagg ctgaagaaac

3301
agtgtctcca acagagctcc ttgtgttatc tgtttgtaca tgtgcatttg tacagtaatt

3361
ggtgtgacag tgttctttgt gtgaattaca ggcaagaatt gtggctgagc aaggcacata

3421
gtctactcag tctattccta agtcctaact cctccttgtg gtgttggatt tgtaaggcac

3481
tttatccctt ttgtctcatg tttcatcgta aatggcatag gcagagatga tacctaattc

3541
tgcatttgat tgtcactttt tgtacctgca ttaatttaat aaaatattct tatttatttt

3601
gttacttggt acaccagcat gtccattttc ttgtttattt tgtgtttaat aaaatgttca

3661
gtttaacatc ccagtggaga aagttaaaaa a

B3 CD96-CD96 antigen; T cell activation mRNA NM_198196.2

(SEQ ID NO: 139)

1
ttcctgtcta cgtttcattt cctgggggct tgccaagtga taaacagacc caggcgtgtg

61
tggtagagtt cgggtttttt agcacgaagt gggtggctgg agtttgcttg aaaacatcaa

121
ttgactttgt gatcattaca gaaatgctgg tgtaaggtgt tcagaagaca atggagaaaa

181
aatggaaata ctgtgctgtc tattacatca tccagataca ttttgtcaag ggagtttggg

241
aaaaaacagt caacacagaa gaaaatgttt atgctacact tggctctgat gtcaacctga

301
cctgccaaac acagacagta ggcttcttcg tgcagatgca atggtccaag gtcaccaata

361
agatagacct gattgctgtc tatcatcccc aatacggctt ctactgtgcc tatgggagac

421
cctgtgagtc acttgtgact ttcacagaaa ctcctgagaa tgggtcaaaa tggactctgc

481
acttaaggaa tatgtcttgt tcagtcagtg gaaggtacga gtgtatgctt gttctgtatc

541
cagagggcat tcagactaaa atctacaacc ttctcattca gacacacgtt acagcagatg

601
aatggaacag caaccatacg atagaaatag agataaatca gactctggaa ataccatgct

661

ttcaaaatag ctcctcaaaa atttcatctg agttcaccta

tgcatggtcg gtggaaaaca

721

gcagcacgga ttcttgggtc cttctttcta agggtataaa ggaggataat
ggaactcagg

781

aaacacttat ctcccaaaat cacctcatca gcaattccac attacttaaa gatagagtca

841

agcttggtac agactacaga ctccacctct ctccagtcca aatcttcgat gatgggcgga

901

agttctcttg ccacattaga gtcggtccta acaaaatctt gaggagctcc accacagtca

961

aggtttttgc taaaccagaa atccctgtga ttgtggaaaa taactccacg gatgtcttgg

1021

tagagagaag atttacctgc ttactaaaga atgtatttcc caaagcaaat atcacatggt

1081

ttatagatgg aagttttctt catgatgaaa aagaaggaat atatattact aatgaagaga

1141

gaaaaggcaa agatggattt ttggaactga agtctgtttt aacaagggta catagtaata

1201

aaccagccca atcagacaac ttgaccattt ggtgtatggc tctgtctcca gtcccaggaa

1261

ataaagtgtg gaacatctca tcagaaaaga tcacttttct cttaggttct gaaatttcct

1321

caacagaccc tccactgagt gttacagaat ctacccttga cacccaacct tctccagcca

1381

gcagtgtatc tcctgcaaga tatccagcta catcttcagt gacccttgta gatgtgagtg

1441

ccttgaggcc aaacaccact cctcaaccca gcaattccag tatgactacc cgaggcttca

1501

actatccctg gacctccagt gggacagata ccaaaaaatc agtttcacgg atacctagtg

1561

aaacatacag ttcatccccg tcaggtgcag gctcaacact tcatgacaat gtctttacca

1621

gcacagccag agcattttca gaagtcccca caactgccaa tggatctacg aaaactaatc

1681

acgtccatat cactggtatt gtggtcaata agcccaaaga tggaatgtcc tggccagtga

1741

ttgtagcagc tttactcttt tgctgcatga tattgtttgg tcttggagtg agaaaatggt

1801

gtcagtacca aaaagaaata atggaaagac ctccaccttt caagccacca ccacctccca

1861

tcaagtacac ttgcattcaa gagcccaacg aaagtgatct gccttatcat gagatggaga

1921

ccctctagtc tcgtgagact ttgccccatg gcagaactct gctggaatcc tattgagaag

1981

gtagacattg tgctttatta atatagtcgc tcttcagcca tgcctttgct gcagctgaaa

2041

tggaagtcag aagtgagtga cctgttttcc cagcaactca ccctcttcca tctccaaacg

2101

cctgaagctt aaccaagagt gagaggatat gtcatgttca cactcaatgc aattcgtagt

2161

ggttttcttg cttatgtaag aagtacatat tagtctgcca tctttaaaaa aaaatacagt

2221

attttcattt aaattctctg atggagggac aacaatggtt tcaactgtat gcccatgcct

2281

gatcctctta tttgaacatc tatcaacatt gtaaactctt tgccaaaatc ctggggcttt

2341

gctgcattcc ctaagataat tacaggaaaa agaaaatgta aaagtgctaa caaggctgcc

2401

aagtaatgga gaagtatggt tagtcttcat attgaaattc tgttgcttat tttcatggaa

2461

ggaaacagaa tactttgcac aggaaccaca ttttcaatcc tccttcactg tcttcctacc

2521

atgttcagcc cagactcctg ccacatggac caggatgaag agggatcaaa gagataatta

2581

gccaaaaacc cagtagccta gaagatacaa aactccactg gcctctaaaa ttatattagc

2641

caagagtggt ttcatttgag tgccttcgtg tgtatgtcca tcaaactgga accaaactgt

2701

tttgtaagta aacaggcagc ctaagcccaa ccctactttc taattccagt tattctcttt

2761

ttcatctggg gatttacctg ttcatttaat ctgcctgttt tgatctgttt tgaaaaagat

2821

aaagagcctc aaatcagacc agcactgatt aattaaccct gctcctacca atctttttta

2881

aagcagttga agcagaatgt ataggtgtca gagaagaaac ctagtcagcc agacgtgctc

2941

tgtattcagc aatagtttgt gaatgaataa attactaatc ctccttgtcg cttgaaacct

3001

tcccacactc cctgctccag gagggaaaaa cagatgttgt tgacagatag agtgataggc

3061

aaattctgtg tggactttag tcccaaaagg aaactttagt tcacttgcag tatgcttatc

3121

cttgactgca catgagaatg ccttgtgcag agttatttgg agattatgtc tttttcttaa

3181

acaccatggc tgtcacactt cagttcaatt aaatcagaat gtctgaggag tgagacacag

3241

gcatcaacac tctcaaatga ttcacatgtt cagccaaagt tgagaaccat cgagcctgtg

3301

gaagttcttt ctcatggctc agaatcttag gtaggtgctt aactcttgtg gtggccagcc

3361

tccaagatga gccccagtgt tcttgcctcc tactattcac atctttatgt ggtcccctcc

3421

aatgctgaat acagatgatt tgtgtaacct gaggccagga ttaaggggag gcaatcaatg

3481

cacctaggga aaaaatttaa

ggaggtattc acactcaggg tcatgcactt gcacaatgtt

3541

gagaatgagt accactctca ccattggtat
agccaaaaaa gcttggaagt gaccaaggct

3601

aggtcacaaa atacactgtg gcttcttctt tgatctctct ttgaccatac tgacactggg

3661

aaaagcccat tcccatgcca tgaagacacc aaggcagccc tattgagaaa tctacctgtc

3721

gtggccgggc gcagtggctc acgcctgtaa tcccagcact ttgggaggcc gaggtgggtg

3781
gatcacgagg tcaggagatc gagaccatcc tggctaacac agtgaaaccc cgtctctact

3841
aaaaatacaa aaaattagcc gggtgtggtg tcgggcacct gtagtcccag ctactcagga

3901
ggctgaggca ggagaagggt gggaacccgg gaggcagagc ttgcagtgag ccgagattgt

3961
gccactgcac actccaatct gggtgaaaga ccgagactcc gcctcaaaaa aaaaaaaaaa

4021
agaaagaaag aaagaaagaa agaaatctac ctgtcaagga actaaggtat tttgctaaca

4081
agcaccaact tgccagccat gtaagggagc catcttggaa gcagatcctc cagcctccag

4141
tcaagtcttc agataattgc aacttcagtt gatcttttga ccaagacctc aagagagcca

4201
gaactaccca gctaagcctt ttactaaatt tctgaacttc taacactatt agataataag

4261
tgcttattgt ttaacaccat taattttgag tataatttgt tacatagcga cagataacta

4321
tacagctcaa caactagaaa aataaactgt ttacctgcct taattattta tctttagttc

4381
cttattagtt ctcaagaaac aaatgctagc ttcatatgta tggctgttgc tttgcttcat

4441
gtgtatggct atttgtattt aacaagactt aatcatcagt a

B4 CDH23-cadherin-related 23 mRNA NM_022124.5

(SEQ ID NO: 140)

1
gcggcggcgg cggctcggga gagagggacg cgggctgcag gcgcgatgct tggctagagg

61
acgcgtccga cggcggccgg acgctgaggt ggtcggggct agtcagcccg gcctgggcat

121
ggagcgcggg gtggcagagc ctctggacgt ttggggcgcg cccagtccga gcccccggcg

181
cgcctgaagt tgcgagcggc gagcggcgag cggcgagcgg cccgcggaga cccaggagct

241
gccggcacgc cgcggatgag ccttcgcgcc ggcgggaaga cgcggcggtg gccagggcca

301
gagcaggcgg cccgcggggg ccgatccggc ggagagcaga gcccgaggcg aggcgaggcg

361
cggcgccgct gcacacacgc acacggagcc atggggcgcc atgttgccac cagctgccac

421
gtggcctggc ttttggtgct gatctctgga tgctggggcc aggtgaaccg gctgcccttc

481
ttcaccaacc acttctttga tacatacctg ctgatcagcg aggacacgcc tgtgggttct

541
tctgtgaccc agttgctggc ccaagacatg gacaatgacc ccctggtgtt tggcgtgtct

601
ggggaggagg cctctcgctt ctttgcagtg gagcctgaca ctggcgtggt gtggctccgg

661
cagccactgg acagagagac caagtcagag ttcaccgtgg agttctctgt cagcgaccac

721
cagggggtga tcacacggaa ggtgaacatc caggttgggg atgtgaatga caacgcgccc

781
acatttcaca atcagcccta cagcgtccgc atccctgaga atacaccagt ggggacgccc

841
atcttcatcg tgaatgccac agaccccgac ttgggggcag ggggcagcgt cctctactcc

901
ttccagcccc cctcccaatt cttcgccatt gacagcgccc gcggtatcgt cacagtgatc

961
cgggagctgg actacgagac cacacaggcc taccagctca cggtcaacgc cacagatcaa

1021
gacaagacca ggcctctgtc caccctggcc aacttggcca tcatcatcac agatgtccag

1081
gacatggacc ccatcttcat caacctgcct tacagcacca acatctacga gcattctcct

1141
ccgggcacga cggtgcgcat catcaccgcc atagaccagg ataaaggacg tccccggggc

1201
attggctaca ccatcgtttc agggaatacc aacagcatct ttgccctgga ctacatcagc

1261
ggagtgctga ccttgaatgg cctgctggac cgggagaacc ccctgtacag ccatggcttc

1321
atcctgactg tgaagggcac ggagctgaac gatgaccgca ccccatctga cgctacagtc

1381
accacgacct tcaatatcct ggttattgac atcaatgaca atgccccgga gttcaacagc

1441
tccgagtaca gcgtggccat cactgagctg gcacaggtcg gctttgccct tccactcttc

1501
atccaggtgg tggacaagga tgagaatttg ggcctgaaca gcatgtttga ggtgtacttg

1561
gtggggaaca actcccacca cttcatcatc tccccgacct ccgtccaggg gaaggcggac

1621
attcgtattc gggtggccat cccactggac tacgagaccg tggaccgcta cgactttgat

1681
ctctttgcca atgagagtgt gcctgaccat gtgggctatg ccaaggtgaa gatcactctc

1741
atcaatgaaa atgacaaccg gcccatcttc agccagccac tgtacaacat cagcctgtac

1801
gagaacgtca ccgtggggac ctctgtgctg acagtcctgg caactgacaa tgatgcaggc

1861
acctttgggg aagtcagcta cttcttcagt gatgaccctg acaggttctc gctggacaag

1921
gacacgggac tcatcatgct gattgccagg ctggactatg agctcatcca gcgcttcacc

1981
ctgacgatca ttgcccggga cgggggcggc gaggagacca caggccgggt caggatcaat

2041
gtgttggatg tcaacgacaa cgtgcccacc ttccagaagg atgcctacgt gggtgctctg

2101
cgggagaacg agccttctgt cacacagctg gtgcggctcc gggcaacaga tgaagactcc

2161
cctcccaaca accagatcac ctacagcatt gtcagtgcat ctgcctttgg cagctacttc

2221
gacatcagcc tgtacgaggg ctatggagtg atcagcgtca gtcgccccct ggattatgaa

2281
cagatatcca atgggctgat ttatctgacg gtcatggcca tggatgctgg caacccccct

2341
ctcaacagca ccgtccctgt caccatcgag gtgtttgatg agaatgacaa ccctcccacc

2401
ttcagcaagc ccgcctactt cgtctccgtg gtggagaaca tcatggcagg agccacggtg

2461
ctgttcctga atgccacaga cctggaccgc tcccgggagt acggccagga gtccatcatc

2521
tactccttgg aaggctccac ccagtttcgg atcaatgccc gctcagggga aatcaccacc

2581
acgtctctgc ttgaccgaga gaccaagtct gaatacatcc tcatcgttcg cgcagtggac

2641
gggggtgtgg gccacaacca gaaaactggc atcgccaccg taaacatcac cctcctggac

2701
atcaatgaca accaccccac gtggaaggac gcaccctact acatcaacct ggtggagatg

2761
acccctccag actctgatgt gaccacggtg gtggctgttg acccagacct gggggagaat

2821
ggcaccctgg tgtacagcat ccagccaccc aacaagttct acagcctcaa cagcaccacg

2881
ggcaagatcc gcaccaccca cgccatgctg gaccgggaga accccgaccc ccatgaggcc

2941
gagctgatgc gcaaaatcgt cgtctctgtt actgactgtg gcaggccccc tctgaaagcc

3001
accagcagtg ccacagtgtt tgtgaacctc ttggatctca atgacaatga ccccaccttt

3061
cagaacctgc cttttgtggc cgaggtgctt gaaggcatcc cggcgggggt ctccatctac

3121
caagtggtgg ccatcgacct cgatgagggc ctgaacggcc tggtgtccta ccgcatgccg

3181
gtgggcatgc cccgcatgga cttcctcatc aacagcagca gcggcgtggt ggtcaccacc

3241
accgagctgg accgcgagcg catcgcggag taccagctgc gggtggtggc cagtgatgca

3301
ggcacgccca ccaagagctc caccagcacg ctcaccatcc atgtgctgga tgtgaacgac

3361
gagacgccca ccttcttccc ggccgtgtac aatgtgtctg tgtccgagga cgtgccacgc

3421
gagttccggg tggtctggct gaactgcacg gacaacgacg tgggcctcaa tgcagagctc

3481
agctacttca tcacaggtgg caacgtggat gggaagttca gcgtgggtta ccgcgatgcc

3541
gttgtgagaa ccgtggtggg cctggaccgg gagaccacag ccgcctacat gctcatcctg

3601
gaggccatcg acaacggccc tgtagggaag cgacacacgg gcacagccac cgtgttcgtc

3661
actgtcctgg atgtgaatga caaccggccc atctttctgc agagcagcta tgaggccagc

3721
gtccctgagg acatccctga aggccacagc atcttgcagc tgaaagccac ggacgcagat

3781
gagggcgagt ttgggcgtgt gtggtaccgc atcctccatg gtaaccatgg caacaacttc

3841
cggatccatg tcagcaatgg gctcctgatg cgagggcccc ggcccctgga ccgggagcgg

3901
aactcatccc acgtgctgat agtggaggcc tacaaccacg acctgggccc catgcggagc

3961
tccgtcaggg tgattgtgta cgtggaggac atcaacgatg aggcccccgt gttcacacag

4021
cagcagtaca gccgtctggg gcttcgagag accgcaggca ttggaacgtc agtcatcgtg

4081
gtccaagcca cagaccgaga ctctggggat ggtggcctgg tgaactaccg catcctgtcg

4141
ggcgcagagg ggaagtttga gattgacgag agcacagggc ttatcatcac cgtgaattac

4201
ctggactacg agaccaagac cagctacatg atgaatgtgt cggccactga ccaggccccg

4261
cccttcaacc agggcttctg cagcgtctac atcactctgc tcaacgagct ggacgaggcc

4321
gtgcagttct ccaatgcctc atacgaggct gccatcctgg agaatctggc actgggtact

4381
gagattgtgc gggtccaggc ctactccatc gacaacctca accaaatcac gtaccgcttc

4441
aacgcctaca ccagcaccca ggccaaagcc ctcttcaaga tagacgccat cacgggtgtg

4501
atcacagtcc agggcctggt ggaccgtgag aagggcgact tctatacctt gacagtggtg

4561
gcagatgacg gcggccccaa ggtggactcc accgtgaagg tctacatcac tgtgctggac

4621
gagaatgaca acagcccccg gtttgacttc acctccgact cggcggtcag catacccgag

4681
gactgccctg tgggccagcg agtggctact gtcaaggcct gggaccctga tgctggcagc

4741
aatgggcagg tggtcttctc cctggcctct ggcaacatcg cgggggcctt tgagatcgtc

4801
accaccaatg actccattgg cgaagtgttt gtggccaggc ccctggacag agaagagctg

4861
gatcactaca tcctccaggt tgtggcttct gaccgaggca cccctccacg gaagaaggac

4921
cacatcctgc aggtgaccat cctggacatc aatgacaacc ctccagtcat cgagagcccc

4981
tttggataca atgtcagtgt gaatgagaac gtgggtggag gtactgctgt ggtccaggtg

5041
agagccactg accgtgacat cgggatcaac agtgttctgt cctactacat caccgagggc

5101
aacaaggaca tggccttccg catggaccgc atcagcggtg agatcgccac acggcctgcc

5161
ccgcctgacc gcgagcgcca gagcttctac cacctggtgg ccactgtgga ggacgagggc

5221
accccaaccc tgtcggccac cacgcacgtg tacgtgacca ttgtggatga gaatgataac

5281
gcgcccatgt tccagcagcc ccactatgag gtgctgctgg atgagggccc agacacgctc

5341
aacaccagcc tcatcaccat ccaggcactg gacctggatg agggtcccaa cggcacagtc

5401
acctatgcca tcgtcgcagg caacatcgtc aacaccttcc gcatcgacag acacatgggt

5461
gtcatcactg ctgccaaaga gctggactac gagatcagcc acggccgcta caccctgatc

5521
gtcactgcca cagaccagtg ccccatctta tcccaccgcc tcacctctac caccacggtg

5581
cttgtgaatg tgaatgacat caacgacaat gtgcctacct tcccccggga ctatgaggga

5641
ccatttgaag tcactgaggg ccagccgggg cccagagtgt ggaccttcct ggcccatgac

5701
cgagactcag gacccaacgg gcaggtggag tacagcatca tggatggaga ccctctgggg

5761
gagtttgtga tctctcctgt ggagggggtg ctaagggtcc ggaaggacgt ggagctggac

5821
cgggagacca tcgccttcta caacctgacc atctgtgccc gtgaccgggg gatgccccca

5881
ctcagctcca caatgctggt ggggatccgg gtgctggaca tcaacgacaa cgaccctgtg

5941
ctgctgaacc tgcccatgaa catcaccatc agcgagaaca gccctgtctc cagctttgtc

6001
gcccatgtcc tggccagtga cgctgacagt ggctgcaatg cacgcctcac cttcaacatc

6061
actgcgggca accgcgagcg ggccttcttc atcaatgcca cgacagggat cgtcactgtg

6121
aaccggcccc tggaccgcga gcggatccca gagtacaagc tgaccatttc tgtgaaggac

6181
aacccggaga atccacgcat agccaggagg gattatgact tgcttctgat cttcctttct

6241
gatgagaatg acaaccaccc cctcttcact aaaagcacct accaggcaga ggtgatggaa

6301
aactctcccg ctggcacccc tctcacggtg ctcaatgggc ccatcctggc cctggatgca

6361
gaccaagaca tctacgccgt ggtgacctac cagctgctgg gtgcccagag tggcctcttt

6421
gacatcaaca gcagcaccgg tgtggtgacc gtgaggtcag gtgtcatcat tgaccgggag

6481
gcattctcgc cacccatcct ggagctgctg ctgctggctg aggacatcgg gctgctcaac

6541
agcacggccc acctgctcat caccatcctg gatgacaatg acaaccggcc cacctttagc

6601
cctgccaccc tcactgtcca tctgctagag aactgcccgc ctggattctc agtccttcaa

6661
gtcacagcca cagatgagga cagtggcctc aatggggagc tggtctaccg aatagaagct

6721
ggggctcagg accgcttcct cattcatctg gtcaccgggg tcatccgtgt tggtaatgcc

6781
accatcgaca gagaggagca ggagtcctac aggctaacgg tggtggccac cgaccggggc

6841
accgttcctc tctcgggcac agccattgtc accattctga tcgatgacat caatgactcc

6901
cgccccgagt tcctcaaccc catccagaca gtgagcgtgc tggagtcggc tgagccaggc

6961
actgtcattg ccaatatcac ggccattgac cacgacctca acccaaagct agagtaccac

7021
attgtcggca ttgtggccaa ggacgacact gatcgcctgg tgcccaacca ggaggacgcc

7081
tttgctgtga atatcaacac aggatctgta atggtgaagt cccccatgaa tcgggagctg

7141
gttgccacct atgaggtcac tctctcagtg attgacaatg ccagcgacct accagagcgc

7201
tctgtcagtg tgccaaatgc caagctgact gtcaacgtcc tggacgtcaa tgacaatacg

7261
ccccagttca agccctttgg gatcacctac tacatggagc ggatcctgga gggggccacc

7321
cctgggacca cactcattgc tgtggcagcc gtggaccctg acaagggcct taatgggctg

7381
gtcacctaca ccctgctgga cctggtgccc ccagggtatg tccagctgga ggactcctcg

7441
gcagggaagg tcattgccaa ccggacagtg gactacgagg aggtgcactg gctcaacttt

7501
accgtgaggg cctcagacaa cgggtccccg ccccgggcag ctgagatccc tgtctacctg

7561
gaaatcgtgg acatcaatga caacaacccc atctttgacc agccctccta ccaggaggct

7621
gtctttgagg atgtgcctgt gggcacaatc atcctgacag tcactgccac tgatgctgac

7681
tcaggcaact ttgcactcat tgagtacagc cttggagatg gagagagcaa gtttgccatc

7741
aaccccacca cgggtgacat ctatgtgctg tcttctctgg accgggagaa gaaggaccac

7801
tatatcctga ctgccttggc caaagacaac cctggggatg tagccagcaa ccgtcgcgaa

7861
aattcagtgc aggtggtgat ccaagtgctg gatgtcaatg actgccggcc acagttctcc

7921
aagccccagt tcagcacaag cgtgtatgag aatgagccgg cgggcacctc ggtcatcacc

7981
atgatggcca ctgaccagga tgaaggtccc aatggagagt tgacctactc acttgagggc

8041
cctggcgtgg aggccttcca tgtggacatg gactcgggct tggtgaccac acagcggcca

8101
ctgcagtcct acgagaagtt cagtctgacc gtggtggcca cagatggtgg agagccccca

8161
ctctggggca ccaccatgct cctggtggag gtcatcgacg tcaatgacaa ccgccctgtc

8221
tttgtgcgcc cacccaacgg caccatcctc cacatcagag aggagatccc gctgcgctcc

8281
aacgtgtacg aggtctacgc cacggacaag gatgagggcc tcaacggggc ggtgcgctac

8341
agcttcctga agactgcggg caaccgggac tgggagttct tcatcatcga cccaatcagc

8401
ggcctcatcc agactgctca gcgcctggac cgcgagtcgc aggcggtgta cagcctcatc

8461
ttggtggcca gcgacctggg ccagccagtg ccatacgaga ctatgcagcc gctgcaggtg

8521
gccctggagg acatcgatga caacgaaccc cttttcgtga ggcctccaaa aggcagcccc

8581
cagtaccagc tgctgacagt gcctgagcac tcaccacgcg gcaccctcgt gggcaacgtg

8641
acaggcgcag tggatgcaga tgagggcccc aacgcgatcg tgtactactt catcgcagcc

8701
ggcaacgaag agaagaactt ccatctgcag cccgatgggt gtctgctggt gctgcgggac

8761
ctggaccggg agcgagaagc catcttctcc ttcatcgtca aggcctccag caatcgcagc

8821
tggacacctc cccgtggacc ctccccaacc ctcgacctgg ttgctgacct cacactgcag

8881
gaggtgcgcg ttgtgctaga ggacatcaac gaccagccac cacgcttcac caaggctgag

8941
tacactgcag gggtggccac cgacgccaag gtgggctcag agttgatcca ggtgctggcc

9001
ctggatgcag acattggcaa caacagcctt gtcttctaca gcattctggc catccactac

9061
ttccgggccc ttgccaacga ctctgaagat gtgggccagg tcttcaccat ggggagcatg

9121
gacggcattc tgcgcacctt cgacctcttc atggcctaca gccccggcta cttcgtggtg

9181
gacattgtgg cccgagacct ggcaggccac aacgacacgg ccatcatcgg catctacatc

9241
ctgagggacg accagcgcgt caagatcgtc attaacgaga tccccgaccg tgtgcgcggc

9301
ttcgaggagg agttcatcca cctgctctcc aacatcactg gggccattgt caatactgac

9361
aatgtgcagt tccatgtgga caagaagggc cgggtgaact ttgcgcagac agaactgctt

9421
atccacgtgg tgaaccgcga taccaaccgc atcctggacg tggaccgggt gatccagatg

9481
atcgatgaga acaaggagca gctacggaat cttttccgga actacaacgt cctggacgtg

9541
cagcctgcca tctctgtccg gctgccggat gacatgtctg ccctgcagat ggcgatcatc

9601
gtcctggcta tcctcctgtt cctggccgcc atgctctttg tcctcatgaa ctggtactac

9661
aggactgtac acaagaggaa gctcaaggcc attgtggctg gctcagctgg gaatcgtggc

9721

ttcatcgaca tcatggacat gcctaacacc aacaagtact cctttgatgg agccaaccct

9781

gtgtggctgg atcccttctg tcggaacctg gagctggccg cccaggcgga gcatgaggat

9841

gacctaccgg agaacctgag tgagatcgcc gacctgtgga acagccccac gcgcacccat

9901

ggaacttttg ggcgtgagcc agcagctgtc aagcctgatg atgaccgata cctgcgggct

9961

gccatccagg agtatgacaa cattgccaag ctgggccaga tcattcgtga ggggccaatc

10021

aagggctcgc

tgctgaaggt ggtcctggag gattacctgc ggctcaaaaa gctctttgca

10081

cagcggatgg tgcaaaaagc

ctcctcctgc cactcctcca tctctgagct gatacagact

10141

gagctggacg aggagccagg agaccacagc ccagggcagg gtagcctgcg cttccgccac

10201

aagccaccag tggagctcaa ggggcccgat gggatccatg tggtgcacgg cagcacgggc

10261

acgctgctgg ccaccgacct caacagcctg cccgaggaag accagaaggg cctgggccgc

10321

tcgctggaga cgctgaccgc tgccgaggcc actgccttcg agcgcaacgc ccgcacagaa

10381

tccgccaaat ccacacccct gcacaaactt cgcgacgtga tcatggagac ccccctggag

10441

atcacagagc tgtgactaga cagggaagcc ttgtgggtgt gagcagcacc catccaccgt

10501

cccctcccag ggagcaaggg cagggacagg gccggtcggg ggggaccctc caaggccagg

10561

ccttggggac aaccttggct tggccctggc agcccgcatc agctgctcag

atcccacttt

10621

tgccagacgc tcattcagca tctgacctct accttcataa gatctgttat ttttataaga

10681

aaaccaaaca aaaatgttaa gcatctaagg acaaggtaag gagggtcact ggggcccaag

10741

agtctgggga ccagcttggc tcaggctgag ctgaaagagg ccaaacaggc cctcctccct

10801

cccagctcca ccccgcaagc accatcccct ccggctaagc aggcgcaagg gaggcccagc

10861

gcggacatcc cctgctggcc ggacacccga ctccagtcca agtctcgcta catttccgcc

10921

acatccctct ctgctggacg tccaggtgga ggtggcatcc ccacgtggac aagaaagtca

10981

atgtcaatga acaagcattc tctccatttc actggcttcc caaatgtgtg cccagcttat

11041
aaacagaagt gactgatgtt ccctccggtt ttgaatgtgg agtgtttgtg tgtgttcctt

11101
ttttaaatta agttattccc tcaaaaaaaa aaaa

B5 IRF1-interferon regulatory factor 1 mRNA-NM_002198.2

(SEQ ID NO: 141)

1
agagctcgcc actccttagt cgaggcaaga cgtgcgcccg agccccgccg aaccgaggcc

61
acccggagcc gtgcccagtc cacgccggcc gtgcccggcg gccttaagaa cccggcaacc

121
tctgccttct tccctcttcc actcggagtc gcgctccgcg cgccctcact gcagcccctg

181
cgtcgccggg accctcgcgc gcgaccgccg aatcgctcct gcagcagagc caacatgccc

241
atcactcgga tgcgcatgag accctggcta gagatgcaga ttaattccaa ccaaatcccg

301
gggctcatct ggattaataa agaggagatg atcttccaga tcccatggaa gcatgctgcc

361
aagcatggct gggacatcaa caaggatgcc tgtttgttcc ggagctgggc cattcacaca

421
ggccgataca aagcagggga aaaggagcca gatcccaaga cgtggaaggc caactttcgc

481
tgtgccatga actccctgcc agatatcgag gaggtgaaag accagagcag gaacaagggc

541
agctcagctg tgcgagtgta ccggatgctt ccacctctca ccaagaacca gagaaaagaa

601
agaaagtcga agtccagccg agatgctaag agcaaggcca agaggaagtc atgtggggat

661
tccagccctg ataccttctc tgatggactc agcagctcca ctctgcctga tgaccacagc

721
agctacacag ttccaggcta catgcaggac ttggaggtgg agcaggccct gactccagca

781
ctgtcgccat gtgctgtcag cagcactctc cccgactggc acatcccagt ggaagttgtg

841
ccggacagca ccagtgatct gtacaacttc caggtgtcac ccatgccctc cacctctgaa

901
gctacaacag atgaggatga ggaagggaaa ttacctgagg acatcatgaa gctcttggag

961
cagtcggagt ggcagccaac aaacgtggat gggaaggggt acctactcaa tgaacctgga

1021
gtccagccca cctctgtcta tggagacttt agctgtaagg aggagccaga aattgacagc

1081
ccaggggggg atattgggct gagtctacag cgtgtcttca cagatctgaa gaacatggat

1141
gccacctggc tggacagcct gctgacccca gtccggttgc cctccatcca ggccattccc

1201
tgtgcaccgt agcagggccc ctgggcccct cttattcctc taggcaagca ggacctggca

1261
tcatggtgga tatggtgcag agaagctgga cttctgtggg cccctcaaca gccaagtgtg

1321
accccactgc caagtgggga tggggcctcc ctccttgggt cattgacctc tcagggcctg

1381
gcaggccagt gtctgggttt ttcttgtggt gtaaagctgg ccctgcctcc tgggaagatg

1441

aggttctgag accagtgtat caggtcaggg acttggacag gagtcagtgt ctggcttttt

1501

cctctgagcc cagctgcctg gagagggtct cgctgtcact ggctggctcc taggggaaca

1561

gaccagtgac cccagaaaag cataacacca
atcccagggc tggctctgca ctaagagaaa

1621

attgcactaa atgaatctcg ttcccaaaga actaccccct
tttcagctga gccctgggga

1681

ctgttccaaa gccagtgaaa tgtgaaggaa agtggggtcc ttcggggcga tgctccctca

1741

gcctcagagg agctctaccc tgctccctgc tttggctgag gggcttggga aaaaaacttg

1801

gcactttttc gtgtggatct tgccacattt ctgatcagag gtgtacacta acatttcccc

1861

cgagctcttg gcctttgcat ttatttatac agtgccttgc tcggcgccca ccaccccctc

1921

aagccccagc agccctcaac aggcccaggg agggaagtgt gagcgccttg gtatgactta

1981

aaattggaaa tgtcatctaa
ccattaagtc atgtgtgaac acataaggac gtgtgtaaat

2041

atgtacattt gtctttttat aaaaagtaaa ttgtttataa ggggtgtggc ctttttagag

2101
agaaatttaa cttgtagatg attttacttt ttatggaaac actgatggac ttattattgg

2161
catcccgcct gaacttgact ttggggtgaa cagggacatg catctattat aaaatccttt

2221
cggccaggcg cggtggctca cacctgtaat cccagcactt tgggaggccg agatgggtgg

2281
atcacctgag gtcaggagtt cgagaccagc ctggtgaaac tccatttcta ctaaaaatgc

2341
aaaaattagc tgggcgtggt tgcgggtgct tgtaatccca gctactcagg aggctgaggc

2401
aagagaatcg cttgaacctg ggaggtggag gttgcagtga gccgagaaca tgccattgca

2461
ctccagcccg ggcaccaaaa aaaaaaaaaa aaaaaaaaac ctttcatttg gccgggcatg

2521
gtggcttatg cctgtaatcc tggcactttg ggaggccaag gtgggcagat cacctgaggt

2581
caggagtttg agaccagcct ggccaacatg gtgaaacctc atctctacta aaaatacaaa

2641
aattaggccg ggcacggtgg ctcacgcctg taatcccagc actttgggag gcagaggcgg

2701
gcggatcacg aggtcaggag atcaagacca tcctggctaa cacggtgaaa ccccgtctct

2761
actaaaaata taaaaaatta gccgggccta gtggcgggtg cctgtagtcc cagctactcg

2821
ggaggctgag gcaggagaat ggcatgaacc ccggaggcag agcttgcagt gagccgagat

2881
tgcaccactg cactacagcc tgggcgacag agcgagactc cgtctcaaaa aaaaaaaaaa

2941
aaattagccg ggcctggtgg cgggcgcctg taatcccagc tactgtggag gctgaagcac

3001
aagaatcact tgaacccggg agatggaggt tgcagtgagc tgagactgtg ccactgcact

3061
ccagcctggg tgacaagagt gagactttgt ctcaaaaaaa aaaaaatcct tttgtttatg

3121
ttcacataga caatggcaga aggaggggac attcctgtca taggaacatg cttatataaa

3181
catagtcacc tgtccttgac tatcaccagg gctgtcagtt gattctgggc tcctggggcc

3241
caaggagtgt taagttttga ggcatgtgcc ataggtgatg tgtcctgcta acacacagat

3301
gctgctccaa aaagtcagtt gatatgacac agtcacagac agaacagtca gcagcccaag

3361
aaaggtcctc acggctgctg tgctgggtag cacttgccat ccagtttcta gagtgatgaa

3421
atgctctgtc tgtaccgttc aatacagtag gcactggcac tagccacatg tgccagctaa

3481
gcacttgaaa tgtggccagt gcaataagga attgaacttt taattgcatt taataaactg

3541
tatgtaaata gtcaaaaaaa aaaaaaa

B6 GBP1-interferon-induced guanylate-binding protein 1 mRNA

NM_002053.2

(SEQ ID NO: 142)

1
ggagtcagtg atttgaacga agtactttca gtttcatatt actctaaatc cattacaaat

61
ctgcttagct tctaaatatt tcatcaatga ggaaatccca gccctacaac ttcggaacag

121
tgaaatatta gtccagggat ccagtgagag acacagaagt gctagaagcc agtgctcgtg

181
aactaaggag aaaaagaaca gacaagggaa cagcctggac atggcatcag agatccacat

241
gacaggccca atgtgcctca ttgagaacac taatgggcga ctgatggcga atccagaagc

301
tctgaagatc ctttctgcca ttacacagcc tatggtggtg gtggcaattg tgggcctcta

361
ccgcacaggc aaatcctacc tgatgaacaa gctggctgga aagaaaaagg gcttctctct

421
gggctccacg gtgcagtctc acactaaagg aatctggatg tggtgtgtgc cccaccccaa

481
gaagccaggc cacatcctag ttctgctgga caccgagggt ctgggagatg tagagaaggg

541
tgacaaccag aatgactcct ggatcttcgc cctggccgtc ctcctgagca gcaccttcgt

601
gtacaatagc ataggaacca tcaaccagca ggctatggac caactgtact atgtgacaga

661
gctgacacat agaatccgat caaaatcctc acctgatgag aatgagaatg aggttgagga

721
ttcagctgac tttgtgagct tcttcccaga ctttgtgtgg acactgagag atttctccct

781
ggacttggaa gcagatggac aacccctcac accagatgag tacctgacat actccctgaa

841
gctgaagaaa ggtaccagtc aaaaagatga aacttttaac ctgcccagac tctgtatccg

901
gaaattcttc ccaaagaaaa aatgctttgt ctttgatcgg cccgttcacc gcaggaagct

961
tgcccagctc gagaaactac aagatgaaga gctggacccc gaatttgtgc aacaagtagc

1021
agacttctgt tcctacatct ttagtaattc caaaactaaa actctttcag gaggcatcca

1081
ggtcaacggg cctcgtctag agagcctggt gctgacctac gtcaatgcca tcagcagtgg

1141
ggatctgccg tgcatggaga acgcagtcct ggccttggcc cagatagaga actcagctgc

1201
agtgcaaaag gctattgccc actatgaaca gcagatgggc cagaaggtgc agctgcccac

1261
agaaaccctc caggagctgc tggacctgca cagggacagt gagagagagg ccattgaagt

1321
cttcatcagg agttccttca aagatgtgga ccatctattt caaaaggagt tagcggccca

1381
gctagaaaaa aagcgggatg acttttgtaa acagaatcag gaagcatcat cagatcgttg

1441
ctcagcttta cttcaggtca ttttcagtcc tctagaagaa gaagtgaagg cgggaattta

1501
ttcgaaacca gggggctatc gtctctttgt tcagaagcta caagacctga agaaaaagta

1561
ctatgaggaa ccgaggaagg ggatacaggc tgaagagatt ctgcagacat acttgaaatc

1621
caaggagtct atgactgatg caattctcca gacagaccag actctcacag aaaaagaaaa

1681
ggagattgaa gtggaacgtg tgaaagctga gtctgcacag gcttcagcaa aaatgttgca

1741
ggaaatgcaa agaaagaatg agcagatgat ggaacagaag gagaggagtt atcaggaaca

1801
cttgaaacaa ctgactgaga agatggagaa cgacagggtc cagttgctga aagagcaaga

1861
gaggaccctc gctcttaaac ttcaggaaca ggagcaacta ctaaaagagg gatttcaaaa

1921
agaaagcaga ataatgaaaa atgagataca ggatctccag acgaaaatga gacgacgaaa

1981
ggcatgtacc ataagctaaa gaccagagcc ttcctgtcac ccctaaccaa ggcataattg

2041
aaacaatttt agaatttgga acaagcgtca ctacatttga taataattag atcttgcatc

2101
ataacaccaa aagtttataa aggcatgtgg tacaatgatc aaaatcatgt tttttcttaa

2161
aaaaaaaaaa agactgtaaa ttgtgcaaca aagatgcatt tacctctgta tcaactcagg

2221

aaatctcata

agctggtacc actcaggaga agtttattct tccagatgac cagcagtaga

2281

caaatggata ctgagcagag
tcttaggtaa aagtcttggg aaatatttgg gcattggtct

2341
ggccaagtct acaatgtccc aatatcaagg acaaccaccc tagcttctta gtgaagacaa

2401
tgtacagtta tccgttagat caagactaca cggtctatga gcaataatgt gatttctgga

2461
cattgcccat gtataatcct cactgatgat ttcaagctaa agcaaaccac cttatacaga

2521
gatctagaat ctctttatgt tctccagagg aaggtggaag aaaccatggg caggagtagg

2581

aattgagtga taaacaattg ggctaatgaa
gaaaacttct cttattgttc agttcatcca

2641

gattataact tcaatgggac actttagacc attagacaat tgacactgga ttaaacaaat

2701

tcacataatg ccaaatacac aatgtattta tagcaacgta taatttgcaa agatggactt

2761

taaaagatgc tgtgtaacta aactgaaata attcaattac ttattattta gaatgttaaa

2821
gcttatgata gtcttttcta actcttaaca ctcatacttg aaaactttct gagtttcccc

2881
agaagagaat atgggatttt ttttgacatt tttgactcat ttaataatgc tcttgtgttt

2941
acctagtata tgtagacttt gtcttatgtg tgaaaagtcc taggaaagtg gttgatgttt

3001
cttatagcaa ttaaaaatta tttttgaact gaaaatacaa tgtatttcac

B7 IFIT3-interferon-induced protein with tetratricopeptide repeats 3

mRNA NM_001549.4

(SEQ ID NO: 143)

1
attttcctcc tcccaacgat tttaaattag tttcactttc cagtttcctc ttccttcccc

61
taaaagcaat tactcaaaaa cggagaaaac atcagctgat gcgtgcccta ctctcccacc

121
cctttatata gttccttcag tatttacttg aggcagacag gaagacttct gaagaacaaa

181
tcagcctggt caccagcttt tcggaacagc agagacacag agggcagtca tgagtgaggt

241
caccaagaat tccctggaga aaatccttcc acagctgaaa tgccatttca cctggaactt

301
attcaaggaa gacagtgtct caagggatct agaagataga gtgtgtaacc agattgaatt

361
tttaaacact gagttcaaag ctacaatgta caacttgttg gcctacataa aacacctaga

421
tggtaacaac gaggcagccc tggaatgctt acggcaagct gaagagttaa tccagcaaga

481
acatgctgac caagcagaaa tcagaagtct agtcacttgg ggaaactacg cctgggtcta

541
ctatcacttg ggcagactct cagatgctca gatttatgta gataaggtga aacaaacctg

601
caagaaattt tcaaatccat acagtattga gtattctgaa cttgactgtg aggaagggtg

661
gacacaactg aagtgtggaa gaaatgaaag ggcgaaggtg tgttttgaga aggctctgga

721
agaaaagccc aacaacccag aattctcctc tggactggca attgcgatgt accatctgga

781
taatcaccca gagaaacagt tctctactga tgttttgaag caggccattg agctgagtcc

841
tgataaccaa tacgtcaagg ttctcttggg cctgaaactg cagaagatga ataaagaagc

901
tgaaggagag cagtttgttg aagaagcctt ggaaaagtct ccttgccaaa cagatgtcct

961
ccgcagtgca gccaaatttt acagaagaaa aggtgaccta gacaaagcta ttgaactgtt

1021
tcaacgggtg ttggaatcca caccaaacaa tggctacctc tatcaccaga ttgggtgctg

1081

ctacaaggca aaagtaagac aaatgcagaa tacaggagaa tctgaagcta gtggaaataa

1141

agagatgatt gaagcactaa agcaatatgc tatggactat tcgaataaag ctcttgagaa

1201

gggactgaat cctctgaatg catactccga tctcgctgag ttcctggaga cggaatgtta

1261

tcagacacca

ttcaataagg aagtccctga tgctgaaaag caacaatccc atcagcgcta

1321

ctgcaacctt cagaaatata atgggaagtc tgaagacact gctgtgcaac atggtttaga

1381

gggtttgtcc ataagcaaaa aatcaactga caaggaagag atcaaagacc aaccacagaa

1441

tgtatctgaa aatctgcttc cacaaaatgc accaaattat tggtatcttc aaggattaat

1501

tcataagcag aatggagatc tgctgcaagc agccaaatgt tatgagaagg aactgggccg

1561

cctgctaagg gatgcccctt caggcatagg cagtattttc ctgtcagcat ctgagcttga

1621

ggatggtagt gaggaaatgg gccagggcgc agtcagctcc agtcccagag agctcctctc

1681

taactcagag caactgaact

gagacagagg aggaaaacag agcatcagaa gcctgcagtg

1741

gtggttgtga cgggtaggac gataggaaga
cagggggccc caacctggga ttgctgagca

1801
gggaagcttt gcatgttgct ctaaggtaca tttttaaaga gttgtttttt ggccgggcgc

1861
agtggctcat gcctgtaatc ccagcacttt gggaggccga ggtgggcgga tcacgaggtc

1921
tggagtttga gaccatcctg gctaacacag tgaaatcccg tctctactaa aaatacaaaa

1981
aattagccag gcgtggtggc tggcacctgt agtcccagct acttgggagg ctgaggcagg

2041
agaatggcgt gaacctggaa ggaagaggtt gcagtgagcc aagattgcgc ccctgcactc

2101
cagcctgggc aacagagcaa gactccatct caaaaaaaaa aaaaaaaaaa aaaaagagtt

2161
gttttctcat gttcattata gttcattaca gttacatagt ccgaaggtct tacaactaat

2221
cactggtagc aataaatgct tcaggcccac atgatgctga ttagttctca gttttcattc

2281
agttcacaat ataaccacca ttcctgccct ccctgccaag ggtcataaat ggtgactgcc

2341
taacaacaaa atttgcagtc tcatctcatt ttcatccaga cttctggaac tcaaagatta

2401
acttttgact aaccctggaa tatctcttat ctcacttata gcttcaggca tgtatttata

2461
tgtattcttg atagcaatac cataatcaat gtgtattcct gatagtaatg ctacaataaa

2521
tccaaacatt tcaactctgt taaaaaaaaa aa

B8 IFITM3-interferon induced transmembrane protein 3 mRNA-

NM_021034.2

(SEQ ID NO: 144)

1
aggaaaagga aactgttgag aaaccgaaac tactggggaa agggagggct cactgagaac

61
catcccagta acccgaccgc cgctggtctt cgctggacac catgaatcac actgtccaaa

121
ccttcttctc tcctgtcaac agtggccagc cccccaacta tgagatgctc aaggaggagc

181
acgaggtggc tgtgctgggg gcgccccaca accctgctcc cccgacgtcc accgtgatcc

241

acatccgcag cgagacctcc gtgcccgacc atgtcgtctg gtccctgttc aacaccctct

301

tcatgaaccc ctgctgcctg ggcttcatag cattcgccta ctccgtgaag tctagggaca

361

ggaagatggt tggcgacgtg accggggccc

aggcctatgc ctccaccgcc aagtgcctga

421

acatctgggc cctgattctg ggcatcctca

tgaccattct gctcatcgtc atcccagtgc

481

tgatcttcca ggcctatgga tagatcagga ggcatcactg aggccaggag ctctgcccat

541

gacctgtatc ccacgtactc

caacttccat tcctcgccct gcccccggag ccgagtcctg

601
tatcagccct ttatcctcac acgcttttct acaatggcat tcaataaagt gcacgtgttt

661
ctggtgctaa aaaaaaaa

B9 GK-glycerol kinase mRNA NM_203391.3

(SEQ ID NO: 145)

1
gggccggagg ggcggggtga gaaggctgcg cgcgggtaaa ggggccgcct cgagcgcggt

61
ccgagcgttc agcggacgcg cgcggcctcg atctctggac tcgtcacctg cccctccccc

121
tcccgccgcc gtcacccagg aaaccggccg caatcgccgg ccgacctgaa gctggtttca

181
tggcagcctc aaagaaggca gttttggggc cattggtggg ggcggtggac cagggcacca

241
gttcgacgcg ctttttggtt ttcaattcaa aaacagctga actacttagt catcatcaag

301
tagaaataaa acaagagttc ccaagagaag gatgggtgga acaggaccct aaggaaattc

361
tacattctgt ctatgagtgt atagagaaaa catgtgagaa acttggacag ctcaatattg

421
atatttccaa cataaaagct attggtgtca gcaaccagag ggaaaccact gtagtctggg

481
acaagataac tggagagcct ctctacaatg ctgtggtgtg gcttgatcta agaacccagt

541
ctaccgttga gagtcttagt aaaagaattc caggaaataa taactttgtc aagtccaaga

601
caggccttcc acttagcact tacttcagtg cagtgaaact tcgttggctc cttgacaatg

661
tgagaaaagt tcaaaaggcc gttgaagaaa aacgagctct ttttgggact attgattcat

721
ggcttatttg gagtttgaca ggaggagtca atggaggtgt ccactgtaca gatgtaacaa

781
atgcaagtag gactatgctt ttcaacattc attctttgga atgggataaa caactctgcg

841
aattttttgg aattccaatg gaaattcttc caaatgtccg gagttcttct gagatctatg

901
gcctaatgaa aatctctcat agcgtgaaag ctggggcctt ggaaggtgtg ccaatatctg

961
ggtgtttagg ggaccagtct gctgcattgg tgggacaaat gtgcttccag attggacaag

1021
ccaaaaatac gtatggaaca ggatgtttct tactatgtaa tacaggccat aagtgtgtat

1081
tttctgatca tggccttctc accacagtgg cttacaaact tggcagagac aaaccagtat

1141
attatgcttt ggaaggttct gtagctatag ctggtgctgt tattcgctgg ctaagagaca

1201
atcttggaat tataaagacc tcagaagaaa ttgaaaaact tgctaaagaa gtaggtactt

1261

cttatggctg ctacttcgtc ccagcatttt cggggttata tgcaccttat tgggagccca

1321

gcgcaagagg gataatctgt ggactcactc agttcaccaa taaatgccat attgcttttg

1381

ctgcattaga agctgtttgt ttccaaactc gagagatttt ggatgccatg aatcgagact

1441

gtggaattcc actcagtcat ttgcaggtag atggaggaat

gaccagcaac aaaattctta

1501

tgcagctaca agcagacatt ctgtatatac cagtagtgaa gccctcaatg

cccgaaacca

1561

ctgcactggg tgcggctatg gcggcagggg ctgcagaagg agtcggcgta tggagtctcg

1621

aacccgagga tttgtctgcc gtcacgatgg agcggtttga acctcagatt aatgcggagg

1681

aaagtgaaat tcgttattct acatggaaga aagctgtgat gaagtcaatg ggttgggtta

1741

caactcaatc tccagaaagt ggtattccat aaaacctacc

aactcatgga ttcccaagat

1801

gtgagctttt tacataatga aagaacccag caattctgtc tcttaatgca

atgacactat

1861

tcatagactt tgattttatt tataagccac ttgctgcatg accctccaag tagacctgtg

1921

gcttaaaata aagaaaatgc agcaaaaaga atgctataga aatatttggt ggtttttttt

1981
ttttttaaac atccacagtt aaggttgggc cagctacctt tggggctgac cccctccatt

2041
gccataacat cctgctccat tccctctaag atgtaggaag aattcggatc cttaccattg

2101
gaatcttcca tcgaacatac tcaaacactt ttggaccagg atttgagtct ctgcatgaca

2161
tatacttgat taaaaggtta ttactaacct gttaaaaatc agcagctctt tgcttttaac

2221
agacacccta aaagtcttct tttctacata gttgaagaca gcaacatctt cactgaatgt

2281
ttgaatagaa acctctacta aattattaaa atagacattt agtgttctca cagcttggat

2341
atttttctga aaagttattt gccaaaactg aaatccttca gatgttttcc atggtcccac

2401
taattataat gactttctgt ctggatctta taggaaaaga tactttcttt tttcttccat

2461
ctttcctttt tatatttttt actttgtatg tataacatac atgcctatat attttataca

2521
ctgagggtag cccatttata aattaagagc acattatatt cagaaggttc taacagggct

2581
ggtcttaagt gaaccactgt gtatataaat atgttggaaa acagctgtat acatttttgg

2641
gcaacggtta tgcataatat ttaccaggag aatttttttc ttaacaagcc aacatttaaa

2701
atttatgttt tatgtcaata aaagaaaata tactttattg tgacttcaac tatatttctt

2761
atcccttaca tttttattta attgtcttag cttaaaaaaa gaagaaactg tggaatacta

2821
cagtaaatat tgttttcaaa cacaagcaat aattcaaata gttatttttc ttttgaatta

2881
attttagaca tattttggat cctattgagg ggataagagg atgtcaaaaa agttaaatac

2941
ctaagtagaa aaaaatatag aaataaagcc aagaatctct ttcagttcaa atgttatcaa

3001
ttgttaataa gaaattgcta tctgggatga cagaattacc tctgcttagt atctcattat

3061
aactgaaaga aggtttatca ttacaaatac cttccaatga aaccaagaat ttctcaaaat

3121
atttaatgtc acatattata agaagttacc taatcctgct tcttaacatc aatttttaaa

3181
aatatcttaa aattactttg ttttgtagta aacagtgaag aaaagattgc ctcctaatta

3241
tttttttcaa tgagtgctga atgggaaaac atttatatct tactataaaa ggttctgttt

3301
tgtttggaat caatggtagc tttattgact gttctgattg tgctgtttct aatttattga

3361
atctgctagg ttttattgat gcagccacca cttaagtgac ataaatatta tagaaaggta

3421
ctgtgaaatg atcactttgt ggcaggggta cttttaaaca taaatgtttc tacaaaagta

3481
ggttgagttc attgtaaata attgtgaaag ccactgttca aataatttta agattacatt

3541
aatttttcta taaattggaa gatttataaa tgtttgaaat tgtacacatt gatatttaat

3601
gacaaattta cttaaaataa attgacccct tgttcttact tgcatttctc atttacagac

3661
tagaacttag ttgaaagtta aattaagaaa gatgtttcag aggccgggca cggtggctga

3721
cgcctgtaat cccagcactt tgggaggccg aggtgggcag atcacctgag gtcaggagtt

3781
cgggactagc ctaaccgaca tggagaaacc ctatctctac taaaaaaaaa aaaagatgtt

3841
tcaggacatg tgaaacttgg ctgttagcgc ttgatagggc acactctgaa gagttaacca

3901
acagccaaag aagtaatttc tgtaatgatg aacactttaa tcattctatt agaagaaact

3961
acactgtccc atctcagcat ttgcaaaaaa taatgttggt aaggtcagca gccattatca

4021
acagggcctt gcatggctaa ctttgaccac catttttctc tcaacctgat aggcaacacc

4081
tcaatccttt gttctccaac taatcagtaa aataagtaat gcatctctgc ttctgtaatg

4141
atatcttaga atttttagta tgtttctttt gaagtgccca aagcccaatt ctttgggata

4201
tcttttgggt atctggtatc atgtgggagt gaagaaagaa agtttttgga gaaaccaaca

4261
aatgaaagct gtgatagcac agaagctaat ggcattgaca gtggagtagg tagtatttaa

4321
tctgtagtgt ttacaacata gtagatagaa gtacaaaaat ttttttaact ataactcttt

4381
aatagcttgt tttatctagt aatatttaaa taatgaagtt tccttgatcc tttgcttttg

4441
caacctaaca actttaataa taagttcaca caataaacaa attagtagaa aaaaaaaaaa

4501
aaa

B10 NELL2-NEL-like 2 (chicken) mRNA NM_001145107.1

(SEQ ID NO: 146)

1
ctctacctac tttgcccagc tccacctcgg cagtgcagcg tgttttggtg gccttcctcc

61
gcacgccctg gagggggagt gccctgcacc ccggggctgc tccggagccc agtgcacgag

121
tgcacatggg cttccctcct ttgcttaaag ggcaggcgag cgctactcgc tccagccttg

181
cctcctgcag ctgggtggtc ttttttctct cctgtctttc aagacacgcg cccgaaatcg

241
agggagggag acgatggact gagctgatcc gcaccatgga gtctcgggtc ttactgagaa

301
cattctgttt gatcttcggt ctcggagcag tttgggggct tggtgtggac ccttccctac

361
agattgacgt cttaacagag ttagaacttg gggagtccac gaccggagtg cgtcaggtcc

421
cggggctgca taatgggacg aaagcctttc tctttcaaga tactcccaga agcataaaag

481
catccactgc tacagctgaa cagttttttc agaagctgag aaataaacat gaatttacta

541
ttttggtgac cctaaaacag acccacttaa attcaggagt tattctctca attcaccact

601
tggatcacag gtacctggaa ctggaaagta gtggccatcg gaatgaagtc agactgcatt

661
accgctcagg cagtcaccgc cctcacacag aagtgtttcc ttacattttg gctgatgaca

721
agtggcacaa gctctcctta gccatcagtg cttcccattt gattttacac attgactgca

781
ataaaattta tgaaagggta gtagaaaagc cctccacaga cttgcctcta ggcacaacat

841
tttggctagg acagagaaat aatgcgcatg gatattttaa gggtataatg caagatgtcc

901
aattacttgt catgccccag ggatttattg ctcagtgccc agatcttaat cgcacctgtc

961
caacttgcaa tgacttccat ggacttgtgc agaaaatcat ggagctacag gatattttag

1021
ccaaaacatc agccaagctg tctcgagctg aacagcgaat gaatagattg gatcagtgct

1081
attgtgaaag gacttgcacc atgaagggaa ccacctaccg agaatttgag tcctggatag

1141
acggctgtaa gaactgcaca tgcctgaatg gaaccatcca gtgtgaaact ctaatctgcc

1201
caaatcctga ctgcccactt aagtcggctc ttgcgtatgt ggatggcaaa tgctgtaagg

1261
aatgcaaatc gatatgccaa tttcaaggac gaacctactt tgaaggagaa agaaatacag

1321
tctattcctc ttctggagta tgtgttctct atgagtgcaa ggaccagacc atgaaacttg

1381
ttgagagttc aggctgtcca gctttggatt gtccagagtc tcatcagata accttgtctc

1441
acagctgttg caaagtttgt aaaggttatg acttttgttc tgaaaggcat aactgcatgg

1501
agaattccat ctgcagaaat ctgaatgaca gggctgtttg tagctgtcga gatggtttta

1561
gggctcttcg agaggataat gcctactgtg aagacatcga tgagtgtgct gaagggcgcc

1621
attactgtcg tgaaaataca atgtgtgtca acaccccggg ttcttttatg tgcatctgca

1681
aaactggata catcagaatt gatgattatt catgtacaga acatgatgag tgtatcacaa

1741
atcagcacaa ctgtgatgaa aatgctttat gcttcaacac tgttggagga cacaactgtg

1801
tttgcaagcc gggctataca gggaatggaa cgacatgcaa agcattttgc aaagatggct

1861
gtaggaatgg aggagcctgt attgccgcta atgtgtgtgc ctgcccacaa ggcttcactg

1921
gacccagctg tgaaacggac attgatgaat gctctgatgg ttttgttcaa tgtgacagtc

1981
gtgctaattg cattaacctg cctggatggt accactgtga gtgcagagat ggctaccatg

2041
acaatgggat gttttcacca agtggagaat cgtgtgaaga tattgatgag tgtgggaccg

2101
ggaggcacag ctgtgccaat gataccattt gcttcaattt ggatggcgga tatgattgtc

2161
gatgtcctca tggaaagaat tgcacagggg actgcatcca tgatggaaaa gttaagcaca

2221
atggtcagat ttgggtgttg gaaaatgaca ggtgctctgt gtgctcatgt cagaatggat

2281
tcgttatgtg tcgacggatg gtctgtgact gtgagaatcc cacagttgat cttttttgct

2341
gccctgaatg tgacccaagg cttagtagtc agtgcctcca tcaaaatggg gaaactttgt

2401

ataacagtgg tgacacctgg gtccagaatt gtcaacagtg ccgctgcttg caaggggaag

2461

ttgattgttg gcccctgcct tgcccagatg tggagtgtga attcagcatt ctcccagaga

2521

atgagtgctg

cccgcgctgt gtcacagacc cttgccaggc tgacaccatc cgcaatgaca

2581

tcaccaagac ttgcctggac gaaatgaatg tggttcgctt caccgggtcc tcttggatca

2641

aacatggcac tgagtgtact ctctgccagt gcaagaatgg ccacatctgt tgctcagtgg

2701

atccacagtg ccttcaggaa ctgtgaagtt aactgtctca tgggagattt ctgttaaaag

2761

aatgttcttt cattaaaaga ccaaaaagaa gttaaaactt aaattgggtg atttgtgggc

2821

agctaaatgc agctttgtta atagctgagt gaactttcaa ttatgaaatt tgtggagctt

2881

gacaaaatca caaaaggaaa attactgggg caaaattaga cctcaagtct gcctctactg

2941

tgtctcacat caccatgtag aagaatgggc gtacagtata

taccgtgaca tcctgaaccc

3001

tggatagaaa gcctgagccc attggatctg tgaaagcctc tagcttcact

ggtgcagaaa

3061

attttcctct agatcagaat cttcaagaat cagttaggtt cctcactgca agaaataaaa

3121
tgtcaggcag tgaatgaatt atattttcag aagtaaagca aagaagctat aacatgttat

3181
gtacagtaca ctctgaaaag aaatctgaaa caagttattg taatgataaa aataatgcac

3241
aggcatggtt acttaatatt ttctaacagg aaaagtcatc cctatttcct tgttttactg

3301
cacttaatat tatttggttg aatttgttca gtataagctc gttcttgtgc aaaattaaat

3361
aaatatttct cttaccttat aacac

B11 S100A11 S100 calcium binding protein A11 mRNA NM_005620.1

(SEQ ID NO: 147)

1
gggcaaggct gggccgggaa gggcgtgggt tgaggagagg ctccagaccc gcacgccgcg

61
cgcacagagc tctcagcgcc gctcccagcc acagcctccc gcgcctcgct cagctccaac

121
atggcaaaaa tctccagccc tacagagact gagcggtgca tcgagtccct gattgctgtc

181
ttccagaagt atgctggaaa ggatggttat aactacactc tctccaagac agagttccta

241
agcttcatga atacagaact agctgccttc acaaagaacc agaaggaccc tggtgtcctt

301

gaccgcatga tgaagaaact ggacaccaac agtgatggtc agctagattt ctcagaattt

361

cttaatctga

ttggtggcct agctatggct tgccatgact ccttcctcaa ggctgtccct

421

tcccagaagc ggacctgagg

accccttggc cctggccttc aaacccaccc cctttccttc

481

cagcctttct gtcatcatct ccacagccca cccatcccct gagcacacta accacctcat

541

gcaggcccca

cctgccaata gtaataaagc aatgtcactt ttttaaaaca tgaaa

B12 SAMD9L-sterile alpha motif domain containing 9-like mRNA

NM_152703.2

(SEQ ID NO: 148)

1
aaagtcagag tactgggaga acagaagact tcacaattta atgcctcagt ttttaaaaaa

61
ggatccttac acttcatgtc tcctagccat cagaagagga atgagacagc aaaagttcaa

121
atggcctgtt tcaagtttct gatataaaac gatgacattt tcaggaaaat cctgcatttc

181
cagagagaga ctggctggtt aaatttctga aagaggacac cagctaaaag aaggtattgc

241
atctcacccg agcagactgt gtctgtggaa agtgtaagcc ccttgccaga agagcagctt

301
cccagcaaag gcagagggtg aaaacagcaa aggtcttaag acactgggga cctagagtca

361
aaagggacct cctccaggga aaacgctgtg tgagaaatgg cctcattcgg tgactgtgag

421
tgacacagca gaaagttggg tcattccggc tgcttttttg agaagtccct gaagagatca

481

ataacagcaa gagggaacct ggcaaggaag ctattcctat aatccaggaa agagatgagg

541

aaggcttgga

ccaggtggta gtggtgtcag gtagtcaaat gctgggtata ttttgaagat

601

acaccccata ggatttgctc cacattgaat gtggaatgct ggaagagaga taaagtgtac

661

ctgtcacata ctttttgagt tttatttatt ttcttagaag taagtacaca aagagatgct

721

acctaggaga agggtattct tttcactatt ctttcaaatt ttctgtatgt tcgaacattt

781

tcatagtaga aagttggggg gaaaatctgt ttcataaaca tttcctcagc agcagtccag

841

tctattgcat tttaattggt tgtgatatca ttgttttatg caatacgttc tcaacaagta

901

tatcctccgg caaactgaac aaggaccaag tctgttctgc ctacagctct gcttcctcat

961

agctgctttc cagaacgtga ctcttgcaaa ttatcaagaa aggggaacta atctaaggga

1021

tccagatcaa acagcctcat gaagacttat tttatgtttc taatataaag atagaagttt

1081

tcagaaaagc cctgctacac agaggatcag agcaggggtg ggcctgctgg gctgcagctg

1141

ggattctgag catcctttcc cggaggcacg gaaagtgagt gagtgagccc agtgaggaag

1201

aagttgaagc tttgatatga gtaaacaagt atctctacct gaaatgatta aagactggac

1261

caaagagcat gtgaaaaaat gggtaaatga agaccttaag attaatgagc aatacgggca

1321

aattctgctc agtgaagaag taacaggatt agtcctgcag gaattaactg agaaggacct

1381

tgtagaaatg gggctaccat ggggtccagc acttttgata aaacgttcat acaacaaatt

1441

gaatagtaag tcccctgaaa gtgacaatca tgatccggga caattagata attcaaaacc

1501

gtccaaaaca gaacaccaga aaaatccaaa acacaccaaa aaggaagaag aaaattcaat

1561

gtcatctaat attgattatg atcccagaga gatcagagat atcaaacaag aagaatcaat

1621

tcttatgaaa gaaaatgtgt tagatgaagt agcaaatgct aaacacaaga aaaagggtaa

1681

gctaaaacct gaacaattga cttgtatgcc atatcctttt gatcagttcc atgacagcca

1741

tcgctacata gaacattata ctctacaacc tgaaacagga gcactcaatc tcattgatcc

1801

aatacatgag ttcaaagctc tcacaaacac agaaacagcc acggaagtgg acattaagat

1861

gaaattcagc aatgaagtct tccgatttgc atcagcttgt atgaattcac gcaccaatgg

1921

caccatccat tttggagtca aggacaaacc ccatggagaa attgttggtg tgaaaatcac

1981

cagtaaggct gccttcattg accacttcaa tgtaatgatc aaaaagtatt ttgaagaaag

2041

tgagatcaat gaagccaaga agtgtattcg ggagccaagg tttgtggaag tccttctgca

2101

gaacaataca ccatctgaca gatttgtcat tgaagttgat actattccaa aacactctat

2161

atgtaatgat aagtatttct acattcagat gcaaatttgt aaagataaaa tatggaaaca

2221

aaaccaaaat ctttcactgt ttgtaagaga aggggctagc tctagggata tcctggccaa

2281

ttccaagcaa cgggatgtag atttcaaggc atttttacaa aatttaaagt cactggtagc

2341

atctagaaaa gaggctgaag aagagtatgg aatgaaggca atgaagaagg agagtgaagg

2401

actaaagctg gttaaacttc tcataggaaa ccgagactca ctggataatt catactatga

2461

ctggtacatt cttgtaacaa ataaatgcca tccaaaccaa ataaagcact tagatttttt

2521

aaaagaaatt aaatggtttg ctgtgttgga gtttgatcct gaatctatga tcaatggagt

2581

ggtcaaagct tacaaagaaa gtcgggtggc aaaccttcac tttccaaatc aatatgaaga

2641

caagacaact aacatgtggg agaagatttc tactcttaat ctttaccaac agcccagctg

2701

gattttctgc aacggcagat cagacctgaa aagcgagaca tataaacctc tagaaccaca

2761

tttatggcag agagaaagag cttcagaagt caggaaacta attttatttc tcacagatga

2821

aaatataatg acaagaggaa aatttttggt agtgtttcta ttactctctt cagtggaaag

2881

cccaggagat ccactcattg aaactttctg ggctttctat caagctctca aaggaatgga

2941

aaatatgttg tgtatctctg taaactcaca tatttatcaa cgatggaaag atctactaca

3001

aacaagaatg aagatggaag atgaactaac aaaccacagt atttccactt taaatataga

3061

actggtaaac agcactatcc ttaaactaaa atcggtgact cggtcatcaa gaaggttttt

3121

gcccgcccgt ggatcttctt cagttatcct agagaaaaag aaagaggatg tcttgactgc

3181

actggaaatc ctctgtgaaa atgagtgtac agagacagac atcgagaaag acaaatctaa

3241

attcctggag

tttaagaaat caaaagaaga acacttttat cgaggtggca aagtatcctg

3301
gtggaacttc tatttttctt ctgaaaacta ttcttcagat tttgttaaaa gggacagtta

3361
tgaaaagctt aaagatttaa tacactgctg ggcagagtct cctaaaccaa tatttgcaaa

3421
aatcatcaat ctttatcatc atccaggctg tggaggtacc acactggcta tgcatgttct

3481
ctgggactta aagaaaaact tcagatgtgc tgtgttaaaa aacaagacaa ctgattttgc

3541
agaaattgca gagcaagtga tcaatctggt cacctatagg gcaaagagcc atcaggatta

3601
cattcctgtg cttctccttg tggatgattt tgaagaacaa gaaaatgtct actttctaca

3661
aaatgccatc cattccgttt tagcagaaaa ggatttgcga tatgaaaaaa cattggtaat

3721
tatcttaaac tgcatgagat cccggaatcc agatgaaagt gcaaaattgg cagacagtat

3781
tgcactaaat taccaacttt cttccaagga acaaagagct tttggtgcca aactgaagga

3841
aattgaaaag cagcacaaga actgtgaaaa cttttattcc ttcatgatca tgaaaagcaa

3901
ttttgatgaa acatatatag aaaatgtagt caggaatatc ctaaaaggac aggatgttga

3961
cagcaaggaa gcacaactca tttccttcct ggctttactc agctcttatg ttactgactc

4021
tacaatttca gtttcacagt gtgaaatatt tttgggaatc atatacacta gtacaccctg

4081
ggaacctgaa agcttagaag acaagatggg aacttattct acacttctaa taaaaacaga

4141
agttgcagaa tatgggagat acacaggtgt gcgtatcatt caccctctga ttgccctgta

4201
ctgtctaaaa gaactggaaa gaagctatca cttggataaa tgtcaaattg cattgaatat

4261
attagaagag aatttattct atgattctgg aataggaaga gacaaatttc aacatgatgt

4321
tcaaactctt ctgcttacaa gacagcgcaa ggtgtatgga gatgaaacag acactctgtt

4381
ttccccatta atggaagctt tacagaataa agacattgaa aaggtcttga gtgcaggaag

4441
tagacgattc ccacaaaatg cattcatttg tcaagcctta gcaagacatt tctacattaa

4501
agagaaggac tttaacacag ctctggactg ggcacgtcag gccaaaatga aagcacctaa

4561
aaattcctat atttcagata cactaggtca agtctacaaa agtgaaatca aatggtggtt

4621
ggatgggaac aaaaactgta ggagcattac tgttaatgac ctaacacatc tcctagaagc

4681
tgcggaaaaa gcctcaagag ctttcaaaga atcccaaagg caaactgata gtaaaaacta

4741
tgaaaccgag aactggtcac cacagaagtc ccagagacga tatgacatgt ataacacagc

4801
ttgtttcttg ggtgaaatag aagttggtct ttacactatc cagattcttc agctcactcc

4861
ctttttccac aaagaaaatg aattatccaa aaaacatatg gtgcaatttt tatcaggaaa

4921
gtggaccatt cctcctgatc ccagaaatga atgttatttg gctcttagca agttcacatc

4981
ccacctaaaa aatttacaat cagatctgaa aaggtgcttt gactttttta ttgattatat

5041
ggttcttctg aaaatgaggt atacccaaaa agaaattgca gaaatcatgt taagcaagaa

5101
agtcagtcgt tgtttcagga aatacacaga acttttctgt catttggatc catgtctatt

5161
acaaagtaaa gagagtcaat tactccagga ggagaattgc aggaaaaagc tagaagctct

5221
gagagcagat aggtttgctg gactcttgga atatcttaat ccaaactaca aagatgctac

5281
caccatggaa agtatagtga atgaatatgc cttcctactg cagcaaaact caaaaaagcc

5341
catgacaaat gagaaacaaa attccatttt ggccaacatt attctgagtt gtctaaagcc

5401
caactccaag ttaattcaac cacttaccac gctaaaaaaa caactccgag aggtcttgca

5461
atttgtagga ctaagtcatc aatatccagg tccttatttc ttggcctgcc tcctgttctg

5521
gccagaaaat caagagctag atcaagattc caaactaata gaaaagtatg tttcatcctt

5581
aaatagatcc ttcaggggac agtacaagcg catgtgcagg tccaagcagg caagcacact

5641
tttctatctg ggcaaaagga agggtctaaa cagtattgtt cacaaggcca aaatagagca

5701
gtactttgat aaagcacaaa atacaaattc cctctggcac agtggggatg tgtggaaaaa

5761
aaatgaagtc aaagacctcc tgcgtcgtct aactggtcag gctgaaggca agctaatctc

5821
tgtagaatat ggaacagagg aaaaaataaa aataccagta atatctgttt attcaggtcc

5881
actcagaagt ggtaggaaca tagaaagagt gtctttctac ctaggatttt ccattgaagg

5941
ccctctggca tatgatatag aagtaattta agacaataca tcacctgtag ttcaaatacg

6001
tttatttata tctttatgat tttattctct ctctctattc tcatggcact ttcataacat

6061
tatggctaac ctctaattac agattttgct tttgcctccc tgaatgaatt acaagccttt

6121
ttaagatatg aaatatgcct acccgcagag cttggcacaa agtggagtca atcttttaat

6181
gttttaaata tgcattttca gactcaaata attaagaagt ttcattgata tccactggtc

6241
acatcataac tgtctatagg gcaataaaat ctgtgttaaa ctcaattgct tttataagtt

6301
ttctaaatta tttcttcact gtgacagcaa agatttaaat aagatgaatg taaaagagaa

6361
agcttattgg actcaaaccc acagatccac accagagttc tatttacctc atcttggtat

6421
caataaaaac ttatgtggaa ggtaaatata ttgttcccca tccaccacat aacactctcc

6481
ccaacacaca cacacacaca cacacacaca cacacacaca cacacactcc ttgtacccct

6541
tgcccttctc ccagctcatt gctccaggag agagaagagt tcaaaaaata aagtaatcat

6601
aaacttgaac tctctccatt ctcttgttcc catttacagg tgaatctctt cctttaagcc

6661
atttttgtct cctgtgaata cagccttatc tccacctgtt tcttagatcc catctcccct

6721
ggcttatttt ttccattcat taccctcttt gttcccttta cttctcaacc tgtgctatat

6781
acatgctgtt ctctctgttg agattgcctt atttccatct aacattctct ctcctgctat

6841
tctgatttgt cattcacaac tgatttcaag agtcaccttc accaggaagt cttccttgac

6901
caccatcatt cctgcctgat tagagggctt cctcatggta atatgtgttc tcaagttttc

6961
agtgtcaagg aatgccatcc cagaagctca ttctcagatg cacaacagcc agaacagtct

7021
caagcagcat tctagagctt ggaatttaag aactacgcat tgcctataaa gtgaaacata

7081
ggctaatata gattaaattg aatattgaat aaaaaatata tttatttatc cac

B13 STAT1-signal transducer and activator of transcription 1-alpha/beta

isoform alpha mRNA NM_007315.3

(SEQ ID NO: 149)

1
gctgagcgcg gagccgcccg gtgattggtg ggggcggaag ggggccgggc gccagcgctg

61
ccttttctcc tgccgggtag tttcgctttc ctgcgcagag tctgcggagg ggctcggctg

121
caccgggggg atcgcgcctg gcagacccca gaccgagcag aggcgaccca gcgcgctcgg

181
gagaggctgc accgccgcgc ccccgcctag cccttccgga tcctgcgcgc agaaaagttt

241
catttgctgt atgccatcct cgagagctgt ctaggttaac gttcgcactc tgtgtatata

301
acctcgacag tcttggcacc taacgtgctg tgcgtagctg ctcctttggt tgaatcccca

361
ggcccttgtt ggggcacaag gtggcaggat gtctcagtgg tacgaacttc agcagcttga

421
ctcaaaattc ctggagcagg ttcaccagct ttatgatgac agttttccca tggaaatcag

481
acagtacctg gcacagtggt tagaaaagca agactgggag cacgctgcca atgatgtttc

541
atttgccacc atccgttttc atgacctcct gtcacagctg gatgatcaat atagtcgctt

601
ttctttggag aataacttct tgctacagca taacataagg aaaagcaagc gtaatcttca

661
ggataatttt caggaagacc caatccagat gtctatgatc atttacagct gtctgaagga

721
agaaaggaaa attctggaaa acgcccagag atttaatcag gctcagtcgg ggaatattca

781
gagcacagtg atgttagaca aacagaaaga gcttgacagt aaagtcagaa atgtgaagga

841
caaggttatg tgtatagagc atgaaatcaa gagcctggaa gatttacaag atgaatatga

901
cttcaaatgc aaaaccttgc agaacagaga acacgagacc aatggtgtgg caaagagtga

961
tcagaaacaa gaacagctgt tactcaagaa gatgtattta atgcttgaca ataagagaaa

1021
ggaagtagtt cacaaaataa tagagttgct gaatgtcact gaacttaccc agaatgccct

1081
gattaatgat gaactagtgg agtggaagcg gagacagcag agcgcctgta ttggggggcc

1141
gcccaatgct tgcttggatc agctgcagaa ctggttcact atagttgcgg agagtctgca

1201
gcaagttcgg cagcagctta aaaagttgga ggaattggaa cagaaataca cctacgaaca

1261
tgaccctatc acaaaaaaca aacaagtgtt atgggaccgc accttcagtc ttttccagca

1321
gctcattcag agctcgtttg tggtggaaag acagccctgc atgccaacgc accctcagag

1381
gccgctggtc ttgaagacag gggtccagtt cactgtgaag ttgagactgt tggtgaaatt

1441
gcaagagctg aattataatt tgaaagtcaa agtcttattt gataaagatg tgaatgagag

1501
aaatacagta aaaggattta ggaagttcaa cattttgggc acgcacacaa aagtgatgaa

1561
catggaggag tccaccaatg gcagtctggc ggctgaattt cggcacctgc aattgaaaga

1621
acagaaaaat gctggcacca gaacgaatga gggtcctctc atcgttactg aagagcttca

1681
ctcccttagt tttgaaaccc aattgtgcca gcctggtttg gtaattgacc tcgagacgac

1741
ctctctgccc gttgtggtga tctccaacgt cagccagctc ccgagcggtt gggcctccat

1801
cctttggtac aacatgctgg tggcggaacc caggaatctg tccttcttcc tgactccacc

1861
atgtgcacga tgggctcagc tttcagaagt gctgagttgg cagttttctt ctgtcaccaa

1921
aagaggtctc aatgtggacc agctgaacat gttgggagag aagcttcttg gtcctaacgc

1981
cagccccgat ggtctcattc cgtggacgag gttttgtaag gaaaatataa atgataaaaa

2041
ttttcccttc tggctttgga ttgaaagcat cctagaactc attaaaaaac acctgctccc

2101
tctctggaat gatgggtgca tcatgggctt catcagcaag gagcgagagc gtgccctgtt

2161
gaaggaccag cagccgggga ccttcctgct gcggttcagt gagagctccc gggaaggggc

2221
catcacattc acatgggtgg agcggtccca gaacggaggc gaacctgact tccatgcggt

2281

tgaaccctac acgaagaaag aactttctgc tgttactttc

cctgacatca ttcgcaatta

2341

caaagtcatg gctgctgaga atattcctga gaatcccctg aagtatctgt

atccaaatat

2401

tgacaaagac catgcctttg gaaagtatta ctccaggcca aaggaagcac cagagccaat

2461

ggaacttgat ggccctaaag gaactggata tatcaagact gagttgattt ctgtgtctga

2521

agttcaccct tctagacttc agaccacaga caacctgctc cccatgtctc ctgaggagtt

2581

tgacgaggtg tctcggatag tgggctctgt agaattcgac agtatgatga acacagtata

2641

gagcatgaat ttttttcatc ttctctggcg acagttttcc ttctcatctg tgattccctc

2701

ctgctactct gttccttcac atcctgtgtt tctagggaaa tgaaagaaag gccagcaaat

2761

tcgctgcaac ctgttgatag caagtgaatt tttctctaac tcagaaacat cagttactct

2821

gaagggcatc atgcatctta ctgaaggtaa aattgaaagg cattctctga agagtgggtt

2881

tcacaagtga aaaacatcca

gatacaccca aagtatcagg acgagaatga gggtcctttg

2941

ggaaaggaga agttaagcaa catctagcaa

atgttatgca taaagtcagt gcccaactgt

3001

tataggttgt tggataaatc agtggttatt tagggaactg cttgacgtag gaacggtaaa

3061

tttctgtggg agaattctta catgttttct ttgctttaag tgtaactggc agttttccat

3121

tggtttacct gtgaaatagt tcaaagccaa gtttatatac aattatatca gtcctctttc

3181
aaaggtagcc atcatggatc tggtaggggg aaaatgtgta ttttattaca tctttcacat

3241
tggctattta aagacaaaga caaattctgt ttcttgagaa gagaatatta gctttactgt

3301
ttgttatggc ttaatgacac tagctaatat caatagaagg atgtacattt ccaaattcac

3361
aagttgtgtt tgatatccaa agctgaatac attctgcttt catcttggtc acatacaatt

3421
atttttacag ttctcccaag ggagttaggc tattcacaac cactcattca aaagttgaaa

3481
ttaaccatag atgtagataa actcagaaat ttaattcatg tttcttaaat gggctacttt

3541
gtcctttttg ttattagggt ggtatttagt ctattagcca caaaattggg aaaggagtag

3601
aaaaagcagt aactgacaac ttgaataata caccagagat aatatgagaa tcagatcatt

3661
tcaaaactca tttcctatgt aactgcattg agaactgcat atgtttcgct gatatatgtg

3721
tttttcacat ttgcgaatgg ttccattctc tctcctgtac tttttccaga cacttttttg

3781
agtggatgat gtttcgtgaa gtatactgta tttttacctt tttccttcct tatcactgac

3841
acaaaaagta gattaagaga tgggtttgac aaggttcttc ccttttacat actgctgtct

3901
atgtggctgt atcttgtttt tccactactg ctaccacaac tatattatca tgcaaatgct

3961
gtattcttct ttggtggaga taaagatttc ttgagttttg ttttaaaatt aaagctaaag

4021
tatctgtatt gcattaaata taatatgcac acagtgcttt ccgtggcact gcatacaatc

4081
tgaggcctcc tctctcagtt tttatataga tggcgagaac ctaagtttca gttgatttta

4141
caattgaaat gactaaaaaa caaagaagac aacattaaaa caatattgtt tctaattgct

4201
gaggtttagc tgtcagttct ttttgccctt tgggaattcg gcatggtttc attttactgc

4261
actagccaag agactttact tttaagaagt attaaaattc taaaattcaa aaaaaaaaaa

4321
aaaaaa

B14 TLR6-toll-like receptor 6 mRNA NM_006068.4

(SEQ ID NO: 150)

1
aattgtattt ccgttcattt acaagttatt ttctcttctt ctgaaaaaga gatcttgaat

61
ttggactcat atcaagatgc tctgaagaag aacaaccctt taggatagcc actgcaacat

121
catgaccaaa gacaaagaac ctattgttaa aagcttccat tttgtttgcc ttatgatcat

181
aatagttgga accagaatcc agttctccga cggaaatgaa tttgcagtag acaagtcaaa

241
aagaggtctt attcatgttc caaaagacct accgctgaaa accaaagtct tagatatgtc

301
tcagaactac atcgctgagc ttcaggtctc tgacatgagc tttctatcag agttgacagt

361
tttgagactt tcccataaca gaatccagct acttgattta agtgttttca agttcaacca

421
ggatttagaa tatttggatt tatctcataa tcagttgcaa aagatatcct gccatcctat

481
tgtgagtttc aggcatttag atctctcatt caatgatttc aaggccctgc ccatctgtaa

541
ggaatttggc aacttatcac aactgaattt cttgggattg agtgctatga agctgcaaaa

601
attagatttg ctgccaattg ctcacttgca tctaagttat atccttctgg atttaagaaa

661
ttattatata aaagaaaatg agacagaaag tctacaaatt ctgaatgcaa aaacccttca

721
ccttgttttt cacccaacta gtttattcgc tatccaagtg aacatatcag ttaatacttt

781
agggtgctta caactgacta atattaaatt gaatgatgac aactgtcaag ttttcattaa

841
atttttatca gaactcacca gaggttcaac cttactgaat tttaccctca accacataga

901
aacgacttgg aaatgcctgg tcagagtctt tcaatttctt tggcccaaac ctgtggaata

961
tctcaatatt tacaatttaa caataattga aagcattcgt gaagaagatt ttacttattc

1021
taaaacgaca ttgaaagcat tgacaataga acatatcacg aaccaagttt ttctgttttc

1081
acagacagct ttgtacaccg tgttttctga gatgaacatt atgatgttaa ccatttcaga

1141
tacacctttt atacacatgc tgtgtcctca tgcaccaagc acattcaagt ttttgaactt

1201
tacccagaac gttttcacag atagtatttt tgaaaaatgt tccacgttag ttaaattgga

1261
gacacttatc ttacaaaaga atggattaaa agaccttttc aaagtaggtc tcatgacgaa

1321
ggatatgcct tctttggaaa tactggatgt tagctggaat tctttggaat ctggtagaca

1381
taaagaaaac tgcacttggg ttgagagtat agtggtgtta aatttgtctt caaatatgct

1441
tactgactct gttttcagat gtttacctcc caggatcaag gtacttgatc ttcacagcaa

1501
taaaataaag agcgttccta aacaagtcgt aaaactggaa gctttgcaag aactcaatgt

1561
tgctttcaat tctttaactg accttcctgg atgtggcagc tttagcagcc tttctgtatt

1621
gatcattgat cacaattcag tttcccaccc atcggctgat ttcttccaga gctgccagaa

1681
gatgaggtca ataaaagcag gggacaatcc attccaatgt acctgtgagc taagagaatt

1741
tgtcaaaaat atagaccaag tatcaagtga agtgttagag ggctggcctg attcttataa

1801

gtgtgactac ccagaaagtt atagaggaag cccactaaag gactttcaca tgtctgaatt

1861

atcctgcaac ataactctgc tgatcgtcac catcggtgcc accatgctgg tgttggctgt

1921

gactgtgacc tccctctgca tctacttgga tctgccctgg tatctcagga tggtgtgcca

1981

gtggacccag

actcggcgca gggccaggaa cataccctta gaagaactcc aaagaaacct

2041

ccagtttcat gcttttattt catatagtga acatgattct gcctgggtga aaagtgaatt

2101

ggtaccttac ctagaaaaag aagatataca gatttgtctt catgagagaa actttgtccc

2161

tggcaagagc attgtggaaa atatcatcaa ctgcattgag aagagttaca agtccatctt

2221

tgttttgtct cccaactttg tccagagtga gtggtgccat tacgaactct attttgccca

2281

tcacaatctc tttcatgaag gatctaataa cttaatcctc atcttactgg aacccattcc

2341

acagaacagc

attcccaaca agtaccacaa gctgaaggct ctcatgacgc agcggactta

2401

tttgcagtgg cccaaggaga

aaagcaaacg tgggctcttt tgggctaaca ttagagccgc

2461
ttttaatatg aaattaacac tagtcactga aaacaatgat gtgaaatctt aaaaaaattt

2521
aggaaattca acttaagaaa ccattattta cttggatgat ggtgaatagt acagtcgtaa

2581
gtaactgtct ggaggtgcct ccattatcct catgccttca ggaaagactt aacaaaaaca

2641
atgtttcatc tggggaactg agctaggcgg tgaggttagc ctgccagtta gagacagccc

2701
agtctcttct ggtttaatca ttatgtttca aattgaaaca gtctcttttg agtaaatgct

2761
cagtttttca gctcctctcc actctgcttt cccaaatgga ttctgttgtg agcaagagtt

2821
tatatggctt catggcagca agggaacagt caacttcagc atcatatgca ccagtcctcg

2881
gagtgccctg tgaatcatat tggtctttgg gtcagtgtca tcattctctt caagtctggg

2941
gcttggggaa aaaattagat cagctacggc atataaaaaa gtcttttgtt tcacatatgt

3001
gtaatagctt atttaatttt ttatcctgct acacaaatat gtaattaacc aatgaggact

3061
catgacttga tagtgtatgt atgtaaaggg atatatggac ttaatcataa gctgttgagg

3121
tgaaagacgt ggatccacct gctttccaag aaaactcggc caaatttatt tgcagctgga

3181
tattgaatgg gacttttctg gttgtcttag aattctggct aaaggctcaa agctgacgaa

3241
agacagtaac tgcaccaaca tgatactaga cacagccagt ctggacttat caaaagagca

3301
gaaagagacc aatgactccc agtccgtatt atccatctct agaagactag agtcaaaagc

3361
gtgattaaag agtcattaag cggaggttct aggccatagg gagattgctt tgaatttctt

3421
gcagacaagt gtgagggact cagcatggta gaaggtagcc tggcatccca ctccaagact

3481
gaaagcttgc agagtaacag gagcacacag gttcagtgca gcagatgtgg tgtggcttga

3541
gaattcttgg aagagcttga tgagtgtttg ctggagtccg agggtgggca ctgggaacac

3601
agagactggt aaatagtgtt tggcaaatac aagtgcttga tgaatatttg ttgaatgaat

3661
agatgagttc ttcccccctg gggaattcag gaggtgaaag gttggcttga gcacccaaaa

3721
tggcaggatg agagaagaga agcactgata gcaacctgcc ctcccattat tgacatggta

3781
aaaggatgtg aatttcttca catggctttg actatggaag agtagctggg cttgcattgt

3841
catgacggga tatcagccaa cagggtagcc tgttgtgcaa agaaactata gcagtaagag

3901
gacacggggt taggcagaag aggggtttgg ggtggaggtt gctgcaagag gtcagccaga

3961
taatgtggcc ctgcatcatg gaactgtgca atgtggggta cactcaaggc cctccaataa

4021
ctcacagatg tgccctatga aaaagccagc atttggactc tgccatagca gctggcagga

4081
tcatgctggc ctgtctgcct tattcaatag ttaactacag gaagatctgc tcctctttgt

4141
gtaataccct cttcccttgc aatggcatag ggacatctag aatatagaga agacagagac

4201
aatggaggaa gagtaaagaa actgactata tgccttcgtc atttcactgc aaggaaggcc

4261
aagcagattt ttgaatgagg tgtgagattg ctgttaaatt ggactggcct ggacatttta

4321
atcccttaaa tagaggtgca atgactaaag tgagatttgt cactaaaatt tatggtatct

4381
gcccaagatt caggagtgat gatgggagga gatccaacag aactttgttg taaggcaatg

4441
gttagagaaa aatgaagccc tcgctttctg gacttagttc attcaataaa ccagtttcgg

4501
ccaggcacgt tggctcacat ctataatccc agtactgtgg gaggctgagg caggtggatc

4561
acttgaggtc aggagttcga gaccagcctg gccaacatgg tgaaaccctg tctgtactaa

4621
aaatacaaaa attagccggg tgtggtggtg tgcacctgta gtcccagcta ctcgggaggc

4681
tgaggcagga aaatcacttg aacctgggag acagaggctg tagtgagctg agacagcgct

4741
actgtactcc ccgctgggca acagagtgag actccatctc aaaaaagtta aaagaaaaaa

4801
aatctggttt cataatagct gtaacgaaat aagccttaat gatattttat tagcatcatc

4861
ttctgtctgc attagccctt ccttgctctt caggagaaca acatttgttt tcctccctag

4921
gctctatccc aaacggcaca ttcttccaca acccctgttg aacagatttt ttaaactgtt

4981
gcctaatcta aaaacaataa aaacaacaaa caaccacagt aacaacaacg acaaaaaaaa

5041
ctgccacaga ttctaaataa tcagatcttt ttaaatggta tcaatgtttc ccacaaaata

5101
ttgttgacat tgaaaatata gaattttagc attaattttg ttaaacctac atcccctcgg

5161
cagaggggcc tccctgcatc ccagtggaaa gtaggttcct cacagtcctc tccgtcacat

5221
tcttcccatt tcttttcttc acagaacaca tcactgtcta aaattatctt gtttgcttag

5281
ttgcttactc atcttcttct tctctcctct gaagtctaag ctccaggaaa aagggagact

5341
tctccacctg ttccctgcct ctccccagtg ccgaggggac actgtgcacc ccattgtaga

5401
tgcgcagtaa aaactcgtgg gatgagcaaa tgactctgaa acggtcccat gcgggaaatg

5461
tccatgaagt cctggatttt atctaaaaag cccaggcagg ggggggcggg ggcggcgggg

5521
ctacagttcc acgctgagct gcctcctggc cgctcgtccc cgccgcagtg cctgggcggc

5581
ccgggcgccc gaccttggcc gtggacacct tcgcggtggg tgctgctcct ccccatctgc

5641
cactggaaga tgctggggcg acccggctcc aggtttagca ggacactgag aaaagggaat

5701
ggctgccttt cggaggctgg gtgagccctt ctctgtgcct cacctgcccg ccccacagcg

5761
gccctgcacc tcgtcccacg gggcccattg ccccggtagg atgcgcgctt ttgttttgag

5821
ggtcaggcat cttccctgcc gtcgtttctg ggaggttgaa aaattgatcc agaaagacct

5881
aaaacaaaaa a

B15 WARS-tryptophanyl-tRNA synthetase, cytoplasmic isoform a mRNA

NM_004184.3

(SEQ ID NO: 151)

1
tcgattctca agagggtttc attggtctca acctggcccc ccaggcaacc cacccctgat

61
tggacagtct catcaagaag gttggtcaag agctcaagtg tttctgagaa tctgggtgat

121
ttataagaaa cccttagctg aatgcagggt ggggagaacg aaagacaaaa gcatcttttt

181
tcagaaggga aactgaaaga aagaggggaa gagtattaaa gaccatttct ggctgggcag

241
ggcactctca gcagctcaac tgcccagcgt gaccagtggc cacctctgca gtgtcttcca

301
caacctggtc ttgactcgtc tgctgaacaa atcctctgac ctcaggccgg ctgtgaacgt

361
agttcctgag agatagcaaa catgcccaac agtgagcccg catctctgct ggagctgttc

421
aacagcatcg ccacacaagg ggagctcgta aggtccctca aagcgggaaa tgcgtcaaag

481
gatgaaattg attctgcagt aaagatgttg gtgtcattaa aaatgagcta caaagctgcc

541
gcgggggagg attacaaggc tgactgtcct ccagggaacc cagcacctac cagtaatcat

601
ggcccagatg ccacagaagc tgaagaggat tttgtggacc catggacagt acagacaagc

661
agtgcaaaag gcatagacta cgataagctc attgttcggt ttggaagtag taaaattgac

721
aaagagctaa taaaccgaat agagagagcc accggccaaa gaccacacca cttcctgcgc

781
agaggcatct tcttctcaca cagagatatg aatcaggttc ttgatgccta tgaaaataag

841
aagccatttt atctgtacac gggccggggc ccctcttctg aagcaatgca tgtaggtcac

901
ctcattccat ttattttcac aaagtggctc caggatgtat ttaacgtgcc cttggtcatc

961
cagatgacgg atgacgagaa gtatctgtgg aaggacctga ccctggacca ggcctatagc

1021
tatgctgtgg agaatgccaa ggacatcatc gcctgtggct ttgacatcaa caagactttc

1081
atattctctg acctggacta catggggatg agctcaggtt tctacaaaaa tgtggtgaag

1141
attcaaaagc atgttacctt caaccaagtg aaaggcattt tcggcttcac tgacagcgac

1201
tgcattggga agatcagttt tcctgccatc caggctgctc cctccttcag caactcattc

1261
ccacagatct tccgagacag gacggatatc cagtgcctta tcccatgtgc cattgaccag

1321
gatccttact ttagaatgac aagggacgtc gcccccagga tcggctatcc taaaccagcc

1381
ctgctgcact ccaccttctt cccagccctg cagggcgccc agaccaaaat gagtgccagc

1441
gaccccaact cctccatctt cctcaccgac acggccaagc agatcaaaac caaggtcaat

1501
aagcatgcgt tttctggagg gagagacacc atcgaggagc acaggcagtt tgggggcaac

1561
tgtgatgtgg acgtgtcttt catgtacctg accttcttcc tcgaggacga cgacaagctc

1621
gagcagatca ggaaggatta caccagcgga gccatgctca ccggtgagct caagaaggca

1681
ctcatagagg ttctgcagcc cttgatcgca gagcaccagg cccggcgcaa ggaggtcacg

1741
gatgagatag tgaaagagtt catgactccc cggaagctgt ccttcgactt tcagtagcac

1801

tcgttttaca tatgcttata aaagaagtga tgtatcagta atgtatcaat aatcccagcc

1861

cagtcaaagc accgccacct gtaggcttct gtctcatggt aattactggg

cctggcctct

1921

gtaagcctgt gtatgttatc aatactgttt cttcctgtga gttccattat ttctatctct

1981

tatgggcaaa gcattgtggg taattggtgc tggctaacat tgcatggtcg gatagagaag

2041

tccagctgtg agtctctccc caaagcagcc ccacagtgga gcctttggct ggaagtccat

2101

gggccaccct gttcttgtcc atggaggact ccgagggttc caagtatact cttaagaccc

2161

actctgttta aaaatatata ttctatgtat gcgtatatgg aattgaaatg tcattattgt

2221

aacctagaaa gtgctttgaa atattgatgt ggggaggttt attgagcaca agatgtattt

2281

cagcccatgc cccctcccaa aaagaaattg ataagtaaaa gcttcgttat acatttgact

2341

aagaaatcac ccagctttaa agctgctttt aacaatgaag attgaacaga gttcagcaat

2401

tttgattaaa ttaagacttg ggggtgaaac tttccagttt actgaactcc agaccatgca

2461

tgtagtccac tccagaaatc atgctcgctt cccttggcac accagtgttc tcctgccaaa

2521

tgaccctaga ccctctgtcc tgcagagtca gggtggcttt tcccctgact gtgtccgatg

2581

ccaaggagtc ctggcctccg cagatgcttc attttgaccc ttggctgcag tggaagtcag

2641

cacagagcag

tgccctggct gtgtccctgg acgggtggac ttagctaggg agaaagtcga

2701

ggcagcagcc ctcgaggccc tcacagatgt ctaggcaggc ctcatttcat cacgcagcat

2761

gtgcaggcct ggaagagcaa agccaaatct cagggaagtc cttggttgat gtatctgggt

2821

ctcctctgga gcactctgcc ctcctgtcac ccagtagagt aaataaactt ccttggctcc

2881
tgct

B16 MMP9-matrix metallopeptidase 9, mRNA NM_0049942

(SEQ ID NO: 152)

1
agacacctct gccctcacca tgagcctctg gcagcccctg gtcctggtgc tcctggtgct

61
gggctgctgc tttgctgccc ccagacagcg ccagtccacc cttgtgctct tccctggaga

121
cctgagaacc aatctcaccg acaggcagct ggcagaggaa tacctgtacc gctatggtta

181
cactcgggtg gcagagatgc gtggagagtc gaaatctctg gggcctgcgc tgctgcttct

241
ccagaagcaa ctgtccctgc ccgagaccgg tgagctggat agcgccacgc tgaaggccat

301
gcgaacccca cggtgcgggg tcccagacct gggcagattc caaacctttg agggcgacct

361
caagtggcac caccacaaca tcacctattg gatccaaaac tactcggaag acttgccgcg

421
ggcggtgatt gacgacgcct ttgcccgcgc cttcgcactg tggagcgcgg tgacgccgct

481
caccttcact cgcgtgtaca gccgggacgc agacatcgtc atccagtttg gtgtcgcgga

541
gcacggagac gggtatccct tcgacgggaa ggacgggctc ctggcacacg cctttcctcc

601
tggccccggc attcagggag acgcccattt cgacgatgac gagttgtggt ccctgggcaa

661
gggcgtcgtg gttccaactc ggtttggaaa cgcagatggc gcggcctgcc acttcccctt

721
catcttcgag ggccgctcct actctgcctg caccaccgac ggtcgctccg acggcttgcc

781
ctggtgcagt accacggcca actacgacac cgacgaccgg tttggcttct gccccagcga

841
gagactctac acccaggacg gcaatgctga tgggaaaccc tgccagtttc cattcatctt

901
ccaaggccaa tcctactccg cctgcaccac ggacggtcgc tccgacggct accgctggtg

961
cgccaccacc gccaactacg accgggacaa gctcttcggc ttctgcccga cccgagctga

1021
ctcgacggtg atggggggca actcggcggg ggagctgtgc gtcttcccct tcactttcct

1081
gggtaaggag tactcgacct gtaccagcga gggccgcgga gatgggcgcc tctggtgcgc

1141

taccacctcg aactttgaca gcgacaagaa gtggggcttc tgcccggacc aaggatacag

1201

tttgttcctc

gtggcggcgc atgagttcgg ccacgcgctg ggcttagatc attcctcagt

1261

gccggaggcg ctcatgtacc ctatgtaccg cttcactgag gggcccccct tgcataagga

1321

cgacgtgaat ggcatccggc acctctatgg tcctcgccct gaacctgagc cacggcctcc

1381

aaccaccacc acaccgcagc ccacggctcc cccgacggtc tgccccaccg gaccccccac

1441

tgtccacccc tcagagcgcc ccacagctgg ccccacaggt cccccctcag ctggccccac

1501

aggtcccccc actgctggcc cttctacggc cactactgtg cctttgagtc cggtggacga

1561

tgcctgcaac gtgaacatct tcgacgccat cgcggagatt gggaaccagc tgtatttgtt

1621

caaggatggg aagtactggc gattctctga gggcaggggg agccggccgc agggcccctt

1681

ccttatcgcc gacaagtggc ccgcgctgcc ccgcaagctg gactcggtct ttgaggagcg

1741

gctctccaag aagcttttct tcttctctgg gcgccaggtg tgggtgtaca caggcgcgtc

1801

ggtgctgggc ccgaggcgtc tggacaagct gggcctggga gccgacgtgg cccaggtgac

1861

cggggccctc cggagtggca gggggaagat gctgctgttc agcgggcggc gcctctggag

1921

gttcgacgtg aaggcgcaga tggtggatcc ccggagcgcc agcgaggtgg accggatgtt

1981

ccccggggtg cctttggaca cgcacgacgt cttccagtac cgagagaaag cctatttctg

2041

ccaggaccgc

ttctactggc gcgtgagttc ccggagtgag ttgaaccagg tggaccaagt

2101

gggctacgtg acctatgaca

tcctgcagtg ccctgaggac tagggctccc gtcctgcttt

2161

ggcagtgcca tgtaaatccc cactgggacc aaccctgggg aaggagccag tttgccggat

2221
acaaactggt attctgttct ggaggaaagg gaggagtgga ggtgggctgg gccctctctt

2281
ctcacctttg ttttttgttg gagtgtttct aataaacttg gattctctaa cctttaaaaa

2341
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa

B17 DOCK9-dedicator of cytokinesis 9 mRNA NM_015296.2

(SEQ ID NO: 153)

1
gcggccgggc cgggccgcgg gagcaggcgg aggcggaggc ggcgggggca ggaggatgtc

61
gcagccgccg ctgctccccg cctcggcgga gactcggaag ttcacccggg cgctgagtaa

121
gccgggcacg gcggccgagc tgcggcagag cgtgtctgag gtggtgcgcg gctccgtgct

181
cctggcaaag ccaaagctaa ttgagccact cgactatgaa aatgtcatcg tccagaagaa

241
gactcagatc ctgaacgact gtttacggga gatgctgctc ttcccttacg atgactttca

301
gacggccatc ctgagacgac agggtcgata catatgctca acagtgcctg cgaaggcgga

361
agaggaagca cagagcttgt ttgttacaga gtgcatcaaa acctataact ctgactggca

421
tcttgtgaac tataaatatg aagattactc aggagagttt cgacagcttc cgaacaaagt

481
ggtcaagttg gataaacttc cagttcatgt ctatgaagtt gacgaggagg tcgacaaaga

541
tgaggatgct gcctcccttg gttcccagaa gggtgggatc accaagcatg gctggctgta

601
caaaggcaac atgaacagtg ccatcagcgt gaccatgagg tcatttaaga gacgattttt

661
ccacctgatt caacttggcg atggatccta taatttgaat ttttataaag atgaaaagat

721
ctccaaagaa ccaaaaggat caatatttct ggattcctgt atgggtgtcg ttcagaacaa

781
caaagtcagg cgttttgctt ttgagctcaa gatgcaggac aaaagtagtt atctcttggc

841
agcagacagt gaagtggaaa tggaagaatg gatcacaatt ctaaataaga tcctccagct

901
caactttgaa gctgcaatgc aagaaaagcg aaatggcgac tctcacgaag atgatgaaca

961
aagcaaattg gaaggttctg gttccggttt agatagctac ctgccggaac ttgccaagag

1021
tgcaagagaa gcagaaatca aactgaaaag tgaaagcaga gtcaaacttt tttatttgga

1081
cccagatgcc cagaagcttg acttctcatc agctgagcca gaagtgaagt catttgaaga

1141
gaagtttgga aaaaggatcc ttgtcaagtg caatgattta tctttcaatt tgcaatgctg

1201
tgttgccgaa aatgaagaag gacccactac aaatgttgaa cctttctttg ttactctatc

1261
cctgtttgac ataaaataca accggaagat ttctgccgat ttccacgtag acctgaacca

1321
tttctcagtg aggcaaatgc tcgccaccac gtccccggcg ctgatgaatg gcagtgggca

1381
gagcccatct gtcctcaagg gcatccttca tgaagccgcc atgcagtatc cgaagcaggg

1441
aatattttca gtcacttgtc ctcatccaga tatatttctt gtggccagaa ttgaaaaagt

1501
ccttcagggg agcatcacac attgcgctga gccatatatg aaaagttcag actcttctaa

1561
ggtggcccag aaggtgctga agaatgccaa gcaggcatgc caaagactag gacagtatag

1621
aatgccattt gcttgggcag caaggacatt gtttaaggat gcatctggaa atcttgacaa

1681
aaatgccaga ttttctgcca tctacaggca agacagcaat aagctatcca atgatgacat

1741
gctcaagtta cttgcagact ttcggaaacc tgagaagatg gctaagctcc cagtgatttt

1801
aggcaatcta gacattacaa ttgataatgt ttcctcagac ttccctaatt atgttaattc

1861
atcatacatt cccacaaaac aatttgaaac ctgcagtaaa actcccatca cgtttgaagt

1921
ggaggaattt gtgccctgca taccaaaaca cactcagcct tacaccatct acaccaatca

1981
cctttacgtt tatcctaagt acttgaaata cgacagtcag aagtcttttg ccaaggctag

2041
aaatattgcg atttgcattg aattcaaaga ttcagatgag gaagactctc agccccttaa

2101
gtgcatttat ggcagacctg gtgggccagt tttcacaaga agcgcctttg ctgcagtttt

2161
acaccatcac caaaacccag aattttatga tgagattaaa atagagttgc ccactcagct

2221
gcatgaaaag caccacctgt tgctcacatt cttccatgtc agctgtgaca actcaagtaa

2281
aggaagcacg aagaagaggg atgtcgttga aacccaagtt ggctactcct ggcttcccct

2341
cctgaaagac ggaagggtgg tgacaagcga gcagcacatc ccggtctcgg cgaaccttcc

2401
ttcgggctat cttggctacc aggagcttgg gatgggcagg cattatggtc cggaaattaa

2461
atgggtagat ggaggcaagc cactgctgaa aatttccact catctggttt ctacagtgta

2521
tactcaggat cagcatttac ataatttttt ccagtactgt cagaaaaccg aatctggagc

2581
ccaagcctta ggaaacgaac ttgtaaagta ccttaagagt ctgcatgcga tggaaggcca

2641
cgtgatgatc gccttcttgc ccactatcct aaaccagctg ttccgagtcc tcaccagagc

2701
cacacaggaa gaagtcgcgg ttaacgtgac tcgggtcatt attcatgtgg ttgcccagtg

2761
ccatgaggaa ggattggaga gccacttgag gtcatatgtt aagtacgcgt ataaggctga

2821
gccatatgtt gcctctgaat acaagacagt gcatgaagaa ctgaccaaat ccatgaccac

2881
gattctcaag ccttctgccg atttcctcac cagcaacaaa ctactgaagt actcatggtt

2941
tttctttgat gtactgatca aatctatggc tcagcatttg atagagaact ccaaagttaa

3001
gttgctgcga aaccagagat ttcctgcatc ctatcatcat gcagtggaaa ccgttgtaaa

3061
tatgctgatg ccacacatca ctcagaagtt tcgagataat ccagaggcat ctaagaacgc

3121
gaatcatagc cttgctgtct tcatcaagag atgtttcacc ttcatggaca ggggctttgt

3181
cttcaagcag atcaacaact acattagctg ttttgctcct ggagacccaa agaccctctt

3241
tgaatacaag tttgaatttc tccgtgtagt gtgcaaccat gaacattata ttccgttgaa

3301
cttaccaatg ccatttggaa aaggcaggat tcaaagatac caagacctcc agcttgacta

3361
ctcattaaca gatgagttct gcagaaacca cttcttggtg ggactgttac tgagggaggt

3421
ggggacagcc ctccaggagt tccgggaggt ccgtctgatc gccatcagtg tgctcaagaa

3481
cctgctgata aagcattctt ttgatgacag atatgcttca aggagccatc aggcaaggat

3541
agccaccctc tacctgcctc tgtttggtct gctgattgaa aacgtccagc ggatcaatgt

3601
gagggatgtg tcacccttcc ctgtgaacgc gggcatgact gtgaaggatg aatccctggc

3661
tctaccagct gtgaatccgc tggtgacgcc gcagaaggga agcaccctgg acaacagcct

3721
gcacaaggac ctgctgggcg ccatctccgg cattgcttct ccatatacaa cctcaactcc

3781
aaacatcaac agtgtgagaa atgctgattc gagaggatct ctcataagca cagattcggg

3841
taacagcctt ccagaaagga atagtgagaa gagcaattcc ctggataagc accaacaaag

3901
tagcacattg ggaaattccg tggttcgctg tgataaactt gaccagtctg agattaagag

3961
cctactgatg tgtttcctct acatcttaaa gagcatgtct gatgatgctt tgtttacata

4021
ttggaacaag gcttcaacat ctgaacttat ggattttttt acaatatctg aagtctgcct

4081
gcaccagttc cagtacatgg ggaagcgata catagccaga acaggaatga tgcatgccag

4141
attgcagcag ctgggcagcc tggataactc tctcactttt aaccacagct atggccactc

4201
ggacgcagat gttctgcacc agtcattact tgaagccaac attgctactg aggtttgcct

4261
gacagctctg gacacgcttt ctctatttac attggcgttt aagaaccagc tcctggccga

4321
ccatggacat aatcctctca tgaaaaaagt ttttgatgtc tacctgtgtt ttcttcaaaa

4381
acatcagtct gaaacggctt taaaaaatgt cttcactgcc ttaaggtcct taatttataa

4441
gtttccctca acattctatg aagggagagc ggacatgtgt gcggctctgt gttacgagat

4501
tctcaagtgc tgtaactcca agctgagctc catcaggacg gaggcctccc agctgctcta

4561
cttcctgatg aggaacaact ttgattacac tggaaagaag tcctttgtcc ggacacattt

4621
gcaagtcatc atatctgtca gccagctgat agcagacgtt gttggcattg ggggaaccag

4681
attccagcag tccctgtcca tcatcaacaa ctgtgccaac agtgaccggc ttattaagca

4741
caccagcttc tcctctgatg tgaaggactt aaccaaaagg atacgcacgg tgctaatggc

4801
caccgcccag atgaaggagc atgagaacga cccagagatg ctggtggacc tccagtacag

4861
cctggccaaa tcctatgcca gcacgcccga gctcaggaag acgtggctcg acagcatggc

4921
caggatccat gtcaaaaatg gcgatctctc agaggcagca atgtgctatg tccacgtaac

4981
agccctagtg gcagaatatc tcacacggaa agaagcagtc cagtgggagc cgccccttct

5041
cccccacagc catagcgcct gcctgaggag gagccgggga ggcgtgttta gacaaggatg

5101
caccgccttc agggtcatta ccccaaacat cgacgaggag gcctccatga tggaagacgt

5161
ggggatgcag gatgtccatt tcaacgagga tgtgctgatg gagctccttg agcagtgcgc

5221
agatggactc tggaaagccg agcgctacga gctcatcgcc gacatctaca aacttatcat

5281
ccccatttat gagaagcgga gggattttga gaggctggcc catctgtatg acacgctgca

5341
ccgggcctac agcaaagtga ccgaggtcat gcactcgggc cgcaggcttc tggggaccta

5401
cttccgggta gccttcttcg ggcaggcagc gcaataccag tttacagaca gtgaaacaga

5461
tgtggaggga ttctttgaag atgaagatgg aaaggagtat atttacaagg aacccaaact

5521
cacaccgctg tcggaaattt ctcagagact ccttaaactg tactcggata aatttggttc

5581
tgaaaatgtc aaaatgatac aggattctgg caaggtcaac cctaaggatc tggattctaa

5641
gtatgcatac atccaggtga ctcacgtcat ccccttcttt gacgaaaaag agttgcaaga

5701
aaggaaaaca gagtttgaga gatcccacaa catccgccgc ttcatgtttg agatgccatt

5761
tacgcagacc gggaagaggc agggcggggt ggaagagcag tgcaaacggc gcaccatcct

5821

gacagccata cactgcttcc cttatgtgaa gaagcgcatc

cctgtcatgt accagcacca

5881

cactgacctg aaccccatcg aggtggccat tgacgagatg agtaagaagg tggcggagct

5941

ccggcagctg tgctcctcgg ccgaggtgga catgatcaaa ctgcagctca aactccaggg

6001

cagcgtgagt gttcaggtca atgctggccc actagcatat gcgcgagctt tcttagatga

6061

tacaaacaca aagcgatatc ctgacaataa agtgaagctg cttaaggaag ttttcaggca

6121

atttgtggaa gcttgcggtc aagccttagc ggtaaacgaa cgtctgatta aagaagacca

6181

gctcgagtat caggaagaaa tgaaagccaa ctacagggaa atggcgaagg agctttctga

6241

aatcatgcat gagcagctgg gatgatctgc cccctggagg agaagacgag cgtcttaccg

6301

aattcccttc

acatcttcaa cgccatcagt gggactccaa caagcacaat ggttcacggg

6361

atgaccagct cgtcttcggt

cgtgtgatta catctcatgg cccgtgtgtg gggacttgct

6421

ttgtcatttg caaactcagg atgctttcca aagccaatca ctggggagac cgagcacagg

6481
gaggaccaag gggaagggga gagaaaggaa ataaagaaca acgttatttc ttaacagact

6541
ttctatagga gttgtaagaa ggtgcacata tttttttaaa tctcactggc aatattcaaa

6601
gttttcattg tgtcttaaca aaggtgtggt agacactctt gagctggact tagattttat

6661
tcttccttgc agagtagtgt tagaatagat ggcctacaga aaaaaaaggt tctgggatct

6721
acatggcagg gagggctgca ctgacattga tgcctggggg accttttgcc tcgaggctga

6781
gctggaaaat cttgaaaata tttttttttt cctgtggcac attcaggttg aatacaagaa

6841
ctatttttgt gactagtttt tgatgaccta agggaactga ccattgtaat ttttgtacca

6901
gtgaaccagg agatttagtg cttttatatt catttccttg catttaagaa aatatgaaag

6961
cttaaggaat tatgtgagct taaaactagt caagcagttt agaaccaaag gcctatatta

7021
ataaccgcaa ctatgctgaa aagtacaaag tagtacagta tattgttatg tacatatcat

7081
tgttaataca gtcctggcat tctgtacata tatgtattac atttctacat ttttaatact

7141
cacatgggct tatgcattaa gtttaattgt gataaatttg tgctgttcca gtatatgcaa

7201
tacactttaa tgttttattc ttgtacataa aaatgtgcaa tatggagatg tatacagtct

7261
ttactatatt aggtttataa acagttttaa gaatttcatc cttttgccaa aatggtggag

7321
tatgtaattg gtaaatcata aatcctgtgg tgaatggtgg tgtactttaa agctgtcacc

7381
atgttatatt ttcttttaag actttaattt agtaatttta tatttgggaa aataaaggtt

7441
tttaatttta tttaactgga atcactgccc tgctgtaatt aaacattctg taccacatct

7501
gtattaaaaa gacattgctg accattaaaa aaaaaaaaaa aaaa

B18 SIRPB2 signal-regulatory protein beta 2, mRNA NM_001122962.1

(SEQ ID NO: 154)

1
ttagcacagt gactacagga atcacagccc agacacaaaa gcaggaaacc ctttgaccgg

61
gctccttcct attgcaccaa cagccttgtg ttgctgcaat gaaaacactt ccccaagcag

121
ctgtggccaa gagacgcaga aactgccttg tccacgggcc ccgcctcaga ctccaacact

181
cacaagagag cagaggagcc ccaagtcttg gggaccacag aagatgccat gtgctccacg

241
atgtcggccc ccacctgcct ggcccacttg cctccctgct tcctgctgct ggcactggtc

301
cttgtcccct cagatgcctc tgggcagagc agcaggaatg actggcaggt gctacagccc

361
gagggcccca tgctggtggc agaaggtgag acacttctac tgaggtgtat ggtggtcggc

421
tcctgcactg atggtatgat aaaatgggtg aaggtgagca ctcaggacca acaggaaatt

481
tataacttta aacgtggctc cttccctggg gtaatgccca tgatccaacg gacatcagaa

541
ccactgaatt gtgattattc catctatatc cacaatgtca ccagggagca cactggaacc

601
taccactgtg tgaggtttga tggtttgagt gaacactcag aaatgaaatc ggatgaaggc

661
acctcagtgc ttgtgaaggg agctggggac cctgaaccag acctgtggat catccagccc

721
caggaattgg tgttggggac cactggagac actgtctttc tgaactgcac agtgcttgga

781

gacggtcccc ctggacccat caggtggttc cagggagctg gtctgagccg ggaggccatt

841

tacaactttg gaggcatctc ccaccccaag gagacagcgg tgcaggcctc caacaatgac

901

ttcagcattc

ttctgcaaaa cgtctccagt gaggatgcag gcacctatta ctgtgtaaag

961

tttcagagga aacccaacag

gcaatacctg tctggacagg gcaccagcct gaaagtgaaa

1021

gcaaaatcta cctcttccaa agaggcagaa ttcaccagtg aacctgcaac tgagatgtct

1081

ccaacaggcc tcctggttgt gttcgcacct gtggtcctgg ggctgaaggc aattaccttg

1141

gctgcactcc tactggccct ggctacctct cggaggagcc ctgggcaaga agatgtcaag

1201

accacaggcc cagcaggagc catgaacacc ttagcatgga gcaagggtca agagtgaggg

1261

gtcagcccca gagtgaggac cctctgagtt ggagaggagc cagggctcct caaccatttc

1321

cctacctcca gtcccagcct ctaggtgccc ccaggcctca tgacaaactc ctagatccct

1381

acatctggtt ttggtccacc tagtgaaatt cccttctttg caccgggctt ccctctaaaa

1441

tgtctccctt tctctttttg gcctgttcaa gacctccttg cttttcagtc cctggctcag

1501

tctctcctca acacccttgc ccctgctgca gccctttctg gtgcgccctg cccctttccc

1561

cacctcgcta catccttctt ggcctccaac atccaactca gagtcttctt cccaggagat

1621

gtctgtaaga atctctgaac tcaaccagcc agaccatctg tgcccctcca tctacacctt

1681

tctccccact ccttcctgcc ttccttccat ccccctcatg gctggcttgg gcaggtataa

1741

tattagaatg caggttcagc aactataaca aagctcttaa ataacagtgg cttaaaccag

1801

tggaaatcaa

ccagaaagtt gaccatcagc aggccaagca atacagagac tccctggtat

1861

tgagacccag gattcactga tctcattgct accaggtcca ccttctaggc agccagactg

1921

gaaaagaggg caggaaaggg gagcaggacc ctccccttta agtgcacagt caggaacttg

1981
gccacctcac ttatctctac ttggctggaa tgtggtcaca tggtcacacc tagctgcaag

2041
aaacactggg agatgtagtc tttatttctg gcagcaatgc gcccagctgc aagttttcac

2101
tagagaaacc agatggcaga tatcagggga taaccagtta tctccaccac agcagcatac

2161
agacagcctc tcacctgccc tgtgggacac ctgagttcaa tgcccagcta gctagccagc

2221
acttcttccc actatcacct cccctggggc agcatgatgt ggggcagtag ttcccaagat

2281
gagtgatttt gcccccactg gacttttggc aatgtctaga gatgtttttg gttggcacaa

2341
cctggggggg tgctaccacc atctagtgga ctgagaagcc ctgacatggg gaagagtgtg

2401
catgcccagg agtcagacac acctgccttt aaccctgagg cctctgcctc ctccctgtgc

2461
accctcagtg actaatcaga gtcccttccc atcacggaac atccaggata ctaatgtgga

2521
cttctctgca ttgtgtaaga accaattcaa gaccaggcac ggtggcttat gcatgtaatc

2581
ccagcacttt gggaggccga ggtgggtgga tcacctgagt tcaggagttt gagaccagcc

2641
tggctaacat ggtgaaacct cgtctctact aaaaatacaa aaaattagcc aggcgtggtg

2701
gtgtgcacct gtaatcccag ctacttggga ggatggggca ggagaaccgc ttgaactggg

2761
aggcagaggc tgcagtgagc tgagatcgcg ccattgcact ccagcctggg caacaagagc

2821
aaaactccgt ctc

B19 ANKRD22 ankyrin repeat domain 22, mRNA NM_1445902

(SEQ ID NO: 155)

1
aatgtaagaa cttttcttcc tcccttaact ttgcttcctt ctttcctgca tgttaccact

61
ggcagagcaa atatgactca gaaaccggct cctcagggtt gtaacattag atgatacagg

121
cttgggtcgt tacacatgac accagtgcct ttgtttcatt gggctgggct ctctggaagg

181
tgtgctgctg cctgagctgc tggaaaagca ctgacaggtg tttgctagaa aagcactcct

241
ggagcttgcc accagcttgg acttctaggg actttcctct cagccaggaa ggattttgat

301
attcatcaga aatacctcca gaagattcaa ggagctgtag aggtgaagta agcctgtgaa

361
ggaccagcat gggaatccta tactctgagc ccatctgcca agcagcctat cagaatgact

421
ttggacaagt gtggcggtgg gtgaaagaag acagcagcta tgccaacgtt caagatggct

481
ttaatggaga cacgcccctg atctgtgctt gcaggcgagg gcatgtgaga atcgtttcct

541
tccttttaag aagaaatgct aatgtcaacc tcaaaaacca gaaagagaga acctgcttgc

601
attatgctgt gaagaaaaaa tttaccttca ttgattatct actaattatc ctcttaatgc

661
ctgttctgct tattgggtat ttcctcatgg tatcaaagac aaagcagaat gaggctcttg

721
tacgaatgct acttgatgct ggcgtcgaag ttaatgctac agattgttat ggctgtaccg

781
cattacatta tgcctgtgaa atgaaaaacc agtctcttat ccctctgctc ttggaagccc

841
gtgcagaccc cacaataaag aataagcatg gtgagagctc actggatatt gcacggagat

901
taaaattttc ccagattgaa ttaatgctaa ggaaagcatt gtaatccttg tgaccacacc

961
gatggagata cagaaaaagt taacgactgg attctatctt cattttagac ttttggtctg

1021

tgggccattt aacctggatg ccaccatttt atggggataa tgatgcttac catggt

taat

1081

gttttggaag agctttttat ttatagcatt gtttactcag

tcaagttcac catggccgta

1141

atccttctaa gggaaacact aaagttgttg tagtctccac ttcagtcaga

aactgatgtt

1201

tcagctaggc acagtggtac atgcctgtaa tcccagctac ttgggaggct gaggtgggag

1261

gatcacttga actcaggagt ttgagagcag ccagggcaac acagcgagac cctgtctcaa

1321
aaaaaaaaaa aaaaaaaaaa gccctggtgt tccaaactca gtctttcctg aagaagagga

1381
tctgagttat cttctgaaac agcgttctcc cttcccagtt gtatcactct tataaaaaga

1441
ctgtccagtc tatgtcatgc cctaggagac aaactgttcc tcccagcccc ctttgagtat

1501
tgagcagaag aatcaaatta ttaaatacgt atgtttgtac agaatggtat ttgtgtatgt

1561
gtgtgggctt agagattcac aagtaaatat tcctttggtg aaggaatttc aataaaaaca

1621
tctatcaagt gtcagcggtg agtgtgttta caccacagaa attggcaaat tgacaaatca

1681
gagtttgttt ttgttttttt gttttttact ttccataaag ttcgtttacc agcataccac

1741
tagagatttc ggtttacaaa taaaagccat cttggtttga gcaagactat gcaactatga

1801
aaatgttcgt ttaaaaaaat cttcatgatc cttttgtaaa tacaaggtgg ttgccaagct

1861
tgttagtttt gtttatttta ttgatagatg taaaatatta ttgtaactta tttggataaa

1921
gttcttcaaa agaaacagag ctatacaatg aggtaggatc tggattattt gtctaagtga

1981
gagattgcga atatcaaaat atctgtctca cttcttctgt gaatgacaca gagtagaaat

2041
aaattcactt taaaaatatg actgaatttt gaaaatcaag actgaatctc acatagctgc

2101
agacaggaac taagccagcc tctttgtatg tggtaacaag tacagtataa gaatgaaaga

2161
tttaccatcc ttgaaagctc taatgaaaat caaatccagc aatatatatt caactgtgta

2221
caggatttaa gaaacttatt ttatgaagga agtaatagtg tgtagatata gattctgaag

2281
tctttaaacg tgtcttaata aattaagatt cactggcatt gagctgagct accaggtgac

2341
ccttggggac aaaaaaccca cacaagtgaa tttcacacac cagtatacct tcaacaatat

2401
acttttgaca cacacaaacc tttgatttgg tttcagagat tttgcaaaat agtaccaatg

2461
taatttacaa ctgtcatctt tgaaattgtg taaaagtgga ataattttct gaagaaataa

2521
atcatggttt gtcaatgagt tgcagagact gtctgacatt aactttgtca agattaaagg

2581
ataaagtata tgacaatttg tttcatcatg ctcatgacat tatgcaattt tctccctagc

2641
ttttaatttt tggaggcaga aaattgagcc agaaattttt agtcattagg tctcctagca

2701
acaagctgta aaccttccaa caagcttgga ctagaatcta gacactgaaa tgcacataca

2761
tgctttatgt aatgcagaat gcatttattg gagaactcat aaacatccta taaaattttc

2821
ttccctgaga tgcaactata aaacttggcc ttattctgag aatgcttaac atagatttca

2881
tccatactgt aacactgatt ttgttgttgt tgtccttaaa gcagctcagc ttcctgaggt

2941
agtgttatgt ctctgtggca acaaggtgaa aatgtctagc ttattttgtc aaagtcaaca

3001
ataatccaca gactccagac ctcaatatct gtcccaattt gccattttac tttagtgctc

3061
caaaaatatg gcttatagaa aaaacaatag gtgttttaaa gagatttacc tgaatgatat

3121
agagaatgtc tagatatttt ctggctatca ggtaaaacct acccttcaag atggtagaat

3181
atataatagc atacaaaacc tctatttacc taataagtac tttaatttac agaaaaaaaa

3241
tgtaaatgta agtgtcggat ttagtgccaa gtgcagggaa tctgaaaaat gtatactagg

3301
tctctgctct ccgtaattct gccttcatgg gtcctagccc catccctcag gaggttgtcc

3361
taagatcgtc agtgtcagat gcttcacaat acggcctcac accgtccctg ggaaaggttg

3421
gtctcctcct gctgcatcag atggatgatt tcattgtaca tacggtgagg agcatccaaa

3481
ccccagatga aatccacgtg agcccattca ggaatattct tatggtagat gaggttggtc

3541
acctcagaga gcagcatttt cacgtcttct ggatttgaaa gccagtcctg acctcctgtc

3601
cacattgctg tagggaccgt catatctctg actctgtacc ttacaggagt tggctagaga

3661
aaaggaatag ttcttaactc taggtaacat ttggactttc aggctcataa tttatgtttc

3721
aaatagacat aataaacatg ccatctgttg tggtgaaggg tacatgggtg ttagagccac

3781
acaactctgt taagaatttc tgttcccgcc cttactttaa ggtaaaatta cttaacatta

3841
ttgaacctca gtttcttctt ctgtgactgg ggataatatc tgtaataact tgctagatca

3901
aatgacaaaa cacataaaaa catgtaatgc cttgtatttc ttttttcttc ctattaaata

3961
ttttgtaaat aaattgtttt taaaaaaaaa aaa

C1 ABCF2-ATP-binding cassette, sub-family F (GCN20), member 2 mRNA

NM_005692.4

(SEQ ID NO: 156)

1
ggcgtcacgc ggccccgcga ggtctgtggg atacatagta gtcctcaagg cgggtctcac

61
tcttggccgc tgcaacttga ggactacact tccaaggagg cagcgcggcg cgccgagaac

121
cacccgaggc cgtgattggc tggtgagccg gccgcacgcg gaggatccta aggagcagct

181
ctgttgcgac ataggccgag cagcgaggcc cagctccctg aaacaacagt aacctacccc

241
tgtgggtcat catcatgccc tccgacctgg ccaagaagaa ggcagccaaa aagaaggagg

301
ctgccaaagc tcgacagcgg cccagaaaag gacatgaaga aaatggagat gttgtcacag

361
aaccacaggt ggcagagaag aatgaggcca atggcagaga gaccacagaa gtagatttgc

421
tgaccaagga gctagaggac tttgagatga agaaagctgc tgctcgagct gtcactggcg

481
tcctggcctc tcaccccaac agtactgatg ttcacatcat caacctctca cttacctttc

541
atggtcaaga gctgctcagt gacaccaaac tggaattaaa ctcaggccgt cgttatggcc

601
tcattggttt aaatggaatt ggaaagtcca tgctgctctc tgctattggg aagcgtgaag

661
tgcccatccc tgagcacatc gacatctacc atctgactcg agagatgccc cctagtgaca

721
agacaccctt gcattgtgtg atggaagtcg acacagagcg ggccatgctg gagaaagagg

781
cagagcggct ggctcatgag gatgcggagt gtgagaagct catggagctc tacgagcgcc

841
tggaggagct ggatgccgac aaggcagaga tgagggcctc gcggatcttg catggactgg

901
gtttcacacc tgccatgcag cgcaagaagc taaaagactt cagtgggggc tggaggatga

961
gggttgccct tgccagagcc ctctttattc ggcccttcat gctgctcctg gatgagccta

1021
ccaaccacct ggacctagat gcttgcgtgt ggttggaaga agaactaaaa acttttaagc

1081
gcatcttggt cctcgtctcc cattcccagg attttctgaa tggtgtctgt accaatatca

1141
ttcacatgca caacaagaaa ctgaagtatt atacgggtaa ttatgatcag tacgtgaaga

1201
cgcggctaga gctggaggag aaccagatga agaggtttca ctgggagcaa gatcagattg

1261
cacacatgaa gaactacatt gcgaggtttg gtcatggcag tgccaagctg gcccggcagg

1321
cccagagcaa ggagaagacg ctacagaaaa tgatggcatc aggactgaca gagagggtcg

1381
tgagcgataa gacactgtca ttttatttcc caccatgtgg caagatccct ccacctgtca

1441
ttatggtgca aaatgtgagc ttcaagtata caaaagatgg gccttgcatc tacaataatc

1501
tagaatttgg aattgacctt gacacacgag tggctctggt agggcccaat ggagcaggga

1561
agtcaactct tctgaagctg ctaactggag agctactacc cacagatggc atgatccgaa

1621
aacactctca tgtcaagata gggcgttacc atcagcattt acaagagcag ctggacttag

1681
atctctcacc tttggagtac atgatgaagt gctacccaga gatcaaggag aaggaagaaa

1741

tgaggaagat cattgggcga tacggtctca ctgggaaaca acaggtgagc ccaatccgga

1801

acttgtcaga cgggcagaag tgccgagtgt gtctggcctg gctggcctgg cagaaccccc

1861

acatgctctt cctggatgaa cccaccaatc acctggatat cgagaccatc gacgccctgg

1921

cagatgccat caatgagttt gagggtggta tgatgctggt

cagccatgac ttcagactca

1981

ttcagcaggt tgcacaggaa atttgggtct gtgagaagca gacaatcacc

aagtggcctg

2041

gagacatcct ggcttacaag gagcacctca agtccaagct ggtggatgag gagccccagc

2101

tcaccaagag gacccacaac gtgtgcaccc tgacattggc atctctgcca aggccatgag

2161

catcatgaac tcgtttgtaa acgacgtgtt tgagcagctg gcgtgtgagg ctgcccggct

2221

ggcccagtac

tcgggccgga ccaccctgac atcccgagaa gtccagacgg ctgtgcgtct

2281

gctgctgcct ggggagctgg ccaagcacgc tgtgtctgag ggcaccaagg ctgtcaccaa

2341

gtacaccagc tccaagtgac ccagggcctg acaaaaataa agggtgaact gttaaaaaaa

2401
aaaaa

C2 FNBP1L formin binding protein 1-like, mRNA NM_0010249482

(SEQ ID NO: 157)

1
tcactcactg gggagcccgg cggtggcggc acctttcgag gtagacccgc tgagctgcta

61
gcccgccggc cagcgagtga gaggtcggac agactgtgga gccgacagac tgaaggacag

121
cggcaccgcc agacggccag aaagttccgc catgagctgg ggcacggagc tgtgggatca

181
gttcgacagc ttagacaagc atacacaatg gggaattgac ttcttggaaa gatatgccaa

241
atttgttaaa gagaggatag aaattgaaca gaactatgcg aaacaattga gaaatctggt

301
taagaagtac tgccccaaac gttcatccaa agatgaagag ccacggttta cctcgtgtgt

361
agcctttttt aatatcctta atgagttaaa tgactatgca ggacagcgag aagttgtagc

421
agaagaaatg gcgcacagag tgtatggtga attaatgaga tatgctcatg atctgaaaac

481
tgaaagaaaa atgcatctgc aagaaggacg aaaagctcaa caatatcttg acatgtgctg

541
gaaacagatg gataatagta aaaagaagtt tgaaagagaa tgtagagagg cagaaaaggc

601
acaacagagt tatgaaagat tggataatga tactaatgca accaaggcag atgttgaaaa

661
ggccaaacag cagttgaatc tgcgtacgca tatggccgat gaaaataaaa atgaatatgc

721
tgcacaatta caaaacttta atggagaaca acataaacat ttttatgtag tgattcctca

781
gatttacaag caactacaag aaatggacga acgaaggact attaaactca gtgagtgtta

841
cagaggattt gctgactcag aacgcaaagt tattcccatc atttcaaaat gtttggaagg

901
aatgattctt gcagcaaaat cagttgatga aagaagagac tctcaaatgg tggtagactc

961
cttcaaatct ggttttgaac ctccaggaga ctttccattt gaagattaca gtcaacatat

1021
atatagaacc atttctgatg ggactatcag tgcatccaaa caggagagtg ggaagatgga

1081
tgccaaaacc acagtaggaa aggccaaggg caaattgtgg ctctttggaa agaagccaaa

1141
gggcccagca ctagaagatt tcagtcatct gccaccagaa cagagacgta aaaaactaca

1201
gcagcgcatt gatgaactta acagagaact acagaaagaa tcagaccaaa aagatgcact

1261
caacaaaatg aaagatgtat atgagaagaa tccacaaatg ggggatccag ggagtttgca

1321
gcctaaatta gcagagacca tgaataacat tgaccgccta cgaatggaaa tccataagaa

1381
tgaggcttgg ctctctgaag tcgaaggcaa aacaggtggg agaggagaca gaagacatag

1441
cagtgacata aatcatcttg taacacaggg acgagaaagt cctgagggaa gttacactga

1501
tgatgcaaac caggaagtcc gtgggccacc ccagcagcat ggtcaccaca atgagtttga

1561
tgatgaattt gaggatgatg atcccttgcc tgctattgga cactgcaaag ctatctaccc

1621
ttttgatgga cataatgaag gtactctagc aatgaaagaa ggtgaagttc tctacattat

1681
agaggaggac aaaggtgacg gatggacaag agctcggaga cagaacggtg aagaaggcta

1741
cgttcccacg tcatacatag atgtaactct agagaaaaac agtaaaggtg cagtaactta

1801
tatctaaact aaccaggcac ctttgtgcca tgtgtgacat aggaagagta acataaaatg

1861
aaaacacatt caacaggttg aaaaaaataa ggaaacttaa agggcatcca agattaattg

1921
ttcactatgt gagctgagtg taggcttgat cttgtgaata ttaccacaag aaacattttg

1981
tggcacttta ctgtttgagt aacgttggtg tgaagcttaa ttgatgcctt ttgctttatg

2041
tcccgcttaa gtctgtgtga aggatttgtg tttttctgcc ttacaaatag aatttgattt

2101
attgggcagg aattcatgga tagtaatgct ctctgccccc tttacttcag aaaacacagt

2161
gactttagtg aatttgaata gtgaaactgc tctgaaatgc tatggaaagc cgactcccca

2221
aagagtggtt tcttctagaa gtttgaattt gtagctacag tttccaagaa gaaaaatagt

2281
agttggataa tttagtaaaa taataacatc attttcattt tcttacctat tcttaacttt

2341
ggtttcctaa aggaagaaaa tgagcaggta gcacataatc tatttaagta gatttaaaga

2401
gagtttcaaa ataaatctcc tggtctagct cttaggtgaa taaaatagat tttgtttgag

2461
acctcaaaat attttgaggt tagctggtaa ttttcaataa tttacaagct tccttccaaa

2521
ctaatctcat acttttgtat gtttcatctt gaaaatatct tttgggaaat accactttag

2581
tgattattta gcatttagca gttacacata ggaaaataca cagttacata gaaaaataca

2641
catttgaaga tagaggaaac cttgaatgga ggggaagtgt tgacaaattt taatttttaa

2701
aggagaaact ttttgactat ctgggttaga ggaagatatg tgtaccgcct ttagggcatt

2761
ttgttatttc cgctgaatca ttagttatta ggatagataa atttttccaa ttagtttcag

2821
caagcgttgt tggaaacact gtgcagtcaa ggattgtgca gtgctggttg tgtgaccaca

2881
ccctgagtca gtggtgtggg gaagtaaagt gtgaagaagc agtaagattg gtttttaatt

2941
ttgcccatgt tttaaatttt cctggtgttt tcggtagctg actataaaat gatagagaca

3001
tttgggacag gcactttaaa ctgaacaccc cttttggttt taccaaaggt cttcagtaat

3061
tgttcttttc tttttcctcc tggactgcag gttcctgaag agggtttctg aggaaatggg

3121
caagatgttg aaggaggtta catgcagctg cttttggggg agggtattag agttgtcagg

3181
ctcaaagaga gtgagagaag caagttgcat gagtgcatgc agacatgatt ttttttttac

3241
taacttcatt agcatttcca tacattgttt ttaaaaatca taataccaac ccttaagttc

3301
ctagttcaca gttattccca caaaagaaaa agccaacaat agtgtaccat ttttctattt

3361
attttattgc tgtctaatca ataaagaatg cagagctgtc aaaaaatgtg tcttacatta

3421
gctgtcccaa caggattgtc ttccctccca gctctgtttt aattggcttt tagacccact

3481
atctgtcaga tccttgccat ctgtcagtgt ctgcctgcgc cacctccgtg cttgcttaac

3541
atcctgttgc atgtctagcg tgattgagct agatttttca ggcatgtctt tagattccct

3601
tgttcttgtc aaagccttgt tttgttttac atttgtagtg caaatcactt tgtcaaacat

3661
ctccagcact aatgtttcca tcttagtatt tgtgcacact gctataactt ccccactgca

3721
aacattccag ttttggcatt acgaagaagt agctgtgaac ctgaagtatt tatgataaga

3781
aaaagaaaac atctctgctg tagcctacag cccagttgaa agaactcttt gaaacgtgat

3841
acatcttcag cacctcagtc tgggaagaat ctagtcagca ctgaaatcct ggcataataa

3901
acacagaaga tattcaccac ctcaagacaa aggactattg tcaaaagtca gctgcttcca

3961
ttcaaatgct gccttaaact tgagtgccta aatctgttga ttgccaacac taccactaca

4021
gtatcccaca aagggcttta tgtgtcagct cagtgcgacc tgctttaact ctgcagcacc

4081
gctgcagctg ccgatgtagc ctcggtaggt ggctattaga gctctaccat atacagtggt

4141
gcatcttcaa atttatgcat caaactaaag acatgtccaa gtccatttta atttcctcag

4201
tggttttatg agaagtttta tgggcctccc ccaattgtct ttttattttg ggttatgacg

4261
atcatgtttg ataattacaa tgatagtctc tttccacgtg atgcttttgt ttgaacctga

4321
taaaatttag tgaaactttg taatgatcta tgtgcacttt tacttgtaaa atggaatttc

4381
tgtatgttta tacttgtaaa tatgattgtt gttagtgctc ctgttgctca tggtgtcctg

4441
cctcgcattt gtgattctgt taatgacatg tatcttaact aatttcttag tggtgttgta

4501
atagggagat ggggcaggtg gggggttatt tgtaccactg aatcttcatt aatttggttc

4561
tttactgttt tgaggggaga aagaacgtga aatggtttgt gtattattga attttaagca

4621

atattttaga agctgtgtga ctgctttaat aactttttcc cagtgttatt tgaatcatac

4681

tacccgttat actaaagctg aatgacaatt gtgtgaaagt

tactgccttc ataagatcaa

4741

gtcaccactg ttacacagct gacatatagt gtattacctt tgcagctagt

aaactataaa

4801

gtttagatat tgaatctcgt tacagggtta tttatataat gtgacattat tcagtactga

4861

cagactacat gaagtagttt taaaatctag tgctattttt attttaaagg ttagcaatga

4921

ggaggaaatg
tgatctggct gtgtttgtct tctgtacaaa gcctgaagtg cttatggttt

4981

tttggctaac

agccacagag ggcaaagttt aagactttct tgtaaggact aactgttctt

5041

ttcaagctac tgtttgtttt tctaaaagca ggatttgctt ccgtaggagg caagttcctt

5101

gatgtggaat agtgcaacct gtatatgggt tattataata ggaaagacat ttgtacttgc

5161

acagtttaaa tcattcttaa attttgaaca tgtgaattgt cccaaaaaat ctttaatttt

5221
ttggtaattt ttactctttt tgtgcacatg ttgatttctt aatggtaaat ccttcattta

5281
aagatagtgt tctctgttga gaatatttac atggaataaa acaatctttt catggcctgt

5341
taaaaaaaaa aaaaaaaaaa aaaaaaaaaa a

C3 NCF1C neutrophil cytosolic factor 1C pseudogene, mRNA NR 0031872

(SEQ ID NO: 158)

1
agtgcattta aggcgcagcc tggaagtgcc agggagcact ggaggccacc cagtcatggg

61
ggacaccttc atccgtcaca tcgccctgct gggctttgag aagcgcttcg tacccagcca

121
gcactatgta catgttcctg gtgaaatggc aggacctgtc ggagaaggtg gtctaccggc

181
gcttcaccga gatctacgag ttccataaaa ccttaaaaga aatgttccct attgaggcag

241
gggcgatcaa tccagagaac aggatcatcc cccacctccc agctcccaag tggtttgacg

301
ggcagcgggc cgccgagaac caccagggca cacttaccga gtactgcagc acgctcatga

361
gcctgcccac caagatctcc cgctgtcccc acctcctcga cttcttcaag gtgcgccctg

421
atgacctcaa gctccccacg gacaaccaga caaaaaagcc agagacatac ttgatgccca

481

aagatggcaa gagtaccgcg acagacatca ccggccccat catcctgcag acgtaccgcg

541

ccattgccga ctacgagaag acctcgggct ccgagatggc tctgtccacg ggggacgtgg

601

tggaggtcgt ggagaagagc gagagcggtt

ggtggttctg tcagatgaaa gcaaagcgag

661

gctggatccc agcatccttc ctcgagcccc tggacagtcc

tgacgagacg gaagaccctg

721

agcccaacta tgcaggtgag ccatacgtcg ccatcaaggc ctacactgct gtggaggggg

781

acgaggtgtc cctgctcgag ggtgaagctg ttgaggtcat tcacaagctc ctggacggct

841

ggtgggtcat caggaaagac

gacgtcacag gctactttcc gtccatgtac ctgcaaaagt

901

cggggcaaga cgtgtcccag gcccaacgcc

agatcaagcg gggggcgccg ccccgcaggt

961
cgtccatccg caacgcgcac agcatccatc agcggtcgcg gaagcgcctc agccaggacg

1021
cctatcgccg caacagcgtc cgttttctgc agcagcgacg ccgccaggcg cggccgggac

1081
cgcagagccc cgggagcccg ctcgaggagg agcggcagac gcagcgctct aaaccgcagc

1141
cggcggtgcc cccgcggccg agcgccgacc tcatcctgaa ccgctgcagc gagagcacca

1201
agcggaagct ggcgtctgcc gtctgaggct ggagcgcagt ccccagctag cgtctcggcc

1261
cttgccgccc cgtgcctgta catacgtgtt ctatagagcc tggcgtctgg acgccgaggg

1321
cagccccgac ccctgtccag cgcggctccc gccaccctca ataaatgttg cttggagtgg

1381
accgaggctc tgcaggaatg cagggagggc cgggctccgc cccagggtta tttctaagtt

1441
gaaaaaaaaa aaaaaaaaa

C4 TBC1D3B TBC1 domain family, member 3B, mRNA NM_001001417.5

(SEQ ID NO: 159)

1
actggtgctt agcacctatc tgctctctgg cctgcgtcag tggtctacag cagttacaca

61
caggcagtgg tatctgtgag cagctctgtg gactcaaagg ttttctccct gagaggcatg

121
acccaggcca gctgattcat cagaatcagg atggacgtgg tagaggtcgc gggtagttgg

181
tgggcacaag agcgagagga catcattatg aaatacgaaa agggacaccg agctgggctg

241
ccagaggaca aggggcctaa gccttttcga agctacaaca acaacgtcga tcatttgggg

301
attgtacatg agacggagct gcctcctctg actgcgcggg aggcgaagca aattcggcgg

361
gagatcagcc gaaagagcaa gtgggtggat atgctgggag actgggagaa atacaaaagc

421
agcagaaagc tcatagatcg agcgtacaag ggaatgccca tgaacatccg gggcccgatg

481
tggtcagtcc tcctgaacat tgaggaaatg aagttgaaaa accccggaag ataccagatc

541
atgaaggaga agggcaagag gtcatctgag cacatccagc gcatcgaccg ggacataagc

601
gggacattaa ggaagcatat gttcttcagg gatcgatacg gaaccaagca gcgggaacta

661
ctccacatcc tcctggcata tgaggagtat aacccggagg tgggctactg cagggacctg

721
agccacatcg ccgccttgtt cctcctctat tttcctgagg aggatgcatt ctgggcactg

781
gtgcagctgc tggccagtga gaggcactcc ctgcagggat ttcacagccc aaatggcggg

841
accgtccagg ggctccaaga ccaacaggag catgtggtag ccacgtcaca atccaagacc

901
atggggcatc aggacaagaa agatctatgt gggcagtgtt ccccgttagg ctgcctcatc

961
cggatattga ttgacgggat ctctctcggg ctcaccctgc gcctgtggga cgtgtatctg

1021
gtagaaggcg aacaggcgtt gatgccgata acaagaatcg cctttaaggt tcagcagaag

1081
cgcctcacga agacgtccag gtgtggcccg tgggcacgtt tttgcaaccg gttcgttgat

1141
acctgggcca gggatgagga cactgtgctc aagcatctta gggcctctat gaagaaacta

1201
acaagaaagc agggggacct gccaccccca gccaaacccg agcaagggtc gtcggcatcc

1261
aggcctgtgc cggcttcacg tggcgggaag accctctgca agggggacag gcaggcccct

1321
ccaggcccac cagcccggtt cccgcggccc atttggtcag cttccccgcc acgggcacct

1381
cgttcttcca caccctgtcc tggtggggct gtccgggaag acacctaccc tgtgggcact

1441

cagggtgtgc ccagcccggc cctggctcag ggaggacctc agggttcctg gagattcctg

1501

cagtggaact ccatgccccg cctcccaacg gacctggacg tagagggccc ttggttccgc

1561

cattatgatt tcagacagag ctgctgggtc cgtgccatat cccaggagga ccagctggcc

1621

ccctgctggc aggctgaaca ccctgcggag cgggtgagat cggctttcgc tgcacccagc

1681

actgattccg accagggcac ccccttcaga gctagggacg aacagcagta tgctcccacc

1741

tcagggcctt

gcctctgcgg cctccacttg gaaagttctc agttccctcc

aggcttctag

1801

aagcatctgg gccagggctc atggctggat aatttcccta ggcttaacaa cccaagcaag

1861

cttcgcgtcc tcgttttatt tttggttaaa cttatgaaaa tgtattaaga aagagtgcag

1921

ctcgagagag attcagagat ggaacacacc agaccccaga tcacaaagcc aaccatgccc

1981

agcccctccc agcaccccca gccccacgac catcgttctg aattctgacg acaccgtgag

2041
cctgcctttg tactttaaac tcatggaagg ataactacct tcacgttttg aaataaatgt

2101
ttcctgttga aatg

C5 SLC14A1 solute carrie family 14 (urea transporter), member 1, mRNA

NM_001128588.3

(SEQ ID NO: 160)

1
acacagagca gagtggggct ctgagtatat aactgttagg tgcctccctc cagcaccatc

61
tcctgagaag cactctccct tgtcgtggag gtgggcaaat ctttatcagc cactgccttc

121
tgctgccagg aagccagcta gagtggtctt taaagaaaac tgggcatctc ctgctactta

181
aaatcaaaaa ctacctaaaa taaagattat aaaaaagtaa ggatgaatgg acggtctttg

241
attggcggcg ctggtgacgc ccgtcatggt cctgtttgga aggacccttt tggaactaaa

301
gctggtgacg cagcgcgcag aggcatcgcc cggctaagct tggccctggc agatgggtcg

361
caggaacagg agccagagga agagatagcc atggaggaca gccccactat ggttagagtg

421
gacagcccca ctatggttag gggtgaaaac caggtttcgc catgtcaagg gagaaggtgc

481
ttccccaaag ctcttggcta tgtcaccggt gacatgaaag aacttgccaa ccagcttaaa

541
gacaaacccg tggtgctcca gttcattgac tggattctcc ggggcatatc ccaagtggtg

601
ttcgtcaaca accccgtcag tggaatcctg attctggtag gacttcttgt tcagaacccc

661
tggtgggctc tcactggctg gctgggaaca gtggtctcca ctctgatggc cctcttgctc

721
agccaggaca ggtcattaat agcatctggg ctctatggct acaatgccac cctggtggga

781
gtactcatgg ctgtcttttc ggacaaggga gactatttct ggtggctgtt actccctgta

841
tgtgctatgt ccatgacttg cccaattttc tcaagtgcat tgaattccat gctcagcaaa

901
tgggacctcc ccgtcttcac cctccctttc aacatggcgt tgtcaatgta cctttcagcc

961
acaggacatt acaatccatt ctttccagcc aaactggtca tacctataac tacagctcca

1021
aatatctcct ggtctgacct cagtgccctg gagttgttga aatctatacc agtgggagtt

1081
ggtcagatct atggctgtga taatccatgg acagggggca ttttcctggg agccatccta

1141
ctctcctccc cactcatgtg cctgcatgct gccataggat cattgctggg catagcagcg

1201
ggactcagtc tttcagcccc atttgaggac atctactttg gactctgggg tttcaacagc

1261
tctctggcct gcattgcaat gggaggaatg ttcatggcgc tcacctggca aacccacctc

1321
ctggctcttg gctgtgccct gttcacggcc tatcttggag tcggcatggc aaactttatg

1381
gctgaggttg gattgccagc ttgtacctgg cccttctgtt tggccacgct attgttcctc

1441
atcatgacca caaaaaattc caacatctac aagatgcccc tcagtaaagt tacttatcct

1501
gaagaaaacc gcatcttcta cctgcaagcc aagaaaagaa tggtggaaag ccctttgtga

1561
gaacaagccc catttgcagc catggtcacg agtcatttct gcctgactgc tccagctaac

1621
ttccagggtc tcagcaaact gctgtttttc acgagtatca actttcatac tgacgcgtct

1681
gtaatctgtt cttatgctca ttttgtattt tcctttcaac tccaggaata tccttgagca

1741
tatgagagtc acatccaggt gatgtgctct ggtatggaat ttgaaacccc aatggggcct

1801
tggcactaag actggaatgt atataaagtc aaagtgctcc aacagaagga ggaagtgaaa

1861
acaaactatt agtatttatt gatattcttg gtgtttagct ggctcgatga tgttaacagt

1921
attaaaaatt aaaccccata aaccaactaa gccttatgga attcacagtc acaaaatcga

1981
agttaatcca gaattctgtg ataagcagct tggctttttt tttaaatcaa tgcaagttac

2041

acattatagc cagaatctgt atcacagagg tgcaagctga cagcagagct cagtccccac

2101

ttcctgcaaa caatggcctg caccctatcc cttgtgtgtg

tgacattctc tcatgggaca

2161

atgttggggt ttttcagact gacaggactg caagagggag aaaggaattt

tgtcaatcaa

2221

aattattctg tattgcaact tttctcagag attgcaaagg attttttagg tagagattat

2281

ttttccttat gaaaaatgat ctgttttaaa tgagataaaa taggagaagt tcctggctta

2341

acctgttctt acatattaaa gaaaagttac ttactgtatt tatgaaatac tcagcttagg

2401

catttttact ttaaccccta aattgatttt gtaaatgcca caaatgcata gaattgttac

2461

caacctccaa agggctcttt aaaatcatat tttttattca tttgaggatg tcttataaag

2521

actgaaggca aaggtcagat tgcttacggg tgttattttt ataagttgtt gaattcctta

2581

atttaaaaaa gctcattatt ttttgcacac

tcacaatatt ctctctcaga aatcaatggc

2641

atttgaacca ccaaaaagaa ataaagggct gagtgcggtg

gctcacgcct gtaatcccag

2701

cactttgggg agcccaggcg ggcagattgc ttgaacccag gagttcaaga ccagcctggg

2761

cagcatggtg aaaccctgta tctacaaaaa atacaaaaat tagccaggca tggtggtggg

2821

tgcctgtagt tccagctact tgggaggctg aggtgggaaa atgacttgag cccaggagga

2881

ggaggctgca gtgagctaag attgcaccac tgcactccaa cctgggcgac aagagtgaaa

2941

ctgtgtctct caaaaaaaaa aaaaaacaaa caaaaacaaa aacaaaacaa aacaaaacaa

3001
aacaaaacag gtaaggattc ccctgttttc ctctctttaa ttttaaagtt atcagttccg

3061
taaagtctct gtaaccaaac atactgaaga cagcaacaga agtcacgttc agggactggc

3121
tcacacctgt aatcccagca ctttgggaga tggaggtaaa aggatctctt gagcccagga

3181
gttcaagacc agcttgggca acatagcaag actccatctc ttaaaaaata aaaatagtaa

3241
cattagccag gtgtagcagc acacatctgc agcagctact caggaggctg aggtggaaag

3301
atcgcttgtg cacagaagtt cgaggctgca gtgagctata tgatcatgtc actgcactcc

3361
agcctgtgtg accgagcaag accctatctc aaaaaaatta attaattaat taattaatta

3421
atttaaaaag gaagtcatgt tcatttactt tccacttcag tgtgtatcgt gtagtatttt

3481
ggaggttgga aagtgaaacg taggaatcct gaagattttt tccacttcta gtttgcagtg

3541
ctcagtgcac aatatacatt ttgctgaatg aataaacaga aatagggaag taaacctaca

3601
aatattttag ggagaagctc acttcttcct tttctcagga aaccaagcaa gcaaacatat

3661
cgttccaatt ttaaaaccca gtgaccaaag cctttggaac tatgaatttg caactgtcat

3721
aggtttatgg atattgctgt ggagaagctc aattttcagt gtttgaactg aaccctttct

3781
tgttagggaa cgtgtgaaag aagaattgtg gggaaaaaaa agcaagcata accaaagatc

3841
atcagcagtg aagaatctag gctgtggctg agagaaccag aggcctctaa aatggacccg

3901
agtcgatctt cagaacaggg atctaccatg caggagcttc ttgtgctcac acaaatctgt

3961
aaatgggaac attgtacatt gtcgaattta aatgatatta attttctcaa gctatttttg

4021
ttactatttt cctaaaattg aatatttgca gggagcactt atactttttc ctaatgtctg

4081
tataacaaat ttctatgcaa gtacatgaat aaattatgct cacagctca

D1 CALCOCO2-calcium binding and coiled-coildomain 2 mRNA NM_001261390.1

(SEQ ID NO: 161)

1
caggcgggac gggctctccc ttgggtgctt agccccgccc ccgtcccact ctgccctgtt

61
gctgtcgcgc cgctgctggt tgctgtccct ggacccctac catggaggag accatcaaag

121
atccccccac atcagctgtc ttgctggatc actgtcattt ctctcaggtc atctttaaca

181
gtgtggagaa gttctacatc cctggagggg acgtcacatg tcattatacc ttcacccagc

241
atttcatccc tcgtcgaaag gattggattg gcatctttag agcatttaaa tgtttccaag

301
acaaattgga acaagaacta ctcaaatgga ggagccaagg acagaaattg caggtggggt

361
ggaagacaac ccgtgagtat tacaccttca tgtgggttac tttgcccatt gacctaaaca

421
acaaatcagc taaacagcag gaagtccaat tcaaagctta ctacctgccc aaggatgatg

481
agtattacca gttctgctat gtggatgagg atggtgtggt ccggggagca agtattcctt

541
tccaattccg tccagaaaat gaggaagaca tcctggttgt taccactcag ggagaggtgg

601
aagagattga gcagcacaac aaggagcttt gcaaagaaaa ccaggagctg aaggacagct

661
gtatcagcct ccagaagcag aactcagaca tgcaggctga gctccaaaag aagcaggagg

721
agctagaaac cctacagagc atcaataaga agttggaact gaaagtgaaa gaacagaagg

781
actattggga gacagagctg cttcaactga aagaacaaaa ccagaagatg tcctcagaaa

841
atgagaagat gggaatcaga gtggatcagc ttcaggccca gctgtcaact caagagaaag

901
aaatggagaa gcttgttcag ggagatcaag ataagacaga gcagttagag cagctgaaaa

961
aggaaaatga ccacctcttt ctcagtttaa ctgaacagag gaaggaccag aagaagctcg

1021
agcagacagt ggagcaaatg aagcagaatg aaactactgc aatgaagaaa caacaggaat

1081
taatggatga aaactttgac ctgtcaaaaa gactgagtga gaacgaaatt atatgtaatg

1141
ctctgcagag acagaaagag agattggaag gagaaaatga tcttttgaag agggagaaca

1201
gcagattgct cagttacatg ggtctggatt ttaattcttt gccgtatcaa gtacctactt

1261
cagatgaagg aggcgcaaga caaaatccag gacttgccta tggaaaccca tattctggta

1321
tccaagaaag ttcttccccc agcccgctct ccatcaagaa atgccctatc tgcaaagcag

1381
atgatatttg tgatcacacc ttggagcaac agcagatgca gcccctttgt ttcaattgtc

1441
caatttgtga caagatcttc ccagctacag agaagcagat ctttgaagac cacgtgttct

1501
gccactctct ctgagtatcc caacctcttg gatgtataca gagattttat agaatagaac

1561
ctatagcttc taccatgagt tatatgagtc aagatcctgc ctaacctgaa attattaggg

1621
atttactcag ccctgctgcc gctaacagtg gagttatgtc actgatctga aggtcactgt

1681
taagggcttc tgctgccatc cttgtgggtt gctaccttta agtcgcataa ctctagctgt

1741
atcatcctct cacctgtcat tcttctgagg gtctcagtac aagggccctg ggatggagcc

1801
aacctgggta ttcacaacag gcctgacttg atactaagtg attagttttc caagttgtcc

1861
cactgccatt caaagtcagc ccttgagtgt atttgttctc agtcctaacc ctggggccag

1921
agattggtcc gaggttgaga attccttcct cctcatcctt ggtgttgctt tctccaaatg

1981
attgttttag actagccaaa aatgccgtgg caaagagctc agaaatccaa tttggatacc

2041
aaaggtttct catgttaatt tctcagcccc caaagaagca tcttactcct gaaccttaga

2101
caggaagtat tgtttcagtc acagaaagct tttctgggta cctctggtta gcactttcta

2161
ctctctgata tttcctatgt acatagcttt tattgttgta aatcctttct taatggttaa

2221
ataggattgt tagcaactat gggtttgcag ttttctgagt aggtgagttt tgaatatggg

2281
taaatcagaa taatgagaca acttgttaat ctctttaata ctaaaaataa attactcttc

2341
tatttcaggg acttaggtaa tttaaaataa accttcaatt tatggtcttc tgttttgaag

2401
ctcatgggaa aattgtgatc aaaagggcta tgggaagggc agaccccgcc aatgatttct

2461
cttcacctgt cttaagatta aataaaaaag agtgtcctgg cagttatctt gaggtgggga

2521
aggaggtgat gaaacattag tttgtgaaat ccaaggccct ggcttgcttt ctttcttttt

2581
tttttttttt ttttgaaaca gtctctctct gtcacccagg ctggcgtgca atggcgcagt

2641

tgactcacta cagcctctgc ctcccaggtt caagcgattc tcatgcctta gcctcccaag

2701

tagctgggat tacaggtgtg tgccgcaatg cccagctaat ttttgtgttt ttagtagaga

2761

cagggtttca ctatgttggc caggctggtc tcgaactcct ggcctcacgt gatctgtcca

2821

cttcagccgt ccaaagtgct gggattacaa gcgtgagcca ctgtgctggg cccgaggccc

2881

tgacttcttg ctgtaacttt ccatgcattt tttttaaaag gagcagtgtg gattttcgca

2941

ccctttgtga actaagttca atgcgctcta tccaaatttg cctaattgaa ctataagaaa

3001

gtaataattc

cattttctat cccctcaggg actgaacaaa tggaaataac tcccaggcag

3061

tatcaggtgg tcactacaga

gacttccaca aaaacttttg aatgatgtga aacacgatgt

3121

catgaataag ggttgagcca actatagctc tgtgttccta

ctgggctttc cctaatgtgg

3181

ttgggagtta tgccctagac taactgtatt gtcctagtca cagctccttg

ctttgatttc

3241

atccttgata aaatgaagat gaaacttaca ctacttctcc aagccttttg ctgtcttaag

3301

aataagacct gagattaaca ctaaccctag aatagaaatg taatagggag atggtaataa

3361

aggagttttt ctggcacata ccctccctac agaatttctg ttgctcccca gatccagtga

3421

agaattgcag tttcatttat tttgtaccag tcagctctta attaagtaca tgaatggaga

3481

ggaacagtgg tgcacataat ccaaatcagt gaataccatt ttctggtgaa ttacccaccc

3541

ctttgcccct gctaccccga gggttaccat gattgtcaac agcagcagga gcccttccac

3601
agggcttggt aaaaaaacca gttgaggtgt taatgaccct ttttgctggg tgtaaaacaa

3661
agcatcttta accactgttc attatcccca gctgctctta ccaaggcttt gaagggggaa

3721
attatgctct aggcagccac tagtagtaaa caat

D2 GTF2B-general transcription factor IIB mRNA NM_001514.5

(SEQ ID NO: 162)

1
acgactgcgt gggtgagtcg tctataaaaa ctcatctctg cgcgtctctt cgccacattc

61
gcttcctgct ttcggtgtgt ctgttgtgtc ttgttgcggg caccgcagtc gccgtgaaga

121
tggcgtctac cagccgtttg gatgctcttc caagagtcac atgtccaaac catccagatg

181
cgattttagt ggaggactac agagccggtg atatgatctg tcctgaatgt ggcttggttg

241
taggtgaccg ggttattgat gtgggatctg aatggcgaac tttcagcaat gacaaagcaa

301
caaaagatcc atctcgagtt ggagattctc agaatcctct tctgagtgat ggagatttgt

361
ctaccatgat tggcaagggc acaggagctg caagttttga cgaatttggc aattctaagt

421
accagaatcg gagaacaatg agcagttctg atcgggcaat gatgaatgca ttcaaagaaa

481
tcactaccat ggcagacaga atcaatctac ctcgaaatat agttgatcga acaaataatt

541
tattcaagca agtatatgaa cagaagagcc tgaagggaag agctaatgat gctatagctt

601
ctgcttgtct ctatattgcc tgtagacaag aaggggttcc taggacattt aaagaaatat

661

gtgccgtatc acgaatttct aagaaagaaa ttggtcggtg ttttaaactt attttgaaag

721

cgctagaaac cagtgtggat ttgattacaa ctggggactt catgtccagg ttctgttcca

781

acctttgtct
tcctaaacaa gtacagatgg cagctacaca tatagcccgt aaagctgtgg

841

aattggactt ggttcctggg aggagcccca

tctctgtggc agcggcagct atttacatgg

901

cctcacaggc atcagctgaa aagaggaccc aaaaagaaat

tggagatatt gctggtgttg

961

ctgatgttac aatcagacag tcctatagac tgatctatcc tcgagcccca gatctgtttc

1021

ctacagactt caaatttgac accccagtgg acaaactacc acagctataa attgaggcag

1081

ctaacgtcaa attcttgaat acaaaacttt gcctgttgta catagcctat acaaaatgct

1141

gggttgagcc tttcatgagg aaaaacaaaa gacatggtac gcattccagg gctgaatact

1201
attgcttggc attctgtatg tatatactag tgaaacatat ttaatgattt aaatttctta

1261
tcaaatttct tttgtagcaa tctaggaaac tgtattttgg aagatatttg aaattatgta

1321
attcttgaat aaaacatttt tcaaaactca agtttttgtt atatgttaca tgtaacttat

1381
gatacataat tacaaataat gcaaatcatt gcagctaata aagctgatag actttatttc

1441
cattacttat atatacatag ttttttattt taataaattt atggaaagag caaaagcttt

1501
tgagaaccat tgttaacatc aacatcatag tttccagttt gaaaggatgt gtatgtgaga

1561
tttattatgt atattattaa acaagaagtg atgagcttgg gccttgaaag gcaccagctt

1621
gagagacatt aaaatgttct aagtaaaaaa a

D3 HLA-B-major histocompatibility complex, class I, B mRNA NM_005514.6

(SEQ ID NO: 163)

1
agttctaaag tccccacgca cccacccgga ctcagagtct cctcagacgc cgagatgctg

61
gtcatggcgc cccgaaccgt cctcctgctg ctctcggcgg ccctggccct gaccgagacc

121
tgggccggct cccactccat gaggtatttc tacacctccg tgtcccggcc cggccgcggg

181
gagccccgct tcatctcagt gggctacgtg gacgacaccc agttcgtgag gttcgacagc

241
gacgccgcga gtccgagaga ggagccgcgg gcgccgtgga tagagcagga ggggccggag

301
tattgggacc ggaacacaca gatctacaag gcccaggcac agactgaccg agagagcctg

361
cggaacctgc gcggctacta caaccagagc gaggccgggt ctcacaccct ccagagcatg

421
tacggctgcg acgtggggcc ggacgggcgc ctcctccgcg ggcatgacca gtacgcctac

481
gacggcaagg attacatcgc cctgaacgag gacctgcgct cctggaccgc cgcggacacg

541
gcggctcaga tcacccagcg caagtgggag gcggcccgtg aggcggagca goggagagcc

601
tacctggagg gcgagtgcgt ggagtggctc cgcagatacc tggagaacgg gaaggacaag

661
ctggagcgcg ctgacccccc aaagacacac gtgacccacc accccatctc tgaccatgag

721
gccaccctga ggtgctgggc cctgggtttc taccctgcgg agatcacact gacctggcag

781
cgggatggcg aggaccaaac tcaggacact gagcttgtgg agaccagacc agcaggagat

841
agaaccttcc agaagtgggc agctgtggtg gtgccttctg gagaagagca gagatacaca

901
tgccatgtac agcatgaggg gctgccgaag cccctcaccc tgagatggga gccgtcttcc

961

cagtccaccg tccccatcgt gggcattgtt gctggcctgg ctgtcctagc agttgtggtc

1021

atcggagctg tggtcgctgc tgtgatgtgt aggaggaaga gttcaggtgg aaaaggaggg

1081

agctactctc aggctgcgtg cagcgacagt gcccagggct ctgatgtgtc tctcacagct

1141

tgaaaagcct

gagacagctg tcttgtgagg gactgagatg caggatttct tcacgcctcc

1201

cctttgtgac ttcaagagcc tctggcatct ctttctgcaa aggcacctga atgtgtctgc

1261

gtccctgtta

gcataatgtg aggaggtgga gagacagccc acccttgtgt ccactgtgac

1321

ccctgttccc atgctgacct

gtgtttcctc cccagtcatc tttcttgttc cagagaggtg

1381

gggctggatg tctccatctc tgtctcaact ttacgtgcac tgagctgcaa cttcttactt

1441
ccctactgaa aataagaatc tgaatataaa tttgttttct caaatatttg ctatgagagg

1501
ttgatggatt aattaaataa gtcaattcct ggaatttgag agagcaaata aagacctgag

1561
aaccttccag aaaaaaaa

D4 HLA-F-major histocompatibility complex, class I, F mRNA

NM_ 001098479.1

(SEQ ID NO: 164)

1
tttctcactc ccattgggcg tcgcgtttct agagaagcca atcagtgtcg ccgcagttcc

61
caggttctaa agtcccacgc accccgcggg actcatattt ttcccagacg cggaggttgg

121
ggtcatggcg ccccgaagcc tcctcctgct gctctcaggg gccctggccc tgaccgatac

181
ttgggcgggc tcccactcct tgaggtattt cagcaccgct gtgtcgcggc ccggccgcgg

241
ggagccccgc tacatcgccg tggagtacgt agacgacacg caattcctgc ggttcgacag

301
cgacgccgcg attccgagga tggagccgcg ggagccgtgg gtggagcaag aggggccgca

361
gtattgggag tggaccacag ggtacgccaa ggccaacgca cagactgacc gagtggccct

421
gaggaacctg ctccgccgct acaaccagag cgaggctggg tctcacaccc tccagggaat

481

gaatggctgc gacatggggc ccgacggacg cctcctccgc gggtatcacc agcacgcgta

541

cgacggcaag gattacatct ccctgaacga ggacctgcgc tcctggaccg cggcggacac

601

cgtggctcag

atcacccagc gcttctatga ggcagaggaa tatgcagagg agttcaggac

661

ctacctggag ggcgagtgcc

tggagttgct ccgcagatac ttggagaatg ggaaggagac

721

gctacagcgc gcagatcctc caaaggcaca cgttgcccac caccccatct ctgaccatga

781

ggccaccctg aggtgctggg ccctgggctt ctaccctgcg

gagatcacgc tgacctggca

841

gcgggatggg gaggaacaga cccaggacac agagcttgtg gagaccaggc
ctgcagggga

901

tggaaccttc cagaagtggg ccgctgtggt ggtgcctcct ggagaggaac agagatacac

961

atgccatgtg cagcacgagg ggctgcccca gcccctcatc ctgagatggg agcagtctcc

1021

ccagcccacc atccccatcg tgggcatcgt tgctggcctt gttgtccttg gagctgtggt

1081

cactggagct gtggtcgctg ctgtgatgtg gaggaagaag agctcagata gaaacagagg

1141

gagctactct caggctgcag cctactcagt ggtcagcgga aacttgatga taacatggtg

1201

gtcaagctta tttctcctgg gggtgctctt ccaaggatat ttgggctgcc tccggagtca

1261

cagtgtcttg ggccgccgga aggtgggtga catgtggatc ttgttttttt tgtggctgtg

1321

gacatctttc aacactgcct tcttggcctt gcaaagcctt cgctttggct tcggctttag

1381

gaggggcagg agcttccttc ttcgttcttg gcaccatctt atgaaaaggg tccagattaa

1441

gatttttgac tgagtcattc taaagtaagt tgcaagaccc atgatactag accactaaat

1501

acttcatcac acacctccta agaataagaa ccaacattat cacaccaaag aaaataaata

1561
attccataat attaaaaaaa aaaaaaaaaa a

D5 MGST2-microsomal glutathione S-transferase 2 mRNA NM_002413.4

(SEQ ID NO: 165)

1
gctggccgtg ggagaggctt aaaacaaacg ccggaagcaa ctcccagccc cataaagatc

61
tgtgaccggc agccccagac ctgcctgcct tcctgacttc tgttccagag caaaggtcat

121
tcagccgctt gaatcagcct tttcccccca cccggtcccc aactttgttt acccgataag

181

gaaggtcagc attcaaagtc aagaagcgcc atttatcttc ccgtgcgctc tacaaatagt

241

tccgtgagaa agatggccgg gaactcgatc ctgctggctg ctgtctctat tctctcggcc

301

tgtcagcaaa gttattttgc tttgcaagtt ggaaaggcaa gattaaaata caaagttacg

361

cccccagcag

tcactgggtc accagagttt gagagagtat ttcgggcaca acaaaactgt

421

gtggagtttt atcctatatt

cataattaca ttgtggatgg ctgggtggta tttcaaccaa

481

gtttttgcta cttgtctggg tctggtgtac atatatggcc gtcacctata cttctgggga

541

tattcagaag ctgctaaaaa

acggatcacc ggtttccgac tgagtctggg gattttggcc

601

ttgttgaccc tcctaggtgc cctgggaatt

gcaaacagct ttctggatga atatctggac

661

ctcaatattg ccaagaaact gaggcggcaa ttctaacttt ttctcttccc tttaatgctt

721

gcagaagctg ttcccaccat gaaggtaata tggtatcatt tgttaaataa aaataaagtc

781
tttattctgt ttttcttgaa aaaaaaaaaa aaaaaaa

D6 SPAST-spastin mRNA NM_014946.3

(SEQ ID NO: 166)

1
ggcccgagcc accgactgca ggaggagaag gggttgtgct cctggccgag gaaggagaaa

61
ggggcggggc cggcgggcag cgtgcggcag tgcggagctc ctgagaccgg cgggcacacg

121
ggggtctgtg gcccccgccg tagcagtggc tgccgccgtc gcttggttcc cgtcggtctg

181
cgggaggcgg gttatggcgg cggcggcagt gagagctgtg aatgaattct ccgggtggac

241
gagggaagaa gaaaggctcc ggcggcgcca gcaacccggt gcctcccagg cctccgcccc

301
cttgcctggc ccccgcccct cccgccgccg ggccggcccc tccgcccgag tcgccgcata

361
agcggaacct gtactatttc tcctacccgc tgtttgtagg cttcgcgctg ctgcgtttgg

421
tcgccttcca cctggggctc ctcttcgtgt ggctctgcca gcgcttctcc cgcgccctca

481
tggcagccaa gaggagctcc ggggccgcgc cagcacctgc ctcggcctcg gccccggcgc

541
cggtgccggg cggcgaggcc gagcgcgtcc gagtcttcca caaacaggcc ttcgagtaca

601
tctccattgc cctgcgcatc gatgaggatg agaaagcagg acagaaggag caagctgtgg

661
aatggtataa gaaaggtatt gaagaactgg aaaaaggaat agctgttata gttacaggac

721

aaggtgaaca gtgtgaaaga gctagacgcc ttcaagctaa aatgatgact aatttggtta

781

tggccaagga ccgcttacaa cttctagaga agatgcaacc agttttgcca ttttccaagt

841

cacaaacgga cgtctataat gacagtacta acttggcatg ccgcaatgga catctccagt

901

cagaaagtgg

agctgttcca aaaagaaaag accccttaac acacactagt aattcactgc

961

ctcgttcaaa aacagttatg aaaactggat ctgcaggcct ttcaggccac catagagcac

1021

ctagttacag tggtttatcc atggtttctg gagtgaaaca gggatctggt cctgctccta

1081

ccactcataa gggtactccg aaaacaaata ggacaaataa accttctacc cctacaactg

1141

ctactcgtaa gaaaaaagac ttgaagaatt

ttaggaatgt ggacagcaac cttgctaacc

1201

ttataatgaa tgaaattgtg gacaatggaa cagctgttaa

atttgatgat atagctggtc

1261

aagacttggc aaaacaagca ttgcaagaaa ttgttattct tccttctctg aggcctgagt

1321

tgttcacagg gcttagagct cctgccagag ggctgttact ctttggtcca cctgggaatg

1381

ggaagacaat gctggctaaa gcagtagctg cagaatcgaa tgcaaccttc tttaatataa

1441

gtgctgcaag tttaacttca aaatacgtgg gagaaggaga gaaattggtg agggctcttt

1501

ttgctgtggc tcgagaactt caaccttcta taatttttat agatgaagtt gatagccttt

1561

tgtgtgaaag aagagaaggg gagcacgatg ctagtagacg cctaaaaact gaatttctaa

1621

tagaatttga tggtgtacag tctgctggag atgacagagt acttgtaatg ggtgcaacta

1681

ataggccaca agagcttgat gaggctgttc tcaggcgttt catcaaacgg gtatatgtgt

1741

ctttaccaaa tgaggagaca agactacttt tgcttaaaaa tctgttatgt aaacaaggaa

1801
gtccattgac ccaaaaagaa ctagcacaac ttgctagaat gactgatgga tactcaggaa

1861
gtgacctaac agctttggca aaagatgcag cactgggtcc tatccgagaa ctaaaaccag

1921
aacaggtgaa gaatatgtct gccagtgaga tgagaaatat tcgattatct gacttcactg

1981
aatccttgaa aaaaataaaa cgcagcgtca gccctcaaac tttagaagcg tacatacgtt

2041
ggaacaagga ctttggagat accactgttt aaggaaatac ctttgtaaac ctgcagaaca

2101
ttttacttaa aagaggaaac acaagatctt caatgaacgt catcggctac agaaacagcc

2161
taagtttaca ggacttttta gagtcttaca tatttgtgca ccaaacttga agatgaacca

2221
gaaaacagac ttaaacaaaa tatacaatgc aaatgtaatt ttttgttgtt taaggccttg

2281
ccttgatggt cacagttatc ccaatggaca ctaagttaga gcacaacaaa acctgattct

2341
ggtcttcttt accaatataa tcataatgta aataataatt tgtatattgt gttgcagatg

2401
aaagtattcc aggaacagtg aatggtagaa gacacaagaa catttgtttg tttgtcttct

2461
gatgtttttt cttaaaatag taatttctcc tacttttctt ttctactgtt gtcttaacta

2521
caggtgattg gaatgccaaa cactcttaag tttattttct tttttcgttt tataaattca

2581
gtgtgccaaa tgaaactttt ttcctaagta actgtaatag gaaaaagttt attttgagag

2641
tttcttcttc ataaatctac agacattaaa caattgttgt gttcttttta ccttttattt

2701
ttctattacc ttgctaccaa acagtttaga tagcaatata atagcaaaaa agcaaatatg

2761
gtaaaataga gaaggtttga aggtttgagt tactctgtca tataacatgt agatcagtct

2821
tcatgtgacc tgcagtattt ttttttctaa tgtatttgtc agaaatctgt tgtagactgt

2881
taacttcttc ctgatggaat ttattttctg caagaattat tctgatattt aagagagcca

2941
attttaactg ctgtgaaaat gtttccagtg caagagaagg gaaatactag gaactaagac

3001
atttctaatt tattgcttat tactttctta attttacagg ataattataa gcaagtggaa

3061
ctaccatctt ttattcttaa taattattaa tcccttcaat gaaactttaa aaaaactgaa

3121
tttttataca tggcatacat ttttctagtt ccttctgctt gctttattaa ctcaaaagtt

3181
ctagttctag tctgttgatc tgccttttgt tctcccaaaa tgtacagtaa ttccatttgt

3241
ttgtataaat atgcctggat tttcattata aaaatgtcat tgtagggagt agagactcat

3301
atcatggcct tttaaatatt gtaataaagg caaatagata tttgccctta gtttactggt

3361
taaaagtttg tttacagaac ttttctctgg tgcttaaatg atgctatgta aaatgtcatg

3421
agtggaaaga atatttgtag tagtaacaag aatttttcat ttaggaaaga tttcttaggt

3481
tttgaaagaa tacattaaaa taaaaaactt gcccctacta ggtaagaact ttataatgaa

3541
gacatacatt cttcttaatt ttactcttgc tcttgttaaa gatttgtttg aatatagaag

3601
atgcatgatt tctgggtttt tttttttttt tgagacagag tttcgctctt gttgcccagg

3661
ctggagtgca atggcgcaat ctcgactcac cacaacctcc gcctcccagg ttcaagcaat

3721
tctcctgcct cagcctcccg agtagctggg attacaggca tgcgccacta ccccagctaa

3781
ttttgtattt ttagtagaga tggggtttct ccatgttggt caggctggtc ttgaactcct

3841
gacctcaggt gatccgcctg cctcggcctc ccaaagtgct gggattacag gcataagcca

3901
ctgcgcccag ccagaagatg catgatttct taggatcata tgctgtttgt agccataagg

3961
taaatcatgt ctcttccaat catgactttg gaactccctg aataataaaa atgagagttg

4021
agataaatag gggaaaaaaa atttttttca agccagagct atgcatatgt taggtgatgg

4081
gtagtatccc tttaaggtct caaacattac aacatcaatt atgaaatact gataacgaaa

4141
ggtagtaatg aaatatatat gatgaaaaga attgagaagt tctaaattaa gacatttcag

4201
ttaagctcat aaaatttcat tgttttcatt taaaagatta acgttattga tacttggata

4261
actggctaat catattaaag gactatgtgg ttccagctca acttttaata tattgtctcc

4321
tttaaaacta tcatggttat aattctattg ggaaagactt ttagataaca aagatttcaa

4381
atgttaaaag agataaaagt caggttaata ctatcttaaa cactgagtca gaaaatcatt

4441
actgtataga agttgctttc ctgatcaagt ctgaacttca gctagtgcta gagaactatt

4501
ttctatgact taactctaac caagttttat tttaagctgt ttctttgata gaagggccat

4561
gaaaatagag taatgatata gtaggagata agggattggt ttggtctttt tcaataaaga

4621
tagaagttgc tgaagttttc tgaattaata atgacttaga ttgtgacctt ttagattcgg

4681
tgttgagctc tgtgttgtat tacttcctaa aagataatgc ttaaacatta agcattagtg

4741
tgctcttcat gttaatatgg cagagttttg taaactaaat taaaacttac tgatatattg

4801
gactttgagc caagggaaag aatgagtact atctttccag atatcttaag ggtaaaagct

4861
tattctaaga cagtctgtcc attgagaata ttagatttct gacttgcaaa tatgtttgta

4921
ctccagaaga attagaggaa aagcagatac tagaattcta atttaattac atatacagcc

4981
gtctttgttt atagtgtaga attctttata ttttgtacaa aaactaattc ttttggtaaa

5041
atgaaccatt tacagttcgg ttttggactc tgagtcaaag gattttcctt taaatgcttg

5101
tctcaatttt agtctggtct tttgtacttt tcttcagaag aaatgaatta aagggtacag

5161
ttgcataaag tgggttttta tcctaatgta ttggaaataa atgataaact ttaaaaaaaa

5221
a

D7 WAC-WW domain containing adaptor with coiled-coil mRNA NM_016628.4

(SEQ ID NO: 167)

1
cgcccgccgc cgccgccgcc tgcgcgcccg cccgcctttc gcggccgctc tcccccctcc

61
ccgacacaca ctcacaggcc gggcattgat ggtaatgtat gcgaggaaac agcagagact

121
cagtgatggc tgtcacgacc ggagggggga ctcgcagcct taccaggcac ttaagtattc

181
atcgaagagt caccccagta gcggtgatca cagacatgaa aagatgcgag acgccggaga

241
tccttcacca ccaaataaaa tgttgcggag atctgatagt cctgaaaaca aatacagtga

301
cagcacaggt cacagtaagg ccaaaaatgt gcatactcac agagttagag agagggatgg

361
tgggaccagt tactctccac aagaaaattc acacaaccac agtgctcttc atagttcaaa

421
ttcacattct tctaatccaa gcaataaccc aagcaaaact tcagatgcac cttatgattc

481
tgcagatgac tggtctgagc atattagctc ttctgggaaa aagtactact acaattgtcg

541
aacagaagtt tcacaatggg aaaaaccaaa agagtggctt gaaagagaac agagacaaaa

601
agaagcaaac aagatggcag tcaacagctt cccaaaagat agggattaca gaagagaggt

661
gatgcaagca acagccacta gtgggtttgc cagtggaatg gaagacaagc attccagtga

721
tgccagtagt ttgctcccac agaatatttt gtctcaaaca agcagacaca atgacagaga

781
ctacagactg ccaagagcag agactcacag tagttctacg ccagtacagc accccatcaa

841
accagtggtt catccaactg ctaccccaag cactgttcct tctagtccat ttacgctaca

901
gtctgatcac cagccaaaga aatcatttga tgctaatgga gcatctactt tatcaaaact

961
gcctacaccc acatcttctg tccctgcaca gaaaacagaa agaaaagaat ctacatcagg

1021
agacaaaccc gtatcacatt cttgcacaac tccttccacg tcttctgcct ctggactgaa

1081
ccccacatct gcacctccaa catctgcttc agcggtccct gtttctcctg ttccacagtc

1141
gccaatacct cccttacttc aggacccaaa tcttcttaga caattgcttc ctgctttgca

1201
agccacgctg cagcttaata attctaatgt ggacatatct aaaataaatg aagttcttac

1261
agcagctgtg acacaagcct cactgcagtc tataattcat aagtttctta ctgctggacc

1321
atctgctttc aacataacgt ctctgatttc tcaagctgct cagctctcta cacaagccca

1381
gccatctaat cagtctccga tgtctttaac atctgatgcg tcatccccaa gatcatatgt

1441
ttctccaaga ataagcacac ctcaaactaa cacagtccct atcaaacctt tgatcagtac

1501
tcctcctgtt tcatcacagc caaaggttag tactccagta gttaagcaag gaccagtgtc

1561
acagtcagcc acacagcagc ctgtaactgc tgacaagcag caaggtcatg aacctgtctc

1621
tcctcgaagt cttcagcgct caagtagcca gagaagtcca tcacctggtc ccaatcatac

1681
ttctaatagt agtaatgcat caaatgcaac agttgtacca cagaattctt ctgcccgatc

1741
cacgtgttca ttaacgcctg cactagcagc acacttcagt gaaaatctca taaaacacgt

1801
tcaaggatgg cctgcagatc atgcagagaa gcaggcatca agattacgcg aagaagcgca

1861
taacatggga actattcaca tgtccgaaat ttgtactgaa ttaaaaaatt taagatcttt

1921
agtccgagta tgtgaaattc aagcaacttt gcgagagcaa aggatactat ttttgagaca

1981
acaaattaag gaacttgaaa agctaaaaaa tcagaattcc ttcatggtgt gaagatgtga

2041

ataattgcac atggttttga gaacaggaac tgtaaatctg ttgcccaatc ttaacatttt

2101

tgagctgcat ttaagtagac tttggaccgt taagctgggc aaaggaaatg acaaggggac

2161

ggggtctgtg agagtcaatt caggggaaag atacaagatt gatttgtaaa acccttgaaa

2221

tgtagatttc ttgtagatgt atccttcacg ttgtaaatat

gttttgtaga gtgaagccat

2281

gggaagccat gtgtaacaga gcttagacat ccaaaactaa tcaatgctga

ggtggctaaa

2341

tacctagcct tttacatgta aacctgtctg caaaattagc ttttttaaaa aaaaaaaaaa

2401

aaaaattggg ggggttaatt tatcattcag aaatcttgca ttttcaaaaa ttcagtgcaa

2461

gcgccaggcg atttgtgtct aaggatacga ttttgaacca tatgggcagt gtacaaaata

2521

tgaaacaact gtttccacac ttgcacctga tcaagagcag tgcttctcca tttgttttgc

2581

agagaaatgt ttttcatttc ccgtgtgttt ccatttcctt ctgaaattct gattttatcc

2641

atttttttaa ggctcctctt tatctccttt cttaaggcac tgttgctatg gcacttttct

2701

ataacctttt cattcctgtg tacagtagct taaaattgca gtgattgagc ataacctact

2761

tgtttgtata aattattgaa atccatttgc accctgttaa gaatggactt aaaagtacta

2821

ctggacaggc atgtgtgctc aaagtacatt gattgctcaa atataaggaa atggcccaat

2881

gaacgtggtt gtgggagggg aaagaggaaa cagagctagt cagatgtgaa ttgtatctgt

2941

tgtaataaac atgttaaaac aaacaaaaat tgttattttt cttttccttc ggtcagtgca

3001

cattagcatt tgaactacct ggggattctt tatcagaact gttcttgttg aatatttata

3061

cttaattgaa ataattcctt aagggaggtt ttgtttaaaa cgtattaaca ggaaattgtg

3121

tatgagatat ttaatgaaat aagaaattca acaagaatga ttaagtcact tcccaagtgg

3181

ttgtcatttg ttaaaccctg gtttacctgt cttgctatta tgacatttca tttggaagga

3241

tgtttgtgtt gtagctaact gttcaagtct

ggtgctgact gctgttctta gccatcacaa

3301

aacgctaaat ttgtgtaatt ggagcttcct gctgttatct

ggaaatagca ggaaagcgca

3361

gctttgtata ttgtttccta aagtatatta aaataaaaaa agaaactatt gctactataa

3421

aattaccttg actttttttt tcctttgctg aaatattagt cacatagcct tagcttcaca

3481

ctgccagtaa tgtatcaaat cacaagggtt tccgcatgaa aaaaatcttt tcttccccca

3541

caaaaaaacc tttaccatca aaatcttgcc atctgattta gaaaggtgtt tcttcttctt

3601
cttctttttt ttctttaaat tggtttaggg ttttttggtg attttttttt tttttttttt

3661
tctgttgggg cagataagtg cttccaaaac tggcagcacc aagggcttat tttttatgtt

3721
agacatcaat gtcaatgtta ctacattctc ggatgctaac ataaattttg aaattgctct

3781
tgtgctttaa gcatatattg aaagtatgga agttaaatgt tcaggctttt cagtaagctc

3841
aaaaagttaa ctgtaagcga tagtgttggt gttttctaaa atacaaaaat gttccagtgt

3901
aattaaaagg aattaaaatc ttgaagatat tttcctgtaa tttaaggata ctttttaaat

3961
gtaagaaaag acatgtcatt aatttattgt catgtttata cctctgtgag attgttaaca

4021
tctgctgaat ttaactagtg catgtaaatg aaaccccaaa gagctgtgtg ttcagctaga

4081
aaccttactg tatctttcct ggaaagaagt gagcaatttg ttgtaatagg caaatgtttc

4141
ctgatcagat ggcaatttgt gatttaggta aatttgaatt tgatttgctt atagtctact

4201
ggtctgtgta cctatgtttt gtttttcaaa aaagtttaca tccctaaatg aattagtcac

4261
atatatttag gagaagatgc ctaatttggt atttcttaat agtgaatttt tttttttctt

4321
gagacagagt ttcactcttg ttgcccaggc tggagtgcaa tggcacgatc tcggctcacc

4381
gcaacctctg cctcctgggt tcaagcgatt ctcctgcctc ggcgttccga gtagctggga

4441
ttacaggcat gcaccaccac gcctggctaa ttttgtattt ttagtagaga tggggtttct

4501
ccatgttggt caggctggtc ttgaactgcc aacttcaggt gatctgcctg ccttggcctc

4561
ccaaagtgct gggattacag gcgtgagcca ccgctcctgg ccagtagtga atttttaaac

4621
acagaaaatc taaaattttg tggaaatatt ttaaatattg caccttaata caaggtatcc

4681
agctcctaac cttaactagg gaatatctat taaaataagc ataatgttct ggactagagt

4741
attccttatc tagttggtta tggatttgaa catgtacctt ggtttagata ctttgaaaat

4801
agaagtactg aatagcctct agggaacttg agtggccttt ccctccccct gccccccccc

4861
cccccccccc gttttaaaag atcagtagtc tctattcaaa cttttaaaat gtcgtggtat

4921
tgtaacaata tatttgatga aagaaggtta cagactcccc tgaagaacca gctttcctac

4981
gctttttatt tttctaactt gtctaacctg attttaaaat gactgcaatt ccagactaaa

5041
aacatgcttc agccctgttt caagacatta tgcttctttt aacagtccaa attagtagtt

5101
ttatttttct tctaaatctt tgtttcacac ttgtaaaatc ttgggaagga ggttcttaaa

5161
actttgccag gaattgttac ccatttccaa aaacagttta ttatgttcaa aaaccaccat

5221
atctttgagg gactgtttga aaggggagag ggcaacgcgg gaaataattc actctgcgca

5281
ccggaactat tgtagttcag gacttccagc tactgtattt agatgttggg tttgaatata

5341
cagatttctt ttcaatacct gtaaatatgg ctatattctt gtatttgtac gggagtgtac

5401
aaaatgacac tgaaaagtaa taaatatgtt ttgactatat tgtgcagtta tttcagaact

5461
gtgttttgaa agtcttagaa tgcataattt gcatttgagt aaggaaattt aaaatacaga

5521
ttactgctga gatttta

BIOMARKERS AND COMBINATIONS THEREOF FOR DIAGNOSING TUBERCULOSIS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)

PCT Information