This invention relates to the detection and diagnosis of tuberculosis. More specifically, the invention relates to new biomarkers and combinations thereof that enable the accurate detection and diagnosis of tuberculosis.
Tuberculosis (TB) is a progressive, often fatal, infectious disease, caused by the bacterial pathogen Mycobacterium tuberculosis (M. tuberculosis, MTB). This is a significant cause of mortality worldwide, being the eighth largest leading cause of death globally, and is primarily a disease of poverty, particularly in developing countries. Latent TB infection is believed to affect as much as one third of the world's population.
Tuberculosis is a notifiable disease and is a major concern for many governmental and other health bodies including the World Health Organisation (WHO), who have initiated numerous control and treatment programmes like the “Stop TB Partnership”.
The WHO estimates that nearly nine million new cases of TB, and nearly two million deaths, occur globally each year. The largest number of new TB cases in 2005 occurred in South-East Asia (34% of incident cases globally), and the estimated incidence rate in sub-Saharan Africa is nearly 350 cases per 100,000 population. However, TB infection is not limited to the developing world: the UK has seen a resurgence of tuberculosis since the late 1980s and there are currently over 8000 new cases each year—a rate of 14.0 per 100,000 population. About 40% of these new cases occur in the London region, where the rate of infection is 44.8 per 100,000 population.
M. tuberculosis is capable of forming intracellular infections. These infections may be exclusively intracellular, or may contain both intracellular and extracellular components. Generally, M. tuberculosis bacilli do not circulate freely in the body, for example, in the bloodstream, and as such are often difficult to detect. They are also less amenable to drug treatment regimes. Intracellular survival and multiplication of mycobacteria is suspected to be a main contributory factor for mycobacterial disease progression.
The term “latency” is synonymous with “persistence”, and describes a reversible state of low metabolic activity in which mycobacterial cells can survive for extended periods with limited or no cell division. During latency (i.e. latent infection), the clinical symptoms associated with a mycobacterial infection do not become manifest.
The presence of a large reservoir of asymptomatic individuals latently-infected with mycobacteria is a major problem for the control of M. tuberculosis infections. In addition, conventional methods for the detection of a latent mycobacterial infection by skin testing may be compromised by BCG vaccination and by exposure to environmental mycobacteria.
Timely, accurate and sensitive diagnosis is imperative for disease control. This is a key priority for many health and immigration authorities, particularly at “point of entry” for developed countries where the majority of TB cases are imported. Optimal patient management requires early initiation of drug therapy and isolation of infectious individuals as soon as possible. Left untreated, each person with active TB disease will infect on average between 10 and 15 people every year. TB infection can normally be treated by a 6 month course of antibiotics; however, patient compliance to long-term drug treatment is varied, with patients often stopping therapy when their symptoms cease. Failure to complete the treatment regime can promote the development of multiple drug-resistant mycobacteria.
Despite considerable investment in surveillance, control and treatment programmes, as well as in research and development for new diagnostics and therapeutics, TB control and eradication has proved challenging. The standard methods used for TB diagnosis have not changed significantly in recent years in many routine diagnostic laboratories, and there is substantial evidence that TB diagnosis is subject to significant error, with up to 52% under-diagnosis reported in some studies using comparative indices between TB diagnosis methods as measured against autopsy observations.
Early detection of a disease condition typically allows for a more effective therapeutic treatment with a correspondingly more favourable clinical outcome. In view of the increasing threat and global prevalence of TB, new strategies are required for more effective prevention, treatment, and diagnosis of TB and M. tuberculosis infection. Ideally, diagnosis would be made by a technique that accurately, rapidly, and simultaneously measures a plurality of biomarkers at a single point in time, thereby minimizing disease progression during the time required for diagnosis.
Previous attempts to develop new diagnostic methods for TB have proved problematic. In particular, earlier work attempting to enable the accurate and timely diagnosis of early stage or latent infection TB, where symptoms may not be apparent and where detection of M. tuberculosis by culture or specific polymerase chain reaction (PCR) is not achieved, has faced challenges.
Other groups have investigated host biomarkers in active and latent TB. However, these methods were unable to maintain the required level of specificity for TB across different subgroups, such as different ethnic groups.
The present inventors have conducted a temporal differential gene expression study in peripheral blood leukocytes (PBLs) in an aerosol Macaca fascicularis non-human primate model of TB. Using this method, the inventors have identified host biomarkers associated with early exposure to TB. Microarray hybridisation analyses to human whole genome arrays have revealed many significant gene expression changes, showing substantial temporal changes in PBL gene expression in response to M. tuberculosis challenge across the time-course of the study. Using parametric and non-parametric tools for data analysis, including artificial neural network analysis, the inventors have identified highly-significant host biomarkers associated with TB and M. tuberculosis infections. The biomarkers identified by the present invention have improved specificity for TB across different subgroups, such as different ethnic groups.
Therefore, the present invention allows for accurate, rapid, and sensitive prediction and diagnosis of TB through a measurement of one or more biomarker taken from a biological sample at a single point in time.
Accordingly, the present invention provides the use of one or more of SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRAP1, WSB1, BST1, SERPINB1, MVP, APBB1IP, MB21D1/C6orf150, TICAM2, DEFB128 and IL8 as a biomarker for tuberculosis.
The invention also provides a method for diagnosing tuberculosis in an individual comprising determining the presence and/or amount of one or more biomarker for tuberculosis in a sample obtained from the individual, wherein the one or more biomarker for tuberculosis is selected from SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRAP1, WSB1, BST1, SERPINB1, MVP, APBB1IP, MB21D1/C6orf150, TICAM2, DEFB128 and IL8.
The tuberculosis detected and/or diagnosed by the method or use of the present invention may be an active tuberculosis infection and the one or more biomarker a biomarker for an active tuberculosis infection.
Typically the one or more biomarker is selected from SNX10, CPVL, PF4V1 and HERC2, or any combination thereof. In a preferred embodiment, the one or more biomarker is selected from: (i) SNX10 and CREG1; and/or (ii) PF4V1 and HERC2.
The tuberculosis detected and/or diagnosed by the method or use of the present invention may be a latent tuberculosis infection and the one or more biomarker a biomarker for a latent tuberculosis infection. Typically the one or more biomarker for a latent tuberculosis infection is selected from PF4V1, LYN, CD52, HERC2, KLRAP1, DEFB128, LGALS3BP and IL8.
A use of the invention may comprise determining the presence and/or amount of the one or more biomarker for tuberculosis in a sample obtained from an individual.
The present invention also provides a use or method as defined herein, wherein said one or more biomarker is able to identify an individual with an active tuberculosis infection and/or an individual with a latent tuberculosis infection.
The present invention also provides a use or method as defined herein, wherein said one or more biomarker is able to identify an individual with an active tuberculosis infection and/or an individual with a latent tuberculosis infection and/or an individual uninfected with tuberculosis.
One or more additional biomarker for tuberculosis may be used in the method or use of the invention. The one or more additional biomarker may be (a) a biomarker for an active tuberculosis infection selected from: (i) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, PSMB9, LGALS3BP, BST1, BAZ1A, LYN, TAPBP, SERPINB1, WSB1, MVP, APBB1IP, FYB, MB21D1/C6orf150, TICAM2, CD52, KLRAP1, DEFB128 and IL8; and/or (ii) a biomarker listed in Table 3; and/or (b) a biomarker for a latent tuberculosis infection selected from: (i) a biomarker listed in Table 4; and/or (ii) a biomarker listed in Table 5. In a preferred embodiment, the one or more additional biomarker for an active tuberculosis infection is selected from LOC400759/GBP1P1, CREG1, PSMB9, ALPK1, GBP1, IRF1, HLA-B, IFITM3, S100A11, MMP9 and CD96. In a more preferred embodiment, the one or more biomarkers for tuberculosis are SNX10 and CPVL and the one or more additional biomarkers for tuberculosis are LOC400759/GBP1P1 and CREG1; and/or the one or more biomarkers for tuberculosis are PF4V1 and HERC2 and the one or more additional biomarkers for tuberculosis are LOC400759/GBP1P1 and ALPK1.
One or more further additional biomarkers may be used in the methods and/or uses of the invention. In one embodiment, the one or more further additional biomarker is PSMB9 and/or PF4V1. Alternatively and/or in addition, the one or more additional biomarker for an active tuberculosis infection, or the one or more further additional biomarker is: (i) GBP1, IRF1 and HLA-B; (ii) GBP1, IRF1, IFITM3 and S100A11; and/or (iii) GBP1, IRF1, MMP9 and CD96.
The presence and/or amount of the one or more biomarker for tuberculosis may be compared with the presence and/or amount of the one or more biomarker for tuberculosis in a control sample. The specificity of the comparison of the presence and/or amount of the one or more biomarker for tuberculosis in the sample and the presence and/or absence of the one or more biomarker for tuberculosis in the control diagnoses tuberculosis may be at least about 80%.
The presence and/or amount of the one or more biomarker for tuberculosis may be determined using an antibody and/or an oligonucleotide specific for said one or more biomarker. Typically, an oligonucleotide specific for said one or more biomarker is used. Preferably: (i) the one or more biomarker for tuberculosis is LOC400759/GBP1P1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 1, 2 or 3; (ii) the one or more biomarker for tuberculosis is PF4V1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 4 or 5; (iii) the one or more biomarker for tuberculosis is ALPK1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 6 or 7; (iv) the one or more biomarker for tuberculosis is HERC2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 8, 9 or 168 to 171; (v) the one or more biomarker for tuberculosis is LGALS3BP and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 10 or 11; (vi) the one or more biomarker for tuberculosis is BST1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 12 or 13; (vii) the one or more biomarker for tuberculosis is SNX10 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 14 or 15; (viii) the one or more biomarker for tuberculosis is CREG1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 16 or 17; (ix) the one or more biomarker for tuberculosis is BAZ1A and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 18 or 19; (x) the one or more biomarker for tuberculosis is LYN and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 20 or 21; (xi) the one or more biomarker for tuberculosis is TAPBP and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 22 or 23; (xii) the one or more biomarker for tuberculosis is SERPINB1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 24 or 25; (xiii) the one or more biomarker for tuberculosis is PSMB9 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 26 or 27; (xiv) the one or more biomarker for tuberculosis is WSB1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 28 or 29; (xv) the one or more biomarker for tuberculosis is MVP and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 30 or 31; (xvi) the one or more biomarker for tuberculosis is APBB1IP and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 32 or 33; (xvii) the one or more biomarker for tuberculosis is FYB and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 34 or 35; (xviii) the one or more biomarker for tuberculosis is MB21D1/C6orf150 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 36 or 37; (xix) the one or more biomarker for tuberculosis is CPVL and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 38 or 39; (xx) the one or more biomarker for tuberculosis is TICAM2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 40 or 41; (xxi) the one or more biomarker for tuberculosis is CD52 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 42 or 43; (xxii) the one or more biomarker for tuberculosis is KLRAP1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 44 or 45; (xxiii) the one or more biomarker for tuberculosis is DEFB128 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 46 or 47; (xxiv) the one or more biomarker for tuberculosis is IL8 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 48 or 49; (xxv) the one or more biomarker for tuberculosis is GBP1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 50 or 51; (xxvi) the one or more biomarker for tuberculosis is IRF1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 52 or 53; (xxvii) the one or more biomarker for tuberculosis is MMP9 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 54 or 55; (xxviii) the one or more biomarker for tuberculosis is CD96 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 56 or 57; (xxix) the one or more biomarker for tuberculosis is AIM2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 58 or 59; (xxx) the one or more biomarker for tuberculosis is CD274 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 60 or 61; (xxxi) the one or more biomarker for tuberculosis is CDH23 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 62 or 63; (xxxii) the one or more biomarker for tuberculosis is IFIT3 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 64 or 65; (xxxiii) the one or more biomarker for tuberculosis is IFITM3 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 66 or 67; (xxxiv) the one or more biomarker for tuberculosis is GK and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 68 or 69; (xxxv) the one or more biomarker for tuberculosis is NELL2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 70 or 71; (xxxvi) the one or more biomarker for tuberculosis is S100A11 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 72 or 73; (xxxvii) the one or more biomarker for tuberculosis is SAMD9L and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 74 or 75; (xxxviii) the one or more biomarker for tuberculosis is STAT1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 76 or 77; (xxxix) the one or more biomarker for tuberculosis is TLR6 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 78 or 79; (xl) the one or more biomarker for tuberculosis is WARS and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 80 or 81; (xli) the one or more biomarker for tuberculosis is DOCKS and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 82 or 83; (xlii) the one or more biomarker for tuberculosis is SIRPB2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 84 or 85; (xliii) the one or more biomarker for tuberculosis is ANKRD22 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 86 or 87; (xliv) the one or more biomarker for tuberculosis is ABCF2 (NM 005692.3 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 88 or 89; (xlv) the one or more biomarker for tuberculosis is FNBP1L and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 90 or 91; (xlvi) the one or more biomarker for tuberculosis is NCF1C and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 92 or 93; (xlvii) the one or more biomarker for tuberculosis is TBC1D3B and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 94 or 95; (xlviii) the one or more biomarker for tuberculosis is SLC14A1 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 96 or 97; (xlix) the one or more biomarker for tuberculosis is CALCOCO2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 98 or 99; (l) the one or more biomarker for tuberculosis is GTF2B and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 100 or 101; (li) the one or more biomarker for tuberculosis is HLA-B and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 102 or 103; (lii) the one or more biomarker for tuberculosis is HLA-F and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 104 or 105; (liii) the one or more biomarker for tuberculosis is MGST2 and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 106 or 107; (liv) the one or more biomarker for tuberculosis is SPAST and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 108 or 109; and/or (lv) the one or more biomarker for tuberculosis is WAC and the oligonucleotide comprises at least one nucleic acid sequence having at least 90% sequence identity to the nucleic acid sequence of SEQ ID NOs: 110 or 111 or 168 to 171.
The presence and/or absence of the at least one biomarker for tuberculosis in the individual may be determined at least twice using a separate sample taken each time the presence and/or absence of the at least one biomarker for tuberculosis is determined. The samples from the individual may be taken prior to, during and/or after treatment initiation.
The invention further provides a device for carrying out the use of the invention, or for use in a method of the invention, which comprises (i) one or more antibody specific for the one or more biomarker for tuberculosis; or (ii) one or more oligonucleotide specific for the one or more biomarker for tuberculosis. In a preferred embodiment, the one or more oligonucleotide specific for the one or more biomarker for tuberculosis comprised in the device is an oligonucleotide of the invention as defined herein.
The present invention allows for the rapid, sensitive, and accurate diagnosis or prediction of TB using one or more biological samples obtained from an individual at a single time point (“snapshot”) or during the course of disease progression. TB may be diagnosed or predicted prior to the onset of clinical symptoms, and/or as subsequent confirmation after the onset of clinical symptoms. Accordingly, the present invention allows for more effective therapeutic intervention and/or diagnosis in the pre-symptomatic stage of the disease.
Tuberculosis and Mycobacterium tuberculosis
Tuberculosis (TB) is a progressive, often fatal, infectious disease, caused by the bacterial pathogen Mycobacterium tuberculosis (M. tuberculosis, MTB). Pulmonary symptoms of TB include a productive, prolonged cough of three or more weeks, chest pain, and hemoptysis. Systemic symptoms include low grade remittent fever, chills, night sweats, appetite loss, weight loss, easy fatigability, and production of sputum that starts out mucoid but changes to purulent. A reference herein to the detection or diagnosis of TB is equivalent to the detection or diagnosis of M. tuberculosis infection. When the M. tuberculosis cells are metabolically active and/or undergoing cell division, this results in the symptoms of TB becoming overt, and is described as an active TB/M. tuberculosis infection.
In latent TB, an individual is infected with M. tuberculosis, but the individual does not display the symptoms of active TB disease. In latent TB, the mycobacterial cells survive for extended periods in a state of low metabolic activity and with limited or no cell division. Thus, during latency (i.e. latent infection), the clinical symptoms associated with a mycobacterial infection do not become manifest. This can make it difficult to distinguish between a latent TB infection and the absence of a TB infection using conventional methods and techniques. A reference herein to the detection or diagnosis of latent TB is equivalent to the detection or diagnosis of latent M. tuberculosis infection.
The present inventors have also found that there is a temporal aspect to the expression of some biomarkers for TB during the active phase of an infection. Specifically, some biomarkers for active TB are expressed at relatively low levels at an early stage in active TB, but become expressed at higher levels as the active stage of the infection progresses. In this context, the term “low level of expression” is relative. For example, the expression of these active TB biomarkers during the early active phase may be low relative to the expression level later in the active phase, and similar to (or slightly greater than) the expression level of the same biomarkers in an uninfected individual and/or an individual with latent TB. Typically the expression of these active TB biomarkers during the early active phase is low relative to the expression level later in the active phase, but still higher than the expression level of the same biomarkers in an uninfected individual and/or an individual with latent TB.
The present invention provides biomarkers for the detection and/or diagnosis of TB infection. In particular, the present invention provides biomarkers for the detection and/or diagnosis of an active TB infection, including an early stage active TB infection and/or a later stage active TB infection. The present invention also provides biomarkers for the detection and/or diagnosis of a latent TB infection. The present invention further provides biomarkers for distinguishing between active and latent TB infections. The present invention also provides biomarkers for distinguishing between a latent TB infection and an absence/lack of TB infection (active or latent). The present invention also provides biomarkers for distinguishing between early stage active TB and later stage active TB. The present invention also provides biomarkers for distinguishing between an individual who has no symptomatic TB infection (active or latent) and has not been exposed to TB (e.g. because they are from a non/low-TB endemic region) and an individual who has no symptomatic TB infection (active or latent) but has been exposed to TB (e.g. because they are from a high-TB endemic region).
Any appropriate technique may be used to confirm the diagnosis of active and/or latent TB according to the present invention. Standard techniques are known in the art. For example, chest x-ray, microbiological culture of M. tuberculosis in a sample (sputum, pus, cerebrospinal fluid, biopsied tissue, etc.) from the individual, CT scan, MMR, antibodies from lymphocyte secretion (ALS) assay, IFNγ assay and tuberculin skin tests (e.g. Mantoux and Heaf tests).
Biomarkers for Tuberculosis
A “biomarker” is virtually any biological compound, such as a protein and a fragment thereof, a peptide, a polypeptide, a proteoglycan, a glycoprotein, a lipoprotein, a carbohydrate, a lipid, a nucleic acid, an organic on inorganic chemical, a natural polymer, and a small molecule, that is present in the biological sample and that may be isolated from, or measured in, the biological sample. Furthermore, a biomarker can be the entire intact molecule, or it can be a portion thereof that may be partially functional or recognized, for example, by an antibody or other specific binding protein. A biomarker is considered to be informative if a measurable aspect or characteristic of the biomarker is associated with a given state of an individual, such as infection with TB. Such a measurable aspect or characteristic may include, for example, the presence, absence, or concentration of the biomarker in the biological sample from the individual and/or its presence as part of a profile of biomarkers. Such a measurable aspect of a biomarker is defined herein as a “feature.” For example, the presence of a biomarker may be a feature. As another example, the amount of a biomarker in a sample, or the amount of a biomarker in a sample compared with a control or reference sample may be a feature. A feature may also be a ratio of two or more measurable aspects of biomarkers, which biomarkers may or may not be of known identity, for example. A “biomarker profile” comprises at least two such features, where the features can correspond to the same or different classes of biomarkers such as, for example, two nucleic acids or a nucleic acid and a carbohydrate. A biomarker profile may also comprise at least three, four, five, 10, 20, 30 or more features. In one embodiment, a biomarker profile comprises hundreds, or even thousands, of features. In another embodiment, the biomarker profile comprises at least one measurable aspect of at least one internal standard.
The present inventors have conducted a temporal differential gene expression study in peripheral blood leukocytes (PBLs) in an aerosol Macaca fascicularis non-human primate model of TB. Using this method, the inventors have identified host biomarkers associated with early exposure to TB.
The new biomarkers for TB identified by the present inventors are listed in Table 2 herein (together with corresponding sequence identifiers (SEQ ID NOs). In particular, the present inventors have identified LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, PSMB9, ALPK1, HERC2, LGALS3BP, BST1, BAZ1A, LYN, TAPBP, SERPINB1, WSB1, MVP, APBB1IP, FYB, MB21D1/C6orf150, TICAM2, CD52, KLRAP1, DEFB128 and IL8 as biomarkers for TB. Therefore, the present invention provides the use of one or more of LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, PSMB9, ALPK1, HERC2, LGALS3BP, BST1, BAZ1A, LYN, TAPBP, SERPINB1, WSB1, MVP, APBB1IP, FYB, MB21D1/C6orf150, TICAM2, CD52, KLRAP1, DEFB128 and IL8 as a biomarker for tuberculosis. Each of these biomarkers may be used alone, in combination with any of the other biomarkers, and/or in combination with one or more additional biomarker for tuberculosis as disclosed herein. For example, the invention may relate to the use of LOC400759/GBP1P1, SNX10, CPVL and/or CREG1 (alone or in any combination thereof), optionally in combination with PF4V1 and/or PSMB9 and/or in combination with any of the other biomarkers disclosed herein.
Typically the present invention provides the use of one or more of SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRAP1, WSB1, BST1, SERPINB1, MVP, APBB1IP, MB21D1/C6orf150, TICAM2, DEFB128 and IL8 as a biomarker for tuberculosis.
Any combination of these biomarkers may be used according to the present invention. For example, any two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten or more, up to and including all of these biomarkers may be used to diagnose TB according to the present invention.
The one or more biomarker of the invention may be a hormone, a growth factor, a transcription factor, a cell surface marker or a soluble protein derived from cells. The one or more biomarker of the invention may be a nucleic acid encoding for one of said proteins.
The one or more of biomarker of the invention may be used in the detection and/or diagnosis of an active TB infection. The one or more biomarker of the invention may be used in the detection and/or diagnosis of a latent TB infection. The one or more biomarker of the invention may be used to diagnose the absence of a TB infection (active or latent). The one or more biomarker of the invention may be used to identify an individual with an active TB infection and/or an individual with a latent TB infection. The one or more biomarker of the invention may be used to identify an individual with an active TB infection and/or an individual with a latent TB infection and/or an individual uninfected with TB. The one or more biomarker of the invention may be used in the detection and/or diagnosis of an early stage active TB infection or a late/later stage active TB infection. The one or more biomarker of the invention may be used to determine exposure of an individual to TB, even in the absence of a symptomatic active or asymptomatic latent TB infection. Thus, the one or more biomarker of the invention may be used to distinguish between one or more individual with an active (early or later stage) TB infection and/or one or more individual with a latent TB infection, and/or one or more individual uninfected with TB. The one or more biomarker of the invention may also be used to distinguish between one or more individual with an early stage active TB infection and one or more individual with a late/later stage active TB infection.
Typically, the present invention relates to the use of one or more of SNX10, CPVL, PF4V1, HERC2, CD52 and LYN as a biomarker for TB. One or more of these biomarkers may be used in the detection and/or diagnosis of an active TB infection (early or late/later stage), or to distinguish between an early stage active TB infection and a late/later stage active TB infection. Alternatively, one or more of these biomarkers may be used in the detection and/or diagnosis of a latent TB infection, or to diagnose the absence of a TB infection (active or latent). Any combination of SNX10, CPVL, PF4V1, HERC2 CD52 and LYN may be used as biomarkers for TB according to the present invention. As a non-limiting example: (i) SNX10 and CPVL; (ii) SNX10 and PF4V1; (iii) SNX10 and HERC2; (iv) CPVL and PF4V1; (v) CPVL and HERC2; (vi) PF4V1 and HERC2; (vii) SNX10, CPVL and PF4V1; (viii) SNX10, CPVL and HERC2; (ix) SNX10, PF4V1 and HERC2; (x) CPVL, PF4V1 and HERC2; and/or (xi) SNX10, CPVL, PF4V1 and HERC2 may be used in combination as biomarkers in the detection and/or diagnosis of TB according to the present invention. Any of these combinations may be used with CD52 and/or LYN. Similarly, CD52 and/or LYN may be used in combination with one or more of SNX10, CPVL, PF4V1 and HERC2, or with any combination of SNX10, CPVL, PF4V1 and HERC2. Thus, in one embodiment, the invention relates to the use of SNX10, CPVL, PF4V1, HERC2, CD52 and LYN.
Typically the invention relates to the use of (i) SNX10 and CPVL; and/or (ii) PF4V1 and HERC2 as biomarkers for tuberculosis. In a preferred embodiment, SNX10 and CPVL are used in combination with LOC400759/GBP1P1 and/or CREG1 as biomarkers in the diagnosis of TB according to the present invention. In another preferred embodiment, PF4V1 and HERC2 are used in combination with LOC400759/GBP1P1 and/or ALPK1 as biomarkers in the diagnosis of TB according to the present invention. Any of these combinations may be used with CD52 and/or LYN.
One or more additional biomarker for TB (or further additional biomarker for TB) may also be used in the detection and/or diagnosis of TB according to the present invention. Any combination of the one or more additional biomarker (or further additional biomarker) may be used in combination with the one or more biomarker of the invention. For example at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten or more additional biomarkers for TB may be used in combination with the one or more biomarker of the invention. As a non-limiting example, in the cases where the one or more biomarker is selected from SNX10 and/or CPVL, the one or more additional biomarker may be selected from LOC400759/GBP1P1, CREG1, PF4V1, PSMB9, ALPK1, HERC2, LGALS3BP, BST1, BAZ1A, LYN, TAPBP, SERPINB1, WSB1, MVP, APBB1IP, FYB, MB21D1/C6orf150, TICAM2, CD52, KLRAP1, DEFB128, HERC2 and IL8. As another non-limiting example, in the case where the one or biomarker is selected from PF4V1 and/or HERC2, the one or more additional biomarker may be selected from LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, LGALS3BP, BST1, BAZ1A, LYN, TAPBP, SERPINB1, WSB1, MVP, APBB1IP, FYB, MB21D1/C6orf150, CPVL, TICAM2, CD52, KLRAP1, DEFB128 and IL8. Again, any of these combinations may be used with CD52 and/or LYN.
Typically, the one or more additional biomarker is selected from the biomarkers listed in Tables 2, 3, 4 and/or 5 herein (corresponding sequence identifiers (SEQ ID NOs) are also given in Tables 2 to 5).
In a preferred embodiment, the one or more biomarker of the invention is selected from SNX10 and CPVL and the one or more additional biomarker is selected from the biomarkers in Tables 2 and 3 or 5. In a more preferred embodiment, the one or more biomarker of the invention is selected from SNX10 and CPVL and the one or more additional biomarker is selected from LOC400759/GBP1P1, CREG1, PF4V1, PSMB9, GBP1, IRF1, HLA-B, IFITM3 and S100A11. In a more preferred embodiment, the present invention provides the use SNX10 and CPVL in combination with PF4V1 and/or PSMB9, and optionally in combination with one or more additional biomarker for TB as disclosed herein. Said one or more additional biomarker is preferably selected from LOC400759/GBP1P1, CREG1, GBP1, IRF1, HLA-B, IFITM3 and S100A11. Any of these combinations may be used with CD52 and/or LYN.
In a particularly preferred embodiment, the present invention relates to the use of SNX10, CPVL, LOC400759/GBP1P1 and CREG1, the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PSMB9, the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PF4V1, the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PSMB9, GBP1, IRF1 and HLA-B or the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11 as biomarkers for TB. Most preferably the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PSMB9, GBP1, IRF1 and HLA-B or the combination of SNX10, CPVL, LOC400759/GBP1P1, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11 is used. Any of these combinations may be used with CD52 and/or LYN.
In another preferred embodiment, the one or more biomarker of the invention is selected from PF4V1 and HERC2 and the one or more additional biomarker is selected from the biomarkers in Tables 2 and 3 or 5. In a preferred embodiment, the one or more biomarker of the invention is selected from PF4V1 and HERC2 and the one or more additional biomarker is selected from LOC400759/GBP1P1, CREG1, PF4V1, PSMB9, GBP1, IRF1, HLA-B, IFITM3 and S100A11, MMP9 and CD96. In a more preferred embodiment, the invention relates to the use of PF4V1 and HERC2 in combination with one or more additional biomarker for TB as disclosed herein. Said one or more additional biomarker is preferably selected from LOC400759/GBP1P1, CREG1, GBP1, IRF1, HLA-B, IFITM3, S100A11, MMP9, KLRA1, DEFB128 and IL8 and CD96. Thus, in one preferred embodiment, the present invention provides the use of PF4V1 and HERC2 in combination with one or more additional biomarker selected from LOC400759/GBP1P1, ALPK1, GBP1, IRF1, MMP9 and CD96; or in combination with one or more additional biomarker selected from of LOC400759/GBP1P1, ALPK1, GBP1, IRF1, MMP9, CD96, KLRA1, DEFB128 and IL8. In a more preferred embodiment, the present invention provides the use of the combination of PF4V1, HERC2, LOC400759/GBP1P1, ALPK1, GBP1, IRF1, MMP9 and CD96, or the combination of PF4V1, HERC2, LOC400759/GBP1P1, ALPK1, GBP1, IRF1, MMP9, CD96, KLRA1, DEFB128 and IL8 as biomarkers for TB. Any of these combinations may be used with CD52 and/or LYN.
Combinations of one or more of LOC400759/GBP1P1, SNX10, CPVL and CREG1 are particularly preferred. Such combinations include: (i) LOC400759/GBP1P1 and SNX10; (ii) LOC400759/GBP1P1 and CPVL; (iii) LOC400759/GBP1P1 and CREG1; (iv) SNX10 and CPVL; (v) SNX10 and CREG1; (vi) CPVL and CREG1; (vii) LOC400759/GBP1P1, SNX10 and CPVL; (viii) LOC400759/GBP1P1, SNX10 and CREG1; (ix) LOC400759/GBP1P, CPVL and CREG1; (x) SNX10, CPVL and CREG1; and/or (xi) LOC400759/GBP1P1, SNX10, CPVL and CREG1. These combinations may be used in combination with one or more further additional biomarker as disclosed herein, with one or more of GBP1, IRF1, HLA-B, IFITM3 and/or S100A11 being particularly preferred as disclosed herein. Any of these combinations may be used with CD52 and/or LYN.
Alternatively or in addition, combinations of one or more of LOC400759/GBP1P1, PF4V1, ALPK1 and HERC2 are preferred. Such combinations include: (i) LOC400759/GBP1P1 and PF4V1; (ii) LOC400759/GBP1P1 and ALPK1; (iii) LOC400759/GBP1P1 and HERC2; (iv) PF4V1 and ALPK1; (v) PF4V1 and HERC2; (vi) ALPK1 and HERC2; (vii) LOC400759/GBP1P1, PF4V1 and ALPK1; (viii) LOC400759/GBP1P1, PF4V1 and HERC2; (ix) LOC400759/GBP1P1, ALPK1 and HERC2; (x) PF4V1, ALPK1 and HERC2; and (xi) LOC400759/GBP1P1, PF4V1, ALPK1 and HERC2. These combinations may be used in combination with one or more further additional biomarker as disclosed herein, with one or more of GBP1, IRF1, MMP9, CD96, KLRA1, DEFB128 and IL8 being particularly preferred as disclosed herein. Any of these combinations may be used with CD52 and/or LYN.
The combination of SNX10, CPVL, PF4V1, HERC2, CD52 and LYN, optionally including one or more additional biomarker for TB, preferably selected from CREG1, PSMB9, LOC400759/GBP1P1, ALPK1, GBP1, IRF1, HLA-B, IFITM3, S100A11, MMP9, CD96, KLRA1, DEFB128 and/or IL8, or any combination thereof, is also preferred. Similarly, the combination of SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRA1 and WSB1, optionally including one or more additional biomarker for TB, preferably selected from CREG1, PSMB9, LOC400759/GBP1P1, ALPK1, GBP1, IRF1, HLA-B, IFITM3, S100A11, MMP9, CD96, KLRA1, DEFB128 and/or IL8, or any combination thereof, is also preferred.
The present inventors have also identified biomarkers for latent TB, and which can be used to distinguish between latent and active forms of TB, i.e. between latent and active forms of M. tuberculosis infection. These biomarkers for latent TB can also be used according to the present invention to distinguish between latent TB infection and the absence of TB infection. In particular, the present inventors have identified PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 as biomarkers for latent TB. These biomarkers may be used to distinguish between active TB and/or latent TB and/or the absence of TB. In a preferred embodiment, these biomarkers are used to distinguish between latent TB and the absence of TB infection, i.e. to identify one or more individual with a latent TB infection and/or one or more individual uninfected with TB.
Accordingly, the present invention provides the use of one or more of the biomarkers selected from PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 for distinguishing between latent and active M. tuberculosis infection, and hence latent and active TB. The present invention also provides the use of one or more of the biomarkers selected from PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 for distinguishing between active TB and/or latent TB and/or the absence of TB. In a preferred embodiment, the present invention provides the use of one or more of the biomarkers selected from PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 for distinguishing between one or more individual with a latent TB infection, and one or more individual uninfected with TB.
Any combination of these biomarkers may be used according to the present invention. For example, any two, three or four, or all five of these biomarkers may be used to distinguish between latent TB and/or active TB and/or the absence of TB according to the present invention. For example, the combination of the biomarkers PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 is used to distinguish between latent TB and/or active TB and/or the absence of TB according to the present invention. In a preferred embodiment, the combination of the biomarkers PF4V1, LYN, CD52, HERC2 is used to distinguish between latent TB and the absence of TB, and/or to identify one or more individual with a latent TB infection and/or one or more individual uninfected with TB. In another preferred embodiment, the combination of the biomarkers PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 is used to distinguish between latent TB and the absence of TB. Thus, the combination of the biomarkers PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128 and IL8 may be used to identify an individual with a latent TB infection and/or an individual uninfected with TB. In a preferred embodiment, the combination of biomarkers PF4V1, LYN, CD52, HERC2, the combination of biomarkers, HERC2, KLRAP1, PF4V1, DEFB128, IL8 or the combination of biomarkers PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8 is used to distinguish between one or more individual with a latent TB infection, and one or more individual uninfected with TB.
One or more additional biomarker for latent TB may also be used in combination with the one or more biomarker selected from PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128, LGALS3BP and IL8. In a preferred embodiment, the one or more additional biomarker is selected from the biomarkers listed in Tables 4 and 5.
One or more additional biomarker for TB may also be used in to distinguish between latent TB and/or active TB and/or the absence of TB according to the present invention. Any combination of the one or more additional biomarker may be used in combination with the one or more biomarker of the invention. For example at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten or more additional biomarkers for TB may be used in combination with the one or more biomarker of the invention. The one or more additional biomarker for use in distinguishing between latent TB and/or active TB and/or the absence of TB can be any biomarker disclosed herein.
Other biomarkers for distinguishing between latent TB and/or active TB and/or the absence of TB, particularly for distinguishing between latent TB and the absence of TB (i.e. to identify one or more individual with a latent TB infection and/or one or more individual uninfected with TB) include HLA-B, NCF1C, ABCF2, FNBP1L, TBC1D3B, SLC14A1, CALCOCO2, GTF2B, HLA-F, MGST2, SPAST and WAC. These biomarkers are listed in Tables 4 and 5 herein.
The present inventors have also identified biomarkers which can be used to distinguish between early stage active TB and late/later stage active TB, i.e. between early stage active and late/later stage active forms of M. tuberculosis infection. In particular, the present inventors have identified GBP1 as such a biomarker. The GBP1 biomarker may be used to distinguish between early stage active TB and late/later stage active TB. As used herein, the term “early stage active TB” refers to patients on first presentation with low to moderate symptoms, such as persistent cough and/or fever, and/or suspected pulmonary tuberculosis which is subsequently confirmed using conventional methods such as smear positivity (graded 1-4 in terms of severity of bacterial load), M. tuberculosis culture or M. tuberculosis PCR positivity (such as using the Cepheid GeneXpert™), As used herein, the term “later or later stage active TB” refers to patients with fully symptomatic active pulmonary tuberculosis, such as persistent cough of some duration, prolonged fever, weight loss, subsequently confirmed using conventional methods as above.
Accordingly, the present invention provides the use of the GBP1 biomarker for distinguishing between early stage active TB and late/later stage active TB. The present invention also provides the use of the GBP1 biomarker for distinguishing between active (early or late active stage) TB and/or latent TB and/or the absence of TB.
One or more additional biomarker for TB may also be used to distinguish between early stage active TB and late/later stage active TB according to the present invention. Any combination of the one or more additional biomarkers may be used in combination with the GBP1 biomarker of the invention. For example at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten or more additional biomarkers for TB may be used in combination with the GBP1 biomarker of the invention. The one or more additional biomarker for use in distinguishing between early stage active TB and late/later stage active TB can be any biomarker disclosed herein.
The present inventors have also identified biomarkers which can be used to determine exposure of an individual to TB, even in the absence of an active or latent TB infection. In particular, the present inventors have identified IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B as such biomarkers for exposure to TB. One or more of the IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B biomarkers, or any combination thereof, may be used to determine exposure to TB. As used herein, the term “exposure to TB” is defined by comparison to non-exposed controls from regions of non/low-TB endemic regions. As an example, the Caucasian control group used in Example 2 below are an example of non-exposed individuals.
Accordingly, the present invention provides the use of one or more of the IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B biomarkers for determining exposure to TB. Any combination of these biomarkers may be used according to the present invention. For example, any one, two, or all three of these biomarkers may be used to determine exposure to TB according to the present invention. Typically, the combination of the biomarkers IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B or the combination of IRF1, S100A11, CD52, LYN, IFITM3 and NCF1C is used to determine exposure to TB according to the present invention.
One or more additional biomarker for TB may also be used to determine exposure to TB according to the present invention. Any combination of the one or more additional biomarkers may be used in combination with one or more of the IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B biomarkers of the invention. For example at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten or more additional biomarkers for TB may be used in combination with one or more of the IRF1, S100A11, CD52, LYN, IFITM3, NCF1C and HLA-B biomarkers of the invention. The one or more additional biomarker for use in determining exposure to TB can be any biomarker disclosed herein.
The one or more biomarker of the invention as described herein may have a nucleic acid sequence as shown in the sequences in the Sequence Information section herein. The relevant sequence identifiers are also shown in Tables 2 to 5. The one or more biomarker of the invention may have a sequence identity of at least 80% with the corresponding nucleic acid sequence shown in the Sequence Information section. Sequence identity may be calculated as described herein. A sequence identity of at least 80% includes at least 82%, at least 84%, at least 86%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, and 100% sequence identity (to each and every nucleic acid sequence presented herein and/or to each and every SEQ ID NO presented herein).
Thus, as described herein, by studying both human and non-human primate biomarkers for TB, the present inventors have identified a robust set of biomarkers for TB that are mutually compatible (i.e. retain accurate binding specificity) within a single set of assay conditions (i.e. a singleplex format). Similarly, the present inventors have also identified robust sets of mutually compatible biomarkers for distinguishing between latent and active TB, for distinguishing between early active and late/later active TB and for determining exposure to TB. Combinations of biomarkers for use according to the present invention are discussed in detail herein. As discussed above, preferably, the present invention provides the use of the combination of: (i) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, GBP1, IRF1 and HLA-B; (ii) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11; and/or (iii) LOC400759/GBP1P1, PF4V1, ALPK1, HERC2, GBP1, IRF1, MMP9, CD96, KLRA1, DEFB128 and IL8 as biomarkers for TB. These combinations (and the other combinations of biomarkers disclosed herein) may be used not only as biomarkers for TB, but also to distinguish between latent TB and/or active TB and/or the absence of TB, to distinguish between early active and late/later stage active TB and/or to determine exposure to TB.
The one or more biomarkers of the invention may be used in a decision tree process. For example, the present invention may first provide one or more biomarkers for the detection and/or diagnosis of active TB (an active TB infection) in an individual. Any suitable biomarker or combination of biomarkers disclosed herein may be used for the detection and/or diagnosis of active TB. In a preferred embodiment, the one or more biomarker for the detection and/or diagnosis of active TB is selected from (i) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, GBP1, IRF1 and HLA-B; (ii) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11; and/or (iii) LOC400759/GBP1P1, PF4V1, ALPK1, HERC2, GBP1, IRF1, MMP9 and CD96; optionally in combination with one or more additional biomarker as disclosed herein. If the individual tests positive for active TB using this method, they may be treated appropriately.
If, however, the individual tests negative for active TB, they may then be tested for latent TB (a latent TB infection) according to the present invention. This is the next “branch” of the decision tree. Any suitable biomarker or combination of biomarkers disclosed herein may be used for the detection and/or diagnosis of latent TB. In a preferred embodiment, the one or more biomarker for the detection and/or diagnosis of latent TB is selected from PF4V1, LYN, CD52, HERC2, KLRA1, DEFB128 and IL8, optionally in combination with one or more additional biomarker as disclosed herein.
The present invention enables the rapid detection of TB, and also to rapidly distinguish between latent TB and/or active TB and/or the absence of TB. By way of example, the method of the invention is typically completed within 2.5 hours, preferably within 2 or 1.5 hours. In contrast, existing multiplex assays typically take at least 4-5 hours, typically at least 5 hours.
Biomarker Profiles
A “phenotypic change” is a detectable change in a parameter associated with a given state of the individual. For instance, a phenotypic change may include an increase or decrease of a biomarker in a bodily fluid, where the change is associated with TB or distinguishing between active and latent TB. The presence and/or amount of each of the one or more biomarkers of the invention is a feature or phenotypic change according to the present invention.
A phenotypic change may further include a change in a detectable aspect of a given state of the individual that is not a change in a measurable aspect of a biomarker. For example, a change in phenotype may include a detectable change in body temperature, weight loss, fatigue, respiration rate or other physiological parameter. Such changes can be determined via clinical observation and measurement using conventional techniques that are well-known to the skilled artisan. As used herein, “conventional techniques” are those techniques that classify an individual based on phenotypic changes without obtaining a biomarker profile according to the present invention.
A “decision rule” or a “decision tree” is a method used to classify individuals. This rule can take on one or more forms that are known in the art, as exemplified in Hastie et al., in “The Elements of Statistical Learning” Springer-Nerlag (Springer, New York (2001)). Analysis of biomarkers in the complex mixture of molecules within the sample generates features in a data set. A decision rule or a decision tree may be used to act on a data set of features to detect and/or diagnose, or to distinguish between active TB and/or latent TB and/or the absence of TB (for example uninfected control(s)).
The application of the decision rule or the decision tree does not require perfect classification. A classification may be made with at least about 90% certainty, or even more, in one embodiment. In other embodiments, the certainty is at least about 80%, at least about 70%, or at least about 60%. The useful degree of certainty may vary, depending on the particular method of the present invention. “Certainty” is defined as the total number of accurately classified individuals divided by the total number of individuals subjected to classification. As used herein, “certainty” means “accuracy”.
Classification may also be characterized by its “sensitivity”. The “sensitivity” of classification relates to the percentage of individuals with TB who were correctly identified as having TB, or in the case of distinguishing between active and latent TB, the percentage of individuals correctly identified as having active TB, or latent TB, or as uninfected with TB. “Sensitivity” is defined in the art as the number of true positives divided by the sum of true positives and false negatives.
The “specificity” of a method is defined as the percentage of patients who were correctly identified as not having TB, or in the case of distinguishing between active and latent TB, the percentage of individuals correctly identified as not having active or latent TB compared with an uninfected control(s). That is, “specificity” relates to the number of true negatives divided by the sum of true negatives and false positives.
Typically, the accuracy, sensitivity and/or specificity is at least about 90%, at least about 80%, at least about 70% or at least about 60%.
Diagnosing TB in an individual means to identify or detect TB in the individual. Distinguishing between active and latent TB in an individual means to identify or detect TB in the individual and to determine whether the TB is active or latent as described herein. Distinguishing between early stage active and late/later stage active TB in an individual means to identify or detect TB in the individual and to determine whether the TB is early stage active or late/later stage active as described herein. Distinguishing between latent TB and the absence of TB in an individual means to identify or detect latent TB in the individual compared with an uninfected control. Determining exposure of an individual to TB means to determine whether an individual has been exposed to TB, but is not themselves infected with active or latent TB.
Because of the sensitivity of the present invention to detect TB before an overtly observable clinical manifestation, the diagnosis, identification or detection of TB includes the detection of the onset of TB, as defined above.
According to the present invention, TB may be diagnosed or detected, or active and latent TB distinguished, by obtaining a profile of biomarkers from a sample obtained from an individual. As used herein, “obtain” means “to come into possession of”. The present invention is particularly useful in predicting and diagnosing TB in an individual, who is suspected of having TB, or who is at risk of TB infection. In the same manner, the present invention may be used to distinguish between active TB and/or latent TB and/or the absence of TB in an individual. That is, the present invention may be used to confirm a clinical suspicion of TB.
The presence and/or amount of the one or more biomarker of the invention in an individual or the profile of biomarkers in an individual may be measured relative to a control or reference population, for example relative to the corresponding biomarker profile of a reference population. Similarly, the biomarker profile of an individual may be measured relative to a biomarker profile from a control or reference population. Herein the terms “control” and “reference population” are used interchangeably. The actual amount of the one or more biomarkers, such as the mass, molar amount, concentration or molarity of the one or more biomarker of the invention may be assessed and compared with the corresponding value from the control or reference population. Alternatively, the amount of one or more biomarker of the invention may be compared with that of the control or reference population without quantifying the mass, molar amount, concentration or molarity of the one or more biomarker.
The control or reference biomarker profile can be generated from one individual or a population of two or more individuals. The control or reference population, for example, may comprise three, four, five, ten, 15, 20, 30, 40, 50 or more individuals. Furthermore, the control or reference biomarker profile and the individual's (test) biomarker profile that are compared in the methods of the present invention may be generated from the same individual, provided that the test and reference biomarker profiles are generated from biological samples taken at different time points and compared to one another. For example, a sample may be obtained from an individual at the start of a study period. A control or reference biomarker profile taken from that sample may then be compared to biomarker profiles generated from subsequent samples from the same individual. Such a comparison may be used, for example, to determine the progression of TB in the individual by repeated classifications over time.
The control or reference may be obtained, for example, from a population of TB-negative individuals, TB-positive individuals, individuals with active TB and individuals with latent TB. In the Examples herein, the Caucasian control group consists of professional individuals recruited locally to the project team who constitute a low risk group, coming from non/low-TB endemic regions, such that their risk of having been exposed to TB is extremely low. Typically this is the preferred control group. The second control group in the Examples consists of individuals of Asian descent who tested negative for TB using the standard Mantoux skin test and IFNγ test and who come from regions where TB is endemic. The likelihood is that these individuals have been exposed to TB, even if they are not themselves (currently) infected. Thus, without being bound by theory, any differences in the detection of biomarkers of the invention between this control group and the Caucasian controls may result from the likely exposure of this Asian control group to TB.
Typically the control or reference population does not have TB and/or is not infected with M. tuberculosis (i.e. is TB-negative). The control or reference population may be TB-positive and are then subsequently diagnosed with TB using conventional techniques. For example, a population of TB-positive individuals used to generate the reference profile may be diagnosed with TB about 24, 48, 72, 96 or more hours after biological samples were taken from them for the purposes of generating a reference biomarker profile. In one embodiment, the population of TB-positive individuals is diagnosed with TB using conventional techniques about 0-36 hours, about 36-60 hours, about 60-84 hours, or about 84-108 hours after the biological samples were taken. If the biomarker profile is indicative of TB, a clinician may begin treatment prior to the manifestation of clinical symptoms of TB.
The amount of the one or more biomarker of the invention, for example in a biomarker profile, may differ by at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60, at least 70%, at least 80%, at least 90%, at least 100%, at least 150%, at least 200% or more compared with a control or reference population.
For example, if the amount of the one or more biomarker of the invention, typically in a biomarker profile, is reduced compared with a control or reference population, the expression may be reduced partially or totally compared with the control or reference population. Typically the amount is reduced by at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60, at least 70%, at least 80%, at least 90%, at least 95%, at least 99%, up to total elimination of the one or more biomarker.
If the amount of one or more biomarker of the invention, typically in a biomarker profile, is increased compared with a control or reference population, the amount may be increased by at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60, at least 70%, at least 80%, at least 90&, at least 100%, at least 150%, at least 200% compared with the control or reference population.
The amount of the one or more biomarker of the invention may be increased or decreased compared with a control or reference population as shown in Tables 2 to 5 herein (where ↑ means the one or more biomarker is upregulated/an increased amount of the one or more biomarker and ↓ means the one or more biomarker is downregulated/a decreased amount of the one or more biomarker). In instances where more than one indication of up or downregulation is given in Tables 2 to 5, the first recited statement is preferred. For example, Table 2 discloses that ALPK1 is increased in monocytes, neutrophils and CD4 positive T cells compared with a control or reference population. In this example, the amount of ALPK1 may be increased in CD4 positive T cells, preferably increased in neutrophils and most preferably increased in monocytes. The amount of ALPK1 may be increased in CD4 positive T cells, neutrophils and monocytes, and may also be increased in other cell types not listed in Tables 2 to 5.
The amount of the one or more biomarker may be increased in some cell types and/or decreased in other cell types. For example, as shown in Table 2 herein, PF4V1 is upregulated (increased amount) in monocytes of individuals with TB, whereas PF4V1 is downregulated (decreased amount) in neutrophils of individuals with TB.
The presence and/or amount of the one or more biomarker of the invention may be determined by quantitative and/or qualitative analysis. The amount of the one or more biomarker of the invention encompasses the mass of the one or more biomarker, the molar amount of the one or more biomarker, the concentration of the one or biomarker and the molarity of the one or more biomarker. This amount may be given in any appropriate units. For example, the concentration of the one or more biomarker may be given in pg/ml, ng/ml or μg/ml.
The presence and/or amount of the one or more biomarker of the invention may be measured directly or indirectly. The relative presence and/or amount of the one or more biomarker of the invention relative to a control or reference population may be determined using any appropriate technique. Suitable standard techniques are known in the art, for example Western blotting and enzyme-linked immunosorbent assays (ELISAs). Preferred methods include microarray analysis (as used in Example 1) and quantitative real-time PCR (qPCR) (as used in Example 2). Different one or more biomarkers may be used with different detection methods according to the present invention. For example, in one embodiment, the one or more biomarker is selected from PF4V1/or HERC2, preferably in combination with LOC400759/GBP1P1 and/or ALPK1 as disclosed herein, for use with microarray analysis. Typically, the one or more biomarker is selected from SNX10 and/or CPVL, preferably in combination with LOC400759/GBP1P1 and/or CREG1, for use with qPCR analysis. Again, additional one or more biomarkers as disclosed herein can be selected dependent on the preferred detection method.
As used herein, “comparison” includes any means to discern at least one difference in the presence and/or amount of the one or more biomarker in the individual and the control or reference population, or at least one difference in the individual's and the control or reference profiles. Thus, a comparison may include a visual inspection of chromatographic spectra, and a comparison may include arithmetical or statistical comparisons of values assigned to the features of the profiles. Such statistical comparisons include, but are not limited to, applying a decision rule. If the biomarker profiles comprise at least one internal standard, the comparison to discern a difference in the biomarker profiles may also include features of these internal standards, such that features of the biomarker are correlated to features of the internal standards. The comparison can confirm the presence or absence of TB, and thus to detect or diagnose TB; or the comparison can distinguish between active and latent TB.
The presence and/or amount level of the one or more biomarker may be altered compared with a control or reference population for at least 12 hours, at least 24 hours, at least 30 hours, at least 48 hours, at least 72 hours, at least 96 hours, at least 120 hours, at least 144 hours, at least 1 week, at least 2 weeks, at least 3 weeks, at least 4 weeks, at least 5 weeks, at least 6 weeks, at least 7 weeks, at least 8 weeks, at least 9 weeks, at least 10 weeks, at least 11 weeks, at least 12 weeks, at least 13 weeks, at least 14 weeks, at least 15 weeks or more.
Although the invention does not require a monitoring period to classify an individual, it will be understood that repeated classifications of the individual, i.e., repeated snapshots, may be taken over time until the individual is no longer at risk. Alternatively, a profile of biomarkers obtained from the individual may be compared to one or more profiles of biomarkers obtained from the same individual at different points in time.
As used herein, an “individual” is an animal, preferably a mammal, more preferably a human or non-human primate. The terms “individual,” “subject” and “patient” are used interchangeably herein. The individual can be normal, suspected of having TB or at risk of a TB infection. In a preferred embodiment, the present invention relates to the detection and/or diagnosis of TB in adult humans (over the age of 16 years).
The progression of an individual from normalcy (i.e., a condition characterized by not having TB) to latent or active TB, and vice versa, will be characterized by changes in biomarker profiles, as certain biomarkers are expressed at increasingly higher levels and the expression of other biomarkers becomes down regulated. These changes in biomarker profiles may reflect the progressive establishment of a physiological response in the reference population to infection. The biomarker profile of the control or reference population also will change as a physiological response subsides. As stated above, one of the advantages of the present is the capability of classifying an individual, using a biomarker profile from a single biological sample, as having membership in a particular population. The determination of whether a particular physiological response is becoming established or is subsiding may be facilitated by a subsequent classification of the individual. To this end, the present invention provides numerous biomarkers that both increase and decrease in level of expression as a physiological response to TB is established or subsides. For example, a feature of an individual's biomarker profile that is known to change in intensity as a physiological response to TB becomes established may be selected. A comparison of the same feature in a profile from a subsequent biological sample from the individual can establish whether the individual is progressing toward more severe TB or is progressing toward normalcy.
Detection and Quantification of Biomarkers and Determination of Biomarker Profiles
A feature as defined herein for the diagnosis of TB, a TB infection and/or a M. tuberculosis infection may be detected, quantified or determined by any appropriate means. For example, the one or more biomarker of the invention, a measurable aspect or characteristic of the one or more biomarker or a biomarker profile of the invention may be detected by any appropriate means. The presence and/or amount of the one or more biomarkers of the invention may be considered together as a “biomarker profile” of the invention. The presence and/or amount of the individual biomarkers within any of the biomarker combinations disclosed herein may be considered together as a “biomarker profile” of the invention. For example, in a preferred embodiment of the invention, the combination of biomarkers: (i) SNX10 and CPVL; (ii) LOC400759/GBP1P1, SNX10, CPVL and CREG1; (iii) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, GBP1, IRF1 and HLA-B; (iv) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11; (v) PF4V1 and HERC2; (vi) SNX10, CPVL, PF4V1 and HERC2; (vii) SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRA1 and WSB1 and/or (viii) LOC400759/GBP1P1, PF4V1, ALPK1, HERC2, GBP1, IRF1, MMP9 and CD96 is used to detect and or diagnose TB. Thus, the presence and/or amount of: (i) SNX10 and CPVL; (ii) LOC400759/GBP1P1, SNX10, CPVL and CREG1; (iii) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, GBP1, IRF1 and HLA-B; (iv) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11; (v) PF4V1 and HERC2; (vi) SNX10, CPVL, PF4V1 and HERC2; (vii) SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRA1 and WSB1 and/or (viii) LOC400759/GBP1P1, PF4V1, ALPK1, HERC2, GBP1, IRF1, MMP9 and CD96 may be considered as a biomarker profile according to the present invention. The presence and/or amount of any other combination of biomarkers according to the present invention may also be considered as a biomarker profile. A biomarker profile of the invention may comprise: (i) SNX10 and CPVL; (ii) LOC400759/GBP1P1, SNX10, CPVL and CREG1; (iii) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PSMB9, GBP1, IRF1 and HLA-B; (iv) LOC400759/GBP1P1, SNX10, CPVL, CREG1, PF4V1, GBP1, IRF1, IFITM3 and S100A11; (v) PF4V1 and HERC2; (vi) SNX10, CPVL, PF4V1 and HERC2; (vii) SNX10, CPVL, PF4V1, HERC2, CD52, LYN, LGALS3BP, BAZ1A, KLRA1 and WSB1 and/or (viii) LOC400759/GBP1P1, PF4V1, ALPK1, HERC2, GBP1, IRF1, MMP9, CD96, KLRA1, DEFB128 and IL8.
The presence and/or amount of the one or more biomarker of the invention may be determined in a sample obtained from an individual. The sample may be any suitable biological material, for example blood, plasma, saliva, serum, sputum, urine, cerebral spinal fluid, cells, a cellular extract, a tissue sample, a tissue biopsy, a stool sample and the like. Typically the sample is blood sample. The precise biological sample that is taken from the individual may vary, but the sampling preferably is minimally invasive and is easily performed by conventional techniques. In a preferred embodiment, the sample is a whole blood sample, a purified peripheral blood leukocyte sample or a cell type sorted leukocyte sample, such as a sample of the individual's neutrophils. The biological sample may be taken from the individual before, during, and/or after treatment for TB infection. In one embodiment, the sample is taken after treatment for TB infection has been initiated.
Measurement of a phenotypic change may be carried out by any conventional technique. Measurement of body temperature, respiration rate, pulse, blood pressure, or other physiological parameters can be achieved via clinical observation and measurement. Measurements of biomarker molecules may include, for example, measurements that indicate the presence, concentration, expression level, or any other value associated with a biomarker molecule. The form of detection of biomarker molecules typically depends on the method used to form a profile of these biomarkers from a biological sample. For instance, biomarkers separated by 2D-PAGE are detected by Coomassie Blue staining or by silver staining, which are well-established in the art.
The biomarkers of the invention may be detected at the nucleic acid or protein level. Thus, the biomarkers of the invention may be DNA, RNA or protein and may be detected using any appropriate technique. The presence and/or amount of the one or more biomarker of the invention may be measured directly or indirectly. Any appropriate agent may be used to determine the presence and/or amount of the one or more biomarker of the invention. For example, the presence and/or amount of the one or more biomarker of the invention may be determined using an agent selected from peptides and peptidomimetics, antibodies, small molecules and single-stranded DNA or RNA molecules, as described herein. The relative presence and/or amount of the one or more biomarker of the invention relative to a control or reference population (see above) may be determined using any appropriate technique. Suitable standard techniques are known in the art.
For example, when the one or more biomarker is detected at the nucleic acid level this may be carried out using: (i) biomarker-specific oligonucleotide DNA or RNA or any other nucleic acid derivative probes bound to a solid surface; (ii) purified RNA (labelled by any method, for example using reverse transcription and amplification) hybridised to probes; (iii) whole lysed blood, from which the RNA is labelled by any method and hybridised to probes; (iv) purified RNA hybridised to probes and a second probe (labelled by any method) hybridised to the purified RNA; (v) whole lysed blood from which the RNA is hybridised to probes, and a second probe (labelled by any method) which is hybridised to the RNA; (vi) purified peripheral blood leukocytes, obtaining purified RNA (labelled by any method), and hybridising the purified labelled RNA to probes; (vii) purified peripheral blood leukocytes, obtaining purified RNA and hybridising the RNA to probes, then using a second probe (labelled by any method) which hybridises to the RNA; (viii) RT-PCR using any primer/probe combination or inter-chelating fluorescent label, for example SyberGreen; (ix) end-point PCR; (x) digital PCT; (xi) sequencing; (xii) array cards (RT-PCT); (xiii) lateral flow devices/methodology; and/or (xiv) digital microfluidics.
In a preferred embodiment, RNA from a sample (either purified or unpurified) is labelled via any method (typically amplification) and used to interrogate one or more probe immobilised on a surface. Typically the one or more probes are 50 to 100 nucleotides in length.
In another preferred embodiment, one or more probe is immobilised on a surface and the RNA from a sample is hybridised to one or more second probe (labelled by any method). The RNA hybridised with the second (labelled) probe is then used to interrogate the one or more probe immobilised on the surface. Examples of such methodology are known in the art, including the Vantix™ system.
For example, when the one or more biomarker is detected at the protein acid level this may be carried out using: (i) biomarker-specific primary antibodies or antibody fragments bound to a solid surface; (ii) whole lysed blood biomarker antigen bound to antibodies or antibody fragments; (iii) secondary biomarker-specific antibodies or antibody fragments used to detect biomarker antigen bound to primary antibody (labelled using any method); (iv) biomarker-specific primary aptamers bound to a solid surface; (v) whole lysed blood—biomarker antigen bound to aptamers; (vi) secondary biomarker-specific aptamer used to detect biomarker antigen bound to primary aptamer (labelled using any method); (vii) any antibody derivative i.e. phage display etc. used as above; (viii) lateral flow devices/methodology; (ix) chromatography; (x) mass spectrometry; (xi) nuclear magnetic resonance (NMR); (xii) protein gels/transfers to filter; and/or (xiii) immunoprecipitation.
Any agent for the detection of or for the determination of the amount of the one or more biomarker of the invention may be used to determine the presence of and/or amount of the one or more biomarker. Similarly, any method that allows for the detecting of the one or more biomarker, the quantification, or relative quantification of the one or more biomarker may be used.
Agents for the detection of or for the determination of the amount of one or more biomarker may be used to determine the amount of the one or more biomarker in a sample obtained from the individual. Such agents typically bind to the one or more biomarker. Such agents may bind specifically to the one or more biomarker. The agent for the detection of or for the determination of the amount of the one or more biomarker may be an antibody or other binding agent specific for the one or more biomarker. By specific, it will be understood that the agent or antibody binds to the molecule of interest, in this case the one or more biomarker, with no significant cross-reactivity to any other molecule, particularly any other protein. For example, an agent or antibody that is specific for LOC400759/GBP1P1 will show no significant cross-reactivity with human neutrophil elastase. Cross-reactivity may be assessed by any suitable method. Cross-reactivity of an agent or antibody for the one or more biomarker with a molecule other than the one or more biomarker may be considered significant if the agent or antibody binds to the other molecule at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90% or 100% as strongly as it binds to the one or more biomarker. An agent or antibody that is specific for the one or more biomarker may bind to another molecule such as human neutrophil elastase at less than 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25% or 20% the strength that it binds to the one or more biomarker. Preferably, the agent or antibody binds to the other molecule at less than 20%, less than 15%, less than 10% or less than 5%, less than 2% or less than 1% the strength that it binds to the one or more biomarker.
As described herein, the presence and/or amount of the one or more biomarker, and hence the biomarker profile may be determined immunologically by reacting antibodies, or functional fragments thereof, specific to the biomarkers. A functional fragment of an antibody is a portion of an antibody that retains at least some ability to bind to the antigen to which the complete antibody binds. The fragments, which include, but are not limited to, scFv fragments, Fab fragments, F(ab) fragments and F(ab)2 fragments, can be recombinantly produced or enzymatically produced. Specific binding molecules other than antibodies, such as aptamers, may be used to bind the biomarkers.
The antibody may be monoclonal or polyclonal. The antibody may be produced by any suitable method known in the art. For example, polyclonal antibodies may be obtained by immunizing a mammal, typically a rabbit or a mouse, with the one or more biomarker under suitable conditions and isolating antibody molecules from, for example, the serum of said mammal. Monoclonal antibodies may be obtained by hybridoma or recombinant methods.
Hybridoma methods may involve immunizing a mammal, typically a rabbit or a mouse, with the one or more biomarker under suitable conditions, then harvesting the spleen cells of said mammal and fusing them with myeloma cells. The mixture of fused cells is then diluted and clones are grown from single parent cells. The antibodies secreted by the different clones are then tested for their ability to bind to the one or more biomarker, and the most productive and stable clone is then grown in culture medium to a high volume. The secreted antibody is collected and purified.
Recombinant methods may involve the cloning into phage or yeast of different immunoglobulin gene segments to create libraries of antibodies with slightly different amino acid sequences. Those sequences which give rise to antibodies which bind to the one or more biomarker may be selected and the sequences cloned into, for example, a bacterial cell line, for production.
Typically the antibody is a mammalian antibody, such as a primate, human, rodent (e.g. mouse or rat), rabbit, ovine, porcine, equine or camel antibody. The antibody may be a camelid antibody or shark antibody. The antibody may be a nanobody. The antibody can be any class or isotype of antibody, for example IgM, but is preferably IgG. The antibody may be a humanised antibody.
The antibody or fragment may be associated with other moieties, such as linkers which may be used to join together 2 or more fragments or antibodies. Such linkers may be chemical linkers or can be present in the form of a fusion protein with the fragment or whole antibody. The linkers may thus be used to join together whole antibodies or fragments which have the same or different binding specificities, e.g. that can bind the same or different polymorphisms. The antibody may be a bispecific antibody which is able to bind to two different antigens, typically any two of the polymorphisms mentioned herein. The antibody may be a ‘diabody’ formed by joining two variable domains back to back. In the case where the antibodies used in the method are present in any of the above forms which have different antigen binding sites of different specificities then these different specificities are typically to polymorphisms at different positions or on different proteins. In one embodiment the antibody is a chimeric antibody comprising sequence from different natural antibodies, for example a humanised antibody.
Methods to assess an amount of the one or more biomarker may involve contacting a sample with an agent or antibody capable of binding specifically to the one or more biomarker. Such methods may include dipstick assays and Enzyme-linked Immunosorbant Assay (ELISA), or similar assays, such as those using a lateral flow device. Other immunoassay types may also be used to assess the one or more biomarker amounts. Typically dipsticks comprise one or more antibodies or proteins that specifically bind to the one or more biomarker. If more than one antibody is present, the antibodies preferably have different non-overlapping determinants such that they may bind to the one or more biomarker simultaneously.
ELISA is a heterogeneous, solid phase assay that requires the separation of reagents. ELISA is typically carried out using the sandwich technique or the competitive technique. The sandwich technique requires two antibodies. The first specifically binds the one or more biomarker and is bound to a solid support. The second antibody is bound to a marker, typically an enzyme conjugate. A substrate for the enzyme is used to quantify the one or more biomarker-antibody complex and hence the amount of the one or more biomarker in a sample. The antigen competitive inhibition assay also typically requires a one or more biomarker-specific antibody bound to a support. A biomarker-enzyme conjugate is added to the sample (containing the one or more biomarker) to be assayed. Competitive inhibition between the biomarker-enzyme conjugate and unlabelled biomarker allows quantification of the amount of the one or more biomarker in a sample. The solid supports for ELISA reactions preferably contain wells.
Antibodies capable of binding specifically to the one or more biomarker may be used in methods of immunofluorescence to detect the presence of the one or more biomarker and hence in methods of diagnosing TB, a TB infection, infection with M. tuberculosis, or to distinguish between active and latent TB according to the present invention.
The present invention may also employ methods of determining the amount of the one or more biomarker that do not comprise antibodies. High Performance Liquid Chromatography (HPLC) separation and fluorescence detection is preferably used as a method of determining the amount of the one or more biomarker. HPLC apparatus and methods as described previously may be used (Tsikas D et al. J Chromatogr B Biomed Sci Appl 1998; 705: 174-6) Separation during HPLC is typically carried out on the basis of size or charge. Prior to HPLC, endogenous amino acids and an internal standard L-homoarginine are typically added to assay samples and these are phase extracted on CBA cartridges (Varian, Harbor City, Calif.). Amino acids within the samples are preferably derivatized with o-phthalaldehyde (OPA). The accuracy and precision of the assay is preferably determined within quality control samples for all amino acids.
Other methods of determining the amount the one or more biomarker that do not comprise antibodies include mass spectrometry. Mass spectrometric methods may include, for example, matrix-assisted laser desorption/ionization mass spectrometry (MALDI MS), surface-enhanced laser desorption/ionization mass spectrometry (SELDI MS), time of flight mass spectrometry (TOF MS) and liquid chromatography mass spectrometry (LC MS).
A separation method may be used to determine the presence and/or amount of the one or more biomarker and hence to create a profile of biomarkers, such that only a subset of biomarkers within the sample is analysed. For example, the biomarkers that are analysed in a sample may consist of mRNA species from a cellular extract, which has been fractionated to obtain only the nucleic acid biomarkers within the sample, or the biomarkers may consist of a fraction of the total complement of proteins within the sample, which have been fractionated by chromatographic techniques. One or more, two or more, three or more, four or more, or five or more separation methods may be used according to the present invention.
Determination of the presence and/or amount of the one or more biomarker, and hence the creation of a profile of biomarkers may be carried out without employing a separation method. For example, a biological sample may be interrogated with a labelled compound that forms a specific complex with a biomarker in the sample, where the intensity of the label in the specific complex is a measurable characteristic of the biomarker. A suitable compound for forming such a specific complex is a labelled antibody. A biomarker may be measured using an antibody with an amplifiable nucleic acid as a label. The nucleic acid label may become amplifiable when two antibodies, each conjugated to one strand of a nucleic acid label, interact with the biomarker, such that the two nucleic acid strands form an amplifiable nucleic acid.
The presence and/or amount of the one or more biomarker, and hence the biomarker profile may be derived from an assay, such as an array, of nucleic acids, where the biomarkers are the nucleic acids or complements thereof. For example, the biomarkers may be ribonucleic acids. The presence and/or amount of the one or more biomarker, and hence the biomarker profile may be obtained using a method selected from nuclear magnetic resonance, nucleic acid arrays, dot blotting, slot blotting, reverse transcription amplification and Northern analysis.
The biomarker profile may comprise any measurable aspect of M. tuberculosis or a component thereof. For example, the biomarker profile may comprise measurable aspects of small molecules, which may include fragments of proteins or nucleic acids, or which may include metabolites.
The determination of the presence and/or amount of the one or more biomarker, and hence a biomarker profile may be generated by the use of one or more separation methods. For example, suitable separation methods may include a mass spectrometry method, such as electrospray ionization mass spectrometry (ESI-MS), ESI-MS/MS, ESI-MS/(MS)n (n is an integer greater than zero), matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF-MS), surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF-MS), desorption/ionization on silicon (DIOS), secondary ion mass spectrometry (SLMS), quadrupole time-of-flight (Q-TOF), atmospheric pressure chemical ionization mass spectrometry (APCI-MS), APCI-MS/MS, APCI-(MS)n, atmospheric pressure photoionization mass spectrometry (APPI-MS), APPI-MS/MS, and APPI-(MS)n. Other mass spectrometry methods may include, inter alia, quadrupole, fourier transform mass spectrometry (FTMS) and ion trap. Other suitable separation methods may include chemical extraction partitioning, column chromatography, ion exchange chromatography, hydrophobic (reverse phase) liquid chromatography, isoelectric focusing, one-dimensional polyacrylamide gel electrophoresis (PAGE), two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) or other chromatography, such as thin-layer, gas or liquid chromatography, or any combination thereof. The sample may be fractionated prior to application of the separation method.
The determination of the presence and/or amount of the one or more biomarker, and hence a biomarker profile may be generated by methods that do not require physical separation of the biomarkers themselves. For example, nuclear magnetic resonance (NMR) spectroscopy may be used to resolve a profile of biomarkers from a complex mixture of molecules. An analogous use of NMR to classify tumours is disclosed in Hagberg, NMR Biomed. 11: 148-56 (1998), for example. Additional procedures include nucleic acid amplification technologies, which may be used to generate a profile of biomarkers without physical separation of individual biomarkers. (See Stordeur et al, J. Immunol. Methods 259: 55-64 (2002) and Tan et al, Proc. Nat'l Acad. Sci. USA 99: 11387-11392 (2002), for example.)
In one embodiment, laser desorption/ionization time-of-flight mass spectrometry is used to determine the presence and/or amount of the one or more biomarker, and hence create a biomarker profile where the biomarkers are proteins or protein fragments that have been ionized and vaporized off an immobilizing support by incident laser radiation. A profile is then created by the characteristic time-of-flight for each protein, which depends on its mass-to-charge (“m/z”) ratio. A variety of laser desorption/ionization techniques are known in the art. (See, e.g., Guttman et al, Anal Chem. 73: 1252-62 (2001) and Wei et al, Nature 399: 243-46 (1999).)
Laser desorption/ionization time-of-flight mass spectrometry allows the generation of large amounts of information in a relatively short period of time. A sample is applied to one of several varieties of a support that binds all of the biomarkers, or a subset thereof, in the sample. Cell lysates or samples are directly applied to these surfaces in volumes as small as 0.5 μL, with or without prior purification or fractionation. The lysates or sample can be concentrated or diluted prior to application onto the support surface. Laser desorption/ionization is then used to generate mass spectra of the sample, or samples, in as little as three hours.
In a preferred embodiment, the total mRNA from a cellular extract of the individual is assayed, and the various mRNA species that are obtained from the sample are used as biomarkers. Biomarker profiles may be obtained, for example, by hybridizing these mRNAs to an array of probes, which may comprise oligonucleotides or cDNAs, using standard methods known in the art. Alternatively, the mRNAs may be subjected to gel electrophoresis or blotting methods such as dot blots, slot blots or Northern analysis, all of which are known in the art. (See, e.g., Sambrook et al. in “Molecular Cloning, 3rd ed.,” Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (2001).) mRNA profiles also may be obtained by reverse transcription followed by amplification and detection of the resulting cDNAs, as disclosed by Stordeur et al, supra, for example. In another embodiment, the profile may be obtained by using a combination of methods, such as a nucleic acid array combined with mass spectroscopy.
Different methods have different advantages and may be preferred depending on numerous factors, such as the particular circumstances of the individuals to be tested and/or the availability of reagents/equipment in the diagnostics laboratory. For example, qPCR using probe/quencher hydrolysis probes as described herein is highly specific and stringent. As another example, microarray analysis can resolve subtle differences in expression of transcript variants, which may be important in disease pathology and diagnosis.
Probes
Any appropriate detection means can be used to detect or quantify the one or more biomarker of the invention, as described herein.
Typically when the one or more biomarker of the invention is a nucleic acid, the presence of the one or more biomarker may be detected, and/or the amount of the one or more biomarker determined using an oligonucleotide probe.
An oligonucleotide probe of the invention may have at least 80% sequence identity to the one or more biomarker of the invention, or a target region within said biomarker, measured over any appropriate length of sequence. Typically the % sequence identity is determined over a length of contiguous nucleic acid residues. An oligonucleotide probe of the invention may, for example, have at least 80% sequence identity to the one or more biomarker of the invention, or target region thereof, measured over at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, or more nucleic acid residues, up to the oligonucleotide probe having at least 80% sequence identity with the one or more biomarker of the invention, or target region thereof, over the entire length of the oligonucleotide probe.
An oligonucleotide probe of the invention may be complementary to the one or more nucleic acid biomarker of the invention, or a target region thereof. Typically the oligonucleotide probe of the invention is complementary over a length of contiguous nucleic acid residues. An oligonucleotide probe of the invention may, for example, be complementary to the one or more biomarker of the invention, or target region thereof, measured over at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, or more nucleic acid residues, up to the oligonucleotide probe having being complementary to the one or more biomarker of the invention, or target region thereof, over the entire length of the oligonucleotide probe.
An oligonucleotide probe of the invention may be complementary to a variant of the one or more biomarker of the invention, or a variant of a target region of said biomarker. Typically the oligonucleotide probe is complementary to a variant having at least 80% sequence identity to the one or more biomarker of the invention, or a variant having at least 80% sequence identity to the target region of said biomarker. The % sequence identity of the variant to the one or more biomarker of the invention, or a variant of a target region of said biomarker may be calculated over any appropriate length of sequence in the one or more biomarker, as described herein.
A sequence identity of at least 80% includes at least 82%, at least 84%, at least 86%, at least 88%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, and 100% sequence identity (to each and every nucleic acid sequence presented herein and/or to each and every SEQ ID NO presented herein).
Any of a variety of sequence alignment methods can be used to determine percent identity, including, without limitation, global methods, local methods and hybrid methods, such as, e.g., segment approach methods. Protocols to determine percent identity are routine procedures within the scope of one skilled in the art. Global methods align sequences from the beginning to the end of the molecule and determine the best alignment by adding up scores of individual residue pairs and by imposing gap penalties. Non-limiting methods include, e.g., CLUSTAL W, see, e.g., Julie D. Thompson et al., CLUSTAL W: Improving the Sensitivity of Progressive Multiple Sequence Alignment Through Sequence Weighting, Position-Specific Gap Penalties and Weight Matrix Choice, 22 (22) Nucleic Acids Research 4673-4680 (1994); and iterative refinement, see, e.g., Osamu Gotoh, Significant Improvement in Accuracy of Multiple Protein. Sequence Alignments by Iterative Refinement as Assessed by Reference to Structural Alignments, 264(4) J. MoI. Biol. 823-838 (1996). Local methods align sequences by identifying one or more conserved motifs shared by all of the input sequences. Non-limiting methods include, e.g., Match-box, see, e.g., Eric Depiereux and Ernest Feytmans, Match-Box: A Fundamentally New Algorithm for the Simultaneous Alignment of Several Protein Sequences, 8(5) CABIOS 501-509 (1992); Gibbs sampling, see, e.g., C. E. Lawrence et al., Detecting Subtle Sequence Signals: A Gibbs Sampling Strategy for Multiple Alignment, 262 (5131) Science 208-214 (1993); Align-M, see, e.g., Ivo Van WaIIe et al., Align-M—A New Algorithm for Multiple Alignment of Highly Divergent Sequences, 20 (9) Bioinformatics: 1428-1435 (2004). Thus, percent sequence identity is determined by conventional methods. See, for example, Altschul et al., Bull. Math. Bio. 48: 603-16, 1986 and Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89:10915-19, 1992.
Variants of the specific sequences provided above may alternatively be defined by reciting the number of nucleotides that differ between the variant sequences and the specific reference sequences provided above. Thus, in one embodiment, the sequence may comprise (or consist of) a nucleotide sequence that differs from the specific sequences provided above at no more than 2 nucleotide positions, for example at no more than 1 nucleotide position. Conservative substitutions are preferred. The term variants as defined herein also encompasses splice variants.
An oligonucleotide probe of the invention may be at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, or more nucleotides in length. In a preferred embodiment, the oligonucleotide probe is 40 to 100 nucleotides in length, more preferably 50 to 100 nucleotides in length, even more preferably 50 to 80 nucleotides in length and most preferably 50 to 70 nucleotides in length.
The probes of the invention are typically designed to hybridise to their target nucleic acid sequence present in the one or more biomarker of the invention.
A probe may comprise or be complementary to a nucleic acid sequence within a target nucleic acid sequence from the one or more biomarker of the invention, or to a nucleic acid sequence having at least 80% identity to said target nucleic acid sequence. Any suitable probe which comprises or is complementary (as defined herein) to a nucleic acid sequence within a target nucleic acid sequence of one or more biomarker of the invention may be used. Preferred target sequences within the one or more biomarkers of the invention are underlined in the nucleic acid sequences shown in the Sequence Information section.
In embodiments wherein the one or more biomarker for TB is LOC400759/GBP1P1, a target nucleic acid sequence may comprise bases 91 to 640 of SEQ ID NO: 112 or bases 13751 to 13950 of SEQ ID NO: 113, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is PF4V1, a target nucleic acid sequence may comprise bases 21 to 450 of SEQ ID NO: 134, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is ALPK1, a target nucleic acid sequence may comprise bases 511 to 3220 of SEQ ID NO: 117, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is HERC2, a target nucleic acid sequence may comprise bases 2411 to 5641, 8141 to 9630 and/or 13651 to 14930 of SEQ ID NO: 132, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is LGALS3BP, a target nucleic acid sequence may comprise bases 1431 to 1850 of SEQ ID NO: 114, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is BST1, a target nucleic acid sequence may comprise bases 361 to 840 of SEQ ID NO: 115, and a probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is SNX10, a target nucleic acid sequence may comprise bases 1901 to 2480 of SEQ ID NO: 116, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is CREG1, a target nucleic acid sequence may comprise bases 961 to 1620 of SEQ ID NO: 118, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is BAZ1A, a target nucleic acid sequence may comprise bases 4561 to 5270 of SEQ ID NO: 119, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is LYN, a target nucleic acid sequence may comprise bases 1681 to 2520 of SEQ ID NO: 120, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is TAPBP, a target nucleic acid sequence may comprise bases 171 to 1820 of SEQ ID NO: 121, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is SERPINB1, a target nucleic acid sequence may comprise bases 1201 to 2050 of SEQ ID NO: 122, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is PSMB9, a target nucleic acid sequence may comprise bases 241 to 870 of SEQ ID NO: 123, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is WSB1, a target nucleic acid sequence may comprise bases 851 to 2250 of SEQ ID NO: 124, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is MVP, a target nucleic acid sequence may comprise bases 1901 to 2880 of SEQ ID NO: 125, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is APBB1IP, a target nucleic acid sequence may comprise bases 301 to 1830 of SEQ ID NO: 126, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is FYB, a target nucleic acid sequence may comprise bases 1621 to 2690 of SEQ ID NO: 127, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is MB21D1/C6orf150, a target nucleic acid sequence may comprise bases 1051 to 1570 of SEQ ID NO: 128, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is CPVL, a target nucleic acid sequence may comprise bases 381 to 1140 of SEQ ID NO: 129, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is TICAM2, a target nucleic acid sequence may comprise bases 2671 to 3020 of SEQ ID NO: 130, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is CD52, a target nucleic acid sequence may comprise bases 51 to 450 of SEQ ID NO: 131, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is KLRA1, a target nucleic acid sequence may comprise bases 801 to 1310 of SEQ ID NO: 133, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is DEFB128, a target nucleic acid sequence may comprise bases 11 to 270 of SEQ ID NO: 135, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is IL8, a target nucleic acid sequence may comprise bases 241 to 1460 of SEQ ID NO: 136, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is GBP1, a target nucleic acid sequence may comprise bases 2171 to 2800 of SEQ ID NO: 142, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is IRF1, a target nucleic acid sequence may comprise bases 1411 to 2050 of SEQ ID NO: 141, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is MMP9, a target nucleic acid sequence may comprise bases 1091 to 2190 of SEQ ID NO: 152, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is CD96, a target nucleic acid sequence may comprise bases 641 to 3760 of SEQ ID NO: 138, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is AIM2, a target nucleic acid sequence may comprise bases 541 to 1060 of SEQ ID NO: 137, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is CD274, a target nucleic acid sequence may comprise bases 541 to 1930 of SEQ ID NO: 138, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is CDH23, a target nucleic acid sequence may comprise bases 9681 to 10990 of SEQ ID NO: 140, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is IFIT3, a target nucleic acid sequence may comprise bases 1041 to 1830 of SEQ ID NO: 143, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is IFITM3, a target nucleic acid sequence may comprise bases 211 to 580 of SEQ ID NO: 144, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is GK, a target nucleic acid sequence may comprise bases 1251 to 1970 of SEQ ID NO: 145, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is NELL2, a target nucleic acid sequence may comprise bases 2401 to 3110 of SEQ ID NO: 146, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is S100A11, a target nucleic acid sequence may comprise bases 291 to 580 of SEQ ID NO: 147, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is SAMD9L, a target nucleic acid sequence may comprise bases 461 to 3260 of SEQ ID NO: 148, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is STAT1, a target nucleic acid sequence may comprise bases 2261 to 3170 of SEQ ID NO: 149, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is TLR6, a target nucleic acid sequence may comprise bases 1751 to 2430 of SEQ ID NO: 150, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is WARS, a target nucleic acid sequence may comprise bases 1801 to 2860 of SEQ ID NO: 151, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is DOCKS, a target nucleic acid sequence may comprise bases 5791 to 6460 of SEQ ID NO: 153, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is SIRPB2, a target nucleic acid sequence may comprise bases 741 to 1950 of SEQ ID NO: 154, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is ANKRD22, a target nucleic acid sequence may comprise bases 981 to 1320 of SEQ ID NO: 155, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is ABCF2, a target nucleic acid sequence may comprise bases 1741 to 2370 of SEQ ID NO: 156, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is FNBP1L, a target nucleic acid sequence may comprise bases 4591 to 5220 of SEQ ID NO: 157, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is NCF1C, a target nucleic acid sequence may comprise bases 461 to 940 of SEQ ID NO: 158, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is TBC1D3B, a target nucleic acid sequence may comprise bases 1421 to 2090 of SEQ ID NO: 159, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is SLC14A1, a target nucleic acid sequence may comprise bases 2031 to 2950 of SEQ ID NO: 160, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is CALCOCO2, a target nucleic acid sequence may comprise bases 2601 to 3600 of SEQ ID NO: 161, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is GTF2B, a target nucleic acid sequence may comprise bases 661 to 1160 of SEQ ID NO: 162, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is HLA-B, a target nucleic acid sequence may comprise bases 961 to 1430 of SEQ ID NO: 163, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is HLA-F, a target nucleic acid sequence may comprise bases 461 to 1520 of SEQ ID NO: 164, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is MGST2, a target nucleic acid sequence may comprise bases 161 to 760 of SEQ ID NO: 165, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is SPAST, a target nucleic acid sequence may comprise bases 701 to 1770 of SEQ ID NO: 166, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
In embodiments wherein the one or more biomarker for TB is WAC, a target nucleic acid sequence may comprise bases 2011 to 3590 of SEQ ID NO: 167, and a probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to a nucleic acid sequence from this target sequence.
It is preferred that the binding conditions for a probe hybridising to its target sequence are such that a high level of specificity is provided—i.e. hybridisation of the probe occurs under “stringent conditions”. In general, stringent conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target (or complement) sequence hybridises to a perfectly matched probe. In this regard, the Tm of probes of the present invention, at a salt concentration of about 0.02M or less at pH 7, is for example above 60° C., such as about 70° C.
Premixed buffer solutions are commercially available (e.g. EXPRESSHYB Hybridisation Solution from CLONTECH Laboratories, Inc.), and hybridisation can be performed according to the manufacturer's instructions.
Probes of the present invention may be screened to minimise self-complementarity and dimer formation (probe-probe binding).
Any of the probes described herein may comprise a tag and/or label. The tag and/or label may, for example, be located (independently of one another) towards the middle or towards or at the 5′ or 3′ end of the herein described probes, for example at the 5′ end.
Hence, following hybridisation of tagged/labelled probe to target nucleic acid, the tag/label is associated with the target nucleic acid in the one or more biomarker. Alternatively, if an amplification step is employed, the probes may act as primers during the method of the invention and the tag/label may therefore become incorporated into the amplification product as the primer is extended.
Examples of suitable labels include detectable labels such as radiolabels or fluorescent or coloured molecules, enzymatic markers or chromogenic markers—e.g. dyes that produce a visible colour change upon hybridisation of the probe. By way of example, the label may be digoxygenin, fluorescein-isothiocyanate (FITC), R-phycoerythrin, Alexa 532 or Cy3. The probes preferably contain a Fam label (e.g. a 5′ Fam label), and/or a minor groove binder (MGB). The label may be a reporter molecule, which is detected directly, such as by exposure to photographic or X-ray film. Alternatively, the label is not directly detectable, but may be detected indirectly, for example, in a two-phase system. An example of indirect label detection is binding of an antibody to the label.
Examples of suitable tags include “complement/anti-complement pairs”. The term “complement/anti-complement pair” denotes non-identical moieties that form a non-covalently associated, stable pair under appropriate conditions. Examples of suitable tags include biotin and streptavidin (or avidin). By way of example, a biotin tag may be captured using streptavidin, which may be coated onto a substrate or support such as a bead (for example a magnetic bead) or membrane. Likewise, a streptavidin tag may be captured using biotin, which may be coated onto a substrate or support such as a bead (for example a magnetic bead) or membrane. Other exemplary complement/anti-complement pairs include receptor/ligand pairs, antibody/antigen (or hapten or epitope) pairs, and the like. Another example is a nucleic acid sequence tag that binds to a complementary sequence. The latter may itself be pre-labelled, or may be attached to a surface (e.g. a bead) which is separately labelled. An example of the latter embodiment is the well-known LuminexR bead system. Other exemplary pairs of tags and capture molecules include receptor/ligand pairs and antibody/antigen (or hapten or epitope) pairs. Where subsequent dissociation of the complement/anti-complement pair is desirable, the complement/anti-complement pair has a binding affinity of, for example, less than 109 M−1. One exemplary tagged probe is a biotin-labelled probe, which may be detected using horse-radish peroxidase conjugated streptavidin.
The probes of the invention may be labelled with different labels or tags, thereby allowing separate identification of each probe when used in the method of the present invention.
Any conventional method may be employed to attach nucleic acid tags to a probe of the present invention (e.g. to the 5′ end of the defined binding region of the probe). Alternatively, nucleic acid probes of the invention (with pre-attached nucleic acid tags) may be constructed by commercial providers.
If an amplification step is employed, this step may be carried out using methods and platforms known in the art, for example PCR (for example, with the use of “Fast DNA Polymerase”, Life Technologies), such as real-time PCR, block-based PCR, ligase chain reaction, glass capillaries, isothermal amplification methods including loop-mediated isothermal amplification, rolling circle amplification transcription mediated amplification, nucleic acid sequence-based amplification, signal mediated amplification of RNA technology, strand displacement amplification, isothermal multiple displacement amplification, helicase-dependent amplification, single primer isothermal amplification, and circular helicase-dependent amplification. If employed, amplification may be carried using any amplification platform.
A general amplification step (e.g. pre-detection) may be employed to increase the amount of the one or more biomarker of the invention present in the sample. PCR amplification primers are typically employed to amplify approximately 100-400 base pair regions of the target/complementary nucleic acid that contain the nucleotide targets of the present invention. In the presence of a suitable polymerase and DNA precursors (dATP, dCTP, dGTP and dTTP), forward and reverse primers are extended in a 5′ to 3′ direction, thereby initiating the synthesis of new nucleic acid strands that are complementary to the individual strands of the target nucleic acid. The primers thereby drive amplification of target nucleic acid sequences in the one or more biomarker, thereby generating amplification products comprising said target nucleic acid sequences.
An amplification step may be employed in which the probes of the present invention act as primers. In this embodiment, the probes (acting as primers) are extended from their 3′ ends (i.e. in a 5′-to-′3′) direction. Such an amplification step may be employed in conjunction with a general amplification step, such as the one described above.
The detection step may be carried out by any known means. In this regard, the probe or amplification product may be tagged and/or labelled, and the detection method may therefore comprise detecting said tag and/or label.
In one embodiment, the probe(s) may comprise a tag and/or label. Thus, in one embodiment, following hybridisation of tagged/labelled probe to target nucleic acid in the one or more biomarker, the tag/label becomes associated with the target nucleic acid. Thus, in one embodiment, the assay may comprise detecting the tag/label and correlating presence of tag/label with presence of the one or more nucleic acid biomarker of the invention.
In one embodiment, tag and/or label may be incorporated during extension of the probe(s). In doing so, the amplification product(s) become tagged/labelled, and the assay may therefore comprise detecting the tag/label and correlating presence of tag/label with presence of amplification product, and hence the presence of one or more nucleic acid biomarker of the invention.
By way of example, in one embodiment, the amplification product may incorporate a tag/label (e.g. via a tagged/labelled dNTP such as biotin-dNTP) as part of the amplification process, and the assay may further comprise the use of a binding partner complementary to said tag (e.g. streptavidin) that includes a detectable tag/label (e.g. a fluorescent label, such as R-phycoerythrin). In this way, the amplified product incorporates a detectable tag/label (e.g. a fluorescent label, such as R-phycoerythrin).
In one embodiment, the probe(s) and/or the amplification product(s) may include a further tag/label (as the complement component) to allow capture of the amplification product(s).
By way of example, a “complement/anti-complement” pairing may be employed in which an anti-complement capture component binds to said further tag/label (complement component) and thereby permits capture of the probe(s) and/or amplification product(s). Examples of suitable “complement/anti-complement” partners have been described earlier in this specification, such as a complementary pair of nucleic acid sequences, a complementary antibody-antigen pair, etc. The anti-complement capture component may be attached (e.g. coated) on to a substrate or solid support—examples of suitable substrates/supports include membranes and/or beads (e.g. a magnetic or fluorescent bead). Capture methods are well known in the art. For example, LuminexR beads may be employed. Alternatively, the use of magnetic beads may be advantageous because the beads (plus captured, tagged/labelled amplification product) can easily be concentrated and separated from the sample, using conventional techniques known in the art.
Immobilisation provides a physical location for the anti-complement capture component (or probes), and may serve to fix the capture component/probe at a desired location and/or facilitate recovery or separation of probe. The support may be a rigid solid support made from, for example, glass, plastic or silica, such as a bead (for example a fluorescent or magnetic bead). Alternatively, the support may be a membrane, such as nylon or nitrocellulose membrane. 3D matrices are also suitable supports for use with the present invention—e.g. polyacrylamide or PEG gels. Immobilisation to a support/platform may be achieved by a variety of conventional means. By way of example, immobilisation onto a support such as a nylon membrane may be achieved by UV cross-linking. Alternatively, biotin-labelled molecules may be bound to streptavidin-coated substrates (and vice-versa), and molecules prepared with amino linkers may be immobilised on to silanised surfaces. Another means of immobilisation is via a poly-T tail or a poly-C tail, for example at the 3′ or 5′ end. Said immobilisation techniques apply equally to the probe component (and primer pair component, if present) of the present invention.
In one embodiment, the probes of the invention comprise a nucleic acid sequence tag/label (e.g. attached to each probe at the 5′ end of the defined sequence of the probe that binds to target/complement nucleic acid). In more detail, each of the probes is provided with a different nucleic acid sequence tag/label, wherein each of said tags/labels (specifically) binds to a complementary nucleic acid sequence present on the surface of a bead. Each of the different tags/labels binds to its complementary sequence counterpart (and not to any of the complementary sequence counterparts of the other tags), which is located on a uniquely identifiable bead. In this regard, the beads are uniquely identifiable, for example by means of fluorescence at a specific wavelength. Thus, in use, probes of the invention bind to target nucleic acid (if present in the sample). Thereafter, (only) the bound probes may be extended (in the 3′ direction) in the presence of one or more labelled dNTP (e.g. biotin labelled dNTPs, such as biotin-dCTPs).
The extended primers may be contacted with a binding partner counterpart to the labelled dNTPs (e.g. a streptavidin labelled fluorophore, such as streptavidin labelled R-phycoerythrin), which binds to those labelled dNTPs that have become incorporated into the extended primers. Thereafter, the labelled extended primers may be identified by allowing them to bind to their nucleic acid counterparts present on the uniquely identifiable beads. The latter may then be “called” (e.g. to determine the type of bead present by wavelength emission) and the nature of the primer extension (and thus the type of target/complement nucleic acid present) may be determined.
Typically, probes of the invention are oligonucleotides having sequence identity with a region of the one or more biomarker of the invention as disclosed herein. One or more probe may be immobilised on a solid support, and used to interrogate mRNA obtained from a test sample. If the mRNA from the test sample contains the one or more biomarker targeted by the immobilised probe, it will bind to the probe, and may then be detected. The biomarkers of the invention may also be detected using PCR, such as real time PCR.
Any oligonucleotide with the appropriate level of sequence identity with the one or more biomarker of the invention, or with one or more target sequences within said one or more biomarker of the invention may be used as a probe as described herein. Any oligonucleotide with the appropriate level of complementarity with the one or more biomarker of the invention, or with one or more target sequences within said one or more biomarker of the invention may be used as a probe as described herein. Exemplary sequences of the one or more biomarkers of the invention are given in SEQ ID NOs: 112 to 167 (see Tables 2 to 5 herein). Sequences of exemplary target regions within the one or more biomarkers of the invention are shown as underlined in the sequences of the Sequence Information section (as discussed herein). Exemplary probe nucleic acid sequences for the biomarkers disclosed herein are set out in Table 6 (SEQ ID NOs: 1 to 111 and 168 to 171) and are shown as double-underlined in the sequences of the Sequence Information section.
In embodiments wherein the one or more biomarker for TB is LOC400759/GBP1P1, the oligonucleotide probe typically comprises or is complementary to a nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NO: 1, 2 or 3.
In embodiments wherein the one or more biomarker for TB is PF4V1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 4 or 5.
In embodiments wherein the one or more biomarker for TB is ALPK1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 6 or 7.
In embodiments wherein the one or more biomarker for TB is HERC2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 8, 9 or 168 to 171.
In embodiments wherein the one or more biomarker for TB is LGALS3BP, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 10 or 11.
In embodiments wherein the one or more biomarker for TB is BST1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 12 or 13.
In embodiments wherein the one or more biomarker for TB is SNX10, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 14 or 15.
In embodiments wherein the one or more biomarker for TB is CREG1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 16 or 17.
In embodiments wherein the one or more biomarker for TB is BAZ1A, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 18 or 19.
In embodiments wherein the one or more biomarker for TB is LYN, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 20 or 21.
In embodiments wherein the one or more biomarker for TB is TAPBP, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 22 or 23.
In embodiments wherein the one or more biomarker for TB is SERPINB1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 24 or 25.
In embodiments wherein the one or more biomarker for TB is PSMB9, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 26 or 27.
In embodiments wherein the one or more biomarker for TB is WSB1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 28 or 29.
In embodiments wherein the one or more biomarker for TB is MVP, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 30 or 31.
In embodiments wherein the one or more biomarker for TB is APBB1IP, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 32 or 33.
In embodiments wherein the one or more biomarker for TB is FYB, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 34 or 35.
In embodiments wherein the one or more biomarker for TB is MB21D1/C6orf150, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 36 or 37.
In embodiments wherein the one or more biomarker for TB is CPVL, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 38 or 39.
In embodiments wherein the one or more biomarker for TB is TICAM2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 40 or 41.
In embodiments wherein the one or more biomarker for TB is CD52, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 42 or 43.
In embodiments wherein the one or more biomarker for TB is KLRA1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 44 or 45.
In embodiments wherein the one or more biomarker for TB is DEFB128, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 46 or 47.
In embodiments wherein the one or more biomarker for TB is IL8, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 48 or 49.
In embodiments wherein the one or more biomarker for TB is GBP1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 50 or 51.
In embodiments wherein the one or more biomarker for TB is IRF1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 52 or 53.
In embodiments wherein the one or more biomarker for TB is MMP9, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 54 or 55.
In embodiments wherein the one or more biomarker for TB is CD96, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 56 or 57.
In embodiments wherein the one or more biomarker for TB is AIM2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 58 or 59.
In embodiments wherein the one or more biomarker for TB is CD274, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 60 or 61.
In embodiments wherein the one or more biomarker for TB is CDH23, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 62 or 63.
In embodiments wherein the one or more biomarker for TB is IFIT3, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 64 or 65.
In embodiments wherein the one or more biomarker for TB is IFITM3, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 66 or 67.
In embodiments wherein the one or more biomarker for TB is GK, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 68 or 69.
In embodiments wherein the one or more biomarker for TB is NELL2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 70 or 71.
In embodiments wherein the one or more biomarker for TB is S100A11, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 72 or 73.
In embodiments wherein the one or more biomarker for TB is SAMD9L, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 74 or 75.
In embodiments wherein the one or more biomarker for TB is STAT1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 76 or 77.
In embodiments wherein the one or more biomarker for TB is TLR6, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 78 or 79.
In embodiments wherein the one or more biomarker for TB is WARS, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 80 or 81.
In embodiments wherein the one or more biomarker for TB is DOCKS, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 82 or 83.
In embodiments wherein the one or more biomarker for TB is SIRPB2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 84 or 85.
In embodiments wherein the one or more biomarker for TB is ANKRD22, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 86 or 87.
In embodiments wherein the one or more biomarker for TB is ABCF2 (NM_005692.3), the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 88 or 89.
In embodiments wherein the one or more biomarker for TB is FNBP1L, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 90 or 91.
In embodiments wherein the one or more biomarker for TB is NCF1C, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 92 or 93.
In embodiments wherein the one or more biomarker for TB is TBC1D3B, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 94 or 95.
In embodiments wherein the one or more biomarker for TB is SLC14A1, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 96 or 97.
In embodiments wherein the one or more biomarker for TB is CALCOCO2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 98 or 99.
In embodiments wherein the one or more biomarker for TB is GTF2B, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 100 or 101.
In embodiments wherein the one or more biomarker for TB is HLA-B, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 102 or 103.
In embodiments wherein the one or more biomarker for TB is HLA-F, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 104 or 105.
In embodiments wherein the one or more biomarker for TB is MGST2, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 106 or 107.
In embodiments wherein the one or more biomarker for TB is SPAST, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 108 or 109.
In embodiments wherein the one or more biomarker for TB is WAC, the oligonucleotide probe typically comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of SEQ ID NOs: 110 or 111.
Use of a Data Analysis Algorithm
In one embodiment, comparison of the one or more biomarker or the biomarker profile to a reference or control comprises applying a decision rule, or using a decision tree, as described herein. The decision rule or decision tree can comprise a data analysis algorithm, such as a computer pattern recognition algorithm. Other suitable algorithms include, but are not limited to, logistic regression or a nonparametric algorithm that detects differences in the distribution of feature values (e.g., a Wilcoxon Signed Rank Test). The decision rule may be based upon one, two, three, four, five, 10, 20 or more features. In one embodiment, the decision rule or decision tree is based on hundreds or more of features. Applying the decision rule or decision tree may also comprise using a classification tree algorithm. For example, the control or reference biomarker profile may comprise at least three features or biomarkers, where the features are predictors in a classification tree algorithm. The data analysis algorithm predicts membership within a population (or class) with an accuracy of at least about 60%, at least about 70%, at least about 80% and at least about 90%.
Suitable algorithms are known in the art, some of which are reviewed in Hastie et al, supra. Such algorithms classify complex spectra from biological materials, such as a blood sample, to distinguish individuals as normal or as possessing biomarker expression levels characteristic of a particular disease state. While such algorithms may be used to increase the speed and efficiency of the application of the decision rule and to avoid investigator bias, one of ordinary skill in the art will realize that computer-based algorithms are not required to carry out the methods of the present invention.
Algorithms may be applied to the comparison of the one or more biomarker or the biomarker profiles, regardless of the method that was used to generate the data for the one or more biomarker or the biomarker profile. For example, suitable algorithms can be applied to biomarker profiles generated using gas chromatography, as discussed in Harper, “Pyrolysis and GC in Polymer Analysis” Dekker, New York (1985). Further, Wagner et al, Anal Chem 74: 1824-35 (2002) disclose an algorithm that improves the ability to classify individuals based on spectra obtained by static time-of-flight secondary ion mass spectrometry (TOF-SIMS). Additionally, Bright et al, J. Microbiol Methods 48: 127-38 (2002) disclose a method of distinguishing between bacterial strains with high certainty (79-89% correct classification rates) by analysis of MALDI-TOF-MS spectra. Dalluge, Fresenius J. Anal. Chem. 366: 701-11 (2000) discusses the use of MALDI-TOF-MS and liquid chromatography-electrospray ionization mass spectrometry (LC/ESI-MS) to classify profiles of biomarkers in complex biological samples.
Methods of Diagnosis
As described herein, the present invention provides a method for diagnosing TB in an individual, comprising determining the presence and/or amount of one or more biomarker for TB in a sample obtained from the individual, wherein the one or more biomarkers is selected from SNX10, CPVL, PF4V1, HERC2, LGALS3BP, BST1, BAZ1A, LYN, SERPINB1, WSB1, MVP, APBB1IP, MB21D1/C6orf150, TICAM2, CD52, KLRA1, DEFB128 and IL8. Any combination of biomarkers as disclosed herein may be used in a method according to the present invention.
The method may comprising obtaining a first biomarker profile from a first sample taken from the individual at a single initial point in time and multiple time points thereafter to monitor the efficacy of treatment and disease resolution; and comparing said individual's first biomarker profile to a reference or control biomarker profile, wherein said comparison determines the status of TB infection in the individual with an accuracy, sensitivity and/or specificity of at least about 90%, at least about 80%, at least about 70% or at least about 60%; and wherein the biomarker profiles comprise determining the presence and/or amount of one or more biomarker of the invention. Typically the accuracy, sensitivity and/or specificity is of at least about 80% or at least about 90%.
The method may comprise obtaining a first biomarker profile from a first sample from the individual; and comparing the individual's first biomarker profile to a reference or control biomarker profile obtained from a reference or control population, said comparison being capable of classifying the individual as belonging to or not belonging to the reference or control population, wherein the comparison determines the status of TB infection in the individual, and wherein the biomarker profiles comprise determining the presence and/or amount of one or more biomarker of the invention.
The method may comprise comparing a measurable characteristic of at least three biomarkers of the invention between (i) a first biomarker profile obtained from a first sample from the individual and (ii) a biomarker profile obtained from samples from a control or reference population; and classifying the individual as belonging to or not belonging to the control or reference population, wherein the comparison determines the status of TB infection in the individual, and wherein the measurable characteristic optionally comprises the presence and/or amount of the biomarker.
The method may comprise selecting at least two features from a set of biomarkers of the invention in a first biomarker profile generated from a first sample of the individual; and comparing the at least two features to a set of the same biomarkers in a biomarker profile generated from samples from a control or reference population, wherein the comparison is capable of classifying the individual as belonging to or not belonging to the control reference population with an accuracy, sensitivity and/or specificity of at least about 90%, at least about 80%, at least about 70% or at least about 60%, wherein the comparison determines the status of TB in the individual, and wherein the feature optionally comprises the presence and/or amount of the biomarker. Typically the accuracy, sensitivity and/or specificity is of at least about 80% or at least about 90%.
The method may comprise determining an abundance or a change in an abundance of at least three biomarkers contained in a first biomarker profile obtained from a first biological sample of the individual; and (b) comparing the abundance or the change in the abundance to an abundance or change in an abundance of said at least three biomarkers contained in biological samples from a control or reference population, wherein the comparison is capable of classifying the individual as belonging to or not belonging to the control or reference population; and wherein the comparison determines the status of TB in the individual.
The method may further comprise obtaining a second biomarker profile from a second sample taken from the individual; and comparing the individual's second biomarker profile to the control or reference biomarker profile; wherein the individual's second biomarker profile and the control or reference biomarker profile comprise features that are measurable characteristics of a biomarker of the invention, wherein the second comparison is capable of classifying the individual as belonging to or not belonging to the control or reference population, and wherein the second comparison determines the status TB infection in the individual. The biomarker profiles optionally comprise one or more of the biomarkers of the present invention, and the measurable characteristic optionally comprises the presence and/or amount of one or more biomarker of the invention.
The methods of the invention may be repeated at least once, at least twice, at least three times, at least four times, at least five times, or more. A separate biomarker profile can be obtained from the individual from a separate sample taken each time the method is repeated.
The methods of the invention may be used to diagnose, detect and/or predict TB, TB infection and/or infection with M. tuberculosis. The methods of the invention may be used to distinguish between active and latent TB, a TB infection and/or infection with M. tuberculosis. The methods of the invention may be used to distinguish between latent TB and the absence of TB. The methods of the invention may be used to identify an individual with an active TB infection and/or a latent TB infection. The methods of the invention may be used to identify an individual with an active TB infection and/or a latent TB infection and/or an individual uninfected with TB. The methods of the invention may be used to identify an individual with an early stage active TB infection and/or a late/later stage active TB infection. The methods of the invention may be used to distinguish between an early stage active TB infection and/or a late/later stage active TB infection. The methods of the invention may also be used to determine the status of TB, a TB infection and/or infection with M. tuberculosis in an individual. Determining the status of TB, a TB infection and/or infection with M. tuberculosis in an individual may comprise determining the progression or resolution of TB, a TB infection and/or infection with M. tuberculosis in the individual. Determining the status of TB, a TB infection and/or infection with M. tuberculosis in an individual may comprise determining the presence of active or latent TB, a TB infection and/or infection with M. tuberculosis in an individual. The methods of the invention may be used to determine whether an individual has been exposed to TB.
The methods of the invention may comprise applying a decision rule as described herein. Applying the decision rule may comprise using a data analysis algorithm, also as described herein. The data analysis algorithm may comprise at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least 15, at least 20, at least 25, at least 50 or more input parameters. The data analysis algorithm may use any of the biomarkers of the invention, or combination of biomarkers of the invention as input parameters. Typically, the data analysis algorithm uses at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least 15, at least 20, at least 25, at least 50 of the biomarkers of the invention (e.g. as listed in any one of Tables 2 to 5) as input parameters.
In a preferred embodiment, the features and/or biomarkers profile used in the methods of the invention are the one or more biomarkers of the present invention, as described herein, and preferably the methods relate to determining the presence and/or amount of the one or more biomarker. Similarly, the “measurable characteristic” in a method of the invention may be any quantitative or qualitative characteristic associated with one or more biomarker of the invention, and is preferably the presence and/or amount of said biomarker.
In a more preferred embodiment, the one or more biomarker of the invention is nucleic acids, selected from DNA or RNA, typically mRNA. The biomarker profile may comprise any measurable aspect of said nucleic acid biomarker, and is typically a measurable characteristic of an mRNA biomarker, such as the presence and/or amount of said mRNA biomarker. The one or more biomarker of the invention and/or the biomarker may comprise a measurable aspect of a nucleic acid biomarker that encodes a protein that is informative of the state of the immune system in response to TB, a TB infection and/or infection with M. tuberculosis in an individual.
As described herein, a method of the invention may comprise fractionation of the sample prior to determining the presence and/or amount of the one or more biomarker of the invention, or obtaining a biomarker profile. Typically, the method comprises at least one, at least two, at least three, at least four, at least five, or more separation methods as described herein. The at least one separation method may be selected from inflammatory cell separation, chemical extraction partitioning, ion exchange chromatography, gel electrophoresis, and any combination thereof.
The invention also provides the use of one or more biomarker for TB as defined herein in the manufacture of a diagnostic for TB. Said diagnostic may be for diagnosing active TB and/or latent TB and/or the absence of TB.
Kits and Devices
The invention also provides kits and devices that are useful in determining the status of TB, diagnosing or detecting TB, distinguishing between active and latent TB in an individual, distinguishing between early stage active TB and late/later stage active TB and/or to determine whether an individual has been exposed to TB. The kits and devices of the present invention comprise at least one biomarker of the invention and/or one or more agent for the detection of or for the determination of the amount of the one or more biomarker of the invention. Specific biomarkers and agents for the detection of said biomarkers useful in the present invention are set forth herein. The biomarkers of the kit or device can be used to generate biomarker profiles according to the present invention.
Generally, the biomarkers of the kit or biomarker will bind, with at least some specificity, to the biomarker molecules contained in the sample from which the biomarker profile is generated. Examples of classes of compounds of the kit or device include, but are not to, proteins (including antibodies of the invention), and fragments thereof, peptides, polypeptides, proteoglycans, glycoproteins, lipoproteins, carbohydrates, lipids, nucleic acids, organic and inorganic chemicals, and natural and synthetic polymers. The biomarker(s) and/or agent(s) for the detection of the one or more biomarker may be part of an array, or the biomarker(s) and/or agent(s) may be packaged separately and/or individually. The biomarker(s) and/or agent(s) may be immobilised on an inert support.
The kit or device may also comprise at least one internal standard to be used in generating the biomarker profiles of the present invention. Likewise, the internal standards can be any of the classes of compounds described above.
The kits and devices of the present invention also may contain reagents that can be used to detectably label biomarkers contained in the biological samples from which the biomarker profiles are generated. For this purpose, the kit or device may comprise a set of antibodies or functional fragments thereof that specifically bind at least two, three, four, five, 10, 20, 30, 40, 50 or more, up to all 55 of the biomarkers set forth in any one of Tables 2 to 6 that list biomarkers for use in the invention. The antibodies themselves may be detectably labelled. The kit or device also may comprise a specific biomarker binding component, such as an aptamer.
In a preferred embodiment, a kit or device of the invention comprises (i) one or more antibody specific for the one or more biomarker for tuberculosis; or (ii) one or more oligonucleotide specific for the one or more biomarker for tuberculosis. In a more preferred embodiment, the one or more oligonucleotide specific for the one or more biomarker for tuberculosis is an oligonucleotide is an oligonucleotide of the invention, more preferably one or more of SEQ ID NOs: 1 to 111 or 168 to 171.
If the biomarkers comprise a nucleic acid, the kit or device may provide one or more oligonucleotide probe that is capable of forming a duplex with the one or more biomarker or with a complementary strand of said one or more biomarker. The one or more oligonucleotide probe may be detectably labelled. Typically, the one or more oligonucleotide probe used in the methods of the invention is selected from one or more of the oligonucleotide described herein. In a preferred embodiment, the one or more oligonucleotide probe is selected from an oligonucleotide probe that comprises or is complementary to at least one nucleic acid sequence having at least 80% sequence identity to the nucleic acid sequence of any one or more of SEQ ID NOs: 1 to 111 or 168 to 171.
The kits and devices of the present invention may also include pharmaceutical excipients, diluents and/or adjuvants when the biomarker is to be used to raise an antibody. Examples of pharmaceutical adjuvants include, but are not limited to, preservatives, wetting agents, emulsifying agents, and dispersing agents. Prevention of the action of microorganisms can be ensured by the inclusion of various antibacterial and antifungal agents, for example, paraben, chlorobutanol, phenol sorbic acid, and the like. It may also be desirable to include isotonic agents such as sugars, sodium chloride, and the like.
The Following Examples Illustrate the Invention
Naive Cynomologous macaques (Macaca fascicularis) aged between 2-4 yrs from two separate breeding colonies i.e. of Mauritian or Chinese origin, from established United Kingdom or Chinese breeding facilities were challenged with were challenged with live M. tuberculosis using aerosol challenge Erdman strain K 01. They were confirmed as naïve in terms of prior exposure to mycobacterial antigens (M. tuberculosis infection or environmental mycobacteria), by negative tuberculin test while in their original breeding colony and prior to the start of the study using the gamma interferon (IFN-γ)-based Primagam test kit (Biocor; CSL). All procedures involving animals were approved by the Ethical Review Committee of the Centre for Emergency Preparedness and Response, Salisbury, United Kingdom.
Mono-dispersed bacteria in particles were generated using a three-jet Collison nebulizer (BGI) and in conjunction with a modified Henderson apparatus, delivered to the nose of each sedated primate via a modified veterinary anaesthesia mask. The challenge was performed on sedated animals placed within a “head-out” plethysmography chamber (Buxco, Wilmington, N.C.), to enable the aerosol to be delivered simultaneously with the measurement of the respiration rate. None of the animals had been used previously for experimental procedures.
Whole heparinised blood was obtained at three independent time points prior to challenge and at one, two, four and six weeks post M. tuberculosis challenge. Within 1 hour of collection, 1 ml of blood from each animal was mixed with 5 ml of Erythrocyte Lysis (EL) Buffer (Qiagen) followed by incubation on ice for 10-15 min. Peripheral blood leukocytes (PBLs) were recovered from erythrocyte-lysed blood by centrifugation at 400×g for 10 min at 4° C. and re-suspended in a further 2 ml of EL buffer. PBLS were again recovered by centrifugation as described above and processed for recovery of total RNA.
One ml of TRIzol® was added to the PBL pellet and then total RNA was extracted from the lysed PBL pellet according to the manufacturer's instructions, using aqueous-phase separation with chloroform isoamyl alcohol and the precipitation using 2-isopropanol. Recovered, dried RNA pellets were re-suspended in 10 μl of diethylpyrocarbonate (DECP) water (Invitrogen), then concentration and purity (A260/A280 ratio ≥1.8) assessed by spectrophotometry using a NanoDrop™ ND-1000 spectrophotometer (Thermo Scientific). Genomic DNA was removed prior to its use in further procedures using the DNase I kit (Qiagen), according to the manufacturer's instructions. The GeniSphere SenseAmp RNA amplification kit according to manufacturer's instructions. The resulting amplified cRNA was purified using RNeasy® Min-Elute Cleanup kit (Qiagen), again according to the manufacturer's protocol. The cRNA concentration and purity (A260/A280 ratio ≥1.8) was then assessed by spectrophotometry using a NanoDrop™ ND-1000 spectrophotometer.
Total amplified cRNAs were then labelled with Cy3 and hybridised to replicate Operon Human Genome AROS V4.0 slides (n=3/sample/time point), using established protocols. The slides were air-dried and scanned using an Affymetrix 480 microarray scanner, at a gain threshold of 65. Feature extraction was then conducted using the microarray quantification package BlueFuse™ (BlueGnome ltd.). Raw data were then exported and hybridisation fluorescence intensities quantified using the software analysis program Bluefuse™, using default background subtraction and normalisation methods, to remove data generated from poor-quality spots, hybridisation artifacts. All raw data were then processed further using the microarray analysis package Genespring 12.5.
Data output files from BlueFuse were imported into GeneSpring 12.5 (GX12) for differential gene expression and statistical analysis. Raw data was normalized to the 50th percentile followed by median baseline transformed to the corresponding animal pre-bleed. This was conducted to normalise data across all time points and assess differential gene expression of each gene entity, relative to a baseline i.e. pre-bleed level of expression prior to M. tuberculosis challenge. The mean value across three replicate samples slides for each feature was used for further analysis. Data were assessed for quality, then filtered on gene expression where entities in at least 100 percent of samples and in any one out of one conditions had normalised expression values within the cut-off-10.699 to 7.037. Statistically significant features were identified using one-way ANOVA analysis across all entities and time points, using the Benjamini-Hochberg False Discovery Rate (BH-FDR) at a cut-off p<0.05. To identify temporally, differentially expressed entities between time-points post-infection, fold-change cut-off analysis was conducted, all against the pre-bleed condition and where the minimum number of pairs was equal to 1 out of the 4 condition pairs i.e. weeks 1, 2, 4 or 6 and using the default cut-off setting >2.0.
Data outputs were also analysed using Artificial Neural Network Analysis (ANN). Normalised expression data was analysed using ANN based data mining approach (Lancashire L J et al (2010), Breast Cancer Res Treat. February; 120(1):83-93). This approach comprised a supervised learning approach where the data for a given single probe was used to classify a known sample. The classifier consisted of a multi-layer perceptron ANN, where weights were updated by a back propagation algorithm (Rumelhart D E et al (1986) Nature 323: 533-536). The ANN architecture utilised a constrained architecture to prevent over-fitting, having only 2 hidden nodes in the hidden layer. ANN training incorporated Monte Carlo Cross Validation, where, the data was randomly divided into three subsets; 60% for training the classifier, 20% for testing (to assess model performance during the training process) and 20% for validation (to independently test the model on data completely blind to the model). This process of random sample cross validation was utilised to prevent over-fitting of the data and assess how well the model would perform on a blind data set. This random re-sampling and training process was repeated 50 times to generate predictions and associated error values for each sample with respect to the validation (blind) data. Probes were ranked in ascending order based on predictive error for test data from the Monte Carlo Cross validation. Significant hits were identified by cross-comparison between ANOVA p value-based (lowest to highest) and ANN test error-based ranked order lists (lowest to highest) and further filtered using the heat map and cluster functions in Genespring 12.0, using default settings. Highly significant biomarker datasets were refined by cross comparison of entity lists obtained using either one way ANOVA (P≥0.05) or ANN analysis (top one thousand entities ranked on average test error). Fifty-five biomarkers were selected for further progression from these gene lists.
All fifty-five biomarkers and individual smaller panels of up to ten biomarkers each were used to interrogate previously published human datasets using the cluster algorithm of GeneSpring 12.5, using the unsupervised hierarchical Euclidean clustering setting on conditions and entities. Small, select panels of biomarkers more amenable to use on point of care diagnostic platforms were identified which exhibited the best sensitivity and specificity in discriminating active Tuberculosis patients from Latent Tuberculosis and controls in one analysis and also in discrimination Latent Tuberculosis from uninfected controls in a second tier analysis. These are given below in Table 1.
All TB 55 Panel; all Biomarkers combined from Tables 2-5
Active TB 8 Panel; LOC400759, PF4V1, ALPK1, HERC2, IRF1, MMP9, GBP1, CD96
Latent TB 5 Panel; HERC2, KLRAP1, PF4V1, DEFB128, IL8
aBerry MPR. et al (2010) Nature 466(7309): 973-977
bMaertzdorf J et al (2011) PLoS One 6(10): e26938
Table 2 lists the genes newly identified as biomarkers for TB using the above methods. Also given are the sequence identifiers for the identified genes and an indication of whether expression of the various genes are up or down regulated in TB compared with a control/reference population, and in what cell types (all white blood cells in a sample, monocytes, neutrophils, CD4 positive T cells, CD8 positive T cells, etc.) the change was observed. Where more than one indication is given, the first listed is the most preferred.
Table 3 lists further biomarkers for TB. Also given are the sequence identifiers for the identified genes and an indication of whether expression of the various genes are up or down regulated in TB compared with a control/reference population, and in what cell types the change was observed. Where more than one indication is given, the first listed is the most preferred.
Table 4 lists the genes identified as biomarkers for latent TB using the above methods. Also given are the sequence identifiers for the identified genes and an indication of whether expression of the various genes are up or down regulated in TB compared with a control/reference population, and in what cell types the change was observed. Where more than one indication is given, the first listed is the most preferred.
Table 5 lists further biomarkers for latent TB. Also given are the sequence identifiers for the identified genes and an indication of whether expression of the various genes are up or down regulated in TB compared with a control/reference population, and in what cell types the change was observed. Where more than one indication is given, the first listed is the most preferred.
Table 6 lists the various probes used to detect the various biomarkers of the invention
Whole blood samples were obtained from the following cohorts: (1) Caucasian controls-professional individuals recruited locally to the project team who constitute a low risk group, coming from non/low-TB endemic regions, such that their risk of having been exposed to TB is extremely low (CC); (2) Controls of Asian descent recruited from Hindu temples in London who tested negative for TB in skin and/or IFNγ tests and originate from high-incidence areas of TB (LC or NMRL CNTRL); (3) individuals of Asian descent recruited from Hindu temples in London and test positive for TB in Mantoux skin and/or IFNγ tests and diagnosed with latent TB (LTB or NMRL LTNT); (4) individuals with early stage active TB recruited at St. Thomas's and Royal Free hospitals in London (EATB); and (5) individuals of Asian descent recruited at the Jawaharlal Institute of Postgraduate Medical Education and Research (JIPMER), India, diagnosed with active TB (ATB).
Whole blood was obtained at a single time point in PaxGene or Tempus RNA stabilization tubes. Control and Latent TB Blood were collected using PaxGene tubes. Early Active and Active TB blood were collected using Tempus tubes. Blood collected in PaxGene tubes was mixed by inversion and incubated at Room Temperature (˜25° C.) for 2 hours before storing at −80° C. Blood collected in Tempus tubes was vortexed at full speed for 10 seconds and then stored at −20/−80° C. RNA was extracted from these respective tubes according to the manufacturer's instructions. Concentration and purity (A260/A280 ratio ≥1.8) were assessed by spectrophotometry using a NanoDrop ND-1000 spectrophotometer. The purity of the extracted RNA was analysed using Agilent Bioanalyzer according to the manufacturer's instructions. cDNA was synthesized from the extracted RNA using the Roche Transcriptor First Strand cDNA Synthesis Kit according to the manufacturer's instructions.
Quantitative real-time PCR analyses were performed using the Roche Lightcycler (LC) 480 in 384 well plate format. The LC 480 is a plate-based, highly adaptable and versatile real-time PCR system used for gene expression analysis and has been designed for enhanced throughput and efficiency without compromising sensitivity and specificity. Roche provide an online ‘target to assay’ design and build system, the ‘Realtime Ready configurator’, which can be used to generate quantitative real-time PCR (qPCR) assays in a number of plate formats. Assays for the biomarker targets of interest were designed using this system and the assay plates configured, tested and quality assured by Roche. It uses a dual-colour assay, 165-FAM labelled Universal Probe Library (UPL) probe system and Roche provide all platform-specific dedicated reagents. Assay plates are dispatched in desiccated format, each well containing a target-specific primer pair and assay-specific human LC 480 Universal Probe Library (UPL) probe, coated in the bottom of each of the 384 wells.
All assays were performed using default protocols according to the manufacturer's instructions. Four human reference genes were used throughout, for quantifying the expression of genes of interest. In short, synthesized cDNA was mixed with Roche LC480 probes master mix at a constant dilution and pipetted into the 384 well assay plates using a Qiagility™ robotic pipetter. This reduced manual handling minimised pipetting errors and ensured reagent distribution uniformity throughout the plate wells. Data outputs were quantified using the Lightcycler 480 software and then expressed as a numeric figure of the ratio of the fold-change difference of the target vs the mean of all four reference genes. All raw data were then exported and processed further using the microarray analysis package Genespring GX 12.5 ((GX 12.5).
Data output files from BlueFuse were imported into GX 12.5 for differential gene expression and statistical analysis. Averaged data were imported without further normalisation and then baseline transformed to the median of all samples. Data were assessed for quality, then filtered on gene expression where entities in at least 100 percent of samples and in any one out of one conditions had normalised expression values within the cut-off 0 to 329.0 where at least 1 feature out of all samples had values within range. Statistically significant features were identified using one-way ANOVA analysis across all entities and time points at a cut-off p<0.05. To identify differentially expressed entities between the groups T-tests (unpaired, unequal variance) were performed on the samples arranged by group at a p<0.05 using the cut-off fold-change setting >1.2. The outputs were visualised using the boxplot graphical output facility.
Table 7 lists the genes newly identified as biomarkers for TB in Example 1 (see Table 2 above). Also given are the results of t-tests comparing the expression of a given marker between different test/control groups. The final two columns give ANOVA p values illustrating the significance of the biomarkers as determined using the qPCR technique.
Table 8 provides the results of the qPCR analysis of the human cohort samples using the further biomarkers for TB listed in Table 3 above. Also given are the results oft-tests comparing the expression of a given marker between different test/control groups. The final two columns give ANOVA p values illustrating the significance of the biomarkers as determined using the qPCR technique.
Table 9 provides the results of the qPCR analysis of the human cohort samples using the genes identified as biomarkers for latent TB in Example 1 (see Table 4 above). Also given are the results of t-tests comparing the expression of a given marker between different test/control groups. The final two columns give ANOVA p values illustrating the significance of the biomarkers as determined using the qPCR technique.
Table 10 provides the results of the qPCR analysis of the human cohort samples using the further biomarkers for latent TB listed in Table 5 above. Also given are the results of t-tests comparing the expression of a given marker between different test/control groups. The final two columns give ANOVA p values illustrating the significance of the biomarkers as determined using the qPCR technique.
In Tables 7 to 10 and
Set out below are the nucleotide sequences of the TB biomarkers disclosed herein. Exemplary target regions within the biomarker sequences are underlined, and exemplary probe sequences are double underlined.
a
caggcccaa tgtgcctcat tgagaacact aatgggcgac tgatggcgaa tccagaagct
ctgaagatcc t
ttctgccat tacgcagcct gtggtggtgg tggcgactgt gggccgctag
cgcacaggaa aatcctacct gattaacaag ctggctcaga agaaaaaggg cttctctctg
ggctccacag tgcagtctca cactaaagga atctggatgt ggtgtatgcc ccatcccaag
aagccaggcc acatcctagt tctgctggac accgagggtc tgggagatgt agagaagggt
gacaaccaga atgactcctg gatcttcgcc ctggccgtcc tcctgaacag cacttccatg
tacaatagca taggaaccat taaccagcag gccatggacc aactgcagta tcctttgtga
ccca
gaacag caccaagtgg aacgtgtgaa agctgagtct gcacaggctt cagcaaaaat
gttgcagcaa atgc
aaagaa agaatgagca gatgatggaa cagaaggaga ggagttatca
agtccaacca aaatcttcac ctgatgagaa tgagaatgag gattcagctg actttgagag
c
ttcttccca gactttgtgt tgacactgag agatttctag cattacagaa agcgcttttg
gacaaaactg t
gataaaata aactaaatgg aggacttttt tttattggaa tagtttcatt
gttcctggag tgcacggaag tcacaactgg tctatcagtc cagacgggg
g cctttggtca
aatattcttc tgattacttc caagccccct ctgactacag atactacccc taccagt
cct
tccagactcc acaacacccc agcttcctct tccaggacaa gagggtgtcc tggtccctgg
tctacctccc caccatccag agctgctgga actacggctt ctcctgctcc tcggacgagc
tccctgtcct gggcctcacc aagtctggcg gctcagatcg
caccattgcc tacgaaaaca
aagccctgat gctctgcgaa gggctcttcg tggcagacgt caccgatttc gagggctgga
aggctgcgat tcccagtgcc ctggacacca acagctcgaa gagcacctcc tccttcccct
gcactgctga gtcccgagca gcggaacaag aactgcacag ccatctggga agcctttaaa
gtggcgctgg acaaggatcc ctgctccgtg ctgccctcag actatgacct ttttattaac
ttgtccaggc actctattcc cagagataag tccctgttc
t gggaaaatag ccacctcctt
gttaacagct ttgcagacaa cacccgtcgt tttatgcccc tgagcgatg
t tctgtatggc
agggttgcag atttcttgag ctggtgtcga cagaaaaatg actctggact cgattaccaa
tcctgcccta catcagaaga ctgtgaaaat aatcctgtgg attccttttg gaaaagggca
tccatccagt attccaagga
tagttctggg gtgatccacg tcatgctgaa tggttcagag
ccaacaggag cctatcccat caaaggtttt
tttgcagatt atgaaattcc aaacctccag
ctattttgaa gttcagattt cagatgaggc aacattttct tgagataatt acccaagttt
catccatgtt gaatggtaca aaatatttct gtgaaactaa caggaagata ttttca
gata
actaggataa cttgttgctt tgttacccag cctaattgaa gagtggcaga ggctactaca
aaaagc
aacc ttttcatttt cactaagagt ttaaaagcta ttgtattatt aaaaagtctt
tacaatgctt gtttcaaaga accaacagaa aaaaaagcta agaaaactga gaactaacat
taaaaaaatt aaatttagaa taagaatgat ttctttaatt tgtccttttt ttctttggtc
taaaacatta ttaaattttt gtaaatattt tgatttaatg tgtcttagat cctcattatt
ttaatacagg aaaagaaaag atttagtaat ttcttaccat gctaatatgt aa
agttcatg
ccatccaggc atttaagagc gatcctcatc ccttcagcaa tatgtatttg agttcacact
a
tttctgttt tacagcagtt ttgaaaaaca catactatgc caccaattgt catattattt
ttgcagcagt tactggcgtc cctgagggcc tccatcctcg ctcgggactg tgcggctgcg
gcggctattg tgttcttggt ggaccg
gttc ctgtatgggc tcgacgtctc tggaaaactt
ctgcaggtcg ccaaaggtct ccacaagttg cagcca
gcca cgccaattgc cccgcaggtg
gttattcgcc aagcccgaat ctccgtgaac tcaggaaaac ttttaaaagc agagtatatt
ctgagcagtc taataagcaa caatggagca acgggtacct ggctgtacag aaatgaaagt
gacaaggtcc tggtgcagtc ggtctgtata cagatcagag ggcagattct gcaaaagctg
gggatgtggt acgaagcagc agagttaata tgggcctcca ttgtaggata tttggcactt
cctcagccgg ataaaaaggg cctctccacg tcgctaggta tactggcaga catctttgtt
tccatgagca agaacgatta tgaaaagttt aaaaacaatc cacaaattaa tttgagcctg
ctgaaggagt ttgaccacca tttgctgtcc gctgcagaag cctgcaagct ggcagctgcc
ttcagtgcct atacgccgct cttcgtgctc acagctgtga atatccgtgg cacgtgttta
ttgtcctaca gtagttcaaa tgactgtcct ccagaattga aaaacttaca tctgtgtgaa
gccaaagagg cctttgagat tggcctcctc accaagagag atgatgagcc tgttactgga
aaacaggagc ttcacagctt tgtcaaagct gctttcggtc tcaccacagt gcacagaagg
ctccatgggg agacagggac ggtccatgca gcaagtcagc tctgtaagga agcaatgggg
aagctgtaca atttcagcac ttcctccaga agtcaggaca gagaagctct gtctcaagaa
gttatgtctg tgattgccca ggtgaaggaa catttacaag ttcaaagctt ctcaaatgta
gatgacagat cttatgttcc cgagagtttc gagtgcaggt tggataaact tatcttgcat
gggcaagggg atttccaaaa aatccttgac acctattcac agcaccatac ttcggtgtgt
gaagtatttg aaagtgattg tggaaacaac aaaaatgaac agaaagatgc aaaaacagga
gtctgcatca ctgctctaaa aacagaaata aaaaacatag atactgtgag tactactcaa
gaaaagccac attgtcaaag agacacagga atatcttcct ccctaatggg taagaatgtt
cagagggaac tcagaagggg aggaaggaga aactggaccc attctgatgc atttcgagtc
tccttggatc aagatgtgga gactgagact gagccatcgg actacagcaa tggtgaggga
gctgttttca acaagtctct gagtggcagc cagacttcca gtgcttggag caacttatca
gggtttagtt cctctgcaag ctgggaggaa gtgaattatc acgttgacga caggtcagcc
agaaaagagc ctggcaaaga acatctggtg gacactcagt gttccactgc cttgtctgag
gagctagaga atgacaggga aggcagagct atgcattcat tgcattcaca gcttcatgat
ctctctcttc aggaacccaa caatgacaat ttggagcctt ctcaaaatca gccacagcaa
cagatgccct tgacaccctt ctcgcctcat aataccccag gcattttctt ggcccctggt
gcagggcttc tagaaggagc tccagaaggt atccaggaag tcagaaatat gggacccaga
aatacttctg ctcactccag accctcatat cgttctgctt cttggtcttc tgattctggt
aggcccaaga atatgggcac acatccttca gtccaaaaag aagaagcctt tgaaataatt
gttgagtttc cagaaaccaa ctgcgatgtc aaagacaggc aggggaaaga gcagggagaa
gaaattagtg aaagaggcgc aggccctaca tttaaagcta gtccctcctg ggttgaccca
gaaggagaaa cagcagaaag cactgaagat gcacccttag actttcacag ggtcctgcac
aattctctgg gaaacatttc catgctgcca tgtagctcct tcacccctaa ttggcctgtt
caaaatcctg actccagaaa aagtggtggc ccagtcgcag agcagggcat cgaccctgat
gcctccacag tggatgagga ggggcaactg ctcgacagca tggatgttcc ctgcacaaat
gggcacggct ctcatagact gtgcattctg agacagccgc ctggtcagag ggcggagacc
cccaattcct ctgtaagcgg taacatcctc ttccctgtcc tcagcgagga ctgcactacc
acagaggaag gaaatcagcc tggaaacatg ctaaactgca gccagaactc cagctcatcc
tcagtgtggt ggctgaaatc acctgcattt tccagtggtt cttctgaggg ggacagccct
tggtcctatc tgaa
ttccag tgggagttct tgggtttcat tgccgggaaa gatgaggaaa
gagatccttg aggctcgcac cttg
caacct gatgactttg aaaagctgtt ggcaggagtg
tgaagccatt ttcgtagaga aatatccttc actataatca aataagtttt gtcccatcaa
ttccaaagat gtttccagtg gtgctcttga agaggaatga gtaccagttt taaattgccc
attggcattt gaaggtagtt gagtatgtgt tctttattcc tagaagccac tgtgcttggt
agagtgcatc actcaccaca gctgcctcct gagctgcctg agcctggtgc aaaaggattg
gcccccatta
tggtgcttct gaataaatct tgccaagata gacaaacaat gatgaaactc
agatggagct tcctactcac
gttgatttat gtctcacaat cctgggtatt gttaattcaa
catagggtga aactatttct gataaagaac ttttgaaaaa ctttttatac tctaaagtga
tactcagaac aaaagaaagt cataaaactc ctgaatttaa tttccccacc taagtcgaaa
cagtattatc aaaacacatg tgcacacaga ttattttttg gctccaaaac tggattgcaa
aagaaagagg agaagaatat tttgtgtgtt
cctggtattc ttttataagt aaagtttacc
caggcatgga ccagcttcag ccagggacaa aatcccctcc
caaaccactc tccacagctt
tgccagtcgt tctactcgcc acagtcatgg cccactgcaa gcagatgtat ttgtggaatt
gcttagtcct cgtagaaaac gcagaggcag gaaaagtgct aataatacac cagaaaatag
tcccaacttc cctaacttca
gagtcattgc cacaaagtca agtgaacagt caagatctgt
aaatattgct tcaaaacttt ctctccaaga
gagtgaatcc aaaagaagat gcagaaaaag
acaatctcca gagccatcgc ctgtgacact gggtcgaagg agttctggcc gacagggagg
agttcatgaa ttgtctgctt ttgaacaact tgttgtagaa ttggtacgac atgatgacag
ctggcctttt ttgaaacttg tttctaaaat ccaggtccca gactactatg acatcatcaa
aaagcccatt gccttaaata taattcgtga aaaagtgaat aagtgtgaat ataaattagc
atctgagttt attgatgaca ttgagttaat gttttcgaac tgctttgaat acaaccctcg
taacacaagt gaagcaaaag ctggaactag gcttcaagca ttttttcata ttcaggctca
aaagcttgga ctccacgtca
cacccagtaa tgtggaccaa gttagcacac caccggctgc
gaaaaagtca cgaatctgac tttgtccttc
taaaggatat atttgaagaa aaacaaattg
ccaacgtttg actacttaca gagcgtcctg gatgatttct acacagccac ggaagggcaa
taccagcagc agccttagag cacagggaga cccgtccatt tggcaggggt ggctgcctca
tttagagagg
aaaagtaacc atcactggtt gcacttatga tttcatgtgc ggggatcatc
tgccgtgcct ggatcctgaa
atagaggcta aattactcag gaagaacacc ctctaaatgg
gaaagtattc tgtactctta gatggattct ccactcagtt gcaacttgga cttgtcctca
gcagctggta atcttgctct gcttgacaac atctgagtgc agccgtttga gaagaaaaca
tctattctct ccaaaaatgc acccaactag ctctatgttt acaaatggac ataggactca
aagtttcaga gaccattgca atgaatcccc aataattgca gaactaaact catttataaa
gctaaaataa ccggatatat acatagcatg acatttcttt gtgctttggc ttacttgttt
aaaaaaaaaa aaaaactaat ccaacctgtt agattttgca ggtgaagtca gcagcttaaa
aatgtctttc ccagatttca atgatttttt tccccctacc tcccaaaatc tgagactgtt
aaaacatttt
tcttctatga acactgctca gacctgctag acatgccata ggagtggcgt
gcacatctct ctctcttcca
gcaggaggag cccgtgagca cgcacagctg ccctgtctgc
tcacccgaag gcaccgggct cacctggacc tcccaggaaa gggagaagag cctcagaaac
gaggatctac ccgggtccct gcctggccag tggggaaaca ccggtccccc aggcaccttc
acctaaccag agcggggatt
tccaccgccc ctcatgccgc cctttggagg aaagtgaaag
tgaaaggagg aagaggaggc ttcatggctg
aggaggtcgc agcgccatga agtccctgtc
tctgctcctc gctgtggctt tgggcctggc gaccgccgtc tcagcaggac ccgcggtgat
cgagtgttgg ttcgtggagg atgcgagcgg aaagggcctg gccaagagac ccggtgcact
gctgttgcgc cagggaccgg gggaaccgcc gccccggccg gacctcgacc ctgagctcta
tctcagtgta cacgaccccg cgggcgccct ccaggctgcc ttcaggcggt atccccgggg
cgcccccgca ccacactgcg agatgagccg cttcgtgcct ctccccgcct ctgcgaaatg
ggccagcggc ctgacccccg cgcagaactg cccgcgggcc ctggatgggg cttggctgat
ggtcagcata tccagcccag tcctcagcct ctccagcctc ttgcgaccac agccagagcc
tcagcaggag cctgttctca tcaccatggc aacagtggta ctgactgtcc tcacccacac
ccctgcccct cgagtgagac tgggacaaga tgctctgctg gacttgagct ttgcctacat
gccccccacc tccgaggccg cctcatctct ggctccgggt ccccctccct ttgggctaga
gtggcgacgc cagcacctgg gtaagggaca tctgctcctg gctgcaactc ctgggctgaa
tggccagatg ccagcagccc aagaaggggc cgtggcattt gctgcttggg atgatgatga
gccatggggc ccatggaccg gaaatgggac cttctggctg cctacagttc aaccctttca
ggagggcacc tatctggcca ccatacacct gccatacctg caaggacagg tcaccctgga
gcttgctgtg tacaaacccc ccaaagtgtc cctgatgcca gcaacccttg cacgggccgc
cccaggggag gcacccccgg aattgctctg ccttgtgtcc cacttctacc cttctggggg
cctggaggtg gagtgggaac tccggggtgg cccagggggc cgctctcaga aggccgaggg
gcagaggtgg ctctcggccc tgcgccacca ttccgatggc tctgtcagcc tctctgggca
cttgcagccg cccccagtca ccactgagca gcatggggca cgctatgcct gtcgaattca
ccatcccagc ctgcctgcct cggggcgcag cgctgaggtc accctggagg tagcaggtct
ttcagggccc tcccttgagg acagcgtagg ccttttcctg tctgcctttc ttctgcttgg
gctcttcaag gcactgggct gggctgctgt ctacctgtcc
acctgcaagg attcaaagaa
gaaagcagag tgagggcact cactgccatc ctgtggaagc caccatcatc
tctggcccaa
gcttctgtag tagctcccta aaataatacc ctatcatctg ctcctaatcc ctccaatctc
tctccactga gtggctggaa tgcttttttt tttttctttc acttatataa gggataattt
gtccacaagt catttgtgga agtgaatgaa gagggaacag aggcggcagc tgcc
acagca
ggcatcgcaa ctttctgcat gttgatgccc gaagaaaatt tcactgccga ccatccattc
cttt
tcttta ttcggcataa ttcctcaggt agcatcctat tcttggggag attttcttcc
ccttagaaga aagagactgt agcaatacaa aaatcaagct tagtgcttta ttacctgagt
ttttaataga gccaatatgt cttatatctt taccaataaa accactgttc agaaacaagt
ctttcatttt ctttgtaagt ttggctctgt tggctgttta cacccatgaa ttttggcatg
ggtatctatt tttctttttt acattgaaaa aaatccagtg gttgcttttg aatgcatcaa
gtaaagaaga agaaaagaat
acatccgatg cgtagattct tgaccatgta gtaatctata
aaattgctat atcctcctga tagccatggg
aaaacatgat aagatggtca tttattttgc
agttagaatt ttggaagcca caaaatagac agacaccctg actgttgaag ggaggtttaa
aaacagatat tcaattgaaa tgtaagagag caccccaatt gagagcccag gttacgaaga
caagcttgcc tcgcctgact tttctgtccc ttgttctgca ggattagtat tctgttacag
acctctagtt tttagactct tcaattaaag ggccaatggt tataacctgc attccctttt
ttgttcttct ttatgtataa tatatagttc atgtggcgct gcatgaaatc aagaagtggg
tgtcttagga taaaagatac caagagtcta caaaaataac catgtagtaa gataaactgc
cacgagcgca tctactgtgc actctctggt tcagctgctg atgcccaagc cgtggccgac
atggccgcct accagctgga gctccatggg atagaactgg aggaacctcc acttgttttg
gctgctgcaa atgtggtgag aaatatcagc tataaatatc gagaggactt gtctgcacat
ctcatggtag ctggctggga ccaacgtgaa ggaggtcagg tatatggaac cctgggagga
atgctgactc gacagccttt tgccattggt ggctccggca gcacctttat ctatggttat
gtggatgcag catataagcc aggcatgtct cccgaggagt gcaggcgctt caccacagac
gctattgctc tggccatgag ccgggatggc tcaagcgggg gtgtcatcta cctggtcact
attacagc
tg ccggtgtgga ccatcgagtc atcttgggca atgaactgcc aaaattctat
gatgagtgaa ccttccccag acttctcttt cttattttgt aataaactct ctagggccaa
aacctggtat ggtcattggg aaatgagtgc tcagggagat ggagcttagg
ggaggtgggt
gcttccctcc tagatgtcag catacactct ttcttctttt gtcccaggtc taaaacatct
cagagtatgg gacctgaaag atgatggaaa catgatgaaa gtattgaggg ggcatcagaa
ttgggtgtac agctgtgcat tctctcctga ctcttctatg ctgtgttcag tcggagccag
taaagcagtt ttcctttgga atatggataa atacaccatg atacggaaac tagaaggaca
tcaccatgat gtggtagctt gtgacttttc tcctgatgga gcattactgg ctactgcatc
ttatgatact cgagtatata tctgggatcc acataatgga gacattctga tggaatttgg
gcacctgttt cccccaccta ctccaatatt tgctggagga gcaaatgacc ggtgggtacg
atctgtatct tttagccatg atggactgca tgttgcaagc cttgctgatg ataaaatggt
gaggttctgg agaattgatg aggattatcc agtgcaagtt gcacctttga gcaatggtct
ttgctgtgcc ttctctactg atggcagtgt tttagctgct gggacacatg acggaagtgt
gtatttttgg gccactccac ggcaggtccc tagcctgcaa catttatgtc gcatgtcaat
ccgaagagtg atgcccaccc aagaagttca ggagctgccg attccttcca agcttttgga
gtttctctcg tat
cgtattt agaagattct gccttcccta gtagtaggga ctgacagaat
acacttaaca caaacctc
aa gctttactga cttcaattat ctgtttttaa agacgtagaa
gatttattta atttgatatg ttcttgtact gcattttgat cagttgagct tttaaaatat
tatttataga caatagaagt atttctgaac atatcaaata taaatttttt taaagatcta
actgtgaaaa catacatacc tgtacatatt tagatataag ctgctatatg ttgaatggac
ccttttgctt ttctgatttt tagttctgac atgtatatat tgcttcagta gagccacaat
atgtatcttt gctgtaaagt gcaaggaaat tttaaattct gggacactga gtt
agatggt
aaatactgac ttacgaaagt tgaattgggt gaggcgggca aatcacctga ggtcagcagt
ttg
agactag cctggcaaac atgatgaaac cctgtctcta ctaaaaatac aaaaaaaaaa
aaaattagcc aggcgtggtg gtgcacacct gtagtcctag ctacttggga ggctgaggca
ggagaattgc ttgaacccag gaggtggagg ttgcagtaag ccaagatcac accactgcac
tccaacctgg acaacagagc gagactccat ctcaaaaaaa aaaaaaaatt gtgttgcctc
acttccataa gaactcagcc cgcatcattc gcactgctgt ctttggcttt gagacctcgg
aagcgaaggg ccccgatggc atggccctgc ccaggccccg ggaccaggct gtcttccccc
aaaacgggct ggtggtcagc agtgtggacg tgcagtcagt ggagcctgtg gatcagagga
cccgggacgc cctgcaacgc agcgtccagc tggccatcga gatcaccacc aactcccagg
aagcggcggc caagcatgag gctcagagac tggagcagga agcccgcggc cggcttgagc
ggcagaagat cctggaccag tcagaagccg agaaagctcg caaggaactt ttggagctgg
aggctctgag catggccgtg gagagcaccg ggactgccaa ggcggaggcc gagtcccgtg
cggaggcagc ccggattgag ggagaagggt ccgtgctgca ggccaagcta aaagcacagg
ccttggccat tgaaacggag gctgagctcc agagggtcca gaaggtccga gagctggaac
tggtctatgc ccgggcccag ctggagctgg aggtgagcaa ggctcagcag
ctggctgagg
tggaggtgaa gaagttcaag cagatgacag aggccatagg ccccagcacc atcagggacc
ttgctgtggc tgggcctgag atgcaggtaa aactgctcca gtccctgggc ctgaaatcaa
ccctcatcac cgatggctcc actcccatca acctcttcaa cacagccttt gggctgctgg
ggatggggcc cgagggtcag cccctgggca gaagggtggc cagtgggccc agccctgggg
aggggatatc cccccagtct gctcaggccc
ctcaagctcc tggagacaac cacgtggtgc
ctgtactgcg ctaactcctg attaatacaa tggaagtttc
tgggcattta caatttcaac
gcgccccgca gagcagtcga gatgggtgag tcaagtgaag acatagacca aatgttcagc
actttgctgg gagagatgga tcttctgact cagagtttag gagttgacac tctccctcct
cctgacccta atccacccag agctgaattt aactacagtg tggggtttaa agatttaaat
gagtccttaa atgcactgga agaccaagat ttagatgctc tcatggcaga tctggtagca
gacataagtg aggctgagca gaggacaatc caggcacaga aagagtcctt gcagaatcaa
catcattcag catctctaca agcatcaatt ttcagtggtg cagcctctct tggttatgga
acaaatgttg ctgccactgg tatcagccaa tatgaggatg acttaccacc tccaccagcc
gatcctgtgt tagaccttcc actgccacca ccacctcctg aacctctctc tcaggaagag
gaagaagccc aagccaaggc tgataaaatt aagctggcgc tggaaaaact gaaggaggcc
aaggttaaga agctcgtcgt caaggtgcac
atgaatgata acagcacaaa gtcactgatg
gtggatgagc ggcagctggc ccgagatgtt ctggacaacc
ttttcgagaa aactcattgt
gactgcaatg tagactggtg tctttatgaa atctacccgg aactacaaat tgagaggttt
tttgaagacc atgaaaatgt tgttgaagtc ttatcagact ggacaagaga cacagaaaat
aaaatactat ttttggagaa agaggagaaa tatgctgtat ttaaaaaccc ccagaatttc
tacttggata acagaggaaa aaaagaaagc aaggaaacta atgagaaaat gaatgctaaa
aacaaggaat ccttacttga ggaaagtttc tgtggaacat ctatcattgt accagaactg
gaaggagctc tttatttgaa agaagatgga aagaaatcct ggaaaaggcg ctattttctt
ttacgggctt ctggaattta ttatgtaccc aaaggaaaga ctaagacatc tcgagatctg
gcgtgtttta tacagtttga aaatgtcaac atttactatg ggactcagca taaaatgaaa
tataaagcgc ccactgacta ttgctttgtt ttaaagcacc cccaaattca gaaggagtcc
cagtatatca agtatctctg ctgtgatgac acaagaaccc ttaaccagtg ggtcatggga
atacggatag ccaagtatgg gaagactctc tatgataact accagcgggc
tgtggcaaag
gctggacttg cctctcggtg gacaaacttg gggacagtca atgcagctgc accagctcag
ccatctacag gacctaaaac aggcaccacc cagcccaatg gacagattcc ccaggctaca
cattctgtca gtgctgttct ccaagaggcc cagagacatg ctgaaacatc gaaggataag
aagccagccc tcgggaacca ccacgacccg gcagtgcccc gggccccgca cgcccccaag
agaaatttaa actaacaggc cctattcaag tcatccatct tgcaaaagct tgttgtgatg
tcaaaggagg aaagaatgaa ctgagcttca agcaaggaga gcaaattgaa atcatccgca
tcacagacaa cccagaagga
aaatggttgg gcagaacagc aaggggttca tatggctata
ttaaaacaac tgctgtagag attgactatg
attctttgaa actgaaaaaa gactctcttg
gtgccccttc aagacctatt gaagatgacc aagaagtata tgatgatgtt gcagagcagg
atgatattag cagccacagt cagagtggaa gtggagggat attccctcca ccaccagatg
atgacattta tgatgggatt gaagaggaag atgctgatga tggctccaca ctacaggttc
aagagaagag taatacgtgg tcctggggga ttttgaagat gttaaaggga aaagatgaca
gaaagaaaag tatacgagag aaacctaaag tctctgactc agacaataat gaaggttcat
ctttccctgc tcctcctaaa caattggaca tgggagatga agtttacgat gatgtggata
cctctgattt ccctgtttca tcagcagaga tgagtcaagg aactaatgtt ggaaaagcta
agacagaaga aaaggacctt aagaagctaa aaaagcagga aaaagaagaa aaagacttca
ggaaaaaatt taaatatgat ggtgaaatta gagtcctata ttcaactaaa gttacaactt
ccataacttc taaaaagtgg ggaaccagag atctacaggt aaaacctggt gaatctctag
aagttataca aaccacagat gacacaaaag ttctctgcag aaatgaagaa gggaaatatg
gttatgtcct tcggagttac ctagcggaca atgatggaga gatctatgat gatattgctg
atggctgcat ctatgacaat gactagcact caactttggt cattctgctg tgttcattag
gtgccaatgt
gaagtctgga ttttaattgg catgttattg ggtatcaaga aaattaatgc
aaaaatatct gtggatataa ccctggcttt ggaatcaaaa agtagctggc ctgctagcac
ccaagaaggc ctgcgcattc aaaactggct ttcagcaaaa gttaggaagc aactacgact
aaagccattt
taccttgtac ccaagcatgc aaaggaagga aatggtttcc aagaagaaac
atggcggcta tccttctctc acatcgaaaa ggaaattttg aacaatcatg gaaaatctaa
aacgtgctgt gaaaacaaag aagagaaatg ttgcaggaaa gattgtttaa aactaatgaa
atacctttta gaacagctga aagaaaggtt taaagacaaa aaacatctgg ataaattctc
ttcttatcat gtgaaaactg ccttctttca
cgtatgtacc cagaaccctc aagacagtca
gtgggaccgc aaagacctgg gcctctgctt tgataactgc
gtgacatact ttcttcagtg
cctcaggaca gaaaaacttg agaattattt tattcctgaa ttcaatctat tctctagcaa
aagagttatg ccggcttcct caccgtgaat aagacttaca acagcaacct cttcttctgg
ttcttcccag ctcagataca gccagaagat gccccagtag ttctctggct acagggtggg
ccgggaggtt catccatgtt tggactcttt gtggaacatg ggccttatgt
tgtcacaagt
aacatgacct tgcgtgacag agacttcccc tggaccacaa cgctctccat gctttacatt
gacaatccag tgggcacagg cttcagtttt actgatgata cccacggata tgcagtcaat
gaggacgatg tagcacggga tttatacagt gcactaattc agtttttcca gatatttcct
gaatataaaa ataatgactt ttatgtcact ggggagtctt atgcagggaa atatgtgcca
gccattgcac acctcatcca ttccctcaac cctgtgagag aggtgaagat caacctgaac
ggaattgcta ttggagatgg
atattctgat cccgaatcaa ttataggggg ctatgcagaa
ttcctgtacc aaattggctt gttggatgag
aagcaaaaaa agtacttcca gaagcagtgc
catgaatgca tagaacacat caggaagcag aactggtttg aggcctttga aatactggat
aaactactag atggcgactt aacaagtgat ccttcttact tccagaatgt tacaggatgt
ccgggatctg cacacatctt tttgcagtta cctcttcata gccatgaacc
aaaacgttct
atgaggagca tgcaagtaag tcaagcctcc tattctgtta gtacttatta gaggaggaga
tggttttcat tgcatagtga cattttctta gccttaacgt tctgatagta gcttactact
cacttctctt tttcagtttt cataataagt attcattttt ttgccataat gcttcctgta
aagccaattt
tatatactaa taaaacatga actgcccact cttcatgcct gccaaacttg
gggcaattga tgctaaatgg
tatttttaaa ataaatgttt ttattcttta ctcttgaaaa
aaagctgcta ccaagacagc cacgaagatc ctaccaaaat gaagcgcttc ctcttcctcc
tactcaccat cagcctcctg gttatggtac agatacaaac tggactctca ggacaaaacg
acaccagcca aaccagcagc ccctcagcat ccagcaacat aagcggaggc attttccttt
tcttcgtggc
caatgccata atccacctct tctgcttcag ttgaggtgac acgtctcagc
cttagccctg tgccccctga
aacagctgcc accatcactc gcaagagaat cccctccatc
tttgggaggg
gttgatgcca gacatcacca ggttgtagaa gttgacaggc agtgccatgg
gggcaacagc caaaataggg
gggtaatgat gtaggggcca agcagtgccc agctgggggt
tggcctccgt gtcccttttg tggtggacat
ctgctcaatg acttttgagc agctggatct
cctgcttcgg caggtgagtg aggggatgga tggttccgcg
gactggcccc cgccccagga
gaaagagtgt gtggccgtgg caacgctgaa tcttctacga cttcagttgc atgctgccat
tagtcaccag gttgacccgg aattccttgg tttaggtctg ggcagcatcc tcctgaacag
cctgaagcag acggtggtga ccctggccag cagtgcgggc gtgctgagca ccgtgcagtc
ggccgcccag gccgtgctgc agagtggctg gtccgtgctg ctgcccaccg cggaggagcg
ggcccgggca ctctctgctc tcctgccctg cgcagtttca ggcaatgaag tgaacataag
tccaggtcgt cgattcatga ttgatcttct ggtgggcagc ttgatggctg atggagggtt
ggagtcagcc ttacacgcag ccattactgc agagatccag gatattgaag ccaaaaaaga
agcacagaag gaaaaagaaa ttgatgaaca ggaagcgaat gcctcaacat ttcatagaag
caggactcca ctggataaag accttattaa tacggggatc tgtgagtctt ctggcaaaca
gtgtttgcct ctggttcagc tcatacaaca gcttcttaga aacattgctt ctcagactgt
agccagattg aaagatgttg cccgtcggat ttcatcatgt ctggactttg agcaacacag
tcgtgaaaga tctgcttcat tggatttgtt actgcgtttt caacgtttgc ttattagtaa
actttatcca ggagaaagta ttggtcagac ctcagatatt tctagtccag agctaatgga
tgttggttcc ttgctgaaga agtacacagc cctcctgtgc acgcacattg gagatatact
gcctgtggcc gccagcattg cttctaccag ctggcggcac ttcgcggagg tggcttacat
tgtggaaggg gactttactg gtgttctcct tccagaacta gtagtttcta tagtgcttct
gctcagtaaa aatgctggtc tcatgcaaga ggctggagct gtacctctgc tgggtggcct
gttggaacat ctggatcggt tcaaccatct ggcaccagga aaggaacggg atgatcatga
agagttagcc tggcctggca taatggagtc attttttaca ggtcagaact gtagaaataa
tgaggaagtg acacttatac gcaaagctga tttggagaac cataataaag atggaggctt
ctggactgtg
attgacggga aggtgtatga tataaaggac ttccagacac agtcgttaac
aggaaatagt attcttgctc agtttgcagg ggaagaccca gtggtagctt tggaagctgc
tttgcagttt gaagacaccc gggaatccat gcacgcgttt tgtgttggcc agtatttgga
gcctgaccaa gaaatcgtca ccataccaga tctggggagt ctctcttcac ctctgataga
cacagagagg aatctgggcc tgcttctcgg attacacgct tcgtatttgg caatgagcac
accgctgtct cctgtcgaga ttgaatgtgc caaatggctt cagtcatcca tcttctctgg
aggcctgcag accagccaga tccactacag ctacaacgag gagaaagacg aggaccactg
cagctcccca gggggcacac ctgccagcaa atctcgactc tgctcccaca gacgggccct
gggggaccat tcccaggcat ttctgcaagc cattgcagac aacaacattc aggatcacaa
cgtgaaggac tttttgtgtc aaatagaaag gtactgtagg cagtgccatt tgaccacacc
gatcatgttt ccccccgagc atcccgtgga agaggtcggt cgcttgttgt tatgttgcct
cttaaaacat gaagatttag gtcatgtggc attatcttta gttcatgcag gtgcacttgg
tattgagcaa gtaaagcaca gaacgttgcc taagtcagtg gtggatgttt gtagagttgt
ctaccaagca aaatgttcgc tcattaagac tcatcaagaa cagggccgtt cttacaagga
ggtctgcgct cctgtcatcg aacgtttgag attcctcttt aatgaattga gacctgctgt
ttgtaatgac ctctctataa tgtctaagtt taaattgtta agttctttgc cccgttggag
gaggatagct caaaagataa ttcgagaacg aaggaaaaag agagttccta agaagccaga
atctacggat gatgaagaaa aaattggaaa cgaagagagt gatttagaag aagcttgcat
tttgcctcat agtccaataa atgtggacaa gagacccatt gcaattaaat cacccaagga
caaatggcag ccgctgttga gtactgttac aggtgttcac aaatacaagt ggttgaagca
gaatgtgcag ggtctttatc cgcagtctcc actcctcagt acaattgctg aatttgccct
taaagaagag ccagtggatg tggaaaaaat gagaaagtgc ctactaaaac agttggagag
agcagaggtt cgcctggaag ggatagatac aattttaaaa ctggcgagca agaatttctt
acttccat
ct gtgcagtatg cgatgttttg tggatggcaa agacttattc ctgagggaat
cgatataggg gaacctct
ta ctgattgttt aaaggatgtt gatttgatcc cgccttttaa
tcggatgctg ctggaagtca cctttggcaa gctgtacgct tgggctgtac agaacattcg
aaatgttttg atggatgcca gtgccaaatt taaagagctt ggtatccagc cggttcccct
gcaaaccatc accaatgaga acccgtcagg accgagcctg gggaccatcc cgcaagccca
cttcctcctg gtgatgctca gcatgctcac cctgcagcac ggcgcaaaca acctcgacct
tctgctcaat tccggcatgc tggccctcac gcagacggca ctgcgcctga ttggccccag
ttgtgacaac gttgaggaag atatgaatgc ttctgctcaa ggtgcttctg ccacagtttt
ccagcagtct cactggactg ggttgctatc agaaatggag ttggtaccca gtattcatcc
tggggttacg tgtgatggat gtcagatgtt tcctatcaat ggatccagat tcaaatgcag
aaactgtgat gactttgatt tttgtgaaac gtgtttcaag
accaaaaaac acaataccag
gcatacattt ggcagaataa atgaaccagg tcagtctgcg gtattttgtg
gccgttctgg
aaaacagctg aagcgttgcc acagcagcca gccaggcatg ctgctggaca gctggtcccg
catggtgaag agcctgaatg tgtcgtcctc cgtgaaccag gcatcccgtc tcattgacgg
cagcgagccc tgctggcagt catcggggtc gcaaggaaag cactggattc gtttggagat
tttcccagat gttcttgttc atagattaaa aatgatcgta gatcctgctg acagtagcta
catgccgtcc ctggttgtag tgtcaggtgg aaattccctg aataacctta ttgaactaaa
gacaatcaat attaaccctt ctgacaccac agtgcccctt ctgaatgact gcacagagta
tcacaggtat attgaaattg ctataaagca gtgcaggagc tcaggaatcg attgtaaaat
ccatggtctc atcctgctgg gacggatccg tgcagaagag gaagatttgg ctgcagttcc
tttcttagct tcggataatg aagaggagga ggatgagaaa ggcaacagcg gaagcctcat
tagaaagaag gctgctgggc tggaatcagc agctacgata agaaccaagg tgtttgtgtg
gggcctgaat gacaaggacc agctgggcgg gctgaaaggc tccaagataa aggttccttc
gttctctgag acactgtcag ctttgaatgt ggtacaggtg gctggtggat ctaaaagttt
gtttgcagtg actgtggaag ggaaggtgta tgcctgtgga gaagccacga atggccggct
ggggctgggc atttccagcg ggacggtgcc catcccacgg cagatcacag ctctcagcag
ctacgtggtc aagaaggtgg ctgttcactc aggtggccgg cacgcgacgg ctttaactgt
cgatggaaaa gtgttttcgt ggggcgaagg tgacgatgga aaacttggac acttcagcag
aatgaactgt gacaaaccaa ggctgatcga ggccctgaaa accaagcgta tccgggatat
cgcctgtggg agctcgcaca gcgcagccct cacatccagc ggagaactgt acacctgggg
cctcggcgag tacggccggc tgggacatgg ggataatacg acacagctaa agcccaaaat
ggtgaaagtc cttctcggtc acagagtaat ccaggttgca tgtgggagta gagacgcgca
gaccctggct ctgaccgatg aaggtttggt attttcctgg ggtgatggtg actttggaaa
cagagcaccc gtgcacagca gcatgttccg cttcctgggt gtgttgctgg gcattgccat
ccgaaccggg agtcccctga gcctcaacct tgccgagcct gtctggaagc agctggctgg
gatgagcctc accatcgcgg acctcagtga
ggttgataag gattttattc ctggactcat
gtacatccga gacaatgaag ccacctca a ggagtttgaa
gccatgagcc tgcccttcac
agtgccaagt gccagtggcc aggacattca gttgagctcc aagcacacac acatcaccct
ggacaaccgc gcggagtacg tgcggctggc gataaactat agactccatg aatttgatga
gcaggtggct gctgttcggg aaggaatggc ccgcgttgtg cctgttcccc tcctctctct
gttcaccggc tacgaactgg agacgatggt gtgtggcagc cctgacatcc cgctgcacct
tctcaagtcg gtggccacct ataaaggcat cgagccttcc gcatcgctga tccagtggtt
ctgggaggtg atggagtcct tctccaacac agagcgctct cttttccttc gcttcgtctg
gggccggacg aggctgccca ggaccatcgc cgacttccgg ggccgagact tcgtcatcca
ggtgttggat aaatacaacc ctccagacca cttcctccct gagtcctaca cctgtttctt
cttgctgaag ctgcccaggt attcctgcaa gcaggtgctg gaggagaagc tcaagtacgc
catccacttc tgcaagtcca tagacacaga tgactacgct cgcatcgcac ttacaggaga
gccagccgcc gacgacagca gcgacgattc agataacgag
gatgtcgact cctttgcttc
ggactctaca caagattatt taacaggaca ctaagatggg gaaacgtcct
cgtgagatga
gagcctgagc caggcagcag agcgctcgct gctgtgtaga ctgtaggctg cctggtgtgt
ctgatgagaa gcgtccgtcc tcgagccagg cgggaggagg gagtggagag actgactggc
cgtgatggga atgacagtga gaaggtccgc ctgtgcgcgt ggaacactgt ggacgctcga
cttccaaggg tcttctcacc cgtaatgctg cattacatgt aggactgtgt ttactaaagt
gtgtaaatgt ttatataaat accaaattgc agcatcccca aaatgaataa agccttttta
ggagtaaact gttattattt taccatgcag aagaaagact ggaagggatg taaacagact
tgtcaacatt gtagatcatc ccttttgaag atagatgaca aagatgaact cgtattttac
attcactttt attctcttgg
actctgtttc tcaatgttgg acctaagata ttgaagacag
gctggagccc agagccttca ttcaatctca
gatttatgaa aataattact ggattggatt
atcatatgat gaaagggaaa gtaagtggaa atggattgat aatggcacat ctcctggaat
taattctaca ataatgcgtt tttcttctgg gagaggagaa tgtgcatttt tgacctcaac
aagaatggca actattgatt
gcattcaaac gtacaattgt atctgtggga agagaataga
ctctattttc tctgattcgg tgtgcgccaa
gaagaaaagg tgaaaatgga atgttttctt
ctgcaacatg agctccgcag ccaggtcccg cctcacccgc gccacccgcc
aggagatgct
gttcttggcg ttgctgctcc tgccagttgt ggtcgccttc gccagagctg aagctgaaga
agatggggac ctgcagtgcc tgtgtgtgaa gaccacctcc caggtccgtc ccaggcacat
caccagcctg gaggtgatca aggccggacc ccactgcccc actgcccaac tcatagccac
gctgaagaat gggaggaaaa tttgcttgga tctgcaagcc ctgctgtaca agaaaatcat
taaggaacat ttggagagtt
agctactagc tgcctaagtg tgcactttca atctaactgt
gaaagaatct tctgatgttt gtattatcct
tcttatatta tattaacaaa ataaatcaag
ctcaaaaaa
t gcttcaataa agtaacaggc tattgcagga agaaatgcaa ggtaggagaa
agatatgaaa taggatgtc
t aagtgggaaa ttatgttgtg ctaatgatga agaagagaaa
aaaca
tgtgt catttaagaa gccacatcaa cattctggtg agaagctgag tgtgctgcag
gattacatca tctta
cccac catcaccatt ttcacagtct aa
aaagaactta gatgtcagtg cataaagaca tactccaaac ctttccaccc caaatttatc
aaagaactga gagtgattga gagtggacca cactgcgcca acacagaaat tattgtaaag
ctttctgatg
gaagagagct ctgtctggac cccaaggaaa actgggtgca gagggttgtg
gagaagtttt tgaagagggc tgagaattca taaaaaaatt cattctctgt ggtatccaag
aatcagtgaa gatgccagtg aaacttcaag caaatctact tcaacacttc atgtattgtg
tgggtctgtt gtagggttgc cagatgcaat acaagattcc tggttaaatt tgaatttcag
taaacaatga atagtttttc attgtaccat gaaatatcca gaacatactt atatgtaaag
tattatttat ttgaatctac aaaaaacaac aaataatttt taaatataag gattttccta
gatattgcac gggagaatat acaaatagca aaattgaggc caagggccaa gagaatatcc
gaactttaat ttcaggaatt gaatgggttt gctagaatgt gatatttgaa gcatcacata
aaaatgatgg gacaataaat tttgccataa agtcaaattt agctggaaat cctggatttt
tttctgttaa atctggcaac cctagtctgc tagccaggat ccacaagtcc ttgttccact
gtgccttggt ttctccttta tttctaagtg gaaaaagtat tagccaccat cttacctcac
agtgatgttg tgaggacatg tggaagcact ttaagttttt tcatcataac ataaattatt
ttcaagtgta acttattaac ctatttatta tttatgtatt tatttaagca tcaaatattt
gtgcaagaat ttggaaaaat agaagatgaa tcattgattg aatagttata aagatgttat
agtaaattta ttttatttta gatattaaat gatgttttat tagataaatt tcaatcaggg
tttttagatt aaacaaacaa acaattgggt acccagttaa attttcattt cagataaaca
acaaataatt ttttagtata agtacattat tgtttatctg
aaattttaat tgaactaaca
atcctagttt gatactccca gtcttgtcat tgccagctgt gttggtagtg
ctgtgttgaa
ttacggaata atgagttaga actattaaaa cagccaaaac tccacagtca atattagtaa
agccactaag tcaagctgaa atgagtcctg ctgcatctgc agccatcaga aatgatgtcg
caaagcaacg tgctgcacca aaagtctctc ctcatgttaa gcctgaacag aaacagatgg
tggcccagca ggaatctatc agagaagggt ttcagaagcg ctgtttgcca gttatggtac
tgaaagcaaa gaagcccttc acgtttgaga cccaagaagg caagcaggag atgtttcatg
ctacagtggc tacagaaaag gaattcttct ttgtaaaagt ttttaataca ctgctgaaag
ataaattcat tccaaagaga ataattataa
tagcaagata ttatcggcac agtggtttct
tagaggtaaa tagcgcctca cgtgtgttag atgctgaatc
tgaccaaaag gttaa
tgtcc
cgctgaacat tatcagaaaa gctggtgaaa ccccgaagat caacacgctt caaactcagc
ccctt
ggaac aattgtgaat ggtttgtttg tagtccagaa ggtaacagaa aagaagaaaa
gatccagtca cctctgaaca tgaactgaca tgtcaggctg agggctaccc caaggccgaa
gtcatctgga caagcagtga ccatcaagtc ctgagtggta agaccaccac caccaattcc
aagagagagg agaagctttt caatgtgacc agcacactga gaatcaacac
aacaactaat
gagattttct actgcacttt taggagatta gatcctgagg aaaaccatac agctgaattg
gtcatcccag aactacctct ggcacatcct ccaaatgaaa ggactcactt ggtaattctg
ggagccatct tattatgcct tggtgtagca ctgacattca tcttccgttt aagaaaaggg
agaatgatgg atgtgaaaaa atgtggcatc caagatacaa actcaaagaa gcaaagtgat
acacatttgg aggagacgta atccagcatt ggaacttctg atcttcaagc agggattctc
aacctgtggt ttaggggttc atcggggctg agcgtgacaa gaggaaggaa tgggcccgtg
ggatgcaggc aatgtgggac ttaaaaggcc caagcactga aaatggaacc tggcgaaagc
agaggaggag aatgaagaaa gatggagtca aacagggagc ctggagggag accttgatac
tttcaaatgc ctgaggggct catcgacgcc tgtgacaggg agaaaggata cttctgaaca
aggagcctcc aagcaaatca tccattgctc atcctaggaa gacgggttga gaatccctaa
tttgagggtc agttcctgca gaagtgccct ttgcctccac tcaatgcctc aatttgtttt
ctgcatgact gagagtctca gtgttggaac gggacagtat ttatgtatga gtttttccta
tttattttga gtctgtgagg tcttcttgtc atgtgagtgt ggttgtgaat gatttctttt
gaagatatat tgtagtagat gttacaattt tgtcgccaaa ctaaacttgc tgcttaatga
tttgctcaca tctagtaaaa catggagtat ttgtaaggtg cttggtctcc tctataacta
caagtataca ttggaagcat aaagatcaaa ccgttggttg cataggatgt cacctttatt
taacccatta atactctggt tgacctaatc ttattctcag acctcaagtg tctgtgcagt
atctgttcca
tttaaatatc agctttacaa ttatgtggta gcctacacac ataatctcat
ttcatcgctg taaccaccct gttgtgataa ccactattat tttacccatc gtacagctga
ggaagcaaac agattaagta acttgcccaa accagtaaat agcagacctc agactgccac
ccactgtcct tttataatac aatttacagc tatattttac tttaagcaat tcttttattc
ttcaaaatag ctcctcaaaa atttcatctg agttcaccta
tgcatggtcg gtggaaaaca
gcagcacgga ttcttgggtc cttctttcta agggtataaa ggaggataat
ggaactcagg
aaacacttat ctcccaaaat cacctcatca gcaattccac attacttaaa gatagagtca
agcttggtac agactacaga ctccacctct ctccagtcca aatcttcgat gatgggcgga
agttctcttg ccacattaga gtcggtccta acaaaatctt gaggagctcc accacagtca
aggtttttgc taaaccagaa atccctgtga ttgtggaaaa taactccacg gatgtcttgg
tagagagaag atttacctgc ttactaaaga atgtatttcc caaagcaaat atcacatggt
ttatagatgg aagttttctt catgatgaaa aagaaggaat atatattact aatgaagaga
gaaaaggcaa agatggattt ttggaactga agtctgtttt aacaagggta catagtaata
aaccagccca atcagacaac ttgaccattt ggtgtatggc tctgtctcca gtcccaggaa
ataaagtgtg gaacatctca tcagaaaaga tcacttttct cttaggttct gaaatttcct
caacagaccc tccactgagt gttacagaat ctacccttga cacccaacct tctccagcca
gcagtgtatc tcctgcaaga tatccagcta catcttcagt gacccttgta gatgtgagtg
ccttgaggcc aaacaccact cctcaaccca gcaattccag tatgactacc cgaggcttca
actatccctg gacctccagt gggacagata ccaaaaaatc agtttcacgg atacctagtg
aaacatacag ttcatccccg tcaggtgcag gctcaacact tcatgacaat gtctttacca
gcacagccag agcattttca gaagtcccca caactgccaa tggatctacg aaaactaatc
acgtccatat cactggtatt gtggtcaata agcccaaaga tggaatgtcc tggccagtga
ttgtagcagc tttactcttt tgctgcatga tattgtttgg tcttggagtg agaaaatggt
gtcagtacca aaaagaaata atggaaagac ctccaccttt caagccacca ccacctccca
tcaagtacac ttgcattcaa gagcccaacg aaagtgatct gccttatcat gagatggaga
ccctctagtc tcgtgagact ttgccccatg gcagaactct gctggaatcc tattgagaag
gtagacattg tgctttatta atatagtcgc tcttcagcca tgcctttgct gcagctgaaa
tggaagtcag aagtgagtga cctgttttcc cagcaactca ccctcttcca tctccaaacg
cctgaagctt aaccaagagt gagaggatat gtcatgttca cactcaatgc aattcgtagt
ggttttcttg cttatgtaag aagtacatat tagtctgcca tctttaaaaa aaaatacagt
attttcattt aaattctctg atggagggac aacaatggtt tcaactgtat gcccatgcct
gatcctctta tttgaacatc tatcaacatt gtaaactctt tgccaaaatc ctggggcttt
gctgcattcc ctaagataat tacaggaaaa agaaaatgta aaagtgctaa caaggctgcc
aagtaatgga gaagtatggt tagtcttcat attgaaattc tgttgcttat tttcatggaa
ggaaacagaa tactttgcac aggaaccaca ttttcaatcc tccttcactg tcttcctacc
atgttcagcc cagactcctg ccacatggac caggatgaag agggatcaaa gagataatta
gccaaaaacc cagtagccta gaagatacaa aactccactg gcctctaaaa ttatattagc
caagagtggt ttcatttgag tgccttcgtg tgtatgtcca tcaaactgga accaaactgt
tttgtaagta aacaggcagc ctaagcccaa ccctactttc taattccagt tattctcttt
ttcatctggg gatttacctg ttcatttaat ctgcctgttt tgatctgttt tgaaaaagat
aaagagcctc aaatcagacc agcactgatt aattaaccct gctcctacca atctttttta
aagcagttga agcagaatgt ataggtgtca gagaagaaac ctagtcagcc agacgtgctc
tgtattcagc aatagtttgt gaatgaataa attactaatc ctccttgtcg cttgaaacct
tcccacactc cctgctccag gagggaaaaa cagatgttgt tgacagatag agtgataggc
aaattctgtg tggactttag tcccaaaagg aaactttagt tcacttgcag tatgcttatc
cttgactgca catgagaatg ccttgtgcag agttatttgg agattatgtc tttttcttaa
acaccatggc tgtcacactt cagttcaatt aaatcagaat gtctgaggag tgagacacag
gcatcaacac tctcaaatga ttcacatgtt cagccaaagt tgagaaccat cgagcctgtg
gaagttcttt ctcatggctc agaatcttag gtaggtgctt aactcttgtg gtggccagcc
tccaagatga gccccagtgt tcttgcctcc tactattcac atctttatgt ggtcccctcc
aatgctgaat acagatgatt tgtgtaacct gaggccagga ttaaggggag gcaatcaatg
cacctaggga aaaaatttaa
ggaggtattc acactcaggg tcatgcactt gcacaatgtt
gagaatgagt accactctca ccattggtat
agccaaaaaa gcttggaagt gaccaaggct
aggtcacaaa atacactgtg gcttcttctt tgatctctct ttgaccatac tgacactggg
aaaagcccat tcccatgcca tgaagacacc aaggcagccc tattgagaaa tctacctgtc
gtggccgggc gcagtggctc acgcctgtaa tcccagcact ttgggaggcc gaggtgggtg
ttcatcgaca tcatggacat gcctaacacc aacaagtact cctttgatgg agccaaccct
gtgtggctgg atcccttctg tcggaacctg gagctggccg cccaggcgga gcatgaggat
gacctaccgg agaacctgag tgagatcgcc gacctgtgga acagccccac gcgcacccat
ggaacttttg ggcgtgagcc agcagctgtc aagcctgatg atgaccgata cctgcgggct
gccatccagg agtatgacaa cattgccaag ctgggccaga tcattcgtga ggggccaatc
aagggctcgc
tgctgaaggt ggtcctggag gattacctgc ggctcaaaaa gctctttgca
cagcggatgg tgcaaaaagc
ctcctcctgc cactcctcca tctctgagct gatacagact
gagctggacg aggagccagg agaccacagc ccagggcagg gtagcctgcg cttccgccac
aagccaccag tggagctcaa ggggcccgat gggatccatg tggtgcacgg cagcacgggc
acgctgctgg ccaccgacct caacagcctg cccgaggaag accagaaggg cctgggccgc
tcgctggaga cgctgaccgc tgccgaggcc actgccttcg agcgcaacgc ccgcacagaa
tccgccaaat ccacacccct gcacaaactt cgcgacgtga tcatggagac ccccctggag
atcacagagc tgtgactaga cagggaagcc ttgtgggtgt gagcagcacc catccaccgt
cccctcccag ggagcaaggg cagggacagg gccggtcggg ggggaccctc caaggccagg
ccttggggac aaccttggct tggccctggc agcccgcatc agctgctcag
atcccacttt
tgccagacgc tcattcagca tctgacctct accttcataa gatctgttat ttttataaga
aaaccaaaca aaaatgttaa gcatctaagg acaaggtaag gagggtcact ggggcccaag
agtctgggga ccagcttggc tcaggctgag ctgaaagagg ccaaacaggc cctcctccct
cccagctcca ccccgcaagc accatcccct ccggctaagc aggcgcaagg gaggcccagc
gcggacatcc cctgctggcc ggacacccga ctccagtcca agtctcgcta catttccgcc
acatccctct ctgctggacg tccaggtgga ggtggcatcc ccacgtggac aagaaagtca
atgtcaatga acaagcattc tctccatttc actggcttcc caaatgtgtg cccagcttat
aggttctgag accagtgtat caggtcaggg acttggacag gagtcagtgt ctggcttttt
cctctgagcc cagctgcctg gagagggtct cgctgtcact ggctggctcc taggggaaca
gaccagtgac cccagaaaag cataacacca
atcccagggc tggctctgca ctaagagaaa
attgcactaa atgaatctcg ttcccaaaga actaccccct
tttcagctga gccctgggga
ctgttccaaa gccagtgaaa tgtgaaggaa agtggggtcc ttcggggcga tgctccctca
gcctcagagg agctctaccc tgctccctgc tttggctgag gggcttggga aaaaaacttg
gcactttttc gtgtggatct tgccacattt ctgatcagag gtgtacacta acatttcccc
cgagctcttg gcctttgcat ttatttatac agtgccttgc tcggcgccca ccaccccctc
aagccccagc
agccctcaac aggcccaggg agggaagtgt gagcgccttg gtatgactta
aaattggaaa tgtcatctaa
ccattaagtc atgtgtgaac acataaggac gtgtgtaaat
atgtacattt gtctttttat aaaaagtaaa ttgtttataa ggggtgtggc ctttttagag
aaatctcata
agctggtacc actcaggaga agtttattct tccagatgac cagcagtaga
caaatggata ctgagcagag
tcttaggtaa aagtcttggg aaatatttgg gcattggtct
ggccaagtct acaatgtccc aatatcaagg acaaccaccc tagcttctta gtgaagacaa
tgtacagtta tccgttagat caagactaca cggtctatga gcaataatgt gatttctgga
cattgcccat gtataatcct cactgatgat ttcaagctaa agcaaaccac cttatacaga
gatctagaat ctctttatgt tctccagagg aaggtggaag aaaccatggg caggagtagg
aattgagtga taaacaattg ggctaatgaa
gaaaacttct cttattgttc agttcatcca
gattataact tcaatgggac actttagacc attagacaat tgacactgga ttaaacaaat
tcacataatg ccaaatacac aatgtattta tagcaacgta taatttgcaa agatggactt
taaaagatgc tgtgtaacta aactgaaata attcaattac ttattattta gaatgttaaa
ctacaaggca aaagtaagac aaatgcagaa tacaggagaa tctgaagcta gtggaaataa
agagatgatt gaagcactaa agcaatatgc tatggactat tcgaataaag ctcttgagaa
gggactgaat cctctgaatg catactccga tctcgctgag ttcctggaga cggaatgtta
tcagacacca
ttcaataagg aagtccctga tgctgaaaag caacaatccc atcagcgcta
ctgcaacctt cagaaatata atgggaagtc tgaagacact gctgtgcaac atggtttaga
gggtttgtcc ataagcaaaa aatcaactga caaggaagag atcaaagacc aaccacagaa
tgtatctgaa aatctgcttc cacaaaatgc accaaattat tggtatcttc aaggattaat
tcataagcag aatggagatc tgctgcaagc agccaaatgt tatgagaagg aactgggccg
cctgctaagg gatgcccctt caggcatagg cagtattttc ctgtcagcat ctgagcttga
ggatggtagt gaggaaatgg gccagggcgc agtcagctcc agtcccagag agctcctctc
taactcagag caactgaact
gagacagagg aggaaaacag agcatcagaa gcctgcagtg
gtggttgtga cgggtaggac gataggaaga
cagggggccc caacctggga ttgctgagca
gggaagcttt gcatgttgct ctaaggtaca tttttaaaga gttgtttttt ggccgggcgc
acatccgcag cgagacctcc gtgcccgacc atgtcgtctg gtccctgttc aacaccctct
tcatgaaccc ctgctgcctg ggcttcatag cattcgccta ctccgtgaag tctagggaca
ggaagatggt tggcgacgtg accggggccc
aggcctatgc ctccaccgcc aagtgcctga
acatctgggc cctgattctg ggcatcctca
tgaccattct gctcatcgtc atcccagtgc
tgatcttcca ggcctatgga tagatcagga ggcatcactg aggccaggag ctctgcccat
gacctgtatc ccacgtactc
caacttccat tcctcgccct gcccccggag ccgagtcctg
cttatggctg ctacttcgtc ccagcatttt cggggttata tgcaccttat tgggagccca
gcgcaagagg gataatctgt ggactcactc agttcaccaa taaatgccat attgcttttg
ctgcattaga agctgtttgt ttccaaactc gagagatttt ggatgccatg aatcgagact
gtggaattcc actcagtcat ttgcaggtag atggaggaat
gaccagcaac aaaattctta
tgcagctaca agcagacatt ctgtatatac cagtagtgaa gccctcaatg
cccgaaacca
ctgcactggg tgcggctatg gcggcagggg ctgcagaagg agtcggcgta tggagtctcg
aacccgagga tttgtctgcc gtcacgatgg agcggtttga acctcagatt aatgcggagg
aaagtgaaat tcgttattct acatggaaga aagctgtgat gaagtcaatg ggttgggtta
caactcaatc tccagaaagt ggtattccat aaaacctacc
aactcatgga ttcccaagat
gtgagctttt tacataatga aagaacccag caattctgtc tcttaatgca
atgacactat
tcatagactt tgattttatt tataagccac ttgctgcatg accctccaag tagacctgtg
gcttaaaata aagaaaatgc agcaaaaaga atgctataga aatatttggt ggtttttttt
ataacagtgg tgacacctgg gtccagaatt gtcaacagtg ccgctgcttg caaggggaag
ttgattgttg gcccctgcct tgcccagatg tggagtgtga attcagcatt ctcccagaga
atgagtgctg
cccgcgctgt gtcacagacc cttgccaggc tgacaccatc cgcaatgaca
tcaccaagac ttgcctggac gaaatgaatg tggttcgctt caccgggtcc tcttggatca
aacatggcac tgagtgtact ctctgccagt gcaagaatgg ccacatctgt tgctcagtgg
atccacagtg ccttcaggaa ctgtgaagtt aactgtctca tgggagattt ctgttaaaag
aatgttcttt cattaaaaga ccaaaaagaa gttaaaactt aaattgggtg atttgtgggc
agctaaatgc agctttgtta atagctgagt gaactttcaa ttatgaaatt tgtggagctt
gacaaaatca caaaaggaaa attactgggg caaaattaga cctcaagtct gcctctactg
tgtctcacat caccatgtag aagaatgggc gtacagtata
taccgtgaca tcctgaaccc
tggatagaaa gcctgagccc attggatctg tgaaagcctc tagcttcact
ggtgcagaaa
attttcctct agatcagaat cttcaagaat cagttaggtt cctcactgca agaaataaaa
gaccgcatga tgaagaaact ggacaccaac agtgatggtc agctagattt ctcagaattt
cttaatctga
ttggtggcct agctatggct tgccatgact ccttcctcaa ggctgtccct
tcccagaagc ggacctgagg
accccttggc cctggccttc aaacccaccc cctttccttc
cagcctttct gtcatcatct ccacagccca cccatcccct gagcacacta accacctcat
gcaggcccca
cctgccaata gtaataaagc aatgtcactt ttttaaaaca tgaaa
ataacagcaa gagggaacct ggcaaggaag ctattcctat aatccaggaa agagatgagg
aaggcttgga
ccaggtggta gtggtgtcag gtagtcaaat gctgggtata ttttgaagat
acaccccata ggatttgctc cacattgaat gtggaatgct ggaagagaga taaagtgtac
ctgtcacata ctttttgagt tttatttatt ttcttagaag taagtacaca aagagatgct
acctaggaga agggtattct tttcactatt ctttcaaatt ttctgtatgt tcgaacattt
tcatagtaga aagttggggg gaaaatctgt ttcataaaca tttcctcagc agcagtccag
tctattgcat tttaattggt tgtgatatca ttgttttatg caatacgttc tcaacaagta
tatcctccgg caaactgaac aaggaccaag tctgttctgc ctacagctct gcttcctcat
agctgctttc cagaacgtga ctcttgcaaa ttatcaagaa aggggaacta atctaaggga
tccagatcaa acagcctcat gaagacttat tttatgtttc taatataaag atagaagttt
tcagaaaagc cctgctacac agaggatcag agcaggggtg ggcctgctgg gctgcagctg
ggattctgag catcctttcc cggaggcacg gaaagtgagt gagtgagccc agtgaggaag
aagttgaagc tttgatatga gtaaacaagt atctctacct gaaatgatta aagactggac
caaagagcat gtgaaaaaat gggtaaatga agaccttaag attaatgagc aatacgggca
aattctgctc agtgaagaag taacaggatt agtcctgcag gaattaactg agaaggacct
tgtagaaatg gggctaccat ggggtccagc acttttgata aaacgttcat acaacaaatt
gaatagtaag tcccctgaaa gtgacaatca tgatccggga caattagata attcaaaacc
gtccaaaaca gaacaccaga aaaatccaaa acacaccaaa aaggaagaag aaaattcaat
gtcatctaat attgattatg atcccagaga gatcagagat atcaaacaag aagaatcaat
tcttatgaaa gaaaatgtgt tagatgaagt agcaaatgct aaacacaaga aaaagggtaa
gctaaaacct gaacaattga cttgtatgcc atatcctttt gatcagttcc atgacagcca
tcgctacata gaacattata ctctacaacc tgaaacagga gcactcaatc tcattgatcc
aatacatgag ttcaaagctc tcacaaacac agaaacagcc acggaagtgg acattaagat
gaaattcagc aatgaagtct tccgatttgc atcagcttgt atgaattcac gcaccaatgg
caccatccat tttggagtca aggacaaacc ccatggagaa attgttggtg tgaaaatcac
cagtaaggct gccttcattg accacttcaa tgtaatgatc aaaaagtatt ttgaagaaag
tgagatcaat gaagccaaga agtgtattcg ggagccaagg tttgtggaag tccttctgca
gaacaataca ccatctgaca gatttgtcat tgaagttgat actattccaa aacactctat
atgtaatgat aagtatttct acattcagat gcaaatttgt aaagataaaa tatggaaaca
aaaccaaaat ctttcactgt ttgtaagaga aggggctagc tctagggata tcctggccaa
ttccaagcaa cgggatgtag atttcaaggc atttttacaa aatttaaagt cactggtagc
atctagaaaa gaggctgaag aagagtatgg aatgaaggca atgaagaagg agagtgaagg
actaaagctg gttaaacttc tcataggaaa ccgagactca ctggataatt catactatga
ctggtacatt cttgtaacaa ataaatgcca tccaaaccaa ataaagcact tagatttttt
aaaagaaatt aaatggtttg ctgtgttgga gtttgatcct gaatctatga tcaatggagt
ggtcaaagct tacaaagaaa gtcgggtggc aaaccttcac tttccaaatc aatatgaaga
caagacaact aacatgtggg agaagatttc tactcttaat ctttaccaac agcccagctg
gattttctgc aacggcagat cagacctgaa aagcgagaca tataaacctc tagaaccaca
tttatggcag agagaaagag cttcagaagt caggaaacta attttatttc tcacagatga
aaatataatg acaagaggaa aatttttggt agtgtttcta ttactctctt cagtggaaag
cccaggagat ccactcattg aaactttctg ggctttctat caagctctca aaggaatgga
aaatatgttg tgtatctctg taaactcaca tatttatcaa cgatggaaag atctactaca
aacaagaatg aagatggaag atgaactaac aaaccacagt atttccactt taaatataga
actggtaaac agcactatcc ttaaactaaa atcggtgact cggtcatcaa gaaggttttt
gcccgcccgt ggatcttctt cagttatcct agagaaaaag aaagaggatg tcttgactgc
actggaaatc ctctgtgaaa atgagtgtac agagacagac atcgagaaag acaaatctaa
attcctggag
tttaagaaat caaaagaaga acacttttat cgaggtggca aagtatcctg
tgaaccctac acgaagaaag aactttctgc tgttactttc
cctgacatca ttcgcaatta
caaagtcatg gctgctgaga atattcctga gaatcccctg aagtatctgt
atccaaatat
tgacaaagac catgcctttg gaaagtatta ctccaggcca aaggaagcac cagagccaat
ggaacttgat ggccctaaag gaactggata tatcaagact gagttgattt ctgtgtctga
agttcaccct tctagacttc agaccacaga caacctgctc cccatgtctc ctgaggagtt
tgacgaggtg tctcggatag tgggctctgt agaattcgac agtatgatga acacagtata
gagcatgaat ttttttcatc ttctctggcg acagttttcc ttctcatctg tgattccctc
ctgctactct gttccttcac atcctgtgtt tctagggaaa tgaaagaaag gccagcaaat
tcgctgcaac ctgttgatag caagtgaatt tttctctaac tcagaaacat cagttactct
gaagggcatc atgcatctta ctgaaggtaa aattgaaagg cattctctga agagtgggtt
tcacaagtga aaaacatcca
gatacaccca aagtatcagg acgagaatga gggtcctttg
ggaaaggaga agttaagcaa catctagcaa
atgttatgca taaagtcagt gcccaactgt
tataggttgt tggataaatc agtggttatt tagggaactg cttgacgtag gaacggtaaa
tttctgtggg agaattctta catgttttct ttgctttaag tgtaactggc agttttccat
tggtttacct gtgaaatagt tcaaagccaa gtttatatac aattatatca gtcctctttc
gtgtgactac ccagaaagtt atagaggaag cccactaaag gactttcaca tgtctgaatt
atcctgcaac ataactctgc tgatcgtcac catcggtgcc accatgctgg tgttggctgt
gactgtgacc tccctctgca tctacttgga tctgccctgg tatctcagga tggtgtgcca
gtggacccag
actcggcgca gggccaggaa cataccctta gaagaactcc aaagaaacct
ccagtttcat gcttttattt catatagtga acatgattct gcctgggtga aaagtgaatt
ggtaccttac ctagaaaaag aagatataca gatttgtctt catgagagaa actttgtccc
tggcaagagc attgtggaaa atatcatcaa ctgcattgag aagagttaca agtccatctt
tgttttgtct cccaactttg tccagagtga gtggtgccat tacgaactct attttgccca
tcacaatctc tttcatgaag gatctaataa cttaatcctc atcttactgg aacccattcc
acagaacagc
attcccaaca agtaccacaa gctgaaggct ctcatgacgc agcggactta
tttgcagtgg cccaaggaga
aaagcaaacg tgggctcttt tgggctaaca ttagagccgc
tcgttttaca tatgcttata aaagaagtga tgtatcagta atgtatcaat aatcccagcc
cagtcaaagc accgccacct gtaggcttct gtctcatggt aattactggg
cctggcctct
gtaagcctgt gtatgttatc aatactgttt cttcctgtga gttccattat ttctatctct
tatgggcaaa gcattgtggg taattggtgc tggctaacat tgcatggtcg gatagagaag
tccagctgtg agtctctccc caaagcagcc ccacagtgga gcctttggct ggaagtccat
gggccaccct gttcttgtcc atggaggact ccgagggttc caagtatact cttaagaccc
actctgttta aaaatatata ttctatgtat gcgtatatgg aattgaaatg tcattattgt
aacctagaaa gtgctttgaa atattgatgt ggggaggttt attgagcaca agatgtattt
cagcccatgc cccctcccaa aaagaaattg ataagtaaaa gcttcgttat acatttgact
aagaaatcac ccagctttaa agctgctttt aacaatgaag attgaacaga gttcagcaat
tttgattaaa ttaagacttg ggggtgaaac tttccagttt actgaactcc agaccatgca
tgtagtccac tccagaaatc atgctcgctt cccttggcac accagtgttc tcctgccaaa
tgaccctaga ccctctgtcc tgcagagtca gggtggcttt tcccctgact gtgtccgatg
ccaaggagtc ctggcctccg cagatgcttc attttgaccc ttggctgcag tggaagtcag
cacagagcag
tgccctggct gtgtccctgg acgggtggac ttagctaggg agaaagtcga
ggcagcagcc ctcgaggccc tcacagatgt ctaggcaggc ctcatttcat cacgcagcat
gtgcaggcct ggaagagcaa agccaaatct cagggaagtc cttggttgat gtatctgggt
ctcctctgga gcactctgcc ctcctgtcac ccagtagagt aaataaactt ccttggctcc
taccacctcg aactttgaca gcgacaagaa gtggggcttc tgcccggacc aaggatacag
tttgttcctc
gtggcggcgc atgagttcgg ccacgcgctg ggcttagatc attcctcagt
gccggaggcg ctcatgtacc ctatgtaccg cttcactgag gggcccccct tgcataagga
cgacgtgaat ggcatccggc acctctatgg tcctcgccct gaacctgagc cacggcctcc
aaccaccacc acaccgcagc ccacggctcc cccgacggtc tgccccaccg gaccccccac
tgtccacccc tcagagcgcc ccacagctgg ccccacaggt cccccctcag ctggccccac
aggtcccccc actgctggcc cttctacggc cactactgtg cctttgagtc cggtggacga
tgcctgcaac gtgaacatct tcgacgccat cgcggagatt gggaaccagc tgtatttgtt
caaggatggg aagtactggc gattctctga gggcaggggg agccggccgc agggcccctt
ccttatcgcc gacaagtggc ccgcgctgcc ccgcaagctg gactcggtct ttgaggagcg
gctctccaag aagcttttct tcttctctgg gcgccaggtg tgggtgtaca caggcgcgtc
ggtgctgggc ccgaggcgtc tggacaagct gggcctggga gccgacgtgg cccaggtgac
cggggccctc cggagtggca gggggaagat gctgctgttc agcgggcggc gcctctggag
gttcgacgtg aaggcgcaga tggtggatcc ccggagcgcc agcgaggtgg accggatgtt
ccccggggtg cctttggaca cgcacgacgt cttccagtac cgagagaaag cctatttctg
ccaggaccgc
ttctactggc gcgtgagttc ccggagtgag ttgaaccagg tggaccaagt
gggctacgtg acctatgaca
tcctgcagtg ccctgaggac tagggctccc gtcctgcttt
ggcagtgcca tgtaaatccc cactgggacc aaccctgggg aaggagccag tttgccggat
gacagccata cactgcttcc cttatgtgaa gaagcgcatc
cctgtcatgt accagcacca
cactgacctg aaccccatcg aggtggccat tgacgagatg agtaagaagg tggcggagct
ccggcagctg tgctcctcgg ccgaggtgga catgatcaaa ctgcagctca aactccaggg
cagcgtgagt gttcaggtca atgctggccc actagcatat gcgcgagctt tcttagatga
tacaaacaca aagcgatatc ctgacaataa agtgaagctg cttaaggaag ttttcaggca
atttgtggaa gcttgcggtc aagccttagc ggtaaacgaa cgtctgatta aagaagacca
gctcgagtat caggaagaaa tgaaagccaa ctacagggaa atggcgaagg agctttctga
aatcatgcat gagcagctgg gatgatctgc cccctggagg agaagacgag cgtcttaccg
aattcccttc
acatcttcaa cgccatcagt gggactccaa caagcacaat ggttcacggg
atgaccagct cgtcttcggt
cgtgtgatta catctcatgg cccgtgtgtg gggacttgct
ttgtcatttg caaactcagg atgctttcca aagccaatca ctggggagac cgagcacagg
gacggtcccc ctggacccat caggtggttc cagggagctg gtctgagccg ggaggccatt
tacaactttg gaggcatctc ccaccccaag gagacagcgg tgcaggcctc caacaatgac
ttcagcattc
ttctgcaaaa cgtctccagt gaggatgcag gcacctatta ctgtgtaaag
tttcagagga aacccaacag
gcaatacctg tctggacagg gcaccagcct gaaagtgaaa
gcaaaatcta cctcttccaa agaggcagaa ttcaccagtg aacctgcaac tgagatgtct
ccaacaggcc tcctggttgt gttcgcacct gtggtcctgg ggctgaaggc aattaccttg
gctgcactcc tactggccct ggctacctct cggaggagcc ctgggcaaga agatgtcaag
accacaggcc cagcaggagc catgaacacc ttagcatgga gcaagggtca agagtgaggg
gtcagcccca gagtgaggac cctctgagtt ggagaggagc cagggctcct caaccatttc
cctacctcca gtcccagcct ctaggtgccc ccaggcctca tgacaaactc ctagatccct
acatctggtt ttggtccacc tagtgaaatt cccttctttg caccgggctt ccctctaaaa
tgtctccctt tctctttttg gcctgttcaa gacctccttg cttttcagtc cctggctcag
tctctcctca acacccttgc ccctgctgca gccctttctg gtgcgccctg cccctttccc
cacctcgcta catccttctt ggcctccaac atccaactca gagtcttctt cccaggagat
gtctgtaaga atctctgaac tcaaccagcc agaccatctg tgcccctcca tctacacctt
tctccccact ccttcctgcc ttccttccat ccccctcatg gctggcttgg gcaggtataa
tattagaatg caggttcagc aactataaca aagctcttaa ataacagtgg cttaaaccag
tggaaatcaa
ccagaaagtt gaccatcagc aggccaagca atacagagac tccctggtat
tgagacccag gattcactga tctcattgct accaggtcca ccttctaggc agccagactg
gaaaagaggg caggaaaggg gagcaggacc ctccccttta agtgcacagt caggaacttg
tgggccattt aacctggatg ccaccatttt atggggataa tgatgcttac catggt
taat
gttttggaag agctttttat ttatagcatt gtttactcag
tcaagttcac catggccgta
atccttctaa gggaaacact aaagttgttg tagtctccac ttcagtcaga
aactgatgtt
tcagctaggc acagtggtac atgcctgtaa tcccagctac ttgggaggct gaggtgggag
gatcacttga actcaggagt ttgagagcag ccagggcaac acagcgagac cctgtctcaa
tgaggaagat cattgggcga tacggtctca ctgggaaaca acaggtgagc ccaatccgga
acttgtcaga cgggcagaag tgccgagtgt gtctggcctg gctggcctgg cagaaccccc
acatgctctt cctggatgaa cccaccaatc acctggatat cgagaccatc gacgccctgg
cagatgccat caatgagttt gagggtggta tgatgctggt
cagccatgac ttcagactca
ttcagcaggt tgcacaggaa atttgggtct gtgagaagca gacaatcacc
aagtggcctg
gagacatcct ggcttacaag gagcacctca agtccaagct ggtggatgag gagccccagc
tcaccaagag gacccacaac gtgtgcaccc tgacattggc atctctgcca aggccatgag
catcatgaac tcgtttgtaa acgacgtgtt tgagcagctg gcgtgtgagg ctgcccggct
ggcccagtac
tcgggccgga ccaccctgac atcccgagaa gtccagacgg ctgtgcgtct
gctgctgcct ggggagctgg ccaagcacgc tgtgtctgag ggcaccaagg ctgtcaccaa
gtacaccagc tccaagtgac ccagggcctg acaaaaataa agggtgaact gttaaaaaaa
atattttaga agctgtgtga ctgctttaat aactttttcc cagtgttatt tgaatcatac
tacccgttat actaaagctg aatgacaatt gtgtgaaagt
tactgccttc ataagatcaa
gtcaccactg ttacacagct gacatatagt gtattacctt tgcagctagt
aaactataaa
gtttagatat tgaatctcgt tacagggtta tttatataat gtgacattat tcagtactga
cagactacat gaagtagttt taaaatctag tgctattttt attttaaagg ttagcaatga
ggaggaaatg
tgatctggct gtgtttgtct tctgtacaaa gcctgaagtg cttatggttt
tttggctaac
agccacagag ggcaaagttt aagactttct tgtaaggact aactgttctt
ttcaagctac tgtttgtttt tctaaaagca ggatttgctt ccgtaggagg caagttcctt
gatgtggaat agtgcaacct gtatatgggt tattataata ggaaagacat ttgtacttgc
acagtttaaa tcattcttaa attttgaaca tgtgaattgt cccaaaaaat ctttaatttt
aagatggcaa gagtaccgcg acagacatca ccggccccat catcctgcag acgtaccgcg
ccattgccga ctacgagaag acctcgggct ccgagatggc tctgtccacg ggggacgtgg
tggaggtcgt ggagaagagc gagagcggtt
ggtggttctg tcagatgaaa gcaaagcgag
gctggatccc agcatccttc ctcgagcccc tggacagtcc
tgacgagacg gaagaccctg
agcccaacta tgcaggtgag ccatacgtcg ccatcaaggc ctacactgct gtggaggggg
acgaggtgtc cctgctcgag ggtgaagctg ttgaggtcat tcacaagctc ctggacggct
ggtgggtcat caggaaagac
gacgtcacag gctactttcc gtccatgtac ctgcaaaagt
cggggcaaga cgtgtcccag gcccaacgcc
agatcaagcg gggggcgccg ccccgcaggt
cagggtgtgc ccagcccggc cctggctcag ggaggacctc agggttcctg gagattcctg
cagtggaact ccatgccccg cctcccaacg gacctggacg tagagggccc ttggttccgc
cattatgatt tcagacagag ctgctgggtc cgtgccatat cccaggagga ccagctggcc
ccctgctggc aggctgaaca ccctgcggag cgggtgagat cggctttcgc tgcacccagc
actgattccg accagggcac ccccttcaga gctagggacg aacagcagta tgctcccacc
tcagggcctt
gcctctgcgg cctccacttg gaaagttctc agttccctcc
aggcttctag
aagcatctgg gccagggctc atggctggat aatttcccta ggcttaacaa cccaagcaag
cttcgcgtcc tcgttttatt tttggttaaa cttatgaaaa tgtattaaga aagagtgcag
ctcgagagag attcagagat ggaacacacc agaccccaga tcacaaagcc aaccatgccc
agcccctccc agcaccccca gccccacgac catcgttctg aattctgacg acaccgtgag
acattatagc cagaatctgt atcacagagg tgcaagctga cagcagagct cagtccccac
ttcctgcaaa caatggcctg caccctatcc cttgtgtgtg
tgacattctc tcatgggaca
atgttggggt ttttcagact gacaggactg caagagggag aaaggaattt
tgtcaatcaa
aattattctg tattgcaact tttctcagag attgcaaagg attttttagg tagagattat
ttttccttat gaaaaatgat ctgttttaaa tgagataaaa taggagaagt tcctggctta
acctgttctt acatattaaa gaaaagttac ttactgtatt tatgaaatac tcagcttagg
catttttact ttaaccccta aattgatttt gtaaatgcca caaatgcata gaattgttac
caacctccaa agggctcttt aaaatcatat tttttattca tttgaggatg tcttataaag
actgaaggca aaggtcagat tgcttacggg tgttattttt ataagttgtt gaattcctta
atttaaaaaa gctcattatt ttttgcacac
tcacaatatt ctctctcaga aatcaatggc
atttgaacca ccaaaaagaa ataaagggct gagtgcggtg
gctcacgcct gtaatcccag
cactttgggg agcccaggcg ggcagattgc ttgaacccag gagttcaaga ccagcctggg
cagcatggtg aaaccctgta tctacaaaaa atacaaaaat tagccaggca tggtggtggg
tgcctgtagt tccagctact tgggaggctg aggtgggaaa atgacttgag cccaggagga
ggaggctgca gtgagctaag attgcaccac tgcactccaa cctgggcgac aagagtgaaa
ctgtgtctct caaaaaaaaa aaaaaacaaa caaaaacaaa aacaaaacaa aacaaaacaa
tgactcacta cagcctctgc ctcccaggtt caagcgattc tcatgcctta gcctcccaag
tagctgggat tacaggtgtg tgccgcaatg cccagctaat ttttgtgttt ttagtagaga
cagggtttca ctatgttggc caggctggtc tcgaactcct ggcctcacgt gatctgtcca
cttcagccgt ccaaagtgct gggattacaa gcgtgagcca ctgtgctggg cccgaggccc
tgacttcttg ctgtaacttt ccatgcattt tttttaaaag gagcagtgtg gattttcgca
ccctttgtga actaagttca atgcgctcta tccaaatttg cctaattgaa ctataagaaa
gtaataattc
cattttctat cccctcaggg actgaacaaa tggaaataac tcccaggcag
tatcaggtgg tcactacaga
gacttccaca aaaacttttg aatgatgtga aacacgatgt
catgaataag ggttgagcca actatagctc tgtgttccta
ctgggctttc cctaatgtgg
ttgggagtta tgccctagac taactgtatt gtcctagtca cagctccttg
ctttgatttc
atccttgata aaatgaagat gaaacttaca ctacttctcc aagccttttg ctgtcttaag
aataagacct gagattaaca ctaaccctag aatagaaatg taatagggag atggtaataa
aggagttttt ctggcacata ccctccctac agaatttctg ttgctcccca gatccagtga
agaattgcag tttcatttat tttgtaccag tcagctctta attaagtaca tgaatggaga
ggaacagtgg tgcacataat ccaaatcagt gaataccatt ttctggtgaa ttacccaccc
ctttgcccct gctaccccga gggttaccat gattgtcaac agcagcagga gcccttccac
gtgccgtatc acgaatttct aagaaagaaa ttggtcggtg ttttaaactt attttgaaag
cgctagaaac cagtgtggat ttgattacaa ctggggactt catgtccagg ttctgttcca
acctttgtct
tcctaaacaa gtacagatgg cagctacaca tatagcccgt aaagctgtgg
aattggactt ggttcctggg aggagcccca
tctctgtggc agcggcagct atttacatgg
cctcacaggc atcagctgaa aagaggaccc aaaaagaaat
tggagatatt gctggtgttg
ctgatgttac aatcagacag tcctatagac tgatctatcc tcgagcccca gatctgtttc
ctacagactt caaatttgac accccagtgg acaaactacc acagctataa attgaggcag
ctaacgtcaa attcttgaat acaaaacttt gcctgttgta catagcctat acaaaatgct
gggttgagcc tttcatgagg aaaaacaaaa gacatggtac gcattccagg gctgaatact
cagtccaccg tccccatcgt gggcattgtt gctggcctgg ctgtcctagc agttgtggtc
atcggagctg tggtcgctgc tgtgatgtgt aggaggaaga gttcaggtgg aaaaggaggg
agctactctc aggctgcgtg cagcgacagt gcccagggct ctgatgtgtc tctcacagct
tgaaaagcct
gagacagctg tcttgtgagg gactgagatg caggatttct tcacgcctcc
cctttgtgac ttcaagagcc tctggcatct ctttctgcaa aggcacctga atgtgtctgc
gtccctgtta
gcataatgtg aggaggtgga gagacagccc acccttgtgt ccactgtgac
ccctgttccc atgctgacct
gtgtttcctc cccagtcatc tttcttgttc cagagaggtg
gggctggatg tctccatctc tgtctcaact ttacgtgcac tgagctgcaa cttcttactt
gaatggctgc gacatggggc ccgacggacg cctcctccgc gggtatcacc agcacgcgta
cgacggcaag gattacatct ccctgaacga ggacctgcgc tcctggaccg cggcggacac
cgtggctcag
atcacccagc gcttctatga ggcagaggaa tatgcagagg agttcaggac
ctacctggag ggcgagtgcc
tggagttgct ccgcagatac ttggagaatg ggaaggagac
gctacagcgc gcagatcctc caaaggcaca cgttgcccac caccccatct ctgaccatga
ggccaccctg aggtgctggg ccctgggctt ctaccctgcg
gagatcacgc tgacctggca
gcgggatggg gaggaacaga cccaggacac agagcttgtg gagaccaggc
ctgcagggga
tggaaccttc cagaagtggg ccgctgtggt ggtgcctcct ggagaggaac agagatacac
atgccatgtg cagcacgagg ggctgcccca gcccctcatc ctgagatggg agcagtctcc
ccagcccacc atccccatcg tgggcatcgt tgctggcctt gttgtccttg gagctgtggt
cactggagct gtggtcgctg ctgtgatgtg gaggaagaag agctcagata gaaacagagg
gagctactct caggctgcag cctactcagt ggtcagcgga aacttgatga taacatggtg
gtcaagctta tttctcctgg gggtgctctt ccaaggatat ttgggctgcc tccggagtca
cagtgtcttg ggccgccgga aggtgggtga catgtggatc ttgttttttt tgtggctgtg
gacatctttc aacactgcct tcttggcctt gcaaagcctt cgctttggct tcggctttag
gaggggcagg agcttccttc ttcgttcttg gcaccatctt atgaaaaggg tccagattaa
gatttttgac tgagtcattc taaagtaagt tgcaagaccc atgatactag accactaaat
acttcatcac acacctccta agaataagaa ccaacattat cacaccaaag aaaataaata
gaaggtcagc attcaaagtc aagaagcgcc atttatcttc ccgtgcgctc tacaaatagt
tccgtgagaa agatggccgg gaactcgatc ctgctggctg ctgtctctat tctctcggcc
tgtcagcaaa gttattttgc tttgcaagtt ggaaaggcaa gattaaaata caaagttacg
cccccagcag
tcactgggtc accagagttt gagagagtat ttcgggcaca acaaaactgt
gtggagtttt atcctatatt
cataattaca ttgtggatgg ctgggtggta tttcaaccaa
gtttttgcta cttgtctggg tctggtgtac atatatggcc gtcacctata cttctgggga
tattcagaag ctgctaaaaa
acggatcacc ggtttccgac tgagtctggg gattttggcc
ttgttgaccc tcctaggtgc cctgggaatt
gcaaacagct ttctggatga atatctggac
ctcaatattg ccaagaaact gaggcggcaa ttctaacttt ttctcttccc tttaatgctt
gcagaagctg ttcccaccat gaaggtaata tggtatcatt tgttaaataa aaataaagtc
aaggtgaaca gtgtgaaaga gctagacgcc ttcaagctaa aatgatgact aatttggtta
tggccaagga ccgcttacaa cttctagaga agatgcaacc agttttgcca ttttccaagt
cacaaacgga cgtctataat gacagtacta acttggcatg ccgcaatgga catctccagt
cagaaagtgg
agctgttcca aaaagaaaag accccttaac acacactagt aattcactgc
ctcgttcaaa aacagttatg aaaactggat ctgcaggcct ttcaggccac catagagcac
ctagttacag tggtttatcc atggtttctg gagtgaaaca gggatctggt cctgctccta
ccactcataa gggtactccg aaaacaaata ggacaaataa accttctacc cctacaactg
ctactcgtaa gaaaaaagac ttgaagaatt
ttaggaatgt ggacagcaac cttgctaacc
ttataatgaa tgaaattgtg gacaatggaa cagctgttaa
atttgatgat atagctggtc
aagacttggc aaaacaagca ttgcaagaaa ttgttattct tccttctctg aggcctgagt
tgttcacagg gcttagagct cctgccagag ggctgttact ctttggtcca cctgggaatg
ggaagacaat gctggctaaa gcagtagctg cagaatcgaa tgcaaccttc tttaatataa
gtgctgcaag tttaacttca aaatacgtgg gagaaggaga gaaattggtg agggctcttt
ttgctgtggc tcgagaactt caaccttcta taatttttat agatgaagtt gatagccttt
tgtgtgaaag aagagaaggg gagcacgatg ctagtagacg cctaaaaact gaatttctaa
tagaatttga tggtgtacag tctgctggag atgacagagt acttgtaatg ggtgcaacta
ataggccaca agagcttgat gaggctgttc tcaggcgttt catcaaacgg gtatatgtgt
ctttaccaaa tgaggagaca agactacttt tgcttaaaaa tctgttatgt aaacaaggaa
ataattgcac atggttttga gaacaggaac tgtaaatctg ttgcccaatc ttaacatttt
tgagctgcat ttaagtagac tttggaccgt taagctgggc aaaggaaatg acaaggggac
ggggtctgtg agagtcaatt caggggaaag atacaagatt gatttgtaaa acccttgaaa
tgtagatttc ttgtagatgt atccttcacg ttgtaaatat
gttttgtaga gtgaagccat
gggaagccat gtgtaacaga gcttagacat ccaaaactaa tcaatgctga
ggtggctaaa
tacctagcct tttacatgta aacctgtctg caaaattagc ttttttaaaa aaaaaaaaaa
aaaaattggg ggggttaatt tatcattcag aaatcttgca ttttcaaaaa ttcagtgcaa
gcgccaggcg atttgtgtct aaggatacga ttttgaacca tatgggcagt gtacaaaata
tgaaacaact gtttccacac ttgcacctga tcaagagcag tgcttctcca tttgttttgc
agagaaatgt ttttcatttc ccgtgtgttt ccatttcctt ctgaaattct gattttatcc
atttttttaa ggctcctctt tatctccttt cttaaggcac tgttgctatg gcacttttct
ataacctttt cattcctgtg tacagtagct taaaattgca gtgattgagc ataacctact
tgtttgtata aattattgaa atccatttgc accctgttaa gaatggactt aaaagtacta
ctggacaggc atgtgtgctc aaagtacatt gattgctcaa atataaggaa atggcccaat
gaacgtggtt gtgggagggg aaagaggaaa cagagctagt cagatgtgaa ttgtatctgt
tgtaataaac atgttaaaac aaacaaaaat tgttattttt cttttccttc ggtcagtgca
cattagcatt tgaactacct ggggattctt tatcagaact gttcttgttg aatatttata
cttaattgaa ataattcctt aagggaggtt ttgtttaaaa cgtattaaca ggaaattgtg
tatgagatat ttaatgaaat aagaaattca acaagaatga ttaagtcact tcccaagtgg
ttgtcatttg ttaaaccctg gtttacctgt cttgctatta tgacatttca tttggaagga
tgtttgtgtt gtagctaact gttcaagtct
ggtgctgact gctgttctta gccatcacaa
aacgctaaat ttgtgtaatt ggagcttcct gctgttatct
ggaaatagca ggaaagcgca
gctttgtata ttgtttccta aagtatatta aaataaaaaa agaaactatt gctactataa
aattaccttg actttttttt tcctttgctg aaatattagt cacatagcct tagcttcaca
ctgccagtaa tgtatcaaat cacaagggtt tccgcatgaa aaaaatcttt tcttccccca
caaaaaaacc tttaccatca aaatcttgcc atctgattta gaaaggtgtt tcttcttctt
Number | Date | Country | Kind |
---|---|---|---|
1408100 | May 2014 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2015/051349 | 5/7/2015 | WO |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2015/170108 | 11/12/2015 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
7598080 | Deirmengian | Oct 2009 | B2 |
7666596 | Halloran | Feb 2010 | B2 |
10519216 | Sonntag | Dec 2019 | B2 |
20060134663 | Harkin | Jun 2006 | A1 |
20090142301 | Bevec | Jun 2009 | A1 |
20110196614 | Banchereau | Aug 2011 | A1 |
20120142544 | Hare | Jun 2012 | A1 |
20130316933 | Osta Pinzolas | Nov 2013 | A1 |
20180028608 | Yu | Feb 2018 | A1 |
20190209576 | Beeharry | Jul 2019 | A1 |
Number | Date | Country |
---|---|---|
2009158521 | Dec 2009 | WO |
2013177502 | Nov 2013 | WO |
2013190321 | Dec 2013 | WO |
2014019977 | Feb 2014 | WO |
2014067943 | May 2014 | WO |
2014093872 | Jun 2014 | WO |
2015170108 | Nov 2015 | WO |
Entry |
---|
Berry, M.P.R., et al., “An Interferon-Inducible Neutrophil-Driven Blood Transcriptional Signature in Human Tuberculosis,” Nature Letters 466(7309):973-977, Aug. 2010. |
Fu, T., and J. Xie, “Progress on the Biomarkers for Tuberculosis Diagnosis,” Critical Reviews™ in Eukaryotic Gene Expression 21(4):379-391, 2011. |
Great Britain Search Report Under Section 17(5), dated Jan. 13, 2015, issued in corresponding Application No. GB1408100.4, filed May 7, 2014, 3 pages. |
International Preliminary Report on Patentability dated Jul. 11, 2016, issued in corresponding International Application No. PCT/GB2015/051349, filed May 7, 2015, 23 pages. |
International Search Report and Written Opinion dated Aug. 20, 2015, issued in corresponding International Application No. PCT/GB2015/051349, filed May 7, 2015, 27 pages. |
Lesho, E., et al., “Transcriptional Responses of Host Peripheral Blood Cells to Tuberculosis Infection,” Tuberculosis 91(5):390-399, Sep. 2011. |
Operon Biotechnologies GmbH, “Operon Microarray Slides—OpArrays™,” Jan. 9, 2006, <http://178.250.165.133/ex/downloads/flyer/EU_OpArray_Flyer.pdf> [retrieved Mar. 24, 2016], 1 page. |
Second Written Opinion dated Apr. 12, 2016, issued in corresponding International Application No. PCT/GB2015/051349, filed May 7, 2015, 9 pages. |
Stanton, L.-A., et al., “Immunophenotyping of Macrophages in Human Pulmonary Tuberculosis and Sarcoidosis,” International Journal of Experimental Pathology 84(6):289-304, Dec. 2003. |
Stern, J.N.H., et al., “Molecular Signatures Distinguishing Active From Latent Tuberculosis in Peripheral Blood Mononuclear Cells, After In Vitro Antigenic Stimulation With Purified Protein Derivative of Tuberculin (PPD) or Candida: A Preliminary Report,” Immunologic Research 45(1):1-12, Oct. 2009. |
Office Action dated Mar. 27, 2018, issued in EP Application No. 15722572.3, filed Nov. 23, 2016, 3 pages. |
Mukundan, H., et al., “Rapid Detection of Mycobacterium tuberculosis Biomarkers Using a Waveguide-Based Biosensor,” Tuberculosis 92(5):407-416, Sep. 2012. |
“DNA Arrays : Order Minimum : 500 Arrays : Contact MI for Pricing,” © Microarrays Inc 2020, Huntsville, Ala., [retrieved Oct. 22, 2020] <http://www.microarrays.com/dna-arrays.php>, 1 page. |
“Human MI ReadyArray Genelist,” © Microarrays Inc 2020, Huntsville, Ala., [retrieved Oct. 22, 2020] <http://www.microarrays.com/docs/HS1100_Human_MI_ReadyArray_genelist.xls>. |
Examination Report dated May 6, 2020, issued in IN Application No. 201617037590, filed Mar. 11, 2016, 8 pages. |
Extended European Search Report dated Jul. 30, 2020, issued in EP Application No. 20170712.2, filed May 7, 2015, 12 pages. |
Examination Report dated Aug. 3, 2020, issued in AU Application No. 2015257483, filed May 7, 2015, 5 pages. |
“Anti-GBP1 Antibody [EPR8285] ab131255,” abcam Product Datasheet, © 1998-2021 Abcam plc, Boston, MA, <https://www.abcam.com/gbp1-antibody-epr8285-ab131255.html> [retrieved May 24, 2021], 6 pages. |
“Anti-SNX10 Antibody Produced in Mouse,” Sigma-Aldrich Product Datasheet, © 2021 Merck KGaA, Darmstadt, Germany <https://www.sigmaaldrich.com/catalog/product/sigma/sab1407388?lang=en®ion-AU> [retrieved May 24, 2021], 5 pages. |
Examination Report No. 2 dated May 24, 2021, issued in Australian Application No. 2015257483, filed May 7, 2015, 4 pages. |
Number | Date | Country | |
---|---|---|---|
20170073737 A1 | Mar 2017 | US |