Embodiments of the present disclosure relate generally to detection and diagnosis of tuberculosis (TB) infections. More particularly, the present disclosure provides novel immunogenic biomarkers associated with TB infections for the diagnosis and treatment of TB.
Active tuberculosis (TB) is a disease caused by uncontrolled infection with Mycobacterium tuberculosis (Mtb). It predominantly affects the respiratory tract and is typically transmitted through infectious droplets generated by coughing. The disease remains a major global public health problem, ranking alongside HIV infection as the leading cause of death worldwide. In 2015, an estimated 10.4 million new cases occurred globally with around 1.4 million TB-associated deaths; numbers that for the first time in decades reflect an increase in incident cases compared with the preceding year. Rapid TB diagnosis and treatment are cornerstones of TB control and essential for reduction of morbidity, mortality and transmission.
Antibody (Ab) detection assays can be adapted for the development of rapid and inexpensive tests that require neither laboratory infrastructure nor specific training. Prior serological tests for the diagnosis of TB have been insufficiently sensitive and specific for several reasons. Importantly, the Ab profiles of TB patients are heterogeneous, and tests that are based on a limited number of antigens, often only one or two, are insufficient to capture the diversity of TB cases. For example, a strong Ab response to the 38 kDa protein is elicited almost exclusively in the subgroup of advanced, HIV negative pulmonary TB patients, so assays based on this antigen are limited in diagnostic scope. Furthermore, several antigens appear to lack specificity for TB. Because of the potential to turn Ab detection assays into simple dipstick formats, TB serology, despite its known limitations, remains a field of study that is worthwhile pursuing further and new biomarker targets need to be identified. The simultaneous use of multiple more recently identified Mtb proteins in form of multiplex microbead immunoassays has already shown promising improved accuracy for TB serodiagnosis in regional case-control studies. Although the World Health Organization recognizes the limitations of currently available serologic tests and in fact cautions against using them, it vigorously encourages further research to meet the need for reliable, simple tests for TB in endemic regions. Because Ab detection is amenable to use in dipstick format incorporating a diversity of antigens, the pursuit of Ab targets that are valid biomarkers of TB is worthwhile.
Discovery of potential biomarkers requires high-throughput methods for assessing proteome-wide screens for antibody reactivity. The utilization of in situ protein arrays provides advances in the access of high-throughput protein microarray and their translation studies. Instead of requiring purified protein for printing, the in situ protein microarray utilizes printing of expression plasmids encoding libraries of genes. After in situ transcription and translation the proteins “self-assemble” on the array surface with the aid of ribosomes and chaperones, thereby enhancing natural protein folding and post-translational modification. Among the in situ protein microarray methods, Nucleic Acid Programmable Protein Array (NAPPA) represents a platform for the biomarker discovery in cancer, autoimmune diseases, and infectious disease. Membrane proteins express and display well with NAPPA with an efficiency that exceeds 90%. Because membrane proteins comprise a large portion of antigens eliciting a human humoral immune response to TB, this method could identify novel valuable Ab targets for TB serodiagnosis that might not be discovered with the conventional protein array platform that is based on printing prefabricated proteins, typically generated in E. coli, on glass slides.
Diagnosis of TB can be challenging because the clinical presentations are manifold and dependent on the immune status of the host. Furthermore, the differential diagnosis can be broad with diagnostic confirmation desired. The gold standard tests for detecting Mtb, usually in a respiratory sample, are culture or nucleic acid amplification (NAA) both of which require a certain degree of laboratory infrastructure and/or equipment, which are often not available in endemic settings, which are typically resource-limited. Thus, there is an urgent need for simple point-of-care (POC) TB tests that are based on the use of easily accessible, nonsputum based body fluids, such as blood, and that can detect the different forms of TB, pulmonary and extrapulmonary, in various hosts. In the absence of such POC tests, a simple triage method to identify those symptomatic TB suspects that are in need of further confirmatory testing, would be desirable but remains a further unmet need among the current TB diagnostic armamentarium.
Embodiments of the present disclosure relate generally to detection and diagnosis of tuberculosis (TB) infections. More particularly, the present disclosure provides novel immunogenic biomarkers associated with TB infections for the diagnosis and treatment of TB.
Embodiments of the present disclosure include a method of diagnosing a subject as having a TB infection. In accordance with these embodiments, the method includes performing an assay on a biological sample obtained from a subject, and measuring or detecting at least one TB biomarker or fragment thereof selected from the group consisting of Rv0054 (ssb), Rv0813c, Rv2031c, (HspX/acr), Rv0222 (echA1), Rv0948c, Rv2853 (PE_PGRS48), Rv3405c, and Rv3544c (fadE28). Measurement or detection of the at least one TB biomarker or fragment thereof can indicate that the subject has a TB infection.
Embodiments of the present disclosure also include a panel of biomarkers for diagnosing a subject as having a TB infection. In accordance with these embodiments, the panel includes at least one TB biomarker or fragment thereof selected from the group consisting of Rv0054 (ssb), Rv0813c, Rv2031c, (HspX/acr), Rv0222 (echA1), Rv0948c, Rv2853 (PE_PGRS48), Rv3405c, and Rv3544c (fadE28), wherein the measurement or detection of the at least one TB biomarker or fragment thereof indicates that the subject has a TB infection.
This patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art. In case of conflict, the present document, including definitions, will control. Preferred methods and materials are described below, although methods and materials similar or equivalent to those described herein can be used in practice or testing of the present invention. All publications, patent applications, patents and other references mentioned herein are incorporated by reference in their entirety. The materials, methods, and examples disclosed herein are illustrative only and not intended to be limiting.
The terms “comprise(s),” “include(s),” “having,” “has,” “can,” “contain(s),” and variants thereof, as used herein, are intended to be open-ended transitional phrases, terms, or words that do not preclude the possibility of additional acts or structures. The singular forms “a,” “an” and “the” include plural references unless the context clearly dictates otherwise. The present disclosure also contemplates other embodiments “comprising,” “consisting of” and “consisting essentially of,” the embodiments or elements presented herein, whether explicitly set forth or not.
The modifier “about” used in connection with a quantity is inclusive of the stated value and has the meaning dictated by the context (for example, it includes at least the degree of error associated with the measurement of the particular quantity). The modifier “about” should also be considered as disclosing the range defined by the absolute values of the two endpoints. For example, the expression “from about 2 to about 4” also discloses the range “from 2 to 4.” The term “about” may refer to plus or minus 10% of the indicated number. For example, “about 10%” may indicate a range of 9% to 11%, and “about 1” may mean from 0.9-1.1. Other meanings of “about” may be apparent from the context, such as rounding off, so, for example “about 1” may also mean from 0.5 to 1.4.
The use of the terms “a” and “an” and “the” and “at least one” and similar referents in the context of describing the invention (especially in the context of the following claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. The use of the term “at least one” followed by a list of one or more items (for example, “at least one of A and B”) is to be construed to mean one item selected from the listed items (A or B) or any combination of two or more of the listed items (A and B), unless otherwise indicated herein or clearly contradicted by context. The terms “comprising,” “having,” “including,” and “containing” are to be construed as open-ended terms (i.e., meaning “including, but not limited to,”) unless otherwise noted. Recitation of ranges of values herein are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range, unless otherwise indicated herein, and each separate value is incorporated into the specification as if it were individually recited herein. All methods described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g., “such as”) provided herein, is intended merely to better illuminate the invention and does not pose a limitation on the scope of the invention unless otherwise claimed. No language in the specification should be construed as indicating any non-claimed element as essential to the practice of the invention.
“Isolated polynucleotide” as used herein may mean a polynucleotide (e.g., of genomic, cDNA, or synthetic origin, or a combination thereof) that, by virtue of its origin, the isolated polynucleotide is not associated with all or a portion of a polynucleotide with which the “isolated polynucleotide” is found in nature; is operably linked to a polynucleotide that it is not linked to in nature; or does not occur in nature as part of a larger sequence.
“Nucleic acid” or “oligonucleotide” or “polynucleotide” as used herein means at least two nucleotides covalently linked together. The depiction of a single strand also defines the sequence of the complementary strand. Thus, a nucleic acid also encompasses the complementary strand of a depicted single strand. Many variants of a nucleic acid may be used for the same purpose as a given nucleic acid. Thus, a nucleic acid also encompasses substantially identical nucleic acids and complements thereof. A single strand provides a probe that may hybridize to a target sequence under stringent hybridization conditions. Thus, a nucleic acid also encompasses a probe that hybridizes under stringent hybridization conditions.
Nucleic acids may be single stranded or double stranded, or may contain portions of both double stranded and single stranded sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine and isoguanine. Nucleic acids may be obtained by chemical synthesis methods or by recombinant methods. [NA “Polypeptide” and “isolated polypeptide” as used herein refers to a polymer of amino acids or amino acid derivatives that are connected by peptide bonds. An isolated polypeptide is a polypeptide that is isolated from a source. An isolated polypeptide can be at least 1% pure, at least 5% pure, at least 10% pure, at least 20% pure, at least 40% pure, at least 60% pure, at least 80% pure, and at least 90% pure, as determined by one or more protein biochemistry techniques (e.g., SDS-PAGE).
“Subject” and “patient” as used herein interchangeably refers to any vertebrate, including, but not limited to, a mammal (e.g., cow, pig, camel, llama, horse, goat, rabbit, sheep, hamsters, guinea pig, cat, dog, rat, and mouse, a non-human primate (for example, a monkey, such as a cynomolgous or rhesus monkey, chimpanzee, etc.) and a human). In some embodiments, the subject may be a human or a non-human. The subject or patient may be undergoing other forms of treatment.
“Treat,” “treated,” or “treating,” as used herein, refer to a therapeutic method wherein the object is to slow down (lessen) an undesired physiological condition, disorder or disease, or to obtain beneficial or desired clinical results. In some aspects of the present disclosure, beneficial or desired clinical results include, but are not limited to, alleviation of symptoms; diminishment of the extent of the condition, disorder or disease; stabilization (i.e., not worsening) of the state of the condition, disorder or disease; delay in onset or slowing of the progression of the condition, disorder or disease; amelioration of the condition, disorder or disease state; and remission (whether partial or total), whether detectable or undetectable, or enhancement or improvement of the condition, disorder or disease. Treatment also includes prolonging survival as compared to expected survival if not receiving treatment.
“Variant” used herein with respect to a nucleic acid means (i) a portion or fragment of a referenced nucleotide sequence; (ii) the complement of a referenced nucleotide sequence or portion thereof; (iii) a nucleic acid that is substantially identical to a referenced nucleic acid or the complement thereof; or (iv) a nucleic acid that hybridizes under stringent conditions to the referenced nucleic acid, complement thereof, or a sequences substantially identical thereto.
“Variant” with respect to a peptide or polypeptide that differs in amino acid sequence by the insertion, deletion, or conservative substitution of amino acids, but retain at least one biological activity. Variant may also mean a protein with an amino acid sequence that is substantially identical to a referenced protein with an amino acid sequence that retains at least one biological activity. A conservative substitution of an amino acid, i.e., replacing an amino acid with a different amino acid of similar properties (e.g., hydrophilicity, degree and distribution of charged regions) is recognized in the art as typically involving a minor change. These minor changes may be identified, in part, by considering the hydropathic index of amino acids, as understood in the art. The hydropathic index of an amino acid is based on a consideration of its hydrophobicity and charge. It is known in the art that amino acids of similar hydropathic indexes may be substituted and still retain protein function. In one aspect, amino acids having hydropathic indexes of ±2 are substituted. The hydrophilicity of amino acids may also be used to reveal substitutions that would result in proteins retaining biological function. A consideration of the hydrophilicity of amino acids in the context of a peptide permits calculation of the greatest local average hydrophilicity of that peptide. Substitutions may be performed with amino acids having hydrophilicity values within ±2 of each other. Both the hydrophobicity index and the hydrophilicity value of amino acids are influenced by the particular side chain of that amino acid. Consistent with that observation, amino acid substitutions that are compatible with biological function are understood to depend on the relative similarity of the amino acids, and particularly the side chains of those amino acids, as revealed by the hydrophobicity, hydrophilicity, charge, size, and other properties.
“Vector” is used herein to describe a nucleic acid molecule that can transport another nucleic acid to which it has been linked. One type of vector is a “plasmid”, which refers to a circular double-stranded DNA loop into which additional DNA segments may be ligated. Another type of vector is a viral vector, wherein additional DNA segments may be ligated into the viral genome. Certain vectors can replicate autonomously in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) can be integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are referred to herein as “recombinant expression vectors” (or simply, “expression vectors”). In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. “Plasmid” and “vector” may be used interchangeably as the plasmid is the most commonly used form of vector. However, other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions, can be used. In this regard, RNA versions of vectors (including RNA viral vectors) may also find use in the context of the present disclosure.
Before any embodiments of the present disclosure are explained in detail, it is to be understood that the present disclosure is not limited in its application to the details of construction and the arrangement of components set forth in the following description or illustrated in the accompanying drawings. The present disclosure is capable of other embodiments and of being practiced or of being carried out in various ways.
Other aspects of the invention will become apparent by consideration of the detailed description and accompanying drawings.
Embodiments of the present disclosure include the generation of a novel Mtb protein microarray based on the NAPPA platform, the High Density-NAPPA (HD-NAPPA), which was used in a multiplex version (M-HD-NAPPA) for high-throughput screening and in a single protein version for deconvolution and validation. This platform entailed printing plasmids containing cDNAs encoding the Mtb proteome comprising ˜4000 proteins into silicon nano-wells. In accordance with these embodiments, sera from HIV uninfected and coinfected TB patients and controls from different geographic regions (US and South Africa (SA)) were screened and proteins with previously unknown value for TB serodiagnosis were identified.
Embodiments of the present disclosure include the generation and validation of a new whole-proteome Mtb HD-NAPPA and demonstrate its value for detecting novel biomarkers for TB serodiagnosis. Embodiments of the present disclosure demonstrate the feasibility, efficiency, and accuracy of multiplexing proteins into a single spot for expedited high-throughput screening for Ab responses to the Mtb proteome. In accordance with these embodiments, multimarker panels were established to distinguish TB patients from non-infected or latently infected subjects, with and without HIV coinfection across two geographic regions. With this initial evaluation, 8 proteins were identified that show potential as TB diagnostic biomarkers. HD-NAPPA provides a higher signal to noise ratio for Ab biomarker discovery as compared with flat-glass NAPPA. Using M-HDNAPPA arrays, where the screen utilizes multiplexing of targets, can further accelerate Ab biomarker screening. To perform serum Ab profiling over the whole Mtb proteome (around 4000 genes), flat-glass based NAPPA requires two slides per sample. In contrast, HD-NAPPA requires only a half slide and the M-HD-NAPPA (using 3-target multiplex per spot) requires only a quarter of a slide. Thus, the capacity to process 8 times more samples than flat-glass based NAPPA would not only facilitate the Ab discovery speed, but also result in significant reagent cost savings.
The possibility that low protein expression levels, from one of the three proteins in the mix, could mask detection was analyzed. Results showed that 100% of high and medium signal intensity responses and 91.5% of low-signal intensity responses were detected when proteins were mixed in all possible combinations. Printing 4,045 plasmids required creating two glass arrays, termed TB array 01 and TB array 02. With these arrays of individually-printed Mtb plasmids, expression of Mtb proteins was demonstrated by detection of the fusion partner with anti-GST staining. As shown in
It was also investigated whether some or all of the observed regional differences in Ab responses are driven by regional differences in disease state—with TB patients from resource-limited TB endemic settings typically being diagnosed at more advanced stages than those living in the US, or whether the regional differences could be driven in part by infection with different Mtb strains. Embodiments of the present disclosure therefore included individual panels for the four subject subgroups, depending on the geographic region (US or SA) and HIV status (HIV+/−). The eight candidate immunoreactive Mtb proteins identified have varied characteristics. Four of these proteins are secreted and have been identified in Mtb culture filtrates (CFPs; Rv0054, Rv0831c, Rv2031c and Rv0222), with three of these (Rv0054, Rv0831c, Rv2031c) also identified in the cell membrane and two (Rv0831c, Rv2031c) in the cell wall. One (Rv0948c) has only been associated with the Mtb membrane fraction. The cellular location for two of the proteins (Rv3405c and Rv3544c) has not been identified.
Embodiments of the present disclosure include a method of diagnosing a subject as having a TB infection. In accordance with these embodiments, the method includes performing an assay on a biological sample obtained from a subject, and measuring or detecting at least one TB biomarker or fragment thereof selected from the group consisting of Rv0054 (ssb), Rv0813c, Rv2031c, (HspX/acr), Rv0222 (echA1), Rv0948c, Rv2853 (PE_PGRS48), Rv3405c, and Rv3544c (fadE28). Measurement or detection of the at least one TB biomarker or fragment thereof can indicate that the subject has a TB infection.
In some embodiments, a biological sample includes a fluid sample from a subject having or suspected of having TB. The sample may be derived from any suitable source. In some cases, the sample may comprise a liquid, fluent particulate solid, or fluid suspension of solid particles. In some cases, the sample may be processed prior to the analysis described herein. For example, the sample may be separated or purified from its source prior to analysis; however, in certain embodiments, an unprocessed sample may be assayed directly. In a particular example, the biological sample is a human bodily substance (e.g., bodily fluid, blood such as whole blood, serum, plasma, urine, saliva, sweat, sputum, semen, mucus, lacrimal fluid, lymph fluid, amniotic fluid, interstitial fluid, lung lavage, cerebrospinal fluid, feces, tissue, organ, or the like). Tissues may include, but are not limited to skeletal muscle tissue, liver tissue, lung tissue, kidney tissue, myocardial tissue, brain tissue, bone marrow, cervix tissue, skin, and the like. The sample may be a liquid sample or a liquid extract of a solid sample. In certain cases, the source of the sample may be an organ or tissue, such as a biopsy sample, which may be solubilized by tissue disintegration/cell lysis. In some embodiments, the biological sample is at least one of whole blood, serum, plasma, urine, saliva, sweat, sputum, semen, mucus, lacrimal fluid, lymph fluid, amniotic fluid, interstitial fluid, lung lavage, cerebrospinal fluid, and feces.
In some embodiments, methods of diagnosing a subject as having or not having TB can be carried out using any suitable diagnostic test, such as but not limited to, an immunoassay. Methods of determining the presence or amount (detecting or measuring) a TB biomarker include, but are not limited to, immunoassays, such as sandwich immunoassays (e.g., monoclonal-monoclonal sandwich immunoassays, monoclonal-polyclonal sandwich immunoassays, including enzyme detection (enzyme immunoassay (EIA) or enzyme-linked immunosorbent assays (ELISA), competitive inhibition immunoassays (e.g., forward and reverse), enzyme multiplied immunoassay techniques (EMIT), a competitive binding assay, bioluminescence resonance energy transfer (BRET), one-step antibody detection assays, homogeneous assays, heterogeneous assays, capture on the fly assay, and the like.
In some embodiments, assays used to measure or detect a TB biomarker as described herein can be associated with percentages of sensitivity and specificity. Sensitivity of an assay as used herein refers to the proportion of subjects for whom the outcome is positive that are correctly identified as positive. Specificity of an assay as used herein refers to the proportion of subjects for whom the outcome is negative that are correctly identified as negative. In some embodiments, the immunoassay has a sensitivity of at least 80.0% and a specificity of at least 50.0%. In other embodiments, the immunoassay has a specificity of at least 80.0% and a sensitivity of at least 50.0%.
Embodiments of the present disclosure also include a panel of biomarkers for diagnosing a subject as having a TB infection. In accordance with these embodiments, the panel includes at least one TB biomarker or fragment thereof selected from the group consisting of Rv0054 (ssb), Rv0813c, Rv2031c, (HspX/acr), Rv0222 (echA1), Rv0948c, Rv2853 (PE_PGRS48), Rv3405c, and Rv3544c (fadE28), wherein the measurement or detection of the at least one TB biomarker or fragment thereof indicates that the subject has a TB infection.
A biomarker panel can refer to a set of biomarkers that can be used alone, together, or in subcombinations to indicate the status of a human subject with respect to a condition, status, or state of being of the human subject. The biomarkers within the panel of biomarkers can include those TB biomarkers discussed herein. It will be appreciated that the specific identity of biomarkers within the panel and the number of distinct biomarkers within the panel can depend on the particular use to which the biomarker panel is put and the stringency that the results of panel must meet for the particular application. In some embodiments, a TB biomarker panel can include TB biomarkers, such as those described here, as well as other various biomarkers that may or may not be used to measure or detect a TB biomarker. In some embodiments, the biomarker panel may include 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or as many as 40 biomarkers. In some embodiments, the biomarker panel may include 10 or fewer biomarkers, depending various factors such as the characteristics of the subject from which the biological sample was obtained and is to be assayed. In other embodiments, the biomarker panel may include 2, 3, 6 or 8 biomarkers. In some embodiments, the biomarker panel may be optimized from a candidate pool of biomarkers. By way of non-limiting example, the biomarker panel may be optimized for determining whether a subject has a specific disease, such as TB.
The following examples are illustrative of disclosed methods. In light of the present disclosure, those of skill in the art will recognize that variations of these examples and other examples of the disclosed method would be possible without undue experimentation.
M-HD-NAPPA—Concept Validation.
A test set of 96 Mtb proteins was first generated, identified by preliminary studies using a serum pool from TB+ patients, as well as proteins reported in TB serology literature. These test proteins were to create multiplex mixes of 3 proteins per spot to validate the concept of M-HD-NAPPA. To confirm detection of a positive responder even when it was mixed with nonresponders (i.e., that the nonresponders did not dilute the responder signal too much), it was ensured that all possible combinations of positive responders and nonresponders were created as determined by reactivity to the serum pool. To ensure that selection did not create any bias, 3 random plates were selected from the Mtb collection set to add an additional 288 Mtb genes. The gene mixtures, as well as the single genes, were printed on HD-NAPPA array. After expression, the arrays were probed with the TB+ pooled serum, sera from each of the individuals that comprise the pool, and anti-GST with signals normalized as described earlier (
As shown in
M-HD-NAPPA Screen. Following the successful demonstration of protein reactivity with M-HD-NAPPA, a protein display quality control (QC) test was performed with the 4045 Mtb M-HD-NAPPA array to assess the intra- and intersubarray correlation (protein display repeatability). The anti-GST reactivity showed that almost all of the spots exhibited a yellow to red color revealing overall fluorescence intensity higher than 1×106 arbitrary intensity units (a.u., the cutoff of the successful protein display;
After assuring the quality and reproducibility of reactivity using the M-HD-NAPPA, samples were randomized to assure a mix of subject category samples during each run and day to ensure minimal run-to-run bias. All of the samples were analyzed as 4 paired subgroups according to the region (US/SA), HIV (HIV±) and TB status (TB±) for determining sensitivity and specificity. Using these groupings, spots were selected for deconvolution analysis. A representative image is shown in
Overall, 95 (US/HIV−), 47 (US/HIV+), 15 (SA/HIV−) and 30 (SA/HIV+) multispots reactive with IgG and 41 (US/HIV−), 24 (US/HIV+), 21 (SA/HIV−) and 41 (SA/HIV+) multispots reactive with IgA with more than 10% sensitivity at 70% specificity were identified. When combining all groups, 202 multispots in IgG and 144 multispots in IgA analysis showed higher than 5% sensitivity. Taken together, 163 multispots from the subgroup analysis and 259 multispots from the merged group analysis, together with 2 extra hits from visual analysis resulted in a total of 272 multispots (792 individual genes) selected for deconvolution.
HD-NAPPA—Deconvolution.
In addition to the 792 candidate proteins identified in the M-HD-NAPPA screen, the 96 Mtb proteins were included (from unpublished data and published studies) to create 870 individual arrays for printing onto HD-NAPPA. Quality control assessments of these arrays are presented in
HD-NAPPA—Validation.
Arrays with the same genes generated for the deconvolution were also used for validation experiments with 124 biologically independent samples (Table 1). For biomarker candidate analysis, sensitivity and specificity were used as the first criteria. In addition, an odd's ratio higher than 1.5 was used; an AUC value higher than 0.55. From the combined criteria, 34 IgG hits and 8 IgA hits (
In
aBecause the deconvolution of positive reactions was the prime goal of this experiment, we focused these analyses predominantly on TB+ samples from the muliplex HD-NAPPA screening.
bConsisting of biologically independent samples.
cConsisting of the original screening and validation samples (n = 244) and heretofore untested samples (n = 41).
aInformation on the cellular location and functions of the proteins can be accessed through the TubercuList Database (http://tuberculist.epfl.ch/) under the gene name.
ELISA Verification. In order to verify the Mtb protein performance of the M-HD-NAPPA workflow, the rapid antigenic protein in situ display ELISA was used. All available 244 sera were tested from discovery and validation with an additional 41 samples (Table 2). An anti-GST quality control expression assessment was performed of all targets prior to performing ELISA (
Multimarker Panels.
Optimal markers under BIC for the US/HIV− subgroup were Rv2031c, Rv0831c and Rv0948c. This classifier had an AUC of 0.807 under leave-one-out cross validation. For the US/HIV+ subgroup, the optimal markers were Rv0054 and Rv0948c, and the classifier had an AUC of 0.782. For the SA/HIV− subgroup, the optimal markers were Rv2853, Rv0054, Rv0831c, Rv3544c and Rv0222. The AUC under cross validation for the SA/HIV− subgroup was 0.868. Finally, for the SA/HIV+ subgroup, only Rv3405c was selected for use in the classifier, which had an AUC of 0.723 under cross validation. The ROC curves for each classifier are shown in
Mtb Plasmid Construction and DNA Preparation.
In the present disclosure, 3295 Mtb H37Rv and 437 CDC 1551 genes were obtained in entry vectors from the Pathogen Functional Genomics Center. Primers were designed and obtained for the missing ˜800 H37Rv genes (Integrated DNA Technologies, Coralville, Iowa) and performed PCR amplification from genomic Mtb H37Rv DNA to create entry clones for these missing genes as described. After two rounds of PCR amplification and transfer of clones to the pANT7-cGST expression vector, which encodes a C-terminal fusion partner for the target gene of Glutathione-S-Transferase (GST), a final sequence-verified gene set was obtained and was comprised of 3646 H37Rv and 399 CDC 1551 clones (4045 total) for array construction. The reduction in clone numbers resulted from failure to either produce a PCR product or creation of a verified expression clone. Purified plasmid DNA was prepared with a high throughput alkaline lysis miniprep protocol as described. For positive controls, several genes were used encoding for the antigens of the Epstein-Barr virus (EBV), a virus over 95% of individuals are infected with by adult age (22), specifically the Epstein-Barr Nuclear Antigen (EBNA), EBV Small capsomere-interacting protein (BFRF3), EBV_EBNA2, and other viral genes, specifically H1N1_Nucleoprotein, H3N2_Nucleoprotein, HCMV2_Viral transcription factor IE2 (UL122). For negative gene controls, a plasmid encoding GST without any fusion partner was used.
HD-NAPPA Array Fabrication.
The HD-NAPPA array fabrication included three main processes: nanowell slide fabrication, plasmid plate and printing mixture preparation and piezoelectric printing. The nano-well slide fabrication was performed as reported. The plasmid plate was constructed as reported for the HD-NAPPA, with modifications to the multiplex version to allow for a more high-throughput evaluation. Three unique genes were admixed into one well resulting in three unique proteins displayed in each spot. Although this added the need to deconvolute reactive spots by reassessing the same screening samples with a new microarray containing only individual proteins per spot, it allowed screening faster overall. The printing master mix (MM) was composed of polyclonal anti-GST Ab (GE Healthcare), bovine serum albumin (BSA, Sigma-Aldrich), BS3 cross linker (Pierce) and DEPC treated water. To control for secondary Ab reactivity, purified mouse IgG, human IgG and human IgA were also printed, in MM at concentrations from 40 to 200 ng/μ1 in each subarray. Negative controls consisted of MINI spots without any plasmid and the plasmid encoding only for the fusion partner GST. The HD-NAPPA print was performed on AU302 piezoelectric dispensing system (Engineering Arts LLC, Tempe, Ariz., USA) by depositing MM (1200 pL/well) and plasmid(s) (100 ng/μl, 300 pL/well) sequentially utilizing 16 individual noncontact dispensing heads. The HD-NAPPA slides were stored under an argon gas filled container at room temperature until the day of use when proteins were expressed.
Protein Expression on M-HD-NAPPA.
Arrays were blocked with SuperBlock (Thermo Fisher Scientific, Rockford, Ill.) prior to expression to reduce nonspecific binding, rinsed with DI water and centrifuged dry. The nano-wells were filled with human cell-free expression system (In Vitro Transcription and Translation coupled system; IVTT; Thermo Fisher Scientific) and a custom micro-reactor device was used for the protein expression. After sealing the wells with a polystyrene membrane under 200 PSI pressure, the reactor was incubated for 2 h at 30° C. for expression and for 0.5 h at 15° C. for protein capture, followed by blocking with 5% skim milk in phosphate buffered saline with 0.2% tween 20 (PBST) for 30 min. Anti-GST murine monoclonal Ab (mAb; Cell signaling technology, Danvers, Mass.) was used to assess protein display followed by detection with Alexa 647-labeled Goat antimouse IgG (H+L) secondary Ab (A-21235, Thermo Fisher Scientific).
Subjects and Samples.
Serum samples were obtained in cross-sectional studies from patients with Mtb culture-proven TB before or within the first 7 days of antituberculous treatment initiation and from asymptomatic controls (Table 5). Subjects were enrolled in two different settings, in public hospitals in New York City, United States, and at Edendale Hospital in KwaZulu-Natal, South Africa (SA). Subjects provided informed written consent prior to enrollment and blood draw. Serum was obtained by collecting peripheral venous blood into BD Vacutainer Serum Separation Tubes (SST; Becton, Dickinson and Company, New Jersey) that do not contain any additives. Within 1-3 h after blood draw the samples were centrifuged at room temperature for 10 mins at 3000 rpm and serum was aliquoted and stored at −80° C. until further use.
aSubjects emigrated from various TB endemic regions, including Asia, South America and Africa;
The studies were approved by the Institutional Review Boards of Arizona State University; the Albert Einstein College of Medicine, New York; and the University of KwaZulu-Natal, SA. The samples were divided into four subgroups according to the region (US, SA) and HIV status (HIV+/HI−; Table 1). Prior to performing assays, the samples in each subgroup were randomized into two even sets: one set for performing the screening/deconvolution array and one independent set for performing the validation array (Table 1).
M-HD-NAPPA—Concept Validation.
In order to evaluate the M-HD-NAPPA array screening workflow, 96 Mtb genes were selected from initial individual gene glass-slide NAPPA results (data not shown but available upon request) and scientific literature to create a gene set to validate immunodetection of individual proteins within a triple protein mix. In addition, 288 Mtb clones were randomly selected and printed those as individual genes as well as triple gene mixes on the HD-NAPPA slides. Ab binding was performed with a pooled sample set from 3 HIV−, TB+ subjects that had documented Ab reactivity to various proteins from prior studies as well as mAb anti-GST for protein display level. During scanning of the silicon slides, the scanner parameters were adjusted to focus the signal detection located within the wells, which were 75 μm deeper than flat glass slide based flat glass NAPPA.
M-HD-NAPPA—Discovery Screen.
A multiplex Mtb array was created containing all 4045 genes, spread among 1431 multiplexed Mtb gene spots along with the 96 individual Mtb genes and 7 viral single gene controls. There were four identical subarrays printed on each of the M-HD-NAPPA slides.
M-HD-NAPPA arrays were expressed for probing against 120 subject samples (Table 1) to identify Mtb antibody binding proteins (
HD-NAPPA—Deconvolution.
Overall, 272 multiplex spots (792 single genes) were calculated showing differential responses between TB positive and negative subgroups. These 792 genes were printed as single genes on the HD-NAPPA. In addition, initial individual 96 Mtb genes were included (from unpublished prior studies and the literature) that were not among the 792 genes (62 genes overlapped) and the controls resulting in a final 870 single gene HD-NAPPA of which 8 subarrays fitted on each slide. To identify the specific protein targets, the same subject samples used for the M-HD-NAPPA screen were tested and the slides were processed as described earlier (
HD-NAPPA—Validation.
Individual HD-NAPPA arrays as described above were created for deconvolution as well as validation with biologically independent sample sets (n=124;
RAPID ELISA.
Rapid antigenic protein in situ display (RAPID) ELISA was used as described to verify the selected candidate proteins according to the three criteria described above (
HD-NAPP A—Validation.
A visual inspection of each array image was conducted and spot by spot to avoid artifacts. The data were median normalized, and the sensitivity and specificity were calculated at cutoff 1.4. The odd's ratio of a positive response was calculated using Firth's penalized likelihood logistic regression. Finally, the area under the receiver operator characteristic (ROC) curve (AUC) was calculated, which is a measure of marker performance across a range of cutoff values. It was set as 0.55, which elucidated the antigens likely to be positive in the TB groups. Only those genes that passed deconvolution and validation with the second set of samples were taken as possible biomarker candidates.
Because of the high level of heterogeneity of responses within the subject subcategories, an analysis was performed of the candidate biomarkers with the deconvolution and validation array data combined. Briefly, the normalized data of the deconvolution and validation within each subgroups were combined as 4 paired subgroups and processed with the same criteria as the validation array analysis. Those genes with a sensitivity higher than 20%, an odd's ratio >1.5 and an AUC value >0.55 in the combined analysis were selected as the biomarkers for ELISA verification testing.
ELISA.
ROC curve analysis was used to assess the performance of each protein tested via ELISA for discriminating TB positive from TB negative patients in each of the four patient subgroups. The pROC R package was used to conduct the analysis. For each protein, several ROC statistics were measured including AUC, the sensitivity at 80% specificity and the specificity at 80% sensitivity. The p value was calculated for the Wilcoxon rank sum test of no difference between the TB positive and TB negative patients. P values were not adjusted for multiple testing and should not be interpreted as strict statistical p values because of the protein selection process and sample re-use. Multiprotein panels were developed to classify TB positive and TB negative patients in each subgroup. The classifier for each subgroup was a logistic regression model. All possible logistic regression models were evaluated using the Bayes Information Criteria (BIC) to identify the best set of proteins for each subgroup. This analysis was conducted using the bestglm R package and Morgan-Tatar search. For each sample the fitted (noncalibrated) probability of TB positivity was calculated. This probability was calculated using leave-one-out cross validation. ROC curves were generated using both the fitted and cross-validated probabilities, and calculated ROC statistics including the AUC, the specificity at 80% sensitivity and the sensitivity at 80% specificity.
Various features and advantages of the invention are set forth in the following claims.
All references, including publications, patent applications, and patents, cited herein are hereby incorporated by reference to the same extent as if each reference were individually and specifically indicated to be incorporated by reference and were set forth in its entirety herein.
Preferred embodiments of this invention are described herein, including the best mode known to the inventors for carrying out the invention. Variations of those preferred embodiments may become apparent to those of ordinary skill in the art upon reading the foregoing description. The inventors expect skilled artisans to employ such variations as appropriate, and the inventors intend for the invention to be practiced otherwise than as specifically described herein. Accordingly, this invention includes all modifications and equivalents of the subject matter recited in the claims appended hereto as permitted by applicable law. Moreover, any combination of the above-described elements in all possible variations thereof is encompassed by the invention unless otherwise indicated herein or otherwise clearly contradicted by context.
For reasons of completeness, various aspects of the invention are set out in the following numbered clauses, as well as the following claims:
Clause 1. A method of diagnosing a subject as having tuberculosis (TB), the method comprising: performing an assay on a biological sample obtained from a subject; and measuring or detecting at least one TB biomarker or fragment thereof selected from the group consisting of Rv0054 (ssb), Rv0813c, Rv2031c, (HspX/acr), Rv0222 (echA1), Rv0948c, Rv2853 (PE_PGRS48), Rv3405c, and Rv3544c (fadE28); wherein the measurement or detection of the at least one TB biomarker or fragment thereof indicates that the subject has a TB infection.
Clause 2. The method of clause 1, wherein the biological sample is at least one of whole blood, serum, plasma, urine, saliva, sweat, sputum, semen, mucus, lacrimal fluid, lymph fluid, amniotic fluid, interstitial fluid, lung lavage, cerebrospinal fluid, and feces.
Clause 3. The method of clause 1 or 2, wherein the assay is an immunoassay.
Clause 4. The method of any of clauses 1-3, wherein the immunoassay has a sensitivity of at least 80.0% and a specificity of at least 50.0%.
Clause 5. The method of any of clauses 1-4, wherein the immunoassay has a specificity of at least 80.0% and a sensitivity of at least 50.0%.
Clause 6. The method of any of clauses 1-5, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv0831c, Rv2031c, (HspX/acr).
Clause 7. The method of any of clauses 1-6, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv0054 and Rv0948c.
Clause 8. The method of any of clauses 1-7, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv2853, Rv0054, Rv0831c, Rv3544c, and Rv0222.
Clause 9. The method of any of clauses 1-8, wherein the at least one TB biomarker or fragment thereof is Rv3405c.
Clause 10. The method of any of clauses 1-9, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv0948c, Rv2853, Rv3405c, and Rv3544c.
Clause 11. The method of any of clauses 1-10, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv0054, Rv2853, and Rv3405c.
Clause 12. A panel of biomarkers for diagnosing a subject as having tuberculosis (TB), the panel comprising: at least one TB biomarker or fragment thereof selected from the group consisting of Rv0054 (ssb), Rv0813c, Rv2031c, (HspX/acr), Rv0222 (echA1), Rv0948c, Rv2853 (PE_PGRS48), Rv3405c, and Rv3544c (fadE28); wherein the measurement or detection of the at least one TB biomarker or fragment thereof indicates that the subject has a TB infection.
Clause 13. The panel of clause 12, wherein the at least one TB biomarker or fragment thereof is measured or detected using an immunoassay.
Clause 14. The panel of clause 12 or 13, wherein the immunoassay has a sensitivity of at least 80.0% and a specificity of at least 50.0%.
Clause 15. The panel of any of clauses 12-14, wherein the immunoassay has a specificity of at least 80.0% and a sensitivity of at least 50.0%.
Clause 16. The panel of any of clauses 12-15, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv0831c, Rv2031c, (HspX/acr).
Clause 17. The panel of any of clauses 12-16, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv0054 and Rv0948c.
Clause 18. The panel of any of clauses 12-17, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv0054 and Rv0948c.
Clause 19. The panel of any of clauses 12-18, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv2853, Rv0054, Rv0831c, Rv3544c, and Rv0222.
Clause 20. The panel of any of clauses 12-19, wherein the at least one TB biomarker or fragment thereof is Rv3405c.
Clause 21. The panel of any of clauses 12-20, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv0948c, Rv2853, Rv3405c, and Rv3544c.
Clause 22. The panel of any of clauses 12-21, wherein the at least one TB biomarker or fragment thereof is selected from the group consisting of Rv0054, Rv2853, and Rv3405c.
This application claims the benefit of U.S. Provisional Patent Application No. 62/592,237, filed on Nov. 29, 2017, the entire content of which is fully incorporated herein by reference.
This invention was made with government support under Federal Grant Nos. R01 AI096213, AI05684, R01 AI117927, and K23 AI067665, awarded by the National Institutes of Health (NIH). The government has certain rights to this invention.
Number | Date | Country | |
---|---|---|---|
62592237 | Nov 2017 | US |