This disclosure relates to biomarkers for prediction and/or diagnosis of sepsis and methods of their use, including methods for treating sepsis in a subject.
Sepsis occurs when the body's response to infection damages its own tissues and organs, leading to shock, multi-organ failure, and potentially death. In the United States, the incidence of severe sepsis is about 300 cases per 100,000 population with a mortality rate estimated between 28 and 50 percent (Ullah et al., Pakistan J. Med. Sci. 32:688-693, 2016). Diagnosis of sepsis is difficult during early stages (e.g., pre-symptomatic phase), which makes it challenging to intervene therapeutically until after the onset of symptoms. Since progression of the condition, once it takes hold, is rapid and often aggressive, effective and early intervention is critical to control the development of sepsis. Among clinical parameters, C-reactive protein (CRP) and procalcitonin (PCT) are established markers to reflect infection or inflammation (Faix, Crit. Rev. Clin. Lab. Sci. 50:23-36, 2013). However, due to the lack of specificity, the use of these markers for sepsis diagnosis is limited. For example, both CRP and PCT levels are elevated in sepsis patients as well as in stress, severe trauma, and surgery patients (Matson et al., Anaesth. Intensive Care 19:182-186, 1991; Yentis et al., Intensive Care Med. 21:602-605, 1995; Aabenhus and Jensen, Prim. Care Respir. J. 20:360-367, 2011; Schuetz et al., BMC Med. 9:107, 2011).
The current gold standard for sepsis diagnosis is positive identification of pathogenic bacteria through culture. Unfortunately, such culture-based methods typically take at least 2-5 days to complete, depending on the causative pathogen and often provide negative or inconclusive results (Kim and Weinstein, Clin. Microbial. Infect. 19:513-520, 2013; Ruiz-Giardin et al., Int. J Infect. Dis. 41:6-10, 2015). Patients are often treated with antibiotics prior to definitive identification of the infective agent, which can result in inappropriate or inadequate treatment. The delay of proper intervention likely contributes to the high sepsis mortality rate.
There is a need to accurately identify patients who have sepsis, are at risk of developing sepsis, or who are in the early or pre-symptomatic phase of sepsis, so that appropriate treatment can be initiated as early as possible.
In some embodiments, disclosed herein are methods of detecting a plurality of markers, such as detecting or measuring expression of a set of at least six genes in a sample from a subject having sepsis, suspected to have sepsis, or at risk of sepsis. The methods include detecting or measuring expression of a set of at least six genes, wherein the set includes CCR1, HLA-DPB1, BATF, C3AR1, ARHGEFI8, and C9orf95; CCR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf103; CCR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CCR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; CD177, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CD63, HLA-DPB1, PLAC8, FCER1G, MTCH1, and C9orf95; CD63, HLA-DPB1, PLAC8, GNA15, ARHGEFI8, and C9orf95; CD63, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; CD63, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CD63, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; EMR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; EMR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; FCER1G, HLA-DPB1, PLAC8, C3AR1, MTCH1, and C9orf95; FCER1G, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf95; FCER1G, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; FCER1G, HLA-DPB1, BATF, C3AR1, MTCH1, and SEPHS2; FCER1G, HLA-DPB1, BAIT, C3AR1, MTCH1, and C9orf95; FES, HLA-DPB1, PLAC8, FCER1G, MTCH1, and C9orf95; FES, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf103; FES, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; FES, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; or C3AR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95. In some examples, the expression of CCR1, C3AR1, C9orf95, BATF, C9orf103, GNA15, CD177, CD63, EMR1, FCER1G, FES, PLAC8, and/or SEPHS2 is increased compared to a control, and/or expression of HLA-DPB1, ARHGEF18, and/or MTCH1 is decreased compared to a control.
Also provided are methods of diagnosing sepsis in a subject including detecting or measuring expression of the set of at least six genes in a sample from a subject, wherein the subject is diagnosed with sepsis when expression of CCR1, C3AR1, C9orf95, BATF, C9orf103, GNA15, CD177, CD63, EMR1, FCER1G, FES, PLAC8, and/or SEPHS2 is increased compared to a control, and/or expression of HLA-DPB1, ARHGEF18, and/or MTCH1 is decreased compared to a control. Further provided are methods of treating a subject with sepsis including detecting or measuring expression of the set of at least six genes in a sample from a subject, wherein expression of CCR1, C3AR1, C9orf95, BATF, C9orf103, GNA15, CD177, CD63, EMR1, FCER1G, FES, PLAC8, and/or SEPHS2 is increased compared to a control, and/or expression of HLA-DPB1, ARHGEF18, and/or MTCH1 is decreased compared to a control and administering one or more antibiotics to the subject.
In other embodiments, disclosed herein are methods of detecting a plurality of markers, such as detecting or measuring expression of a set of at least three genes in a sample from a subject having sepsis or suspected to have sepsis. In some examples, the methods include detecting or measuring expression of a set of at least three (such as at least 3, 4, 5, or more) genes, wherein the set includes a set of 3, 4, or 5 genes provided in Table 16, for example, LCN2, SLC2A3, and BMX; LCN2, SLC2A3, and GRB10; LCN2, PFKFB3, and GRB10; LCN2, PFKFB3, and BMX; IL1R2, HK3, and BMX; LCN2, HK3, and BMX; LCN2, HK3, and GRB10; GZMA, HK3, and BMX; FCAR, PFKFB2, and BMX; LCN2, PFKFB3, and IL18R1; LCN2, PFKFB3, GRB10, and ST6GALNAC3; LCN2, SLC2A3, BMX, and LGALS2; IL1R2, SLC2A3, BMX, and TCN1; LCN2, SLC2A3, GRB10, and ST6GALNAC3; FCAR, PFKFB2, BMX, and CEACAM1; IL1R2, HK3, BMX, and CD24; IL1R2, PFKFB3, BMX, and CD24; BMX, SLC2A3, GRB10, and CD24; IL1R2, HK3, BMX, and CEACAM1; GZMA, SLC2A3, BMX, and CD24; LCN2, PFKFB3, GRB10, ST6GALNAC3, and RNASE3; LCN2, PFKFB3, GRB10, RNASE2, and ST6GALNAC3; IL1R2, PFKFB3, GRB10, CD24, and ST6GALNAC3; GZMA, PFKFB3, GRB10, ST6GALNAC3, and CD24; HK3, PFKFB3, GRB10, ST6GALNAC3, and CD24; IL1R2, PFKFB3, GRB10, ST6GALNAC3, and TCN1; SLC2A3, PFKFB3, GRB10, ST6GALNAC3, and TCN1, LCN2, PFKFB3, GRB10, ST6GALNAC3, and DACH1; SLC2A3, PFKFB3, GRB10, ST6GALNAC3, and CD24; or SLC2A3, HK3, BMX, SPOCD1, and LGALS2, wherein expression of the three or more genes is altered compared to a control.
In other examples, the methods include detecting or measuring expression of a set of at least six genes (such as 6, 7, 8, or more), wherein the set includes a set of 6, 7, or 8 genes provided in Table 17, for example, RPS6KA3, BCL6, TPM3, GNG10, STOM, and MPO; RPS6KA3, MAVS, TPM3, BCL6, STOM, and MPO; RPS6KA3, BCL6, TPM3, BMX, STOM, and MPO; RPS6KA3, LTF, TPM3, BCL6, STOM, and PDGFC; RPS6KA3, BCL6, TPM3, CD55, STOM, and MPO; RPS6KA3, BCL6, TPM3, GNG10, STOM, and PDGFC; RPS6KA3, LTF, TPM3, BCL6, STOM, and CYP1B1; LTF, BCL6, TPM3, CD55, STOM, and PDGFC; RPS6KA3, BCL6, TPM3, TLR8, STOM, and MPO; RPS6KA3, BCL6, TPM3, HLA-DRA, GNG10, and MPO; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, CYP27A1, and YOD1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, and YOD1; RPS6KA3, GAS7, TPM3, BCL6, RRBP1, MPO, and YOD1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, MPO, and LGALS2; RPS6KA3, GAS7, TPM3, BCL6, GNG10, MPO, and YOD1; LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, and YOD1; RPS6KA3, BCL6, TPM3, GNG10, LILRA2, CYP27A1, and YOD1; RPS6KA3, BCL6, TPM3, GNG10, STOM, MPO, and LGALS2; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, MPO, and YOD1; RPS6KA3, BCL6, TPM3, CD55, STOM, MPO, and IL17RA; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and LGALS2; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and LILRA4; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and IL17RA; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and TCN1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and RNASE3; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and NOV; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and RNASE2; LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and FAM105A; RPS6KA3, BCL6, TPM3, HLA-DRA, RRBP1, MPO, ERO1L, and YOD1; or LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and C14orf101, wherein expression of the six or more genes is altered compared to a control. An additional set of six or more genes that can be used in the disclosed methods includes LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14, wherein expression of the six or more genes is altered compared to a control.
Also provided are methods of diagnosing sepsis in a subject, including detecting or measuring expression of the set of at least three or at least six genes (such as the sets of genes in Tables 15 and 16) or the set of at least six genes including LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14 in a sample from a subject, wherein the subject is diagnosed with sepsis when expression of the set of at least three genes or at least six genes is altered compared to a control. Further provided are methods of treating a subject with sepsis including detecting or measuring expression of the set of at least three genes or at least six genes (such as the sets of genes in Tables 15 and 16) or the set of at least six genes including LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14 in a sample from a subject, wherein expression is altered compared to a control, and administering one or more antibiotics to the subject. In some examples, the subject does not exhibit symptoms of sepsis (for example, is pre-symptomatic or pre-clinical for sepsis). Thus, in some examples, the methods including predicting risk of developing sepsis in a subject.
Also disclosed herein are solid supports including a solid support (such as an array or bead) including binding agents specific for a set of markers disclosed herein. In some embodiments, the solid support includes at least one probe, primer, and/or antibody specific for a set including each of CCR1, HLA-DPB1, BATF, C3AR1, ARHGEF18, and C9orf95; CCR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf103; CCR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CCR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; CD177, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CD63, HLA-DPB1, PLAC8, FCER1G, MTCH1, and C9orf95; CD63, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf95; CD63, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; CD63, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CD63, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; EMR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; EMR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; FCER1G, HLA-DPB1, PLAC8, C3AR1, MTCH1, and C9orf95; FCER1G, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf95; FCER1G, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; FCER1G, HLA-DPB1, BATF, C3AR1, MTCH1, and SEPHS2; FCER1G, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; FES, HLA-DPB1, PLAC8, FCER1G, MTCH1, and C9orf95; FES, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf103; FES, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; FES, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; or C3AR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95. In other embodiments, the solid support includes at least one probe, primer, and/or antibody specific for each gene in a set including 3, 4, or 5 genes provided in Table 16, for example, LCN2, SLC2A3, and BMX; LCN2, SLC2A3, and GRB10; LCN2, PFKFB3, and GRB10; LCN2, PFKFB3, and BMX; IL1R2, HK3, and BMX; LCN2, HK3, and BMX; LCN2, HK3, and GRB10; GZMA, HK3, and BMX; FCAR, PFKFB2, and BMX; LCN2, PFKFB3, and IL18R1; LCN2, PFKFB3, GRB10, and ST6GALNAC3; LCN2, SLC2A3, BMX, and LGALS2; IL1R2, SLC2A3, BMX, and TCN1; LCN2, SLC2A3, GRB10, and ST6GALNAC3; FCAR, PFKFB2, BMX, and CEACAM1; IL1R2, HK3, BMX, and CD24; IL1R2, PFKFB3, BMX, and CD24; BMX, SLC2A3, GRB10, and CD24; IL1R2, HK3, BMX, and CEACAM1; GZMA, SLC2A3, BMX, and CD24; LCN2, PFKFB3, GRB10, ST6GALNAC3, and RNASE3; LCN2, PFKFB3, GRB10, RNASE2, and ST6GALNAC3; IL1R2, PFKFB3, GRB10, CD24, and ST6GALNAC3; GZMA, PFKFB3, GRB10, ST6GALNAC3, and CD24; HK3, PFKFB3, GRB10, ST6GALNAC3, and CD24; IL1R2, PFKFB3, GRB10, ST6GALNAC3, and TCN1; SLC2A3, PFKFB3, GRB10, ST6GALNAC3, and TCN1; LCN2, PFKFB3, GRB10, ST6GALNAC3, and DACH1; SLC2A3, PFKFB3, GRB10, ST6GALNAC3, and CD24; or SLC2A3, HK3, BMX, SPOCD1, and LGALS2. In other embodiments, the solid support includes at least one probe, primer, and/or antibody specific for a set including each gene in a set including 6, 7, or 8 genes provided in Table 17, for example, RPS6KA3, BCL6, TPM3, GNG10, STOM, and MPO; RPS6KA3, MAVS, TPM3, BCL6, STOM, and MPO; RPS6KA3, BCL6, TPM3, BMX, STOM, and MPO; RPS6KA3, LIT, TPM3, BCL6, STOM, and PDGFC; RPS6KA3, BCL6, TPM3, CD55, STOM, and MPO; RPS6KA3, BCL6, TPM3, GNG10, STOM, and PDGFC; RPS6KA3, LTF, TPM3, BCL6, STOM, and CYP1B1; LTF, BCL6, TPM3, CD55, STOM, and PDGFC; RPS6KA3, BCL6, TPM3, TLR8, STOM, and MPO; RPS6KA3, BCL6, TPM3, HLA-DRA, GNG10, and MPO; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, CYP27A1, and YOD1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, and YOD1; RPS6KA3, GAS7, TPM3, BCL6, RRBP1, MPO, and YOD1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, MPO, and LGALS2; RPS6KA3, GAS7, TPM3, BCL6, GNG10, MPO, and YOD1; LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, and YOD1; RPS6KA3, BCL6, TPM3, GNG10, LILRA2, CYP27A1, and YOD1; RPS6KA3, BCL6, TPM3, GNG10, STOM, MPO, and LGALS2; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, MPO, and YOD1; RPS6KA3, BCL6, TPM3, CD55, STOM, MPO, and IL17RA; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and LGALS2; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and LILRA4; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and IL17RA; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and TCN1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and RNASE3; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and NOV; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and RNASE2; LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and FAM105A; RPS6KA3, BCL6, TPM3, HLA-DRA, RRBP1, MPO, ERO1L, and YOD1; or LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and C14orf101. In other embodiments, the solid support includes at least one probe, primer, and/or antibody specific for a set of genes including each LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14
Kits that include a disclosed set of specific binding agents (such as a set of primers or probes) are also provided. Such a kit can further include other reagents, such as a buffer, such as a hybridization buffer. Such sets of specific binding agents and kits can be used to perform steps of the disclosed methods.
The foregoing and other features of the disclosure will become more apparent from the following detailed description, which proceeds with reference to the accompanying figures.
It is a challenge to develop large multi-feature diagnostic panels for clinical use. Most of the clinical gene panels in current use are relatively small. For example, recently developed influenza and tuberculosis panels each utilize six analytes. Developing large multi-feature panels is expensive and time consuming, including the need to optimize measurements for individual features. In addition, some of the features may show different results due to measurement platform differences. The inventors therefore in some embodiments have reduced the number of features in sepsis diagnosis and/or prognosis panels without sacrificing performance in order to further develop panels for clinical use.
Unless otherwise noted, technical terms are used according to conventional usage. Definitions of common terms in molecular biology may be found in Lewin's Genes X, ed. Krebs et al., Jones and Bartlett Publishers, 2009 (ISBN 0763766321); Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Publishers, 1994 (ISBN 0632021829); Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by Wiley, John & Sons, Inc., 1995 (ISBN 0471186341); and George P. Rédei, Encyclopedic Dictionary of Genetics, Genomics, Proteomics and Informatics, 3rd Edition, Springer, 2008 (ISBN: 1402067534), and other similar references.
Unless otherwise explained, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. The singular terms “a,” “an,” and “the” include plural referents unless the context clearly indicates otherwise. Similarly, the word “or” is intended to include “and” unless the context clearly indicates otherwise. Hence “comprising A or B” means including A, or B, or A and B. It is further to be understood that all base sizes or amino acid sizes, and all molecular weight or molecular mass values, given for nucleic acids or polypeptides are approximate, and are provided for description.
Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety, as are the GenBank Accession numbers (for the sequence present on Nov. 5, 2018). In case of conflict, the present specification, including explanations of terms, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.
In order to facilitate review of the various embodiments of the disclosure, the following explanations of specific terms are provided:
Altered expression: A difference, such as an increase or decrease, in the conversion of the information encoded in a gene (for example, one or more of those in Tables 5, 6, 11, 15, and/or 16) into messenger RNA, the conversion of mRNA to a protein, or both. In some examples, the difference is relative to a control or reference value (or range of values), such as the average expression value of a group of healthy subjects, such as subjects who do not have sepsis. Detecting altered expression can include measuring a change in gene or protein expression, such as an increase of at least 20%, at least 50%, at least 75%, at least 90%, at least 100%, at least 200%, at least 300% at least 400%, or at least 500%, or a decrease of at least 20%, at least 50%, at least 75%, at least 90%, or at least 95%.
Array: An arrangement of nucleic acids (such as DNA or RNA) or proteins (such as antibodies) in assigned locations on a matrix or substrate. In some examples, the nucleic acid molecules or proteins are attached covalently to the matrix or substrate.
Control: A sample or standard used for comparison with an experimental sample, such as a sample from a healthy subject, for example, a subject who does not have sepsis. In some embodiments, the control is a historical control or standard reference value or range of values (e.g., a previously tested control sample, such as a group of healthy subjects, or group of samples that represent baseline or normal values, such as the level of expression of one or more genes listed in Tables 5, 6, 11, 15, and/or 16). Laboratory standards and values can be set based on a known or determined population value and can be supplied in the format of a graph or table that permits comparison of measured, experimentally determined values.
Detecting or measuring expression: Determining the level expression in either a qualitative, semi-quantitative, or quantitative manner by detection of nucleic acid molecules (e.g., DNA, RNA, and/or mRNA) or proteins. Exemplary methods include microarray analysis, PCR (such as RT-PCR, real-time PCR, or qRT-PCR), Northern blot, Western blot, ELISA, and mass spectrometry.
Sample (or biological sample): A biological specimen containing nucleic acids (for example, DNA, RNA, and/or mRNA), proteins, or combinations thereof, obtained from a subject. Examples include, but are not limited to, peripheral blood, serum, plasma, urine, saliva, tissue biopsy, fine needle aspirate, surgical specimen, and autopsy material. In some examples, a sample includes blood, serum, or plasma.
Sepsis: A condition where a subject has an infection, and the subject's immune response to the infection damages the subject's own tissue(s). As used herein, “sepsis” can refer to sepsis, severe sepsis, or septic shock. Sepsis is typically diagnosed by presence of infection in combination with one or more of altered body temperature (e.g., above 101° F. or below 98.6° F.), increased respiratory rate (e.g., >20 breaths/minute), and increased heart rate (e.g., >90 beats per minute). Severe sepsis includes sepsis and one or more of altered mental state, low blood pressure (e.g., <100 mm Hg systolic pressure), decreased platelet count, respiratory distress, abnormal heart function, and abdominal pain. Septic shock is sepsis or severe sepsis with low blood pressure due to sepsis that does not improve after treatment (such as fluid support). Sepsis, severe sepsis, and septic shock are life-threatening conditions that are usually treated with antibiotics, intravenous fluids, and other supportive measures, such as oxygen, mechanical ventilation and/or dialysis.
Subject: A living multi-cellular vertebrate organism, a category that includes human and non-human mammals. In one example, a subject has or is suspected to have sepsis or is at risk for sepsis. In other example, a subject does not exhibit symptoms of sepsis.
Provided herein are biomarker panels and methods of their use. In some embodiments, the biomarker panels are used in methods of predicting development of sepsis in a subject and/or diagnosing sepsis in a subject. In embodiments, the methods include measuring the expression of three or more (such as 3, 4, 5, 6, 7, 8, 9, 10, or more) genes in a sample from a subject. The methods also include determining whether expression of the three or more genes is altered (such as increased or decreased) compared to a control. In some examples, altered expression of the three or more genes indicates that the subject is predicted to develop or has developed sepsis.
In some embodiments, the methods disclosed herein can predict that a subject will develop sepsis, for example, can diagnose a subject with sepsis when they are pre-symptomatic or pre-clinical (e.g., do not exhibit symptoms of sepsis). In some examples, the methods can diagnose a subject as having sepsis when they do not exhibit symptoms of sepsis, such as at least 1 day (for example, at least 1, 2, 3, 4, 5, or more days) prior to diagnosis of sepsis using current clinical criteria. Thus, in some embodiments, the methods utilize a sample that is from a subject who does not exhibit symptoms of sepsis. Subjects and samples are discussed in greater detail below.
A. ISB6 Panels
Disclosed herein are methods that include measuring expression of six or more (for example, 6, 7, 8, 9, 10, or more) genes in a sample from a subject having or suspected to have sepsis and determining whether expression of the six or more genes is altered (for example, increased or decreased) compared to a control. In some embodiments, the set of six or more genes includes or consists of a set of genes provided in Table 5. In other embodiments, the set of six or more genes includes or consists of a set of genes provided in Table 6, for example, CCR1, HLA-DPB1, BATF, C3AR1, ARHGEF18, and C9orf95; CCR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf103; CCR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CCR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; CD177, HILA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CD63, HLA-DPB1, PLAC8, FCER1G, MTCH1, and C9orf95; CD63, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf95; CD63, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; CD63, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CD63, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; EMR1, HLA-DPB1, BAIT, C3AR1, MTCH1, and C9orf95; EMR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; FCER1G, HLA-DPB1, PLAC8, C3AR1, MTCH1, and C9orf95; FCER1G, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf95; FCER1G, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; FCER1G, HLA-DPB1, BATF, C3AR1, MTCH1, and SEPHS2; FCER1G, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; FES, HLA-DPB1, PLAC8, FCER1G, MTCH1, and C9orf95; FES, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf103; FES, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; FES, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; or C3AR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95.
In some examples, expression of one or more of the genes in the set is increased compared to a control and/or expression of one or more of the genes in the set is decreased compared to a control. In particular examples, expression of CCR1, C3AR1, C9orf95, BATF, C9orf103, GNA15, CD177, CD63, EMR1, FCER1G, FES, PLAC8, and/or SPEHS2 is increased compared to the control. In other particular examples, expression of HLA-DPB1, ARHGEF18, and/or MTCH1 is decreased compared to the control.
Also disclosed herein are methods of diagnosing sepsis in a subject that include measuring expression of six or more (for example, 6, 7, 8, 9, 10, or more) genes in a sample from a subject and diagnosing the subject with sepsis when the expression of the six or more genes is altered (for example, increased or decreased) compared to a control. In some embodiments, the set of six or more genes includes or consists of a set of genes provided in Table 5. In other embodiments, the set of six or more genes includes or consists of a set of genes provided in Table 6, for example, CCR1, HLA-DPB1, BATF, C3AR1, ARHGEF18, and C9orf95; CCR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf103; CCR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9or195; CCR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; CD177, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CD63, HLA-DPB1, PLAC8, FCER1G, MTCH1, and C9orf95; CD63, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf95; CD63, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; CD63, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CD63, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; EMR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; EMR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; FCER1G, HLA-DPB1, PLAC8, C3AR1, MTCH1, and C9orf95; FCER1G, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf95; FCER1G, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; FCER1G, HLA-DPB1, BATF, C3AR1, MTCH1, and SEPHS2; FCER1G, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; FES, HLA-DPB1, PLAC8, FCER1G, MTCH1, and C9orf95; FES, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf103; FES, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; FES, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; or C3AR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95.
In some examples, the subject is diagnosed with sepsis when expression of one or more of the genes in the set is increased compared to a control and/or expression of one or more of the genes in the set is decreased compared to a control. In particular examples, expression of CCR1, C3AR1, C9orf95, BATF, C9orf103, GNA15, CD177, CD63, EMR1, FCER1G, FES, PLAC8, and/or SPEHS2 is increased compared to the control. In other particular examples, expression of HLA-DPB1, ARHGEF18, and/or MTCH1 is decreased compared to the control.
Also disclosed herein are methods of treating sepsis in a subject that include measuring expression of six or more (for example, 6, 7, 8, 9, 10, or more) genes in a sample from a subject and administering one or more treatments for sepsis (such as administering one or more antibiotics) when the expression of the six or more genes is altered (for example, increased or decreased) compared to a control. In some embodiments, the set of six or more genes includes or consists of a set of genes provided in Table 5. In other embodiments, the set of six or more genes includes or consists of a set of genes provided in Table 6, for example, CCR1, HLA-DPB1, BATF, C3AR1, ARHGEF18, and C9orf95; CCR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf103; CCR1, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CCR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; CD177, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CD63, HLA-DPB1, PLAC8, FCER1G, MTCH1, and C9orf95; CD63, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf95; CD63, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; CD63, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; CD63, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; EMR1, HLA-DPB1, BAIT, C3AR1, MTCH1, and C9orf95; EMR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95; FCER1G, HLA-DPB1, PLAC8, C3AR1, MTCH1, and C9orf95; FCER1G, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf95; FCER1G, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; FCER1G, HLA-DPB1, BATF, C3AR1, MTCH1, and SEPHS2; FCER1G, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; FES, HLA-DPB1, PLAC8, FCER1G, MTCH1, and C9orf95; FES, HLA-DPB1, PLAC8, GNA15, ARHGEF18, and C9orf103; FES, HLA-DPB1, PLAC8, GNA15, MTCH1, and C9orf95; FES, HLA-DPB1, BATF, C3AR1, MTCH1, and C9orf95; or C3AR1, HLA-DPB1, BATF, GNA15, MTCH1, and C9orf95. In particular examples, expression of CCR1, C3AR1, C9orf95, BATF, C9orf103, GNA15, CD177, CD63, EMR1, FCER1G, FES, PLAC8, and/or SPEHS2 is increased compared to the control. In other particular examples, expression of HLA-DPB1, ARHGEF18, and/or MTCH1 is decreased compared to the control.
In some embodiments of the methods (such as methods of measuring expression of the six or more genes, methods of diagnosing a subject with sepsis, and/or methods of treating sepsis) further include measuring expression of RPGRIP1 in the sample and determining whether expression of RPGRIP1 is altered compared to a control. In some examples, expression of RPGRIP1 is decreased compared to a control. In other examples, the subject is diagnosed with sepsis when expression of RPGRIP1 is decreased compared to a control.
In other embodiments, the methods further include measuring expression of one or more control genes (such as 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 control genes), such as one or more housekeeping or internal control genes. In some examples, a control gene is expressed at a constant level among different tissues and/or is unaffected by sepsis or an experimental treatment and can be used to normalize patterns of gene expression. Exemplary control or housekeeping genes include GAPDH, β-action, 18S ribosomal RNA, tubulin, BRK1, RNF181, GPR155, SUPT4H1, and FAM74A4.
B. ISB 19-Derived Panels
Disclosed herein are methods that include measuring expression of three or more (for example, 3, 4, 5, 6, or more) genes in a sample from a subject having or suspected to have sepsis or at risk of sepsis and determining whether expression of the three or more genes is altered (for example, increased or decreased) compared to a control. In some embodiments, the set of three or more genes includes or consists of a three or more genes selected from the set provided in Table 12 as the “ISB 19” panel. In other embodiments, the set of three or more genes includes or consists of a set of 3, 4, or 5 genes provided in Table 16, for example, LCN2, SLC2A3, and BMX; LCN2, SLC2A3, and GRB10; LCN2, PFKFB3, and GRB10; LCN2, PFKFB3, and BMX; IL1R2, HK3, and BMX; LCN2, HK3, and BMX; LCN2, HK3, and GRB10; GZMA, HK3, and BMX; FCAR, PFKFB2, and BMX; LCN2, PFKFB3, and IL18R1; LCN2, PFKFB3, GRB10, and ST6GALNAC3; LCN2, SLC2A3, BMX, and LGALS2; IL1R2, SLC2A3, BMX, and TCN1; LCN2, SLC2A3, GRB10, and ST6GALNAC3; FCAR, PFKFB2, BMX, and CEACAM1; IL1R2, HK3, BMX, and CD24; IL1R2, PFKFB3, BMX, and CD24; BMX, SLC2A3, GRB10, and CD24; IL1R2, HK3, BMX, and CEACAM1; GZMA, SLC2A3, BMX, and CD24; LCN2, PFKFB3, GRB10, ST6GALNAC3, and RNASE3; LCN2, PFKFB3, GRB10, RNASE2, and ST6GALNAC3; IL1R2, PFKFB3, GRB10, CD24, and ST6GALNAC3; GZMA, PFKFB3, GRB10, ST6GALNAC3, and CD24; HK3, PFKFB3, GRB10, ST6GALNAC3, and CD24; IL1R2, PFKFB3, GRB10, ST6GALNAC3, and TCN1; SLC2A3, PFKFB3, GRB10, ST6GALNAC3, and TCN1; LCN2, PFKFB3, GRB10, ST6GALNAC3, and DACH1; SLC2A3, PFKFB3, GRB10, ST6GALNAC3, and CD24; or SLC2A3, HK3, BMX, SPOCD1, and LGALS2.
In some examples, expression of one or more of the genes in the set is increased compared to a control and/or expression of one or more of the genes in the set is decreased compared to a control. In particular examples, expression of one or more of LCN2, SLC2A3, BMX, GRB10, PFKFB3, IL1R2, HK3, FCAR, PFKFB2, IL18R1, ST6GALNAC3, TCN1, CEACAM1, CD24, RNASE3, RNASE2, DACH1 and/or SPOCD1 is increased compared to the control. In other particular examples, expression of GZMA and/or LGALS2 is decreased compared to the control.
Also disclosed herein are methods of diagnosing or predicting sepsis in a subject that include measuring expression of three or more (for example, 3, 4, 5, 6, or more) genes in a sample from a subject and diagnosing the subject with sepsis or predicting risk of sepsis when the expression of the three or more genes is altered (for example, increased or decreased) compared to a control. In some embodiments, the three or more genes includes or consists of three or more genes selected from the set provided in Table 12 as the “ISB 19” panel. In other embodiments, the three or more genes includes or consists of a set of 3, 4, or 5 genes provided in Table 16, for example, LCN2, SLC2A3, and BMX; LCN2, SLC2A3, and GRB10; LCN2, PFKFB3, and GRB10; LCN2, PFKFB3, and BMX; IL1R2, HK3, and BMX; LCN2, HK3, and BMX; LCN2, HK3, and GRB10; GZMA, HK3, and BMX; FCAR, PFKFB2, and BMX; LCN2, PFKFB3, and IL18R1; LCN2, PFKFB3, GRB10, and ST6GALNAC3; LCN2, SLC2A3, BMX, and LGALS2; IL1R2, SLC2A3, BMX, and TCN1; LCN2, SLC2A3, GRB10, and ST6GALNAC3; FCAR, PFKFB2, BMX, and CEACAM1; IL1R2, HK3, BMX, and CD24; IL1R2, PFKFB3, BMX, and CD24; BMX, SLC2A3, GRB10, and CD24; IL1R2, HK3, BMX, and CEACAM1; GZMA, SLC2A3, BMX, and CD24; LCN2, PFKFB3, GRB10, ST6GALNAC3, and RNASE3; LCN2, PFKFB3, GRB10, RNASE2, and ST6GALNAC3; IL1R2, PFKFB3, GRB10, CD24, and ST6GALNAC3; GZMA, PFKFB3, GRB10, ST6GALNAC3, and CD24; HK3, PFKFB3, GRB10, ST6GALNAC3, and CD24; IL1R2, PFKFB3, GRB10, ST6GALNAC3, and TCN1; SLC2A3, PFKFB3, GRB10, ST6GALNAC3, and TCN1; LCN2, PFKFB3, GRB10, ST6GALNAC3, and DACH1; SLC2A3, PFKFB3, GRB10, ST6GALNAC3, and CD24; or SLC2A3, HK3, BMX, SPOCD1, and LGALS2.
In some examples, the subject is diagnosed with sepsis or predicted to develop sepsis when expression of one or more of the genes in the set is increased compared to a control and/or expression of one or more of the genes in the set is decreased compared to a control. In particular examples, expression of one or more of LCN2, SLC2A3, BMX, GRB10, PFKFB3, IL1R2, HK3, FCAR, PFKFB2, IL18R1, ST6GALNAC3, TCN1, CEACAM1, CD24, RNASE3, RNASE2, DACH1 and/or SPOCD1 is increased compared to the control. In other particular examples, expression of GZMA and/or LGALS2 is decreased compared to the control.
Also disclosed herein are methods of treating sepsis in a subject that include measuring expression of three or more (for example, 3, 4, 5, 6, or more) genes in a sample from a subject and administering one or more treatments for sepsis (such as administering one or more antibiotics) when the expression of the three or more genes is altered (for example, increased or decreased) compared to a control. In some embodiments, the three or more genes includes or consists of three or more genes selected from the set provided in Table 12 as the “ISB 19” panel. In other embodiments, the three or more genes includes or consists of a set of 3, 4, or 5 genes provided in Table 16, for example, LCN2, SLC2A3, and BMX; LCN2, SLC2A3, and GRB10; LCN2, PFKFB3, and GRB10; LCN2, PFKFB3, and BMX; IL1R2, HK3, and BMX; LCN2, HK3, and BMX; LCN2, HK3, and GRB10; GZMA, HK3, and BMX; FCAR, PFKFB2, and BMX; LCN2, PFKFB3, and IL18R1; LCN2, PFKFB3, GRB10, and ST6GALNAC3; LCN2, SLC2A3, BMX, and LGALS2; IL1R2, SLC2A3, BMX, and TCN1; LCN2, SLC2A3, GRB10, and ST6GALNAC3; FCAR, PFKFB2, BMX, and CEACAM1; IL1R2, HK3, BMX, and CD24; IL1R2, PFKFB3, BMX, and CD24; BMX, SLC2A3, GRB10, and CD24; IL1R2, HK3, BMX, and CEACAM1; GZMA, SLC2A3, BMX, and CD24; LCN2, PFKFB3, GRB10, ST6GALNAC3, and RNASE3; LCN2, PFKFB3, GRB10, RNASE2, and ST6GALNAC3; IL1R2, PFKFB3, GRB10, CD24, and ST6GALNAC3; GZMA, PFKFB3, GRB10, ST6GALNAC3, and CD24; HK3, PFKFB3, GRB10, ST6GALNAC3, and CD24; IL1R2, PFKFB3, GRE 10, ST6GALNAC3, and TCN1; SLC2A3, PFKFB3, GRB10, ST6GALNAC3, and TCN1; LCN2, PFKFB3, GRB10, ST6GALNAC3, and DACH1; SLC2A3, PFKFB3, GRB10, ST6GALNAC3, and CD24; or SLC2A3, HK3, BMX, SPOCD1, and LGALS2. In particular examples, expression of one or more of LCN2, SLC2A3, BMX, GRB10, PFKFB3, IL1R2, HK3, FCAR, PFKFB2, IL18R1, ST6GALNAC3, TCN1, CEACAM1, CD24, RNASE3, RNASE2, DACH1 and/or SPOCD1 is increased compared to the control. In other particular examples, expression of GZMA and/or LGALS2 is decreased compared to the control. In some embodiments, the subject does not exhibit symptoms of sepsis.
In other embodiments, the methods further include measuring expression of one or more control genes (such as 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 control genes), such as one or more housekeeping or internal control genes. In some examples, a control gene is expressed at a constant level among different tissues and/or is unaffected by sepsis or an experimental treatment and can be used to normalize patterns of gene expression. Exemplary control or housekeeping genes include GAPDH, β-action, 18S ribosomal RNA, tubulin, BRK1, RNF181, GPR155, SUPT4H1, and FAM74A4.
C. ISB63-Derived Panels
Disclosed herein are methods that include measuring expression of six or more (for example, 6, 7, 8, 9, or more) genes in a sample from a subject having or suspected to have sepsis and determining whether expression of the six or more genes is altered (for example, increased or decreased) compared to a control. In some embodiments, the six or more genes includes or consists of six or more genes selected from the set provided in Table 12 as the “ISB 63” panel. In other embodiments, the six or more genes includes or consists of a set of 6, 7, or 8 genes provided in Table 17, for example, RPS6KA3, BCL6, TPM3, GNG10, STOM, and MPO; RPS6KA3, MAVS, TPM3, BCL6, STOM, and MPO; RPS6KA3, BCL6, TPM3, BMX, STOM, and MPO; RPS6KA3, LTF, TPM3, BCL6, STOM, and PDGFC; RPS6KA3, BCL6, TPM3, CD55, STOM, and MPO; RPS6KA3, BCL6, TPM3, GNG10, STOM, and PDGFC; RPS6KA3, LTF, TPM3, BCL6, STOM, and CYP1B1; LTF, BCL6, TPM3, CD55, STOM, and PDGFC; RPS6KA3, BCL6, TPM3, TLR8, STOM, and MPO; RPS6KA3, BCL6, TPM3, HLA-DRA, GNG10, and MPO; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, CYP27A1, and YOD1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, and YOD1; RPS6KA3, GAS7, TPM3, BCL6, RRBP1, MPO, and YOD1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, MPO, and LGALS2; RPS6KA3, GAS7, TPM3, BCL6, GNG10, MPO, and YOD1; LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, and YOD1; RPS6KA3, BCL6, TPM3, GNG10, LILRA2, CYP27A1, and YOD1; RPS6KA3, BCL6, TPM3, GNG10, STOM, MPO, and LGALS2; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, MPO, and YOD1; RPS6KA3, BCL6, TPM3, CD55, STOM, MPO, and IL17RA; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and LGALS2; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and LILRA4; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and IL17RA; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and TCN1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and RNASE3; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and NOV; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and RNASE2; LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and FAM105A; RPS6KA3, BCL6, TPM3, HLA-DRA, RRBP1, MPO, ERO1L, and YOD1; or LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and C14orf101.
In some examples, expression of one or more of the genes in the set is increased compared to a control and/or expression of one or more of the genes in the set is decreased compared to a control. In particular examples, expression of one or more of RPS6KA3, BCL6, TPM3, GNG10, STOM, MPO, BMX, LTF, PDGFC, CD55, CYP1B1, TLR8, MLLT1, YOD1, GAS7, RRBP1, LILRA2, IL17RA, LILRA4, TCN1, RNASE3, RNASE2, FAM105A, ERO1L, and/or C14orf101 is increased compared to the control. In other particular examples, expression of one or more of MAVS, HLA-DRA, CYP27A1, LGALS2, and/or NOV is decreased compared to the control.
Also disclosed herein are methods of measuring expression of a modified set of six or more ISB63 genes comprising LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14 in a sample from a subject having or suspected to have sepsis and determining whether expression of the six or more genes is altered (for example, increased or decreased) compared to a control. In particular examples, expression of one or more of LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14 is increased compared to the control.
Also disclosed herein are methods of diagnosing or predicting sepsis in a subject that include measuring expression of six or more (for example, 6, 7, 8, 9, or more) genes in a sample from a subject and diagnosing the subject with sepsis or predicting risk of sepsis when the expression of the six or more genes is altered (for example, increased or decreased) compared to a control. In some embodiments, the six or more genes includes or consists of six or more genes selected from the set provided in Table 12 as the “ISB 63” panel. In other embodiments, the six or more genes includes or consists of a set of 6, 7, or 8 genes provided in Table 17, for example, RPS6KA3, BCL6, TPM3, GNG10, STOM, and MPO; RPS6KA3, MAVS, TPM3, BCL6, STOM, and MPO; RPS6KA3, BCL6, TPM3, BMX, STOM, and MPO; RPS6KA3, LTF, TPM3, BCL6, STOM, and PDGFC; RPS6KA3, BCL6, TPM3, CD55, STOM, and MPO; RPS6KA3, BCL6, TPM3, GNG10, STOM, and PDGFC; RPS6KA3, LTF, TPM3, BCL6, STOM, and CYP1B1; LTF, BCL6, TPM3, CD55, STOM, and PDGFC; RPS6KA3, BCL6, TPM3, TLR8, STOM, and MPO; RPS6KA3, BCL6, TPM3, HLA-DRA, GNG10, and MPO; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, CYP27A1, and YOD1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, and YOD1; RPS6KA3, GAS7, TPM3, BCL6, RRBP1, MPO, and YOD1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, MPO, and LGALS2; RPS6KA3, GAS7, TPM3, BCL6, GNG10, MPO, and YOD1; LIT, MLLT1, TPM3, BCL6, GNG10, PDGFC, and YOD1; RPS6KA3, BCL6, TPM3, GNG10, LILRA2, CYP27A1, and YOD1; RPS6KA3, BCL6, TPM3, GNG10, STOM, MPO, and LGALS2; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, MPO, and YOD1; RPS6KA3, BCL6, TPM3, CD55, STOM, MPO, and IL17RA; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and LGALS2; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and LILRA4; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and IL17RA; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and TCN1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and RNASE3; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and NOV; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and RNASE2; LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and FAM105A; RPS6KA3, BCL6, TPM3, HLA-DRA, RRBP1, MPO, ERO1L, and YOD1; or LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and C14orf101.
In some examples, the subject is diagnosed with sepsis or predicted to develop sepsis when expression of one or more of the genes in the set is increased compared to a control and/or expression of one or more of the genes in the set is decreased compared to a control. In particular examples, the subject is diagnosed with sepsis when expression of one or more of RPS6KA3, BCL6, TPM3, GNG10, STOM, MPO, BMX, LTF, PDGFC, CD55, CYP1B1, TLR8, MLLT1, YOD1, GAS7, RRBP1, LILRA2, IL17RA, LILRA4, TCN1, RNASE3, RNASE2, FAM105A, ERO1L, and/or C14orf101 is increased compared to the control. In other particular examples, expression of one or more of MAVS, HLA-DRA, CYP27A1, LGALS2, and/or NOV is decreased compared to the control.
Also disclosed herein are methods of diagnosing or predicting sepsis in a subject that include measuring expression of a modified set of six or more ISB63 genes comprising LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14 in a sample from a subject and diagnosing the subject with sepsis or predicting risk of sepsis when the expression of the six or more genes is altered (for example, increased or decreased) compared to a control. In particular examples, expression of one or more of LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14 is increased compared to the control.
Also disclosed herein are methods of treating sepsis in a subject that include measuring expression of six or more (for example, 6, 7, 8, 9, or more) genes in a sample from a subject and administering one or more treatments for sepsis (such as administering one or more antibiotics) when the expression of the six or more genes is altered (for example, increased or decreased) compared to a control. In some embodiments, the six or more genes includes or consists of six or more genes selected from the set provided in Table 12 as the “ISB 63” panel. In other embodiments, the six or more genes includes or consists of a set of 6, 7, or 8 genes provided in Table 17, for example, RPS6KA3, BCL6, TPM3, GNG10, STOM, and MPO; RPS6KA3, MAVS, TPM3, BCL6, STOM, and MPO; RPS6KA3, BCL6, TPM3, BMX, STOM, and MPO; RPS6KA3, LTF, TPM3, BCL6, STOM, and PDGFC; RPS6KA3, BCL6, TPM3, CD55, STOM, and MPO; RPS6KA3, BCL6, TPM3, GNG10, STOM, and PDGFC; RPS6KA3, LTF, TPM3, BCL6, STOM, and CYP1B1; LTF, BCL6, TPM3, CD55, STOM, and PDGFC; RPS6KA3, BCL6, TPM3, TLR8, STOM, and MPO; RPS6KA3, BCL6, TPM3, HLA-DRA, GNG10, and MPO; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, CYP27A1, and YOD1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, and YOD1; RPS6KA3, GAS7, TPM3, BCL6, RRBP1, MPO, and YOD1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, MPO, and LGALS2; RPS6KA3, GAS7, TPM3, BCL6, GNG10, MPO, and YOD1; LTF, MLLT1, TPM3, BCL6, GNG1C, PDGFC, and YOD1; RPS6KA3, BCL6, TPM3, GNG10, LILRA2, CYP27A1, and YOD1; RPS6KA3, BCL6, TPM3, GNG10, STOM, MPO, and LGALS2; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, MPO, and YOD1; RPS6KA3, BCL6, TPM3, CD55, STOM, MPO, and IL17RA; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and LGALS2; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and LILRA4; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and IL17RA; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and TCN1; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and RNASE3; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and NOV; RPS6KA3, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and RNASE2; LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and FAM105A; RPS6KA3, BCL6, TPM3, HLA-DRA, RRBP1, MPO, ERO1L, and YOD1; or LTF, MLLT1, TPM3, BCL6, GNG10, PDGFC, YOD1, and C14orf101. In particular examples, expression of one or more of RPS6KA3, BCL6, TPM3, GNG10, STOM, MPO, BMX, LTF, PDGFC, CD55, CYP1B1, TLR8, MLLT1, YOD1, GAS7, RRBP1, LILRA2, IL17RA, LILRA4, TCN1, RNASE3, RNASE2, FAM105A, ERO1L, and/or C14orf101 is increased compared to the control. In other particular examples, expression of one or more of MAVS, HLA-DRA, CYP27A1, LGALS2, and/or NOV is decreased compared to the control. In some embodiments, the subject does not exhibit symptoms of sepsis.
Also disclosed herein are methods of treating sepsis in a subject that include measuring expression of a modified set of six or more ISB63 genes comprising LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14 in a sample from a subject and administering one or more treatments for sepsis (such as administering one or more antibiotics) when the expression of the six or more genes is altered (for example, increased or decreased) compared to a control. In particular examples, expression of one or more of LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14 is increased compared to the control.
In other embodiments, the methods further include measuring expression of one or more control genes (such as 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 control genes), such as one or more housekeeping or internal control genes. In some examples, a control gene is expressed at a constant level among different tissues and/or is unaffected by sepsis or an experimental treatment and can be used to normalize patterns of gene expression. Exemplary control or housekeeping genes include GAPDH, β-action, 18S ribosomal RNA, tubulin, BRK1, RNF181, GPR155, SUPT4H1, and FAM74A4.
D. Subjects and Samples
The methods disclosed herein utilize a sample from a subject. In some embodiments, a subject has or is suspected to have sepsis. In other embodiments, a subject does not exhibit symptoms of sepsis (e.g., does not have sepsis and/or has sepsis, but is pre-symptomatic). In some examples, the subject may be at risk for development of sepsis, for example, a subject with an infection (e.g., a bacterial, viral, fungal, or parasitic infection), a subject who has experienced trauma (e.g., wound, injury, or burn), a subject who has undergone a surgical procedure, a subject who has received an organ transplant, or a subject with an invasive device (e.g., a catheter or breathing tube). In some examples, the subject has an infection with Enterococcus, Pseudomonas aeruginosa, Staphylococcus aureus, or Enterobacteriaceae. In additional examples, a subject at risk for sepsis may be younger than 1 year or older than 65, have diabetes, and/or have a compromised immune system (e.g., subjects with AIDS (or HIV positive), subjects with severe combined immunodeficiency, subjects who have had transplants and who are taking immunosuppressants, subjects who have cancer or who are receiving chemotherapy for cancer, subjects who do not have a spleen, subjects with end stage kidney disease, and subjects who have been taking corticosteroids or other immune suppressing therapy). In particular examples, a subject at risk for sepsis includes a subject with pneumonia, an abdominal infection (e.g., peritonitis), a kidney infection or urinary tract infection, a subject who has undergone surgery, and/or a subject who has experienced trauma.
In some examples, the subject has experienced a risk factor for development of sepsis (including but not limited to infection, surgery, or trauma) within 1-14 days of collecting a sample from the subject. For example, one or more samples is collected from a subject 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, or 14 days after undergoing surgery or experiencing a trauma. In other examples, one or more samples are collected from a subject prior to a risk factor for developing sepsis (including but not limited to surgery, placement of an invasive device, or immunosuppressant or chemotherapy), such as 1-14 days prior to the risk factor. In some examples, one or more samples are collected from the subject 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, or 14 days before the risk factor. Samples may be collected from a subject both before (such as 1-14 days before) and after (such as 1-14 days after) the risk factor, in some examples.
Samples that can be used in the disclosed methods include biological specimens containing nucleic acids (for example, DNA, RNA, and/or mRNA), proteins, or combinations thereof, obtained from a subject. Examples include, but are not limited to, peripheral blood, peripheral blood mononuclear cells (PBMCs), serum, plasma, urine, saliva, sputum, wound secretions, pus, tissue biopsy, fine needle aspirate, surgical specimen, and autopsy material. In particular examples, the sample is peripheral blood from a subject. In some examples, samples are used directly in the methods provided herein. In other examples, samples are manipulated prior to analysis using the disclosed methods, such as through concentrating, filtering, centrifuging, diluting, desalting, denaturing, reducing, alkylating, proteolyzing, or combinations thereof. In some examples, components of the samples (for example, nucleic acids and/or proteins) are isolated or purified from the sample prior to analysis using the disclosed methods, such as isolating cells, proteins, and/or nucleic acid molecules from the samples.
In some embodiments, one or more samples (such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more samples) are collected from a subject. In some examples, multiple samples are collected and tested, for example, to detect changes in gene expression prior to development of symptoms of sepsis. The samples may be collected about every 2 hours, about every 4 hours, about every 8 hours, about every 12 hours, about every 18 hours, about once per day, about every other day, or combinations thereof. The samples may be collected prior to diagnosis with sepsis, after diagnosis with sepsis, or a combination thereof. In some examples, samples are collected from a subject who does not exhibit symptoms of sepsis, such as at least 1 day (for example, at least 1, 2, 3, 4, 5, or more days) prior to diagnosis of sepsis using current clinical criteria. In some examples, one or more samples are collected from a subject after an event that increases their risk for developing sepsis, including but not limited to surgery, trauma or other wound, burn(s), infection (such as pneumonia, meningitis, urinary tract infection, peritonitis, or skin infection), or presence of an invasive device (such as a catheter or ventilator).
E. Detecting Gene Expression
As described herein, expression of the sepsis biomarkers disclosed herein (e.g., one or more panels provided herein, including in any one of Tables 5, 6, 12, 16, and 17) can be measured or detected using any one of a number of methods. Detecting or measuring expression of nucleic acid molecules (e.g., mRNA, cDNA) or protein is contemplated herein. The detection methods can be qualitative, semi-quantitative, or quantitative.
1. Methods for Detecting mRNA
Gene expression can be evaluated by detecting RNA (e.g., mRNA or cDNA) encoding the genes of interest. Thus, the disclosed methods can include evaluating mRNA(s) encoding the genes or sets of genes provided herein (including in any one of Tables 5, 6, 12, 16, and 17). In other examples, mRNA encoding the gene(s) of interest is reverse transcribed to cDNA and the cDNA is measured or detected. In some examples, the mRNA (or cDNA) is quantified. The amount of the mRNA (or cDNA) can be assessed in a sample from a subject and optionally in a control sample (such as a sample from a healthy subject). The amounts of mRNA (or cDNA) can be compared to levels of the mRNA (or cDNA) found in sample(s) from healthy subjects or other controls (such as a standard value or reference value). A significant increase or decrease in the amount can be evaluated using statistical methods. An alteration in the amount of the mRNA or cDNA in a sample from the subject relative to a control, such as an increase or decrease in expression, indicates whether the subject has sepsis or is likely to develop sepsis, as described herein.
RNA can be isolated from a sample from a subject, for example using commercially available kits, such as those from QIAGEN®. General methods for mRNA extraction are disclosed in, for example, including Ausubel et al., Current Protocols of Molecular Biology, John Wiley and Sons (1997). In some examples, RNA can also be extracted from paraffin embedded tissues (e.g., see Rupp and Locker, Lab Invest. 56:A67, 1987 and De Andres et al., BioTechniques 18:42044, 1995). Total RNA from cells (such as those obtained from a subject) can be isolated using QIAGEN® RNeasy mini-columns. Other commercially available RNA isolation kits include MASTERPURE® Complete DNA and RNA Purification Kit (EPICENTRE® Madison, Wis.), and Paraffin Block RNA Isolation Kit (Ambion, Inc.). Total RNA from tissue samples can be isolated using RNA Stat-60 (Tel-Test). RNA prepared from biological samples can also be isolated, for example, by cesium chloride density gradient centrifugation.
Methods of measuring or detecting gene expression include methods based on hybridization analysis of polynucleotides, methods based on sequencing of polynucleotides, and proteomics-based methods. In some examples, mRNA (or cDNA) expression in a sample is quantified using northern blotting or in situ hybridization (Parker & Barnes, Methods in Molecular Biology 106:247-283, 1999); RNAse protection assays (Hod, Biotechniques 13:852-4, 1992); and/or PCR-based methods, such as reverse transcription polymerase chain reaction (RT-PCR) (Weis et al., Trends in Genetics 8:263-4, 1992). Alternatively, antibodies can be employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. Representative methods for sequencing-based gene expression analysis include Serial Analysis of Gene Expression (SAGE), and gene expression analysis by massively parallel signature sequencing (MPSS).
In one example, RT-PCR can be used. Generally, the first step in gene expression profiling by RT-PCR is the reverse transcription of the RNA template into cDNA, followed by its exponential amplification in a PCR reaction. Two commonly used reverse transcriptases are avian myeloblastosis virus reverse transcriptase (AMV-RT) and Moloney murine leukemia virus reverse transcriptase (MMLV-RT). The reverse transcription step is typically primed using specific primers, random hexamers, or oligo-dT primers, depending on the circumstances and the goal of expression profiling. For example, extracted RNA can be reverse-transcribed using a GeneAmp RNA PCR kit (Perkin Elmer, Calif, USA), or other commercially available kits, following the manufacturer's instructions. The derived cDNA can then be used as a template in the subsequent PCR reaction.
Although the PCR step can use a variety of thermostable DNA-dependent DNA polymerases, it typically employs the Taq DNA polymerase. TaqMan® PCR typically utilizes the 5′-nuclease activity of Taq or Tth polymerase to hydrolyze a hybridization probe bound to its target amplicon, but any enzyme with equivalent 5′ nuclease activity can be used. Two oligonucleotide primers are used to generate an amplicon typical of a PCR reaction. A third oligonucleotide, or probe, is designed to detect nucleotide sequence located between the two PCR primers. The probe is not extendable by Taq DNA polymerase enzyme, and is labeled with a reporter fluorescent dye and a quencher fluorescent dye. Any laser-induced emission from the reporter dye is quenched by the quenching dye when the two dyes are located close together as they are on the probe. During the amplification reaction, the Taq DNA polymerase enzyme cleaves the probe in a template-dependent manner. The resultant probe fragments dissociate in solution, and signal from the released reporter dye is free from the quenching effect of the second fluorophore. One molecule of reporter dye is liberated for each new molecule synthesized, and detection of the unquenched reporter dye provides the basis for quantitative interpretation of the data.
To minimize errors and the effect of sample-to-sample variation, RT-PCR can be performed using an internal standard. The ideal internal standard is expressed at a constant level among different tissues, and/or is unaffected by the experimental treatment. RNAs commonly used to normalize patterns of gene expression are mRNAs for the housekeeping genes glyceraldehyde-3-phosphate-dehydrogenase (GAPDH), beta-actin, tubulin, and 18S ribosomal RNA. Additional internal control genes include BRK1, RNF181, GPR155, SUPT4H1, and FAM74A4
A variation of RT-PCR is real time quantitative RT-PCR (qRT-PCR), which measures PCR product accumulation through a dual-labeled fluorogenic probe (e.g., TAQMAN® probe). Real time PCR is compatible both with quantitative competitive PCR, where internal competitor for each target sequence is used for normalization, and with quantitative comparative PCR using a normalization gene contained within the sample, or a housekeeping gene for RT-PCR (see Held et al., Genome Research 6:986 994, 1996). Quantitative PCR is also described in U.S. Pat. No. 5,538,848. Related probes and quantitative amplification procedures are described in U.S. Pat. Nos. 5,716,784 and 5,723,591. Instruments for carrying out quantitative PCR in microtiter plates are available from PE Applied Biosystems, 850 Lincoln Centre Drive, Foster City, Calif. 94404 under the trademark ABI PRISM® 7700.
An alternative quantitative nucleic acid amplification procedure is described in U.S. Pat. No. 5,219,727. In this procedure, the amount of a target sequence in a sample is determined by simultaneously amplifying the target sequence and an internal standard nucleic acid segment. The amount of amplified DNA from each segment is determined and compared to a standard curve to determine the amount of the target nucleic acid segment that was present in the sample prior to amplification.
In some embodiments of this method, the expression of a housekeeping gene or internal control can also be evaluated. These terms include any constitutively or globally expressed gene whose presence enables an assessment of a sepsis-associated mRNA levels. Such an assessment includes a determination of the overall constitutive level of gene transcription and a control for variations in RNA recovery. Internal control (or internal reference) also refers to genes that show little or minimal change between different conditions (such as presence or absence of sepsis). In one example, an internal reference gene is one that shows <1.1-fold change and p-value >0.05 between sepsis and control (no sepsis) samples. Exemplary housekeeping genes include but are not limited to GAPDH, 18S ribosomal RNA, β-actin and tubulin. Exemplary internal control genes include but are not limited to BRK1, RNF181, GPR155, SUPT4H1, and FAM74A4. In particular examples, BRK1 and RNF181 are used as internal controls.
In other examples, gene expression is identified or confirmed using a microarray technique. Thus, the expression profile can be measured in a sample from a subject using microarray technology. In this method, nucleic acid sequences from one or more of the sepsis panels disclosed herein, including but not limited to those included in any one of Tables 5, 6, 12, 16, and 17 (including cDNAs and/or oligonucleotides) are plated, or arrayed, on a microchip substrate. The arrayed sequences are then hybridized with specific DNA probes from cells or tissues of interest. Just as in the RT-PCR method, the source of mRNA typically is total RNA isolated from a sample from a subject, and optionally from corresponding non-sepsis (e.g., healthy) samples.
In a specific embodiment of the microarray technique, PCR amplified inserts of cDNA clones are applied to a substrate in a dense array. At least probes specific for one or more panels disclosed herein, such as those in any one of Tables 5, 6, 12, 16, and 17 (and in some examples one or more housekeeping and/or internal control genes) are applied to the substrate, and the array can include, consist essentially of, or consist of these nucleic acids. The microarrayed nucleic acids are suitable for hybridization under stringent conditions. Fluorescently labeled cDNA probes may be generated through incorporation of fluorescent nucleotides by reverse transcription of RNA extracted from a sample of interest (such as a sample from a subject with sepsis, suspected to have sepsis, or at risk of sepsis). Labeled cDNA probes applied to the chip hybridize with specificity to each spot of DNA on the array. After stringent washing to remove non-specifically bound probes, the chip is scanned by confocal laser microscopy or by another detection method. Quantitation of hybridization of each arrayed element allows for assessment of corresponding mRNA abundance. With dual color fluorescence, separately labeled cDNA probes generated from two sources of RNA are hybridized pairwise to the array. The relative abundance of the transcripts from the two sources corresponding to each specified gene is thus determined simultaneously. The miniaturized scale of the hybridization affords a convenient and rapid evaluation of the expression pattern for the panels disclosed herein. Microarray analysis can be performed by commercially available equipment, following manufacturer's protocols.
Serial analysis of gene expression (SAGE) allows the simultaneous and quantitative analysis of a large number of gene transcripts, without the need of providing an individual hybridization probe for each transcript. First, a short sequence tag (about 10-14 base pairs) is generated that contains sufficient information to uniquely identify a transcript, provided that the tag is obtained from a unique position within each transcript. Then, many transcripts are linked together to form long serial molecules, that can be sequenced, revealing the identity of the multiple tags simultaneously. The expression pattern of any population of transcripts can be quantitatively evaluated by determining the abundance of individual tags, and identifying the gene corresponding to each tag (see, for example, Velculescu et al., Science 270:484-7, 1995; and Velculescu et al., Cell 88:243-51, 1997, herein incorporated by reference).
In situ hybridization (ISH) is another method for detecting and comparing expression of the panels described herein, including but not limited to those disclosed in any one of Tables 5, 6, 12, 16, and 17. ISH applies and extrapolates the technology of nucleic acid hybridization to the single cell level, and, in combination with the art of cytochemistry, immunocytochemistry and immunohistochemistry, permits the maintenance of morphology and the identification of cellular markers to be maintained and identified, and allows the localization of sequences to specific cells within populations, such as tissues and blood samples. ISH is a type of hybridization that uses a complementary nucleic acid to localize one or more specific nucleic acid sequences in a portion or section of tissue (in situ), or, if the tissue is small enough, in the entire tissue (whole mount ISH).
Sample cells or tissues can be treated to increase their permeability to allow one or more gene-specific probes to enter the cells. The one or more probes are added to the treated cells, allowed to hybridize at pertinent temperature, and excess probe is washed away. The probe can be labeled, for example with a radioactive, fluorescent or antigenic tag, so that the probe's location and in some examples quantity, in the tissue can be determined, for example using autoradiography, fluorescence microscopy or immunoassay. Probes can be designed based on the known sequences of the genes (such as the GenBank accession numbers provided herein) such that the probes specifically bind the gene of interest.
In situ PCR is the PCR based amplification of the target nucleic acid sequences prior to ISH. For detection of RNA, an intracellular reverse transcription step is introduced to generate complementary DNA from RNA templates prior to in situ PCR. This enables detection of low copy RNA sequences.
Prior to in situ PCR, cells or tissue samples can be fixed and permeabilized to preserve morphology and permit access of the PCR reagents to the intracellular sequences to be amplified. PCR amplification of target sequences is next performed either in intact cells held in suspension or directly in cytocentrifuge preparations or tissue sections on glass slides. In the former approach, fixed cells suspended in the PCR reaction mixture are thermally cycled using conventional thermal cyders. After PCR, the cells are cytocentrifuged onto glass slides with visualization of intracellular PCR products by ISH or immunohistochemistry. In situ PCR on glass slides is performed by overlaying the samples with the PCR mixture under a coverslip which is then sealed to prevent evaporation of the reaction mixture. Thermal cycling is achieved by placing the glass slides either directly on top of the heating block of a conventional or specially designed thermal cycler or by using thermal cycling ovens.
Detection of intracellular PCR products can be achieved by ISH with PCR-product specific probes, or direct in situ PCR without ISH through direct detection of labeled nucleotides (such as digoxigenin-11-dUTP, fluorescein-dUTP, 3H-CTP or biotin-16-dUTP), which have been incorporated into the PCR products during thermal cycling.
Gene expression can also be detected and quantitated using the nCounter® technology developed by NanoString (Seattle, Wash.; see, for example, U.S. Pat. Nos. 7,473,767; 7,919,237; and 9,371,563, which are herein incorporated by reference in their entirety). The nCounter® analysis system utilizes a digital color-coded barcode technology that is based on direct multiplexed measurement of gene expression. The technology uses molecular “barcodes” and single molecule imaging to detect and count hundreds of unique transcripts in a single reaction. Each color-coded barcode is attached to a single target-specific probe corresponding to a gene of interest. Mixed together with controls, they form a multiplexed CodeSet.
Each color-coded barcode represents a single target molecule. Barcodes hybridize directly to target molecules and can be individually counted without the need for amplification. The method includes three steps: (1) hybridization; (2) purification and immobilization; and (3) counting. The technology employs two approximately 50 base probes per mRNA that hybridize in solution. The reporter probe carries the signal; the capture probe allows the complex to be immobilized for data collection. After hybridization, the excess probes are removed and the probe/target complexes are aligned and immobilized in the nCounter® cartridge. Sample cartridges are placed in the digital analyzer for data collection. Color codes on the surface of the cartridge are counted and tabulated for each target molecule. This method is described in, for example, U.S. Pat. No. 7,919,237; and U.S. Patent Application Publication Nos. 20100015607; 20100112710; 20130017971, which are herein incorporated by reference in their entirety.
2. Arrays for Profiling Gene Expression
In particular embodiments, arrays (such as a solid support, for example a multi-well plate, a membrane, a bead, or flow cell) are provided that can be used to evaluate gene expression, for example to determine if a subject has or is likely to develop symptoms of sepsis. Such arrays can include a set of specific binding agents (such as nucleic acid probes and/or primers) specific for genes of one or more panels described herein, including but not limited to those in any one of Tables 5, 6, 12, 16, and 17. When describing an array that consists essentially of probes or primers specific for a panel provided herein, including those in any one of Tables 5, 6, 12, 16, and 17, such an array includes probes or primers specific for the genes of the panel, and can further include control probes or primers, such as 1-10 control probes or primers (for example to confirm the incubation conditions are sufficient). In some examples, the array may further comprise additional, such as 1, 2, 3, 4 or 5 additional probes for other genes. In some examples, the array includes 1-10 housekeeping- and/or internal control-specific probes or primers. In one example, an array is a multi-well plate (e.g., 98 or 364 well plate).
In one example, the array includes, consists essentially of, or consists of probes or primers (such as an oligonucleotide or antibody) that can recognize the genes in the panels listed in any one of Tables 5, 6, 12, 16, and 17 (and in some examples also 1-10 housekeeping or control genes). In another example, the array includes, consists essentially of, or consists of probes or primers (such as an oligonucleotide or antibody) that can recognize each of LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14 (and in some examples also 1-10 housekeeping or control genes). The oligonucleotide probes or primers can further include one or more detectable labels, to permit detection of hybridization signals between the probe and its target sequence.
3. Methods for Detecting Protein Expression
In some examples, expression of proteins of the panels provided herein, including but not limited to those listed in any one of Tables 5, 6, 12, 16, and 17 is analyzed. Suitable biological samples include samples containing protein obtained from a subject with sepsis, a subject suspected to have sepsis, or a subject at risk for sepsis. An alteration in the amount of the proteins in a sample from the subject relative to a control, such as an increase or decrease in protein expression, indicates whether the subject has sepsis or is likely to develop sepsis, as described herein.
Antibodies specific for the panels of proteins provided herein, including but not limited to those listed in any one of Tables 5, 6, 12, 16, and 17 can be used for protein detection and quantification, for example using an immunoassay method, such as those presented in Harlow and Lane (Antibodies, A Laboratory Manual, CSHL, New York, 1988). Exemplary immunoassay formats include ELISA, Western blot, and RIA assays. Thus, protein levels in a sample can be evaluated using these methods. Immunohistochemical techniques can also be utilized protein detection and quantification. General guidance regarding such techniques can be found in Bancroft and Stevens (Theory and Practice of Histological Techniques, Churchill Livingstone, 1982) and Ausubel et al, (Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1998).
To quantify proteins, a biological sample of the subject that includes cellular proteins can be used. Quantification of the proteins of the panels provided herein, including but not limited to those listed in any one of Tables 5, 6, 12, 16, and 17 can be achieved by immunoassay methods. The amount of the proteins can be assessed in a sample from a subject and optionally in a control sample (such as a sample from a healthy subject). The amounts of protein can be compared to levels of the protein found in sample(s) from healthy subjects or other controls (such as a standard value or reference value). A significant increase or decrease in the amount can be evaluated using statistical methods.
Quantitative spectroscopic approaches, such as SELDI, can be used to analyze gene expression in a sample (such as a sample from a subject with sepsis or suspected to have or develop sepsis). In one example, surface-enhanced laser desorption-ionization time-of-flight (SELDI-TOF) mass spectrometry is used to detect protein expression, for example by using the ProteinChip™ (Ciphergen Biosystems, Palo Alto, Calif.). Such methods are well known in the art (for example see U.S. Pat. Nos. 5,719,060; 6,897,072; and 6,881,586). SELDI is a solid phase method for desorption in which the analyte is presented to the energy stream on a surface that enhances analyte capture or desorption.
The surface chemistry allows the bound analytes to be retained and unbound materials to be washed away. Subsequently, analytes bound to the surface (such as tumor-associated proteins) can be desorbed and analyzed by any of several means, for example using mass spectrometry. When the analyte is ionized in the process of desorption, such as in laser desorption/ionization mass spectrometry, the detector can be an ion detector. Mass spectrometers generally include means for determining the time-of-flight of desorbed ions. This information is converted to mass. However, one need not determine the mass of desorbed ions to resolve and detect them: the fact that ionized analytes strike the detector at different times provides detection and resolution of them. Alternatively, the analyte can be detectably labeled (for example with a fluorophore or radioactive isotope). In these cases, the detector can be a fluorescence or radioactivity detector. A plurality of detection means can be implemented in series to fully interrogate the analyte components and function associated with retained molecules at each location in the array.
Therefore, in a particular example, the chromatographic surface includes antibodies that specifically bind to proteins of the panels listed in any one of Tables 5, 6, 11, 15, or 16. In other examples, the chromatographic surface consists essentially of, or consists of, antibodies that specifically bind to proteins of the panels listed in any one of Tables 5, 6, 11, 15, or 16. In further examples, the chromatographic surface includes antibodies that specifically bind to proteins LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14. In some examples, the chromatographic surface includes antibodies that bind other molecules, such as housekeeping proteins (e.g., tubulin, β-actin, GAPDH, or 18S ribosomal RNA) or internal control proteins (e.g., BRK1, RNF181, GPR155, SUPT4H1, or FAM74A4). In another example, antibodies are immobilized onto the surface using a bacterial Fc binding support.
The chromatographic surface is incubated with a sample. The antigens present in the sample can recognize the antibodies on the chromatographic surface. The unbound proteins and mass spectrometric interfering compounds are washed away and the proteins that are retained on the chromatographic surface are analyzed and detected by SELDI-TOF. The MS profile from the sample can be then compared using differential protein expression mapping, whereby relative expression levels of proteins at specific molecular weights are compared by a variety of statistical techniques and bioinformatic software systems.
F. Controls
The methods disclosed herein include determining expression of one or more genes (such as a panel of genes provided herein, including but not limited to those listed in any one of Tables 5, 6, 11, 15, or 16) is altered compared to a control. In some examples the expression is compared to a control, such as compared to expression in a sample from a subject known not to have sepsis, for example, a healthy subject (or compared to a reference value or range of values representing such).
The control can be any suitable control against which to compare expression of any one of the panels of genes disclosed herein (including in any one of Tables 5, 6, 11, 15, or 16) in a sample from a subject. In some embodiments, the control sample is a sample, or plurality of samples, from a subject(s) known not to have sepsis (e.g., one or more “healthy” subjects). In other examples, the control sample is a sample, or plurality of samples, from a subject(s) known to have sepsis (in which case the increase or decrease in expression correlation to sepsis is reversed). In some embodiments, the control is a reference value. For example, the reference value can be derived from the average expression values obtained from a group of subjects known not to have (or known to have) sepsis.
An increase in expression includes any detectable increase in the production of a gene product, for example, compared to a control. In certain examples, production of a gene product increases by at least 20%, at least 50%, at least 75%, at least 90%, at least 2-fold, at least 3-fold or at least 4-fold, as compared to a control (such as a sample from a subject known not to have sepsis or compared to a reference value or range of values representing such). In one example, a control is a relative amount of gene expression in a biological sample, such as a sample from a subject known not to have sepsis. In some examples, the control is a reference value (or range of values) for an expected amount of expression of each of the panel of genes from a subject known not to have sepsis. In some examples, the control is a sample from a subject known not to have sepsis (which can be analyzed in parallel with a test sample).
A decrease in expression includes any detectable decrease in the production of a gene product, for example, compared to a control. In certain examples, production of a gene product decreases by at least 20%, at least 50%, at least 75%, at least 90%, at least 2-fold, at least 3-fold or at least 4-fold, as compared to a control (such as a sample from a subject known not to have sepsis or compared to a reference value or range of values representing such). In one example, a control is a relative amount of gene expression in a biological sample, such as a sample from a subject known not to have sepsis. In some examples, the control is a reference value (or range of values) for an expected amount of expression of each of the panel of genes from a subject known not to have sepsis. In some examples, the control is a sample from a subject known not to have sepsis (which can be analyzed in parallel with a test sample).
G. Additional Features
In some embodiments, the methods provided herein further include analyzing one or more additional clinical parameters and integrating the information with the panels disclosed herein (such as one or more panels provided herein, including those in any one of Tables 5, 6, 11, 15, or 16). In some examples, the additional clinical parameters alone do not provide good performance for diagnosing or predicting sepsis, however, they can provide improved performance in combination with the panels disclosed herein.
In one example, the methods include determining a Sequential Organ Failure Assessment (SOFA) score for the subject. The SOFA score is a mortality prediction score measurement used to determine the extent of organ failure in a subject based on degree of function/dysfunction of six organ systems. It includes PaO2, FiO2, platelet count, Glasgow coma scale, bilirubin level, mean arterial pressure and administration of vasoactive agents, creatinine level, and urine output to generate a score (Vincent et al., Crit. Care Med. 26, 1793-1800, 1998; Ferreira et al., JAMA 286:1754-1758, 2001). Thus, in some examples, one or more SOFA scores are obtained for a subject having sepsis, suspected to have sepsis, or at risk for sepsis (for example, one or more times per day). The SOFA score is then integrated with the gene panel to determine whether the subject has or is likely to develop sepsis.
In another example, the level of C-reactive protein (CRP) is determined in a sample from the subject. CRP level is commonly used as a non-specific indicator of infection in a subject. Thus, in some examples, a level of CRP is measured in a sample from a subject having sepsis, suspected to have sepsis, or at risk for sepsis (for example, one or more times per day). The CRP level is then integrated with the gene panel to determine whether the subject has or is likely to develop sepsis.
In some examples, both a SOFA score and level of CRP are measured or determined for a subject having sepsis, suspected to have sepsis, or at risk for sepsis (for example, one or more times per day). The SOFA score and CRP level are then integrated with the gene panel to determine whether the subject has or is likely to develop sepsis.
Also provided are kits including sets of specific binding agents, such as sets of nucleic acid probes, nucleic acid primers, and/or antibodies (or antibody fragments) specific for each of the genes in one or more panels provided herein (including those listed in any one of Tables 5, 6, 11, 15, or 16). In some examples, a kit includes a nucleic acid probe specific for each of the genes in one or more panels listed in any one of Tables 5, 6, 11, 15, or 16, one or more nucleic acid primers (e.g., 1, 2, 3, 4, or more primers) specific for each of the genes in one or more panels listed in any one of Tables 5, 6, 11, 15, or 16, an antibody specific for proteins encoded by each of the genes in one or more panels listed in any one of Tables 5, 6, 11, 15, or 16, or combinations thereof. In other examples, a kit includes a nucleic acid probe specific for each of the LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14; one or more nucleic acid primers (e.g., 1, 2, 3, 4, or more primers) specific for each of LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14; an antibody specific for proteins encoded by each of LCN2, BMX, MPO, RPS6KA3, BCL6, and MAPK14; or combinations thereof. Such probes, primers, and/or antibodies can be in vials, such as a glass or plastic container, or attached or conjugated to an array (e.g., a solid substrate). In some examples, the probes, primers, and/or antibodies are in a carrier, such as a buffer (e.g., saline). In some examples, such sets further include a nucleic acid probe, one or more nucleic acid primers, or an antibody, specific for at least one housekeeping and/or internal control molecule, such as 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10, or about 1 to about 3, or about 1 to about 5, housekeeping and/or internal control molecules (e.g., β-actin, GAPDH, tubulin, BRK1, RNF181, GPR155, SUPT4H1, or FAM74A4). Such kits can include other components, such as a buffer (e.g., hybridization buffer), enzyme(s), and/or one or more detection reagents.
In some embodiments, the kit includes at least one surface with at least one nucleic acid probe, nucleic acid primer, and/or antibody immobilized on the surface (for example on an array, bead, or flow cell). The disclosed kits can include at least one surface with at least one nucleic acid probe, nucleic acid primer, and/or antibody immobilized on the surface in an addressable manner. Some of the surfaces (or substrates) which can be used in the disclosed kits (or methods) are readily available from commercial suppliers. In some embodiments, the surface is a 96-, 384-, or 1536-well microliter plate, such as modified plates sold by Corning Costar or BD Biosciences (for example, gamma-irradiated plates). In other embodiments, a substrate includes one or more beads (such as a population of beads that can be differentiated by size or color, for example by flow cytometry). In some embodiments, a substrate includes a flow cell (such as a flow cell or a microfluidic ship with a plurality of channels). Alternatively, a surface comprising wells which, in turn, comprise indentations or “dimples” can be formed by micromachining a substance such as aluminum or steel to prepare a mold, then microinjecting plastic or a similar material into the mold to form a structure. Alternatively, a structure comprised of glass, plastic, ceramic, or the like, can be assembled. In some examples, the base is a flat piece of material (for example glass or plastic), in, for example, the shape of the lower portion of a typical microplate used for a biochemical assay. The top surface of the base can be either flat or formed with indentations that will align with a subdivider to provide full subdivisions, or wells, within each sample well. The pieces can be joined by standard procedures, for example the procedures used in the assembly of silicon wafers.
A wide variety of array formats for arrangement of the nucleic acid probes, nucleic acid primers, and/or antibodies can be employed. One suitable format includes a two-dimensional pattern of discrete cells (such as 4096 squares in a 64 by 64 array). Other suitable array formats include but are not limited to slot (rectangular) and circular arrays (see U.S. Pat. No. 5,981,185). In some examples, the array is a multi-well plate. In one example, the array is formed on a polymer medium, which is a thread, membrane or film. An example of an organic polymer medium is a polypropylene sheet having a thickness on the order of about 1 mil. (0.001 inch) to about 20 mil., although the thickness of the film is not critical and can be varied over a fairly broad range. The array can include biaxially oriented polypropylene (BOPP) films, which in addition to their durability, exhibit low background fluorescence. In another example, a surface activated organic polymer is used as the solid support surface. One example of a surface activated organic polymer is a polypropylene material aminated via radio frequency plasma discharge. Other reactive groups can also be used, such as carboxylated, hydroxylated, thiolated, or active ester groups.
The solid support of the array can be formed from an organic polymer. Suitable materials for the solid support include, but are not limited to: polypropylene, polyethylene, polybutylene, polyisobutylene, polybutadiene, polyisoprene, polyvinylpyrrolidine, polytetrafluroethylene, polyvinylidene difluoride, polyfluoroethylene-propylene, polyethylenevinyl alcohol, polymethylpentene, polycholorotrifluoroethylene, polysulfones, hydroxylated biaxially oriented polypropylene, aminated biaxially oriented polypropylene, thiolated biaxially oriented polypropylene, ethyleneacrylic acid, ethylene methacrylic acid, and blends of copolymers thereof (see U.S. Pat. No. 5,985,567).
In general, suitable characteristics of the material that can be used to form the solid support surface include: being amenable to surface activation such that upon activation, the surface of the support is capable of covalently attaching a biomolecule such as an oligonucleotide or antibody thereto; amenability to in situ synthesis of biomolecules; being chemically inert such that at the areas on the support not occupied by the oligonucleotides or proteins (such as antibodies) are not amenable to non-specific binding, or when non-specific binding occurs, such materials can be readily removed from the surface without removing the oligonucleotides or proteins (such as antibodies).
The array formats of the present disclosure can be included in a variety of different types of formats. A “format” includes any format to which the solid support can be affixed, such as microtiter plates (e.g., multi-well plates), test tubes, inorganic sheets, dipsticks, and the like. For example, when the solid support is a polypropylene thread, one or more polypropylene threads can be affixed to a plastic dipstick-type device; polypropylene membranes can be affixed to glass slides. The particular format is, in and of itself, unimportant. All that is necessary is that the solid support can be affixed thereto without affecting the functional behavior of the solid support or any biopolymer absorbed thereon, and that the format (such as the dipstick or slide) is stable to any materials into which the device is introduced (such as clinical samples and reaction solutions).
In one embodiment, preformed nucleic acid probes, nucleic acid primers, and/or antibodies can be situated on or within the surface of a test region by any of a variety of conventional techniques, including photolithographic or silkscreen chemical attachment, disposition by ink jet technology, capillary, screen or fluid channel chip, electrochemical patterning using electrode arrays, contacting with a pin or quill, or denaturation followed by baking or UV-irradiating onto filters (see, e.g., Rava et al. (1996). U.S. Pat. No. 5,545,531; Fodor et al. (1996). U.S. Pat. No. 5,510,270; Zanzucchi et al. (1997). U.S. Pat. No. 5,643,738; Brennan (1995). U.S. Pat. No. 5,474,796; PCT WO 92/10092; PCT WO 90/15070).
In one embodiment, preformed nucleic acid probes or nucleic acid primers are derivatized at the 5′ end with a free amino group; dissolved at a concentration routinely determined empirically (e.g., about 1 μM) in a buffer such as 50 mM phosphate buffer, pH 8.5 and 1 mM EDTA; and distributed with a Pixus nanojet dispenser (Cartesian Technologies) in droplets of about 10.4 nanoliters onto specific locations within a test well whose upper surface is that of a fresh, dry DNA Bind plate (Corning Costar). In another embodiment, preformed nucleic acid probes or nucleic acid primers are derivatized at the 3′ end with a free amino group and optionally include a carbon spacer. Oligonucleotides are dissolved at 20 μM in 0.5 M Phosphate buffer at pH 8.5 and are contact printed on Falcon 1172 plates, gamma irradiated (BD Biosciences) using capillary pins in a humidified chamber. Depending on the relative rate of attachment and evaporation, it may be required to control the humidity in the wells during preparation.
In another embodiment, nucleic acid probes or nucleic acid primers can be synthesized directly on the surface of a test region, using methods such as, for example, light-activated deprotection of growing oligonucleotide chains (for example, in conjunction with the use of a site directing “mask”) or by patterned dispensing of nanoliter droplets of deactivating compound using a nanojet dispenser. Deprotection of all growing oligonucleotides that are to receive a single nucleotide can be done, for example, and the nucleotide then added across the surface. In another embodiment, oligonucleotide anchors are attached to the surface via the 3′ ends of the oligonucleotides, using conventional methodology.
In particular examples, the nucleic acid probe(s), nucleic acid primer(s), and/or antibodies are immobilized in an array (such as a microarray) and a label is detected using a microarray imager. Microarray imagers are commercially available and include OMIX, OMIX HD, CAPELLA, or SUPERCAPELLA imagers (HTG Molecular Diagnostics, Tucson, Ariz.), TYPHOON imager (GE Life Sciences, Piscataway, N.J.), GENEPIX microarray scanner (Molecular Devices, Sunnyvale, Calif.), or GENECHIP scanner (Affymetrix, Santa Clara, Calif.). In other examples, the nucleic acid probes, nucleic acid primers, and/or antibodies can be immobilized on a bead and the label is detected by flow cytometry or related methods (such as utilizing a LUMINEX 200 or FLEXMAP 3D (Luminex Corporation, Austin, Tex.), or other suitable instrument).
Subjects with sepsis (whether symptomatic or pre-symptomatic) identified using the disclosed methods can be treated for sepsis. Thus, in some embodiments, the disclosed methods include administering one or more treatments for sepsis. The methods disclosed herein can be used to treat a subject who does not exhibit symptoms of sepsis, thereby decreasing the severity and/or duration of sepsis in the subject, or inhibiting development of sepsis. In other embodiments, the methods disclosed herein can be used to treat a subject who exhibits symptoms of sepsis and/or is diagnosed with sepsis by one or more other criteria, such as Sepsis-2 or Sepsis-3 criteria (e.g., Levy et al., Crit. Care Med. 31:1250-1256, 2003; Singer et al., JAMA 315:801-810, 2016).
In some examples, one or more therapies for sepsis, such as antibiotic therapy (for example, oral, intravenous, and/or intramuscular antibiotics), intravenous fluids, vasopressors (e.g., epinephrine, norepinephrine, and/or vasopressin), or other supportive therapies, can be administered to the subject. In additional examples, treatment can include surgery, for example to treat or remove infected tissue (including amputation of one or more affected extremities). Additional treatments, including corticosteroids, insulin, blood transfusion, dialysis, and/or mechanical ventilation may be administered in some cases. In other examples, the subject does not exhibit symptoms of sepsis, but is predicted to develop sepsis by the methods disclosed herein, and the subject is monitored for development of symptoms of sepsis. The subject may also be retested one or more times using the methods disclosed herein to diagnose sepsis at a later time point, and may be placed on a treatment protocol if sepsis is diagnosed.
Exemplary antibiotics that can be administered include ceftriaxone, azithromycin, ciprofloxacin, vancomycin, aztreonam, moxifloxacin, nafcillin, daptomycin, meropenem, ceftazidime, cefotaxime, cefepime, piperacillin-tazobactam, ampicillin-sulbactam, imipenem/cilastatin, levofloxacin), clindamycin, and combinations of two or more thereof. Specific antibiotics can be selected if the organism(s) causing the infection are identified. In some examples, the subject is treated with one or more broad-spectrum antibiotics immediately upon diagnosis, for example, prior to identifying a causative agent. The subject can then be administered one or more additional or different antibiotics when a specific causative agent is identified.
In other examples, the subject can be administered antiviral therapy, such as acyclovir, pocapavir or ganciclovir, if the underlying infection is known or suspected to be viral (such as herpes virus or enterovirus).
The following examples are provided to illustrate certain particular features and/or embodiments. These examples should not be construed to limit the disclosure to the particular features or embodiments described.
A previous 11 gene panel for sepsis prediction (CEACAM1, ZDHHC19, C9orf95, GNA15, BATF, C3AR1, KIAA1370, TGFB1, MTCH1, RPGRIP1, and HLA-DPB1; referred to as the “Stanford11” panel) was previously developed from a panel of 82 differentially expressed genes using the greedy forward search method (Sweeney et al., Sci. Transl. Med. 7:287ra71, 2015). This set of 82 genes (Stanford82; Table 1) was used as the basis for developing new biological-function based sepsis panels. Exemplary, non-limiting GenBank Accession numbers are provided for each gene in Table 1. Additional sequences for each gene can be identified using publicly available databases.
A list of substitutable genes in Stanford11 was generated based on gene ontology biological processes (GOBPs). The biological functions associated with the Stanford82 genes was first analyzed through functional enrichment analyses of GOBPs using DAVID (Database for Annotation Visualization and Integrated Discovery) (Huang et al., Nature Protocols 4:44-57, 2009). GO Direct category, which provides GO mappings directly annotated by the source database, was used. The terms with >1 genes were selected as “Stanford82-associated GOBPs.” The GOBPs that were represented by at least two genes, at least one of which derived from the Stanford11 genes were selected and visualized as a network using Enrichment Map v2.1.0, a Cytoscape plugin (Merico et al., PLoS One 5:e13984, 2010). The connected GOBP terms were merged and defined as a single functional category. Genes involved in the merged GOBPs and having the same directional changes were defined as substitutable candidates in the same GOBP.
Twelve microarray datasets that were used in two studies by Sweeney et al. (Table 2; Sweeney et al., Critical Care Medicine 45:1-10, 2017) were utilized. The first nine datasets are the discovery set that was used to identify the Stanford11 panel and the last three datasets are independent validation sets. The microarray datasets were normalized by the same methods used by Sweeney et al. Briefly, Affymetrix datasets were normalized using RMA or gcRMA (R package affy) (Gautier et al., Bioinformatics 20:307-315, 2004). Agilent and Illumina datasets were background corrected based on normal-exponential convolution model and then between-arrays quantile normalized using R package limma (Ritchie et al., Nucleic Acids Research 43:e47, 2015). The mean of multiple probes for common genes was used as the gene expression level after normalization. In the case of GSE74224, there was no probe for KIAA1370 of Stanford11; therefore, 10 genes were used to compute the performance of Stanford11 for this data.
In order to systematically evaluate the effect of gene substitution and reduction on classification performance, four different procedures were used as follows: 1) substitute one gene at a time; 2) substitute all possible genes; 3) retain only one gene for a GOBP where more than two genes are involved; 4) reduce panel by selecting one gene in each GOBP.
In order to evaluate the impact of biological function information in classification performance, three different approaches were used as follows: 1) use the 11 highest correlated genes based on expression profile with Stanford11 regardless of their biological functions; 2) use 11 randomly selected genes involved in chemotaxis, adhesion and migration biological function—one of GOBP terms associated with Stanfordl 1; and 3) k-Top Scoring Pairs classifier (kTSP) using switchbox R package to identify a small set of paired genes (Afsari et al., Bioinformatics 31:273-274, 2015). The AUC of kTSP was calculated by defining the number of votes among k pairs as a diagnostic score (Marchionni et al., BMC Genomics 14:336, 2014). The chemotaxis, adhesion and migration GOBP terms were selected as they contain more than 11 genes that can be fully substituted for Stanford11 genes. The classification performance of randomly selected gene sets was also tested by generating 100,000 combinations of 11 genes randomly selected from the Stanford82 genes. Classification performance was analyzed, along with biological processes that the 100,000 gene sets are involved in. The top and bottom 250 gene sets in order of performance were selected. The hierarchical clustering was applied to cluster GOBPs based on the number of genes in each GOBP from the top and bottom 250 gene sets.
The overall procedure and the results of identifying substitutable genes for Stanford11 are summarized in
Since three genes (ZDHHC19, KIAA1370, and RPGRIP1) were excluded during the process of identification of substitutable genes, their contribution to the performance of Stanford11 was assessed. Excluding these genes from Stanford11 did not significantly affect the performance in both discovery and validation sets (Table 4). In the case of excluding all three genes, the performance decreased in one discovery and one validation set (Glue grant day [1-3] from 0.9145 to 0.865 and GSE74224 from 0.8814 to 0.8544), while increasing in one discovery set (GSE40012 from 0.7091 to 0.774). Therefore, these three genes contributed marginally to the diagnostic performance of Stanford11.
#indicates significantly higher/lower AUC than Stanford11 (P-value ≤ 0.05).
To determine the impact of biological processes in the performance of a diagnostic panel, 100,000 panels consisting of 11 genes randomly selected from the Stanford82 list were generated. The classification performances of the 100,000 random 11 member gene sets in the 9 discovery datasets were computed and sorted based on the performance as measured by AUC (
A total of 12 microarray datasets from the public domain were used to evaluate the effect of gene substitution/reduction on the diagnostic performances of Stanford11. Nine of the datasets were used in the original discovery of the Stanford11 gene panel. The other three, GSE65682, GSE74224, and E-MTAB-3589, were not used in the development of Stanford11 panel; therefore, they were used as independent validation data in the study.
Based on the substitution candidates listed in Table 3, one gene was changed at a time for the six substitutable genes in the Stanford11 panel. As shown in
The effect of substituting genes representing all functional categories simultaneously was then tested. If a function had more than one substitutable gene, all combinations of genes were enumerated and their classification performances were tested and are summarized in
The possibility of using representative genes from each biological process to reduce the number of features in the diagnostic panel was investigated. Among the six representative biological processes of the Stanford11 panel, two biological functions have more than one gene in the panel. The GOBP “PLC, phosphorylation, platelet activation function” has two genes (C3AR1 and GNA15) and “chemotaxis, angiogenesis, adhesion, migration function” has four genes (CEACAM1, C3AR1, TGFB1, and GNA15) in the panel. We calculated AUCs of panels where only one gene was retained. As shown in
To explore the possibility of just using representative genes from key biological processes associated with Stanford11, all possible combinations of 6 gene panels were generated from genes mapped to those six biological processes. Diagnostic performance for all six-gene combinations was computed (
Though removing RPGRIP1 involved in cell development and proliferation process from the original Stanford11 panel did not decrease the overall diagnostic performance (Table 4), we tested the effect of adding RPGRIP1 to the new 6 gene panels. The results showed that adding RPGRIP1 to the 6-gene panel only improved the performance in one of the validation datasets, GSE74224, but not in the other two datasets (Table 7).
The performance differences between biological function-based gene substitution and expression correlation-based substitution were tested. Based on expression profiles of all the discovery datasets, the highest correlated genes with features in the Stanford11 panel were selected (referred to as panel-HC), regardless of the biological functions (
364 panels (referred to as panel-AM) were also generated by randomly selecting 11 genes from 14 genes involved in adhesion/migration process (
Lastly, TSP and k-TSP algorithms were applied to identify a small set of paired genes. TSP identified two genes that have only 67% classification accuracy in the three independent validation sets. Therefore, we applied k-TSP to increase performance by including additional pairs of genes. It resulted in three pairs which include six genes in total (referred to as panel-kTSP,
Peripheral blood samples were collected from elective surgery patients (n=155) prior to surgery (Pre-Op) and daily up until one or two days post-sepsis diagnosis (Post-Op). The patients had undergone a wide range of surgeries, with the most common being large bowel resection, vascular surgery, and pancreatic surgery. The control group included an equal number of patients matched by age, sex, and surgical procedure who did not develop sepsis. The sepsis patients, as well as matched controls, were divided into three separate groups: Discovery, Test, and Validation sample sets (Table 8). Samples from the Discovery set were used to identify various differentially expressed genes (DEGs) and corresponding biomarker panel candidates. The Test set and the Validation set were used to compare performances of the candidate panels and to assess the ability of the selected panel based on the test set to predict sepsis, respectively.
Based on the study design, the longitudinal samples from each patient were organized according to the date when sepsis was diagnosed (Day 0). Using this nomenclature, the day before diagnosis date is Day-1 and one day post-diagnosis is Day+1. There were significantly fewer samples available on Day−4 (almost 50% decrease from Day−3). Since the performance and reliability of biomarkers depends on the number of samples analyzed and the purpose of this study was to find a predictive marker for risk sepsis, the analyses were concentrated on data from samples taken on Day-3 to Day-1. Total RNA from whole blood was isolated with miRNeasy® kit (Qiagen, Germantown, Md.) according to the manufacturer's instructions. Fluorescent-labeled (Cy3) probes were prepared with the single color labeling kit from Agilent (Santa Clara, Calif.). Probes were then loaded on the array hybridization chambers then hybridized at 65° C. for 17 hours. The slides were washed and scanned (Agilent, Santa Clara, Calif.). The microarray data was then normalized by quantile normalization.
To identify genes associated with the development of sepsis for the samples, three factors were considered during statistical analysis: 1) Pre-Op normalization, 2) paired vs unpaired testing, and 3) time points. By considering only probes corresponding to mRNA, eight sets of DEGs were identified (Table 9). The Pre-Op normalization factor was applied to remove transcriptome changes associated with surgery from Post-Op samples to reduce the individual differences and emphasize the gene expression changes associated with infection processes. The second factor was to explore the effect of matched sepsis-control patients individually or as a group. The third factor was to compare the gene expression profile changes between the sepsis and control groups at individual timepoints before sepsis is diagnosed (Day 0). It is reasonable to assume that a sepsis-specific signature would be more prominent as getting closer to time of diagnosis. Grouping all pre-diagnosis sepsis time points together could potentially dilute out the stronger signal present at Day-1. Statistical analysis was therefore performed comparing control and sepsis at each time-point. DEGs identified in at least two individual timepoints were considered.
Support Vector Machine was then combined with Recursive Feature Elimination (SVM-RFE) and applied in conjunction with a repeat cross-validation procedure to select genes that were most relevant for the classification between sepsis patients and controls from the selected DEGs identified from the eight different procedures. The performances of the resultant eight candidate panels were then evaluated in the test sample set (Table 9).
Four different approaches using Pre-Op normalization were first performed (Table 9). Differentially expressed genes and their optimal feature sets from SVM-RFE were identified for each approach. Among the four approaches, 58 genes (ISB58) were identified from approach 2. A 19 gene panel (ISB19) was selected from ISB58 using SVM-RFE. ISB19 showed the highest performance from panel genes identified from the four approaches utilizing Pre-Op normalization (Approach 2 (bold in Table 9)). ISB58 is summarized in Table 10, and the overall expression profile is shown in
Second, from the comparisons without Pre-Op normalization, gene panels identified from approach 6 showed the best performance (Approach 6 highlighted with bold in Table 9). With this approach, 355 DEGs were identified (ISB355) (Table 11). Among the 355 genes, 63 genes (ISB63) were then identified as an optimal feature set using SVM-RFE. The overall expression profiles for the ISB355 is shown in
There is a significant overlap (54 genes) between the genes in ISB 58 and ISB 355 (
An SVM model for each panel that was trained using the Discovery sample set (Example 2) was generated and subsequently tested in the Validation sample set. Classification performance was summarized and listed with four metrics: AUC, Accuracy, Sensitivity, and Specificity in Table 13. Both panels provided high predictive performance. For example, the AUC values of ISB19 are >0.7 and the AUC values of ISB63 were even higher, at >0.8 at all time points. ISB63 showed greater than 77% of predictive accuracy for the development of sepsis three days prior to sepsis diagnosis (Day-3). The two panels ISB19 and ISB63, showed very little overlap of genes in the panel (Table 11).
Even though the ISB panels were developed using pre-sepsis diagnosis gene expression data to predict the development of sepsis, the ability of the ISB19 and ISB63 gene panels (Example 2) were assessed to diagnose patients with sepsis using 19 publicly available sepsis related datasets derived from 1,636 patients (Table 14). The public datasets can be grouped based on patient information, such as bacterial or viral infection, or adult or pediatric sepsis. With this information, the datasets were grouped into 6 groups based on clinical parameters: 1) Sepsis/severe sepsis, 2) Pediatric sepsis, 3) Neonatal sepsis, 4) Sepsis associated with bacterial infection, 5) Sepsis associated with bacterial/viral infection, and 6) Sepsis associated with viral infection.
Staphylococcus aureus or
Escherichia coli
Even though the ISB19 and ISB63 panels were developed for predicting the development of sepsis, these panels still showed good performance in diagnosing patients with sepsis in unrelated and independent datasets obtained from the public domain (
To determine the utility of ISB19 and ISB63 panels, the ability of ISB panels to predict the development and diagnose sepsis was assessed and compared with panels reported in literature (Stanford11) and approved by FDA (Septicyte4). The results show that the previously identified sepsis panels, Stanford11 and Septicyte4, were able to accurately diagnose but not predict sepsis development (
The methods described in Example 1 to optimize gene panels were applied to the ISB19 and ISB63 panels. The biological processes associated with ISB19 and ISB 63 panels were first determined. The Gene Ontology (GO) enrichment test showed a total of 89 GOBPs associated with genes in the ISB19 panel and 326 GOPBs associated with the ISB63. The EnrichmentMap tool summarized the 89 GOBPs associated with ISB19 into three representative functional terms: immune response, signal transduction, and metabolism. Similarly, the 326 GOBPs associated with ISB63 were summarized into six representative terms: immune response, signal transduction, metabolism, apoptosis, transcription, and adhesion/migration (Table 15).
Genes from the original ISB58 and ISB355 sets that shared the same functional terms and similar direction of concentration changes with genes already included in the ISB19 and ISB63 were then used. From ISB58 gene set, 13, 3, and 4 genes were identified that have the same direction of concentration changes with genes in ISB19 and are associated with immune response, signal transduction, and metabolism, respectively. From ISB355 gene set, 67, 55, 41, 53, 7 and 40 genes were identified that have the same direction of concentration changes with genes in ISB63 and are associated with immune response, signal transduction, metabolism, apoptosis, transcription and adhesion/migration, respectively (Table 15).
The genes identified from ISB58 and ISB355 (number of available genes for individual functional terms are listed in Table 15) were then used to compute classification performance of three-gene combinations that represent three functional terms associated with ISB19 and six gene combinations for ISB63. In total, the performance of 156 three-gene combinations and randomly selected 100,000 (out of 2.24×109 possibilities) six-gene combinations, respectively, were assessed. The top 10 performing three (from ISB19) and six (from ISB63) gene panels are listed in Tables 16 and 17, respectively. The results showed slightly higher or similar average performance for the ISB19 derived three gene panels than the original ISB19 at all three time points (
Exemplary GenBank Accession Nos. are provided for the genes in Table 16 either in Table 12, or as follows: GRB10—GenBank Accession No. NM_001350814; GZMA—GenBank Accession No. NM_006144; PFKFB2—GenBank Accession No. NM_006212; CD24—GenBank Accession No. NM_037362; RNASE2—GenBank Accession No. NM_002934; DACH1—GenBank Accession No. NM_004392; SPOCD1—GenBank Accession No. NM_144569
Exemplary GenBank Accession Nos. are provided for the genes in Table 17 either in Table 12, with respect to Table 16, or as follows: RPS6KA3—GenBank Accession No. NM_004586; BCL6—GenBank Accession No. NM_001706; GNG10—GenBank Accession No. NM_001017998; MPO—GenBank Accession No. NM_000250; MAVS—GenBank Accession No. NM_020746; CYPIB1—GenBank Accession No. NM_000104; TLR8—GenBank Accession No. NM_016610; MLLT1—GenBank Accession No. NM_005934; GAS7—GenBank Accession No. NM_003644; LILRA2—GenBank Accession No. NM_006866; IL17RA—GenBank Accession No. NM_014339; LILRA4—GenBank Accession No. NM_012276; NOV—GenBank Accession No. NM_002514; FAM105A—GenBank Accession No. NM_019018; ERO1L—GenBank Accession No. GenBank Accession No. NM_014584; C14orf101—GenBank Accession No. NM_017799.
Since the ISB predictive panels ISB19 and ISB63 also showed good diagnostic performance, public domain data were used to assess the ability of smaller gene panels to accurately diagnose sepsis. The average AUCs of the 19 datasets are summarized in
The Sequential Organ Failure Assessment (SOFA) score is used to determine the extent of a patient's organ failure. C-reactive protein (CRP) levels are commonly used clinically as a non-specific indicator of infection. These two clinical parameters are often utilized as part of the sepsis diagnosis, and for most patients in this study this clinical information was available. SOFA score or CRP levels alone did not provide good performance in either diagnosing or predicting the development of sepsis (Table 18). However, integrating these two clinical parameters with the ISB gene panels slightly increased sensitivity, but not specificity (Table 18).
To determine if there were differences in prediction performance based on the severity of sepsis, patients who required vasopressor support and who met septic shock criteria in all iterations of the sepsis guidelines as severe sepsis were identified. All remaining patients were classified as mild sepsis. Based on this stratification, the classification performance of 60 small gene panels derived from ISB19 and ISB63 (Tables 16 and 17) were evaluated for all Sepsis samples, Mild Sepsis and Severe Sepsis compared to Controls. The ISB panels showed higher classification performance between Severe Sepsis and Control (
The current study utilized longitudinal blood samples collected from patients undergoing elective surgery to identify a blood mRNA-based panel that could diagnose sepsis prior to onset of detectable clinical symptoms, allowing for much earlier therapeutic intervention. Using a combination of differential gene expression analysis, machine learning tools, and a biological function-based biomarker panel optimization process, we have identified and validated biomarker panels consisting of only three or six genes that can identify patients developing sepsis three days prior to the onset of symptoms.
Patient and sample description. The patients were recruited across eight hospitals in England and Germany: Guys' and St Thomas' Hospital (London, U.K.), Heartlands Hospital (Birmingham, U.K.), North Bristol NHS Trust (Bristol, U.K.), Queen Elizabeth Hospital (Birmingham, U.K.), The Leeds Teaching Hospitals Trust (Leeds, U.K.), The Royal Liverpool University Hospital (Liverpool, U.K.), University College Hospital (London, U.K.), and University Hospital Frankfurt (Frankfurt DE). All subjects gave written informed consent to participate and the study was approved by Southampton & South West Hampshire Research Ethics Committee with reference number 06/Q1702/152.
The patients underwent a wide range of major elective surgeries, with the most common being large bowel resection, vascular surgery, and pancreatic surgery (detailed patient information is shown in Table 19). Peripheral blood samples were collected from patients prior to surgery (Pre-Op) and daily up to five days post-sepsis diagnosis (Post-Op). Each patient's demographic parameters, vital signs, hematology, clinical chemistry, and pathogen detection results were recorded. The control group included age, sex, and surgical procedure matched patients who did not develop sepsis. Consensus evaluations were made by nine physicians for the study. This multi-year project was initiated prior to the release of the Sepsis-3 definition; therefore, all patients were diagnosed based on the Sepsis-2 criteria.
Whole blood RNA isolation and microarray analysis. Total RNA from RNAlater-preserved whole blood was isolated with miRNeasy kit (Qiagen, Germantown, Md.) according to the manufacturer's instructions. The quantity and quality of RNA were assessed with the Agilent 2100 Bioanalyzer (Santa Clara, Calif.) and NanoDrop 1000 spectrophotometer (Thermo Scientific, Wilmington, Del.). Whole blood gene expression profiling experiments were performed using Agilent Human Whole Genome 8×60 microarrays and fluorescent probes were prepared using Agilent QuickAmp Labeling Kit according to the manufacturer's instructions (Santa Clara, Calif.). Gene expression information was obtained with Agilent's Feature Extractor and processed with in-house SLIM pipeline (Marzolf et al., Source Code Biol. Med. 1:5, 2006).
Differential expression analysis. The microarray data were normalized using a quantile method. Probes having signals lower than the global mean were removed from further analysis as it is known that low abundance transcripts have higher detection variabilities. To identify genes associated with the development of sepsis, three factors were considered during analysis: 1) the use of Pre-Op data, 2) paired or unpaired analysis between controls and sepsis samples, and 3) combining time point data prior to Day0 or not. Combinations of these 3 different factors resulted in 8 different analysis approaches (Approach 1 to 8,
Feature selection. To identify a biomarker panel consisting of a subset of genes among DEGs, support vector machine with recursive feature elimination (SVM-RFE) (Guyon et al., Machine Learning 46:389-422, 2002) was applied with 5-fold cross-validation using pathClass R package (Johannes et al., Bioinformatics 27:1442-1443, 2011). From each loop of the cross-validation procedure, an optimal set of features was selected by SVM-RFE, therefore, five different sets of features (genes) could be identified. Considering the randomness of shuffling and partitioning of samples in a cross-validation procedure, the cross-validation procedure was repeated 100 times, resulting in a total of 500 sets of features. The importance of each feature to the classification was determined as the frequency of how often each feature was selected among the 500 sets. The features were sorted in order of their importance. SVM models with different numbers of features were constructed by adding features from the most important to the least and average AUCs were computed by repeating the 5-fold cross-validation procedure 100 times. The final optimal feature set was determined at the highest average AUC.
Panel optimization. To determine the biological processes associated with the classification, the Gene Ontology (GO) terms associated with the genes (features) in the top performing diagnostic panels were determined using the database for annotation visualization and integrated discovery (DAVID) (Huang et al., Nat. Protoc. 4:44-57, 2009). Then we tried to select core biological processes that have higher discriminatory power (
Throughout the analyses, classification performance of a biomarker panel was evaluated by the area under the curve (AUC) of receiver operating characteristic (ROC) curve. Statistical significance of the difference between AUCs from different panels was computed with DeLong's method (Robin et al., BMC Bioinformatics 12:77, 2011).
qPCR validation. The Fluidigm 96.96 integrated fluidic circuit (IFC) plates were used for qPCR validation of the 3- and 6-gene panels. The analysis included 665 blood RNA samples from 155 sepsis and 153 controls from the Discovery, Test, and Validation sets. The Discovery and Test sets were used to train classifiers using the panels and the Validation set was used to calculate their performances. The samples were randomized across 8 IFC chips and each chip included four wells of a pooled sample to measure inter-chip variability and two wells of no template controls. The whole blood cDNA was synthesized from 100 ng of RNA using the Reverse Transcription Master mix (Fluidigm PN 100-6297). Pre-amplification was performed using a pool of all assays using the Preamp Master Mix (Fluidigm PN 100-5744). The preamp reaction was then cleaned up using Exonuclease 1 (New England Biolabs PN M0293 S). The amplified cDNA was diluted 1:10 using DNA suspension buffer, and then the Fluidigm chips were run on the Biomark HD machine according to the manufacturer's instructions using the GE 96.96 Standard v1 protocol. Ct values from all eight chips were combined using the Biomark Data Analysis software. BRK1 and RNF181 were used as references genes. The two reference genes were selected based on the following procedure: 1) a set of invariant genes was selected in which each gene had less than 5% of coefficient of variation in each of sepsis and control datasets, below bottom 5% of fold-change distribution between sepsis and control, and within top 5% of an intensity distribution of all genes. Genes that were not defined well or were suspected to be saturated in intensity were not considered in further steps. 2) A total of 41 genes were selected from the first step and used as an input for RefFinder that provides a comprehensive ranking of genes by comparing four computational programs (delta-CT, BestKeeper, NormFinder, and geNorm) for a reference gene identification. In RefFinder, sepsis and control datasets were analyzed separately and gene rankings from each dataset were averaged. Top two ranked genes, BRK1 and RNF181, were finally selected and used as reference genes.
Patient cohort and study design. Surgery patients were recruited across eight hospitals located in England and Germany (
The sepsis patients and matched controls were each split into three groups: Discovery (64 sepsis and 63 control), Test (31 sepsis and 30 control), and Validation (60 sepsis and 60 control) sets. Samples from the Discovery set were used to identify biomarker panel(s) based on differentially expressed genes (DEGs) from whole blood between sepsis patients and controls. The Test samples were used to optimize the biomarker panel(s) and the Validation set was used to verify the panels' classification performance.
To identify pre-symptomatic biomarkers, the longitudinal samples from each patient were organized such that the day sepsis was diagnosed was labeled as Day0 (
Identification of pre-symptomatic sepsis diagnostic panels. Because of the study design, different approaches were used to identify gene expression changes associated with sepsis development (
Using the DEGs identified from the Discovery sample set, a classifier capable of separating pre-symptomatic sepsis patients from control patients was identified using support vector machine combined with recursive feature elimination (SVM-RFE) in conjunction with a repeat cross-validation procedure for each of the DEG sets from different analysis approaches. The performances (indicated as area under the receiver operating characteristic curve (AUC) for each day prior to the day of sepsis diagnosis) of the panels were then computed (summarized in
Optimization of the classification panels. We used a biological function-based process to reduce the number of features in the two top performing classification panels (Lee et al., Sci Rep. 9:7365, 2019; incorporated herein by reference in its entirety). There were 89 and 326 biological processes associated with at least one of the genes in ISB19 and ISB63, respectively. To determine the biological processes with higher discriminatory power, 100,000 randomly selected 19-gene or 63-gene panels were generated from ISB58 or ISB355, respectively (
Redundant and mutually overlapping terms from the 19 and 46 core biological functions were clustered and summarized (
Next, it was determined whether using a transcript representing each biological function could reduce the overall number of features while preserving the diagnostic performance of the panel(s). All 3-gene combinations that represent functional terms associated with ISB19 and 6-gene combinations representing ISB63 were generated by selecting one gene from each biological process, respectively. The performances of all possible 3-gene combinations (156=13×3×4, Table 22) were assessed using the Test sample set. However, it is prohibitory to do all 6-gene panels as there are more than 2 billion combinations (2,242,101,400=67×55×41×53×7×40). To reduce the number of possible 6-gene combinations for the ISB63 derived panels, the genes that had better classification capability were first determined. The performances of randomly selected 100,000 6-gene combinations were assessed with the Test sample set and then sorted based on classification performance. For each biological process, the most frequently appearing genes among the top performing 1,000 6-gene panels (top 1% of the 100,000 combinations tested) were identified. This analysis yielded 16, 12, 8, 10, 2, and 10 genes for immune response, signal transduction, apoptosis, transcription, adhesion/migration and metabolism, respectively (Table 22). From these top performing genes, 307,200 (16×12×8×10×2×10) 6-gene panels were generated and their performances were also assessed with the Test sample set. The performances of the ISB19 and ISB63 panels were also determined with the Test sample set. Seventy-eight of the 3-gene panels and 32,540 of the 6-gene panels showed performances similar or better than the original ISB19 and ISB63 panels based on DeLong's test (p-value >0.05). Therefore, these panels could be considered alternative optimized panels for the original ISB19 and ISB63.
Evaluation of the biomarker panels using the Validation cohort. The diagnostic performances of the ISB19 and ISB63 panels were further assessed with the Validation cohort. Compared to their diagnostic performances in the Test cohort (
Integration of top panel with clinical parameters does not significantly increase performance of the biomarker panels. A patient's clinical parameters such as the SOFA score and blood CRP level have been used to identify individuals having or suspected to have sepsis. Therefore, the diagnostic performances of SOFA and CRP were evaluated in the pre-symptomatic phase. In addition, the impact of integrating clinical information with the gene panels was evaluated to determine if this would increase the panels' performance. Neither the SOFA score nor the CRP level alone or combined performed well in diagnosing sepsis in the pre-symptomatic phase in the Validation cohort (
The impact of disease severity on the performance of panels. To determine whether the panels' performances were affected by disease severity, patients who required vasopressors and met septic shock criteria were identified and labeled as “Septic Shock.” All remaining sepsis patients were grouped as “Sepsis” (this includes sepsis and severe sepsis categories according to Sepsis-2 criteria). The classification performances of the smaller 3- and 6-gene panels were then assessed in these groups. The panels had higher overall performances in identifying “Septic Shock” patients (average AUC of 0.91 and 0.94 from ISB3 and ISB6, respectively) prior to the onset of clinical symptoms compared to “Sepsis” patients (average AUC of 0.80 and 0.84 from ISB3 and ISB6, respectively) (
Assessing the accuracy and specificity of detecting sepsis in the symptomatic phase. Although the biomarker panels identified were aimed at detecting sepsis at the pre-symptomatic stage, the ISB19 and ISB63 panels also showed good diagnostic performance for patients with clinical symptoms of sepsis in the Test sample set (
We also assessed the specificity of the panels for sepsis diagnosis using eleven different public domain datasets including four sets representing bacterial infections without sepsis, four sets representing viral infection, and three sets representing autoimmune diseases (
Validation of ISB3 and ISB6 using an alternative measurement platform. The performances of the 3-gene and 6-gene panels were further verified using qPCR (
Although the performances were slightly decreased compared to the microarray results (average ISB3 AUC=0.83 and average ISB6 AUC=0.85), the AUCs from qPCR results were still greater than 0.7 at all time points for both ISB3 (average AUC=0.77) and ISB6 (average AUC=0.74) (
In view of the many possible embodiments to which the principles of the disclosure may be applied, it should be recognized that the illustrated embodiments are only examples and should not be taken as limiting the scope of the invention. Rather, the scope of the invention is defined by the following claims. We therefore claim as our invention all that comes within the scope and spirit of these claims.
This application claims the benefit of U.S. Provisional Application No. 62/755,834, filed Nov. 5, 2018, and U.S. Provisional Application No. 62/911,603, filed Oct. 7, 2019, both of which are incorporated by reference herein in their entirety.
This invention was made with government support under Contract number HDTRA1-13-C-0055 awarded by Defense Threat Reduction Agency. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2019/059707 | 11/4/2019 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62911603 | Oct 2019 | US | |
62755834 | Nov 2018 | US |