This invention is related to the area of prognosis, diagnosis and theranosis. In particular, it relates to prognosis, diagnosis, risk assessment, and monitoring of sepsis.
Sepsis is the name given to infection when symptoms of inflammatory response are present. Of patients hospitalized in an intensive care unit (ICU) who have an infection, 82% have sepsis. Sepsis is defined as an infection-induced syndrome involving two or more of the following features of systemic inflammation: fever or hypothermia, leukocytosis or leukopenia, tachycardia, and tachypnea or a supranormal minute ventilation. Sepsis may be defined by the presence of any of the following ICD-9-CM codes: 038 (septicemia), 020.0 (septicemic), 790.7 (bacteremia), 117.9 (disseminated fungal infection), 112.5 (disseminated Candida infection), and 112.81 (disseminated fungal endocarditis). Sepsis is diagnosed either by clinical criteria or by culture of microorganisms from the blood of patients suspected of having sepsis plus the presence of features of systemic inflammation. Culturing some microorganisms can be tedious and time consuming, and may provide a high rate of false negatives. Bloodstream infection is diagnosed by identification of microorganisms in blood specimens from a patient suspected of having sepsis after 24 to 72 hours of laboratory culture. Currently, gram positive bacteria account for 52% of cases of sepsis, gram-negative bacteria account for 38%, polymicrobial infections for 5%, anaerobes for 1%, and fungi for 5%. For each class of infection listed, there are several different types of microorganisms that can cause sepsis. The high rate of false negative microbiologic cultures leads frequently today to empiric treatment for sepsis in the absence of definitive diagnosis. Infection at many different sites can result in sepsis. The most common sites of infection in patients with sepsis are lung, gut, urinary tract, and primary blood stream site of infection. Since sepsis can be caused by many infections with microorganisms at many different sites, sepsis is a very heterogeneous disease. The heterogeneity of sepsis increases the difficulty in devising a diagnostic test.
The number of patients with sepsis per year is increasing at 13.7% per year, and was 659,935 in 2000. The incidence of sepsis in the United States in 2000 was 240.4 cases per 100,000 population. Sepsis accounted for 1.3% of all hospitalizations in the U.S. from 1979 to 2000. During this period, there were 750 million hospitalizations in the U.S. and 10.5 million reported cases of sepsis.
Sepsis is the leading cause of death in critically ill patients, the second leading cause of death among patients in non-coronary intensive care units (ICUs), and the tenth leading cause of death overall in the United States. Overall mortality rates for sepsis are 18%. In-hospital deaths related to sepsis were 120,491 (43.9 per 100,000 population) in 2000.
Care of patients with sepsis is expensive and accounts for $17 billion annually in the United States alone. Sepsis is often lethal, killing 20 to 50 percent of severely affected patients. Furthermore, sepsis substantially reduces the quality of life of those who survive: only 56% of patients surviving sepsis are discharged home; 32% are discharged to other health care facilities (i.e., rehabilitation centers or other long-term care facilities), accruing additional costs of care.
Cost of care, morbidity and mortality related to sepsis are largely associated with delayed diagnosis and specific treatment of sepsis and the causal infection. Early diagnosis of sepsis is expected to result in decreased morbidity, mortality and cost of care. The average length of hospital stay in patients with sepsis is twelve days.
Severe sepsis is defined as sepsis associated with acute organ dysfunction. The proportion of patients with sepsis who had any organ failure is 34%, resulting in the identification of 256,033 cases of severe sepsis in 2000. Organ failure had a cumulative effect on mortality: approximately 15% of patients without organ failure died, whereas 70% of patients with 3 or more failing organs (classified as having severe sepsis and septic shock) died. Risk of death from sepsis increases with increasing severity of sepsis.
Currently determination of the severity of sepsis and determination of whether, in a patient with sepsis, the sepsis is increasing or decreasing in severity, is based upon clinical events such as failing organs. Determination that, in a patient with sepsis, the sepsis is increasing in severity, may allow more intensive therapy to be given which may increase the likelihood of the patient surviving. The availability of a diagnostic test that would allow monitoring of patients with sepsis to determine whether the sepsis is increasing or decreasing in severity may allow early detection of deterioration and earlier intensification of therapy and less risk of death or disability. Sepsis results either from community-acquired infections or hospital-acquired infections. Sepsis occurs in 1.3% of all U.S. hospitalizations. Hospital-acquired infections are a major source of sepsis, accounting for 65% of sepsis patients who are admitted to an intensive care unit. Sepsis is a major cause of admission to a hospital intensive care unit. 23-30% of patients admitted to an intensive care unit for longer than 24 hours will develop sepsis. Sepsis is a common complication of prolonged stay in an ICU. 8% of patients who remain in an ICU for longer than 24 hours will develop sepsis.
There is a need for screening diagnostic tests for sepsis and for tests to monitor sepsis severity with relatively few false negatives and high sensitivity and specificity. Sepsis is the 10th leading cause of death. Infections account for 11 million hospital visits per year. Only the patients with severe symptoms are hospitalized or receive intensive treatment. However, the evaluation and management of patients with suspected sepsis is complicated by the lack of specific diagnostic criteria, heterogeneity of presentation and outcome. Early identification of patients likely to progress to death, who are candidates for aggressive treatment to prevent such death, is particularly difficult.
Current gold standards for prognostic assessment in sepsis include APACHE II (Acute Physiology and Chronic Health Evaluation), SOFA (Sepsis-related Organ Failure Assessment), and PRISM III (Pediatric Risk of Mortality) scores (Knaus et al., 1985; Vincent et al., 1996; Pollack et al., 1996). Additional potential treatments include admission to an intensive care unit, early goal directed therapy, activated protein C therapy, intensive glycemic control, hyperbaric or supplemental oxygen, or exogenous steroids (Otero et al., 2006; Russel 2008; Calzia et al., 2006; Muth et al., 2005; Annane 2005; Lin et al., 2005; Oter et al., 2005). The decisions regarding the severity of sepsis made based upon APACHE II, SOFA, PRISM and other clinical scores or on finger stick lactate values are either subjective (clinical scores) or insensitive (lactate) or suffer from false negative results in certain subjects. Therefore a more accurate test using biomarkers or reference characteristics are needed to stratify patients at presentation and identify patient subsets that need additional or more aggressive treatment. Additionally what is needed are methods for diagnosing sepsis and differentiating those with sepsis from those patients who do not have sepsis.
Methods and biomarker compositions are disclosed for prognosing and diagnosing sepsis in subjects, methods for prognosis of a sepsis infection and outcomes, and methods for determining the sepsis status of a subject who presents to a healthcare worker or facility as to whether the subject does or does not have sepsis, and whether there is a high risk of death. Methods comprise measurement of the amounts of one or more clinicometabolomic classifiers, which are identified clinical and metabolic changes in bodily fluids, such as plasma, of patients, for example, at time of presentation to a healthcare worker or facility, that distinguish sepsis from other disorders with similar presentation (NIS—non-infected SIRS-positive) (SIRS—systemic inflammatory response) and that differentiate sepsis patients that are likely to have uncomplicated courses from those patients that are likely to have complications, including death.
Also disclosed are novel therapeutic targets for individualized intervention. Disclosed herein are methods and compositions of diagnosing sepsis in a human subject. Methods and biomarkers of the present invention can be used to ascertain if a patient receiving treatment for sepsis is responding positively to such treatment. Additionally, methods and biomarkers of the present invention can be used to distinguish patients who should be admitted to a hospital for treatment from patients who will not require admittance for treatment.
A biomarker prognostic panel is disclosed that can distinguish and predict sepsis survival from sepsis death. The panel can include piperine, palmitoycarnitine, 3-methoxytyrosine, ocatanoylcarnitine, clinical blood lactate, X-12775 (unannotated analyte), and the single sulfated steroid X-11302 (unannotated analyte). Alternatively, the biomarker prognostic panel may comprise creatinine, 4-vinylphenol sulfate, cglycosyltryptophan, X-11261, X-12095, X-12100, 2-octenoylcarnitine and X-13553.
A biomarker diagnostic panel is disclosed that can differentiate sepsis patients from non-infected subjects. The panel can include galactonate, uridine, maltose, glutamate, creatine and X-12644 (unannotated analyte). Alternatively, the biomarker diagnostic panel may comprise citrulline, laurylcarnitine, androsterone sulfate, isoleucine, X-11838, X-12644, and X-11302 (a pregnan steroid monosulfate).
A method for sepsis prognosis in a subject is also described. The method can include the step of obtaining a biological sample from the subject; determining, in the biological sample, the level of the metabolites of a biomarker prognostic panel which can include piperine, palmitoycarnitine, 3-methoxytyrosine, ocatanoylcarnitine, clinical blood lactate, X-12775, and the single sulfated steroid X-11302 and creatinine, 4-vinylphenol sulfate, cglycosyltryptophan, X-11261, X-12095, X-12100, 2-octenoylcarnitine and X-13553; where in the correlated presence of the metabolites of the panel in the biological sample indicates that the subject has sepsis with high rate of death. In a method the biological sample subject to the method is a bodily fluid. In a method the biological sample subject to the method is plasma.
A method for sepsis diagnosis in a subject is disclosed which can include (a) obtaining a biological sample from the subject; (b) determining, in the biological sample, the concentration of the metabolites of a biomarker prognostic panel chosen from (1) galactonate, uridine, maltose, glutamate, creatine and X-12644 and (2) citrulline, laurylcarnitine, androsterone sulfate, isoleucine, X-11838, X-12644, and X-11302; where in the correlated presence of the metabolites of the panel in the biological sample indicates that the subject has sepsis. In one method the biological sample subject to the method can be a bodily fluid. In one method the biological sample subject to the method can be plasma.
A method for determining the severity of a sepsis infection in a patient is disclosed that can involve measuring the age, mean arterial pressure, hematocrit, patient temperature, and the concentration of one or more metabolites that are predictive of sepsis severity. The method can involve obtaining a blood sample from said patient and determining the concentration of the metabolite in the patient's blood; and then determining the severity of sepsis infection by analyzing the measured values in a weighted logistic regression equation. The blood sample can be taken when the patient arrives for treatment and subsequently thereafter, for example about 24 hours afterword, to determine the progress of the disease and efficacy of treatment. Not all of the markers need be assessed in every method only a sufficient number of markers to reliably determine the severity of the disease. Thus, a plurality or number of indicators can be measured which are selected from the group that includes a patient's age, mean arterial pressure, hematocrit, patient temperature, and the concentration of a metabolite selected from the group of metabolite markers consisting of 2-methylbutyrylcarnitine, 4-cis-decenoylcarnitine, butyrylcarnitine, hexanoylcarnitine, 4-methyl-2-oxopentanoate, 1-arachidonoylglycerophosphocholine, 1-linoleoylglycerophosphocholine, 3-(4-hydroxyphenyl)lactate (HPLA), 3-methoxytyrosine, n-acetylthreonine, pseudouridine and lactate and their combinations. In some instances it may be possible to measure any two of these markers to assess sepsis severity. In more preferred embodiments three, four, five, six, seven, eight, ten, eleven or all twelve of the markers may be evaluated in the determination.
Preferably the accuracy of the panel in predicting day 7 sepsis survival in a known test patient population pool is about 90% or more, or more preferably about 95% or more and even more preferably about 99% or more.
The methods can also be used in the treatment of a sepsis patient. For example, to determine whether the disease is progressing and whether a therapeutic regimen is effective.
Other aspects and iterations of the invention are described in more detail below.
Clinical and metabolomic biomarker classifiers were developed to predict survival or death. Sparse models were developed at t0 using logistic regression along with penalized predictor reduction using a max number of 10 effects in the model, log 10 regularization parameter and 5 max number of categories allowed in a predictor, and cross validation, with 10 percent random holdout and 100 iterations was performed with JMP genomics 5.0 (SAS inc., Cary, N.C.). The analyses identified four clinical factors (Age, mean arterial pressure, hematocrit and temperature) and 12 metabolites (2-methylbutyrylcarnitine, 4-cis-decenoylcarnitine, butyrylcarnitine, hexanoylcarnitine, 4-methyl-2-oxopentanoate, 1-arachidonoylglycerophosphocholine, 1-linoleoylglycerophosphocholine, 3-(4-hydroxyphenyl)lactate (HPLA), 3-methoxytyrosine, n-acetylthreonine, pseudouridine and lactate) that reflected underpinning molecular mechanisms, and were also significantly different via ANOVA and Bayesian Factor Analysis.
A seven feature logistic regression model was developed utilizing 4-cis-decenoylcarnitine, 2-methylbutyrylcarnitine, butyrylcarnitine, hexanoylcarnitine, lactate, age, hematocrit and prognostic utility was assessed in t24, Rt0, and Rt24 datasets. Metabolite classifiers predicted outcomes better than proteins or clinical variables (Data not shown) with high AUCs (Table 1). Since the logistic regression model was developed utilizing all CAPSOD patients, it is possible that the model was over-fitted to best represent the CAPSOD cohort. Therefore, the finished model was independently validated against de-identified sepsis patients' metabolomic values that were graciously provided by Dr. Augustine Choi and the Brigham and Women's Hospital Registry of Critical Illness Cohort (RoCI; approved by the Partners Human Research Committee, protocol #2008-P-000495.(1)). Again, we saw similar strong prediction of sepsis survival and sepsis death utilizing our training set (Table 1). The accuracy, AUC, PPV and NPV of the current gold standards for prognostic assessment in sepsis (SOFA score ≧7, APACHE II score ≧25, and capillary lactate ≧4.0 mg/dL) were lower than most of the seven-feature logistic regression results in all datasets. AUC values at t0 and t24 of the logistic regression model were superior to the best published biomarker classifier (79% for 3-day prognosis).
14-cis-decenoylcarnitine, 2-methylbutyrylcarnitine, butyrylcarnitine, hexanoylcarnitine, lactate, age, hematocrit
2328 targeted assay values tested. All test sets and timepoints combined. Sepsis death, n = 93; sepsis survivors, n = 235.
3173 unique sepsis survivors (n = 124) and sepsis death (n = 49); 87 for training, 86 for test. 100 iterations. 4-cis-decenoylcarnitine, 2-methylbutyrylcarnitine, butyrylcarnitine, hexanoylcarnitine, age, hematocrit, MAP, temperature.
Models were refined using quantitative, targeted MS measurements of the 11 metabolites represented in the initial predictive classifiers in 378 samples and non-sparse, clinical parameters that differed significantly in survivors and deaths. First, the seven-feature logistic regression model was repeated in all sepsis death (n=93) compared to all sepsis survivors (n=235). Clinical lactate values were used in place of targeted assay measurements since the values for most patients were previously captured. Predictive performance was similar to the initially derived test and training sets (Table 1). Support vector machines were used to develop a weighted model for prediction of sepsis survival and death. Data from 173 unique sepsis survivors and deaths was used; where data from the same person was available at both t0 and t24, one time point was randomly chosen and included (87 for training and the remaining 86 for testing) to avoid testing on a trained patient. Values were normalized by subtracting the mean and dividing by the standard deviation. 100 random partitions were performed for training and test data for each setting. Parameters and weights for the linear SVM determined were 2-methylbutyrylcarnitine 0.1631, 4-cis-decenoylcarnitine 0.1629, butyrylcarnitine −0.4248, hexanoylcarnitine 0.0719, Temperature −0.2602, MAP −0.3157, Age 0.4838, Hematocrit −0.3419 and bias term −0.9959. With these weights, the AUC in 86 unique test subjects was 0.71 and accuracy was 74% (63% for 28-day sepsis death and 79% for sepsis survival).
Since we noted that variance in metabolomic profiles could be partially attributed to time-to-death we used the 11 metabolites and clinical features to build a seven-day outcome prediction model to determine if it was superior to 28-day outcome since the metabolomic variance attributable to outcome decayed with increasing time-to-death. Moreover, all eleven plasma metabolite concentrations correlated well between time-to-death and metabolite value (
The strong replication in internal and external validation sets, targeted assays, SVM analysis, and predictive time-to-death models suggest that metabolomic features described will provide strong utility for sepsis death and survival prediction at presentation.
The plasma metabolome, plasma proteome and blood transcriptome of over 200 rigorously phenotyped individuals with community-acquired sepsis or controls (SIRS without infection) were analyzed by mass spectrometry and mRNA sequencing, respectively, in discovery and validation studies at ED arrival and 24 hours later. Host responses to sepsis were dichotomous and predicted 28-day sepsis outcome: Molecular divergence of sepsis survivors, sepsis deaths and controls was present at ED arrival, increased after 24 hours, and continued to diverge as death approached. Analytes differed minimally among etiologic agents or between survivors with uncomplicated sepsis, severe sepsis or septic shock. While sepsis survivors mobilized and utilized diverse energy substrates aerobically, sepsis patients who would die exhibited impaired ft-oxidation of fatty acids, with acylcarnitine accumulation and RNA degradation. Concomitant changes in transcription provided explanations for proteomic and metabolic differences. Collapsed rare and common genetic variants in 20 genes showed significant association with survival and death.
The integration of systems surveys revealed sepsis to be a complex, heterogeneous and highly dynamic pathologic state and yielded new insights into molecular mechanisms of survival or death that could potentially enable predictive differentiation and individualized patient treatment. Early accumulation of catabolic intermediates of lipids, proteins, RNA and carbohydrates in plasma of sepsis patients who would die, most notably acyl carnitines, were found, together with widespread decreases in mRNA of genes involved in glycolysis and gluconeogenesis. These changes were reversed in sepsis survivors. Therefore, the primacy of metabolism was shown to be a determinant of sepsis survival and death. The present invention also presented structural studies showing mitochondrial derangements, decreased mitochondrial number and reduced substrate utilization in sepsis death, and progressive drop in total body oxygen consumption with increasing severity of sepsis. An early differential in sepsis survival or death is the presence or absence of mitochondrial biogenesis, respectively. Finally, sepsis-induced multiple organ failure occurs despite minimal cell death in affected organs and recovery occurs relatively rapidly in sepsis survivors, ruling out other potential mechanisms of sepsis death. A causal role for elevated acylcarnitines in sepsis death is discovered by the finding that micromolar palmitoylcarnitine causes ventricular contractile dysfunction. Furthermore, adults with Mendelian mutations of acylcarnitine metabolism have similar metabolic derangements and high rates of sudden death. Alternatively, the differences observed in corticoid levels in sepsis survivors and nonsurvivors may be token neuro-hormonal control of disparate metabolic responses to sepsis.
The immediacy of the metabolic dichotomy in sepsis—before organ failure or shock became established—was very surprising. Survivors and deaths did not differ significantly in medication prior to enrollment. However, nucleotide variants in 20 genes showed evidence as risk factors for a pre-existing susceptibility and an adverse outcome. The functions of these genes concurred with the molecular differences between sepsis survival and death: single stranded DNA binding protein 1 is involved in mitochondrial biogenesis; SLC16A13 transports lactate and pyruvate; vitamin K epoxide reductase complex, subunit 1, is important for blood clotting; CCAAT/enhancer binding protein ε is important in granulocyte maturation and response to TNFα; NADH dehydrogenase 1 α2 and β8 are components of the mitochondrial electron transport chain.
Also surprising was the molecular homogeneity of uncomplicated sepsis, severe sepsis and septic shock, challenging the traditional notion of a temporal or molecular pyramid of sepsis progression. Additional longitudinal investigation of the host metabolic response to sepsis is needed to address more fully the temporal dynamics and general relevance of this dichotomy in community-acquired and nosocomial sepsis among diverse patient populations, ages and types of infection. Investigation of the relevance of host metabolic dichotomy to other SIRS-inducing conditions, such as trauma, hyperthermia and drug-induced mitochondrial damage, is also needed. The reversibility of the death phenotype by targeted interventions such as early goal-directed therapy, succinate administration or enhancement of mitochondrial biogenesis needs to be assessed. Global and temporal correlation of metabolome, proteome and transcriptome data from relevant biological fluids and well phenotyped patient groups, is suitable for understanding of intermediary metabolism, particularly with respect to poorly annotated analytes, and for characterization of homogeneous subgroups in complex traits. Combinations of transcriptome, proteome, metabolome and genetic data may establish multi-dimensional molecular models of disease that could provide insights into network responses to intrinsic and/or extrinsic perturbation.
Global correlations of plasma proteomic and metabolomic datasets recapitulated known mass action kinetic models of catalysis or physicochemical complex assembly and suggested novel models disclosed herein. Hierarchical clustering of correlations predicted class membership for unannotated biochemicals that were substantiated by structural determination. The clinicometabolomic model disclosed herein predicted day-7 survival with 99% accuracy, providing basis for individualized sepsis treatment. Therefore the invention is proved to be useful for predictive differentiation and nomination of novel potential interventions in complex pathologic states.
The following examples set forth preferred materials and procedures in accordance with the present invention. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods, devices, and materials are now described. It is to be understood, however, that these examples are provided by way of illustration only, and nothing therein should be deemed a limitation upon the overall scope of the invention.
Summary:
Patients presenting at EDs at Henry Ford Hospital, Duke University Hospital, and Durham Veterans Affairs Medical Center with suspected sepsis (≧2 SIRS criteria and infection) were enrolled. The CAPSOD study was approved by institutional ethics committees and written informed consent was given by patients. Physical examination and blood sample collection were performed at enrollment and 24 hrs later. Patients were followed for 28 days. Anonymized demographic and clinical data was stored in compliance with HIPAA regulations (ProSanos Inc., Harrisburg, Pa.). Following blinded, expert audit of infection status and outcomes, 150 matched subjects were chosen for discovery studies. Patients were classified as non-infected SIRS-positive uncomplicated sepsis, severe sepsis, septic shock or sepsis death. t0 and t24 samples from another 52 matched sepsis survivors and deaths were used for validation. Plasma metabolites were prepared and analyzed by high performance liquid chromatography and linear ion trap quadrupole (LTQ) MS with electrospray ionization and by gas chromatography and fastscanning dual-stage quadrupole MS with electron impact ionization (Metabolon Inc, Durham, N.C.). Plasma proteins were immunodepleted by GenWay Seppro IgY-12 columns and analyzed by LTQ MS in triple-play mode (Monarch Life Sciences Inc.). mRNA was isolated from blood samples and sequenced on Illumina GAIIx instruments. Statistical analysis employed JMP Genomics 5.0 (SAS Institute).
CAPSOD Study Sites and Patients:
The Community Acquired Pneumonia and Sepsis Outcome Diagnostics (CAPSOD) study was approved by the Institutional Review Boards of the National Center for Genome Resources (Santa Fe, N. Mex.), Duke University Medical Center (Durham, N.C.), Durham Veteran Affairs Medical Center (Durham, N.C.) and Henry Ford Hospital (Detroit, Mich.) and filed at ClinicalTrials.gov (NCT00258869). Inclusion criteria were presentation of adults at the emergency department with known or suspected acute infection and presence of at least two of the four systemic inflammatory response syndrome (SIRS) criteria (tympanic temperature <36° C. or >38° C., tachycardia >90 beats per minute, tachypnea >20 breaths per minute or PaCO2<32 mmHg, white cell count <4000 cells/mm3 or >12,000 cells/mm3 or >10% neutrophil band forms). Exclusion criteria were as previously described. Patients were enrolled from 2005 through 2009 in emergency departments at each institution and written informed consent was obtained by all study participants or their legal designates.
Clinical Data Collection:
Patient demographics, exposures, past medical history, results of physical examination, APACHE II score, SOFA score, development of ALI or ARDS and treatment were recorded at enrollment (t0) and at 24 hours (t24) by a nurse practitioner or physician using online electronic data capture (Prosanos Inc., Harrisburg, Pa.) as previously described. Microbiologic evaluation was as indicated clinically, supplemented by urinary pneumococcal and Legionella antigen tests. Finger-stick lactate values were obtained. After 28 days, charts were reviewed and largest deviations of clinical and laboratory parameters from normal were recorded, together with outcome measures, microbiologic results, treatment and time-to-events. Blood for metabolomic and proteomic analyses was collected in bar-coded EDTA-plasma tubes at enrollment (t0) and the following day (t24), incubated on ice, plasma separated (within 4 hours), and aliquots stored at −80° C. Blood for mRNA sequencing was collected in PaxGene tubes at enrollment (t0) and the following day (t24), incubated at room temperature and stored at −20° C.
Clinical Data Audit and Discovery Cohort Selection:
All subject records were adjudicated independently by a study physician to determine whether presenting symptoms and signs were due to infection, etiologic agent, site of infection, patient outcomes and times-to-outcomes. Patients were clinically categorized based on infection likelihood and microbial etiology: definite infection, causative organism identified; definite infection, causative organism uncertain; indeterminate, infection possible; no evidence of infection; and no evidence of infection and diagnosis of a non-infectious process accounting for SIRS. 150 patients were selected from the definite infection and non-infection categories for plasma metabolome and proteome analyses as follows: non-infected patients with >2 SIRS criteria (n=29); uncomplicated sepsis (sepsis without progression and with survival at day 28; n=27); severe sepsis (sepsis at t0 with progression to severe sepsis by day 3, n=25); septic shock (sepsis at t0 with progression to septic shock by day 3, n=38); sepsis deaths (sepsis with death by day 28, n=31). Patients with sepsis were further selected to enrich for confirmed infections due to E. coli, S. aureus, and S. pneumoniae. Within these constraints, groups were matched for age, race, sex and enrollment site. The estimated glomerular filtration rate (eGFR) was calculated as described.
Metabolite Sample Preparation and Gas Chromatography/Mass-Spectrometry and Liquid Chromatography/Mass-Spectrometry Analysis:
Plasma samples were thawed on ice at Metabolon Inc. (Durham, N.C.), and 100 μL was extracted using an automated MicroLab STAR system (Hamilton Company, Reno, Nev.), as described. A well characterized human plasma pool (“Matrix”, MTRX) was also included as a technical replicate, to assess variability and sensitivity in the measurement of all consistently detected chemicals. A single solvent extraction was performed with 400 μl of methanol containing recovery standards by shaking for two minutes using a Geno/Grinder 2000 (Glen Mills Inc., Clifton N.J.). After extraction, the sample was centrifuged, the supernatant removed and split into four equal aliquots: two for LC/MS, one for GC/MS, and a reserve aliquot. Aliquots were dried under vacuum overnight on a TurboVap (Zymark, Hopkinton, Mass.). Samples were maintained at 4° C. throughout the extraction process. For LC/MS analysis, aliquots were reconstituted in either 0.1% formic acid (for positive ion LC/MS), or 6.5 mM ammonium bicarbonate pH 8.0 (for negative ion LC/MS) containing internal standards for chromatographic alignment. For GC/MS analysis, aliquots were derivatized using equal parts N,O-bistrimethylsilyl-trifluoroacetamide and a mixture of acetonitrile:dichloromethane:cyclohexane (5:4:1) with 5% triethylamine at 60° C. for 1 hour. The derivatization mixture also contained a series of alkyl benzenes that served as retention time markers.
LC/MS was carried out using an Acquity UPLC (Waters Corporation, Milford, Mass.) coupled to a linear ion trap quadrupole (LTQ) mass spectrometer (Thermo-Fisher Scientific Inc., Waltham, Mass.) equipped with an electrospray ionization source. Two separate LC/MS injections were performed on each sample: the first optimized for positive ions, and the second for negative ions. The mobile phase for positive ion analysis consisted of 0.1% formic acid in H2O (solvent A) and 0.1% formic acid in methanol (solvent B), whereas that for negative ion analysis consisted of 6.5 mM ammonium bicarbonate, pH 8.0 (solvent A) and 6.5 mM ammonium bicarbonate in 95% methanol (solvent B). The acidic and basic extracts were monitored for positive and negative ions, respectively, using separate acid/base dedicated 2.1×100 mm Waters BEH C18 1.7 μm particle columns heated to 40° C. The extracts were loaded via a Waters Acquity autosampler and gradient-eluted (0% B to 98% B, with an 11 minute runtime) directly into the mass spectrometer at a flow rate of 350 μl/min. The LTQ alternated between full scan mass spectra (99-1000 m/z) and data-dependent MS/MS scans, which used dynamic exclusion.
Derivatized samples were analyzed on a Thermo-Fisher Scientific Trace DSQ fastscanning single-quadrupole MS set at unit mass resolving power. The GC column was 20 m×0.18 mm with 0.18 μm film phase consisting of 5% phenyldimethyl silicone. The temperature program ramped from 60° C. to 340° C., with helium as the carrier gas. The MS was operated using electron impact ionization with a 50-750 amu scan range, tuned and calibrated daily for mass resolution and mass accuracy. Samples were randomized to avoid group block effects and were analyzed over five platform days (for discovery group samples) or two platform days (for replication group samples). Six MTRX aliquots, an internal standard sample (see below) and various control samples were included in each run.
Metabolites were identified by automated comparison to a reference library of purified external standards using Metabolon software developed for creating library entries from known chemical entities with automatic fitting of reference to experimental spectra. Peaks that eluted from the LC or GC methods were compared to the library at a particular retention time and associated spectra for that metabolite. Internal standards were used to calibrate retention times of metabolites across all samples. Platform variability was determined by calculating the median relative standard deviation (RSD) for the internal standard compounds that were added to every sample. Overall variability (including sample preparation) was determined by the median RSD for 261 endogenous metabolites present in all MTRX samples. Peptides were identified using standard tandem mass spectrometry sequencing.
Raw area counts for each metabolite in each sample were normalized to correct for variation resulting from instrument inter-day tuning differences. For each metabolite, the raw area counts were divided by the median value for each run-day, therefore setting the medians to 1.0 for each run. This preserved variation between samples, but allowed metabolites of widely different raw peak areas to be compared on a similar graphical scale. Missing values were imputed with the observed minimum after normalization. However, metabolites with missing values in >50% of the samples were excluded from analysis.
Identification of Unknown Biochemical X-11234:
The unknown compound X-11234 was identified as cis-4-decenoyl carnitine based on comparison of its mass spectrum and chromatographic retention time with an authentic standard.
Quantitative LC/MS/MS Measurements:
A combined internal standard working solution was made, comprising butyrylcarnitine-d3 at 400 μg/mL, 2-methylbutyrylcarnitine-d3 at 200 μg/mL, hexanoylcarnitine-d3 at 200 μg/mL and cis-4-decenoylcarnitine-d3 (Universidad Autonoma de Madrid, Spain) at 400 μg/mL in acetonitrile/water (1:1). Six calibration samples were made in acetonitrile/water (1:1): Standard A: butyrylcarnitine 2 μg/mL, 2-methylbutyrylcarnitine 4 μg/mL, hexanoylcarnitine 2 μg/mL, cis-4-decanoylcarnitine 40 μg/mL. Standard B: butyrylcarnitine 4 μg/mL, 2-methylbutyrylcarnitine 8 μg/mL, hexanoylcarnitine 4 μg/mL, cis-4-decanoylcarnitine 80 μg/mL. Standard C: butyrylcarnitine 10 μg/mL, 2-methylbutyrylcarnitine 20 μg/mL, hexanoylcarnitine 10 μg/mL, cis-4-decanoylcarnitine 200 μg/mL. Standard D: butyrylcarnitine 40 μg/mL, 2-methylbutyrylcarnitine 80 μg/mL, hexanoylcarnitine 40 μg/mL, cis-4-decanoylcarnitine 800 μg/mL. Standard E: butyrylcarnitine 100 μg/mL, 2-methylbutyrylcarnitine 200 μg/mL, hexanoylcarnitine 100 μg/mL, cis-4-decanoylcarnitine 2000 μg/mL. Standard F: butyrylcarnitine 200 μg/mL, 2-methylbutyrylcarnitine 400 μg/mL, hexanoylcarnitine 200 μg/mL, cis-4-decanoylcarnitine 4000 μg/mL. 50 μL of 393 human EDTA plasma samples, 48 quality control plasma aliquots, 6 calibration standards and a blank internal standard (H2O) were each spiked with 20 μL of internal standard working solution and 50 μL of acetonitrile/water (1:1) and 200 μL of methanol. Samples were vortexed and centrifuged to precipitate proteins. 180 μL of the supernatant was dried under a stream of nitrogen at 40° C., reconstituted in 75 μL of water, vortexed, centrifuged and injected onto a Waters Acquity UPLC/Thermo Quantum Ultra triple quadrupole LC/MS/MS system with HESI source equipped with a reversed phase chromatographic column. The peak areas of the respective product ions were measured against the peak areas of the corresponding internal standard product ions. The monitored ion masses (SRM mode) were: as follows: for butyrylcarnitine, parent ion 232.2+0.5, product ion 85.0+0.5; For butyrylcarnitine-D3, parent ion 235.2+0.5, product ion 85.0+0.5; For 2-methylcarnitine, parent ion 246.2+0.5, product ion 85.0+0.5; For 2-methylcarnitine-D3, parent ion 249.2+0.5, product ion 85.0+0.5. For hexanoylcarnitine, parent ion 260.2+0.5, product ion 85.0+0.5; For hexanoylcarnitine-D3, parent ion 263.2+0.5, product ion 85.0+0.5; For cis-4-decenenoylcarnitine, parent ion 314.2+0.5, product ion 85.0+0.5; For cis-4-decenoylcarnitine-D3, parent ion 317.2+0.5, product ion 85.0+0.5. 1. Chromatographic conditions were: Mobile phase A, 0.1% formic acid in water; Mobile phase B, 0.5% formic acid in acetonitrile; UHPLC column, Waters Acquity C 18 BEH, 1.7 micron 2.1×100 mm; Injection volume, 10 μL. Quantitation was performed using a weighted linear least squares regression analysis generated from fortified calibration standards prepared immediately prior to each run. The dynamic range was 2.00-200 μg/mL for butyrylcarnitine, 4.00-400 μg/mL for 2-methylbutyrylcarnitine, 2.00-200 μg/mL for hexanoylcarnitine and 40.0-4000 μg/mL for cis-4-decenoylcarnitine. 48 replicate plasma quality control sample aliquots were interspersed and analyzed together with the study samples and a calibration curve at the beginning and end of each run. The interday % RSD (total of 8 analytical runs) for butyrylcarnitine was 5.1%, 2-methylcarnitine was 4.9%, hexanoylcarnitine was 5.8% and cis-4-decenoylcarnitine was 4.8%.
Proteome Sample Preparation and Mass Spectrometry Analysis (Monarch Life Sciences):
Plasma samples were thawed on ice at Monarch Life Sciences Inc. and the top-12 most abundant proteins (albumin, IgG, fibrinogen, transferrin, IgA, IgM, haptoglobin, α2-macroglobulin, α1-acid glycoprotein, α1-antitrypsin and apolipoprotein A-I and A-II) were removed using Seppro IgY-12 Columns (GenWay Biotech Inc.). Column flow-throughs were denatured by 8M urea, reduced by triethylphosphine, alkylated by iodoethanol and digested by trypsin, as described. Tryptic digests (˜20 μg) were analyzed using a Thermo-Fisher Scientific LTQ linear ion-trap mass spectrometer coupled with a Surveyor HPLC system. Peptides were separated on a C18 reverse phase column (i.d. =2.1 mm, length=50 mm) with a flow rate of 200 μl/min and eluted with a gradient from 5 to 45% acetonitrile developed over 120 min. All injections were randomized and the instrument was operated by the same operator for the study. Data were collected in the triple-play mode (MS scan, zoom scan and MS/MS scan). Data were filtered and analyzed as described. Database searches against the IPI (International Protein Index) human database (v3.48) and the non-Redundant-Homo Sapiens database (update July 2009) were carried out using both the X!Tandem and SEQUEST algorithms. Parameters were set as follows: a mass tolerance of 2 Da for precursors and 0.7 Da for fragment ions, two missed cleavage sites allowed for trypsin, carbamidomethyl cysteine as fixed modification, and oxidized methionine as optional modification. The q-value represented peptide false identification rate and was calculated by incorporating Sequest and X!Tandem results in addition to a number of other relevant factors such as Ä [M+H]+ and charge state. Observed peptide MS/MS spectrum and theoretically derived spectra were used to assign quality scores (Xcorr in SEQUEST and e-Score in X!Tandem). Protein identities were assigned priority scores (from 1 to 4): based on the peptide ID confidence (q-value) and the number of unique peptides used for protein identification: Priority 1, high peptide confidence (>90%) and multiple unique sequences; Priority 2, high peptide confidence (>90%) and single peptide sequence; Priority 3, moderate peptide confidence (between 75% and 89%) and multiple unique sequences; Priority 4, moderate peptide confidence (between 75% and 89%) and single peptide sequence. Priority 1 protein identifications were employed for analyses, except protein-metabolite correlations, which also employed Priority 2 identifications that were observed at both t0 and t24. Protein quantification was carried out using the method of Higgs et al. Briefly, raw files were acquired from the LTQ and all extracted ion chromatograms (XIC) were aligned by retention time. For protein quantification, each aligned peak must match four criteria: precursor ion, charge state, fragment ions (MS/MS data) and retention time (within a one-minute window). After alignment, area-under-the-curve (AUC) for each individually aligned peak from each sample was measured and compared for relative abundance. As an example, the XICs and ANOVA for chicken lysozyme (an external control) in 150 subjects at t0 are appended.
Peak intensities were log 2 transformed before quantile normalization90 to ensure that every sample had a peptide intensity histogram of the same scale, location and shape. Normalization removed trends introduced by sample handling, sample preparation, total protein differences and changes in instrument sensitivity while running multiple samples (data not shown). If multiple peptides had the same protein identification, then their quantile normalized log 2 intensities were averaged to obtain log 2 protein intensities.
Proteome Mass Spectrometry Analysis:
Raw LC-MS/MS data files collected on a LTQ Linear Ion Trap (ThermoFisher Scientific, Waltham. MA) were delivered to the Duke Proteomics Core Facility as .raw files with appropriate deidentified clinical data. The centroid MS/MS data was processed into .mgf files using Mascot Distiller v2.0 (Matrix Sciences, Inc Boston, Mass.), and searched with Mascot v2.2. Mascot was set up to search the Swissprot v57.5 database (www.uniprot.org) with human taxonomy and decoy database enabled, trypsin specificity with a maximum of 2 missed cleavages, and 2 Da precursor and 0.8 Da product ion mass accuracy. Iodoacetamide derivative of cysteine was specified as a fixed modification, and deamidation of asparagine, deamidation of glutamine, and oxidation of methionine were specified in Mascot as variable modifications. Scaffold version 3.0 (Proteome Software Inc., Portland, Oreg.) was used to import search results directly from Mascot and validate MS/MS based peptide and protein identifications. Because of the number of analyses, the time zero (n=150) and 24 hour (n=131) datasets were imported and validated in Scaffold independently. For both data sets, peptide identifications were accepted if they could be established at greater than 50.0% probability as specified by the Peptide Prophet algorithm, and protein identifications were accepted if they could be established at greater than 90.0% probability and contained at least 1 identified peptide. Protein probabilities were assigned by the Protein Prophet algorithm92. Proteins that contained similar peptides and could not be differentiated based on MS/MS analysis alone were grouped to satisfy the principles of parsimony. Non-normalized spectral counting reports were then exported independently for each of the datasets, and compiled in Microsoft Excel 2007. Using the Protein Prophet scores, the protein search results from both datasets were compiled, sorted and curated using reverse (decoy) sequences identified to set the protein false discovery rate of the aggregate dataset to 2.5%. Proteins identified below this threshold were discarded from the dataset. Follow-up comparative quantitation between individuals and timepoints was performed using spectral counting in the form of number of identified spectra per protein.
Transcriptome Sample Preparation and mRNA Sequencing:
RNA was prepared using a PaxGene Blood RNA kit (Qiagen, Germantown, Md.) according to the manufacturer's instructions. Briefly, nucleic acids were pelleted by centrifugation, washed and treated with proteinase K. Residual cell debris was removed by centrifugation through a column. Samples were equilibrated with ethanol and total RNA was isolated using a silica membrane. Following washing and DNase I treatment, RNA was eluted. RNA integrity was determined by 2100 Bioanalyzer microfluids using RNA 600 Nano kit (Agilent). RNA samples were stored at −80° C.
mRNA sequencing libraries were prepared from total RNA according to Illumina's mRNA-Seq Sample Prep Protocol v2.0/2007. Briefly, mRNA was isolated using oligo-dT magnetic Dynabeads (Invitrogen, Carlsbad, Calif.). Random-primed cDNA was synthesized and fragments were 3′ adenylated. Illumina DNA oligonucleotides adapters for sequencing were ligated and 350-500 bp fragments were selected by gel electrophoresis. cDNA sequencing libraries were amplified by 18 cycles of PCR and quality was assessed with the Bioanalyzer. cDNA libraries were stored at −20° C.
Biological replicate cDNA libraries, prepared from whole blood extracted from an anonymous healthy individual, were sequenced on the Illumina GAII instruments as 36-cycle singleton reads. CAPSOD experimental samples were sequenced on Illumina GAII instruments 54-cycle singleton reads). Base calling used the Illumina Pipeline software v1.4, except for 14 samples which used v1.3. Approximately 500 million high quality reads were generated per sample. Reads were aligned to the NCBI human nuclear genome reference build 37 and the corresponding human mitochondrial genome reference using the algorithm GSNAP (Mar. 9, 2010 release). GSNAP alignment parameters were: maximum mismatches=((readlength+2)/12)−2; indel penalty=1; trim=1; indel endlength=12; maximum middle deletion size=6000 nt; maxmiddle-insertions=60. Uniquely aligned reads were enumerated on a RefSeq gene-by-gene basis and expressed as aligned reads per million. Variants were detected in reads aligned by GSNAP.
Variants were retained if present in >=4 reads of Q>=20 and >14% reads, with the exception of mitochondrial variants, which were retained if present in >10% reads. Numeric genotypes (0, homozygous reference; 1, heterozygous; 2, homozygous variant, •, nucleotide coverage <4 reads) were imputed in reads aligning to the nuclear genome; mitochondrial variants were assigned present or absent (0, absent [present in <10% reads]; 1, present [>=10% reads]; •, nucleotide coverage <4 reads). Heterozygous nuclear variants were present in 14-86% of reads; homozygotes were represented by reads with <14% or >86% variant calls, as described.
Statistical Analyses:
Overlaid kernel density estimates, univariate distribution results, correlation coefficients of pair wise sample comparisons, unsupervised principal components analysis (by Pearson productmoment correlation) and Ward hierarchal clustering of Pearson product-moment correlations were performed using log2-transformed data as described using JMP Genomics 5.0 (SAS Institute). Decomposition of principal components of variance, including patient demographics, past medical history, laboratory and clinical values, was performed to maximize sepsis-group related components of variance and minimize residual variance. Guided by these analyses, ANOVA was performed between sepsis groups, with 5 or 10% false discovery rate (FDR) correction and inclusion of substantive non-hypothesis components of variance as fixed effects. These included renal function, as determined by the estimated glomerular filtration rate (eGFR) using the four variable modification of diet in renal disease calculation96, hemodialysis (HD), cirrhosis and liver disease, hepatitis, neoplastic disease, congenital disease, administration of exogenous immunosuppressants, drug abuse, metabolic dysfunction, respiratory dysfunction, serum glucose levels and mean arterial pressure (MAP). Predictive modeling was performed with JMP Genomics 5.0 using logistic regression, K nearest neighbors, partial least squares, partition trees and radial basis machines. Cross-validation was performed using 50 iterations and 10% sample omission.
Variant associations with survival/death were performed by comparing a binary trait with numeric genotypes of both common and rare variants. Rare variants were recoded according to a dominant model and combined within genes into a single locus. Association tests were then performed using JMP Genomics 5.0 on each single locus (using Person chi-square and Fisher's exact test) and combined tests on all variants within a gene (using Hotelling's T-squared test or on the principal components representing the variants as a regression model). The significance cutoff was −log 10(p value)>8.0. Significant associations were retained if observed in at least 60 samples, had at least moderately altered odd ratios, and following manual inspection of read alignments to confirm variant calls.
Ingenuity Pathway Analysis software (version 8.7, content version 3203) was used to assign biological functions to differentially expressed genes.
Pairwise cross correlations were performed using JMP Genomics 4.0 software to compare protein and metabolite values at t0 and t24 using Pearson moment-correlation. Briefly, all proteins and all metabolites were included, with the exception of unannotated GC/MS determined compounds or redundant entries. Metabolite and protein log 2 values were transposed into a wide format and the correlations were merged based on patient identification. Protein metabolite correlations were considered significant if observed at t0 and t24 with p-values <0.05 and <0.1, or at a single time point with Bonferroni correction. To identify significant, sepsis associated correlations, the same analysis was performed but limited only to proteins or metabolites that were significant at both time points with concordant changes.
Unannotated metabolites and proteins, except the sulfated steroids X-11245 and X11302, were removed.
Support vector machines (SVM), both linear and with RBF kernels, were used for binary classification of sepsis survivors and deaths (SD). Data from 173 unique sepsis survivors and deaths was used; where data from the same person was available at both t0 and t24, one time point was randomly chosen and included. Features were either four quantitative MS-assays of acylcarnitines or the four acylcarnitines and four non-sparse, clinical parameters that showed significant differences between survivors and deaths (age, temperature, MAP and hematocrit). 100 random partitions were performed for training and test data for each setting. SVM performance was evaluated by test data scores for area under the receiver operating characteristic (ROC) curve (AUC) and accuracy. Accuracy was highly dependent on the threshold chosen for the scores. In all experiments, the scores of training samples were sorted and the N_SDth score was used as the threshold with test data. Parameter weights were derived for linear SVM.
The following examples illustrate preferred embodiments of the present invention. These examples are provided for illustration only and the invention is not limited by these examples.
1,152 individuals with suspected, community-acquired sepsis (acute infection and ≧2 SIRS criteria16) were enrolled prospectively at three urban, tertiary-care EDs in the United States between 2005 and 2009 [Community Acquired Pneumonia and. Sepsis Outcome Diagnostics (CAPSOD), ClinicalTrials.gov NCT00258869]. Medical history, physical examination, acute illness scores (APACHE II and SOFA) and blood samples were recorded at enrollment (t0) and 24 hours later (t24;
150 CAPSOD enrollees were selected for mass spectrometry (MS)-based venous plasma metabolome and proteome profiling at t0 and t24, venous blood mRNA-seq at t0 and integrative analysis (
1Despite adequate fluid resuscitation or adequate intravascular volume
3If SaO2 only, PaO2 calculated from standard oxyhemoglobin dissociation curve with assumption of normal pH
The latter were considered to be ideal molecular controls since at ED arrival they had a SIRS-associated illness that was clinically indistinguishable from sepsis (Table 3). In addition, they matched the sepsis groups in rates of progression (day 3 organ dysfunction or shock) and 28-day death, allowing a distinction to be made between the pathognomonic molecular events of sepsis progression and those common to progression in other SIRS-associated, acute illnesses (Table 3).
Patients were selected to match groups for most material phenotypes at presentation (number of SIRS criteria, age, race, sex, enrollment site, renal function and co-morbidity) but differed in temperature, APACHE II and SOFA scores (Tables 4 and 5). All sepsis patients were independently determined by an expert physician to have definite infections. Non-consecutive patients were added to sepsis groups to increase those with Streptococcus pneumoniae (and thereby for lobar pneumonia; n=31), Escherichia coli (and thereby for urosepsis; n=16) and Staphylococcus aureus (and thereby for skin, soft tissue, and catheter associated infections; n=27) to allow limited etiologic comparisons to be undertaken. Validation studies employed in an independent CAPSOD sample of 18 sepsis deaths and 34 matched sepsis survivors (at t0 [Rt0] and t24 [Rt24]: Table 6). The validation set included all remaining sepsis deaths in CAPSOD at time of selection, and, as a result differed in median time-to-death from the discovery cohort (18.5 days vs. 10.7 days, respectively).
S. aureus bacteremia
S. pneumonia bacteremia
E. coli bacteremia
1Constrained - little or no choice; ≧2 SIRS;
2Day 0-3;
3Day 1-28;
4Henry Ford Hospital System
Plasma biochemicals of mass-to-charge (m/z) ratio 100-1000 Da were measured in 150 discovery patients using label-free, liquid and gas chromatography and MS. Of approximately 4,413 biochemicals detectable in human tissues, 439 were measured at t0 or t24 and 332 were detected at both times. 215 and 224 of the biochemicals detected at t0 and t24, respectively, were annotated metabolites (
Group differences between mean plasma metabolite values were sought in cross-sectional studies at t0 or t24. Principal component analysis (PCA) and Bayesian factor analysis with normalized energy plots both demonstrated the main sources of inter-individual variation in the plasma metabolome to be renal function, liver disease and sepsis group membership (
Differences between groups were sought by analysis of variance (ANOVA). Non-sepsis-related effects were minimized by inclusion of renal function and liver disease as fixed effects and/or by separating renal and sepsis group effects. Since acute renal dysfunction partially co-segregated with sepsis death this strategy may have been too conservative in sepsis outcome comparisons (Table 7, 8).
No plasma metabolite differed significantly between sepsis survivor subgroups (uncomplicated sepsis, day 3 severe sepsis, day 3 septic shock) or between infectious agents (S. pneumoniae, S. aureus or E. coli;
The metabolic differences of sepsis survivors from controls were reversed in sepsis deaths. 76 plasma metabolites differed between sepsis survivors and deaths at t0, increasing to 128 at t24 (FDR 5%;
Plasma metabolites were assayed in all remaining CAPSOD sepsis deaths (n=18) and 34 additional, matched sepsis survivors to seek confirmation of the discovery findings. (
Additional validation was obtained by retesting all 393 samples using targeted, quantitative assays of 11 metabolites representative of the major findings. While inter-individual variability was considerable, the differences between sepsis survivor, sepsis death and control groups were confirmed (
Proteomic analysis of these samples provided an orthogonal survey of host response in sepsis survival and death (
In contrast, sepsis survivors differed from controls in levels of 15 and 23 plasma proteins at t0 and t24, respectively (stratified ANOVA, FDR 5%;
Akin to the metabolome, the plasma proteome disclosed a dichotomous host response in sepsis survivors and deaths (64 and 27 protein differences at t0 and t24, respectively;
Transcription in venous blood of patients at ED arrival was evaluated by sequencing mRNA from the discovery cohort at t0 (
Differences in transcript abundance between sepsis survivors and controls and sepsis survivors and deaths were strikingly skewed (
Other prominent functional classes that differed in mRNA abundance in sepsis outcome were kinases, transporters, and peptidases (
Transcriptome differences suggested elevation of metabolic rate in sepsis survivors: RNAs for 41 nuclear-encoded mitochondrial proteins were significantly increased in sepsis survivors (compared with controls) and 15 were decreased in sepsis death (
Transcription of innate immune effectors was markedly different in sepsis survivors and deaths (
Finally, among the small number of mRNAs that were significantly increased in sepsis death were six involved in coagulation and endothelial cell adhesion (angiopoietin-like 2, thrombin receptor-like 2, glycophorin B, kallikrein-related peptidase 8, lymphatic vessel endothelial hyaluronic receptor 1 and PFTAIRE protein kinase). Together with complement regulator CD59, which was decreased in sepsis death, these transcriptional changes agreed with the observed perturbation in thrombolysis and complement proteins in sepsis deaths (
Common and rare expressed genetic variants that might underpin the molecular differences in sepsis survivors and deaths were sought. Variants were identified and genotyped in peripheral blood transcripts from 142 subjects at nucleotides with adequate coverage (present in ≧4 reads of Q≧20 and >14% reads), as described. Variant genotypes and collapsed variant genotypes within a gene were tested for association with 28-day survival or death under the common disease:common variant and common disease:rare variant hypotheses. Of 384,283 nuclear and mitochondrial mRNA variants, none showed a significant association. However, combined variants in 20 genes showed significant associations with outcome (−log10(p) value≦32; Hotelling T-squared test or regression analysis of principal components representing the combined variants), were observed in at least 60 samples and had at least moderately altered odd ratios in survivor:death and sepsis survivor:death comparisons (Table 6). Several of these genes were plausible functional candidates for risk of adverse sepsis outcome: 4 encoded mitochondrial proteins and 9 exhibited altered mRNA levels in sepsis survival and death. Notably, subunits α2 and β8 of NADH dehydrogenase 1, a component of the mitochondrial electron transport chain, had excess variants in sepsis deaths.
Common and rare expressed genetic variants that might underpin the molecular differences in sepsis survivors and deaths were sought. Variants were identified and genotyped in peripheral blood transcripts from 142 subjects at nucleotides with adequate coverage (present in ≧4 reads of Q≧20 and >14% reads), as described. Variant genotypes and collapsed variant genotypes within a gene were tested for association with 28-day survival or death under the common disease:common variant and common disease:rare variant hypotheses49. Of 384,283 nuclear and mitochondrial mRNA variants, none showed a significant association. However, combined variants in 20 genes showed significant associations with outcome (−log10(p) value≦32; Hotelling T-squared test or regression analysis of principal components representing the combined variants), were observed in at least 60 samples and had at least moderately altered odd ratios in survivor:death and sepsis survivor:death comparisons (Table 6). Several of these genes were plausible functional candidates for risk of adverse sepsis outcome: 4 encoded mitochondrial proteins and 9 exhibited altered mRNA levels in sepsis survival and death. Notably, subunits α2 and 138 of NADH dehydrogenase 1, a component of the mitochondrial electron transport chain, had excess variants in sepsis deaths.
Surveys of the plasma proteome and metabolome were also integrated by global cross-correlations and hierarchical clustering of correlations (
4,106 of 53,784 plasma protein-metabolite correlations were concordant at t0 and t24 and statistically significant (Bonferroni-corrected log10 p-value<−6.03; data not shown). These included known mass action kinetic models of catalysis or physicochemical complex assembly: Ribonuclease A1 correlated with 12 downstream products of its action (N6-carbamoylthreonyladenosine, N2,N2-dimethylguanosine, pseudouridine, arabitol, arabinose, erythritol, erythronate, gulono-1,4-lactone, allantoin, phosphate, xylonate and xylose). Hemoglobin subunits α1, β, δ and ζ correlated with the component heme, allosteric effector adenosine-5-monophosphate and degradation product xanthine. Subunit D of succinate dehydrogenase (SDHD, a high confidence protein identification supported by a single peptide) correlated with 3 downstream citric acid cycle intermediates (L-malate, oxaloacetate and citrate;
Co-cluster hierarchies and correlations also suggested novel reaction models: Thus, SDHD correlated with pyruvate, lactate and acetyl-carnitine, suggesting novel regulation of the citric acid cycle (
Only 3 plasma proteins or metabolites correlated significantly with blood transcripts: levels of fatty acid binding protein 1 and S100A9 correlated with their respective mRNAs (Pearson coefficients 0.49; −log10p=9.0 and 8.8, respectively). Uridine phosphorylase 1 mRNA correlated inversely with plasma uridine (r2=−0.48, −log10p=8.7), consistent with their enzyme-substrate relationship. The paucity of mRNA correlations likely reflects the small effect of blood cells to MS-detected plasma protein and metabolite levels, relative to liver and muscle.
The goal of the current study was to identify markers for prompt and objective determination of prognosis in individual sepsis patients in order to tailor treatment dynamically. Since such markers have been sought for decades, an innovative approach, with three premises, was taken. Firstly, comprehensive, hypothesis-agnostic description of the molecular antecedents of survival and death was posited to yield new, unbiased insights. Secondly, holistic integration of metabolomic, proteomic, transcriptomic and genetic data was posited to permit identification of signals undetected or obscured by false discovery cutoffs in single datasets. Thirdly, co-occurrence and correlation of networks and pathways in orthogonal datasets was posited to help identify and prioritize causal molecular mechanisms. Therefore, findings identified in individual datasets by statistically significant group differences in discovery and replication cohorts were prioritized by: 1). assembly into networks, pathways or biochemical families; 2). temporal confirmation or evolution of changes; 3). network and pathway corroboration in orthogonal datasets; and 4). cross correlations, hierarchical co-clustering and assembly of mass action kinetic models of catalysis or physicochemical complexes. Finally, prognostic biomarker candidates were chosen to reflect underpinning molecular mechanisms, rather than by ability to partition accurately.
An integrated systems survey revealed sepsis to be a complex, heterogeneous and dynamic pathologic state and yielded new insights into molecular mechanisms of survival or death that may enable predictive differentiation and individualized patient treatment. There were both negative and positive material findings.
The major negative finding was that the plasma metabolome, proteome and transcriptome did not differ between uncomplicated sepsis, day 3 severe sepsis, day 3 septic shock nor between infections with S. pneumoniae, S. aureus or E. coli. There were no plasma metabolic or proteomic differences between these groups either at time of presentation for care or at t24. Thus, sepsis survivors represented a molecular continuum, irrespective of imminent clinical course or etiology. It should be noted, however, that MS-based proteome analysis was insensitive for measurement of low molecular weight proteins, such as cytokines, which are known to differ between etiologic agents. Importantly, all datasets refuted the concept that the discrete clinical stages of progression from uncomplicated sepsis to severe sepsis to septic shock have a unifying molecular basis. The molecular homogeneity of uncomplicated sepsis, severe sepsis and septic shock was remarkable, challenging the traditional notion of a temporal or molecular pyramid of sepsis progression (
The major positive finding was that the vast majority of host molecular responses were directly opposite in sepsis survivors and deaths (
Prominent in the disparate molecular phenotype of sepsis survival and death was altered fatty acid metabolism: Plasma levels of 6 carnitine esters were decreased in sepsis survivors, relative to controls. In contrast, 16 carnitine esters and 4 FA were elevated in sepsis deaths. Corroborating the metabolic changes were decreases in mRNAs encoding carnitine acyltransferase, carnitine palmitoyltransferase 1B, SLC27A3, malonyl CoA:ACP acyltransferase and the FA β-oxidation enzymes pantothenate kinase 4, CoA synthase and mitochondrial enoyl CoA hydratase 1 in sepsis death. 9 fatty acid transport proteins were decreased in sepsis death, while plasma levels of two fatty acid binding proteins correlated with acyl-carnitine and FA levels. Some of these have been previously reported. Several transcriptional regulatory genes that control fatty acid metabolism were also decreased in sepsis death, including FOXO3, KLF2, C/EBP-α and -β, while TIF2 (NCOA2) was increased. TIF2 is an energy rheostat, which is activated in states of energy depletion, depresses uncoupling protein 3, and increases fat absorption from the gut. Thus, TIF2 up-regulation may represent a maladaptive host response in sepsis death, further elevating plasma lipids that are already increased by impaired β-oxidation. Together, these findings indicate a defect in FA β-oxidation in sepsis death, particularly at the level of the mitochondrial shuttle. Carnitine esterification commits FAs irreversibly to β-oxidation and mitochondrial import of carnitine esters is rate limiting in FA β-oxidation. Acyl-carnitines of all FA lengths were elevated and several shuttle enzymes were affected. A causal role for acylcarnitines in sepsis death is suggested by the finding that micromolar amounts cause ventricular dysfunction. Furthermore, Mendelian mutations of acylcarnitine metabolism induce similar metabolic derangements and high rates of sudden death.
Glycolysis, gluconeogenesis and the citric acid cycle also differed prominently in sepsis survivors and deaths. Plasma values of citrate, malate, glycerol, glycerol 3-phosphate, phosphate and glucogenic and ketogenic amino acids were decreased in sepsis survivors, relative to controls. In contrast, citrate, malate, pyruvate, dihydroxyacetone, lactate, phosphate and gluconeogenic amino acids were increased in sepsis deaths. A corroborating proteomic change was subunit D of succinate dehydrogenase, whose level correlated with the downstream citric acid cycle intermediates malate, oxaloacetate and citrate and with lactate, pyruvate and acetyl-carnitine. Corroborating maladaptive transcriptome changes in sepsis deaths were decreased fructose-1, 6-bisphosphatase 1, hexokinase 3, glucosidase, glycogen synthase kinase, NAD kinase and NAD synthase 1. A parsimonious explanation of these findings was that sepsis survivors mobilized energetic substrates and utilized these in aerobic catabolism completely, while those who would die failed to do so. One clinical corroboration was significantly lower core temperature in sepsis deaths than survivors.
Several lines of evidence support the primacy of metabolism as a determinant of sepsis outcome: Structural studies show mitochondrial derangements, decreased mitochondrial number and reduced substrate utilization in sepsis death, and progressive drop in total body oxygen consumption with increasing severity of sepsis. An early indicator of sepsis outcomes is mitochondrial biogenesis. Finally, sepsis-induced multiple organ failure occurs despite minimal cell death and recovery is rapid in survivors, ruling out irreversible mechanisms. Alternatively, the differences observed in corticoid levels in sepsis survivors and nonsurvivors may betoken neuro-hormonal control of disparate metabolic responses to sepsis. While levels of unbound metabolites in plasma reflect tissue concentrations, values may not be in linear relationship with tissues. Nevertheless, long experience with clinical chemistry predicated on plasma values.
The immediacy of the metabolic dichotomy in sepsis suggested a pre-existing susceptibility and potentially indicated a unifying risk factor. Survivors and deaths did not differ significantly in medication prior to enrollment. However, nucleotide variants in 20 genes showed evidence as risk factors for adverse outcome. The functions of these genes concurred with the molecular differences between sepsis survival and death: SLC16A13 transports lactate and pyruvate; vitamin K epoxide reductase complex, subunit 1, is important for blood clotting; CCAAT/enhancer binding proteins is important in granulocyte maturation and response to TNFα; NADH dehydrogenase 1 α2 and β8 are components of the mitochondrial electron transport chain. The relationships between these variants and the survival/death molecular phenotypes remain unknown.
In summary, an integrated systems survey revealed new and surprising insights into molecular mechanisms of sepsis survival and death. The current study examined community-acquired sepsis in adults in detail, and mainly caused by Streptococcus pneumoniae (and thereby lobar pneumonia), Escherichia coli (and thereby urosepsis) and Staphylococcus aureus (and thereby skin, soft tissue, and catheter associated infections). Additional longitudinal investigation of the host metabolic response to sepsis is needed to address more fully the temporal dynamics and breadth of relevance of this dichotomy in community-acquired infection. New proteomic technologies are available with greater sensitivity than those used herein. Ideally, liver or muscle tissue would be examined concomitantly with blood in order to confirm the relevance of the latter. Additional studies are needed to evaluate the applicability of these findings to nosocomial sepsis, pediatric sepsis, neonatal sepsis, other patient populations and other etiologic agents. Investigation of the relevance of host metabolic dichotomy to other SIRS-inducing conditions, such as trauma, hyperthermia and drug-induced mitochondrial damage, is also warranted.
Finally, prognostic biomarker models derived from the molecular events and mechanisms elucidated in sepsis survival and death were developed. For practical reasons, a homogeneous biomarker panel was sought, rather than combinations of protein, metabolite and RNA measurements. In general, biomarker panels have had disappointing rates of replication. Reasons include data overfitting, reliance on cross-validation rather than independent validation, recruitment at single sites and dependence on single analytic platforms or statistical methods. We sought to obviate these by development of sparse panels, recruitment at three sites, use of two metabolite measurement techniques, replication in an independent CAPSOD cohort, and evaluation of a wide variety of statistical approaches. Numerous combinations of seven or eight of fifteen metabolites and clinical parameters were effective in prediction. A final model employed logistic regression of values of MAP, hexanoylcarnitine, Na+, creatinine, pseudouridine, HPLA and 3-methoxytyrosine. The factors in this model all reflected the observed dichotomy in host response and/or have previously shown utility in sepsis outcome prediction. The model predicted 7-day all cause survival/death with an AUC of 0.88 and 99% accuracy, assuming a 10% prior probability of death. All cause survival/death (confirmed sepsis and patients presenting with sepsis but subsequently shown to have a non-infectious SIRS etiology) matched precisely the clinical scenario encountered in ED patients. The performance of this model was approximately 10% better than those obtained in the same patients by capillary lactate, SOFA or APACHE II scores, the current gold standards for prognostic assessment in sepsis. Independent replication studies are needed, as are finalization of markers and parameters and additional assay development. As with many current disease severity markers, the panel is likely to be especially useful when used serially in individual patients. Ideally, the panel should be deployed on device that will be at point-of-care or hospital-based and with time-to-result of about an hour. With additional development, this panel may meet the immense need for prompt determination of sepsis prognosis in individuals to guide targeting of intensive treatments and, thereby, to improve outcomes.
In the interim, it will be possible to use some of the markers of the molecular phenotypes of sepsis as pharmacogenetic indicators. Key questions are whether the observed molecular phenotype of death is universal and is it reversible. The vast majority of the CAPSOD sepsis deaths had received early goal-directed therapy (EGDT). Possibly, inclusion of assessment of the death phenotype could allow individualization of EGDT. None of the sepsis deaths had received activated protein C. The molecular phenotype of death included broad changes in complement, coagulation and fibrinolytic system components, suggesting a specific role for activated protein C in the treatment of these patients. It will be very interesting to evaluate the effect on the death phenotype of experimental sepsis therapies such as succinate or acetylcarnitine supplementation, intensive glycemic control or enhancement of mitochondrial biogenesis.
Finally, global and temporal correlation of metabolome, proteome and transcriptome data from relevant biological fluids and well-phenotyped patient groups seems broadly suitable for expanding our understanding of intermediary metabolism, particularly with respect to poorly annotated analytes, and for characterization of homogeneous subgroups in complex traits. Combinations of transcriptome, proteome, metabolome and genetic data may establish multi-dimensional molecular models of other complex diseases that could provide insights into network responses to intrinsic and/or extrinsic perturbation.
The U.S. government retains certain rights in this invention as provided by the terms of Grant Number U01A1066569 (NIH), P20RR016480 and HHSN266200400064C, awarded by the National Institutes of Health.
| Filing Document | Filing Date | Country | Kind | 371c Date |
|---|---|---|---|---|
| PCT/US12/54951 | 9/12/2012 | WO | 00 | 10/10/2014 |
| Number | Date | Country | |
|---|---|---|---|
| 61533782 | Sep 2011 | US |