Host response is a complex pathophysiologic process arising from an insult such as infection, trauma, burns, and other injuries. Diverse host responses can manifest clinically, including immune response, inflammatory response, coagulopathic response, and any other type of response to bodily insult. In some cases, host response to bodily insult can go awry, causing acute, life-threatening syndromes. As referred to herein, “dysregulated host response” refers to such cases in which host response to bodily insult goes awry, and thereby causes acute, life-threatening syndromes. For example, dysregulated immune response to infection can manifest clinically as sepsis. As another example, dysregulated immune response to a non-infection insult, such as, for example, burns, can manifest clinically as Systemic Inflammatory Response Syndrome (SIRS)52.
Sepsis is an acute, life-threatening syndrome caused by a dysregulated immune response to infection.1,2 Approximately 1.7 million patients are diagnosed with sepsis each year.15 According to a recent study based on electronic medical record data from more than 7 million hospitalizations across 409 US hospitals, sepsis has an estimated 6% hospital admission rate.15 The average length of stay for septic patients is 75% greater than most other conditions, and its mortality accounts for more than 50% of hospital deaths in hospitals.16 Sepsis ranks as one of the highest costs among all hospital admissions, representing approximately 13% of total US hospital costs, or more than $24 billion in hospital expenses.16 Sepsis costs increase based on sepsis severity level and timing of clinical presentation (e.g., at the hospital admission or during the hospital stay). Sepsis cases that were not present at hospital admission spend almost twice the amount of time in the hospital, in the intensive care unit, and on mechanical ventilation, compared to patients in which sepsis was presented at the hospital admission.17
Beyond early recognition, according to the Surviving Sepsis Campaign guidelines, the cornerstone for initial sepsis management is currently based on five main actions known as the “1-hour bundle”. The “1-hour bundle” includes: (1) lactate level measurement; (2) blood cultures collection; (3) broad-spectrum antibiotics administration; (4) rapid fluid administration of 30 ml/kg crystalloid for hypotension or lactate ≥4 mmol/L and (5) vasopressors for patients that remain hypotensive during or after resuscitation to maintain mean arterial pressure ≥65 mmHg.18
After applying this initial approach, patients are frequently assessed over the following hours according to their clinical response. For those patients with poor clinical response, further adjustments, in terms of the amount of fluids given and/or in terms of the choice of antibiotic therapy and measurements for source control (e.g. device removal, surgical procedures, or additional investigation), can be made.
Despite the appropriate application of these actions, close to 30% of septic patients remain hypotensive, requiring vasopressors to maintain a mean arterial pressure ≥65 mmHg, and then are characterized as having septic shock,19 a subtype of sepsis and a condition that has an expected hospital mortality in excess of 40%.1 Of septic shock patients, close to 40% continue to show no clinical improvement (refractory septic shock), defined as a systolic blood pressure <90 mmHg for more than one hour following both adequate fluid resuscitation and vasopressor therapy. In this set of refractory septic shock patients, glucocorticoid therapy may provide improvement.1
Corticosteroids remain a controversial therapy for sepsis patients. Specifically, current guidelines provide a weak recommendation for corticosteroids sepsis patients by stating that either steroids and no steroids are reasonable management options.20
Despite often-promising preclinical studies, more than 100 interventional trials have failed to demonstrate significantly improved survival among sepsis patients, leaving clinicians with limited interventions, and patients with mortality rates as high as 40% among those who develop septic shock.4-12
Similar trends have also been observed for other manifestations of dysregulated host response not caused by infection, such as, for examples SIRS, which can be caused by severe burns.
Embodiments disclosed herein relate to methods, non-transitory computer-readable mediums, systems, and kits for determining patient subtypes, determining therapy recommendations for patients, and generating therapeutic hypotheses for patient subtypes. In various embodiments described herein, the methods involve analyzing quantitative data of one or more biomarker sets derived from a sample obtained from a patient using a patient subtype classifier. The patient subtype classifier outputs a classification for the patient that guides the determination of a therapy recommendation.
Disclosed herein is a method for determining a patient subtype, the method comprising: obtaining or having obtained quantitative data for at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises biomarker 1, biomarker 2, and biomarker 3, wherein biomarker 1 is one or more of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, or MMP8, wherein biomarker 2 is one or more of SERPINB1 or GSPT1, and wherein biomarker 3 is one or more of MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, or TOMM70A, wherein group 2 comprises biomarker 4, biomarker 5, and biomarker 6, wherein biomarker 4 is one or more of ZNF831, MME, CD3G, or STOM, wherein biomarker 5 is one or more of ECSIT, LAT, or NCOA4, and wherein biomarker 6 is one or more of SLC1A5, IGF2BP2, or ANXA3, wherein group 3 comprises biomarker 7, biomarker 8, and biomarker 9, wherein biomarker 7 is one or more of C14orf159 or PUM2, wherein biomarker 8 is one or more of EPB42 or RPS6KA5, and wherein biomarker 9 is one or more of EPB42 or GBP2; and wherein group 4 comprises biomarker 10, biomarker 11, and biomarker 12, wherein biomarker 10 is one or more of MSH2, DCTD, or MMP8, wherein biomarker 11 is one or more of HK3, UCP2, or NUP88, and wherein biomarker 12 is one or more of GABARAPL2 or CASP4; and wherein group 5 comprises biomarker 13, biomarker 14, and biomarker 15, wherein biomarker 13 is one or more of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, or CD3G, wherein biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, OR TNFRSF1A; and determining a classification of a subject based on the quantitative data using a patient subtype classifier.
In various embodiments, the at least one biomarker set is group 5, and wherein biomarker 13 is one or more of STOM, MME, BNT3A2, or HLA-DPA1. In various embodiments, wherein the at least one biomarker set is group 5, and wherein biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1. In various embodiments, the at least one biomarker set is group 5, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, or ANXA3.
Additionally disclosed herein is a method for determining a therapy recommendation for a patient, the method comprising: obtaining or having obtained quantitative data for two or more biomarkers selected from the group consisting of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1, GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, TOMM70A, ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, ANXA3, C14orf159, PUM2, EPB42, RPS6KA5, GBP2, MSH2, DCTD, HK3, UCP2, NUP88, GABARAPL2, and CASP4; and determining a classification of a subject based on the quantitative data using a patient subtype classifier.
Additionally disclosed herein is a method for determining a therapy recommendation for a patient, the method comprising: obtaining or having obtained quantitative data for at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises two or more biomarkers selected from a group consisting of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1 GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, and TOMM70A, wherein group 2 comprises two or more biomarkers selected from a group consisting of ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, and ANXA3, wherein group 3 comprises two or more biomarkers selected from a group consisting of C14orf159, PUM2, EPB42, RPS6KA5, EPB42, and GBP2; and wherein group 4 comprises two or more biomarkers selected from a group consisting of MSH2, DCTD, MMP8, HK3, UCP2, NUP88, GABARAPL2, and CASP4; and wherein group 5 comprises two or more biomarkers selected from a group consisting of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, CD3G, EPB42, GSPT1, LAT, HK3, SERPINB1, SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, and TNFRSF1A; and determining a classification of a subject based on the quantitative data using a patient subtype classifier. In various embodiments, methods described herein further comprise identifying a therapy recommendation for the subject based at least in part on the classification.
Additionally disclosed herein is a method for determining a therapy recommendation for a patient, the method comprising: obtaining a classification of a subject exhibiting a dysregulated host response, the classification having been determined by: obtaining or having obtained quantitative data for at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises biomarker 1, biomarker 2, and biomarker 3, wherein biomarker 1 is one or more of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, or MMP8, wherein biomarker 2 is one or more of SERPINB1 or GSPT1, and wherein biomarker 3 is one or more of MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, or TOMM70A, wherein group 2 comprises biomarker 4, biomarker 5, and biomarker 6, wherein biomarker 4 is one or more of ZNF831, MME, CD3G, or STOM, wherein biomarker 5 is one or more of ECSIT, LAT, or NCOA4, and wherein biomarker 6 is one or more of SLC1A5, IGF2BP2, or ANXA3, wherein group 3 comprises biomarker 7, biomarker 8, and biomarker 9, wherein biomarker 7 is one or more of C14orf159 or PUM2, wherein biomarker 8 is one or more of EPB42 or RPS6KA5, and wherein biomarker 9 is one or more of EPB42 or GBP2; and wherein group 4 comprises biomarker 10, biomarker 11, and biomarker 12, wherein biomarker 10 is one or more of MSH2, DCTD, or MMP8, wherein biomarker 11 is one or more of HK3, UCP2, or NUP88, and wherein biomarker 12 is one or more of GABARAPL2 or CASP4; and wherein group 5 comprises biomarker 13, biomarker 14, and biomarker 15, wherein biomarker 13 is one or more of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, or CD3G, wherein biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, OR TNFRSF1A; and determining the classification based on the quantitative data using a patient subtype classifier; and identifying a therapy recommendation for the subject based at least in part on the classification.
In various embodiments, the dysregulated host response of the subject comprises one of sepsis and dysregulated host response not caused by infection. In various embodiments, the classification of the subject comprises one of subtype A or subtype B. In various embodiments, the classification of the subject comprises one of subtype A, subtype B, or subtype C. In various embodiments, responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject comprises at least no immunosuppressive therapy. In various embodiments, responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject further comprises at least no corticosteroid therapy. In various embodiments, the therapy recommendation identified for the subject further comprises no hydrocortisone.
In various embodiments, responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, blocking of complement activity therapy, and anti-inflammatory therapy. In various embodiments, responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject further comprises at least one of a checkpoint inhibitor, a blocker of complement components, a blocker of complement component receptors, and a blocker of a pro-inflammatory cytokine. In various embodiments, the therapy recommendation identified for the subject further comprises at least one of GM-CSF, anti-PD-1, anti-PD-L1, anti-CLTA-4, anti-CEACAM-1, anti-TIM-3, anti-BTLA, INF-gamma, IFN-beta 1a regulator, IL-22 agonist, IFN-alpha regulator, IFN-lambda regulator, IFN-alpha 2b stimulant, anti-C5a, anti-C3a, anti-C5aR, anti-C3aR, anti-TNF-alpha, and anti-IL-6, Anti-HMGB1, ST2 antibody, IL-33 antibody.
In various embodiments, responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, modulators of coagulation therapy, and modulators of vascular permeability therapy. In various embodiments, responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject further comprises at least one of a checkpoint inhibitor and an anticoagulant. In various embodiments, the therapy recommendation identified for the subject further comprises at least one of GM-CSF, anti-PD-1, anti-PD-L1, anti-CLTA-4, anti-CEACAM-1, anti-TIM-3, anti-BTLA, INF-gamma, IFN-beta 1a regulator, IL-22 agonist, IFN-alpha regulator, IFN-lambda regulator, IFN-alpha 2b stimulant, activated protein C, antithrombin, and thrombomodulin. In various embodiments, methods further comprise administering or having administered therapy to the subject based on the therapy recommendation.
In various embodiments, obtaining or having obtained quantitative data comprises: obtaining a sample from a subject exhibiting dysregulated host response, wherein the sample comprises a plurality of biomarkers; and determining the quantitative data from the obtained sample. In various embodiments, the obtained sample comprises a blood sample from the subject. In various embodiments, the subject exhibiting dysregulated host response does not exhibit shock, and wherein the at least one biomarker set is one of group 1, group 3, or group 4. In various embodiments, the subject exhibiting dysregulated host response is further exhibiting shock, and wherein the at least one biomarker set is one of group 1, group 2, group 4, group 5, group 6, group 7, or group 8. In various embodiments, the subject exhibiting dysregulated host response is an adult subject, and wherein the at least one biomarker set is one of group 1, group 2, group 3, group 5, group 6, group 7, or group 8. In various embodiments, the subject exhibiting dysregulated host response is a pediatric subject, and wherein the at least one biomarker set is one of group 1, group 4, group 5, group 6, group 7, or group 8.
In various embodiments, the quantitative data is determined by one of RT-qPCR (quantitative reverse transcription polymerase chain reaction), qPCR (quantitative polymerase chain reaction), PCR (polymerase chain reaction), RT-PCR (reverse transcription polymerase chain reaction), SDA (strand displacement amplification), RPA (recombinase polymerase amplification), MDA (multiple displacement amplification), HDA (helicase dependent amplification), LAMP (loop-mediated isothermal amplification), RCA (rolling circle amplification), NASBA (nucleic acid-sequence-based amplification), and any other isothermal or thermocycled amplification reaction.
In various embodiments, the quantitative data is determined by: contacting a sample with a reagent; generating a plurality of complexes between the reagent and the plurality of biomarkers in the sample; and detecting the plurality of complexes to obtain a dataset associated with the sample, wherein the dataset comprises the quantitative data. In various embodiments, the classification of the subject is determined by: determining, for at least one candidate classification of the subject, a classification-specific score for the subject; determining, by the patient subtype classifier, based on the classification-specific score, the classification of the subject.
In various embodiments, determining the classification-specific score comprises: determining a first subscore of the quantitative data for the subject for one or more biomarkers of the candidate classification, wherein the quantitative data for the subject for the one or more biomarkers of the candidate classification are increased relative to the quantitative data for the one or more biomarkers for one or more control subjects; determining a second subscore of the quantitative expression for the subject for one or more additional biomarkers of the candidate classification, wherein the quantitative data for the subject for the one or more additional biomarkers of the candidate classification are decreased relative to the quantitative data for the one or more additional biomarkers for the one or more control subjects; and determining a difference between the first subscore and the second subscore, the first and second geometric subscore optionally subject to scaling, and the difference comprising the classification-specific score for the subject. In various embodiments, one or both of the first subscore and the second subscore are geometric means.
In various embodiments, the patient subtype classifier is a machine-learned model. In various embodiments, the machine-learned model is a support vector machine (SVM). In various embodiments, the support vector machine receives, as input, one or more classification-specific scores and outputs the classification of the subject. In various embodiments, the patient subtype classifier determines the classification of the subject by: comparing the classification-specific scores to one or more threshold values; and determining the classification of the subject based on the comparisons. In various embodiments, at least one of the one or more threshold values is a fixed value. In various embodiments, at least one of the one or more threshold values is determined using training samples, the at least one threshold value representing a value on a ROC curve nearest to maximum sensitivity or maximum specificity.
In various embodiments, methods disclosed herein further comprise, prior to determining a classification of the subject using a patient subtype classifier, normalizing the quantitative data based on quantitative data for one or more housekeeping genes. In various embodiments, the candidate classifications of the subject comprise subtype A, subtype B, and subtype C. In various embodiments, the at least one biomarker set is group 1, and wherein the patient subtype classifier has an average accuracy of at least 82.93%. In various embodiments, the at least one biomarker set is group 2, and wherein the patient subtype classifier has an average accuracy of at least 89.6%. In various embodiments, the at least one biomarker set is group 3, and wherein the patient subtype classifier has an average accuracy of at least 86.3%. In various embodiments, the at least one biomarker set is group 4, and wherein the patient subtype classifier has an average accuracy of at least 98.3%.
In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, no corticosteroid therapy, or no therapy recommendation. In various embodiments, the therapy recommendation comprises a no corticosteroid therapy, wherein the no corticosteroid therapy is identified by determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response not provided corticosteroid therapy is greater than or equal to a threshold statistical significance. In various embodiments, the therapy recommendation comprises a no corticosteroid therapy, wherein the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype C. In various embodiments, the therapy recommendation comprises a corticosteroid therapy, wherein the corticosteroid therapy is identified by determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response and provided corticosteroid therapy is greater than or equal to a threshold statistical significance.
In various embodiments, the therapy recommendation comprises a corticosteroid therapy, wherein the corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be favorably responsive to corticosteroid therapy. In various embodiments, the subtype is subtype B.
In various embodiments, the therapy recommendation identified for the subject comprises a no therapy recommendation, wherein the no therapy recommendation is identified at least by: determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response and not provided corticosteroid therapy is less than a threshold statistical significance; and determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response and provided corticosteroid therapy is less than a threshold statistical significance. In various embodiments, a statistical significance comprises a p-value, and wherein the threshold statistical significance comprises at least 0.1.
In various embodiments, the therapy recommendation identified for the subject comprises a no corticosteroid therapy, wherein the dysregulated host response comprises dysregulated host response not caused by infection, and wherein the at least one biomarker set is group 1 or group 4. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises no therapy recommendation, wherein the no therapy recommendation is identified by determining that the classification of the subject comprises subtype B. In various embodiments, the therapy recommendation identified for the subject comprises a no corticosteroid therapy, wherein the dysregulated host response comprises sepsis, wherein the at least one biomarker set is one of group 2, group 3, or group 4.
In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A. In various embodiments, the therapy recommendation identified for the subject further comprises a no therapy recommendation, wherein the no therapy recommendation is identified by determining that the classification of the subject comprises a subtype likely to be non-responsive to corticosteroid therapy. In various embodiments, the subtype is subtype B or subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises a no corticosteroid therapy, wherein the dysregulated host response comprises dysregulated host response not caused by infection, and wherein the at least one biomarker set is group 2. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype like to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises a no therapy recommendation, wherein the no therapy recommendation is identified by determining that the classification of the subject comprises a subtype like to be non-responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype B. In various embodiments, the therapy recommendation identified for the subject comprises a no corticosteroid therapy, wherein the dysregulated host response comprises dysregulated host response not caused by infection, and wherein the at least one biomarker set is group 3. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype like to be non-responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype C. In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, wherein the corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be responsive to corticosteroid therapy. In various embodiments, the subtype is subtype B.
Additionally disclosed herein is a method for identifying a candidate therapeutic, the method comprising: accessing a differentially expressed gene database comprising gene level fold changes between patients of different subtypes; determining at least a threshold number of genes are differentially expressed in patients of a first subtype in comparison to patients of a second subtype, wherein each of the differentially expressed genes is involved in a common biological pathway; and determining a candidate therapeutic likely to be effective for patients of the first subtype, wherein the candidate therapeutic is effective in modulating expression of at least one of the genes that are differentially expressed in patients of the first subtype. In various embodiments, the differentially expressed gene database is generated by: obtaining labeled patient data, wherein labels of the labeled patient data identify patients that are classified into one of two or more subtypes; generating the differentially expressed gene database for at least one or more genes by at least determining gene-level fold changes between patient data with a label indicating a first subtype and patient data with a label indicating a second subtype. In various embodiments, the labels of the labeled patient data are generated by applying a clustering analysis or by applying a patient subtype classifier. In various embodiments, at least the threshold number of genes is at least three genes, at least four genes, at least five genes, at least six genes, at least seven genes, at least eight genes, at least nine genes, or at least ten genes. In various embodiments, determining a candidate therapeutic for patients of the first subtype further comprises: analyzing one or both of: therapeutic pharmacology data comprising data for the candidate therapeutic; and host response pathobiology comprising data for patients of the first subtype.
Additionally disclosed herein a non-transitory computer readable medium for determining a patient subtype, the non-transitory computer readable medium comprising instructions that, when executed by a processor, cause the processor to: obtain quantitative data for at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises biomarker 1, biomarker 2, and biomarker 3, wherein biomarker 1 is one or more of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, or MMP8, wherein biomarker 2 is one or more of SERPINB1 or GSPT1, and wherein biomarker 3 is one or more of MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, or TOMM70A, wherein group 2 comprises biomarker 4, biomarker 5, and biomarker 6, wherein biomarker 4 is one or more of ZNF831, MME, CD3G, or STOM, wherein biomarker 5 is one or more of ECSIT, LAT, or NCOA4, and wherein biomarker 6 is one or more of SLC1A5, IGF2BP2, or ANXA3, wherein group 3 comprises biomarker 7, biomarker 8, and biomarker 9, wherein biomarker 7 is one or more of C14orf159 or PUM2, wherein biomarker 8 is one or more of EPB42 or RPS6KA5, and wherein biomarker 9 is one or more of EPB42 or GBP2; and wherein group 4 comprises biomarker 10, biomarker 11, and biomarker 12, wherein biomarker 10 is one or more of MSH2, DCTD, or MMP8, wherein biomarker 11 is one or more of HK3, UCP2, or NUP88, and wherein biomarker 12 is one or more of GABARAPL2 or CASP4; and wherein group 5 comprises biomarker 13, biomarker 14, and biomarker 15, wherein biomarker 13 is one or more of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, or CD3G, wherein biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, OR TNFRSF1A; and determine a classification of a subject based on the quantitative data using a patient subtype classifier.
In various embodiments, the at least one biomarker set is group 5, and wherein biomarker 13 is one or more of STOM, MME, BNT3A2, or HLA-DPA1. In various embodiments, the at least one biomarker set is group 5, and wherein biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1. In various embodiments, the at least one biomarker set is group 5, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, or ANXA3.
Additionally disclosed herein is a non-transitory computer readable medium for determining a therapy recommendation for a patient, the non-transitory computer readable medium comprising instructions that, when executed by a processor, cause the processor to: obtain quantitative data for two or more biomarkers selected from the group consisting of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1, GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, TOMM70A, ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, ANXA3, C14orf159, PUM2, EPB42, RPS6KA5, GBP2, MSH2, DCTD, HK3, UCP2, NUP88, GABARAPL2, and CASP4; and determine a classification of a subject based on the quantitative data using a patient subtype classifier.
Additionally disclosed herein is a non-transitory computer readable medium for determining a therapy recommendation for a patient, the non-transitory computer readable medium comprising instructions that, when executed by a processor, cause the processor to: obtain quantitative data for at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises two or more biomarkers selected from a group consisting of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1 GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, and TOMM70A, wherein group 2 comprises two or more biomarkers selected from a group consisting of ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, and ANXA3, wherein group 3 comprises two or more biomarkers selected from a group consisting of C14orf159, PUM2, EPB42, RPS6KA5, EPB42, and GBP2; and wherein group 4 comprises two or more biomarkers selected from a group consisting of MSH2, DCTD, MMP8, HK3, UCP2, NUP88, GABARAPL2, and CASP4; and wherein group 5 comprises two or more biomarkers selected from a group consisting of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, CD3G, EPB42, GSPT1, LAT, HK3, SERPINB1, SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, and TNFRSF1A; and determine a classification of a subject based on the quantitative data using a patient subtype classifier. In various embodiments, the instructions further comprise instructions that, when executed by the processor, cause the processor to identify a therapy recommendation for the subject based at least in part on the classification.
Additionally disclosed herein is a non-transitory computer readable medium for determining a therapy recommendation for a subject, the non-transitory computer readable medium comprising instructions that, when executed by a processor, cause the processor to: obtain a classification of the subject exhibiting a dysregulated host response, the classification having been determined by: obtaining or having obtained quantitative data for at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises biomarker 1, biomarker 2, and biomarker 3, wherein biomarker 1 is one or more of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, or MMP8, wherein biomarker 2 is one or more of SERPINB1 or GSPT1, and wherein biomarker 3 is one or more of MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, or TOMM70A, wherein group 2 comprises biomarker 4, biomarker 5, and biomarker 6, wherein biomarker 4 is one or more of ZNF831, MME, CD3G, or STOM, wherein biomarker 5 is one or more of ECSIT, LAT, or NCOA4, and wherein biomarker 6 is one or more of SLC1A5, IGF2BP2, or ANXA3, wherein group 3 comprises biomarker 7, biomarker 8, and biomarker 9, wherein biomarker 7 is one or more of C14orf159 or PUM2, wherein biomarker 8 is one or more of EPB42 or RPS6KA5, and wherein biomarker 9 is one or more of EPB42 or GBP2; and wherein group 4 comprises biomarker 10, biomarker 11, and biomarker 12, wherein biomarker 10 is one or more of MSH2, DCTD, or MMP8, wherein biomarker 11 is one or more of HK3, UCP2, or NUP88, and wherein biomarker 12 is one or more of GABARAPL2 or CASP4; and wherein group 5 comprises biomarker 13, biomarker 14, and biomarker 15, wherein biomarker 13 is one or more of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, or CD3G, wherein biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, OR TNFRSF1A; and determining the classification based on the quantitative data using a patient subtype classifier; and identify a therapy recommendation for the subject based at least in part on the classification.
In various embodiments, the dysregulated host response of the subject comprises one of sepsis and dysregulated host response not caused by infection. In various embodiments, the classification of the subject comprises one of subtype A or subtype B. In various embodiments, the classification of the subject comprises one of subtype A, subtype B, or subtype C. In various embodiments, responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject comprises at least no immunosuppressive therapy. In various embodiments, responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject further comprises at least no corticosteroid therapy. In various embodiments, the therapy recommendation identified for the subject further comprises no hydrocortisone.
In various embodiments, responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, blocking of complement activity therapy, and anti-inflammatory therapy. In various embodiments, responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject further comprises at least one of a checkpoint inhibitor, a blocker of complement components, a blocker of complement component receptors, and a blocker of a pro-inflammatory cytokine. In various embodiments, the therapy recommendation identified for the subject further comprises at least one of GM-CSF, anti-PD-1, anti-PD-L1, anti-CLTA-4, anti-CEACAM-1, anti-TIM-3, anti-BTLA, IL-7, INF-gamma, IFN-beta 1a regulator, IL-22 agonist, IFN-alpha regulator, IFN-lambda regulator, IFN-alpha 2b stimulant, anti-C5a, anti-C3a, anti-C5aR, anti-C3aR, anti-TNF-alpha, and anti-IL-6, Anti-HMGB1, ST2 antibody, IL-33 antibody. In various embodiments, responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, modulators of coagulation therapy, and modulators of vascular permeability therapy.
In various embodiments, responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject further comprises at least one of a checkpoint inhibitor and an anticoagulant. In various embodiments, the therapy recommendation identified for the subject further comprises at least one of GM-CSF, anti-PD-1, anti-PD-L1, anti-CLTA-4, anti-CEACAM-1, anti-TIM-3, anti-BTLA, IL-7, INF-gamma, IFN-beta 1a regulator, IL-22 agonist, IFN-alpha regulator, IFN-lambda regulator, IFN-alpha 2b stimulant, activated protein C, antithrombin, and thrombomodulin.
In various embodiments, the instructions that cause the processor to obtain quantitative data further comprises instructions that, when executed by the processor, cause the processor to: obtain a sample from a subject exhibiting dysregulated host response, wherein the sample comprises a plurality of biomarkers; and determine the quantitative data from the obtained sample. In various embodiments, the obtained sample comprises a blood sample from the subject.
In various embodiments, the subject exhibiting dysregulated host response does not exhibit shock, and wherein the at least one biomarker set is one of group 1, group 3, or group 4. In various embodiments, the subject exhibiting dysregulated host response is further exhibiting shock, and wherein the at least one biomarker set is one of group 1, group 2, group 4, group 5, group 6, group 7, or group 8. In various embodiments, the subject exhibiting dysregulated host response is an adult subject, and wherein the at least one biomarker set is one of group 1, group 2, group 3, group 5, group 6, group 7, or group 8. In various embodiments, the subject exhibiting dysregulated host response is a pediatric subject, and wherein the at least one biomarker set is one of group 1, group 4, group 5, group 6, group 7, or group 8. In various embodiments, the quantitative data is determined by one of RT-qPCR (quantitative reverse transcription polymerase chain reaction), qPCR (quantitative polymerase chain reaction), PCR (polymerase chain reaction), RT-PCR (reverse transcription polymerase chain reaction), SDA (strand displacement amplification), RPA (recombinase polymerase amplification), MDA (multiple displacement amplification), HDA (helicase dependent amplification), LAMP (loop-mediated isothermal amplification), RCA (rolling circle amplification), NASBA (nucleic acid-sequence-based amplification), and any other isothermal or thermocycled amplification reaction.
In various embodiments, the quantitative data is determined by: contacting a sample with a reagent; generating a plurality of complexes between the reagent and the plurality of biomarkers in the sample; and detecting the plurality of complexes to obtain a dataset associated with the sample, wherein the dataset comprises the quantitative data.
In various embodiments, the classification of the subject is determined by: determining, for at least one candidate classification of the subject, a classification-specific score for the subject; determining, by the patient subtype classifier, based on the classification-specific score, the classification of the subject. In various embodiments, the instructions that cause the processor to determine the classification-specific score further comprises instructions that, when executed by the processor, cause the processor to: determine a first subscore of the quantitative data for the subject for one or more biomarkers of the candidate classification, wherein the quantitative data for the subject for the one or more biomarkers of the candidate classification are increased relative to the quantitative data for the one or more biomarkers for one or more control subjects; determine a second subscore of the quantitative expression for the subject for one or more additional biomarkers of the candidate classification, wherein the quantitative data for the subject for the one or more additional biomarkers of the candidate classification are decreased relative to the quantitative data for the one or more additional biomarkers for the one or more control subjects; and determine a difference between the first subscore and the second subscore, the first and second geometric subscore optionally subject to scaling, and the difference comprising the classification-specific score for the subject. In various embodiments, one or both of the first subscore and the second subscore are geometric means. In various embodiments, the patient subtype classifier is a machine-learned model. In various embodiments, the machine-learned model is a support vector machine (SVM). In various embodiments, the support vector machine receives, as input, one or more classification-specific scores and outputs the classification of the subject.
In various embodiments, the patient subtype classifier determines the classification of the subject by: comparing the classification-specific scores to one or more threshold values; and determining the classification of the subject based on the comparisons. In various embodiments, at least one of the one or more threshold values is a fixed value. In various embodiments, at least one of the one or more threshold values is determined using training samples, the at least one threshold value representing a value on a ROC curve nearest to maximum sensitivity or maximum specificity. In various embodiments, further comprising, prior to determining a classification of the subject using a patient subtype classifier, normalizing the quantitative data based on quantitative data for one or more housekeeping genes.
In various embodiments, wherein the candidate classifications of the subject comprise subtype A, subtype B, and subtype C. In various embodiments, the at least one biomarker set is group 1, and wherein the patient subtype classifier has an average accuracy of at least 82.93%. In various embodiments, the patient subtype classifier has an average accuracy of at least 89.6%. In various embodiments, the patient subtype classifier has an average accuracy of at least 86.3%. In various embodiments, the at least one biomarker set is group 4, and wherein the patient subtype classifier has an average accuracy of at least 98.3%.
In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, no corticosteroid therapy, or no therapy recommendation. In various embodiments, the therapy recommendation comprises a no corticosteroid therapy, wherein the no corticosteroid therapy is identified by determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response not provided corticosteroid therapy is greater than or equal to a threshold statistical significance. In various embodiments, the therapy recommendation comprises a no corticosteroid therapy, wherein the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype C.
In various embodiments, the therapy recommendation comprises a corticosteroid therapy, wherein the corticosteroid therapy is identified by determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response and provided corticosteroid therapy is greater than or equal to a threshold statistical significance. In various embodiments, the therapy recommendation comprises a corticosteroid therapy, wherein the corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be favorably responsive to corticosteroid therapy. In various embodiments, the subtype is subtype B.
In various embodiments, the therapy recommendation identified for the subject comprises a no therapy recommendation, wherein the no therapy recommendation is identified at least by: determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response and not provided corticosteroid therapy is less than a threshold statistical significance; and determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response and provided corticosteroid therapy is less than a threshold statistical significance. In various embodiments, a statistical significance comprises a p-value, and wherein the threshold statistical significance comprises at least 0.1.
In various embodiments, the therapy recommendation identified for the subject comprises a no corticosteroid therapy, wherein the dysregulated host response comprises dysregulated host response not caused by infection, and wherein the at least one biomarker set is group 1 or group 4. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises no therapy recommendation, wherein the no therapy recommendation is identified by determining that the classification of the subject comprises subtype B. In various embodiments, the therapy recommendation identified for the subject comprises a no corticosteroid therapy, wherein the dysregulated host response comprises sepsis, wherein the at least one biomarker set is one of group 2, group 3, or group 4. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A.
In various embodiments, the therapy recommendation identified for the subject further comprises a no therapy recommendation, wherein the no therapy recommendation is identified by determining that the classification of the subject comprises a subtype likely to be non-responsive to corticosteroid therapy. In various embodiments, the subtype is subtype B or subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises a no corticosteroid therapy, wherein the dysregulated host response comprises dysregulated host response not caused by infection, and wherein the at least one biomarker set is group 2. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype like to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises a no therapy recommendation, wherein the no therapy recommendation is identified by determining that the classification of the subject comprises a subtype like to be non-responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype B.
In various embodiments, the therapy recommendation identified for the subject comprises a no corticosteroid therapy, wherein the dysregulated host response comprises dysregulated host response not caused by infection, and wherein the at least one biomarker set is group 3. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype like to be non-responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, wherein the corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be responsive to corticosteroid therapy. In various embodiments, the subtype is subtype B.
Additionally disclosed herein is a non-transitory computer readable medium for identifying a candidate therapeutic, the non-transitory computer readable medium comprising instructions that, when executed by a processor, cause the processor to: access a differentially expressed gene database comprising gene level fold changes between patients of different subtypes; determine at least a threshold number of genes are differentially expressed in patients of a first subtype in comparison to patients of a second subtype, wherein each of the differentially expressed genes is involved in a common biological pathway; and determine a candidate therapeutic likely to be effective for patients of the first subtype, wherein the candidate therapeutic is effective in modulating expression of at least one of the genes that are differentially expressed in patients of the first subtype. In various embodiments, the differentially expressed gene database is generated by: obtaining labeled patient data, wherein labels of the labeled patient data identify patients that are classified into one of two or more subtypes; generating the differentially expressed gene database for at least one or more genes by at least determining gene-level fold changes between patient data with a label indicating a first subtype and patient data with a label indicating a second subtype. In various embodiments, the labels of the labeled patient data are generated by applying a clustering analysis or by applying a patient subtype classifier. In various embodiments, at least the threshold number of genes is at least three genes, at least four genes, at least five genes, at least six genes, at least seven genes, at least eight genes, at least nine genes, or at least ten genes. In various embodiments, determining a candidate therapeutic for patients of the first subtype further comprises: analyzing one or both of: therapeutic pharmacology data comprising data for the candidate therapeutic; and host response pathobiology comprising data for patients of the first subtype.
Additionally disclosed herein is a system for determining a patient subtype, the system comprising: a set of reagents used for determining quantitative data for at least one biomarker set from a test sample from a subject, the at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises biomarker 1, biomarker 2, and biomarker 3, wherein biomarker 1 is one or more of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, or MMP8, wherein biomarker 2 is one or more of SERPINB1 or GSPT1, and wherein biomarker 3 is one or more of MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, or TOMM70A, wherein group 2 comprises biomarker 4, biomarker 5, and biomarker 6, wherein biomarker 4 is one or more of ZNF831, MME, CD3G, or STOM, wherein biomarker 5 is one or more of ECSIT, LAT, or NCOA4, and wherein biomarker 6 is one or more of SLC1A5, IGF2BP2, or ANXA3, wherein group 3 comprises biomarker 7, biomarker 8, and biomarker 9, wherein biomarker 7 is one or more of C14orf159 or PUM2, wherein biomarker 8 is one or more of EPB42 or RPS6KA5, and wherein biomarker 9 is one or more of EPB42 or GBP2; and wherein group 4 comprises biomarker 10, biomarker 11, and biomarker 12, wherein biomarker 10 is one or more of MSH2, DCTD, or MMP8, wherein biomarker 11 is one or more of HK3, UCP2, or NUP88, and wherein biomarker 12 is one or more of GABARAPL2 or CASP4; and wherein group 5 comprises biomarker 13, biomarker 14, and biomarker 15, wherein biomarker 13 is one or more of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, or CD3G, wherein biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, OR TNFRSF1A; and an apparatus configured to receive a mixture of one or more reagents in the set and the test sample and to measure the quantitative data for the at least one biomarker set from the test sample; and a computer system communicatively coupled to the apparatus to obtain the quantitative data for the at least one biomarker set and to determine a classification of the subject based on the quantitative data using a patient subtype classifier.
In various embodiments, the at least one biomarker set is group 5, and wherein biomarker 13 is one or more of STOM, MME, BNT3A2, or HLA-DPA1. In various embodiments, biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1. In various embodiments, the at least one biomarker set is group 5, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, or ANXA3.
Additionally disclosed herein is a system for determining a patient subtype, the system comprising: a set of reagents used for determining quantitative data for two or more biomarkers selected from the group consisting of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1, GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, TOMM70A, ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, ANXA3, C14orf159, PUM2, EPB42, RPS6KA5, GBP2, MSH2, DCTD, HK3, UCP2, NUP88, GABARAPL2, and CASP4; and an apparatus configured to receive a mixture of one or more reagents in the set and the test sample and to measure the quantitative data for the at least one biomarker set from the test sample; and a computer system communicatively coupled to the apparatus to obtain the quantitative data for the at least one biomarker set and to determine a classification of the subject based on the quantitative data using a patient subtype classifier.
Additionally disclosed herein is a system for determining a patient subtype, the system comprising: a set of reagents used for determining quantitative data for at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises two or more biomarkers selected from a group consisting of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1 GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, and TOMM70A, wherein group 2 comprises two or more biomarkers selected from a group consisting of ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, and ANXA3, wherein group 3 comprises two or more biomarkers selected from a group consisting of C14orf159, PUM2, EPB42, RPS6KA5, EPB42, and GBP2; and wherein group 4 comprises two or more biomarkers selected from a group consisting of MSH2, DCTD, MMP8, HK3, UCP2, NUP88, GABARAPL2, and CASP4; and wherein group 5 comprises two or more biomarkers selected from a group consisting of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, CD3G, EPB42, GSPT1, LAT, HK3, SERPINB1, SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, and TNFRSF1A; and an apparatus configured to receive a mixture of one or more reagents in the set and the test sample and to measure the quantitative data for the at least one biomarker set from the test sample; and a computer system communicatively coupled to the apparatus to obtain the quantitative data for the at least one biomarker set and to determine a classification of the subject based on the quantitative data using a patient subtype classifier. In various embodiments, the computer system is configured to identify a therapy recommendation for the subject based at least in part on the classification.
Additionally disclosed herein is a system for determining a therapy recommendation for a subject, the system comprising: a computer system configured to: obtain a classification of the subject exhibiting a dysregulated host response, the classification having been determined by: obtaining or having obtained quantitative data for at least one biomarker set obtained from the subject, the at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises biomarker 1, biomarker 2, and biomarker 3, wherein biomarker 1 is one or more of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, or MMP8, wherein biomarker 2 is one or more of SERPINB1 or GSPT1, and wherein biomarker 3 is one or more of MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, or TOMM70A, wherein group 2 comprises biomarker 4, biomarker 5, and biomarker 6, wherein biomarker 4 is one or more of ZNF831, MME, CD3G, or STOM, wherein biomarker 5 is one or more of ECSIT, LAT, or NCOA4, and wherein biomarker 6 is one or more of SLC1A5, IGF2BP2, or ANXA3, wherein group 3 comprises biomarker 7, biomarker 8, and biomarker 9, wherein biomarker 7 is one or more of C14orf159 or PUM2, wherein biomarker 8 is one or more of EPB42 or RPS6KA5, and wherein biomarker 9 is one or more of EPB42 or GBP2; and wherein group 4 comprises biomarker 10, biomarker 11, and biomarker 12, wherein biomarker 10 is one or more of MSH2, DCTD, or MMP8, wherein biomarker 11 is one or more of HK3, UCP2, or NUP88, and wherein biomarker 12 is one or more of GABARAPL2 or CASP4; and wherein group 5 comprises biomarker 13, biomarker 14, and biomarker 15, wherein biomarker 13 is one or more of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, or CD3G, wherein biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, OR TNFRSF1A; and determine the classification based on the quantitative data using a patient subtype classifier; and identify a therapy recommendation for the subject based at least in part on the classification.
In various embodiments, the dysregulated host response of the subject comprises one of sepsis and dysregulated host response not caused by infection. In various embodiments, the classification of the subject comprises one of subtype A or subtype B. In various embodiments, the classification of the subject comprises one of subtype A, subtype B, or subtype C. In various embodiments, responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject comprises at least no immunosuppressive therapy.
In various embodiments, responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject further comprises at least no corticosteroid therapy. In various embodiments, the therapy recommendation identified for the subject further comprises no hydrocortisone. In various embodiments, responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, blocking of complement activity therapy, and anti-inflammatory therapy. In various embodiments, responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject further comprises at least one of a checkpoint inhibitor, a blocker of complement components, a blocker of complement component receptors, and a blocker of a pro-inflammatory cytokine. In various embodiments, the therapy recommendation identified for the subject further comprises at least one of GM-CSF, anti-PD-1, anti-PD-L1, anti-CLTA-4, anti-CEACAM-1, anti-TIM-3, anti-BTLA, IL-7, INF-gamma, IFN-beta 1a regulator, IL-22 agonist, IFN-alpha regulator, IFN-lambda regulator, IFN-alpha 2b stimulant, anti-C5a, anti-C3a, anti-C5aR, anti-C3aR, anti-TNF-alpha, and anti-IL-6, Anti-HMGB1, ST2 antibody, IL-33 antibody.
In various embodiments, responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, modulators of coagulation therapy, and modulators of vascular permeability therapy. In various embodiments, responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject further comprises at least one of a checkpoint inhibitor and an anticoagulant. In various embodiments, the therapy recommendation identified for the subject further comprises at least one of GM-CSF, anti-PD-1, anti-PD-L1, anti-CLTA-4, anti-CEACAM-1, anti-TIM-3, anti-BTLA, INF-gamma, IFN-beta 1a regulator, IL-22 agonist, IFN-alpha regulator, IFN-lambda regulator, IFN-alpha 2b stimulant, activated protein C, antithrombin, and thrombomodulin.
In various embodiments, the sample comprises a blood sample from the subject. In various embodiments, the subject exhibiting dysregulated host response does not exhibit shock, and wherein the at least one biomarker set is one of group 1, group 3, or group 4. In various embodiments, the subject exhibiting dysregulated host response is further exhibiting shock, and wherein the at least one biomarker set is one of group 1, group 2, group 4, group 5, group 6, group 7, or group 8. In various embodiments, the subject exhibiting dysregulated host response is an adult subject, and wherein the at least one biomarker set is one of group 1, group 2, group 3, group 5, group 6, group 7, or group 8. In various embodiments, the subject exhibiting dysregulated host response is a pediatric subject, and wherein the at least one biomarker set is one of group 1, group 4, group 5, group 6, group 7, or group 8.
In various embodiments, the quantitative data is determined by one of RT-qPCR (quantitative reverse transcription polymerase chain reaction), qPCR (quantitative polymerase chain reaction), PCR (polymerase chain reaction), RT-PCR (reverse transcription polymerase chain reaction), SDA (strand displacement amplification), RPA (recombinase polymerase amplification), MDA (multiple displacement amplification), HDA (helicase dependent amplification), LAMP (loop-mediated isothermal amplification), RCA (rolling circle amplification), NASBA (nucleic acid-sequence-based amplification), and any other isothermal or thermocycled amplification reaction.
In various embodiments, the classification of the subject is determined by: determining, for at least one candidate classification of the subject, a classification-specific score for the subject; determining, by the patient subtype classifier, based on the classification-specific score, the classification of the subject. In various embodiments, determine the classification-specific score further comprises: determine a first subscore of the quantitative data for the subject for one or more biomarkers of the candidate classification, wherein the quantitative data for the subject for the one or more biomarkers of the candidate classification are increased relative to the quantitative data for the one or more biomarkers for one or more control subjects; determine a second subscore of the quantitative expression for the subject for one or more additional biomarkers of the candidate classification, wherein the quantitative data for the subject for the one or more additional biomarkers of the candidate classification are decreased relative to the quantitative data for the one or more additional biomarkers for the one or more control subjects; and determine a difference between the first subscore and the second subscore, the first and second geometric subscore optionally subject to scaling, and the difference comprising the classification-specific score for the subject. In various embodiments, one or both of the first subscore and the second subscore are geometric means.
In various embodiments, the patient subtype classifier is a machine-learned model. In various embodiments, the machine-learned model is a support vector machine (SVM). In various embodiments, the support vector machine receives, as input, one or more classification-specific scores and outputs the classification of the subject. In various embodiments, the patient subtype classifier determines the classification of the subject by: comparing the classification-specific scores to one or more threshold values; and determining the classification of the subject based on the comparisons.
In various embodiments, at least one of the one or more threshold values is a fixed value. In various embodiments, at least one of the one or more threshold values is determined using training samples, the at least one threshold value representing a value on a ROC curve nearest to maximum sensitivity or maximum specificity. In various embodiments, prior to determining a classification of the subject using a patient subtype classifier, normalizing the quantitative data based on quantitative data for one or more housekeeping genes.
In various embodiments, the candidate classifications of the subject comprise subtype A, subtype B, and subtype C. In various embodiments, the at least one biomarker set is group 1, and wherein the patient subtype classifier has an average accuracy of at least 82.93%. In various embodiments, the patient subtype classifier has an average accuracy of at least 89.6%. In various embodiments, the patient subtype classifier has an average accuracy of at least 86.3%. In various embodiments, the at least one biomarker set is group 4, and wherein the patient subtype classifier has an average accuracy of at least 98.3%.
In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, no corticosteroid therapy, or no therapy recommendation. In various embodiments, the therapy recommendation comprises a no corticosteroid therapy, wherein the no corticosteroid therapy is identified by determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response not provided corticosteroid therapy is greater than or equal to a threshold statistical significance. In various embodiments, the therapy recommendation comprises a no corticosteroid therapy, wherein the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype C. In various embodiments, the therapy recommendation comprises a corticosteroid therapy, wherein the corticosteroid therapy is identified by determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response and provided corticosteroid therapy is greater than or equal to a threshold statistical significance.
In various embodiments, the therapy recommendation comprises a corticosteroid therapy, wherein the corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be favorably responsive to corticosteroid therapy. In various embodiments, the subtype is subtype B.
In various embodiments, the therapy recommendation identified for the subject comprises a no therapy recommendation, wherein the no therapy recommendation is identified at least by: determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response and not provided corticosteroid therapy is less than a threshold statistical significance; and determining that a statistical significance of a reduction in mortality of subjects exhibiting dysregulated host response and provided corticosteroid therapy is less than a threshold statistical significance. In various embodiments, a statistical significance comprises a p-value, and wherein the threshold statistical significance comprises at least 0.1.
In various embodiments, the therapy recommendation identified for the subject comprises a no corticosteroid therapy, wherein the dysregulated host response comprises dysregulated host response not caused by infection, and wherein the at least one biomarker set is group 1 or group 4. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises no therapy recommendation, wherein the no therapy recommendation is identified by determining that the classification of the subject comprises subtype B.
In various embodiments, the therapy recommendation identified for the subject comprises a no corticosteroid therapy, wherein the dysregulated host response comprises sepsis, wherein the at least one biomarker set is one of group 2, group 3, or group 4. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A.
In various embodiments, the therapy recommendation identified for the subject further comprises a no therapy recommendation, wherein the no therapy recommendation is identified by determining that the classification of the subject comprises a subtype likely to be non-responsive to corticosteroid therapy. In various embodiments, the subtype is subtype B or subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises a no corticosteroid therapy, wherein the dysregulated host response comprises dysregulated host response not caused by infection, and wherein the at least one biomarker set is group 2. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype like to be adversely responsive to corticosteroid therapy. In various embodiments, the subtype is subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises a no therapy recommendation, wherein the no therapy recommendation is identified by determining that the classification of the subject comprises a subtype like to be non-responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype B.
In various embodiments, the therapy recommendation identified for the subject comprises a no corticosteroid therapy, wherein the dysregulated host response comprises dysregulated host response not caused by infection, and wherein the at least one biomarker set is group 3. In various embodiments, the no corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype like to be non-responsive to corticosteroid therapy. In various embodiments, the subtype is subtype A or subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, wherein the corticosteroid therapy is identified by determining that the classification of the subject comprises a subtype likely to be responsive to corticosteroid therapy. In various embodiments, the subtype is subtype B.
Additionally disclosed herein is a a system for identifying a candidate therapeutic, the system comprising: a storage device storing a differentially expressed gene database comprising gene level fold changes between patients of different subtypes; a computational device configured to: access one or more gene level fold changes corresponding to differentially expressed genes in the differentially expressed gene database; determine at least a threshold number of genes are differentially expressed in patients of a first subtype in comparison to patients of a second subtype, wherein each of the differentially expressed genes is involved in a common biological pathway; and determine a candidate therapeutic likely to be effective for patients of the first subtype, wherein the candidate therapeutic is effective in modulating expression of at least one of the genes that are differentially expressed in patients of the first subtype. In various embodiments, the differentially expressed gene database is generated by: obtaining labeled patient data, wherein labels of the labeled patient data identify patients that are classified into one of two or more subtypes; generating the differentially expressed gene database for at least one or more genes by at least determining gene-level fold changes between patient data with a label indicating a first subtype and patient data with a label indicating a second subtype. In various embodiments, the labels of the labeled patient data are generated by applying a clustering analysis or by applying a patient subtype classifier. In various embodiments, at least the threshold number of genes is at least three genes, at least four genes, at least five genes, at least six genes, at least seven genes, at least eight genes, at least nine genes, or at least ten genes. In various embodiments, determining a candidate therapeutic for patients of the first subtype further comprises: analyzing one or both of: therapeutic pharmacology data comprising data for the candidate therapeutic; and host response pathobiology comprising data for patients of the first subtype.
Additionally disclosed herein is a kit for determining a patient subtype, the kit comprising: a set of reagents for determining quantitative data for at least one biomarker set from a test sample from a subject, the at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises biomarker 1, biomarker 2, and biomarker 3, wherein biomarker 1 is one or more of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, or MMP8, wherein biomarker 2 is one or more of SERPINB1 or GSPT1, and wherein biomarker 3 is one or more of MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, or TOMM70A, wherein group 2 comprises biomarker 4, biomarker 5, and biomarker 6, wherein biomarker 4 is one or more of ZNF831, MME, CD3G, or STOM, wherein biomarker 5 is one or more of ECSIT, LAT, or NCOA4, and wherein biomarker 6 is one or more of SLC1A5, IGF2BP2, or ANXA3, wherein group 3 comprises biomarker 7, biomarker 8, and biomarker 9, wherein biomarker 7 is one or more of C14orf159 or PUM2, wherein biomarker 8 is one or more of EPB42 or RPS6KA5, and wherein biomarker 9 is one or more of EPB42 or GBP2; and wherein group 4 comprises biomarker 10, biomarker 11, and biomarker 12, wherein biomarker 10 is one or more of MSH2, DCTD, or MMP8, wherein biomarker 11 is one or more of HK3, UCP2, or NUP88, and wherein biomarker 12 is one or more of GABARAPL2 or CASP4; and wherein group 5 comprises biomarker 13, biomarker 14, and biomarker 15, wherein biomarker 13 is one or more of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, or CD3G, wherein biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, OR TNFRSF1A; and instructions for using the set of reagents to determine the quantitative data for the at least one biomarker set.
In various embodiments, the at least one biomarker set is group 5, and wherein biomarker 13 is one or more of STOM, MME, BNT3A2, or HLA-DPA1. In various embodiments, the at least one biomarker set is group 5, and wherein biomarker 14 is one or more of EPB42, GSPT1, LAT, HK3, or SERPINB1. In various embodiments, the at least one biomarker set is group 5, and wherein biomarker 15 is one or more of SLC1A5, IGF2BP2, or ANXA3.
Additionally disclosed herein is a kit for determining a patient subtype, the kit comprising: a set of reagents for determining quantitative data for two or more biomarkers selected from the group consisting of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1, GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, TOMM70A, ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, ANXA3, C14orf159, PUM2, EPB42, RPS6KA5, GBP2, MSH2, DCTD, HK3, UCP2, NUP88, GABARAPL2, and CASP4; and instructions for using the set of reagents to determine the quantitative data for the at least one biomarker set.
Additionally disclosed herein is a kit for determining a patient subtype, the kit comprising: a set of reagents for determining quantitative data for at least one biomarker set selected from the group consisting of the biomarker sets of group 1, group 2, group 3, group 4, or group 5, wherein group 1 comprises two or more biomarkers selected from a group consisting of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1 GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, and TOMM70A, wherein group 2 comprises two or more biomarkers selected from a group consisting of ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, and ANXA3, wherein group 3 comprises two or more biomarkers selected from a group consisting of C14orf159, PUM2, EPB42, RPS6KA5, EPB42, and GBP2; and wherein group 4 comprises two or more biomarkers selected from a group consisting of MSH2, DCTD, MMP8, HK3, UCP2, NUP88, GABARAPL2, and CASP4; and wherein group 5 comprises two or more biomarkers selected from a group consisting of STOM, MME, BNT3A2, HLA-DPA1, ZNF831, CD3G, EPB42, GSPT1, LAT, HK3, SERPINB1, SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, BTN3A2, and TNFRSF1A; and instructions for using the set of reagents to determine the quantitative data for the at least one biomarker set.
In various embodiments, the instructions comprise instructions for determining the quantitative data by performing one of RT-qPCR (quantitative reverse transcription polymerase chain reaction), qPCR (quantitative polymerase chain reaction), PCR (polymerase chain reaction), RT-PCR (reverse transcription polymerase chain reaction), SDA (strand displacement amplification), RPA (recombinase polymerase amplification), MDA (multiple displacement amplification), HDA (helicase dependent amplification), LAMP (loop-mediated isothermal amplification), RCA (rolling circle amplification), NASBA (nucleic acid-sequence-based amplification), and any other isothermal or thermocycled amplification reaction.
In various embodiments, the set of reagents comprises at least three primer sets for amplifying at least three biomarkers, wherein the at least three primer sets comprise pairs of single-stranded DNA primers for amplifying the at least three biomarkers, and wherein at least one of the at least three biomarkers is selected from the group consisting of the biomarkers EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, ZNF831, MME, CD3G, STOM, C14orf159, PUM2, MSH2, DCTD, BNT3A2, or HLA-DPA1, at least one biomarker of the at least three biomarkers is selected from the group consisting of the biomarkers SERPINB1, GSPT1, ECSIT, LAT, NCOA4, EPB42, RPS6KA5, HK3, UCP2, or NUP88, and at least one biomarker of the at least three biomarkers is selected from the group consisting of the biomarkers MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, PRPF3, TOMM70A, EPB42, GABARAPL2, CASP4, SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, OR TNFRSF1A.
In various embodiments, the at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 7 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 8, a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 9 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 10, a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 11 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 12, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 13 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 14, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 15 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 16, a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 17 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 18, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 19 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 20, and wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 1 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 2; a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 3 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 4, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 5 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 6.
In various embodiments, at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 7 and a reverse primer comprising SEQ ID NO. 8, a forward primer comprising SEQ ID NO. 9 and a reverse primer comprising SEQ ID NO. 10, a forward primer comprising SEQ ID NO. 11 and a reverse primer comprising SEQ ID NO. 12, and a forward primer comprising SEQ ID NO. 13 and a reverse primer comprising SEQ ID NO. 14, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 15 and a reverse primer comprising SEQ ID NO. 16, a forward primer comprising SEQ ID NO. 17 and a reverse primer comprising SEQ ID NO. 18, and a forward primer comprising SEQ ID NO. 19 and a reverse primer comprising SEQ ID NO. 20, and wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 1 and a reverse primer comprising SEQ ID NO. 2; a forward primer comprising SEQ ID NO. 3 and a reverse primer comprising SEQ ID NO. 4, and a forward primer comprising SEQ ID NO. 5 and a reverse primer comprising SEQ ID NO. 6.
In various embodiments, the at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 21 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 22, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 23 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 24, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 25 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 26, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 29 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 30, and wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 25 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 26, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 27 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 28.
In various embodiments, at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 21 and a reverse primer comprising SEQ ID NO. 22, and a forward primer comprising SEQ ID NO. 23 and a reverse primer comprising SEQ ID NO. 24, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 25 and a reverse primer comprising SEQ ID NO. 26, and a forward primer comprising SEQ ID NO. 29 and a reverse primer comprising SEQ ID NO. 30, and wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 25 and a reverse primer comprising SEQ ID NO. 26, and a forward primer comprising SEQ ID NO. 27 and a reverse primer comprising SEQ ID NO. 28.
In various embodiments, the set of reagents comprises at least three primer sets for amplifying at least three biomarkers, wherein each primer set of the at least three primer sets comprises a forward outer primer, a backward outer primer, a forward inner primer, a backward inner primer, a forward loop primer, and a backward loop primer for amplifying one of the at least three biomarkers, and wherein at least one of the at least three biomarkers is selected from the group consisting of EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, ZNF831, MME, CD3G, STOM, C14orf159, PUM2, MSH2, DCTD, BNT3A2, or HLA-DPA1, at least one biomarker of the at least three biomarkers is selected from the group consisting of SERPINB1, GSPT1, ECSIT, LAT, NCOA4, EPB42, RPS6KA5, HK3, UCP2, or NUP88, and at least one biomarker of the at least three biomarkers is selected from the group consisting of MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, PRPF3, TOMM70A, EPB42, GABARAPL2, CASP4, SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, OR TNFRSF1A.
In various embodiments, at least one of the at least three primer sets is selected from the group consisting of: a forward outer primer, a backward outer primer, a forward inner primer, a backward inner primer, a forward loop primer, and a backward loop primer, each of which is configured to enable amplification of at least one biomarker selected from the group consisting of: EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, ZNF831, MME, CD3G, STOM, C14orf159, PUM2, MSH2, DCTD, BNT3A2, or HLA-DPA1, a forward outer primer, a backward outer primer, a forward inner primer, a backward inner primer, a forward loop primer, and a backward loop primer, each of which is configured to enable amplification of at least one biomarker selected from the group consisting of: SERPINB1, GSPT1, ECSIT, LAT, NCOA4, EPB42, RPS6KA5, HK3, UCP2, or NUP88, and a forward outer primer, a backward outer primer, a forward inner primer, a backward inner primer, a forward loop primer, and a backward loop primer, each of which is configured to enable amplification of at least one biomarker selected from the group consisting of: MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, PRPF3, TOMM70A, EPB42, GABARAPL2, CASP4, SLC1A5, IGF2BP2, ANXA3, GBP2, TNFRSF1, OR TNFRSF1A.
These and other features, aspects, and advantages of the present disclosure will become better understood with regard to the following description, and accompanying drawings, where:
The figures depict various embodiments of the present disclosure for purposes of illustration only. One skilled in the art will readily recognize from the following discussion that alternative embodiments of the structures and methods illustrated herein can be employed without departing from the principles of the disclosure described herein.
In general, terms used in the claims and the specification are intended to be construed as having the plain meaning understood by a person of ordinary skill in the art. Certain terms are defined below to provide additional clarity. In case of conflict between the plain meaning and the provided definitions, the provided definitions are to be used.
The term “patient” or “subject” encompasses or organism, mammals including humans or non-humans (e.g., non-human primates, canines, felines, murines, bovines, equines, and porcines), whether in vivo, ex vivo, or in vitro, male or female.
The term “sample” can include a single cell or multiple cells or fragments of cells or an aliquot of body fluid, such as a blood sample, taken from a subject, by means including venipuncture, excretion, ejaculation, massage, biopsy, needle aspirate, lavage sample, scraping, surgical incision, or intervention or other means known in the art. Examples of an aliquot of body fluid include amniotic fluid, aqueous humor, bile, lymph, breast milk, interstitial fluid, blood, blood plasma, cerumen (earwax), Cowper's fluid (pre-ejaculatory fluid), chyle, chyme, female ejaculate, menses, mucus, saliva, urine, vomit, tears, vaginal lubrication, sweat, serum, semen, sebum, pus, pleural fluid, cerebrospinal fluid, synovial fluid, intracellular fluid, and vitreous humour.
The terms “marker,” “markers,” “biomarker,” and “biomarkers” encompass, without limitation, lipids, lipoproteins, proteins, cytokines, chemokines, growth factors, peptides, nucleic acids, genes, and oligonucleotides, together with their related complexes, metabolites, mutations, variants, polymorphisms, modifications, fragments, subunits, degradation products, elements, and other analytes or sample-derived measures. In particular embodiments discussed herein, the biomarkers are genes. However, in alternative embodiments, the biomarkers can include any other measurable substance in a sample from a subject. A marker can also include mutated proteins, mutated nucleic acids, variations in copy numbers, and/or transcript variants, in circumstances in which such mutations, variations in copy number and/or transcript variants are useful for generating a predictive model, or are useful in predictive models developed using related markers (e.g., non-mutated versions of the proteins or nucleic acids, alternative transcripts, etc.). In some embodiments, the biomarkers discussed throughout this disclosure can include a nucleic acid, including DNA, modified (e.g., methylated) DNA, cDNA, and RNA, including coding (e.g., mRNAs) and non-coding RNA (e.g., sncRNAs), a protein, including a post-transcriptionally modified protein (e.g., phosphorylated, glycosylated, myristilated, etc. proteins), a nucleotide (e.g., adenosine triphosphate (ATP), adenosine diphosphate (ADP), and adenosine monophosphate (AMP)), including cyclic nucleotides such as cyclic adenosine monophosphate (cAMP) and cyclic guanosine monophosphate (cGMP), a biologic, an ADC, a small molecule, such as oxidized and reduced forms of nicotinamide adenine dinucleotide (NADP/NADPH), a volatile compound, and any combination thereof.
The term “antibody” is used in the broadest sense and specifically covers monoclonal antibodies (including full length monoclonal antibodies), polyclonal antibodies, multi specific antibodies (e.g., bispecific antibodies), and antibody fragments that are antigen-binding so long as they exhibit the desired biological activity, e.g., an antibody or an antigen-binding fragment thereof.
“Antibody fragment,” and all grammatical variants thereof, as used herein are defined as a portion of an intact antibody comprising the antigen binding site or variable region of the intact antibody, wherein the portion is free of the constant heavy chain domains (i.e. CH2, CH3, and CH4, depending on antibody isotype) of the Fc region of the intact antibody. Examples of antibody fragments include Fab, Fab′, Fab′-SH, F(ab′)2, and Fv fragments; diabodies; any antibody fragment that is a polypeptide having a primary structure consisting of one uninterrupted sequence of contiguous amino acid residues (referred to herein as a “single-chain antibody fragment” or “single chain polypeptide”).
The term “obtaining or having obtained quantitative data” encompasses obtaining a set of data determined from at least one sample. Obtaining a dataset encompasses obtaining a sample and processing the sample to experimentally determine the data. The phrase also encompasses receiving a set of data, e.g., from a third party that has processed the sample to experimentally determine the dataset. Additionally, the phrase encompasses mining data from at least one database or at least one publication or a combination of databases and publications. A dataset can be obtained by one of skill in the art via a variety of known ways including stored on a storage memory.
Any terms not directly defined herein shall be understood to have the meanings commonly associated with them as understood within the art of the disclosure. Certain terms are discussed herein to provide additional guidance to the practitioner in describing the compositions, devices, methods and the like of aspects of the disclosure, and how to make or use them. It will be appreciated that the same thing can be said in more than one way. Consequently, alternative language and synonyms can be used for any one or more of the terms discussed herein. No significance is to be placed upon whether or not a term is elaborated or discussed herein. Some synonyms or substitutable methods, materials and the like are provided. Recital of one or a few synonyms or equivalents does not exclude use of other synonyms or equivalents, unless it is explicitly stated. Use of examples, including examples of terms, is for illustrative purposes only and does not limit the scope and meaning of the aspects of the disclosure herein.
Additionally, as used in the specification, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise.
In various embodiments, the process of building a model that predicts patient subtypes, hereafter referred to as a patient subtype classifier, involves using the labeled data. The labeled data is analyzed to select biomarkers (e.g., “gene selection” as shown in
The trained patient subtype classifiers can be deployed to classify specific patients. In one embodiment, the patient subtype classifier analyzes data derived from randomized controlled trial (RCT) data pertaining to one or more patients and outputs predictions for the patients. For example, the patient subtype classifier analyzes quantitative biomarker expression data for patients that have been involved in a randomized controlled trial and classifies the patients in one of the different subtypes.
IIA. System Environment Overview
In various embodiments, a test sample is obtained from the subject 110. The test sample is analyzed to determine quantitative values of one or more biomarkers by performing the marker quantification assay 120. The marker quantification assay 120 may be a quantitative reverse transcription polymerase chain reaction (RT-PCR) assay, a microarray, a sequencing assay, or an immunoassay, examples of which are described in further detail below. The quantitative values of biomarkers can be quantified RT-PCR data, transcriptomics data, and/or RNA-seq data. The quantified expression values of the biomarkers are provided to the patient classification system 130.
Generally, the patient classification system 130 includes one or more computers, such as example computer 1600 as discussed below with respect to
In various embodiments, the patient classification system 130 applies a patient subtype classifier to predict a classification for patient 110. In various embodiments, a patient subtype classifier can be a machine-learned model. In such embodiments, the patient classification system 130 can train the patient subtype classifier using training data and/or deploy the patient subtype classifier to analyze the quantitative expression values of biomarkers of the patient 110.
In various embodiments, the marker quantification assay 120 and the patient classification system 130 can be employed by different parties. For example, a first party performs the marker quantification assay 120 which then provides the results to a second party which implements the patient classification system 130. For example, the first party may be a clinical laboratory that obtains test samples from subjects 110 and performs the assay 120 on the test samples. The second party receives the expression values of biomarkers resulting from the performed assay 120 analyzes the expression values using the patient classification system 130.
In various embodiments, the patient classification system 130 can be a distributed computing system implemented in a cloud computing environment. For example, steps performed by the patient classification system 130 can be performed using systems in geographically different locations. In particular embodiments, the patient classification system 130 receives quantitative biomarker data from the marker quantification assay 120 at a first location. The patient classification system 130 transmits the quantitative biomarker data and analyzes the quantitative biomarker data to predict a classification using a patient subtype classifier at a second location (e.g., cloud computing). The patient classification system 130 can further transmit the classification back to the first location for subsequent use.
Cloud computing can be employed to offer on-demand access to the shared set of configurable computing resources. The shared set of configurable computing resources can be rapidly provisioned via virtualization and released with low management effort or service provider interaction, and then scaled accordingly. A cloud-computing model can be composed of various characteristics such as, for example, on-demand self-service, broad network access, resource pooling, rapid elasticity, measured service, and so forth. A cloud-computing model can also expose various service models, such as, for example, Software as a Service (“SaaS”), Platform as a Service (“PaaS”), and Infrastructure as a Service (“IaaS”). A cloud-computing model can also be deployed using different deployment models such as private cloud, community cloud, public cloud, hybrid cloud, and so forth. In this description and in the claims, a “cloud-computing environment” is an environment in which cloud computing is employed.
In various embodiments, the marker quantification assay 120 and patient classification system 130 are implemented in a critical care setting such that a therapy recommendation is to be generated for a patient 110 within a maximum amount of time. In various embodiments, the maximum amount of time is 30 minutes. In various embodiments, the maximum amount of time is 1 hour, 2 hours, 3 hours, 4 hours, 5 hours, 6 hours, 7 hours, 8 hours, 9 hours, 10 hours, 11 hours, or 12 hours. In other embodiments, the marker quantification assay 120 and patient classification system 130 are not implemented in a critical care setting.
IIB. Methods for Determining a Therapy Recommendation
In various embodiments, the patient classification system 130 (as described above in reference to
The patient classification system 130 receives quantitative data from the marker quantification assay 120. Here, the quantitative data from the marker quantification assay 120 can include quantitative levels of one or more biomarkers that were determined from a sample obtained from a patient. In various embodiments, the patient classification system 130 normalizes the quantitative data. For example the patient classification system 130 can normalize the quantitative data based on study-specific parameters (such that data is normalized for a study) and/or based on parameters specific for a particular assay or platform used to generate the quantitative data. In various embodiments, the patient classification system 130 can normalize the quantitative data according to normalization parameters derived the healthy samples. In such embodiments, the resulting quantitative data are normalized across patients and studies at the end of the normalization process. Such embodiments that involve normalizing quantitative data can be implemented during research settings (non-critical care settings). In some embodiments, the patient classification system 130 need not normalize the quantitative data prior to analysis by the patient subtype classifier. Such embodiments that do not involve normalizing the quantitative data can be implemented in critical care settings where a rapid analysis and classification is needed for a patient 110. The patient classification system 130 analyzes the quantitative data, which hereafter also encompasses normalized quantitative data.
As one example, the patient classification system 130 analyzes quantitative data for a biomarker set derived from a microarray analysis. The patient classification system 130 applies a patient subtype classifier that analyzes the quantitative microarray data and classifies the patient, which can later be used to determine a therapy recommendation. As another example, the patient classification system 130 analyzes qPCR data, which measures the relative or absolute expression level of biomarkers. In various embodiments, normalization or calibration processes are implemented. The quantitative data of the biomarker set are used to calculate the scores for different classifications (e.g., subtypes), which then will be used for subtype assignment by a patient subtype classifier. As another example, the patient classification system 130 analyzes RNA sequencing data, which includes relative expression levels of model genes and their transcripts. Using sequencing reads alignment methods (e.g. Hisat2, and Bowtie2), expression estimation methods (e.g. StringTie, Salmon) and normalization processes (e.g. quantile normalization), the estimated expression of model genes can be used to calculate classification-specific scores for downstream classification by a patient subtype classifier. In various embodiments, the patient classification system 130 can convert quantitative data derived from a first type of assay to quantitative data of a second type of assay using normalization factors. For example, the patient classification system 130 can convert quantitative data derived from microarray data to either qPCR data or RNA sequencing data. The conversion can entail one or more normalization factors involving normalization or calibration processes for qPCR data or normalization processes (e.g., quantile normalization) for RNA sequencing data. Thus, the patient classification system 130 can apply different patient subtype classifiers to analyze different types of quantitative data.
The patient classification system 130 implements the patient subtype classifier to analyze quantitative data for biomarkers. In one embodiment, the patient subtype classifier is a trained machine-learned model. Thus, the patient subtype classifier can be trained to receive, as input, quantitative data of a biomarker set, and analyze the input to output a classification for the patient. In some embodiments, the patient subtype classifier is not a machine-learned model. In various embodiments, patient subtype classifier outputs a prediction of one classification for the patient out of X possible classifications. For example, the patient subtype classifier can output a prediction of a patient subtype for the patient out of a possible X patient subtypes. In various embodiments, X may be two possible classifications. In various embodiments, X may be more than two possible classifications. In various embodiments, X may be three, four, five, six, seven, eight, nine, or ten possible classifications. In various embodiments, X may be more than ten possible classifications.
In some embodiments, the patient classification system 130 calculates scores from the quantitative data and then provides the calculated scores as input to the patient subtype classifier. Thus, the patient subtype classifier determines a classification for the patient based on the calculated scores.
In various embodiments, the patient classification system 130 calculates multiple scores, each score corresponding to a patient subtype (e.g., classification). For example, if the goal is to classify the patient in a classification out of X possible classifications, the patient classification system 130 calculates X scores. The X scores are then provided as input to the patient subtype classifier to predict the classification. These scores are hereafter referred to as classification-specific scores.
In various embodiments, to calculate a classification-specific score, the patient classification system 130 determines subscores derived from quantitative data of one or more biomarkers in the biomarker set and uses the subscores to determine the classification-specific score. In one embodiment, a subscore is calculated from one or more biomarkers that are differentially expressed in the patient in comparison to a control value. In various embodiments, the control value may be a value derived from a different set of patients, such as healthy patients. In various embodiments, the control value may be a baseline value derived from the same patient (e.g., a baseline value corresponding to when the same patient was previously healthy).
In various embodiments, the patient classification system 130 determines a subscore determined from quantitative data of one or more biomarkers that are upregulated in the patient in comparison to the control value. In various embodiments, the patient classification system 130 determines a subscore determined from quantitative data of one or more biomarkers that are downregulated in the patient in comparison to the control value. In various embodiments, the patient classification system 130 determines a first subscore determined from quantitative data of one or more biomarkers that are upregulated in the patient in comparison to the control value and further determines a second subscore determined from quantitative data of one or more biomarkers that are downregulated in the patient in comparison to a control value. In various embodiments, a subscore can be an aggregation of the quantitative data of the one or more biomarkers. For example, a subscore can be a mean, a median, or a geometric mean of quantitative data of the one or more biomarkers. In various embodiments, the patient classification system 130 can further scale the sub scores.
In various embodiments, the quantitative data of one or more biomarkers that are analyzed refer to biomarkers that have been previously categorized as influencing the particular subtype that the classification-specific score is being calculated for. For example, if the patient classification system 130 is determining a classification-specific for subtype A, the patient classification system 130 determines subscores using quantitative data of biomarkers that are categorized as influencing the subtype A. Examples of biomarkers that are categorized with certain subtypes are shown below in Tables 1, 2A-2B, 3, and 4A-4D. Specifically, row number 1 in each of Tables 1, 2A-2B, 3, and 4A-4D show biomarkers that are categorized with subtype A, row number 2 in each of Tables 1, 2A-2B, 3, and 4A-4D show biomarkers that are categorized with subtype B, and row number 3 in each of Tables 1, 2A-2B, 3, and 4A-4D show biomarkers that are categorized with subtype C.
In various embodiments, the patient classification system 130 combines one or more subscores to determine the classification-specific score. For example, the patient classification system 130 can determine a difference between a first subscore and a second subscore. The difference can represent the classification-specific score.
As a specific example, the patient classification system 130 can determine a classification-specific score using the following steps: the patient classification system 130 determines a first geometric mean of the quantitative expression data for the subject for one or more biomarkers of the candidate classification, wherein the quantitative expression data for the subject for the one or more biomarkers of the candidate classification are increased relative to the quantitative expression data for the one or more biomarkers for one or more control subjects. The patient classification system 130 determines a second geometric mean of the quantitative expression for the subject for one or more additional biomarkers of the candidate classification, wherein the quantitative expression data for the subject for the one or more additional biomarkers of the candidate classification are decreased relative to the quantitative expression data for the one or more additional biomarkers for the one or more control subjects. The patient classification system 130 determines a difference between the first geometric mean and the second geometric mean, the first and second geometric means optionally subject to scaling. Here, the difference can represent the classification-specific score.
In various embodiments, the patient classification system 130 determines multiple classification-specific scores and provides them as input to the patient subtype classifier. The patient subtype classifier analyzes the classification-specific scores and outputs a classification for the patient. Embodiments of the patient subtype classifier are described in further detail below.
In various embodiments, based on the classification-specific scores, the patient subtype classifier outputs a classification. For example, the patient subtype classifier may analyze X classification-specific scores and outputs a prediction for one class out of Xpossible classifications. As another, the patient subtype classifier may analyze X classification-specific scores and outputs a prediction for one class out of two possible classifications. As a specific example, the patient subtype classifier may analyze 3 classification-specific scores (e.g., specific for subtype A, subtype B, and subtype C), and outputs a prediction for a class out of two possible classifications (e.g., subtype A v. not subtype A, subtype B v. not subtype B, or subtype C v. not subtype C).
Generally, the classification determined by the patient subtype classifier guides the selection of a therapy recommendation. In various embodiments, the therapy recommendation refers to whether a therapy is likely to be beneficial to a patient. In particular embodiments, the disease of interest is sepsis and therefore, the therapy recommendation pertain to whether a corticosteroid therapy, such as hydrocortisone, is likely to be of benefit to a patient. In one embodiment, the therapy recommendation can indicate whether the patient is likely to be “favorably responsive” or “non-responsive” to a therapy. In one embodiment, the therapy recommendation can indicate whether the patient is likely to be “favorably responsive”, “adversely responsive”, or “non-responsive” to a therapy.
Examples of a therapy include: immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, blocking of complement activity therapy, anti-inflammatory therapy, a checkpoint inhibitor, a blocker of complement components, a blocker of complement component receptors, a blocker of a pro-inflammatory cytokine, modulators of coagulation therapy, and modulators of vascular permeability therapy. Additional examples of a therapy include: GM-CSF, anti-PD-1, anti-PD-L1, anti-CLTA-4, anti-CEACAM-1, anti-TIM-3, anti-BTLA, IL-7, INF-gamma, IFN-beta 1a regulator, IL-22 agonist, IFN-alpha regulator, IFN-lambda regulator, IFN-alpha 2b stimulant, anti-C5a, anti-C3a, anti-C5aR, anti-C3aR, anti-TNF-alpha, and anti-IL-6, Anti-HMGB1, ST2 antibody, IL-33 antibody, activated protein C, antithrombin, and thrombomodulin.
Additional examples of a therapy and corresponding therapy recommendations for different patient subtypes (e.g., subtype A, subtype B, and subtype C) are shown below in Table 8. Specifically, the therapy recommendations are shown in the column titled “Subtype Hypothesis” and support for that hypothesis is found in the column titled “Evidence.” Altogether the therapy recommendation determined by the patient classification system 130 can be provided to guide therapy for the patient.
The impact of a particular therapy and a patient subtype, such as those hypothesized in Table 8, may have been previously determined by analyzing patient cohorts who have received the particular therapy. For example, such patient cohorts may have been involved in a clinical trial. Thus, the patients may be exhibiting dysregulated host responses and therefore, were enrolled in the trial. Therefore, patients in the clinical trial are classified with a patient subtype (e.g., using the methods described above) and their responses to the therapy (e.g., favorably responsive, adversely responsive, non-responsive) are tracked and recorded. For each subtype, the responses of patients receiving the therapy are compared to control patients. If the comparison yields a statistically significant difference patients of the subtype are labeled as favorably responsive or adversely responsive to the therapy. If the comparison does not yield a statistically significant difference (e.g., p-value not greater than a threshold value), patients of the subtype are labeled as non-responsive to the therapy. In various embodiments, the statistical significance threshold is a p-value, where the p-value is any one of 0.01, 0.0.5, or 0.1.
In particular embodiments, the compared measurable on which statistical significance is determined is patient mortality. Therefore, the mortality of patients who receive a therapy is compared to mortality of control patients to determine whether there is statistical significance indicating an effect due to the therapy. For example, if the patients of a subtype who receive a therapy exhibit a statistically significantly increased survival time in comparison to control patients who did not receive the therapy, then patients of this subtype can be identified as favorably responsive to the therapy. As another example, if patients of a subtype who receive a therapy exhibit a statistically significantly decreased survival time in comparison to control patients who did not receive the therapy, then patients of this subtype can be identified as adversely responsive to the therapy. As yet another example, if patients of a subtype who receive a therapy do not exhibit a statistically significantly increased or a statistically significantly decreased survival time in comparison to control patients who did not receive the therapy, then patients of this subtype can be identified as not responsive to the therapy.
IIB. Methods for Determining a Therapy Hypothesis
In various embodiments, methods disclosed herein involve the identification of therapeutic hypotheses for different patient subtypes. In various embodiments, the process of identifying a therapeutic hypothesis is performed by the patient classification system 130. In some embodiments, the process of identifying a therapeutic hypothesis is performed by third party system which provides a therapeutic hypothesis to the patient classification system 130. In various embodiments, a therapy hypothesis is specific for a patient subtype. Therefore, a therapy hypothesis is useful for identifying a therapy recommendation, as discussed above in reference to
Generally, a therapeutic hypothesis involves analyzing genes that are differentially expressed across different subtypes. By identifying patterns of differentially expressed genes that are implicated in certain known biological pathways, certain patient subtypes can be associated with particular dysregulated pathways. A therapeutic hypothesis comprising a candidate therapeutic can be selected. Here, a candidate therapeutic can modulate parts of the dysregulated pathways, thereby representing a possible avenue of therapy for treating particular patient subtypes.
Reference is now made to
The labeled data 610 represents patient data that have been labeled with one or more classifications. For example, the labeled data 610 can be labeled with patient subtypes (e.g., subtype A, subtype B, subtype C, etc.). In various embodiments, the patient data comprises quantitative data of one or more biomarkers of patients. In various embodiments, the patient data is clinical trial data and therefore, the quantitative data of one or more biomarkers can be data obtained from patients enrolled in the clinical trial.
The labels of the labeled data can be previously generated through various means. In various embodiments, the labels of the data can be generated using a model, such as a patient subtype classifier described herein. For example, the quantitative data of biomarkers from patients are analyzed using the patient subtype classifier to predict a classification for patients. Thus, the predicted classification for each patient can serve as a label for the labeled data. In various embodiments, the labels of the data can be generated through a clustering analysis. For example, the quantitative data of biomarkers can be analyzed through unsupervised clustering, thereby generating clusters of patients that have similar expression of various biomarkers. Each cluster of patients can be labeled. In various embodiments, a cluster can be labeled based on outcomes of patients in the clinical trials. For example, if a majority of patients in a cluster exhibited prolonged survival time in response to a therapy, the cluster can be labeled as a subtype that is responsive to the therapy.
The differentially expressed gene data 620 comprises gene level fold changes of biomarker expression between patients of different subtypes. Using the labeled data 610, gene expression from patients of individual subtypes are aggregated and compared across subtypes. For example, a statistical measure of gene expression for patients of a subtype can be determined (e.g., a mean, a median, a mode, a geometric mean). The statistical measure of gene expression for patients of a first subtype are compared to a statistical measure of gene expression for patients of a second subtype. This can be performed across the different patient subtypes and across various genes. Thus, the differentially expressed gene data 620 includes gene level fold changes of different biomarkers across different patient subtypes.
In various embodiments, the differentially expressed gene data 620 includes gene level fold changes for at least twenty biomarkers. In various embodiments, the differentially expressed gene data 620 includes gene level fold changes for at least fifty biomarkers. In various embodiments, the differentially expressed gene data 620 includes gene level fold changes for at least 100 biomarkers, at least 200 biomarkers, at least 300 biomarkers, at least 400 biomarkers, at least 500 biomarkers, at least 1000 biomarkers, at least 2000 biomarkers, at least 3000 biomarkers, at least 4000 biomarkers, at least 5000 biomarkers, at least 10,000 biomarkers, at least 50,000 biomarkers, or at least 100,000 biomarkers.
In various embodiments, the differentially expressed gene data 620 can be represented as a database or a table that documents gene level fold changes between patients of different subtypes. An example of such a gene level fold changes between patient subtypes is shown below in Table 7. Specifically, for each gene, a gene level fold change (e.g., ratio) between different subtypes (e.g., subtype A/subtype B denoted as “A/B”) is shown.
To determine a therapeutic hypothesis 650, patterns of gene level fold changes are identified across the differentially expressed gene data 620. In various embodiments, patterns of gene level fold changes refer to at least a threshold number of genes that are differentially expressed in a first patient subtype in comparison to a second patient subtype. In various embodiments, patterns of gene level fold changes refer to at least a threshold number of genes that are overexpressed in a first patient subtype in comparison to a second patient subtype. In various embodiments, patterns of gene level fold changes refer to at least a threshold number of genes that are underexpressed in a first patient subtype in comparison to a second patient subtype.
In various embodiments, the threshold number of genes include genes that are involved in a common biological pathway. Example biological pathways include, but are not limited to: innate immune pathways, chronic inflammation pathways, acute inflammation pathways, coagulation pathways, complement pathways, signaling pathways (e.g., TLR signaling pathway or glucocorticoid receptor signaling pathway), and the like. In various embodiments, the involvement of genes in certain biological pathways is curated from publicly available databases such as the Reactome Pathway Database or the KEGG Pathway database.
In various embodiments, the threshold number of genes involved in a common biological pathway is at least 2 genes. In various embodiments, the threshold number of genes is at least 3 genes, at least 4 genes, at least 5 genes, at least 6 genes, at least 7 genes, at least 8 genes, at least 9 genes, at least 10 genes, at least 15 genes, at least 20 genes, at least 25 genes, at least 50 genes, at least 75 genes, at least 100 genes, at least 200 genes, at least 300 genes, at least 400 genes, at least 500 genes, or at least 1000 genes. In various embodiments, the threshold number of genes involved in a common biological pathway is 2 genes. In various embodiments, the threshold number of genes involved in a common biological pathway is 3 genes, 4 genes, 5 genes, 6 genes, 7 genes, 8 genes, 9 genes, 10 genes, 11 genes, 12 genes, 13 genes, 14 genes, 15 genes, 16 genes, 17 genes, 18 genes, 19 genes, 20 genes, 25 genes, 50 genes, 75 genes, 100 genes, 200 genes, 300 genes, 400 genes, 500 genes, 600 genes, 700 genes, 800 genes, 900 genes, or 1000 genes.
Altogether, patterns of gene level fold changes, as indicated by a threshold number of genes involved in a common biological pathway, are useful for understanding the underlying biology that may be involved in a patient subtype. For example, genes involved in inflammation may be differentially expressed in subtype A in comparison to those genes in subtype B. Thus, subtype A can be associated or characterized by inflammation based processes.
The patterns of gene level fold changes between subtypes is analyzed to determine a therapeutic hypothesis 650 which, in some scenarios, includes a class of a candidate therapeutic of a candidate therapeutic itself (e.g., including but not limited to a drug therapy or a gene therapy). For example, given the characterization that a particular patient subtype is associated with an underlying biological pathway or process, a target involved in the biological pathway or process can serve as a druggable target. Thus, a class of a candidate therapeutic or a candidate therapeutic that modulates the target involved in the biological pathway can be promising as a therapeutic hypothesis 650. Examples of a class of a therapy include, but are not limited to: immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, blocking of complement activity therapy, anti-inflammatory therapy, a checkpoint inhibitor, a blocker of complement components, a blocker of complement component receptors, a blocker of a pro-inflammatory cytokine, modulators of coagulation therapy, and modulators of vascular permeability therapy. Examples of a candidate therapy include but are not limited to: GM-CSF, anti-PD-1, anti-PD-L1, anti-CLTA-4, anti-CEACAM-1, anti-TIM-3, anti-BTLA, IL-7, INF-gamma, IFN-beta 1a regulator, IL-22 agonist, IFN-alpha regulator, IFN-lambda regulator, IFN-alpha 2b stimulant, anti-C5a, anti-C3a, anti-C5aR, anti-C3aR, anti-TNF-alpha, and anti-IL-6, Anti-HMGB1, ST2 antibody, IL-33 antibody, activated protein C, antithrombin, and thrombomodulin.
The therapeutic pharmacology data 630 is useful for developing a therapeutic hypothesis for a particular class of therapy or for a particular candidate therapy. Generally, therapeutic pharmacology data 630 is useful for understanding what therapeutic effects, if any, can be imparted by a class of therapy or candidate therapy. For example, therapeutic pharmacology data 630 can include molecular data of therapeutics, clinical pharmacology data of therapeutics (e.g., pharmacokinetics and pharmacodynamics data), and/or data identifying therapeutics that are useful for modulating activity in particular biological pathways. For example, for a given candidate therapeutic (e.g., an anti-PD-1 inhibitor), the therapeutic pharmacology data 630 is useful for understanding how different patients respond to the anti-PD-1 inhibitor.
Examples of therapeutic pharmacology data 630 is shown in
The response pathobiology data 640 is useful for developing a hypothesis as to therapeutic effects, independent of a particular candidate therapeutic, that may benefit a particular patient subtype. In various embodiments, response pathobiology data 640 can include patient data corresponding to patients that responded favorably. In various embodiments, response pathobiology data 640 includes patient data of patient subtypes that indicate differential expression of biomarkers associated with certain biological activity. The differentially expressed biomarkers can be promising targets for modulation. For example, dysregulated host response patients of subtype A exhibit up-regulation of biomarkers associated with innate immune activity involved in pathogen recognition (e.g., via recognition of pathogen-associated molecular patterns (PAMPs)), up-regulation of biomarkers associated with innate immune regulation, and up-regulation of biomarkers associated with adaptive immune activity. As another example, dysregulated host response patients of subtype B exhibit up-regulation of biomarkers associated with innate immune activity involved in recognition of damage-associated molecular patterns (DAMPs), up-regulation of biomarkers associated with DAMPs, up-regulation of biomarkers associated with inflammation (e.g. TNF-alpha), up-regulation of biomarkers associated with complement activity, down-regulation of biomarkers associated with adaptive immune activity, up-regulation of biomarkers associated with adaptive immune suppression, and up-regulation of markers associated with increased risk of acute kidney injury. As another example, subtype C patients exhibit down-regulation of biomarkers associated with innate and adaptive immune activity, up-regulation of biomarkers associated with DAMPs, up-regulation of biomarkers associated with cellular recruitment (e.g. G-CSF and GM-CSF), up-regulation of biomarkers associated with increased risk of thrombosis, and up-regulation of biomarkers associated with coagulation.
The therapeutic hypothesis 650 for a patient subtype can be subsequently tested and validated. For example, the therapeutic hypothesis 650 can be tested in pre-clinical or clinical studies and trials (e.g., a randomized controlled trial) by providing subjects or patients of the subtype a candidate therapeutic and monitoring their response.
IIC. Patient Subtype Classifier
In various embodiments, the patient subtype classifier is a machine-learned model that analyzes quantitative data of biomarkers or classification-specific scores derived from quantitative data of biomarkers and predicts a classification. In various embodiments, the patient subtype classifier is any one of a regression model (e.g., linear regression, logistic regression, or polynomial regression), decision tree, random forest, support vector machine, Naïve Bayes model, k-means cluster, or neural network (e.g., feed-forward networks, convolutional neural networks (CNN), deep neural networks (DNN), autoencoder neural networks, generative adversarial networks, or recurrent networks (e.g., long short-term memory networks (LSTM), bi-directional recurrent networks, deep bi-directional recurrent networks), or any combination thereof.
The patient subtype classifier can be trained using a machine learning implemented method, such as any one of a linear regression algorithm, logistic regression algorithm, decision tree algorithm, support vector machine classification, Naïve Bayes classification, K-Nearest Neighbor classification, random forest algorithm, deep learning algorithm, gradient boosting algorithm, and dimensionality reduction techniques such as manifold learning, principal component analysis, factor analysis, autoencoder regularization, and independent component analysis, or combinations thereof. In various embodiments, the patient subtype classifier is trained using supervised learning algorithms, unsupervised learning algorithms, semi-supervised learning algorithms (e.g., partial supervision), weak supervision, transfer, multi-task learning, or any combination thereof.
In various embodiments, the patient subtype classifier has one or more parameters, such as hyperparameters or model parameters. Hyperparameters are generally established prior to training. Examples of hyperparameters include the learning rate, depth or leaves of a decision tree, number of hidden layers in a deep neural network, number of clusters in a k-means cluster, penalty in a regression model, and a regularization parameter associated with a cost function. Model parameters are generally adjusted during training. Examples of model parameters include weights associated with nodes in layers of neural network, support vectors in a support vector machine, and coefficients in a regression model. The model parameters of the patient subtype classifier are trained (e.g., adjusted) using the training data to improve the predictive capacity of the patient subtype classifier.
In some embodiments, the patient subtype classifier is a regression, such as a logistic regression. Parameters of the logistic regression are trained using the training data such that when the logistic regression is applied, it outputs a classification based on the different classification-specific scores. The parameters of the logistic regression can be trained to maximize the differences between the different classifications (e.g., subtype A, subtype B, and subtype C).
In some embodiments, the patient subtype classifier is a support vector machine. The support vector machine is trained with a single or a set of hyperplanes that maximizes the differences among the X different classifications. In one embodiment, the support vector machine is trained with single or a set of hyperplanes that maximizes the differences among 3 different classifications (e.g., subtype A, subtype B, and subtype C). As a specific example, the support vector machine is trained with a set of hyperplanes that maximizes the differences among the 3 different classification-specific scores (e.g., scores for each of subtype A, subtype B, and subtype C). Therefore, the trained support vector machine can use the hyperplanes to output a prediction of a classification when provided quantitative data of biomarkers or classification-specific scores derived from quantitative data of biomarkers.
In some embodiments, the patient subtype classifier may be a non-machine learned model. The patient subtype classifier may employ one or more threshold values for comparison against the classification-specific scores. Depending on the comparison between the threshold values and the classification-specific scores, the patient subtype classifier outputs a predicted classification. In various embodiments, a threshold value is specific for a classification. Therefore, there may be X threshold values to be compared against X classification-specific scores.
In some embodiments, a threshold value may be a fixed value (e.g., fixed value=0). Here, the classification-specific scores are compared to the fixed threshold value and patient subtype classifier determines the classification based on the comparison. For example, assuming there are two classification-specific scores, the patient subtype classifier may compare each of the first classification-specific score and the second classification-specific score to the fixed threshold. In one embodiment, if the first classification-specific score is greater than the fixed threshold value and the second classification-specific score is less than a fixed threshold value, then the patient subtype classifier can output a particular classification. Similar logic can be applied for determining classifications using more than two classification-specific scores.
In some embodiments, a threshold value may be determined from training samples including data from patients who have been classified (e.g., classified as subtype A, subtype B, and/or subtype C). Such a threshold value may derived from a receiver operating curve (ROC) demonstrating the sensitivity/specificity of a model that classified the patients of the training samples. For example, for patients in the training sample classified as subtype A, a receiver operating curve is generated that demonstrates the sensitivity and specificity of the classifier. The threshold value can be the top-left part of the plot, representing the closest point in the ROC to perfect sensitivity or specificity.
The classification-specific scores are compared to corresponding threshold values, and based on the comparison, the patient subtype classifier determines the classification. For example, assuming there are two classification-specific scores for subtype A and subtype B, the patient subtype classifier may compare the subtype A classification-specific score to a subtype A threshold value and may further compare the subtype B classification-specific score to a subtype B threshold value. Thus, depending on the two comparisons, the patient subtype classifier determines the classification. In one embodiment, if the first classification-specific score is greater than the first threshold value and the second classification-specific score is less than a second threshold value, then the patient subtype classifier can output a particular classification. Similar logic can be applied for determining classifications using more than two classification-specific scores and/or more than two threshold values. Examples of subtype specific threshold values that are derived from training samples are described below in Table 18.
IID. Biomarker Panel
Embodiments described herein involve the analysis of biomarkers. As described herein, a biomarker panel, also referred to as a biomarker set, can be implemented to analyze values of biomarkers for a patient. In various embodiments, a biomarker panel can be a multivariate biomarker panel. In such embodiments, the multivariate biomarker panel includes more than one biomarker. In various embodiments, the multivariate biomarker panel includes two biomarkers. In various embodiments, the multivariate biomarker panel includes 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 biomarkers. In particular embodiments, the multivariate biomarker panel includes 3 biomarkers. In particular embodiments, the multivariate biomarker panel includes 4 biomarkers. In particular embodiments, the multivariate biomarker panel includes 5 biomarkers. In particular embodiments, the multivariate biomarker panel includes 6 biomarkers. In particular embodiments, the multivariate biomarker panel includes 8 biomarkers. In particular embodiments, the multivariate biomarker panel includes 10 biomarkers. In particular embodiments, the multivariate biomarker panel includes 15 biomarkers. In particular embodiments, the multivariate biomarker panel includes 16 biomarkers. In particular embodiments, the multivariate biomarker panel includes 24 biomarkers.
In various embodiments, the multivariate biomarker panel includes biomarkers selected from the following markers: EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1, GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, TOMM70A, ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, ANXA3, C14orf159, PUM2, EPB42, RPS6KA5, GBP2, MSH2, DCTD, HK3, UCP2, NUP88, GABARAPL2, and CASP4.
In various embodiments, the multivariate biomarker panel includes at least two biomarkers selected from the following markers: EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1, GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, TOMM70A, ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, ANXA3, C14orf159, PUM2, EPB42, RPS6KA5, GBP2, MSH2, DCTD, HK3, UCP2, NUP88, GABARAPL2, and CASP4.
In various embodiments, the multivariate biomarker panel include X number of biomarkers, where X is the number of possible classifications that the patient subtype classifier can predict. For example, for a patient subtype classifier that predicts three different subtypes (e.g., subtype A, subtype B, and subtype C), the multivariate biomarker panel can include three different biomarkers.
In various embodiments, the multivariate biomarker panel includes a first biomarker selected from EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, a second biomarker selected from SERPINB1 and GSPT1, and a third biomarker selected from MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, and TOMM70A. Example accuracies of a multivariate biomarker panel implementing combinations of three biomarkers described above is shown in
In various embodiments, the multivariate biomarker panel includes one or more biomarkers selected from EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, one or more biomarkers selected from SERPINB1 and GSPT1, and one or more biomarkers selected from MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, and TOMM70A.
In one embodiment, the multivariate biomarker panel includes a first biomarker selected from ZNF831, MME, CD3G, and STOM, a second biomarker selected from ECSIT, LAT, and NCOA4, and a third biomarker selected from SLC1A5, IGF2BP2, and ANXA3. Example accuracies of a multivariate biomarker panel implementing combinations of three biomarkers described above is shown in
In one embodiment, the multivariate biomarker panel includes a first biomarker selected from C14orf159 and PUM2, a second biomarker selected from EPB42 and RPS6KA5, and a third biomarker selected from GBP2. Example accuracies of a multivariate biomarker panel implementing combinations of three biomarkers described above is shown in
In one embodiment, the multivariate biomarker panel includes a first biomarker selected from MSH2, DCTD, and MMP8, a second biomarker selected from HK3, UCP2, and NUP88, and a third biomarker selected from GABARAPL2 and CASP4. Example accuracies of a multivariate biomarker panel implementing combinations of three biomarkers described above is shown in
In one embodiment, the multivariate biomarker panel includes a first biomarker selected from STOM, ZNF831, CD3G, MME, BTN3A2, and HLA-DPA1, a second biomarker selected from EPB42, GSPT1, LAT, HK3, and SERPINB1, and a third biomarker selected from GBP2, TNFRSF1A, SLC1A5, IGF2BP2, and ANXA3. Example accuracies of a multivariate biomarker panel implementing combinations of three biomarkers described above is shown in
In one embodiment, the multivariate biomarker panel includes a first biomarker selected from STOM, ZNF831, CD3G, MME, BTN3A2, and HLA-DPA1, a second biomarker selected from EPB42, GSPT1, LAT, HK3, and SERPINB1, and a third biomarker selected from GBP2, SLC1A5, IGF2BP2, and ANXA3. Example accuracies of a multivariate biomarker panel implementing combinations of three biomarkers described above is shown in
In one embodiment, the multivariate biomarker panel includes a first biomarker selected from STOM, MME, BTN3A2, HLA-DPA1, and EVL, a second biomarker selected from EPB42, GSPT1, LAT, HK3, and SERPINB1, and a third biomarker selected from BTN3A2, TNFRSF1A, SLC1A5, IGF2BP2, and ANXA3. Example accuracies of a multivariate biomarker panel implementing combinations of three biomarkers described above is shown in
In one embodiment, the multivariate biomarker panel includes a first biomarker selected from STOM, MME, BTN3A2, HLA-DPA1, and EVL, a second biomarker selected from EPB42, GSPT1, LAT, HK3, and SERPINB1, and a third biomarker selected from GBP2, SLC1A5, IGF2BP2, and ANXA3. Example accuracies of a multivariate biomarker panel implementing combinations of three biomarkers described above is shown in
Although the embodiments described above may refer to a “first biomarker,” “second biomarker,” and/or “third biomarker,” the terms “first biomarker,” “second biomarker,” and/or “third biomarker,” each encompass one or more biomarkers. For example, a “first biomarker” can refer to one or more biomarkers selected from EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8. A “second biomarker” can refer to one or more biomarkers selected from SERPINB1 and GSPT1. A “third biomarker” can refer to one or more biomarkers selected from MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, and TOMM70A.
In one embodiment, the multivariate biomarker panel includes four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty, twenty one, twenty two, twenty three, or twenty four biomarkers selected from EVL, BTN3A2, HLA-DPA1, IDH3A, ACBD3, EXOSC10, SNRK, MMP8, SERPINB1, GSPT1, MPP1, HMBS, TAL1, C9orf78, POLR2L, SLC27A3, BTN3A2, DDX50, FCHSD2, GSTK1, UBE2E1, TNFRSF1A, PRPF3, and TOMM70A.
In one embodiment, the multivariate biomarker panel includes four, five, six, seven, eight, nine, or ten biomarkers selected from ZNF831, MME, CD3G, STOM, ECSIT, LAT, NCOA4, SLC1A5, IGF2BP2, and ANXA3.
In one embodiment, the multivariate biomarker panel includes four or five biomarkers selected from C14orf159, PUM2, EPB42, RPS6KA5, and GBP2.
In one embodiment, the multivariate biomarker panel includes four, five, six, seven, or eight biomarkers selected from MSH2, DCTD, MMP8, HK3, UCP2, NUP88, GABARAPL2 and CASP4.
In one embodiment, the multivariate biomarker panel includes four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen or sixteen biomarkers selected from STOM, ZNF831, CD3G, MME, BTN3A2, HLA-DPA1, EPB42, GSPT1, LAT, HK3, SERPINB1, GBP2, TNFRSF1A, SLC1A5, IGF2BP2, and ANXA3.
In one embodiment, the multivariate biomarker panel includes four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, or fifteen biomarkers selected from STOM, ZNF831, CD3G, MME, BTN3A2, HLA-DPA1, EPB42, GSPT1, LAT, HK3, SERPINB1, GBP2, SLC1A5, IGF2BP2, and ANXA3.
In one embodiment, the multivariate biomarker panel includes four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, or fifteen biomarkers selected from STOM, MME, BTN3A2, HLA-DPA1, EVL, EPB42, GSPT1, LAT, HK3, SERPINB1, BTN3A2, TNFRSF1A, SLC1A5, IGF2BP2, and ANXA3.
In one embodiment, the multivariate biomarker panel includes four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, or fifteen biomarkers selected from STOM, MME, BTN3A2, HLA-DPA1, EVL, EPB42, GSPT1, LAT, HK3, SERPINB1, GBP2, SLC1A5, IGF2BP2, and ANXA3.
IIE. Assays
As shown in
In various embodiments, the assay can be any one of RT-qPCR (quantitative reverse transcription polymerase chain reaction), qPCR (quantitative polymerase chain reaction), PCR (polymerase chain reaction), RT-PCR (reverse transcription polymerase chain reaction), SDA (strand displacement amplification), RPA (recombinase polymerase amplification), MDA (multiple displacement amplification), HDA (helicase dependent amplification), LAMP (loop-mediated isothermal amplification), RCA (rolling circle amplification), NASBA (nucleic acid-sequence-based amplification), and any other isothermal or thermocycled amplification reaction. In particular embodiments, the assay is a RT-qPCR assay or a LAMP assay. For example, in a critical care setting where a classification and therapy recommendation is to be rapidly developed for a patient (e.g., within 30 minutes or within 2 hours), assay can be RT-qPCR or a LAMP assay that enables rapid quantification of the biomarkers in a sample obtained from the patient.
In various embodiments, the marker quantification assay 120 involves performing sequencing to obtain sequence reads (e.g., sequence reads for generating a sequencing library). The sequence reads can be quantified to determine quantitative data of biomarkers. Sequence reads can be achieved with commercially available next generation sequencing (NGS) platforms, including platforms that perform any of sequencing by synthesis, sequencing by ligation, pyrosequencing, using reversible terminator chemistry, using phospholinked fluorescent nucleotides, or real-time sequencing. As an example, amplified nucleic acids may be sequenced on an Illumina MiSeq platform.
When pyrosequencing, libraries of NGS fragments are cloned in-situ amplified by capture of one matrix molecule using granules coated with oligonucleotides complementary to adapters. Each granule containing a matrix of the same type is placed in a microbubble of the “water in oil” type and the matrix is cloned amplified using a method called emulsion PCR. After amplification, the emulsion is destroyed and the granules are stacked in separate wells of a titration picoplate acting as a flow cell during sequencing reactions. The ordered multiple administration of each of the four dNTP reagents into the flow cell occurs in the presence of sequencing enzymes and a luminescent reporter, such as luciferase. In the case where a suitable dNTP is added to the 3′ end of the sequencing primer, the resulting ATP produces a flash of luminescence within the well, which is recorded using a CCD camera. It is possible to achieve a read length of more than or equal to 400 bases, and it is possible to obtain 106 readings of the sequence, resulting in up to 500 million base pairs (megabytes) of the sequence. Additional details for pyrosequencing is described in Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; U.S. Pat. Nos. 6,210,891; 6,258,568; each of which is hereby incorporated by reference in its entirety.
On the Solexa/Illumina platform, sequencing data is produced in the form of short readings. In this method, fragments of a library of NGS fragments are captured on the surface of a flow cell that is coated with oligonucleotide anchor molecules. An anchor molecule is used as a PCR primer, but due to the length of the matrix and its proximity to other nearby anchor oligonucleotides, elongation by PCR leads to the formation of a “vault” of the molecule with its hybridization with the neighboring anchor oligonucleotide and the formation of a bridging structure on the surface of the flow cell. These DNA loops are denatured and cleaved. Straight chains are then sequenced using reversibly stained terminators. The nucleotides included in the sequence are determined by detecting fluorescence after inclusion, where each fluorescent and blocking agent is removed prior to the next dNTP addition cycle. Additional details for sequencing using the Illumina platform is found in Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; U.S. Pat. Nos. 6,833,246; 7,115,400; 6,969,488; each of which is hereby incorporated by reference in its entirety.
Sequencing of nucleic acid molecules using SOLiD technology includes clonal amplification of the library of NGS fragments using emulsion PCR. After that, the granules containing the matrix are immobilized on the derivatized surface of the glass flow cell and annealed with a primer complementary to the adapter oligonucleotide. However, instead of using the indicated primer for 3′ extension, it is used to obtain a 5′ phosphate group for ligation for test probes containing two probe-specific bases followed by 6 degenerate bases and one of four fluorescent labels. In the SOLiD system, test probes have 16 possible combinations of two bases at the 3′ end of each probe and one of four fluorescent dyes at the 5′ end. The color of the fluorescent dye and, thus, the identity of each probe, corresponds to a certain color space coding scheme. After many cycles of alignment of the probe, ligation of the probe and detection of a fluorescent signal, denaturation followed by a second sequencing cycle using a primer that is shifted by one base compared to the original primer. In this way, the sequence of the matrix can be reconstructed by calculation; matrix bases are checked twice, which leads to increased accuracy. Additional details for sequencing using SOLiD technology is found in Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; U.S. Pat. Nos. 5,912,148; 6,130,073; each of which is incorporated by reference in its entirety.
In particular embodiments, HeliScope from Helicos BioSciences is used. Sequencing is achieved by the addition of polymerase and serial additions of fluorescently-labeled dNTP reagents. Switching on leads to the appearance of a fluorescent signal corresponding to dNTP, and the specified signal is captured by the CCD camera before each dNTP addition cycle. The reading length of the sequence varies from 25-50 nucleotides with a total yield exceeding 1 billion nucleotide pairs per analytical work cycle. Additional details for performing sequencing using HeliScope is found in Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; U.S. Pat. Nos. 7,169,560; 7,282,337; 7,482,120; 7,501,245; 6,818,395; 6,911,345; 7,501,245; each of which is incorporated by reference in its entirety.
In some embodiments, a Roche sequencing system 454 is used. Sequencing 454 involves two steps. In the first step, DNA is cut into fragments of approximately 300-800 base pairs, and these fragments have blunt ends. Oligonucleotide adapters are then ligated to the ends of the fragments. The adapter serve as primers for amplification and sequencing of fragments. Fragments can be attached to DNA-capture beads, for example, streptavidin-coated beads, using, for example, an adapter that contains a 5′-biotin tag. Fragments attached to the granules are amplified by PCR within the droplets of an oil-water emulsion. The result is multiple copies of cloned amplified DNA fragments on each bead. At the second stage, the granules are captured in wells (several picoliters in volume). Pyrosequencing is carried out on each DNA fragment in parallel. Adding one or more nucleotides leads to the generation of a light signal, which is recorded on the CCD camera of the sequencing instrument. The signal intensity is proportional to the number of nucleotides included. Pyrosequencing uses pyrophosphate (PPi), which is released upon the addition of a nucleotide. PPi is converted to ATP using ATP sulfurylase in the presence of adenosine 5′phosphosulfate. Luciferase uses ATP to convert luciferin to oxyluciferin, and as a result of this reaction, light is generated that is detected and analyzed. Additional details for performing sequencing 454 is found in Margulies et al. (2005) Nature 437: 376-380, which is hereby incorporated by reference in its entirety.
Ion Torrent technology is a DNA sequencing method based on the detection of hydrogen ions that are released during DNA polymerization. The microwell contains a fragment of a library of NGS fragments to be sequenced. Under the microwell layer is the hypersensitive ion sensor ISFET. All layers are contained within a semiconductor CMOS chip, similar to the chip used in the electronics industry. When dNTP is incorporated into a growing complementary chain, a hydrogen ion is released that excites a hypersensitive ion sensor. If homopolymer repeats are present in the sequence of the template, multiple dNTP molecules will be included in one cycle. This results in a corresponding amount of hydrogen atoms being released and in proportion to a higher electrical signal. This technology is different from other sequencing technologies that do not use modified nucleotides or optical devices. Additional details for Ion Torrent Technology is found in Science 327 (5970): 1190 (2010); US Patent Application Publication Nos. 20090026082, 20090127589, 20100301398, 20100197507, 20100188073, and 20100137143, each of which is incorporated by reference in its entirety.
In various embodiments, immunoassays designed to quantitate markers can be used in screening including multiplex assays. Measuring the concentration of a target marker in a sample or fraction thereof can be accomplished by a variety of specific assays. For example, a conventional sandwich type assay can be used in an array, ELISA, RIA, etc. format. Other immunoassays include Ouchterlony plates that provide a simple determination of antibody binding. Additionally, Western blots can be performed on protein gels or protein spots on filters, using a detection system specific for the markers as desired, conveniently using a labeling method.
Protein based analysis, using an antibody that specifically binds to a polypeptide (e.g. marker), can be used to quantify the marker level in a test sample obtained from a subject. In various embodiments, an antibody that binds to a marker can be a monoclonal antibody. In various embodiments, an antibody that binds to a marker can be a polyclonal antibody. For multiplex analysis of markers, arrays containing one or more marker affinity reagents, e.g. antibodies can be generated. Such an array can be constructed comprising antibodies against markers. Detection can utilize one or a panel of marker affinity reagents, e.g. a panel or cocktail of affinity reagents specific for one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty, twenty one, twenty two, twenty three, twenty four or more markers.
In various embodiments, determining the quantitative expression data for each of the at least three biomarkers comprises: contacting the sample with a reagent; generating a plurality of complexes between the reagent and the plurality of biomarkers in the sample; and detecting the plurality of complexes to obtain a dataset associated with the sample, wherein the dataset comprises the quantitative expression data for the biomarker.
Custom processing of 14 datasets from sepsis studies from the literature was performed to identify dysregulated host response subtypes.46 For each study, patients were classified as either adult or pediatric. To distinguish between pediatric and adult patients, manual literature review was performed. Then, adult patients were classified as either sepsis (S) or septic shock (SS), septic shock being a subset of sepsis. To distinguish between adult sepsis and adult septic shock patients, the rate of patient vasopressor use reported in the literature (normally at the first day) was used. If the rate of patient vasopressor use was more than 50%, the whole study cohort was classified as septic shock. In contrast, if the rate of patient vasopressor use was less than 50%, the whole study cohort was classified as sepsis. Based on these classifications of adult or pediatric and sepsis or septic shock, patient samples were classified as Full samples (including adult, pediatric, sepsis, and septic shock patient samples), SS samples (including only adult septic shock patient samples), S samples (including only adult sepsis patient samples), and P samples (including only pediatric sepsis and septic shock patient samples).
Following classification of the patient samples from each literature study, for each study, biomarker expression data were normalized within the study and curated with methodologies specific to the study's array platform technology and to the study's available data format. Healthy control samples and patient samples were processed by the COCONUT framework,47 which normalized the samples with the same array platform and transformed patient expression data according to normalization parameters derived from the healthy samples. The resulting expression data were quantile normalized across patients and studies at the end of the normalization process.
The COINCIDE algorithm was then used to rank the genes based on the expression data.47 Then, for each set of classified patient samples (e.g., Full samples, SS samples, S samples, and P samples), for each subset of genes ranked (i.e. 100, 250, 500, 1000, 1500 genes, so on and so forth), the COMMUNAL clustering algorithm was used to identify the optimal number of clusters,47, 49 as well as to label each patient sample.46 COMMUNAL maps of cluster optimality were generated for each set of classified patient samples, in which the X-axis is the number of clusters, the Y-axis is the number of included genes, and the Z-axis is the mean validity score.
The COMMUNAL map of cluster optimality of a Full Model (including adult, pediatric, sepsis, and septic shock patient samples), exhibited three clusters for 574 out of 700 training samples. The remaining training samples were reported as inconclusive.
The COMMUNAL map of cluster optimality for a SS Model (including only adult septic shock patient samples), exhibited three clusters for 115 out of 165 training samples.
The COMMUNAL map of cluster optimality for a S Model (including only adult septic patient samples), exhibited four clusters for 153 out of 308 training samples. However, the fourth cluster did not reveal consistent results among different clustering algorithms.
The COMMUNAL map of cluster optimality for a P Model (including only pediatric sepsis and septic shock patient samples), exhibited three clusters for 180 out of 227 training samples.
Stable optima were consistently observed at K=3 clusters. Using the patient expression data and cluster labels, Gene Ontology (GO) analysis was performed to characterize the nature and functionality of each cluster, hereinafter referred to as a “subtype”. Subtypes were named as A (lower mortality, adaptive immune activation), B (higher mortality, innate immune activation), and C (higher mortality, older, and with clinical and molecular evidence of coagulopathy).46 The biological functions indicated in the GO analysis demonstrated distinct characteristics among the different subtypes, indicating high potential for guided treatment.
Eight classification Models, including the Full Model based on the Full samples, the SS Model based on the SS samples, the S Model based on the S samples, the P Model based on the P samples, as well as the SS.B1, SS.B2, SS.B3, and SS.B4 models were developed. As detailed below, to train the classification Models based on the associated samples, training labels for each training sample were determined using unsupervised clustering procedures including, normalization, the COCONUT method, the COINCIDE method, and the COMMUNAL method.
The methodology of building the classifiers was guided by a number of considerations, particularly in data transformation and normalization, which impact classifier performance the most. Specifically, the classifiers were built based on the following considerations. First, the classification is time-sensitive because dysregulated host response progression is dynamic (e.g., patients can transition from one subtype to another over time). Therefore, time matched data were analyzed. The time matched data analyzed included data from blood collected from patients within 24 hours of sepsis diagnosis. In cases in which time series data existed, data from the first time point was used. Second, the final classification was envisioned to be a measure of a few biomarkers selected from tens of thousands of biomarkers, so down-selection of the most important biomarkers was implemented. Third, the training sets for the subtype classifiers did not have any outcome labels based on a randomized placebo-controlled trial design, so the trial datasets were selected exclusively as a test set for classifier performance evaluation. Fourth, the VANISH trial raw expression data were measured with the Illumina platform and reported in a different format than the format of expression data of the training set, so the normalization used in the clustering process required special consideration. Fifth, the classification process applied similar data transformation to the training set and the test set to achieve the best performance. Finally, the transformation and normalization strategies that worked best for the clustering process and even the training process may not necessarily perform well in classifying subtypes to differentiate corticosteroid response because the training set did not involve outcome data.
Based on these six considerations, the classifiers were built with a normalization scheme for both the training and test expression data. A platform normalization matrix was built out of all genes of all healthy and sepsis samples. As the number of samples in the matrix was large, individual samples' expression data were quantile normalized against the matrix as a perturbation. To train the classifiers, the expression data from the training set was batch normalized and curated, and then normalized by the platform normalization matrix, as described in detail below.
Sets of potentially significant biomarkers were identified by Significance Analysis of Microarrays (SAM).48 As another example, sets of potentially significant biomarkers are identifiable using qPCR or RNA sequencing data. qPCR measures the relative or absolute expression level of biomarkers. Normalization or calibration processes are implemented. RNA sequencing data measures relative expression levels of model genes and their transcripts. Using sequencing reads alignment methods (e.g. Hisat2, and Bowtie2), expression estimation methods (e.g. StringTie, Salmon) and normalization processes (e.g. quantile normalization), the estimated expression of model genes are quantified.
These sets of potentially significant biomarker sets were down-selected by at least 2-fold change, and forward-search methodology was used to identify a small set of biomarkers for feature calculation.51 The calculated features (e.g., summarized differential gene expression)51 and clustering label of each sample were finally used to train the multi-class classifiers, implemented as e1071::svm with radial kernel, 0.1 gamma, and 10 cost. Tables 1, 2A-2B, and 3 below depict the genes identified for each subtype (e.g., A, B, and C) for each classifier (e.g., the Full Model, the SS Model, the S Model, and the P Model). Specifically, Table 1 depicts the genes for each subtype (e.g., A, B, and C) for the Full Model, Table 2A depicts the genes for each subtype (e.g., A, B, and C) for the SS Model, Table 2B depicts the genes for each subtype (e.g., A, B, and C) for the S Model, and Table 3 depicts the genes for each subtype (e.g., A, B, and C) for the P Model. Note that in certain embodiments, the entire set of genes for a given Model is used to train and/or test the Model. However, in alternative embodiments, only a subset of the set of genes for a given Model is used to train and/or test the Model. For example, in some embodiments, at least one gene from each subtype A, B, and C (e.g., at least one gene from each row 1, 2, and 3 in one of the below Tables 1, 2A, 2B, and 3) may be used to train and/or test a model.
Additional models were created in order to include at least one up- and one down-gene in the model to enable the calculation of scores in an assay based on relative gene expression. Two methods were applied based on forward selection and backward elimination. Forward selection is an iterative method that starts with no genes in the model. In each iteration, features are added that improves the model until the addition of a new variable does not improve the performance of the model. In backward elimination, all the genes are included and then the least significant feature is removed at each iteration if there is improvement in the performance of the model. This is repeated until no improvement is observed from the removal of features. As an example, the SS model was taken as a starting point for the creation of an alternative model. The metric used for evaluating model performance was leave-one-out accuracy and the model's similarity in labeling patients when compared to the Full model.
In this exercise, the backward elimination method produced superior results. Tables 4A-4D depicts four additional models generated by this method named SS.B1, SS.B2, SS.B3, and SS.B4.
Table 5 depicts primer sets for amplifying genes identified by the SS Model and depicted above in Table 2A, primer sets for amplifying genes identified by the S Model and depicted above in Table 3B, and primer sets for amplifying genes identified by the SS.B2 Model and depicted above in Table 4B. Each primer set includes a pair of single-stranded DNA primers (i.e., a forward primer and a reverse primer) for amplifying one gene by, for example, RT-qPCR. In some embodiments the entire sequence of a primer may be used in amplification of the associated gene. In alternative embodiments, at least 15 contiguous nucleotides of a primer sequence may be used in amplification of the associated gene. In certain embodiments, primer sequences other than those mentioned provided in Table 5 can be used to amplify one or more of the genes from Tables 1, 2A, 2B, 3, and 4A-4D.
In certain embodiments, genes may be amplified by methods other than RT-qPCR. For example, in some embodiments, genes may be amplified via LAMP (loop-mediated isothermal amplification). In such embodiments in which a gene is amplified via LAMP, a primer set for amplifying the gene includes a forward outer primer, a backward outer primer, a forward inner primer, a backward inner primer, a forward loop primer, and a backward loop primer.
Sensitivity analysis was performed for each classifier, for each combination of three genes, with one gene selected from each subtype. Specifically, the accuracies of the classifiers were measured to demonstrate that the accuracy of each classifier in identifying a subject's subtype using any combination of three genes, with one gene from each subtype, is greater than 50% (e.g., greater than random chance).
To calculate accuracy for a given classifier for a given combination of three genes, leave-one-out accuracy of the training samples of the training dataset on which the classifier was trained was implemented. The training dataset included N training samples, each training sample including a label y and features x. The leave-one-out accuracy for the classifier for the combination of three genes was calculated based on N calculations. The Ni calculation leaves out the training sample i during training of the classifier. Then, the trained classifier is used to make a prediction zi for the features xi that corresponds to the training sample i that was left out of the training data set. The prediction zi is then compared to the label yi to determine the accuracy of the prediction. The leave-one-out accuracy for the classifier for the combination of three genes was calculated as the number of correct predictions z, divided by N.
Based on the differential biomarker expression determined for each dysregulated host response subtype, an immune state was determined for each subtype. Specifically, subtype A was determined to be associated with the adaptive immune state, subtype B was determined to be associated with the innate immune state and the complement immune state, and subtype C was determined to be associated with the coagulopathic immune state. Then, biomarkers indicated as related to dysregulated host response, immune state, and the pharmacology of existing therapeutics were identified in the literature. Table 6 below depicts a representative list of genes associated with dysregulated host response, immune state, and the pharmacology of existing therapeutics that were identified from the literature.
For each gene in Table 6, the fold-change in gene expression was calculated between subtypes. Specifically, for each subtyping Model (Full/S/SS/P), linear regression was used to compare each gene expression among A/B/C subtypes. In order to adjust batch effects of microarray dataset from different studies, study IDs were included in the linear regression model. From the linear regression model, the coefficients of subtypes were used to calculate gene expression fold changes and Benjamini-Hochberg (BH)53 adjusted p-values of subtypes were used to indicate if expression differences were statistically significant. Table 7 below depicts a representative dataset for subtype fold-changes in expression of the genes in Table 6. The fold-changes in gene expression between subtypes (e.g. fold change “A/B”=2{circumflex over ( )}(A−B) where A and B are the log 2 mean expression for the listed gene for the given subtype A and B) are listed as the numerical values in the table. Bold or underlined indicates a statistically significant fold-change as determined by BH. Bold indicates up-regulation and underlined indicates down-regulation. This dataset was then used to identify therapeutic candidates for the treatment of dysregulated host response taking into account whether the gene is expected to be appreciably expressed in blood.
1.382
1.786
0.724
0.56
1.612
1.635
0.62
0.612
1.348
1.281
0.742
0.781
1.698
1.724
0.589
0.58
1.758
1.547
0.569
0.646
0.949
1.054
1.56
1.706
0.641
0.586
1.692
1.81
0.591
0.552
2.017
1.903
0.496
0.525
0.824
1.214
1.35
0.741
0.832
1.496
1.202
1.722
0.669
0.581
0.577
1.733
1.87
0.535
0.785
1.274
1.433
0.698
0.555
0.711
1.801
1.406
0.794
1.26
1.18
0.848
0.902
0.906
1.109
1.104
0.547
0.407
1.828
2.459
0.803
1.245
1.165
0.858
0.594
0.632
1.683
1.582
0.686
0.723
1.457
1.383
0.553
0.356
1.807
0.529
2.811
1.891
0.507
0.588
1.973
1.702
1.222
0.818
1.19
1.161
0.84
0.861
0.48
2.084
2.021
0.495
0.728
1.262
1.373
1.611
0.792
0.621
0.835
1.198
1.234
0.81
1.682
1.668
0.594
0.599
0.936
0.936
1.069
1.068
2.243
1.924
0.446
0.52
1.216
0.823
1.235
0.81
0.668
1.497
1.902
2.03
0.526
0.493
2.62
2.47
0.382
0.405
1.338
1.41
0.748
0.709
2.984
2.611
0.335
0.383
2.755
2.492
0.363
0.401
2.799
2.132
0.357
0.746
0.469
1.34
1.608
1.352
0.622
0.74
3.286
2.409
0.304
0.74
0.415
1.351
1.147
1.199
0.872
0.834
2.204
2.453
0.454
0.408
1.93
1.708
0.518
0.585
1.752
1.472
0.571
0.679
1.465
1.272
0.683
0.786
1.104
0.906
0.902
1.109
0.613
1.46
1.632
2.274
0.685
0.44
2.177
1.939
0.459
0.516
0.819
1.201
1.221
1.45
0.832
0.69
1.158
0.863
0.639
0.783
1.566
1.233
1.277
0.811
0.921
0.903
1.085
1.107
1.114
0.897
1.057
0.946
0.938
1.066
0.875
1.143
1.156
0.865
0.904
0.887
1.106
1.127
0.934
0.928
1.07
1.077
0.89
0.933
1.124
1.072
0.603
1.658
1.603
0.624
0.887
0.893
1.127
1.12
0.599
1.67
1.516
0.66
0.939
0.937
1.065
1.068
0.656
0.762
1.526
1.163
1.313
0.86
1.67
0.599
0.463
2.158
0.513
0.528
1.948
1.892
0.607
0.623
1.647
1.605
0.937
0.947
1.067
1.055
0.944
1.059
0.864
0.751
1.157
1.332
0.722
1.21
1.384
1.403
0.827
0.713
0.895
0.875
1.117
1.142
0.879
0.882
1.138
1.133
0.784
0.447
1.275
0.581
2.238
1.722
1.051
0.952
0.927
1.079
0.666
1.231
1.501
1.769
0.812
0.565
0.908
0.645
1.101
0.74
1.55
1.351
0.95
1.052
1.58
1.639
0.633
0.61
0.579
1.727
1.687
0.593
0.929
0.906
1.076
1.104
0.948
1.055
0.935
0.909
1.07
1.1
0.892
1.122
0.931
1.074
0.911
0.917
1.098
1.09
1.719
1.628
0.582
0.614
0.913
1.096
0.773
1.435
1.293
1.75
0.697
0.571
0.13
0.129
7.68
7.752
0.796
0.776
1.256
1.289
2.023
1.954
0.494
0.512
1.101
1.119
0.908
0.894
0.774
1.177
1.292
1.442
0.85
0.694
1.642
1.791
0.609
0.558
0.744
2.056
1.343
2.593
0.486
0.386
1.205
1.362
0.83
1.118
0.734
0.894
1.233
1.221
0.811
0.819
1.307
1.303
0.765
0.767
0.81
1.235
1.142
0.876
1.1
0.909
0.929
1.077
0.942
1.062
1.093
0.915
1.697
1.367
0.589
0.826
0.732
1.21
0.961
1.041
0.565
1.263
1.771
2.185
0.792
0.458
0.9
0.903
1.111
1.108
0.952
1.05
0.668
1.497
1.718
0.582
1.3
1.159
0.769
0.863
0.901
1.11
0.911
1.098
1.106
0.905
0.937
1.067
0.819
1.22
1.132
0.883
0.878
1.139
1.193
0.838
0.941
0.94
1.062
1.064
1.618
0.741
0.618
0.45
1.35
2.22
0.848
1.18
0.848
0.847
1.18
1.181
1.776
0.858
0.563
0.487
1.166
2.055
0.894
1.119
1.467
1.301
0.681
0.768
0.789
1.268
1.455
0.687
0.943
1.06
0.894
1.118
0.76
0.838
1.316
1.212
1.194
0.825
0.563
1.775
2.055
0.487
0.796
1.548
1.256
1.779
0.646
0.562
0.528
0.701
1.893
1.422
1.427
0.703
0.721
1.387
1.467
0.682
1.209
1.292
0.827
0.774
1.847
1.682
0.541
0.595
1.098
0.866
0.911
0.831
1.155
1.204
0.917
1.09
0.073
0.09
13.764
11.128
1.107
1.162
0.903
0.861
0.712
1.404
1.564
0.64
0.962
1.04
0.857
1.446
1.166
1.634
0.691
0.612
1.091
1.317
0.916
1.141
0.759
0.876
1.372
1.206
0.729
0.829
1.198
1.127
0.835
0.887
1.517
1.357
0.659
0.737
0.796
1.257
1.24
1.226
0.807
0.815
1.189
0.841
0.778
0.784
1.285
1.276
These findings of differential biomarker expression between subtypes A, B, and C inform general therapeutic strategies.
As discussed above, the genes of Tables 6 and 7 are associated with pharmacology of existing therapeutics. For instance, examples of existing therapeutics that are associated with certain genes are indicated in Table 7. Analysis of these genes of Tables 6 and 7 according to subtype, informs the use of the existing therapeutics associated with these genes for treating dysregulated host response patients of the subtype. Specifically, Table 8 depicts therapeutic hypotheses for systemic immune patients having subtypes A, B, and C, determined based on the analysis of differential gene expression of Table 7, in accordance with an embodiment.
As a specific example, while anti-TNF-alpha has failed to show benefit in past sepsis clinical trials, analysis of differential gene expression according to subtype can inform which specific subtype of patients may respond to anti-TNF-alpha. In this example, the TNF gene is seen to be up-regulated in patients having subtype B and thus, subtype B patients may specifically respond to anti-TNF-alpha therapy.
Table 8 below summarizes an analysis of existing therapeutics that are anticipated to provide the desired therapeutic effects for subtypes A, B, and C mentioned above.
coli improved
Macacairus/
Homo
sapiens
VI.A. Dysregulated Host Response Patient Subtype A
VI.A.1. Corticosteroids
As discussed in detail above, septic patients that remain hypotensive and require vasopressors to maintain a mean arterial pressure ≥65 mmHg are characterized as having septic shock—a condition that exhibits a hospital mortality in excess of 40%. Septic shock patients that show no clinical improvement (defined as having a systolic blood pressure <90 mmHg for more than one hour following both adequate fluid resuscitation and vasopressor therapy) are deemed refractory to vasopressor therapy and are thus characterized as refractory septic shock patients. In many cases, refractory septic shock patients are given corticosteroid therapy, such as hydrocortisone, based on rationale that the therapy may enable vasopressor responsiveness.
To evaluate the efficacy of hydrocortisone therapy in sepsis patients having subtypes A, B, and C, differential expression of the genes of Table 7 that are associated with pharmacology of hydrocortisone therapy were evaluated for the subtypes A, B, and C. Specifically,
Due to this differential expression of genes associated with glucocorticoid receptor signaling between patients of subtypes A, B, and C, it was hypothesized that hydrocortisone therapy is differentially effective for the different subtypes. To test this hypothesis, multiple cohort datasets were analyzed for differential expressions and survival rates to evaluate the effect of hydrocortisone among different dysregulated host response subtypes. Specifically, the constructed classifiers discussed above. were applied to two placebo-controlled trials: the VANISH trial and a Burn-Induced SIRS trial to evaluate the survival rate of the patients that received hydrocortisone therapy.13, 50
To evaluate hydrocortisone therapy response in dysregulated host response patients of the identified dysregulated host response patient subtypes, as discussed in detail below31 the patient subtype classifiers were applied to a transcriptomic dataset from a placebo-controlled hydrocortisone clinical trials in sepsis patients and burn-induced SIRS patients that failed to show a difference in mortality between the treatment and placebo arms of the trial. Differential responses to hydrocortisone therapy were identified for the different patient subtypes. Specifically, one patient subtype is shown to benefit from hydrocortisone, and one or both of the other patient subtypes are shown to worsen with hydrocortisone.
The test expression data from each trial were normalized by the platform normalization matrix described above13 so that the test data were more consistent with the training data. The classifiers (e.g., the Full Model, the SS Model, the S Model, and the P Model) were then applied to the normalized data such that the patients were classified into A, B, and C subtypes. In contrast to the COCONUT method, the normalization approach described herein is simpler because it does not use controls and instead employs a platform normalization matrix, and then selects all of the samples from the matrix used by the target platform of the target sample and then co-normalizes them together. Therefore, each sample in the target samples was normalized independently with the normalization matrix of the sample array platform.
Survival and mortality rates were calculated at day 28 because survival and mortality labels at other time points were not available. A single-time-point survival analysis was performed to observe the difference of survival rate between the hydrocortisone therapy group and the placebo group in each subtype. Binomial and Chi-squared with continuity correction tests were used to test for the significance of these differences. Mortality reduction when ruling out hydrocortisone was calculated as: 1−(Placebo's mortality rate/Hydrocortisone's mortality rate). Conversely, mortality reduction when ruling in hydrocortisone was calculated as: 1−(Hydrocortisone's mortality rate/Placebo's mortality rate), whichever denominator was larger.
Tables 9-16 below depict the survival analyses for each subtype (e.g., A, B, and C) for each classifier (e.g., the Full Model, the SS Model, the S Model, and the P Model) for each trial. Specifically, Tables 9 and 13 depicts survival analysis for each subtype (e.g., A, B, and C) for the Full Model, Tables 10 and 14 depicts survival analysis for each subtype (e.g., A, B, and C) for the SS Model, Tables 11 and 15 depicts survival analysis for each subtype (e.g., A, B, and C) for the S Model, and Tables 12 and 16 depicts survival analysis for each subtype (e.g., A, B, and C) for the P Model.
An alternative method for identifying patients that may be harmed by immunosuppressive effects of hydrocortisone is based on employing A and B scores to identify patients expected to exhibit increased immune activity and lower inflammation. In simple terms, this method is based on a classifying patients with a high A score and low B score.
In one example, previously identified subtypes of sepsis patients were used to tune the model to identify these type A and B patients. Two distinct sepsis response signatures (SRS1 and SRS2) were identified in five public studies (E-MTAB-4421, E-MTAB-4451, E-MTAB-5273, E-MTAB-5274, and E-MTAB-7581), where HumanHT-12 v4 BeadChip were used to generate the gene expression profiles of the patient samples. The processed data of those five studies were downloaded and processed using R programming language and software environment for statistical analysis (version 3.6.3). The Bioconductor annotation package, illuminaHumanv4.db (version 1.26.0), was used to annotate microarray probes and expression levels of genes were determined by each individual probe or mean of probes belonging to the same gene. In order to remove cohort biases, the Bioconductor package, limma (version 3.42.2), was used to remove batch effects. Using ss.b2 panel genes, subtype A, B, and C scores were calculated by geometric mean of up/down genes.
To build the classifier, we defined E-MTAB-4421, E-MTAB-4451, E-MTAB-5273, and E-MTAB-5274 as the training dataset and VANISH (E-MTAB-7581) as the testing dataset. We used features (subtype A, B, and C scores) and class labels (SRS1 vs SRS2) in the training dataset to build a machine-learned classifier based on support-vector machine (SVM) method. SVM is a supervised machine learning method for classification analysis. The algorithm finds a single or a set of hyperplanes that maximize the margin among subtype A, B, and C scores. In order to capture non-linear data, the kernel function was used. R package e1071 was used to build the SVM classifier with following parameters: method=“C-classification”, kernal=“radial”, gamma=0.1, and cost=10.
The accuracy of the classifiers was evaluated by Leave-One-Out (LOO) cross-validation over the training dataset. Also the classifier was applied to 117 controlled samples from VANISH trial. The patients predicted as Type-A (SRS2-like) exhibited significant 28-day mortality reduction when hydrocortisone was applied in comparison to placebo. These Type-A exhibited 75.5% mortality reduction in the placebo group in comparison to the hydrocortisone group (Fisher exact test p-value 0.0093). The Type-A (SRS2-like) and Type-B (SRS1-like) classifier exhibited an accuracy of 88.6%. Table 17 below depict the survival analyses for each subtype for the SS.B2 model.
In addition to the SVM-method, thresholds can be employed to scores in order to define A vs. B labels. We discovered that subtype A and B scores play an important role in subtype SRS1 and SRS2 classification. Therefore we applied a heuristic threshold (threshold=0) to subtype A and B scores to classify SRS1-like and SRS2-like in VANISH: SRS2-like label was assigned to samples with subtype A score >0 and subtype B score <0 and SRS1-like label was assigned to the rest of samples. With the simple heuristic threshold (threshold=0), the patients predicted as SRS2-like exhibited 85.2% 28-day mortality reduction in the placebo group in comparison to the hydrocortisone group (Fisher exact test p-value 0.0159).
Besides the heuristic threshold, we also derived the thresholds for subtype A and B scores using the training dataset. Same as the SVM method, we defined E-MTAB-4421, E-MTAB-4451, E-MTAB-5273, and E-MTAB-5274 as the training dataset and VANISH (E-MTAB-7581) as the testing dataset. To identify the best threshold of subtype A score to classify subtype SRS1 and SRS2 in training dataset, we fitted subtype A scores and SRS subtype labels into ROC curve (receiver operating characteristic curve) and identified the threshold for subtype A score (A threshold=−0.2664) which is the closest point to the top-left part of the plot with perfect sensitivity or specificity. With the same method, the optimal B score threshold (B threshold=0.3179) was selected to classify subtype SRS1 and SRS2 in the training set. We applied the defined optimal thresholds for subtype A and B scores to the VANISH trail: the samples whose subtype A scores are above the A threshold and subtype B scores are below the B threshold were labeled as SRS2-like and the rest VANISH trail samples were labeled as SRS1-like. With such classification, the patients with the SRS2-like label showed 81.7% 28-day mortality reduction in the placebo group in comparison to the hydrocortisone group (Fisher exact test p-value 0.0065).
Various thresholds can be employed in order to optimize for mortality reduction (mr) and for the number of patients who may benefit (percentage of patients that are B). Table 18 below depict the survival analyses for each subtype for the SS.B2 model.
Response to hydrocortisone therapy for each subtype of patients identified by each Model for each of the sepsis and burn-induced SIRS patient studies, was evaluated based on a p-values and mortality reduction for the subtype. To evaluate possible adverse response to hydrocortisone therapy for a subtype, a binomial p-value was calculated as the probability of achieving a random survival rate of less than or equal to the survival rate P(X<=x) observed in the hydrocortisone therapy group for the subtype, assuming that the survival rate observed in the placebo therapy group for the subtype was an expected survival rate for patients receiving no hydrocortisone therapy. To evaluate possible favorable response to hydrocortisone therapy for a subtype, a chi-squared p-value was calculated for the survival rate P(X>=x) observed in the hydrocortisone therapy group for the subtype. The chi-squared p-value was calculated with continuity correction when computed for 2-by-2 tables.
As an example, assuming a chosen, statistically significant p-value of at least 0.1, hydrocortisone therapy response was evaluated based on mortality reduction, as well as binomial and chi-squared p-values, for sepsis and SIRS patients, for each subtype, and for each Model, as follows.
First, assuming the chosen statistically significant p-value of at least 0.1, hydrocortisone therapy response was evaluated for sepsis patients. As shown in Tables 9-12, sepsis patients assigned to the A subtype by the Full, SS, S, and P Models exhibited statistically significant 28-day mortality reduction when placebo was applied in comparison to hydrocortisone therapy. Specifically, the Full, SS, S, and P Models identified a subtype, A, exhibiting 64.6%, 86.0%, 86.1%, and 77.6%, respectively, lower mortality in the placebo group when compared to the hydrocortisone therapy group. As shown in Table 9, sepsis patients assigned to the B subtype by the Full Model exhibited statistically significant 28-day mortality reduction when hydrocortisone was applied in comparison to placebo. Specifically, the Full Model identified a subtype, B, exhibiting 35.2%, lower mortality in the hydrocortisone group when compared to the placebo group. As shown in Table 9, sepsis patients assigned to the C subtype by the Full Model exhibited statistically significant 28-day mortality reduction when placebo was applied in comparison to hydrocortisone therapy. Specifically, the Full Model identified a subtype, C, exhibiting 56.8% lower mortality in the placebo group when compared to the hydrocortisone therapy group.
Additionally, assuming the chosen statistically significant p-value of at least 0.1, hydrocortisone therapy response was evaluated for SIRS patients. As shown in Tables 13, 15, and 16, SIRS patients assigned to the A subtype by the Full, S, and P Models exhibited statistically significant 28-day mortality reduction when placebo was applied in comparison to hydrocortisone therapy. Specifically, the Full, S, and P Models identified a subtype, A, exhibiting 100% lower mortality in the placebo group when compared to the hydrocortisone therapy group. As shown in Table 15, SIRS patients assigned to the B subtype by the S Model exhibited statistically significant 28-day mortality reduction when hydrocortisone was applied in comparison to placebo. Specifically, the S Model identified a subtype, B, exhibiting 28.6%, lower mortality in the hydrocortisone group when compared to the placebo group. As shown in Tables 13-16, SIRS patients assigned to the C subtype by the Full, SS, S, and P Models exhibited statistically significant 28-day mortality reduction when placebo was applied in comparison to hydrocortisone therapy. Specifically, the Full, SS, S, and P Models identified a subtype, A, exhibiting 100%, 65%, 100%, and 100%, respectively, lower mortality in the placebo group when compared to the hydrocortisone therapy group.
Based on these observations of differential mortality reduction as a result of hydrocortisone therapy or placebo therapy between the subtypes A, B, and C identified for both sepsis and SIRS patients by the Full, SS, S, and P Models, the subtypes can be assigned more descriptive titles such as “favorably responsive”, “adversely responsive”, and “non-responsive” to corticosteroid therapy. For example, assuming the chosen statistically significant p-value of at least 0.1, subtypes can be assigned titles as follows.
First, assuming the chosen statistically significant p-value of at least 0.1, subtypes A, B, and C identified for sepsis patients by the Full, SS, S, and P Models can be assigned titles as follows. Because mortality reduction was statistically significant in the placebo group compared to the hydrocortisone therapy group for sepsis patients assigned to subtype A by the Full, SS, S, and P Models, sepsis patients assigned to subtype A by at least one of the Full, SS, S, and P Models can be colloquially referred to as “adversely responsive” to corticosteroid therapy. Similarly, because mortality reduction was statistically significant in the placebo group compared to the hydrocortisone therapy group for sepsis patients assigned to subtype C by the Full Model, sepsis patients assigned to subtype C by the Full Model can be colloquially referred to as “adversely responsive” to corticosteroid therapy. Conversely, because mortality reduction was statistically significant in the hydrocortisone therapy group compared to the placebo group for sepsis patients assigned to subtype B by the Full Model, sepsis patients assigned to subtype B by the Full Model can be colloquially referred to as “favorably responsive” to corticosteroid therapy. Finally, because correlation between therapy group and mortality reduction was not statistically significant for sepsis patients assigned to subtypes B and C by the SS, S, and P Models, sepsis patients assigned to subtype B or C by at least one of the SS, S, and P Models can colloquially referred to as “non-responsive” to corticosteroid therapy.
Additionally, assuming the chosen statistically significant p-value of at least 0.1, subtypes A, B, and C identified for SIRS patients by the Full, SS, S, and P Models can be assigned titles as follows. Because mortality reduction was statistically significant in the placebo group compared to the hydrocortisone therapy group for SIRS patients assigned to subtype A by the Full, S, and P Models, SIRS patients assigned to subtype A by at least one of the Full, S, and P Models can be colloquially referred to as “adversely responsive” to corticosteroid therapy. Similarly, because mortality reduction was statistically significant in the placebo group compared to the hydrocortisone therapy group for SIRS patients assigned to subtype C by the Full, SS, S, and P Models, SIRS patients assigned to subtype C by at least one of the Full, SS, S, and P Models can be colloquially referred to as “adversely responsive” to corticosteroid therapy. Conversely, because mortality reduction was statistically significant in the hydrocortisone therapy group compared to the placebo group for SIRS patients assigned to subtype B by the S Model, SIRS patients assigned to subtype B by the S Model one can be colloquially referred to as “favorably responsive” to corticosteroid therapy. Because correlation between therapy group and mortality reduction was not statistically significant for SIRS patients assigned to subtype B by the Full, SS, and P Models, SIRS patients assigned to subtype B by at least one of the Full, SS, and P Models can be colloquially referred to as “non-responsive” to corticosteroid therapy. Finally, because correlation between therapy group and mortality reduction was not statistically significant for SIRS patients assigned to subtype A by the SS Model, SIRS patients assigned to subtype A by the SS Model can be colloquially referred to as “non-responsive” to corticosteroid therapy.
Further based on these observations of mortality reduction in sepsis and SIRS patients assigned to subtypes A, B, and C by the Full, SS, S, and P Models, subtyped sepsis and SIRS patients may be provided treatment recommendations accordingly. For instance, in one embodiment, patients subtyped as “favorably responsive” to corticosteroid therapy can be recommended treatment with corticosteroids, while patients subtyped as “adversely responsive” to corticosteroid therapy can be recommended no corticosteroid therapy, and while patients subtyped as “non-responsive” to corticosteroid therapy can be provided with no therapy recommendation.
Discrepancies between treatment recommendations for a given subtype (e.g., subtype A, B, or C) across models (e.g., the Full, SS, S, and P Model) and types of dysregulated host response (e.g., sepsis or SIRS) are due to the fact that statistical significance of mortality reduction for a subtype varies according to the model used to assign the subtype, as well as the type of dysregulated host response. For example, as discussed above, for a statistically significant p-value of at least 0.1, sepsis patients that are determined to be of subtype C by the Full Model may be subtyped as “adversely responsive” to corticosteroid therapy, and thus recommended no corticosteroid therapy. Conversely, for a statistically significant p-value of at least 0.1, sepsis patients that are determined to be of subtype C by the SS Model may be subtyped as “non-responsive” to corticosteroid therapy, and thus may not be provided with a therapy recommendation, while SIRS patients that are determined to be of subtype C by the SS Model may be subtyped as “adversely responsive” to corticosteroid therapy, and thus recommended no corticosteroid therapy. Therefore, the titles assigned to subtypes A, B, and C for each model and for each type of dysregulated host response, and thus the therapy recommendations, are dependent upon the chosen statistically significant p-value. In alternative embodiments, the statistically significant p-value may be adjusted, and thus the titles assigned to subtypes A, B, and C, as well as the therapy recommendations, may be adjusted. For example, in some embodiments, the statistically significant p-value may be less than 0.1.
Furthermore, as described above, survival analyses were performed independently from classifier training, which prevented the training from overfitting issues. Thus, the observations of differential response to corticosteroid therapy among the three different subtypes can likely be attributed to the fundamental link between therapy and the biological nature of each subtype. For instance, the most significant molecular functions from the GO analysis of the A subtype were antigen binding, MEW protein complex binding, and cytokine binding, which are strong indicators for adaptive immune response. In the survival analysis results for the A subtype, significant mortality reduction was observed in the placebo group compared to the corticosteroid therapy group, inferring that the corticosteroid therapy might be potentially disturbing the already working adaptive immune response of the A subtype patients. On the contrary, according to the GO analysis of the B subtype, the B subtype was significantly enriched with interleukin (IL)-1 receptor and complement component Cl, indicating a more likely innate immune response. Indeed, for the B subtype, instead of a mortality reduction in the placebo group, a mortality reduction was observed with corticosteroid therapy.
VI.B. Dysregulated Host Response Patient Subtypes B and C
VI.B.1. Immune Stimulants: Checkpoint Inhibitors, Interleukins, and Mediators of T-Cell Regulation Attenuation
As discussed in detail above, subtype B and C patients may benefit from immune stimulants. Examples of therapies for stimulating the immune system include checkpoint inhibitors, interleukins such as IL-7, and therapies that attenuate the regulation and suppression of T-cell function such as blockers of IL-10, and TGF-β.
As shown in
CD28 interacts with CD86 and CD80 to mediate stimulation of T-cell function. CTLA-4 interacts with CD86 and CD80 to mediate inhibition of T-cell function. Checkpoint inhibition of CTLA-4 causes upregulation of INF-γ. As shown in
TIM-3 interacts with CEACAM-1 to mediate inhibition of T cell function. As shown in
VI.C. Dysregulated Host Response Patient Subtype C
VI.C.1. Modulators of Coagulation and Modulators of Vascular Permeability
As discussed in detail above, subtype C patients exhibit coagulopathy and may benefit from modulators of coagulation such as anticoagulants and modulators of vascular permeability. Specifically, therapies that indirectly modulate coagulation factors, such as activated protein C and antithrombin, may be of particular benefit to subtype C patients due to the complexity of the coagulation system and difficulty of managing coagulation by targeting specific coagulation factors directly.
VII.A. Improvement of Acute Care
Syndromes caused by dysregulated host response, such as sepsis, are not single diseases, but rather are heterogeneous processes. As a result, evaluation of effective therapies has been hampered by limitations in the ability to classify patients into homogeneous subtypes based on pathogenesis. The improved ability to subtype patients exhibiting dysregulated host response can therefore enable identification and evaluation of effective new therapies for treating dysregulated host response syndromes such as sepsis.
VII.B. Precision Clinical Trials
The improved ability to subtype patients exhibiting dysregulated host response also enables the design and execution of precision clinical trials and the ability to test effectiveness potential new therapies by targeting the therapies to specific subtypes of patients. The improved ability to subtype patients exhibiting dysregulated host response also allows for predictive therapy enrichment in positively-responsive patients and avoiding the use of therapies in non-responsive or adversely-responsive patients.
VII.C. Precision Care
The improved ability to subtype patients exhibiting dysregulated host response also enables the delivery of precision care. The patient subtype classifiers discussed throughout this disclosure allow for the development of tests for guiding dysregulated host response therapy, and in particular for guiding dysregulated host response therapy in acute care. Specifically, the patient subtype classifiers discussed throughout this disclosure can serve as a companion diagnostic to enable the safe and effective use of dysregulated host response therapy.
VII.D. Patient Subtyping Test
In some embodiments, the subtyping test can be differently configured. For example, the subtyping test need not employ the manual RNA extraction and assay preparation step shown in
The storage device 1604 is a non-transitory computer-readable storage medium such as a hard drive, compact disk read-only memory (CD-ROM), DVD, or a solid-state memory device. The memory 1603 holds instructions and data used by the processor 1601. The input interface 1607 is a touch-screen interface, a mouse, track ball, or other type of pointing device, a keyboard, or some combination thereof, and is used to input data into the computer 1600. In some embodiments, the computer 1600 can be configured to receive input (e.g., commands) from the input interface 1607 via gestures from the user. The graphics adapter 1606 displays images and other information on the display 1609. The network adapter 1608 couples the computer 1600 to one or more computer networks.
The computer 1600 is adapted to execute computer program modules for providing functionality described herein. As used herein, the term “module” refers to computer program logic used to provide the specified functionality. Thus, a module can be implemented in hardware, firmware, and/or software. In one embodiment, program modules are stored on the storage device 1604, loaded into the memory 1603, and executed by the processor 1601.
The types of computers 1600 used to implement the methods described herein can vary depending upon the embodiment and the processing power required by the entity. For example, the diagnostic/treatment system can run in a single computer 1600 or multiple computers 1600 communicating with each other through a network such as in a server farm. The computers 1600 can lack some of the components described above, such as graphics adapters 1606, and displays 1609.
Also disclosed herein are kits for determining a therapy recommendation for an individual. Such kits can include reagents for detecting expression levels of one or biomarkers and instructions for classifying based on the detected expression levels and selecting a therapy recommendation based on the classification.
The detection reagents can be provided as part of a kit. Thus, the invention further provides kits for detecting the presence of a panel of biomarkers of interest in a biological test sample. A kit can comprise a set of reagents for generating a dataset via at least one protein detection assay (e.g., immunoassay or RT-PCR assay) that analyzes the test sample from the subject. In various embodiments, the set of reagents enable detection of quantitative expression levels of biomarkers described in any of Tables 1, 2A-2B, 3, and 4A-4D. In certain aspects, the reagents include one or more antibodies that bind to one or more of the markers. The antibodies may be monoclonal antibodies or polyclonal antibodies. In some aspects, the reagents can include reagents for performing ELISA including buffers and detection agents. In some aspects, the reagents include primers that are designed to hybridize with nucleic acids transcribed from genes identified in any of Tables 1, 2A-2B, 3, and 4A-4D.
A kit can include instructions for use of a set of reagents. For example, a kit can include instructions for performing at least one biomarker detection assay such as an immunoassay, a protein-binding assay, an antibody-based assay, an antigen-binding protein-based assay, a protein-based array, an enzyme-linked immunosorbent assay (ELISA), flow cytometry, a protein array, a blot, a Western blot, nephelometry, turbidimetry, chromatography, mass spectrometry, enzymatic activity, proximity extension assay, and an immunoassay selected from RIA, immunofluorescence, immunochemiluminescence, immunoelectrochemiluminescence, immunoelectrophoretic, a competitive immunoassay, and immunoprecipitation.
In various embodiments, a kit can include instructions for performing at least one of RT-qPCR (quantitative reverse transcription polymerase chain reaction), qPCR (quantitative polymerase chain reaction), PCR (polymerase chain reaction), RT-PCR (reverse transcription polymerase chain reaction), SDA (strand displacement amplification), RPA (recombinase polymerase amplification), MDA (multiple displacement amplification), HDA (helicase dependent amplification), LAMP (loop-mediated isothermal amplification), RCA (rolling circle amplification), NASBA (nucleic acid-sequence-based amplification), and any other isothermal or thermocycled amplification reaction.
In various embodiments, the kit includes instructions for determining quantitative expression data for three biomarkers, the instructions including: contacting the sample with a reagent; generating a plurality of complexes between the reagent and the plurality of biomarkers in the sample; and detecting the plurality of complexes to obtain a dataset associated with the sample, wherein the dataset comprises the quantitative expression data for the biomarker.
In various embodiments, the kits include instructions for practicing the methods disclosed herein (e.g., methods for training and/or implementing a patient subtype classifier). These instructions can be present in the subject kits in a variety of forms, one or more of which can be present in the kit. One form in which these instructions can be present is as printed information on a suitable medium or substrate, e.g., a piece or pieces of paper on which the information is printed, in the packaging of the kit, in a package insert, etc. Yet another means would be a computer readable medium, e.g., diskette, CD, hard-drive, network data storage, etc., on which the information has been recorded. Yet another means that can be present is a website address which can be used via the internet to access the information at a removed site. Any convenient means can be present in the kits.
All references, issued patents and patent applications cited within the body of the specification are hereby incorporated by reference in their entirety, for all purposes.
The foregoing description of the embodiments of the disclosure has been presented for the purpose of illustration; it is not intended to be exhaustive or to limit the disclosure to the precise forms disclosed. Persons skilled in the relevant art can appreciate that many modifications and variations are possible in light of the above disclosure.
Some portions of this description describe the embodiments of the disclosure in terms of algorithms and symbolic representations of operations on information. These algorithmic descriptions and representations are commonly used by those skilled in the data processing arts to convey the substance of their work effectively to others skilled in the art. These operations, while described functionally, computationally, or logically, are understood to be implemented by computer programs or equivalent electrical circuits, microcode, or the like.
Any of the steps, operations, or processes described herein can be performed or implemented with one or more hardware or software modules, alone or in combination with other devices. In one embodiment, a software module is implemented with a computer program product including a computer-readable non-transitory medium containing computer program code, which can be executed by a computer processor for performing any or all of the steps, operations, or processes described.
Embodiments may also relate to an apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, and/or it may comprise a general-purpose computing device selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a non-transitory, tangible computer readable storage medium, or any type of media suitable for storing electronic instructions, which may be coupled to a computer system bus. Furthermore, any computing systems referred to in the specification may include a single processor or may be architectures employing multiple processor designs for increased computing capability.
Embodiments of the disclosure may also relate to a product that is produced by a computing process described herein. Such a product may include information resulting from a computing process, where the information is stored on a non-transitory, tangible computer-readable storage medium and may include any embodiment of a computer program product or other data combination described herein.
Finally, the language used in the specification has been principally selected for readability and instructional purposes, and it may not have been selected to delineate or circumscribe the inventive subject matter. It is therefore intended that the scope of the disclosure be limited not by this detailed description, but rather by any claims that issue on an application based hereon. Accordingly, the disclosure of the embodiments of the disclosure is intended to be illustrative, but not limiting, of the scope of the disclosure.
Disclosed herein is a method comprising: obtaining a sample from a subject exhibiting dysregulated host response, wherein the sample comprises a plurality of biomarkers; determining quantitative expression data for at least one biomarker selected from the group consisting of the biomarkers listed in Row 1 of one of Tables 1, 2A, 2B, and 3, at least one biomarker selected from the group consisting of the biomarkers listed in Row 2 of the one of Tables 1, 2A, 2B, and 3, and at least one biomarker selected from the group consisting of the biomarkers listed in Row 3 of the one of Tables 1, 2A, 2B, and 3; and determining a classification of the subject based on the quantitative expression data using a patient subtype classifier, the classification of the subject comprising one of subtype A, subtype B, or subtype C.
Additionally disclosed herein is a method comprising: obtaining a classification of a subject exhibiting dysregulated host response, the classification of the subject comprising one of subtype A, subtype B, or subtype C; and identifying a therapy recommendation for the subject based at least in part on the classification, wherein responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject comprises at least no immunosuppressive therapy, wherein responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, blocking of complement activity therapy, and anti-inflammatory therapy, and wherein responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, modulators of coagulation therapy, and modulators of vascular permeability therapy.
Additionally disclosed herein is a computer-implemented method comprising: obtaining quantitative expression data from a sample from a subject exhibiting dysregulated host response for at least one biomarker selected from the group consisting of the biomarkers listed in Row 1 of one of Tables 1, 2A, 2B, and 3, at least one biomarker selected from the group consisting of the biomarkers listed in Row 2 of the one of Tables 1, 2A, 2B, and 3, and at least one biomarker selected from the group consisting of the biomarkers listed in Row 3 of the one of Tables 1, 2A, 2B, and 3; and determining, by a computer processor, a classification of the subject based on the quantitative expression data using a patient subtype classifier, the classification of the subject comprising one of subtype A, subtype B, or subtype C.
Additionally disclosed herein is a computer-implemented method comprising: obtaining, by a computer processor, a classification of a subject exhibiting dysregulated host response, the classification of the subject comprising one of subtype A, subtype B, or subtype C; and identifying a therapy recommendation for the subject based at least in part on the classification, wherein responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject comprises at least no immunosuppressive therapy, wherein responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, blocking of complement activity therapy, and anti-inflammatory therapy, and wherein responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, modulators of coagulation therapy, and modulators of vascular permeability therapy.
Additionally disclosed herein is a non-transitory computer-readable storage medium storing computer program instructions that when executed by a computer processor, cause the computer processor to: store quantitative expression data from a sample from a subject exhibiting dysregulated host response, the quantitative expression data for at least one biomarker selected from the group consisting of the biomarkers listed in Row 1 of one of Tables 1, 2A, 2B, and 3, at least one biomarker selected from the group consisting of the biomarkers listed in Row 2 of the one of Tables 1, 2A, 2B, and 3, and at least one biomarker selected from the group consisting of the biomarkers listed in Row 3 of the one of Tables 1, 2A, 2B, and 3; and determine a classification of the subject based on the quantitative expression data using a patient subtype classifier, the classification of the subject comprising one of subtype A, subtype B, or subtype C.
Additionally disclosed herein is a non-transitory computer-readable storage medium storing computer program instructions that when executed by a computer processor, cause the computer processor to: obtain a classification of a subject exhibiting dysregulated host response, the classification of the subject comprising one of subtype A, subtype B, or subtype C; and identify a therapy recommendation for the subject based at least in part on the classification, wherein responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject comprises at least no immunosuppressive therapy, wherein responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, blocking of complement activity therapy, and anti-inflammatory therapy, and wherein responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, modulators of coagulation therapy, and modulators of vascular permeability therapy.
Additionally disclosed herein is a system comprising: a storage memory for storing quantitative expression data from a sample from a subject exhibiting dysregulated host response, the quantitative expression data for at least one biomarker selected from the group consisting of the biomarkers listed in Row 1 of one of Tables 1, 2A, 2B, and 3, at least one biomarker selected from the group consisting of the biomarkers listed in Row 2 of the one of Tables 1, 2A, 2B, and 3, and at least one biomarker selected from the group consisting of the biomarkers listed in Row 3 of the one of Tables 1, 2A, 2B, and 3; and a processor communicatively coupled to the storage memory for determining a classification of the subject based on the quantitative expression data using a patient subtype classifier, the classification of the subject comprising one of subtype A, subtype B, or subtype C.
Additionally disclosed herein is a system comprising: a processor for: obtaining a classification of a subject exhibiting dysregulated host response, the classification of the subject comprising one of subtype A, subtype B, or subtype C; and identifying a therapy recommendation for the subject based at least in part on the classification, wherein responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject comprises at least no immunosuppressive therapy, wherein responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, blocking of complement activity therapy, and anti-inflammatory therapy, and wherein responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject comprises at least one of no therapy recommendation, immune stimulation therapy, suppression of immune regulation therapy, blocking of immune suppression therapy, modulators of coagulation therapy, and modulators of vascular permeability therapy.
Additionally disclosed herein is a kit comprising: a plurality of reagents for determining, from a sample obtained from a subject exhibiting dysregulated host response, quantitative expression data for at least one biomarker selected from the group consisting of the biomarkers listed in Row 1 of one of Tables 1, 2A, 2B, and 3, at least one biomarker selected from the group consisting of the biomarkers listed in Row 2 of the one of Tables 1, 2A, 2B, and 3, and at least one biomarker selected from the group consisting of the biomarkers listed in Row 3 of the one of Tables 1, 2A, 2B, and 3; and instructions for using the plurality of reagents to determine the quantitative expression data from the sample from the subject.
Additionally disclosed herein is a composition comprising at least three primer sets for amplifying at least three biomarkers, wherein each primer set of the at least three primer sets comprises a pair of single-stranded DNA primers for amplifying one of the at least three biomarkers, and wherein at least one of the at least three biomarkers is selected from the group consisting of the biomarkers listed in Row 1 of one of Tables 1, 2A, 2B, and 3, at least one biomarker of the at least three biomarkers is selected from the group consisting of the biomarkers listed in Row 2 of the one of Tables 1, 2A, 2B, and 3, and at least one biomarker of the at least three biomarkers is selected from the group consisting of the biomarkers listed in Row 3 of the one of Tables 1, 2A, 2B, and 3.
In various embodiments, the one of Tables 1, 2A, 2B, and 3 comprises Table 1, wherein the at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 7 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 8, a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 9 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 10, a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 11 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 12, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 13 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 14, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 15 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 16, a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 17 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 18, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 19 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 20, and wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 1 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 2; a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 3 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 4, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 5 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 6.
In various embodiments, the one of Tables 1, 2A, 2B, and 3 comprises Table 1, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 7 and a reverse primer comprising SEQ ID NO. 8, a forward primer comprising SEQ ID NO. 9 and a reverse primer comprising SEQ ID NO. 10, a forward primer comprising SEQ ID NO. 11 and a reverse primer comprising SEQ ID NO. 12, and a forward primer comprising SEQ ID NO. 13 and a reverse primer comprising SEQ ID NO. 14, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 15 and a reverse primer comprising SEQ ID NO. 16, a forward primer comprising SEQ ID NO. 17 and a reverse primer comprising SEQ ID NO. 18, and a forward primer comprising SEQ ID NO. 19 and a reverse primer comprising SEQ ID NO. 20, and wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 1 and a reverse primer comprising SEQ ID NO. 2; a forward primer comprising SEQ ID NO. 3 and a reverse primer comprising SEQ ID NO. 4, and a forward primer comprising SEQ ID NO. 5 and a reverse primer comprising SEQ ID NO. 6.
In various embodiments, the one of Tables 1, 2A, 2B, and 3 comprises Table 2B, wherein the at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 21 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 22, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 23 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 24, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 25 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 26, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 29 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 30, and wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 25 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 26, and a forward primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 27 and a reverse primer comprising at least 15 contiguous nucleotides of SEQ ID NO. 28.
In various embodiments, the one of Tables 1, 2A, 2B, and 3 comprises Table 2B, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 21 and a reverse primer comprising SEQ ID NO. 22, and a forward primer comprising SEQ ID NO. 23 and a reverse primer comprising SEQ ID NO. 24, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 25 and a reverse primer comprising SEQ ID NO. 26, and a forward primer comprising SEQ ID NO. 29 and a reverse primer comprising SEQ ID NO. 30, and wherein at least one of the at least three primer sets is selected from the group consisting of: a forward primer comprising SEQ ID NO. 25 and a reverse primer comprising SEQ ID NO. 26, and a forward primer comprising SEQ ID NO. 27 and a reverse primer comprising SEQ ID NO. 28.
Additionally disclosed herein is a composition comprising at least three primer sets for amplifying at least three biomarkers, wherein each primer set of the at least three primer sets comprises a forward outer primer, a backward outer primer, a forward inner primer, a backward inner primer, a forward loop primer, and a backward loop primer for amplifying one of the at least three biomarkers, and wherein at least one of the at least three biomarkers is selected from the group consisting of the biomarkers listed in Row 1 of one of Tables 1, 2A, 2B, and 3, at least one biomarker of the at least three biomarkers is selected from the group consisting of the biomarkers listed in Row 2 of the one of Tables 1, 2A, 2B, and 3, and at least one biomarker of the at least three biomarkers is selected from the group consisting of the biomarkers listed in Row 3 of the one of Tables 1, 2A, 2B, and 3.
In various embodiments, at least one of the at least three primer sets is selected from the group consisting of: a forward outer primer configured to enable amplification of the at least one biomarker listed in Row 1 of Tables 1, 2A, 2B, and 3, a backward outer primer configured to enable amplification of the at least one biomarker listed in Row 1 of Tables 1, 2A, 2B, and 3, a forward inner primer configured to enable amplification of the at least one biomarker listed in Row 1 of Tables 1, 2A, 2B, and 3, a backward inner primer configured to enable amplification of the at least one biomarker listed in Row 1 of Tables 1, 2A, 2B, and 3, a forward loop primer configured to enable amplification of the at least one biomarker listed in Row 1 of Tables 1, 2A, 2B, and 3, and a backward loop primer configured to enable amplification of the at least one biomarker listed in Row 1 of Tables 1, 2A, 2B, and 3, wherein at least one of the at least three primer sets is selected from the group consisting of: a forward outer primer configured to enable amplification of the at least one biomarker listed in Row 2 of Tables 1, 2A, 2B, and 3, a backward outer primer configured to enable amplification of the at least one biomarker listed in Row 2 of Tables 1, 2A, 2B, and 3, a forward inner primer configured to enable amplification of the at least one biomarker listed in Row 2 of Tables 1, 2A, 2B, and 3, a backward inner primer configured to enable amplification of the at least one biomarker listed in Row 2 of Tables 1, 2A, 2B, and 3, a forward loop primer configured to enable amplification of the at least one biomarker listed in Row 2 of Tables 1, 2A, 2B, and 3, and a backward loop primer configured to enable amplification of the at least one biomarker listed in Row 2 of Tables 1, 2A, 2B, and 3, and wherein at least one of the at least three primer sets is selected from the group consisting of: a forward outer primer configured to enable amplification of the at least one biomarker listed in Row 3 of Tables 1, 2A, 2B, and 3, a backward outer primer configured to enable amplification of the at least one biomarker listed in Row 3 of Tables 1, 2A, 2B, and 3, a forward inner primer configured to enable amplification of the at least one biomarker listed in Row 3 of Tables 1, 2A, 2B, and 3, a backward inner primer configured to enable amplification of the at least one biomarker listed in Row 3 of Tables 1, 2A, 2B, and 3, a forward loop primer configured to enable amplification of the at least one biomarker listed in Row 3 of Tables 1, 2A, 2B, and 3, and a backward loop primer configured to enable amplification of the at least one biomarker listed in Row 3 of Tables 1, 2A, 2B, and 3.
In various embodiments, the dysregulated host response comprises one of sepsis and dysregulated host response not caused by infection. In various embodiments, methods described above further comprise administering or having administered therapy to the subject based on the therapy recommendation. In various embodiments, responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject further comprises at least no corticosteroid therapy. In various embodiments, responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject further comprises at least one of a checkpoint inhibitor, a blocker of complement components, a blocker of complement component receptors, and a blocker of a pro-inflammatory cytokine. In various embodiments, responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject further comprises at least one of a checkpoint inhibitor and an anticoagulant. In various embodiments, responsive to the classification of the subject comprising subtype A, the therapy recommendation identified for the subject further comprises no hydrocortisone.
In various embodiments, responsive to the classification of the subject comprising subtype B, the therapy recommendation identified for the subject further comprises at least one of anti-PD-1, anti-PD-L1, anti-CLTA-4, anti-CEACAM-1, anti-TIM-3, IL-7, anti-C5a, anti-C3a, anti-C5aR, anti-C3aR, anti-TNF-alpha, and anti-IL-6. In various embodiments, responsive to the classification of the subject comprising subtype C, the therapy recommendation identified for the subject further comprises at least one of anti-PD-1, anti-PD-L1, anti-CLTA-4, anti-CEACAM-1, anti-TIM-3, IL-7, activated protein C, antithrombin, and thrombomodulin. In various embodiments, the classification is pre-determined. In various embodiments, the method further comprises determining the classification, and wherein determining the classification comprises: obtaining a sample from the subject exhibiting dysregulated host response, wherein the sample comprises a plurality of biomarkers; determining quantitative expression data for at least three biomarkers; and determining the classification of the subject based on the quantitative expression data using a patient subtype classifier.
In various embodiments, the at least three biomarkers comprise at least one biomarker selected from the group consisting of the biomarkers listed in Row 1 of one of Tables 1, 2A, 2B, and 3, at least one biomarker selected from the group consisting of the biomarkers listed in Row 2 of the one of Tables 1, 2A, 2B, and 3, and at least one biomarker selected from the group consisting of the biomarkers listed in Row 3 of the one of Tables 1, 2A, 2B, and 3. In various embodiments, the obtained sample comprises a blood sample from the subject. In various embodiments, the method further comprises determining that the subject exhibiting dysregulated host response does not exhibit shock, and wherein the one of Tables 1, 2A, 2B, and 3 comprises one of Tables 1, 2B, and 4. In various embodiments, the method further comprises determining that the subject exhibiting dysregulated host response is further exhibiting shock, and wherein the one of Tables 1, 2A, 2B, and 3 comprises one of Tables 1, 2A, and 3. In various embodiments, the method further comprises determining that the subject exhibiting dysregulated host response is an adult subject, and wherein the one of Tables 1, 2A, 2B, and 3 comprises one of Tables 1, 2A, and 2B. In various embodiments, the method further comprises determining that the subject exhibiting dysregulated host response is a pediatric subject, and wherein the one of Tables 1, 2A, 2B, and 3 comprises one of Tables 1 and 3.
In various embodiments, the quantitative expression data for at least one of the at least three biomarkers is determined by one of RT-qPCR (quantitative reverse transcription polymerase chain reaction), qPCR (quantitative polymerase chain reaction), PCR (polymerase chain reaction), RT-PCR (reverse transcription polymerase chain reaction), SDA (strand displacement amplification), RPA (recombinase polymerase amplification), MDA (multiple displacement amplification), HDA (helicase dependent amplification), LAMP (loop-mediated isothermal amplification), RCA (rolling circle amplification), NASBA (nucleic acid-sequence-based amplification), and any other isothermal or thermocycled amplification reaction.
In various embodiments, determining the quantitative expression data for each of the at least three biomarkers comprises: contacting the sample with a reagent; generating a plurality of complexes between the reagent and the plurality of biomarkers in the sample; and detecting the plurality of complexes to obtain a dataset associated with the sample, wherein the dataset comprises the quantitative expression data for the biomarker.
In various embodiments, determining a classification of the subject based on the quantitative expression data using a patient subtype classifier comprises: determining, by the patient subtype classifier, for each candidate classification of the subject, a classification-specific score for the subject by: determining a first geometric mean of the quantitative expression data for the subject for one or more biomarkers of the candidate classification, wherein the quantitative expression data for the subject for the one or more biomarkers of the candidate classification are increased relative to the quantitative expression data for the one or more biomarkers for one or more control subjects; determining a second geometric mean of the quantitative expression for the subject for one or more additional biomarkers of the candidate classification, wherein the quantitative expression data for the subject for the one or more additional biomarkers of the candidate classification are decreased relative to the quantitative expression data for the one or more additional biomarkers for the one or more control subjects; and determining a difference between the first geometric mean and the second geometric mean, the first and second geometric means optionally subject to scaling, and the difference comprising the classification-specific score for the subject; and determining, by the patient subtype classifier, using a multi-class regression model, based on the classification-specific score for each candidate classification of the subject, the classification of the subject, wherein the candidate classifications of the subject comprise subtype A, subtype B, and subtype C.
In various embodiments, the method further comprises prior to determining a classification of the subject based on the quantitative expression data using a patient subtype classifier, normalizing the quantitative expression data based on quantitative expression data for one or more housekeeping genes.
In various embodiments, the patient subtype classifier is a machine-learned model. In various embodiments, the one of Tables 1, 2A, 2B, and 3 comprises Table 1, and wherein the patient subtype classifier has an average accuracy of at least 82.93%. In various embodiments, the one of Tables 1, 2A, 2B, and 3 comprises Table 2A, and wherein the patient subtype classifier has an average accuracy of at least 89.6%. In various embodiments, the one of Tables 1, 2A, 2B, and 3 comprises Table 2B, and wherein the patient subtype classifier has an average accuracy of at least 86.3%. In various embodiments, the one of Tables 1, 2A, 2B, and 3 comprises Table 3, and wherein the patient subtype classifier has an average accuracy of at least 98.3%.
In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, no corticosteroid therapy, or no therapy recommendation, identifying that the therapy recommendation for the subject comprises at least no corticosteroid therapy comprises determining that a statistical significance of an average day 28 reduction in mortality of subjects exhibiting dysregulated host response, determined based on the one of Tables 1, 2A, 2B, and 3 to be of the determined classification of the subject, and not provided corticosteroid therapy, is greater than or equal to a threshold statistical significance, identifying that the therapy recommendation for the subject comprises corticosteroid therapy comprises determining that a statistical significance of an average day 28 reduction in mortality of subjects exhibiting dysregulated host response, determined based on the one of Tables 1, 2A, 2B, and 3 to be of the determined classification of the subject, and provided corticosteroid therapy, is greater than or equal to a threshold statistical significance, and identifying that the therapy recommendation for the subject comprises no therapy recommendation comprises: determining that a statistical significance of an average day 28 reduction in mortality of subjects exhibiting dysregulated host response, determined based on the one of Tables 1, 2A, 2B, and 3 to be of the determined classification of the subject, and not provided corticosteroid therapy, is less than a threshold statistical significance; and determining that a statistical significance of an average day 28 reduction in mortality of subjects exhibiting dysregulated host response, determined based on the one of Tables 1, 2A, 2B, and 3 to be of the determined classification of the subject, and provided corticosteroid therapy, is less than a threshold statistical significance.
In various embodiments, a statistical significance comprises a p-value, and wherein the threshold statistical significance comprises at least 0.1. In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, no corticosteroid therapy, or no therapy recommendation, wherein the dysregulated host response comprises sepsis, wherein the one of Tables 1, 2A, 2B, and 3 comprises Table 1, wherein identifying that the therapy recommendation for the subject comprises no corticosteroid therapy comprises determining that the classification of the subject comprises subtype A or subtype C, and wherein identifying that the therapy recommendation for the subject comprises corticosteroid therapy comprises determining that the classification of the subject comprises subtype B.
In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, no corticosteroid therapy, or no therapy recommendation, wherein the dysregulated host response comprises dysregulated host response not caused by infection, wherein the one of Tables 1, 2A, 2B, and 3 comprises Table 1 or Table 3, wherein identifying that the therapy recommendation for the subject comprises no corticosteroid therapy comprises determining that the classification of the subject comprises subtype A or subtype C, and wherein identifying that the therapy recommendation for the subject comprises no therapy recommendation comprises determining that the classification of the subject comprises subtype B.
In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, no corticosteroid therapy, or no therapy recommendation, wherein the dysregulated host response comprises sepsis, wherein the one of Tables 1, 2A, 2B, and 3 comprises Table 2A, Table 2B or Table 3, wherein identifying that the therapy recommendation for the subject comprises no corticosteroid therapy comprises determining that the classification of the subject comprises subtype A, and wherein identifying that the therapy recommendation for the subject comprises no therapy recommendation comprises determining that the classification of the subject comprises subtype B or subtype C.
In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, no corticosteroid therapy, or no therapy recommendation, wherein the dysregulated host response comprises dysregulated host response not caused by infection, wherein the one of Tables 1, 2A, 2B, and 3 comprises Table 2A, wherein identifying that the therapy recommendation for the subject comprises no corticosteroid therapy comprises determining that the classification of the subject comprises subtype C, and wherein identifying that the therapy recommendation for the subject comprises no therapy recommendation comprises determining that the classification of the subject comprises subtype A or subtype B.
In various embodiments, the therapy recommendation identified for the subject further comprises corticosteroid therapy, no corticosteroid therapy, or no therapy recommendation, wherein the dysregulated host response comprises dysregulated host response not caused by infection, wherein the one of Tables 1, 2A, 2B, and 3 comprises Table 2B, wherein identifying that the therapy recommendation for the subject comprises no corticosteroid therapy comprises determining that the classification of the subject comprises subtype A or subtype C, and wherein identifying that the therapy recommendation for the subject comprises corticosteroid therapy comprises determining that the classification of the subject comprises subtype B.
In various embodiments, an average day 28 reduction in mortality of subjects exhibiting dysregulated host response that comprises sepsis, determined to be of subtype A based on Table 1, and not provided corticosteroid therapy, is between 5.0%-64.6%, compared to subjects exhibiting dysregulated host response that comprises sepsis, determined to be of subtype A based on Table 1, and provided corticosteroid therapy. In various embodiments, an average day 28 reduction in mortality of subjects exhibiting dysregulated host response that comprises sepsis, determined to be of subtype A based on Table 2A, and not provided corticosteroid therapy, is between 5.0%-86.0%, compared to subjects exhibiting dysregulated host response that comprises sepsis, determined to be of subtype A based on Table 2A, and provided corticosteroid therapy. In various embodiments, an average day 28 reduction in mortality of subjects exhibiting dysregulated host response that comprises sepsis, determined to be of subtype A based on Table 2B, and not provided corticosteroid therapy, is between 5.0%-86.1%, compared to subjects exhibiting dysregulated host response that comprises sepsis, determined to be of subtype A based on Table 2B, and provided corticosteroid therapy.
In various embodiments, an average day 28 reduction in mortality of subjects exhibiting dysregulated host response that comprises sepsis, determined to be of subtype A based on Table 3, and not provided corticosteroid therapy, is between 5.0%-77.6%, compared to subjects exhibiting dysregulated host response that comprises sepsis, determined to be of subtype A based on Table 3, and provided corticosteroid therapy. In various embodiments, an average day 28 reduction in mortality of subjects exhibiting dysregulated host response that comprises sepsis, determined to be of subtype B based on Table 1, and provided corticosteroid therapy, is between 5.0%-35.2%, compared to subjects exhibiting dysregulated host response that comprises sepsis, determined to be of subtype B based on Table 1, and not provided corticosteroid therapy. In various embodiments, an average day 28 reduction in mortality of subjects exhibiting dysregulated host response that comprises dysregulated host response not caused by infection, determined to be of subtype A or subtype C based on Table 1, and not provided corticosteroid therapy, is between 5.0%-100.0%, compared to subjects exhibiting dysregulated host response that comprises dysregulated host response not caused by infection, determined to be of subtype A or subtype C based on Table 1, and provided corticosteroid therapy. In various embodiments, an average day 28 reduction in mortality of subjects exhibiting dysregulated host response that comprises dysregulated host response not caused by infection, determined to be of subtype A or subtype C based on Table 2A, and not provided corticosteroid therapy, is between 5.0%-56.7%, compared to subjects exhibiting dysregulated host response that comprises dysregulated host response not caused by infection, determined to be of subtype A or subtype C based on Table 2A, and provided corticosteroid therapy.
In various embodiments, an average day 28 reduction in mortality of subjects exhibiting dysregulated host response that comprises dysregulated host response not caused by infection, determined to be of subtype A or subtype C based on Table 2B, and not provided corticosteroid therapy, is between 5.0%-100.0%, compared to subjects exhibiting dysregulated host response that comprises dysregulated host response not caused by infection, determined to be of subtype A or subtype C based on Table 2B, and provided corticosteroid therapy. In various embodiments, an average day 28 reduction in mortality of subjects exhibiting dysregulated host response that comprises dysregulated host response not caused by infection, determined to be of subtype A or C based on Table 3, and not provided corticosteroid therapy, is between 5.0%-100.0%, compared to subjects exhibiting dysregulated host response that comprises dysregulated host response not caused by infection, determined to be of subtype A or C based on Table 3, and provided corticosteroid therapy.
This application claims the benefit of and priority to U.S. Provisional Patent Application No. 62/909,530 filed Oct. 2, 2019 and U.S. Provisional Patent Application No. 63/009,331 filed Apr. 13, 2020, the entire disclosures of which are each hereby incorporated by reference in its entirety for all purposes.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/054033 | 10/2/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62909530 | Oct 2019 | US | |
63009331 | Apr 2020 | US |