Pancreatic ductal adenocarcinoma (PDAC) is one of the most lethal types of cancer with a 5-year survival rate of only 8% and a mortality rate closely approaching the incidence rate. Although resectable PDAC is associated with better survival, only 15-20% of PDAC patients present with localized disease. Imaging modalities, notably endoscopic ultrasound and magnetic resonance cholangiopancreatography, are currently used in the work up of subjects with suspected PDAC or at high risk for the disease. However, known risk factors have only a modest effect on PDAC incidence.
Cancer Antigen 19-9 (CA19-9) is currently in clinical use as a PDAC biomarker. CA19-9 has shown potential as a diagnostic biomarker for both preclinical and early-stage PDAC (Riker et al., Surgical Oncology 6:157-69, 1998). However, CA19-9 alone has limited performance as a biomarker for early-stage disease: less than 75% of pancreatic cancer patients present with elevated CA19-9, and many benign disorders can lead to elevated CA19-9 levels. Moreover, CA19-9 is not detectable in 5-10% of patients with fucosyltransferase deficiency and inability to synthesize antigens of the Lewis blood group. As such, the proportions of individuals incorrectly identified as having PDAC, as well as those incorrectly identified as not having PDAC, is unacceptably high for reliance on CA19-9 alone as a diagnostic tool.
Due to late diagnosis, growing incidence, and limited avenues of treatment, PDAC is set to become a leading cause of cancer-related death. Given the disease is generally diagnosed in an advanced stage in most patients, and use of CA19-9 as a standalone biomarker is clearly inadequate, there is a need to develop a test for the detection of pancreatic cancer at an early stage.
The present disclosure provides methods and kits for the early detection of pancreatic cancer. The methods and kits use multiple assays of biomarkers contained within a biological sample obtained from a subject. The combined analysis of at least three biomarkers: carbohydrate antigen 19-9 (CA19-9), TIMP metallopeptidase inhibitor 1 (TIMP1), and leucine-rich alpha-2-glycoprotein 1 (LRG1), provides high-accuracy diagnosis of PDAC when screened against cohorts with known status.
In some embodiments, the analysis of biomarkers CA19-9, TIMP1, and LRG1, can be combined with analysis of additional biomarkers. In some embodiments, the additional biomarkers can be protein biomarkers. In some embodiments, the additional protein biomarkers can be selected from the group consisting of ALCAM, CHI3L1, COL18A1, IGFBP2, LCN2, LYZ, PARK7, REG3A, SLPI, THBS1, TNFRSF1A, WFDC2, and any combination thereof. In some embodiments, the additional biomarkers can be non-protein biomarkers. In some embodiments, the non-protein biomarkers can be circulating tumor DNA (ctDNA). In some embodiments, a method as described herein may further comprise: measuring the level of (N1/N8)-acetylspermidine (AcSperm) in the biological sample; measuring the level of diacetylspermine (DAS) in the biological sample; measuring the level of lysophosphatidylcholine (LPC) (18:0) in the biological sample; measuring the level of lysophosphatidylcholine (LPC) (20:3) in the biological sample; and measuring the level of an indole-derivative in the biological sample; wherein the amount of (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and the indole-derivative classifies the patient as being susceptible to pancreatic ductal adenocarcinoma or not susceptible to pancreatic ductal adenocarcinoma.
A regression model was identified that can predict the PDAC status for a subject based on levels of CA19-9, TIMP1, and LRG1 found in a biological sample from the subject.
In some embodiments, biomarkers are measured in blood samples drawn from patients. In some embodiments, the presence or absence of biomarkers in a biological sample can be determined. In some embodiments, the level of biomarkers in a biological sample can be quantified.
In some embodiments, a surface is provided to analyze a biological sample. In some embodiments, biomarkers of interest adsorb nonspecifically onto this surface. In some embodiments, receptors specific for biomarkers of interest are incorporated onto this surface.
In some embodiments, the surface is associated with a particle, for example, a bead. In some embodiments, the surface is contained in a multi-well plate to facilitate simultaneous measurements.
In some embodiments, multiple surfaces are provided for parallel assessment of biomarkers. In some embodiments, the multiple surfaces are provided on a single device, for example a 96-well plate. In some embodiments, the multiple surfaces enable simultaneous measurement of biomarkers. In some embodiments, a single biological sample can be applied sequentially to a plurality of surfaces. In some embodiments, a biological sample is divided for simultaneous application to a plurality of surfaces.
In some embodiments, the biomarker binds to a particular receptor molecule, and the presence or absence of the biomarker-receptor complex can be determined. In some embodiments, the amount of biomarker-receptor complex can be quantified. In some embodiments, the receptor molecule is linked to an enzyme to facilitate detection and quantification.
In some embodiments, the biomarker binds to a particular relay molecule, and the biomarker-relay molecule complex in turn binds to a receptor molecule. In some embodiments, the presence or absence of the biomarker-relay-receptor complex can be determined. In some embodiments, the amount of biomarker-relay-receptor complex can be quantified. In some embodiments, the receptor molecule is linked to an enzyme to facilitate detection and quantification. In some embodiments, the enzyme is horseradish peroxidase or alkaline phosphatase.
In some embodiments, a biological sample is analyzed sequentially for individual biomarkers. In some embodiments, a biological sample is divided into separate portions to allow for simultaneous analysis for multiple biomarkers. In some embodiments, a biological sample is analyzed in a single process for multiple biomarkers.
In some embodiments, the absence or presence of biomarker can be determined by visual inspection. In some embodiments, the quantity of biomarker can be determined by use of a spectroscopic technique. In some embodiments, the spectroscopic technique is mass spectrometry. In some embodiments, the spectroscopic technique is UV/V is spectrometry. In some embodiments, the spectroscopic technique is an excitation/emission technique such as fluorescence spectrometry.
In some embodiments, a kit is provided for analysis of a biological sample. In some embodiments, the kit can contain chemicals and reagents required to perform the analysis. In some embodiments, the kit contains a means for manipulating biological samples in order to minimize the required operator intervention.
In another aspect, the disclosure provides a method of determining susceptibility of a patient to pancreatic ductal adenocarcinoma, comprising obtaining a biological sample from the patient; measuring the level of (N1/N8)-acetylspermidine (AcSperm) in the biological sample; measuring the level of diacetylspermine (DAS) in the biological sample; measuring the level of lysophosphatidylcholine (LPC) (18:0) in the biological sample; measuring the level of lysophosphatidylcholine (LPC) (20:3) in the biological sample; and measuring the level of an indole-derivative in the biological sample; wherein the amount of (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and the indole-derivative classifies the patient as being susceptible to pancreatic ductal adenocarcinoma or not susceptible to pancreatic ductal adenocarcinoma.
In another aspect, the disclosure provides a method of determining susceptibility of a patient to pancreatic ductal adenocarcinoma, comprising a plasma-derived biomarker panel and a protein marker panel: wherein the plasma-derived biomarker panel comprises (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and an indole-derivative; wherein the protein biomarker panel comprises CA19-9, LRG1, and TIMP1; wherein the method comprises: obtaining a biological sample from the patient; measuring the levels of the plasma-derived biomarkers and the protein biomarkers in the biological sample; wherein the amount of the plasma-derived biomarkers and the protein biomarkers classifies the patient as being susceptible to pancreatic ductal adenocarcinoma or not susceptible to pancreatic ductal adenocarcinoma.
In another aspect, the disclosure provides a method of determining susceptibility of a patient to pancreatic ductal adenocarcinoma, comprising determining the levels of one or more protein biomarkers and one or more metabolite markers, said method comprising: obtaining a biological sample from the patient; contacting the sample with a first reporter molecule that binds CA19-9 antigen; contacting the sample with a second reporter molecule that binds TIMP1 antigen; contacting the sample with a third reporter molecule that binds LRG1 antigen; and determining the levels of the one or more biomarkers, wherein the one or more biomarkers is selected from the group consisting of (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and an indole-derivative; wherein the amount of the first reporter molecule, the second reporter molecule, the third reporter molecule, and the one or more biomarkers classifies the patient as being susceptible to pancreatic ductal adenocarcinoma or not susceptible to pancreatic ductal adenocarcinoma.
In another aspect, the disclosure provides a method of determining susceptibility of a patient to pancreatic ductal adenocarcinoma, comprising obtaining a biological sample from the patient; measuring the levels of CA19-9, TIMP1, and LRG1 antigens in the biological sample; and measuring the levels of one or more metabolite markers selected from the group consisting of (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and an indole-derivative in the biological sample; assigning the condition of the patient as either susceptible to pancreatic ductal adenocarcinoma or not susceptible to pancreatic ductal adenocarcinoma, as determined by statistical analysis of the levels of CA19-9 antigen, TIMP1 antigen, LRG1 antigen, (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and the indole-derivative in the biological sample.
In another aspect, the disclosure provides a method of treating a patient suspected of susceptibility to pancreatic ductal adenocarcinoma, comprising: analyzing the patient for susceptibility to pancreatic ductal adenocarcinoma with a method as recited in any one of claims 38-41; administering a therapeutically effective amount of a treatment for the adenocarcinoma. In one embodiment, the treatment is surgery, chemotherapy, radiation therapy, targeted therapy, or a combination thereof.
In one embodiment, a method as described herein comprises at least one receptor molecule that selectively binds to an antigen selected from the group consisting of CA19-9, TIMP1, and LRG1.
In one embodiment, detection of the amount of CA19-9, TIMP1, LRG, (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), or the indole-derivative comprises the use of a solid particle. In another embodiment, the solid particle is a bead.
In one embodiment, at least one of the reporter molecules is linked to an enzyme.
In one embodiment, at least one of the protein or metabolite markers generates a detectable signal. In another embodiment, the detectable signal is detectable by a spectrometric method. In another embodiment, the spectrometric method is mass spectrometry.
In one embodiment, a method as described herein comprises inclusion of patient history information into the assignment of having pancreatic ductal adenocarcinoma or not having pancreatic ductal adenocarcinoma.
In one embodiment, a method as described herein comprises administering at least one alternate diagnostic test for a patient assigned as having pancreatic ductal adenocarcinoma. In another embodiment, the at least one alternate diagnostic test comprises an assay or sequencing of at least one ctDNA.
In another aspect, the disclosure provides a kit for a method as described herein, comprising: a reagent solution that comprises a first solute for detection of CA19-9 antigen; a second solute for detection of LRG1 antigen; a third solute for detection of TIMP1 antigen; a fourth solute for detection of (N1/N8)-acetylspermidine (AcSperm); a fifth solute for detection of diacetylspermine (DAS); a sixth solute for detection of lysophosphatidylcholine (LPC) (18:0); a seventh solute for detection of lysophosphatidylcholine (LPC) (20:3); and an eighth solute for detection of the indole-derivative.
In one embodiment, such a kit may comprise a first reagent solution that comprises a first solute for detection of CA19-9 antigen; a second reagent solution that comprises a second solute for detection of LRG1 antigen; a third reagent solution that comprises a third solute for detection of TIMP1 antigen; a fourth reagent solution that comprises a fourth solute for detection of (N1/N8)-acetylspermidine (AcSperm); a fifth reagent solution that comprises a fifth solute for detection of diacetylspermine (DAS); a sixth reagent solution that comprises a sixth solute for detection of lysophosphatidylcholine (LPC) (18:0); a seventh reagent solution that comprises a seventh solute for detection of lysophosphatidylcholine (LPC) (20:3); and an eighth reagent solution that comprises an eighth solute for detection of the indole-derivative.
In one embodiment, a kit as described herein may comprise a device for contacting the reagent solutions with a biological sample. In another embodiment, such a kit may comprise at least one surface with means for binding at least one antigen. In another embodiment, the at least one antigen is selected from the group consisting of CA19-9, LRG1, and TIMP1. In another embodiment, the at least one surface comprises a means for binding ctDNA.
In another aspect, the disclosure provides such a method as described herein wherein the method further comprises: measuring the level of (N1/N8)-acetylspermidine (AcSperm) in the biological sample; measuring the level of diacetylspermine (DAS) in the biological sample; measuring the level of lysophosphatidylcholine (LPC) (18:0) in the biological sample; measuring the level of lysophosphatidylcholine (LPC) (20:3) in the biological sample; and measuring the level of an indole-derivative in the biological sample; wherein the amount of (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and the indole-derivative classifies the patient as being susceptible to pancreatic ductal adenocarcinoma or not susceptible to pancreatic ductal adenocarcinoma.
In another aspect, the disclosure provides a method of determining susceptibility of a patient to pancreatic ductal adenocarcinoma, comprising obtaining a biological sample from the patient; measuring the level of (N1/N8)-acetylspermidine (AcSperm) in the biological sample; measuring the level of diacetylspermine (DAS) in the biological sample; measuring the level of lysophosphatidylcholine (LPC) (18:0) in the biological sample; measuring the level of lysophosphatidylcholine (LPC) (20:3) in the biological sample; and measuring the level of an indole-derivative in the biological sample; wherein the amount of (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and the indole-derivative classifies the patient as being susceptible to pancreatic ductal adenocarcinoma or not susceptible to pancreatic ductal adenocarcinoma.
In another aspect, the disclosure provides a method of determining susceptibility of a patient to pancreatic ductal adenocarcinoma, comprising a plasma-derived biomarker panel and a protein marker panel: wherein the plasma-derived biomarker panel comprises (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and an indole-derivative; wherein the protein biomarker panel comprises CA19-9, LRG1, and TIMP1; wherein the method comprises: obtaining a biological sample from the patient; measuring the levels of the plasma-derived biomarkers and the protein biomarkers in the biological sample; wherein the amount of the plasma-derived biomarkers and the protein biomarkers classifies the patient as being susceptible to pancreatic ductal adenocarcinoma or not susceptible to pancreatic ductal adenocarcinoma.
In another aspect, the disclosure provides a method of determining susceptibility of a patient to pancreatic ductal adenocarcinoma, comprising determining the levels of one or more protein biomarkers and one or more metabolite markers, said method comprising: obtaining a biological sample from the patient; contacting the sample with a first reporter molecule that binds CA19-9 antigen; contacting the sample with a second reporter molecule that binds TIMP1 antigen; contacting the sample with a third reporter molecule that binds LRG1 antigen; and determining the levels of the one or more biomarkers, wherein the one or more biomarkers is selected from the group consisting of (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and an indole-derivative; wherein the amount of the first reporter molecule, the second reporter molecule, the third reporter molecule, and the one or more biomarkers classifies the patient as being susceptible to pancreatic ductal adenocarcinoma or not susceptible to pancreatic ductal adenocarcinoma.
In another aspect, the disclosure provides a method of determining susceptibility of a patient to pancreatic ductal adenocarcinoma, comprising obtaining a biological sample from the patient; measuring the levels of CA19-9, TIMP1, and LRG1 antigens in the biological sample; and measuring the levels of one or more metabolite markers selected from the group consisting of (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and an indole-derivative in the biological sample; assigning the condition of the patient as either susceptible to pancreatic ductal adenocarcinoma or not susceptible to pancreatic ductal adenocarcinoma, as determined by statistical analysis of the levels of CA19-9 antigen, TIMP1 antigen, LRG1 antigen, (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and the indole-derivative in the biological sample.
In another aspect, the disclosure provides a method of treating a patient suspected of susceptibility to pancreatic ductal adenocarcinoma, comprising: analyzing the patient for susceptibility to pancreatic ductal adenocarcinoma with a method as recited in any one of claims 36-39; administering a therapeutically effective amount of a treatment for the adenocarcinoma. In one embodiment, the treatment is surgery, chemotherapy, radiation therapy, targeted therapy, or a combination thereof. In another embodiment, such a method comprises at least one receptor molecule that selectively binds to an antigen selected from the group consisting of CA19-9, TIMP1, and LRG1. In another embodiment, detection of the amount of CA19-9, TIMP1, LRG, (N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), or the indole-derivative comprises the use of a solid particle. In another embodiment, the solid particle is a bead. In another embodiment, at least one of the reporter molecules is linked to an enzyme. In another embodiment, at least one of the protein or metabolite markers generates a detectable signal. In another embodiment, the detectable signal is detectable by a spectrometric method. In another embodiment, the spectrometric method is mass spectrometry. In another embodiment, such a method comprises inclusion of patient history information into the assignment of having pancreatic ductal adenocarcinoma or not having pancreatic ductal adenocarcinoma. In another embodiment, such a method comprises administering at least one alternate diagnostic test for a patient assigned as having pancreatic ductal adenocarcinoma. In another embodiment, the at least one alternate diagnostic test comprises an assay or sequencing of at least one ctDNA.
In another aspect, the disclosure provides a kit for the method as recited in any one of claims 36-40, comprising: a reagent solution that comprises a first solute for detection of CA19-9 antigen; a second solute for detection of LRG1 antigen; a third solute for detection of TIMP1 antigen; a fourth solute for detection of (N1/N8)-acetylspermidine (AcSperm); a fifth solute for detection of diacetylspermine (DAS); a sixth solute for detection of lysophosphatidylcholine (LPC) (18:0); a seventh solute for detection of lysophosphatidylcholine (LPC) (20:3); and an eighth solute for detection of the indole-derivative. In another embodiment, a kit as disclosed herein comprises a first reagent solution that comprises a first solute for detection of CA19-9 antigen; a second reagent solution that comprises a second solute for detection of LRG1 antigen; a third reagent solution that comprises a third solute for detection of TIMP1 antigen; a fourth reagent solution that comprises a fourth solute for detection of (N1/N8)-acetylspermidine (AcSperm); a fifth reagent solution that comprises a fifth solute for detection of diacetylspermine (DAS); a sixth reagent solution that comprises a sixth solute for detection of lysophosphatidylcholine (LPC) (18:0); a seventh reagent solution that comprises a seventh solute for detection of lysophosphatidylcholine (LPC) (20:3); and an eighth reagent solution that comprises an eighth solute for detection of the indole-derivative. In one embodiment, such a kit comprises a device for contacting the reagent solutions with a biological sample. In another embodiment, such a kit comprises at least one surface with means for binding at least one antigen. In another embodiment, the at least one antigen is selected from the group consisting of CA19-9, LRG1, and TIMP1. In another embodiment, the at least one surface comprises a means for binding ctDNA.
In another aspect, the disclosure provides a method of treatment or prevention of progression of pancreatic ductal adenocarcinoma (PDAC) in a patient in whom the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen classifies the patient as having or being susceptible to PDAC comprising one or more of: administering a chemotherapeutic drug to the patient with PDAC; administering therapeutic radiation to the patient with PDAC; and surgery for partial or complete surgical removal of cancerous tissue in the patient with PDAC. In ne embodiment, the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen are elevated. In another embodiment, the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen are elevated in comparison to the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen in a reference patient or group that does not have PDAC. In another embodiment, the reference patient or group is healthy. In another embodiment, the AUC (95% CI) is at least 0.850. In another embodiment, the AUC (95% CI) is at least 0.900. In another embodiment, the classification of the patient as having PDAC has a sensitivity of 0.849 and 0.658 at 95% and 99% specificity, respectively. In another embodiment, the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen are elevated in comparison to the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen in a reference patient or group that has chronic pancreatitis. In another embodiment, the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen are elevated in comparison to the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen in a reference patient or group that has benign pancreatic disease. In another embodiment, the AUC (95% CI) is at least 0.850. In another embodiment, the AUC (95% CI) is at least 0.900. In another embodiment, the classification of the patient as having PDAC has a sensitivity of 0.849 and 0.658 at 95% and 99% specificity, respectively. In another embodiment, the PDAC is diagnosed at or before the borderline resectable stage. In another embodiment, the PDAC is diagnosed at the resectable stage.
In another aspect, the disclosure provides a method of treatment or prevention of progression of pancreatic ductal adenocarcinoma (PDAC) in a patient in whom the levels of CA19-9 antigen, TIMP1 antigen, LRG1, N1/N8)-acetylspermidine (AcSperm), diacetylspermine (DAS), lysophosphatidylcholine (LPC) (18:0), lysophosphatidylcholine (LPC) (20:3), and an indole-derivative classifies the patient as having or being susceptible to PDAC comprising one or more of: administering a chemotherapeutic drug to the patient with PDAC; administering therapeutic radiation to the patient with PDAC; and surgery for partial or complete surgical removal of cancerous tissue in the patient with PDAC. In another embodiment, the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen are elevated. In another embodiment, the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen are elevated in comparison to the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen in a reference patient or group that does not have PDAC. In another embodiment, the reference patient or group is healthy. In another embodiment, the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen are elevated in comparison to the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen in a reference patient or group that has chronic pancreatitis. In another embodiment, the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen are elevated in comparison to the levels of CA19-9 antigen, TIMP1 antigen, and LRG1 antigen in a reference patient or group that has benign pancreatic disease. In another embodiment, the patient is at high-risk of PDAC. In another embodiment, the patient is over age 50 years with new-onset diabetes mellitus, has chronic pancreatitis, has been incidentally diagnosed with mucin-secreting cysts of the pancreas, or is asymptomatic kindred of one of these high-risk groups.
In another aspect, the disclosure provides a method of treating a patient suspected of susceptibility to pancreatic ductal adenocarcinoma, comprising analyzing the patient for susceptibility to pancreatic ductal adenocarcinoma with a method as described herein; administering a therapeutically effective amount of a treatment for the adenocarcinoma. In another embodiment, the treatment is surgery, chemotherapy, radiation therapy, targeted therapy, or a combination thereof.
Provided are methods for identifying pancreatic cancer in a human subject, the methods generally comprising:
(a) applying a blood sample obtained from the subject to an assay for analysis of at least three biomarkers: CA19-9, TIMP1, and LRG1;
(b) quantifying the amount of the at least three biomarkers present in the blood sample; and
(c) applying statistical analysis based on the amount of biomarkers present to determine a biomarker score with respect to corresponding pancreatic cancer, thereby classifying a subject as either positive or negative for pancreatic cancer.
The methods herein enable screening of high-risk subjects, for example, those with a family history of pancreatic cancer, or patients with other risk factors such as chronic pancreatitis, obesity, heavy smoking, and possibly diabetes. The logistic regression model provided herein can incorporate these factors into a classification method.
For subjects that are classified as PDAC-positive, further methods can be provided to clarify PDAC status. Classification as PDAC-positive can be followed by methods including, but not limited to, computed tomography (CT), endoscopic ultrasound (EUS), or endoscopic retrograde cholangiopancreatography (ERCP).
Detection of CA19-9 can be accomplished by contact with the CA19-9 antigen, which is a carbohydrate structure called sialyl-Lewis A (part of the Lewis family of blood group antigens) with the sequence Neu5Acα2,3Galβ1,3(Fucαα1,4)GlcNAc. Sialyl-Lewis A is synthesized by glycosyltransferases that sequentially link the monosaccharide precursors onto both N-linked and O-linked glycans. It is attached to many different proteins, including mucins, carcinoembryonic antigen, and circulating apolipoproteins. In the standard CA19-9 clinical assay, a monoclonal antibody captures and detects the CA19-9 antigen in a sandwich ELISA format, which measures the CA19-9 antigen on many different carrier proteins (Partyka et al., Proteomics 12(13):2213-20, 2012).
Detection of TIMP1 (SEQ ID NO:1; UniProtKB: P01033) can be accomplished by contact with a reporter molecule that specifically binds to TIMP1.
Detection of LRG1 (SEQ ID NO:2; UniProtKB: P02750) can be accomplished by contact with a reporter molecule that specifically binds to LRG1.
A combination of at least the three biomarkers CA19-9, TIMP1, and LRG1 can afford a previously unseen, highly reliable PDAC predictive power. When applied to a blind test set composed of plasma samples from 39 resectable PDAC cases and 82 matched healthy controls, the methods described herein yielded an AUC (95% CI) of 0.887 (0.817-0.957) with a sensitivity of 0.667 at 95% specificity in discriminating early-stage PDAC versus healthy controls. The performance of the biomarker panel demonstrated high accuracy detection of early stage pancreatic cancer and a statistically-significant improvement as compared to CA19-9 alone (p=0.008, test set).
With regard to the detection of the biomarkers detailed herein, the disclosure is not limited to the specific biomolecules reported herein. In some embodiments, other biomolecules can be chosen for the detection and analysis of the disclosed biomarkers including, but not limited to, biomolecules based on proteins, antibodies, nucleic acids, aptamers, and synthetic organic compounds. Other molecules may demonstrate advantages in terms of sensitivity, efficiency, speed of assay, cost, safety, or ease of manufacture or storage. In this regard, those of ordinary skill in the art will appreciate that the predictive and diagnostic power of the biomarkers disclosed herein may extend to the analysis of not just the protein form of these biomarkers, but other representations of the biomarkers as well (e.g., nucleic acid). Further, those of ordinary skill in the art will appreciate that the predictive and diagnostic power of the biomarkers disclosed herein can also be used in combination with an analysis of other biomarkers associated with PDAC. In some embodiments, other biomarkers associated with PDAC can be protein-based biomarkers. In some embodiments, other biomarkers associated with PDAC can be non-protein-based biomarkers, such as, for instance, ctDNA.
TIMP1 and LRG1 complement CA19-9 performance in the validation studies that are disclosed herein. Increased gene expression and/or secretion of TIMP1 has been previously observed in PDAC and found to induce tumor cell proliferation. Although elevated circulating TIMP1 levels have been associated with PDAC, increased levels have also been found in other epithelial tumor types. A role for LRG1 has been suggested in promoting angiogenesis through activation of the TGF-β pathway. Besides PDAC, increased LRG1 plasma levels have also been found in other cancer types.
The performance of the three marker panel demonstrated a statistically-significant improvement over CA19-9 alone in distinguishing early-stage PDAC from matched healthy subject or benign pancreatic disease controls. The three marker panel permits assessment of PDAC among subjects at increased risk, namely those with family history, cystic lesions, chronic pancreatitis or subjects who present with adult-onset type II diabetes, as opposed to screening of asymptomatic subjects of average risk.
Disclosed herein is the first proteomics-based study, performed using both human prediagnostic and mouse early-stage PDAC plasma samples, to conduct sequential validation of identified biomarker candidates in multiple independent sets of samples from resectable PDAC patients and matched controls.
In some embodiments, levels of CA19-9, TIMP1, and LRG1 in a biological sample are measured. In some embodiments, CA19-9, TIMP1, and LRG1 are contacted with reporter molecules, and the levels of respective reporter molecules are measured. In some embodiments, three reporter molecules are provided that specifically bind CA19-9, TIMP1, and LRG1, respectively. Use of reporter molecules can provide gains in convenience and sensitivity for the assay.
In some embodiments, CA19-9, TIMP1, and LRG1 are adsorbed onto a surface that is provided in a kit. In some embodiments, reporter molecules bind to surface-adsorbed CA19-9, TIMP1, and LRG1. Adsorption of biomarkers can be nonselective or selective. In some embodiments, the surface comprises a receptor functionality for increasing selectivity towards adsorption of one or more biomarkers.
In some embodiments, CA19-9, TIMP1, and LRG1 are adsorbed onto three surfaces that are selective for one or more of the biomarkers. A reporter molecule or multiple reporter molecules can then bind to surface-adsorbed biomarkers, and the level of reporter molecule(s) associated with a particular surface can allow facile quantification of the particular biomarker present on that surface.
In some embodiments, CA19-9, TIMP1, and LRG1 are adsorbed onto a surface provided in a kit; relay molecules specific for one or more of these biomarkers can bind to surface-adsorbed biomarkers; and receptor molecules specific for one or more relay molecules can bind to relay molecules. Relay molecules can provide specificity for certain biomarkers, and receptor molecules can enable detection.
In some embodiments, three relay molecules are provided that specifically bind CA19-9, TIMP1, and LRG1, respectively. Relay molecules can be designed for specificity towards a biomarker, or can be selected from a pool of candidates due to their binding properties. Relay molecules can be antibodies generated to bind the biomarkers.
In some embodiments, CA19-9, TIMP1, and LRG1 are adsorbed onto three discrete surfaces provided in a kit; relay molecules specific for one or more of these biomarkers can bind to surface-adsorbed biomarkers; and receptor molecules can bind to relay molecules. Analysis of the surfaces can be accomplished in a stepwise or concurrent fashion.
In some embodiments, the reporter molecule is linked to an enzyme, facilitating quantification of the reporter molecule. In some embodiments, quantification can be achieved by catalytic production of a substance with desirable spectroscopic properties.
In some embodiments, the amount of biomarker is determined using spectroscopy. In some embodiments, the spectroscopy is UV/visible spectroscopy. In some embodiments, the amount of biomarker is determined using mass spectrometry.
The quantity of biomarker(s) found in a particular assay can be directly reported to an operator, or alternately it can be stored digitally and readily made available for mathematical processing. A system can be provided for performing mathematical analysis, and can further report classification as PDAC-positive or PDAC-negative to an operator.
In some embodiments, additional assays known to those of ordinary skill in the art can function within the scope of the present disclosure. Examples of other assays include, but are not limited to, assays utilizing mass-spectrometry, immunoaffinity LC-MS/MS, surface plasmon resonance, chromatography, electrochemistry, acoustic waves, immunohistochemistry, and array technologies.
Also provided herein are methods of treatment for subjects who are classified as PDAC-positive. Treatment for PDAC-positive patients can include, but is not limited to, surgery, chemotherapy, radiation therapy, targeted therapy, or a combination thereof.
The foregoing has outlined rather broadly the features and technical benefits of the disclosure in order that the detailed description may be better understood. It should be appreciated by those skilled in the art that the specific embodiments disclosed may be readily utilized as a basis for modifying or designing other structures or processes for carrying out the same purposes of the disclosure. It is to be understood that the present disclosure is not limited to the particular embodiments described, as variations of the particular embodiments may be made and still fall within the scope of the appended claims.
As used herein, the term “pancreatic cancer” means a malignant neoplasm of the pancreas characterized by the abnormal proliferation of cells, the growth of which cells exceeds and is uncoordinated with that of the normal tissues around it.
As used herein, the term “PDAC” refers to pancreatic ductal adenocarcinoma, which is pancreatic cancer that can originate in the ducts of the pancreas.
As used herein, the term “PDAC-positive” refers to classification of a subject as having PDAC.
As used herein, the term “PDAC-negative” refers to classification of a subject as not having PDAC.
As used herein, the term “pancreatitis” refers to an inflammation of the pancreas. Pancreatitis is not generally classified as a cancer, although it may advance to pancreatic cancer.
As used herein, the term “subject” or “patient” as used herein refers to a mammal, preferably a human, for whom a classification as PDAC-positive or PDAC-negative is desired, and for whom further treatment can be provided.
As used herein, a “reference patient” or “reference group” refers to a group of patients or subjects to which a test sample from a patient suspected of having or being susceptible to PDAC may be compared. In some embodiments, such a comparison may be used to determine whether the test subject has PDAC. A reference patient or group may serve as a control for testing or diagnostic purposes. As described herein, a reference patient or group may be a sample obtained from a single patient, or may represent a group of samples, such as a pooled group of samples.
As used herein, “healthy” refers to an individual having a healthy pancreas, or normal, non-compromised pancreatic function. A healthy patient or subject has no symptoms of PDAC or other pancreatic disease. In some embodiments, a healthy patient or subject may be used as a reference patient for comparison to diseased or suspected diseased samples for determination of PDAC in a patient or a group of patients.
The term “treatment” or “treating” as used herein refers to the administration of medicine or the performance of medical procedures with respect to a subject, for either prophylaxis (prevention) or to cure or reduce the extent of or likelihood of occurrence or recurrence of the infirmity or malady or condition or event in the instance where the subject or patient is afflicted. As related to the present disclosure, the term may also mean the administration of pharmacological substances or formulations, or the performance of non-pharmacological methods including, but not limited to, radiation therapy and surgery. Pharmacological substances as used herein may include, but are not limited to, chemotherapeutics that are established in the art, such as Gemcitabine (GEMZAR), 5-fluorouracil (5-FU), irinotecan (CAMPTOSAR), oxaliplatin (ELOXATIN), albumin-bound paclitaxel (ABRAXANE), capecitabine (XELODA), cisplatin, paclitaxel (TAXOL), docetaxel (TAXOTERE), and irinotecan liposome (ONIVYDE). Pharmacological substances may include substances used in immunotherapy, such as checkpoint inhibitors. Treatment may include a multiplicity of pharmacological substances, or a multiplicity of treatment methods, including, but not limited to, surgery and chemotherapy.
As used herein, the term “ELISA” refers to enzyme-linked immunosorbent assay. This assay generally involves contacting a fluorescently tagged sample of proteins with antibodies having specific affinity for those proteins. Detection of these proteins can be accomplished with a variety of means, including but not limited to laser fluorimetry.
As used herein, the term “regression” refers to a statistical method that can assign a predictive value for an underlying characteristic of a sample based on an observable trait (or set of observable traits) of said sample. In some embodiments, the characteristic is not directly observable. For example, the regression methods used herein can link a qualitative or quantitative outcome of a particular biomarker test, or set of biomarker tests, on a certain subject, to a probability that said subject is for PDAC-positive.
As used herein, the term “logistic regression” refers to a regression method in which the assignment of a prediction from the model can have one of several allowed discrete values. For example, the logistic regression models used herein can assign a prediction, for a certain subject, of either PDAC-positive or PDAC-negative.
As used herein, the term “biomarker score” refers to a numerical score for a particular subject that is calculated by inputting the particular biomarker levels for said subject to a statistical method.
As used herein, the term “cutoff point” refers to a mathematical value associated with a specific statistical method that can be used to assign a classification of PDAC-positive of PDAC-negative to a subject, based on said subject's biomarker score.
As used herein, the term “classification” refers to the assignment of a subject as either PDAC-positive or PDAC-negative, based on the result of the biomarker score that is obtained for said subject.
As used herein, the term “PDAC-positive” refers to an indication that a subject is predicted as susceptible to PDAC, based on the results of the outcome of the methods of the disclosure.
As used herein, the term “PDAC-negative” refers to an indication that a subject is predicted as not susceptible to PDAC, based on the results of the outcome of the methods of the disclosure.
As used herein, the term “Wilcoxon rank sum test,” also known as the Mann-Whitney U test, Mann-Whitney-Wilcoxon test, or Wilcoxon-Mann-Whitney test, refers to a specific statistical method used for comparison of two populations. For example, the test can be used herein to link an observable trait, in particular a biomarker level, to the absence or presence of PDAC in subjects of a certain population.
As used herein, the term “true positive rate” refers to the probability that a given subject classified as positive by a certain method is truly positive.
As used herein, the term “false positive rate” refers to the probability that a given subject classified as positive by a certain method is truly negative.
As used herein, the term “ROC” refers to receiver operating characteristic, which is a graphical plot used herein to gauge the performance of a certain diagnostic method at various cutoff points. A ROC plot can be constructed from the fraction of true positives and false positives at various cutoff points.
As used herein, the term “AUC” refers to the area under the curve of the ROC plot. AUC can be used to estimate the predictive power of a certain diagnostic test. Generally, a larger AUC corresponds to increasing predictive power, with decreasing frequency of prediction errors. Possible values of AUC range from 0.5 to 1.0, with the latter value being characteristic of an error-free prediction method.
As used herein, the term “p-value” or “p” refers to the probability that the distributions of biomarker scores for positive-PDAC and non-positive-PDAC subjects are identical in the context of a Wilcoxon rank sum test. Generally, a p-value close to zero indicates that a particular statistical method will have high predictive power in classifying a subject.
As used herein, the term “CI” refers to a confidence interval, i.e., an interval in which a certain value can be predicted to lie with a certain level of confidence. As used herein, the term “95% CI” refers to an interval in which a certain value can be predicted to lie with a 95% level of confidence.
As used herein, the term “sensitivity” refers to, in the context of various biochemical assays, the ability of an assay to correctly identify those with a disease (i.e., the true positive rate). By comparison, as used herein, the term “specificity” refers to, in the context of various biochemical assays, the ability of an assay to correctly identify those without the disease (i.e., the true negative rate). Sensitivity and specificity are statistical measures of the performance of a binary classification test (i.e., classification function). Sensitivity quantifies the avoiding of false negatives, and specificity does the same for false positives.
As used herein, the term “ALCAM” refers to activated leukocyte cell adhesion molecule.
As used herein, the term “CHI3L1” refers to chitinase-3-like-1.
As used herein, the term “COL18A1” refers to collagen type XVIII alpha 1.
As used herein, the term “IGBFP2” refers to insulin-like growth factor binding protein 2.
As used herein, the term “LCN2” refers to lipocalin 2.
As used herein, the term “LRG1” refers to leucine-rich alpha-2-glycoprotein 1.
As used herein, the term “LYZ” refers to lysozyme 2.
As used herein, the term “PARK7,” refers to protein deglycase DJ-1.
As used herein, the term “REG3A” refers to regenerating family member 3 alpha.
As used herein, the term “SLPI” refers to secretory leukocyte protease inhibitor, also known in the art as antileukoproteinase.
As used herein, the term “pro-CTSS” refers to pro-cathepsin S.
As used herein, the term “total-CTSS” refers to total cathepsin S.
As used herein, the term “THBS1” refers to thrombospondin 1.
As used herein, the term “TIMP1” refers to TIMP metallopeptidase inhibitor 1, also known in the art as metalloproteinase inhibitor 1.
As used herein, the term “TNFRSF1A” refers to tumor necrosis factor receptor superfamily member 1A.
As used herein, the term “WFDC2” refers to WAP four-disulfide core domain 2.
As used herein, the term “CA19-9” refers to carbohydrate antigen 19-9, and is also known in the art as cancer antigen 19-9 and sialylated Lewisa antigen.
As used herein, the term “ctDNA” refers to cell-free or circulating tumor DNA. ctDNA is tumor DNA found circulating freely in the blood of a cancer patient. Without being limited by theory, ctDNA is thought to originate from dying tumor cells and can be present in a wide range of cancers but at varying levels and mutant allele fractions. Generally, ctDNA carry unique somatic mutations formed in the originating tumor cell and not found in the host's healthy cells. As such, the ctDNA somatic mutations can act as cancer-specific biomarkers.
As used herein, a “metabolite” refers to small molecules that are intermediates and/or products of cellular metabolism. Metabolites may perform a variety of functions in a cell, for example, structural, signaling, stimulatory and/or inhibitory effects on enzymes. In some embodiments, a metabolite may be a non-protein, plasma-derived metabolite marker, such as including, but not limited to, acetylspermidine, diacetylspermine, lysophosphatidylcholine (18:0), lysophosphatidylcholine (20:3) and an indole-derivative.
As used herein, an “indole-derivative” refers to compounds that are derived from indole. Indole is an aromatic heterocyclic organic compound with formula C8H7N. It has a bicyclic structure, consisting of a six-membered benzene ring fused to a five-membered nitrogen-containing pyrrole ring. An indole-derivative as described herein may be any derivative of indole. Representative examples include, but are not limited to, tryptophan, indole-3-ethanol, 10,11-Methylenedioxy-20(S)-CPT, 9-Methyl-20(S)-CPT, 9-Amino-10,11-methylenedioxy-20(S)-CPT, 9-Chloro-10,11-methylenedioxy-20(S)-CPT, 9-Chloro-20(S)-CPT, 10-Hydroxy-20(S)-CPT, 9-Amino-20(S)-CPT, 10-Amino-20(S)-CPT, 10-Chloro-20(S)-CPT, 10-Nitro-20(S)-CPT, 20(S)-CPT, 9-hydroxy-20(S)-CPT, (SR)-Indoline-2-carboxylic acid, IAA, IAA-L-Ile, IAA-L-Leu, IBA, ICA-OEt, ICA, Indole-3-acrylic acid, Indole-3-carboxylic acid methyl ester, Indole-3-carboxylic acid, Indole-4-carboxylic acid methyl ester, Boc-L-Igl-OH.
The most common way to classify pancreatic cancer is to divide it into 4 categories based on whether it can be removed with surgery and where it has spread: resectable, borderline resectable, locally advanced, or metastatic. Resectable pancreatic cancer can be surgically removed. The tumor may be located only in the pancreas or extends beyond it, but it has not grown into important arteries or veins in the area. There is no evidence that the tumor has spread to areas outside of the pancreas. Using standard methods common in the medical industry today, only about 10% to 15% of patients are diagnosed with this stage. Borderline resectable describes a tumor that may be difficult, or not possible, to remove surgically when it is first diagnosed, but if chemotherapy and/or radiation therapy is able to shrink the tumor first, it may be able to be removed later with negative margins. A negative margin means that no visible cancer cells are left behind. Locally advanced pancreatic cancer is still located only in the area around the pancreas, but it cannot be surgically removed because it has grown into nearby arteries or veins or to nearby organs. However, there are no signs that it has spread to any distant parts of the body. Using standard methods common in the medical industry today, approximately 35% to 40% of patients are diagnosed with this stage. Metastatic means the cancer has spread beyond the area of the pancreas and to other organs, such as the liver or distant areas of the abdomen. Using standard methods common in the medical industry today, approximately 45% to 55% of patients are diagnosed with this stage. Alternatively, the TNM Staging System, commonly used for other cancers, may be used (but is not common in pancreatic cancer). This system is based on tumor size (T), spread to lymph nodes (N), and metastasis (M).
Options for treatment of pancreatic cancer include surgery for partial or complete surgical removal of cancerous tissue (for example a Whipple procedure, distal pancreatectomy, or total pancreatectomy), administering one or more chemotherapeutic drugs, and administering therapeutic radiation to the affected tissue (e.g., conventional/standard fraction radiation therapy stereotactic body radiation (SBRT)). Chemotherapeutic drugs approved for treatment of pancreatic cancer include, but are not limited to, capecitabine (Xeloda), erlotinib (Tarceva), fluorouracil (5-FU), gemcitabine (Gemzar), irinotecan (Camptosar), leucovorin (Wellcovorin), nab-paclitaxel (Abraxane), nanoliposomal irinotecan (Onivyde), and oxaliplatin (Eloxatin).
Pancreatic cancer is treated most effectively when diagnosed early, preferably at or before the borderline resectable stage and more preferably at the resectable stage.
The following examples are included to demonstrate embodiments of the disclosure. The following examples are presented only by way of illustration and to assist one of ordinary skill in using the disclosure. The examples are not intended in any way to otherwise limit the scope of the disclosure. Those of ordinary skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the disclosure.
Quantitative mass spectrometry (MS) analysis of human plasma samples was done as previously described (Faca et al., PLoS Med. 5(6):e123, 2008). One pool consisting of pancreatic cases whose blood was collected before onset of symptoms and diagnosis was labeled with heavy 1,2,3-13C-acrylamide isotope while a control pool was labeled with light acrylamide prior to mixing of the pools. Proteins were separated by an automated online 2D-HPLC system controlled by Workstation Class-VP 7.4 (Shimadzu Corporation). Separation consisted of anion exchange chromatography followed by reversed-phase chromatography. Each fraction was lyophilized, in-solution digested, and analyzed by MS using an LTQ-Orbitrap (Thermo) mass spectrometer coupled with a NanoLC-1D (Eksigent).
Acquired LC-MS/MS data were processed by the Computational Proteomics Analysis System (CPAS) pipeline (Rauch et al., J. Proteome Res. 5(1):112-21, 2006). X!Tandem, with the custom scoring-plugin Comet, was used as the search engine against the database of human International Protein Index (IPI) version 3.13. Search algorithm parameters were set for trypsin specificity and a maximum of two missed cleavages. Mass tolerance was 1.5 Da for precursor ions and 0.5 Da for fragment ions. Cysteine alkylation with [12C] acrylamide (+71.03657) was set as a fixed modification, and [13C] acrylamide (+3.01006) and oxidation of methionine (+15.99491) as variable modifications. Identified peptides were further validated through PeptideProphet (Keller et al., Anal. Chem. 74(20):5383-92, 2002) and proteins inferred via ProteinProphet (Nesvizhskii et al., Anal. Chem. 75(17):4646-58, 2003). Protein identifications were filtered with a 5% error rate based on the ProteinProphet evaluation. Protein quantitative information was extracted with a designated tool Q3 to quantify each pair of peptides containing cysteine residues identified by MS/MS (Faca et al., J. Proteome Res. 5(8):2009-18, 2006). Only peptides with a minimum of 0.75 PeptideProphet score, and maximum of 20 ppm fractional delta mass were selected for quantitation. Ratios of [13C] acrylamide-labeled to [12C] acrylamide-labeled peptides were plotted on a histogram (log 2 scale), and the median of the distribution was centered at zero. All normalized peptide ratios for a specific protein were averaged to compute an overall protein ratio.
The analysis resulted in the identification of 1,732 proteins using ProteinProphet scores of 0.8 or higher, with an error rate less than 5%. Results also included quantification of 395 proteins with at least two quantified peptides used for downstream analysis.
For all ELISA experiments, each sample was assayed in duplicate, and the absorbance or chemiluminescence was measured with a SpectraMax M5 microplate reader (Molecular Devices). An internal control sample was run in every plate and each value of the samples was divided by the mean value of the internal control in the same plate to correct inter-plate variability.
Murine monoclonal antibodies (#635 and #675) against recombinant NPC2 (aa 20-151; SEQ ID NO:3; UniProtKB: P61916) were generated and used in a sandwich ELISA.
Ninety-six well polystyrene plates (Corning, Canton, N.Y., USA) were coated with 1 μg/mL of anti-NPC2 mouse monoclonal antibody (#635) as capture antibody, followed by blocking with Reagent Diluent (R&D Systems). Plasma samples were diluted 1:200 and serial dilution of recombinant protein was applied to develop a standard curve. Biotinylated anti-NPC2 murine monoclonal antibody (#675) at 1:4000 dilutions was used for detection. After washing, each well was incubated with Streptavidin-HRP followed by incubation of color reagents and stop solution (R&D Systems).
Independent multiple blood sample cohorts were drawn from a pool consisting of PDAC cases (n=187), benign pancreatic disease (n=93), and healthy controls (n=169). All human blood samples were obtained following Institutional Review Board (University of Michigan Comprehensive Cancer Center, Evanston Hospital, University of Utah, University of Texas MD Anderson Cancer Center and International Agency for Research on Cancer) approval and informed consent.
For studies using in-depth quantitative MS, a pool of plasma was constituted from 6 pre-diagnostic PDAC cases (sex, male; median age, 66.5 years; range, 62-76 years) and 6 matched controls (sex, male; median age, 67.0 years, range: 61-76 years). These samples were collected from subjects that were subsequently diagnosed with stage IA (N=1), IB (N=2), and IIB (N=3) PDAC an average of 9.3 months (range, 8-12 months) after sample collection as part of the Carotene and Retinol Efficacy Trial and from 6 controls from the same cohort that were matched for age, sex, and smoking history and that were not diagnosed with cancer over a 4-year follow-up period.
Plasma samples obtained from the University of Michigan Comprehensive Cancer Center under the auspices of the Early Detection Research Network, consisting of 75 PDAC cases, 27 healthy controls, and 19 chronic pancreatitis cases, were used for initial validation and biomarker selection (triage set).
An additional set of plasma samples from 73 patients with early-stage PDAC, 60 healthy controls, 60 patients with chronic pancreatitis, and 14 patients with benign pancreatic cysts, were used for biomarker sequential validation and panel development. All chronic pancreatitis samples were collected in an elective setting in the clinic in the absence of an acute flare-up.
Validation set #1, from Evanston Hospital, consisted of stages IB to IIB PDAC cases (n=10), healthy controls (n=10), and chronic pancreatitis cases (n=10); validation set #2, the University of Utah, consisted of early-stage (IA to IIA) PDAC cases (n=42), healthy controls (n=50), and chronic pancreatitis cases (n=50); and validation set #3, the University of Texas MD Anderson Cancer Center, consisted of resectable PDAC cases (n=21) and benign pancreatic cyst cases (n=14).
Demographics for the three validation sets are presented in Table 1.
An additional independent plasma sample set for testing the combined biomarker panel was obtained from the International Agency for Research on Cancer, consisting of 39 early-stage PDAC and 82 healthy controls. Demographics for the test set are presented in Table 2.
Raw assay data were log 2-transformed, after imputation of the lowest detected value for each assay, to the values below limit of detection. A one-sided Wilcoxon rank sum test was used to compute p values comparing PDAC cases with healthy controls, chronic pancreatitis cases, and pancreatic cyst cases. The applied test was one-sided as aimed to test the null hypothesis of AUC=0.50 versus the alternative hypothesis AUC>0.50. Receiver operating characteristic (ROC) curve analysis was performed to assess the performance of biomarkers in distinguishing PDAC cases from healthy controls, chronic pancreatitis cases, and pancreatic cyst cases. Owing to the small sample size of each set, validation sets #1, #2, and #3 were merged for model development by standardizing the data such that the mean was 0 and standard deviation was 1 for healthy controls. Because validation set #3 did not include healthy controls, the results were standardized such that the benign pancreatic cyst samples had the same mean and standard deviation as chronic pancreatitis samples. Statistical analyses were performed using MATLAB R2014b and SAS version 9.3. p<0.05 was considered statistically significant in all the analyses.
All possible combinations of seven validated biomarker candidates were explored to select a logistic regression model to discriminate pancreatic cancer from healthy control, chronic pancreatitis and pancreatic cyst based on the Akaike information criterion (AIC). A total of 127 logistic regression models were fitted. Standard errors, confidence intervals, and p values were obtained by 1000 times bootstrap taking into account the variability of the coefficients. The p values for comparing the biomarker panel and CA19-9 alone were calculated by 1000 times bootstrap and refers to the null hypothesis of AUC(panel)=AUC(CA19-9) versus the alternative AUC(panel)>AUC(CA19-9). Likelihood ratio test was also applied to compare the goodness of fit of the biomarker panel to CA19-9 alone. The LeaveMOut cross-validation technique was applied to validate the obtained logistic regression models. Data were split into a training and a test set, which corresponded to ⅔ and ⅓ of the original data, respectively. The models were validated by 1000 repetitions of such a splitting scheme and averaging the obtained 1000 AUCs from the test sets. A modified design covariate matrix was applied to build a logistic regression model with OR rule able to discriminate pancreatic cancer from chronic pancreatitis and benign pancreatic cysts patients: [I(Ca19-9>=a) Ca19-9*I(Ca19-9>=a) I(Ca19-9<a) TIMP1*I(Ca19-9<a) LRG1*I(Ca19-9<a) CA19-9*I(Ca19-9<a)]. All possible values of the CA19-9 threshold “a” were scanned to attain the highest possible AUC by 1000 times bootstrap. The measurements that were not initially selected by the bootstrap were used to generate the predicted scores and evaluate the AUC. The procedure was repeated 1000 times and two-tailed p-values were calculated on the obtained 1000 AUCs. The highest AUC was obtained with “a”=1.6.
To avoid over-fitting in the development in the test set of a logistic regression model which included covariates (represented by recruiting center, gender, age, smoking status, and alcohol drinking) together with the three biomarkers TIMP1, LRG1, CA19-9 a two-step strategy was followed. First a covariate-based score was generated by fitting a logistic regression model which included covariates only, and then the covariate-based score was added to the three-biomarker logistic regression model as a single covariate.
The NPC2 assay, as described above, was utilized for this study. At least 17 additional biomarker panel candidates are listed in Table 3.
A flow diagram for the study is presented in
The 7 biomarker candidates in the triage panel were then subjected to analysis with the three validation sets described above. AUC values for all 7 biomarkers selected in the triage set indicate that their plasma levels were consistently elevated in PDAC patients compared with matched controls in validation set #1, #2 and #3 (Tables 4, 5, and 6). The AUCs for these 7 markers, except for IGFBP2 in the comparison of PDAC versus chronic pancreatitis cases in validation set #2, were >0.60 in discriminating PDAC cases from healthy controls as well as chronic pancreatitis cases in both validation set #1 and #2. In addition, 4 biomarkers (CA19-9, TIMP1, LRG1, and IGFBP2) also yielded AUCs>0.60 in plasma samples from PDAC cases compared with benign pancreatic cyst cases in validation set #3 (Table 6).
To develop a biomarker panel for early-stage PDAC, the results of validation sets #1, #2, and #3 were standardized and combined. In the combined validation set the levels of all 7 biomarkers were higher to a statistically-significant degree (AUC>0.60; p<0.05, Wilcoxon rank-sum test) in PDAC cases than in healthy controls and benign pancreatic disease cases (chronic pancreatitis and benign pancreatic cyst cases combined) (Table 7). Next, a biomarker panel for early-stage PDAC based on a logistic regression model was developed.
The resulting regression model can be:
log it(p)=−1.97+1.7005×log TIMP1+0.93856×log LRG1+0.60639×log CA19.9
where p denotes the probability of being a case in the given sample. This model is a regular logistic regression model that makes use of the log it link function. The binary disease status is playing the role of the response and the markers play the role of the covariates. The algorithm for fitting such regression models is a standard one and is based on an iterative re-weighted procedure which is described in detail in standard textbooks of generalized linear models (McCullogh et al., Generalized Linear and Mixed Models (2008); Wiley Series in Probability and Statistics, John Wiley & Sons, Inc., Hoboken, N.J.). However, even though this standard approach applies for model fitting it cannot provide inference for the underlying AUC. To provide p-values and confidence intervals that refer to the AUC, a bootstrap scheme was employed in which re-estimation of the coefficients was done within each bootstrap sample (1000 in total) in order to be able to take into account the variability of the estimated coefficients.
The LeaveMOut cross-validation technique was applied to validate the resulting logistic regression model. In the comparison of PDAC cases with healthy controls, the resulting panel consisted of TIMP1, LRG1, and CA19-9 yielding an AUC (95% CI) of 0.949 (0.917-0.981) and a cross-validation related average AUC of 0.936, which was greater to a statistically-significant degree than the AUC of CA19-9 alone (AUC (95% CI)=0.882 (0.809-0.956); p=0.003, bootstrap; p<0.001, likelihood ratio test; Table 8 and
A logistic regression model based on the same biomarker combination (TIMP1, LRG1, and CA19-9) was developed to discriminate PDAC from benign pancreatic disease cases (AUC (95% CI)=0.846 (0.781-0.911) and a cross-validation related average AUC=0.830, Table 8). Whether an “OR” rule-based linear regression model, whereby either CA19-9 alone or the combination of all three markers, would enable discrimination between PDAC and benign pancreatic disease cases was also explored. The “OR” rule combination of TIMP1, LRG1, and CA19-9 yielded an AUC (95% CI) of 0.890 (0.802-0.978), which was greater to a statistically-significant degree than that of CA19-9 alone (AUC (95% CI)=0.831 (0.754-0.907); p<0.001 bootstrap; p<0.001, likelihood ratio test; Table 8 and
The regression model for discrimination of PDAC from benign pancreatic disease can be:
log it(p)=−1.2497+0.50306×log TIMP1+0.25355×log LRG1+0.51564×log CA19.9
where log refers to the logarithm with base 2. This was obtained by fitting a regular logistic regression model by employing the log it link function and using the binary disease status as the response and the markers as the covariates. The algorithm for fitting such regression models is a standard one and is based on an iterative re-weighted procedure which is described in detail in standard textbooks of generalized linear models (McCullogh et al., supra). An OR rule was further considered in which a tradeoff between the CA19-9 alone and the three marker panel was considered based on a decision value that was varied through a grid search. Namely a regular logistic regression model was considered for which the design matrix was contributing either only through the CA19-9 or through all three markers. Based on a fine grid of points of the threshold that would determine this contribution, an exemplary AUC was extracted that could be derived after repeatedly fitting all models for every point of the grid.
The panel yielded a sensitivity of 0.452 at 95% specificity, which represents an improvement over a sensitivity of 0.288 at 95% specificity for CA19-9 alone. The “OR” rule combination of TIMP1, LRG1, and CA19-9 resulted in high diagnostic accuracy when applied to the comparison of PDAC patients versus healthy controls yielding an AUC (95% CI) of 0.955 (0.890-1) (p vs. CA19-9: p<0.001 bootstrap; p<0.001, likelihood ratio test; Table 8).
Odds ratios at the Youden index-based optimal cut-off points was estimated. For the model for early-stage PDAC cases versus healthy controls, log (odds ratio) was 4.67 (95% CI=3.29-6.05) at the cut-off point with sensitivity of 0.849 and specificity of 0.950. For the model for early-stage PDAC cases versus benign pancreatic disease cases, log (odds ratio) was 2.98 (95% CI=2.04-3.91) at the cut-off point with sensitivity of 0.863 and specificity of 0.757.
Further blinded validation of the panel of three biomarkers TIMP1, LRG1, and CA19-9 was performed using the test set. The levels of all 3 biomarkers were significantly higher in PDAC cases than in healthy controls with AUC (95% CI) of 0.821 (0.736-0.906) for CA19-9, 0.730 (0.626-0.834) for TIMP1, and 0.832 (0.755-0.909) for LRG1 (Table 10). A linear combination of the three markers yielded an AUC (95% CI) of 0.903 (0.838-0.967), which was greater to a statistically-significant degree than the AUC of CA19-9 alone (p=0.001, bootstrap; p<0.001, likelihood ratio test; Table 11). Moreover, the linear combination of TIMP1, LRG1, CA19-9 and covariates (represented by recruiting center, gender, age, smoking status, and alcohol consumption) yielded an AUC (95% CI) of 0.929 (0.878-0.980), which represents a statistically-significant improvement over CA19-9 and covariates combination alone (AUC (95% CI)=0.848 (0.778-0.920); p=0.01, bootstrap; p<0.001, likelihood ratio test; Table 11). Inclusion of covariates resulted in a statistically-significant improvement in performance as compared to the three biomarker panel alone (p=0.03, bootstrap; p=0.004, likelihood ratio test; Table 11).
Of note, the logistic regression model of CA19-9, TIMP1 and LRG1 with fixed coefficients which was developed in the combined validation sets for PDAC versus healthy controls yielded an AUC of 0.887, also with statistically-significant improved performance compared to CA19-9 alone (p=0.008, likelihood ratio test; Table 10 and
It will be appreciated by those of ordinary skill in the art that different methods or assays of biomarker detection, quantitation, and analysis, which can include using different reagents, will produce different results which may require modification of the regression model. In particular, different assays can produce results expressed, for example, in different units. Further, duplicate reactions in duplicate assays of the same samples can also produce different raw results. However, it is the combined detection, quantitation, and analysis of at least the three biomarkers TIMP1, LRG1, and CA19-9 that, when incorporated into a regression model as disclosed herein, produce a definitive diagnosis of PDAC.
A range in the results reported for each particular assay used to detect, quantify, and analyze the three biomarkers will have a range in the resulting PDAC-predictive score that depends, in part, on the degree of sensitivity or specificity (Table 12; where the preferred cutoff based on the Youden Index is 0.8805 with a specificity of 0.95 and sensitivity of 0.8493). The regression model used to generate the PDAC-predictive score can depend on the specific assays utilized to test the markers. As understood by those of skill in the art, different assays can target different epitopes of the three biomarkers or have different affinities and sensitivities. As such, the regression model algorithm used to generate the PDAC-predictive score can be modified to take these assay variations into consideration.
In one example, a patient being screened for PDAC-based on the three-biomarker panel disclosed herein—has a blood sample drawn (or other fluid or tissue biopsy) and assayed by ELISA (or other assay) to quantitate the levels of TIMP1, LRG1, and CA19-9 in the patient. Normalized values for at least these biomarkers that take into account the specific assay used (e.g., ELISA; Table 3) could be, for example, TIMP1=0.6528 ng/mL; LRG1=2.0498 ng/mL; and CA19−9=1.8160 U/mL. Raw assay data are then log 2-transformed, computing the mean and standard deviation for the healthy samples in each cohort. The data is then standardized so that healthy samples have a mean of 0 and a standard deviation of 1: where (Readj−meanhealthy)/(stdhealthy), where j is the jth sample.
When analyzed using the following regression model:
log it(p)=−1.97+1.7005×log TIMP1+0.93856×log LRG1+0.60639×log CA19.9
the above patient would have a combined score of 2.1653. In view of the preferred cutoff for consideration of both specificity and sensitivity (Table 12), a patient with such a combined score would have PDAC with near certainty and consequently be directed for follow-up testing and treatment for PDAC using other modalities discussed herein and known to those of skill in the art. Using the regression model described herein, the more positive the combined PDAC-predictive score, the more certainty the patient has PDAC. Conversely, the more negative the combined PDAC-predictive score, the more certainty the patient does not have PDAC.
By contrast, in another example, normalized values for biomarkers TIMP1, LRG1, and CA19-9 that take into account the specific assay used could be, for example, TIMP1=−2.0370 ng/mL; LRG1=−1.5792 ng/mL; and CA19−9=1.0712 U/mL. When analyzed using the same regression model as above, such a patient would have a combined score of −6.2666. In view of the preferred cutoff for consideration of both specificity and sensitivity (Table 12), a patient with such a combined score would, with near certainty, not have PDAC and, therefore, would or would not need to be followed for additional testing based on the strength of any other clinical conditions.
†Covariates: recruiting center, gender, age (continuous), smoking status (current, ex-, never smoker), and alcohol drinking (current, ex-, never drinker). One healthy control subject with missing alcohol consumption information was not included in this analysis.
‡p versus CA19-9 + Covariates
§p versus Panel
Using an untargeted metabolomics approach, a plasma-derived metabolite biomarker panel was developed for resectable pancreatic ductal adenocarcinoma (PDAC). A multi-assay metabolomics approach using liquid chromatography/mass spectrometry was applied on plasmas collected from 20 (10 early and 10 late stage) PDAC cases and 20 matched controls (10 healthy subjects; 10 subjects with chronic pancreatitis) to identify candidate metabolite markers for PDAC; candidate markers were narrowed based on a second ‘confirmatory’ cohort consisting of 9 PDACs and 50 subjects with benign pancreatic disease (BPD). Blinded validation was performed in an independent cohort consisting of 39 resectable PDAC cases and 82 matched controls. Five metabolites, including acetylspermidine, diacetylspermine, lysophosphatidylcholine (18:0), lysophosphatidylcholine (20:3) and an indole-derivative, were identified in discovery and ‘confirmatory’ cohorts as candidate biomarkers markers for PDAC. A metabolite panel was developed based on logistic regression models and evaluated for its ability to distinguish PDAC from healthy controls in the combined discovery and ‘confirmatory’ cohort. The resulting panel yielded an area under the curve (AUC) of 0.90 (95% C.I.: 0.818-0.989). Blinded validation of the metabolite panel yielded an AUC of 0.89 (95% C.I.: 0.828-0.956) in the independent validation cohort. Importantly evaluation of the metabolite markers in combination with our previously identified protein markers (CA19-9, TIMP1 and LRG1) yielded an AUC of 0.92 in the validation cohort, which was statistically significantly greater than the protein panel alone (AUC=0.86; p-value: 0.024), highlighting the complementary nature of the metabolite panel when combined with a three-protein marker panel.
Pancreatic ductal adenocarcinoma (PDAC) is the third leading cause of cancer-related mortality in both men and women in the United States, with an overall 5-year survival rate of only ˜8%. Unfortunately, diagnosis of PDAC at an early stage is uncommon and usually incidental in the majority of patients (˜85%) presenting with locally advanced or metastatic disease.
Currently, no clinical marker(s) exist that display desired performance characteristics for early stage PDAC in asymptomatic individuals. The current use of CA19-9 as a screen biomarker is limited by its variable accuracy, reduced performance in pre-diagnostic stages of the disease, and its inability to be detected in ˜10% of subjects with fucosyltransferase deficiency. Consequently, there is a critical need for additional markers that display collectively higher sensitivity and specificity for reliable detection of low volume PDAC in asymptomatic individuals. In this context, blood-based biomarker(s) are ideal and represent a relatively non-invasive and cost-effective method for detecting disease at early stages.
Recently, development and sequential validation of a protein-based biomarker panel for detecting early-stage PDAC, capable of complementing CA19-9 was performed. Although classification performance improved relative to CA19-9 alone, room for improvement remained. Thus, there is a need to test the relative contribution of different types of biomarkers, such as metabolites, to enable the development of an optimal biomarker combination model for this challenging application.
In the present study, an untargeted metabolomics approach was applied to develop a plasma-derived metabolite biomarker panel for PDAC. The fixed biomarker panel was subsequently blindly validated in an independent test cohort consisting of 39 resectable PDAC cases and 82 matched healthy controls in addition to being compared against a previously identified protein panel. The performance of the metabolite panel was additionally tested to distinguish PDAC cases from subjects diagnosed with benign pancreatic cysts.
All human blood samples were obtained following Institutional Review Board approval and informed consent. For initial metabolite discovery studies, plasma samples from 20 patients with PDAC, including 10 early-stage and 10 late stage PDAC, 10 healthy controls, and 10 patients with chronic pancreatitis were obtained from the Evanston Hospital (discovery set). All chronic pancreatitis samples were collected in an elective setting in the clinic in the absence of an acute flare-up. Plasma samples obtained from the Indiana University School of Medicine, consisting of 50 patients with low dysplastic grade pancreatic cyst and 9 patients with invasive IPMN (5 early-stage and 4 late-stage adenocarcinoma) were used for biomarker sequential selection and initial validation (confirmation set). All patients underwent surgical resection of their cystic lesion, and plasma samples were collected prior to surgery. Dysplastic grade was histopathology confirmed after surgical resection and determined according to WHO criteria. An additional independent plasma sample set for testing the combined biomarker panel was obtained from the International Agency for Research on Cancer, consisting of 39 early-stage PDAC and 82 healthy controls (Test Set #1). A second sample set from the Indiana University School of Medicine, consisting of 102 patients with low dysplastic grade pancreatic cyst, 12 patients with resectable invasive IPMN, and resectable 8 PDAC patients with IPMN was applied as a Test Set #2. Study flow diagram and clinical characteristics of the patients in the validation sets and test set are presented in
PDAC cell lines (CFPAC, MiaPaCa, SU8686, BxPC3, CAPAN2, PANC03.27 and SW1990) were grown in RPMI-1640 with 10% FBS. The identity of each cell line was confirmed by DNA fingerprinting via short tandem repeats at the time of mRNA and total protein lysate preparation using the PowerPlex 1.2 kit (Promega). Fingerprinting results were compared with reference fingerprints maintained by the primary source of the cell line. Cells were seeded in 6-cm dishes (Thermo Scientific) to reach a 70% (50-80%) confluency, 24 hours post initial seeding. Post 24 hours, cell lysates were washed 2× with pre-chilled 0.9% NaCl followed by addition of 1 mL of pre-chilled extraction buffer (3:1 isopropanol:ultrapure water) to quench and remove cell media. Cells were then scraped using a 25-cm Cell Scraper (Sarstedt) in extraction solvent and transferred to a 1.5-mL Eppendorf tube. After vortexing briefly, the extracted cell lysates were centrifuged at 4° C. for 10 min at 2,000×g. Thereafter, 1 mL of the supernatant containing the extracted metabolites were transferred to 1.5-mL Eppendorf tubes and stored in −20° C. until needed for metabolomic analysis.
Cells were grown in 1 ml of RPMI 1640+10% FBS in 12-well dishes (Costar) to reach a 70% (50-80%) confluency, 24 hours post initial seeding. On the day of the experiment, the cells were washed 2 times with 500 μL serum-free RPMI (Fisher Scientific) containing 5 mM glucose and 0.5 mM glutamine. Serum-free RPMI (300 μL) containing 5 mM glucose and 0.5 mM glutamine was then added to each well and the cells were incubated. After a predetermined incubation time (1, 2, 4, and 6 hours) 250 μL of the conditioned media was collected. For baseline (T0), 250 μL of media was collected directly after the addition of 300 μL. All time points were performed in triplicates or quadruplicates. Blank samples containing media only were included and collected at T0 and T6. The 6-hour samples were used to count cell numbers for data normalization. Once all the media samples were collected, the tubes were centrifuged at 2000×g for 10 min to remove residual debris and the supernatants transferred to 1.5-mL Eppendorf tubes and stored at −80° C. until used for metabolomics analysis.
Plasma metabolites were extracted from pre-aliquoted EDTA plasma (10 μL) with 30 μL of LCMS grade methanol (ThermoFisher) in a 96-well microplate (Eppendorf). Plates were heat sealed, vortexed for 5 min at 750 rpm, and centrifuged at 2000×g for 10 minutes at room temperature. The supernatant (10 μL) was carefully transferred to a 96-well plate, leaving behind the precipitated protein. The supernatant was further diluted with 10 μL of 100 mM ammonium formate, pH 3. For Hydrophilic Interaction Liquid Chromatography (HILIC) analysis, the samples were diluted with 60 μL LCMS grade acetonitrile (ThermoFisher), whereas samples for C18 analysis were diluted with 60 μL water (GenPure ultrapure water system, ThermoFisher). Each sample solution was transferred to 384-well microplate (Eppendorf) for LCMS analysis.
For cell lysates, 100 μL (3:1 isopropanol:ultrapure water) was aliquoted into two 300 μL, 96-well plates (Eppendorf) and evaporated to dryness under vacuum. The samples were then reconstituted as follows: for the HILIC assays, the dried samples were dissolved in 65 μL of ACN (Fisher Scientific): 100 mM Ammonium Formate pH 3 (9:1), whereas for the reverse phase C18 assays, the dried samples were dissolved in 65 μL of H2O: 100 mM Ammonium Formate pH 3 (9:1). The samples were then spun down to remove any insoluble materials and transferred to a 384-well plate for high throughput analysis using LCMS.
Frozen media samples were thawed on ice and 30 al transferred to a 96-well microplate (Eppendorf) containing 30 μL of 100 mM ammonium formate, pH 3.0. The microplates were heat sealed, vortexed for 5 min at 750 rpm, and centrifuged at 2000×g for 10 minutes at room temperature. For Hydrophilic Interaction Liquid Chromatography (HILIC) analysis, 25 μL of sample was transferred to a new 96-well microplate containing 75 μL acetonitrile, whereas samples for C18 analysis were transferred to a new 96-well microplate containing 75 μL water (GenPure ultrapure water system, ThermoFisher). Each sample solution was transferred to 384-well microplate (Eppendorf) for LCMS analysis.
For each batch, samples were randomized and matrix-matched reference quality controls and batch-specific pooled quality controls were included.
Pre-aliquoted EDTA plasma samples (10 μL) were extracted with 30 μL of LCMS grade 2-propanol (ThermoFisher) in a 96-well microplate (Eppendorf). Plates were heat-sealed, vortexed for 5 min at 750 rpm, and centrifuged at 2000×g for 10 minutes at room temperature. The supernatant (10 μL) was carefully transferred to a 96-well plate, leaving behind the precipitated protein. The supernatant was further diluted with 90 μL of 1:3:2 100 mM ammonium formate, pH 3 (Fischer Scientific): acetonitrile: 2-propanol and transferred to a 384-well microplate (Eppendorf) for lipids analysis using LCMS.
For cell lysates, in a 300 μL, 96-well plate, 10 μL supernatant (3:1 isopropanol:ultrapure water) of the extracted cell lysate metabolites was diluted with 90 μL of 1:3:2 100 mM ammonium formate, pH 3: acetonitrile: 2-propanol (Fisher Scientific) and transferred to a 384-well microplate (Eppendorf) for analysis using LCMS.
For each batch, samples were randomized and matrix-matched reference quality controls and batch-specific pooled quality controls were included.
Untargeted metabolomics analysis was conducted on Waters Acquity™ UPLC system with 2D column regeneration configuration (I-class and H-class) coupled to a Xevo G2-XS quadrupole time-of-flight (qTOF) mass spectrometer. Chromatographic separation was performed using HILIC (Acquity™ UPLC BEH amide, 100 Å, 1.7 μm 2.1×100 mm, Waters Corporation, Milford, U.S.A) and C18 (Acquity™ UPLC HSS T3, 100 Å, 1.8 μm, 2.1×100 mm, Water Corporation, Milford, U.S.A) columns at 45° C.
Quaternary solvent system mobile phases were (A) 0.1% formic acid in water, (B) 0.1% formic acid in acetonitrile and (D) 100 mM ammonium formate, pH 3. Samples were separated using the following gradient profile: for the HILIC separation a starting gradient of 95% B and 5% D was increase linearly to 70% A, 25% B and 5% D over a 5-min period at 0.4 mL/min flow rate, followed by 1 min isocratic gradient at 100% A at 0.4 mL/min flow rate. For C18 separation, a chromatography gradient was as follows: starting conditions, 100% A, with a linear increase to final conditions of 5% A, 95% B, followed by isocratic gradient at 95% B, 5% D for 1 min.
A binary pump was used for column regeneration and equilibration. The solvent system mobile phases were (A1) 100 mM ammonium formate, pH 3, (A2) 0.1% formic in 2-propanol and (B1) 0.1% formic acid in acetonitrile. The HILIC column was stripped using 90% A2 for 5 min followed by 2 min equilibration using 100% B1 at 0.3 mL/min flowrate. Reverse phase C18 column regeneration was performed using 95% A1, 5% B1 for 2 min followed by column equilibration using 5% A1, 95% B1 for 5 min.
For the lipidomic assay, untargeted metabolomics analysis was conducted on a Waters Acquity™ UPLC system with 2D column regeneration configuration (I-class and H-class) coupled to a Xevo G2-XS quadrupole time-of-flight (qTOF) mass spectrometer. Chromatographic separation was performed using a C18 (Acquity™ UPLC HSS T3, 100 Å, 1.8 am, 2.1×100 mm, Water Corporation, Milford, U.S.A) column at 55° C. The mobile phases were (A) water, (B) Acetonitrile, (C) 2-propanol and (D) 500 mM ammonium formate, pH 3. A starting elution gradient of 20% A, 30% B, 49% C, and 1% D was increased linearly to 10% B, 89% C and 1% D for 5.5 min, followed by isocratic elution at 10% B, 89% C and 1% D for 1.5 min and column equilibration with initial conditions for 1 min.
Mass spectrometry data was acquired in sensitivity, positive and negative electrospray ionization mode within 50-1200 Da range for primary metabolites and 100-2000 Da for complex lipids. For the electrospray acquisition, the capillary voltage was set at 1.5 kV (positive), 3.0 kV (negative), sample cone voltage of 30 V, source temperature of 120° C., cone gas flow of 50 L/h, and desolvation gas flow rate of 800 L/h with scan time of 0.5 sec in continuum mode. Leucine Enkephalin; 556.2771 Da (positive) and 554.2615 Da (negative) for lockspray correction and scans were performed at 0.5 min. The injection volume for each sample was 3 μL, unless otherwise specified. The acquisition was carried out with instrument auto gain control to optimize instrument sensitivity over the sample acquisition time.
Pooled quality control samples were analyzed after a defined number of samples to assess replicate precision and allow LOESS correction by injection order. Additional data was captured using the MSe function for pooled quality control samples.
Peak picking and retention time alignment of LC-MS and MSe data were performed using Progenesis QI (Nonlinear, Waters). Data processing and peak annotations were performed using an in-house automated pipeline. Annotations were determined by matching accurate mass and retention times using customized libraries created from authentic standards and/or by matching experimental tandem mass spectrometry data against the NIST MSMS, LipidBlast or HMDB v3 theoretical fragmentations. To correct for injection order drift, each feature was normalized using data from repeat injections of quality control samples collected every 10 injections throughout the run sequence. Measurement data were smoothed by Locally Weighted Scatterplot Smoothing (LOESS) signal correction (QC-RLSC) as previously described (1). Only detected features exhibiting a relative standard deviation (RSD) less than 30 in quality control samples were considered for further statistical analysis. To reduce data matrix complexity, annotated features with multiple adducts or acquisition mode repeats were collapsed to one representative unique feature. Features were selected based on replicate precision (RSD<30), intensity and best isotope similarity matching to theoretical isotope distributions. Values are reported as ratios relative to the median of historical quality control reference samples ran with every analytical batch for the given analyte.
Plasma protein concentrations for CA19-9, LRG1, and TIMP1 were determined as previously described (Capello et al., 2017). For all ELISA experiments, each sample was assayed in duplicate and the absorbance or chemiluminescence measured with a SpectraMax M5 microplate reader (Molecular Devices, Sunnyvale, Calif.). An internal control sample was run in every plate and each value of the samples was divided by the mean value of the internal control in the same plate to correct for interpolate variability.
Gene expression for the Badea dataset was downloaded from oncomine database. Networks were visualized using cytoscape.
Receiver operating characteristic (ROC) curve analysis was performed to assess the performance of biomarkers in distinguishing PDAC cases from healthy controls and subjects diagnosed with benign pancreatic disease (chronic pancreatitis or pancreatic cysts).
The AUC that corresponds to the individual performance of all biomarkers is estimated using the area under the empirical estimator of the receiver operating characteristic curve (ROC). The standard error (S.E.) and the corresponding 95% confidence intervals presented for the individual performance of each biomarker were based on the bootstrap procedure in which re-sampling was performed with replacement separately for the controls and the diseased 1000 bootstrap samples. It was noted that for markers LPC (18:0), LPC (20:3), and indole-3-lactate, the inverse directionality was taken into account, since these markers tend to exhibit higher measurements for the controls compared to the ones that correspond to the cancer related samples. The model building was based on a logistic regression model using the log it link function. The estimated AUC of the proposed metabolite panel (0.9034) was derived by using the empirical estimator of the linear combination that corresponds to the model. The 95% confidence interval reported for the metabolite panel based AUC (0.8180-0.9889) takes into account the fact that the coefficients of the underlying logistic regression model were estimated, and hence exhibit variability, by using the bootstrap with 1000 iterations, for which in every bootstrap iteration the coefficients of the model are re-estimated in order to provide proper inference. The hyper-panel, i.e. the panel that refers to the combination of the two underlying panels—one for the proteins and one for the metabolites—has been developed using those two panels as two composite markers, considering their respective coefficients fixed (one composite marker for the proteins and one for the metabolites). The hyper-panel was developed by combining those two underlying composite markers using a logistic regression model in which we considered the log it link function.
Untargeted metabolomics analysis was conducted on a discovery cohort (Set #1) consisting of 20 PDAC cases (10 early and 10 late stage) and 20 matched controls (10 healthy subjects and 10 subjects with chronic pancreatitis (CP) (
1fold change depicting PDAC (n = 20) relative to controls (10 healthy subjects; 10 subjects with chronic pancreatitis)
2fold change depicting PDAC (n = 9) relative to BPD (n = 50)
3fold change depicting PDAC (n = 29) relative to Healthy subjects (n = 10)
#AUCs <0.5 are flipped
&one-tailed p-values, specify test
Next, a biomarker panel for PDAC was developed based on a logistic regression model. PDAC cases (n=29) from Set #1 and #2 were combined and evaluated against healthy subjects (n=10) from Set #1 (
Blinded validation of the 5 metabolites individually and as a panel was performed in an independent set of plasma samples consisting of 39 resectable PDAC cases and 82 matched healthy controls (Test Set #1). All 5 biomarkers were significantly different (one-tailed p<0.001) in PDAC cases as compared to healthy controls with individual AUCs ranging from 0.73 to 0.84 Table 19). All 5 metabolites indicated the same direction of change (increased/decreased) as observed in the initial cohorts. The logistic regression model for the five-metabolite panel yielded an AUC of 0.89 (95% C.I.=0.828-0.956); exhibiting 67% sensitivity at 95% specificity (
&one-tailed p-values for corresponding AUCs
The ability of the individual metabolites and panel to distinguish PDAC from BPD (low grade cysts) was tested in a second cohort (Test Set #2) consisting of 20 resectable PDAC and 102 subjects diagnosed with BPD derived from the same study as the confirmatory set (Set #2) but analyzed separately. Individual classification performances ranged from 0.60-0.73 (Table 2). The fixed logistic regression model for the five-metabolite panel yielded an AUC of 0.70 (95% C.I.=0.573-0.833); exhibiting 15% sensitivity at 95% specificity (
Previously, a protein-derived biomarker panel for early-stage PDAC was developed, which was validated in the same independent cohort (Test Set #1) described herein. It was therefore interrogated whether a hyper-panel consisting of the metabolite- and protein-panel would improve classification performance as compared to the protein-panel alone. The AUC of hyper-panel in the training set (29 PDAC versus 10 healthy controls) yielded an AUC of 0.97 with 95% CI (0.9278-1.000). The sensitivity of the metabolite panel alone for FPR values of 1% is estimated to be 0.6897. This estimate is improved statistically significantly to 0.8621 when considering the hyper-panel in the training set (corresponding one-tailed p-value=0.0390). Comparison of the protein panel (AUC=0.95) and the hyper-panel (AUC=0.97) in terms of AUCs in the training set, yielding a p-value of 0.1074 (
To determine whether elevations in plasma AcSperm and DAS were associated with disease status, cell lysates and serum-free conditioned media from 5 PDAC cell lines (CFPAC-1, MiaPaCa, SU8686, PANC03-27 and SW1990) were analyzed. Metabolomic analysis of cell lysates revealed detectable levels of AcSperm and DAS in all 5 cell lines. Analysis of conditioned media indicated positive rates of AcSperm accumulation in all 5 cell lines whereas positive rates of DAS accumulation were observed in 3 of the 5 cell lines (
To determine whether PDAC cells catabolize/scavenge extracellular lipids, the lipid composition of serum-containing media from PANC1 and Su8686 cells was examined at 24, 48, and 72 hours post conditioning. The analysis indicated time-dependent reductions in several lysophospholipids (
The primary objective of this study was to identify and validate a plasma metabolite-derived biomarker panel for resectable PDAC. Using an untargeted metabolomics approach, a 5-marker metabolite biomarker panel was identified and validated that is capable of distinguishing resectable PDAC cases from healthy individuals yielding an AUC of 0.89 in the validation cohort (Test Set #1). It was equally demonstrated that a hyper-panel consisting of the metabolite- and previously identified protein-panel significantly improves classification performances compared to the protein-panel alone (AUC: 0.92 vs 0.86; p: 0.024; Test Set #1) highlighting the complementary nature of the metabolite panel.
Given the low prevalence of PDAC, the multi-marker signature would be best suited for screening programs targeting high-risk subjects rather than the average risk population. These include individuals over age 50 years with new-onset diabetes mellitus, asymptomatic kindred of high-risk families, subjects with chronic pancreatitis, and patients incidentally diagnosed with mucin-secreting cysts of the pancreas. The metabolite-biomarker panel was able to significantly differentiate PDAC from low-grade pancreatic cyst in two separate sample sets, yielding AUC equal to 0.69 and 0.70 in the confirmation set and in test set #2, respectively.
Notably, no differences in plasma branched-chain amino acids (BCAA) were observed between cases and respective controls in contrast to previous findings. However, it should be noted that the predictive value of BCAAs were most prominent 2-5 years prior to diagnosis with levels returning towards baseline 0-2 years prior to diagnosis, consistent with observations of no differences in plasma BCAAs in samples taken at the time of diagnosis.
Altered polyamine metabolism has long been linked to tumorigenesis and hyper-proliferative disorders, being intimately involved in cell cycle progression. Polyamine synthesis is regulated by the rate-limiting enzymes ODC1 and AMD1 whereas their catabolism is regulated by SAT1. Previous findings indicated increased abundances of putrescine and AcSperm in pancreatic carcinomas as compared to histologically unaffected pancreas. Conversely, it was previously found that many polyamines including AcSperm were elevated in serum of cases as compared to healthy controls. These findings are in concordance with elevated mRNA expression of SAT1 in PDAC relative to adjacent control tissue in the Badea dataset and detection of AcSperm and DAS in cell lysates and their concurrent accumulation in conditioned media (
Previous studies indicated that plasma LPCs are significantly lower in PDAC relative to healthy controls or subjects with chronic pancreatitis, consistent with the findings of this study. The cell line data indicated that PDAC cells catabolize lysophospholipids, a notion that is supported by gene expression data in the Badea dataset (
In conclusion, a metabolite-derived biomarker panel for early-stage PDAC was developed and validated that complements the previously identified protein-based biomarker panel.
The detailed description set-forth above is provided to aid those skilled in the art in practicing the present disclosure. However, the disclosure described and claimed herein is not to be limited in scope by the specific embodiments disclosed herein because these embodiments are intended as illustration of several aspects of the disclosure. Any equivalent embodiments are intended to be within the scope of this disclosure. Indeed, various modifications of the disclosure in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description, which do not depart from the spirit or scope of the present inventive discovery. Such modifications are also intended to fall within the scope of the appended claims.
This application claims the benefit of U.S. Provisional Application No. 62/435,024, and U.S. Provisional Application No. 62/435,020, both of which were filed Dec. 15, 2016, the disclosure of which is hereby incorporated by reference in its entirety.
This invention was made with government support under grant number CA124550, awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2017/066851 | 12/15/2017 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62435020 | Dec 2016 | US | |
62435024 | Dec 2016 | US |