The contents of the text file named “IDIA-014_001US Sequence Listing.txt”, which was created on Mar. 1, 2017 and is 893 KB in size, are hereby incorporated by reference in their entireties.
Lung conditions and particularly lung cancer present significant diagnostic challenges. In many asymptomatic patients, radiological screens such as computed tomography (CT) scanning are a first step in the diagnostic paradigm. Pulmonary nodules (PNs) or indeterminate nodules are located in the lung and are often discovered during screening of both high risk patients or incidentally. The number of PNs identified is expected to rise due to increased numbers of patients with access to health care, the rapid adoption of screening techniques and an aging population. It is estimated that over 3 million PNs are identified annually in the US. Although the majority of PNs are benign, some are malignant leading to additional interventions. For patients considered low risk for malignant nodules, current medical practice dictates scans every three to six months for at least two years to monitor for lung cancer. The time period between identification of a PN and diagnosis is a time of medical surveillance or “watchful waiting” and may induce stress on the patient and lead to significant risk and expense due to repeated imaging studies. If a biopsy is performed on a patient who is found to have a benign nodule, the costs and potential for harm to the patient increase unnecessarily. Major surgery is indicated in order to excise a specimen for tissue biopsy and diagnosis. All of these procedures are associated with risk to the patient including: illness, injury and death as well as high economic costs.
Frequently, PNs cannot be biopsied to determine if they are benign or malignant due to their size and/or location in the lung. Accordingly, there exists a need for non-invasive diagnostic assays to determine whether a PN is malignant or benign.
Diagnostic methods that can replace or complement current diagnostic methods for patients presenting with PNs are needed to improve diagnostics, reduce costs and minimize invasive procedures and complications to patients. The present invention provides novel compositions, methods and kits for identifying protein markers to identify, diagnose, classify and monitor lung conditions, and particularly lung cancer. The present invention uses a blood-based multiplexed assay to distinguish benign pulmonary nodules from malignant pulmonary nodules to classify patients with or without lung cancer. The present invention may be used in patients who present with symptoms of lung cancer, but do not have pulmonary nodules.
The disclosure provides a method of identifying a status of a pulmonary nodule comprising, (a) performing an analysis to predict that the pulmonary nodule is not malignant, comprising, (1) assessing the expression of a plurality of proteins comprising determining the protein level of at least each of ALDOA, FRIL, LG3BP, TSP1, and COIA1, and, (2) calculating a first score based on the protein measurements of step (1); (b) classifying the risk that the pulmonary nodule of (a) is benign as (1) statistically significant if the score in step (a)(2) is greater than a first threshold score; or (2) not statistically significant if the score in step (a)(2) is lesser than the first threshold score; (c) performing an analysis on the pulmonary nodule of (b)(2), comprising, (1) assessing the expression of a plurality of proteins comprising determining the protein level of at least each of ALDOA, TSP1, FRIL, KIT, and GGH, and (2) calculating a second score based on the protein measurements of step (1); (d) classifying the risk that the pulmonary nodule of (c) is malignant as (1) statistically significant if the score in step (c)(2) is greater than a second (2) not statistically significant if the score in step (c)(2) is less than the second threshold score; thereby identifying the status of the pulmonary nodule as benign or malignant.
In one embodiment, the pulmonary nodule has a diameter of less than or equal to 3 cm. In another embodiment, the pulmonary nodule has a diameter of about 0.8 cm to 2.0 cm, inclusive of endpoints.
In one aspect, the analysis of (a) or (b) above is performed on a biological sample selected from the group consisting of tissue, lymph tissue, lymph fluid, blood, plasma, serum, whole blood, urine, saliva, and excreta.
In one embodiment, the biological sample is obtained from a subject. In one aspect, the subject is at risk of a lung condition. In one aspect, the lung condition is cancer. In one aspect the lung condition is non-small cell lung cancer (NSCLC). In one embodiment, lung condition is chronic obstructive pulmonary disease, hamartoma, fibroma, neurofibroma, granuloma, sarcoidosis, bacterial infection or fungal infection.
In another embodiment, the assessing steps of (a)(1) and/or (c)(1) are performed by liquid chromatography-selected reaction monitoring mass spectrometry (LC-SRM-MS). In one embodiment, the analysis of (a)(2) further comprises determining an interaction between FRIL and COIA1. In another embodiment, the analysis of (c)(2) further comprises determining an interaction between ALDOA and KIT.
In one embodiment, the analysis of (a)(1) comprises generating a plurality of transition ion pairs from the plurality of proteins of (a)(1) and measuring an abundance of at least one transition ion pair, wherein each transition ion measuring an abundance of at least one transition ion pair, wherein each transition ion pair consists of a precursor ion m/z and a fragment ion m/z, and wherein said plurality of transition ion pairs comprise at least 3 transitions selected from the group consisting of ALQASALK (SEQ ID NO: 65) transition pair 401.25-617.40, LGGPEAGLGEYLFER (SEQ ID NO: 66) transition pair 804.40-913.40, VEIFYR (SEQ ID NO: 67) transition pair 413.73-598.30, GFLLLASLR (SEQ ID NO: 68) transition pair 495.31-559.40, and AVGLAGTFR (SEQ ID NO: 69) transition pair (446.26-721.40).
In another embodiment, the analysis of (c)(1) comprises generating a plurality of transition ion pairs from the plurality of proteins of (c)(1) and measuring an abundance of at least one transition ion pair, wherein each transition ion pair consists of a precursor ion m/z and a fragment ion m/z, and wherein said plurality of transition ion pairs comprise at least 3 transitions selected from the group consisting of ALQASALK (SEQ ID NO: 65) transition pair 401.25-617.40, GFLLLASLR (SEQ ID NO: 68) transition pair 495.31-559.40, LGGPEAGLGEYLFER (SEQ ID NO: 66) transition pair 804.40-1083.60, and YVSELHLTR (SEQ ID NO: 70) transition pair.
In one aspect, the generating a plurality of transition ion pairs from the plurality of proteins of (a)(1) comprises fragmenting each protein into at least one peptide. In another aspect, the fragmenting comprises contacting each protein with a trypsin composition. In one embodiment, the assessing step of (a)(1) are performed by liquid chromatography-selected reaction monitoring mass spectrometry (LC-SRM-MS).
In one embodiment, the protein expression assessment of (a)(1) or (c)(1) is normalized with respect to the protein expression one or more proteins selected from the group consisting of PEDF, MASP1, GELS, LUM, C163A and PTPRJ.
In one embodiment, the transition ion pair assessment of (a)(1) is normalized with respect to the abundance of one or more transition ion pairs selected from the group consisting of LQSLFDSPDFSK (SEQ ID NO: 71) transition pair 692.34-593.30, TGVITSPDFPNPYPK (SEQ ID NO: 72) transition pair 816.92-258.10, TASDFITK (SEQ ID NO: 73) transition pair 441.73-710.40, SLEDLQLTHNK (SEQ ID NO: 74) transition pair 433.23-499.30, INPASLDK (SEQ ID NO: 75) transition pair 429.24-630.30 and VITEPIPVSDLR (SEQ ID NO: 76) transition pair 669.89-896.50.
In another embodiment, the classifying the pulmonary nodule of (b) further comprises determining a sensitivity, a specificity, a negative predictive value or a positive predictive value of the first score.
In one embodiment, the pulmonary nodule is classified in (b) as benign and wherein the subject does not receive treatment. In one aspect, the treatment comprises a pulmonary function test (PFT), pulmonary imaging, a biopsy, a surgery, a chemotherapy, a radiotherapy, or any combination thereof. The pulmonary imaging is an x-ray, a chest computed tomography (CT) scan, or a positron emission tomography (PET) scan.
In one embodiment, the pulmonary nodule is benign and wherein the subject receives periodic monitoring for between 1 year and 3 years.
In one embodiment, the periodic monitoring comprises chest computed tomography.
In one embodiment, the pulmonary nodule is malignant and wherein the subject receives treatment according to the standard of care. The treatment comprises a pulmonary function test (PFT), pulmonary imaging, a biopsy, a surgery, a chemotherapy, a radiotherapy, or any combination thereof. The pulmonary imaging is an x-ray, a chest computed tomography (CT) scan, or a positron emission tomography (PET) scan.
In one embodiment, the generating a plurality of transition ion pairs from the plurality of proteins of (c)(1) comprises fragmenting each protein into at least one peptide. The fragmenting comprises contacting each protein with a trypsin composition.
In one embodiment, the assessing step of (c)(1) are performed by liquid chromatography-selected reaction monitoring mass spectrometry (LC-SRM-MS).
In one embodiment, the at least one peptide is labeled. In one embodiment, the label is an isotopic label.
The disclosed invention derives from the surprising discovery, that in patients presenting with pulmonary nodule(s), protein markers in the blood exist that specifically identify and classify lung cancer. Accordingly, the invention provides unique advantages to the patient associated with early detection of lung cancer in a patient, including increased life span, decreased morbidity and mortality, decreased exposure to radiation during screening and repeat screenings and a minimally invasive diagnostic model. Importantly, the methods of the invention allow for a patient to avoid invasive procedures.
The routine clinical use of chest computed tomography (CT) scans identifies millions of pulmonary nodules annually, of which only a small minority are malignant but contribute to the dismal 15% five-year survival rate for patients diagnosed with non-small cell lung cancer (NSCLC). The early diagnosis of lung cancer in patients with pulmonary nodules is a top priority, as decision-making based on clinical presentation, in conjunction with current non-invasive diagnostic options such as chest CT and positron emission tomography (PET) scans, and other invasive alternatives, has not altered the clinical outcomes of patients with Stage I NSCLC. The subgroup of pulmonary nodules between 8 mm and 20 mm in size is increasingly recognized as being “intermediate” relative to the lower rate of malignancies below 8 mm and the higher rate of malignancies above 20 mm. Invasive sampling of the lung nodule by biopsy using transthoracic needle aspiration or bronchoscopy may provide a cytopathologic diagnosis of NSCLC, but are also associated with both false-negative and non-diagnostic results. In summary, a key unmet clinical need for the management of pulmonary nodules is a non-invasive diagnostic test that discriminates between malignant and benign processes in patients with indeterminate pulmonary nodules (IPNs).
The clinical decision to be more or less aggressive in treatment is based on risk factors, primarily nodule size, smoking history and age in addition to imaging. As these are not conclusive, there is a great need for a molecular-based blood test that would be both non-invasive and provide complementary information to risk factors and imaging.
Accordingly, these and related embodiments will find uses in screening methods for lung conditions, and particularly lung cancer diagnostics. More importantly, the invention finds use in determining the clinical management of a patient. That is, the method of invention is useful in ruling in or ruling out a particular treatment protocol for an individual subject.
Cancer biology requires a molecular strategy to address the unmet medical need for an assessment of lung cancer risk. The field of diagnostic medicine has evolved with technology and assays that provide sensitive mechanisms for detection of changes in proteins. The methods described herein use a LC-SRM-MS technology for measuring the concentration of blood plasma proteins that are collectively changed in patients with a malignant PN. This protein signature is indicative of lung cancer. LC-SRM-MS is one method that provides for both quantification and identification of circulating proteins in plasma. Changes in protein expression levels, such as but not limited to signaling factors, growth factors, cleaved surface proteins and secreted proteins, can be detected using such a sensitive technology to assay cancer. Presented herein is a blood-based classification test to determine the likelihood that a patient presenting with a pulmonary nodule has a nodule that is benign or malignant. The present invention presents a classification algorithm that predicts the relative likelihood of the PN being benign or malignant.
More broadly, it is demonstrated that there are many variations on this invention that are also diagnostic tests for the likelihood that a PN is benign or malignant. These are variations on the panel of proteins, protein standards, measurement methodology and/or classification algorithm.
As disclosed herein, archival plasma samples from subjects presenting with PNs were analyzed for differential protein expression by mass spectrometry and the results were used to identify biomarker proteins and panels of biomarker proteins that are differentially expressed in conjunction with various lung conditions (cancer vs. non-cancer).
These assays resulted in the development of a rule-in classifier (referred to herein as “Reflex Lung”, and “Classifier 2”) that is able to determine the probability of a pulmonary nodule as being cancerous. In one aspect, the rule-in classifier is meant to be used with a previously developed rule-out classifier (Xpresys® Lung) described in U.S. Pat. No. 9,297,805, the contents of which are incorporated herein in its entirety. Xpresys® Lung CR (Cancer Risk) is an assay with the combined use of the rule-out classifier and the rule-in classifier.
In one embodiment, a preferred panel for ruling-in treatment for a subject is listed in Table 10 and Table 12. In various other embodiments, the panels according to the invention include measuring at least 2, 3, 4, 5, 6, 7, or more of the proteins listed on Table 2. In one embodiment, normalizing proteins listed in Table 10 are also measured.
The term “pulmonary nodules” (PNs) refers to lung lesions that can be visualized by radiographic techniques. A pulmonary nodule is any nodules less than or equal to three centimeters in diameter. In one example, a pulmonary nodule has a diameter of about 0.8 cm to 2 cm.
The term “masses” or “pulmonary masses” refers to lung nodules that are greater than three centimeters maximal diameter.
The term “blood biopsy” refers to a diagnostic study of the blood to determine whether a patient presenting with a nodule has a condition that may be classified as either benign or malignant.
The term “acceptance criteria” refers to the set of criteria to which an assay, test, diagnostic or product should conform to be considered acceptable for its intended use. As used herein, acceptance criteria are a list of tests, references to analytical procedures, and appropriate measures, which are defined for an assay or product that will be used in a diagnostic. For example, the acceptance criteria for the classifier refers to a set of predetermined ranges of coefficients.
The term “average maximal AUC” refers to the methodology of calculating performance. For the present invention, in the process of defining the set of proteins that should be in a panel by forward or backwards selection proteins are removed or added one at a time. A plot can be generated with performance (AUC or partial AUC score on the Y axis and proteins on the X axis) the point which maximizes performance indicates the number and set of proteins the gives the best result.
The term “partial AUC factor or pAUC factor” is greater than expected by random prediction. At sensitivity=0.90 the pAUC factor is the trapezoidal area under the ROC curve from 0.9 to 1.0 Specificity/(0.1*0.1/2).
The term “incremental information” refers to information that may be used with other diagnostic information to enhance diagnostic accuracy. Incremental information is independent of clinical factors such as including nodule size, age, or gender.
The term “score” or “scoring” refers to calculating a probability likelihood for a sample. For the present invention, values closer to 1.0 are used to represent the likelihood that a sample is cancer, values closer to 0.0 represent the likelihood that a sample is benign.
The term “robust” refers to a test or procedure that is not seriously disturbed by violations of the assumptions on which it is based. For the present invention, a robust test is a test wherein the proteins or transitions of the mass spectrometry chromatograms have been manually reviewed and are “generally” free of interfering signals.
The term “coefficients” refers to the weight assigned to each protein used to in the logistic regression equation to score a sample.
In certain embodiments of the invention, it is contemplated that in terms of the logistic regression model of MC CV, the model coefficient and the coefficient of variation (CV) of each protein's model coefficient may increase or decrease, dependent upon the method (or model) of measurement of the protein classifier. For each of the listed proteins in the panels, there is about, at least, at least about, or at most about a 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, or 10-, -fold or any range derivable therein for each of the coefficient and CV. Alternatively, it is contemplated that quantitative embodiments of the invention may be discussed in terms of as about, at least, at least about, or at most about 10, 20, 30, 40, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99% or more, or any range derivable therein.
The term “best team players” refers to the proteins that rank the best in the random panel selection algorithm, i.e., perform well on panels. When combined into a classifier these proteins can segregate cancer from benign samples. “Best team player” proteins is synonymous with “cooperative proteins”. The term “cooperative proteins” refers proteins that appear more frequently on high performing panels of proteins than expected by chance. This gives rise to a protein's cooperative score which measures how (in)frequently it appears on high performing panels. For example, a protein with a cooperative score of 1.5 appears on high performing panels 1.5× more than would be expected by chance alone.
The term “classifying” as used herein with regard to a lung condition refers to the act of compiling and analyzing expression data for using statistical techniques to provide a classification to aid in diagnosis of a lung condition, particularly lung cancer.
The term “classifier” as used herein refers to an algorithm that discriminates between disease states with a predetermined level of statistical significance. A two-class classifier is an algorithm that uses data points from measurements from a sample and classifies the data into one of two groups. In certain embodiments, the data used in the classifier is the relative expression of proteins in a biological sample. Protein expression levels in a subject can be compared to levels in patients previously diagnosed as disease free or with a specified condition.
The “classifier” maximizes the probability of distinguishing a randomly selected cancer sample from a randomly selected benign sample, i.e., the AUC of ROC curve.
In addition to the classifier's constituent proteins with differential expression, it may also include proteins with minimal or no biologic variation to enable assessment of variability, or the lack thereof, within or between clinical specimens; these proteins may be termed endogenous proteins and serve as internal controls for the other classifier proteins.
The term “normalization” or “normalizer” as used herein refers to the expression of a differential value in terms of a standard value to adjust for effects which arise from technical variation due to sample handling, sample preparation and mass spectrometry measurement rather than biological variation of protein concentration in a sample. For example, when measuring the expression of a differentially expressed protein, the absolute value for the expression of the protein can be expressed in terms of an absolute value for the expression of a standard protein that is substantially constant in expression. This prevents the technical variation of sample preparation and mass spectrometry measurement from impeding the measurement of protein concentration levels in the sample.
The term “condition” as used herein refers generally to a disease, event, or change in health status.
The term “treatment protocol” as used herein including further diagnostic testing typically performed to determine whether a pulmonary nodule is benign or malignant. Treatment protocols include diagnostic tests typically used to diagnose pulmonary nodules or masses such as for example, CT scan, positron emission tomography (PET) scan, bronchoscopy or tissue biopsy. Treatment protocol as used herein is also meant to include therapeutic treatments typically used to treat malignant pulmonary nodules and/or lung cancer such as for example, chemotherapy, radiation or surgery.
The terms “diagnosis” and “diagnostics” also encompass the terms “prognosis” and “prognostics”, respectively, as well as the applications of such procedures over two or more time points to monitor the diagnosis and/or prognosis over time, and statistical modeling based thereupon. Furthermore the term diagnosis includes: a. prediction (determining if a patient will likely develop a hyperproliferative disease); b. prognosis (predicting whether a patient will likely have a better or worse outcome at a pre-selected time in the future); c. therapy selection; d. therapeutic drug monitoring; and e. relapse monitoring.
In some embodiments, for example, classification of a biological sample as being derived from a subject with a lung condition may refer to the results and related reports generated by a laboratory, while diagnosis may refer to the act of a medical professional in using the classification to identify or verify the lung condition.
The term “providing” as used herein with regard to a biological sample refers to directly or indirectly obtaining the biological sample from a subject. For example, “providing” may refer to the act of directly obtaining the biological sample from a subject (e.g., by a blood draw, tissue biopsy, lavage and the like). Likewise, “providing” may refer to the act of indirectly obtaining the biological sample. For example, providing may refer to the act of a laboratory receiving the sample from the party that directly obtained the sample, or to the act of obtaining the sample from an archive.
As used herein, “lung cancer” preferably refers to cancers of the lung, but may include any disease or other disorder of the respiratory system of a human or other mammal. Respiratory neoplastic disorders include, for example small cell carcinoma or small cell lung cancer (SCLC), non-small cell carcinoma or non-small cell lung cancer (NSCLC), squamous cell carcinoma, adenocarcinoma, broncho-alveolar carcinoma, mixed pulmonary carcinoma, malignant pleural mesothelioma, undifferentiated large cell carcinoma, giant cell carcinoma, synchronous tumors, large cell neuroendocrine carcinoma, adenosquamous carcinoma, undifferentiated carcinoma; and small cell carcinoma, including oat cell cancer, mixed small cell/large cell carcinoma, and combined small cell carcinoma; as well as adenoid cystic carcinoma, hamartomas, mucoepidermoid tumors, typical carcinoid lung tumors, atypical carcinoid lung tumors, peripheral carcinoid lung tumors, central carcinoid lung tumors, pleural mesotheliomas, and undifferentiated pulmonary carcinoma and cancers that originate outside the lungs such as secondary cancers that have metastasized to the lungs from other parts of the body. Lung cancers may be of any stage or grade. Preferably the term may be used to refer collectively to any dysplasia, hyperplasia, neoplasia, or metastasis in which the protein biomarkers expressed above normal levels as may be determined, for example, by comparison to adjacent healthy tissue.
Examples of non-cancerous lung condition include chronic obstructive pulmonary disease (COPD), benign tumors or masses of cells (e.g., hamartoma, fibroma, neurofibroma), granuloma, sarcoidosis, and infections caused by bacterial (e.g., tuberculosis) or fungal (e.g. histoplasmosis) pathogens. In certain embodiments, a lung condition may be associated with the appearance of radiographic PNs.
As used herein, “lung tissue”, and “lung cancer” refer to tissue or cancer, respectively, of the lungs themselves, as well as the tissue adjacent to and/or within the strata underlying the lungs and supporting structures such as the pleura, intercostal muscles, ribs, and other elements of the respiratory system. The respiratory system itself is taken in this context as representing nasal cavity, sinuses, pharynx, larynx, trachea, bronchi, lungs, lung lobes, aveoli, aveolar ducts, aveolar sacs, aveolar capillaries, bronchioles, respiratory bronchioles, visceral pleura, parietal pleura, pleural cavity, diaphragm, epiglottis, adenoids, tonsils, mouth and tongue, and the like. The tissue or cancer may be from a mammal and is preferably from a human, although monkeys, apes, cats, dogs, cows, horses and rabbits are within the scope of the present invention. The term “lung condition” as used herein refers to a disease, event, or change in health status relating to the lung, including for example lung cancer and various non-cancerous conditions.
“Accuracy” refers to the degree of conformity of a measured or calculated quantity (a test reported value) to its actual (or true) value. Clinical accuracy relates to the proportion of true outcomes (true positives (TP) or true negatives (TN) versus misclassified outcomes (false positives (FP) or false negatives (FN)), and may be stated as a sensitivity, specificity, positive predictive values (PPV) or negative predictive values (NPV), or as a likelihood, odds ratio, among other measures.
The term “biological sample” as used herein refers to any sample of biological origin potentially containing one or more biomarker proteins. Examples of biological samples include tissue, organs, or bodily fluids such as whole blood, plasma, serum, tissue, lavage or any other specimen used for detection of disease.
The term “subject” as used herein refers to a mammal, preferably a human.
The term “biomarker protein” as used herein refers to a polypeptide in a biological sample from a subject with a lung condition versus a biological sample from a control subject. A biomarker protein includes not only the polypeptide itself, but also minor variations thereof, including for example one or more amino acid substitutions or modifications such as glycosylation or phosphorylation.
The term “biomarker protein panel” as used herein refers to a plurality of biomarker proteins. In certain embodiments, the expression levels of the proteins in the panels can be correlated with the existence of a lung condition in a subject. In certain embodiments, biomarker protein panels comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 60, 70, 80, 90 or 100 proteins. In certain embodiments, the biomarker proteins panels comprise from 100-125 proteins, 125-150 proteins, 150-200 proteins or more.
“Treating” or “treatment” as used herein with regard to a condition may refer to preventing the condition, slowing the onset or rate of development of the condition, reducing the risk of developing the condition, preventing or delaying the development of symptoms associated with the condition, reducing or ending symptoms associated with the condition, generating a complete or partial regression of the condition, or some combination thereof.
The term “ruling out” as used herein is meant that the subject is selected not to receive a treatment protocol.
The term “ruling-in” as used herein is meant that the subject is selected to receive a treatment protocol.
Biomarker levels may change due to treatment of the disease. The changes in biomarker levels may be measured by the present invention. Changes in biomarker levels may be used to monitor the progression of disease or therapy.
“Altered”, “changed” or “significantly different” refer to a detectable change or difference from a reasonably comparable state, profile, measurement, or the like. One skilled in the art should be able to determine a reasonable measurable change. Such changes may be all or none. They may be incremental and need not be linear. They may be by orders of magnitude. A change may be an increase or decrease by 1%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 99%, 100%, or more, or any value in between 0% and 100%. Alternatively the change may be 1-fold, 1.5-fold 2-fold, 3-fold, 4-fold, 5-fold or more, or any values in between 1-fold and five-fold. The change may be statistically significant with a p value of 0.1, 0.05, 0.001, or 0.0001.
Using the methods of the current invention, a clinical assessment of a patient is first performed. If there exists is a higher likelihood for cancer, the clinician may rule in the disease which will require the pursuit of diagnostic testing options yielding data which increase and/or substantiate the likelihood of the diagnosis. “Rule in” of a disease requires a test with a high specificity.
“FN” is false negative, which for a disease state test means classifying a disease subject incorrectly as non-disease or normal.
“FP” is false positive, which for a disease state test means classifying a normal subject incorrectly as having disease.
The term “rule in” refers to a diagnostic test with high specificity that coupled with a clinical assessment indicates a higher likelihood for cancer. If the clinical assessment is a lower likelihood for cancer, the clinician may adopt a stance to rule out the disease, which will require diagnostic tests which yield data that decrease the likelihood of the diagnosis. “Rule out” requires a test with a high sensitivity.
The term “rule out” refers to a diagnostic test with high sensitivity that coupled with a clinical assessment indicates a lower likelihood for cancer.
The term “sensitivity of a test” refers to the probability that a patient with the disease will have a positive test result. This is derived from the number of patients with the disease who have a positive test result (true positive) divided by the total number of patients with the disease, including those with true positive results and those patients with the disease who have a negative result, i.e. false negative.
The term “specificity of a test” refers to the probability that a patient without the disease will have a negative test result. This is derived from the number of patients without the disease who have a negative test result (true negative) divided by all patients without the disease, including those with a true negative result and those patients without the disease who have a positive test result, e.g. false positive. While the sensitivity, specificity, true or false positive rate, and true or false negative rate of a test provide an indication of a test's performance, e.g. relative to other tests, to make a clinical decision for an individual patient based on the test's result, the clinician requires performance parameters of the test with respect to a given population.
The term “positive predictive value” (PPV) refers to the probability that a positive result correctly identifies a patient who has the disease, which is the number of true positives divided by the sum of true positives and false positives.
The term “negative predictive value” or “NPV” is calculated by TN/(TN+FN) or the true negative fraction of all negative test results. It also is inherently impacted by the prevalence of the disease and pre-test probability of the population intended to be tested.
The term “disease prevalence” refers to the number of all new and old cases of a disease or occurrences of an event during a particular period. Prevalence is expressed as a ratio in which the number of events is the numerator and the population at risk is the denominator.
The term disease incidence refers to a measure of the risk of developing some new condition within a specified period of time; the number of new cases during some time period, it is better expressed as a proportion or a rate with a denominator.
Lung cancer risk according to the “National Lung Screening Trial” is classified by age and smoking history. High risk—age≧55 and ≧30 pack-years smoking history; Moderate risk—age≧50 and ≧20 pack-years smoking history; Low risk—<age 50 or <20 pack-years smoking history.
The term “negative predictive value” (NPV) refers to the probability that a negative test correctly identifies a patient without the disease, which is the number of true negatives divided by the sum of true negatives and false negatives. A positive result from a test with a sufficient PPV can be used to rule in the disease for a patient, while a negative result from a test with a sufficient NPV can be used to rule out the disease, if the disease prevalence for the given population, of which the patient can be considered a part, is known.
The clinician must decide on using a diagnostic test based on its intrinsic performance parameters, including sensitivity and specificity, and on its extrinsic performance parameters, such as positive predictive value and negative predictive value, which depend upon the disease's prevalence in a given population.
Additional parameters which may influence clinical assessment of disease likelihood include the prior frequency and closeness of a patient to a known agent, e.g. exposure risk, that directly or indirectly is associated with disease causation, e.g. second hand smoke, radiation, etc., and also the radiographic appearance or characterization of the pulmonary nodule exclusive of size. A nodule's description may include solid, semi-solid or ground glass which characterizes it based on the spectrum of relative gray scale density employed by the CT scan technology.
“Mass spectrometry” refers to a method comprising employing an ionization source to generate gas phase ions from an analyte presented on a sample presenting surface of a probe and detecting the gas phase ions with a mass spectrometer. In one embodiment, liquid chromatography selected reaction monitoring mass spectrometry (LC-SRM-MS) is used. In another embodiment, liquid chromatography, multiple reaction monitoring mass spectrometry (LC-MRM-MS) is used.
Bioinformatic and biostatistical analyses were used first to identify individual proteins with statistically significant differential expression, and then using these proteins to derive one or more combinations of proteins or panels of proteins, which collectively demonstrated superior discriminatory performance compared to any individual protein. Bioinformatic and biostatistical methods are used to derive coefficients (C) for each individual protein in the panel that reflects its relative expression level, i.e. increased or decreased, and its weight or importance with respect to the panel's net discriminatory ability, relative to the other proteins. The quantitative discriminatory ability of the panel can be expressed as a mathematical algorithm with a term for each of its constituent proteins being the product of its coefficient and the protein's plasma expression level (P) (as measured by LC-SRM-MS), e.g. C×P, with an algorithm consisting of n proteins described as: C1×P1+C2×P2+C3×P3+ . . . +Cn×Pn. An algorithm that discriminates between disease states with a predetermined level of statistical significance may be refers to a “disease classifier”. In addition to the classifier's constituent proteins with differential expression, it may also include proteins with minimal or no biologic variation to enable assessment of variability, or the lack thereof, within or between clinical specimens; these proteins may be termed typical native proteins and serve as internal controls for the other classifier proteins.
In certain embodiments, expression levels are measured by MS. MS analyzes the mass spectrum produced by an ion after its production by the vaporization of its parent protein and its separation from other ions based on its mass-to-charge ratio. The most common modes of acquiring MS data are 1) full scan acquisition resulting in the typical total ion current plot (TIC), 2) selected ion monitoring (SIM), and 3) selected reaction monitoring (SRM).
In certain embodiments of the methods provided herein, biomarker protein expression levels are measured by LC-SRM-MS. LC-SRM-MS is a highly selective method of tandem mass spectrometry which has the potential to effectively filter out all molecules and contaminants except the desired analyte(s). This is particularly beneficial if the analysis sample is a complex mixture which may comprise several isobaric species within a defined analytical window. LC-SRM-MS methods may utilize a triple quadrupole mass spectrometer which, as is known in the art, includes three quadrupole rod sets. A first stage of mass selection is performed in the first quadrupole rod set, and the selectively transmitted ions are fragmented in the second quadrupole rod set. The resultant transition (product) ions are conveyed to the third quadrupole rod set, which performs a second stage of mass selection. The product ions transmitted through the third quadrupole rod set are measured by a detector, which generates a signal representative of the numbers of selectively transmitted product ions. The RF and DC potentials applied to the first and third quadrupoles are tuned to select (respectively) precursor and product ions that have m/z values lying within narrow specified ranges. By specifying the appropriate transitions (m/z values of precursor and product ions), a peptide corresponding to a targeted protein may be measured with high degrees of sensitivity and selectivity. Signal-to-noise ratio is superior to conventional tandem mass spectrometry (MS/MS) experiments, which select one mass window in the first quadrupole and then measure all generated transitions in the ion detector.
The expression level of a biomarker protein can be measured using any suitable method known in the art, including but not limited to mass spectrometry (MS), reverse transcriptase-polymerase chain reaction (RT-PCR), microarray, serial analysis of gene expression (SAGE), gene expression analysis by massively parallel signature sequencing (MPSS), immunoassays (e.g., ELISA), immunohistochemistry (IHC), transcriptomics, and proteomics.
To evaluate the diagnostic performance of a particular set of peptide transitions, a ROC curve is generated for each significant transition.
An “ROC curve” as used herein refers to a plot of the true positive rate (sensitivity) against the false positive rate (specificity) for a binary classifier system as its discrimination threshold is varied. A ROC curve can be represented equivalently by plotting the fraction of true positives out of the positives (TPR=true positive rate) versus the fraction of false positives out of the negatives (FPR=false positive rate). Each point on the ROC curve represents a sensitivity/specificity pair corresponding to a particular decision threshold.
AUC represents the area under the ROC curve. The AUC is an overall indication of the diagnostic accuracy of 1) a biomarker or a panel of biomarkers and 2) a ROC curve. AUC is determined by the “trapezoidal rule.” For a given curve, the data points are connected by straight line segments, perpendiculars are erected from the abscissa to each data point, and the sum of the areas of the triangles and trapezoids so constructed is computed. In certain embodiments of the methods provided herein, a biomarker protein has an AUC in the range of about 0.75 to 1.0. In certain of these embodiments, the AUC is in the range of about 0.8 to 0.8, 0.9 to 0.95, or 0.95 to 1.0.
The methods provided herein are minimally invasive and pose little or no risk of adverse effects. As such, they may be used to diagnose, monitor and provide clinical management of subjects who do not exhibit any symptoms of a lung condition and subjects classified as low risk for developing a lung condition. For example, the methods disclosed herein may be used to diagnose lung cancer in a subject who does not present with a PN and/or has not presented with a PN in the past, but who nonetheless deemed at risk of developing a PN and/or a lung condition. Similarly, the methods disclosed herein may be used as a strictly precautionary measure to diagnose healthy subjects who are classified as low risk for developing a lung condition.
The present invention provides a method of determining the likelihood that a lung condition in a subject is cancer by measuring an abundance of a panel of proteins in a sample obtained from the subject; calculating a probability of cancer score based on the protein measurements and ruling out cancer for the subject if the score) is lower than a pre-determined score, wherein when cancer is ruled out the subject does not receive a treatment protocol. Treatment protocols include for example pulmonary function test (PFT), pulmonary imaging, a biopsy, a surgery, a chemotherapy, a radiotherapy, or any combination thereof. In some embodiments, the imaging is an x-ray, a chest computed tomography (CT) scan, or a positron emission tomography (PET) scan.
The present invention further provides a method of ruling in the likelihood of cancer for a subject by measuring an abundance of panel of proteins in a sample obtained from the subject, calculating a probability of cancer score based on the protein measurements and ruling in the likelihood of cancer for the subject if the score in step is higher than a pre-determined score.
In another aspect the invention further provides a method of determining the likelihood of the presence of a lung condition in a subject by measuring an abundance of panel of proteins in a sample obtained from the subject, calculating a probability of cancer score based on the protein measurements and concluding the presence of said lung condition if the score is equal or greater than a pre-determined score. The lung condition is lung cancer such as for example, non-small cell lung cancer (NSCLC). The subject at risk of developing lung cancer.
The subject has or is suspected of having a pulmonary nodule. The pulmonary nodule has a diameter of less than or equal to 3 cm. In one embodiment, the pulmonary nodule has a diameter of about 0.8 cm to 3.0 cm. The subject may have stage IA lung cancer (i.e., the tumor is smaller than 3 cm).
The score is calculated from a logistic regression model applied to the protein measurements. For example, the score is determined as Ps=1/[1+exp(−α−Σi=1Nβi*{hacek over (I)}i,s)], where {hacek over (I)}i,s is logarithmically transformed and normalized intensity of transition i in said sample (s), βi is the corresponding logistic regression coefficient, α was a panel-specific constant, and N was the total number of transitions in said panel.
In various embodiments, the method of the present invention further comprises normalizing the protein measurements. For example, the protein measurements are normalized by one or more proteins selected from PEDF, MASP1, GELS, LUM, C163A and PTPRJ.
The biological sample such as for example tissue, blood, plasma, serum, whole blood, urine, saliva, genital secretion, cerebrospinal fluid, sweat and excreta.
In one aspect, the determining the likelihood of cancer is determined by the sensitivity, specificity, negative predictive value or positive predictive value associated with the score. The score determined has a negative predictive value (NPV) is at least about 60%, at least 70% or at least 80%.
The measuring step is performed by selected reaction monitoring mass spectrometry, using a compound that specifically binds the protein being detected or a peptide transition. In one embodiment, the compound that specifically binds to the protein being measured is an antibody or an aptamer.
In certain embodiments, the diagnostic methods disclosed herein can be used in combination with other clinical assessment methods, including for example various radiographic and/or invasive methods. Similarly, in certain embodiments, the diagnostic methods disclosed herein can be used to identify candidates for other clinical assessment methods, or to assess the likelihood that a subject will benefit from other clinical assessment methods.
The high abundance of certain proteins in a biological sample such as plasma or serum can hinder the ability to assay a protein of interest, particularly where the protein of interest is expressed at relatively low concentrations. Several methods are available to circumvent this issue, including enrichment, separation, and depletion. Enrichment uses an affinity agent to extract proteins from the sample by class, e.g., removal of glycosylated proteins by glycocapture. Separation uses methods such as gel electrophoresis or isoelectric focusing to divide the sample into multiple fractions that largely do not overlap in protein content. Depletion typically uses affinity columns to remove the most abundant proteins in blood, such as albumin, by utilizing advanced technologies such as IgY14/Supermix (SigmaSt. Louis, Mo.) that enable the removal of the majority of the most abundant proteins.
In certain embodiments of the methods provided herein, a biological sample may be subjected to enrichment, separation, and/or depletion prior to assaying biomarker or putative biomarker protein expression levels. In certain of these embodiments, blood proteins may be initially processed by a glycocapture method, which enriches for glycosylated proteins, allowing quantification assays to detect proteins in the high pg/ml to low ng/ml concentration range. Exemplary methods of glycocapture are well known in the art (see, e.g., U.S. Pat. No. 7,183,188; U.S. Patent Appl. Publ. No. 2007/0099251; U.S. Patent Appl. Publ. No. 2007/0202539; U.S. Patent Appl. Publ. No. 2007/0269895; and U.S. Patent Appl. Publ. No. 2010/0279382). In other embodiments, blood proteins may be initially processed by a protein depletion method, which allows for detection of commonly obscured biomarkers in samples by removing abundant proteins. In one such embodiment, the protein depletion method is a Supermix (Sigma) depletion method.
In certain embodiments, stable isotope-labeled standard peptides (SIL) are used as normalizing peptides, according to U.S. Ser. No. 14/612,959 and Li et al. “An integrated quantification method to increase the precision, robustness, and resolution of protein measurement in human plasma samples,” Clinical Proteomics, 2015, 12:3, pages, 2-17, the contents of each of which are incorporated herein in their entireties.
In certain embodiments, a biomarker protein panel comprises two to 100 biomarker proteins. In certain of these embodiments, the panel comprises 2 to 5, 6 to 10, 11 to 15, 16 to 20, 21-25, 5 to 25, 26 to 30, 31 to 40, 41 to 50, 25 to 50, 51 to 75, 76 to 100, biomarker proteins. In certain embodiments, a biomarker protein panel comprises one or more subpanels of biomarker proteins that each comprise at least two biomarker proteins. For example, biomarker protein panel may comprise a first subpanel made up of biomarker proteins that are overexpressed in a particular lung condition and a second subpanel made up of biomarker proteins that are under-expressed in a particular lung condition.
In certain embodiments, kits are provided for diagnosing a lung condition in a subject. These kits are used to detect expression levels of one or more biomarker proteins. Optionally, a kit may comprise instructions for use in the form of a label or a separate insert. The kits can contain reagents that specifically bind to proteins in the panels described, herein. These reagents can include antibodies. The kits can also contain reagents that specifically bind to mRNA expressing proteins in the panels described, herein. These reagents can include nucleotide probes. The kits can also include reagents for the detection of reagents that specifically bind to the proteins in the panels described herein. These reagents can include fluorophores.
The following examples are provided to better illustrate the claimed invention and are not to be interpreted as limiting the scope of the invention. To the extent that specific materials are mentioned, it is merely for purposes of illustration and is not intended to limit the invention. One skilled in the art may develop equivalent means or reactants without the exercise of inventive capacity and without departing from the scope of the invention.
Described herein is the development of the Xpresys® Lung CR test. The Xpresys® Lung CR test comprises a rule-out classifier (Classifier 1; TRO) and a rule-in classifier (Classifier 2; TRI). See
The previously described rule-out classifier (also referred to herein as Xpresys® Lung; TRO) is a plasma test that aims to rescue benign lung nodules from unnecessary invasive procedure. The proteins, transitions and corresponding coefficients of the TRO classifier are detailed in Table 1. Based on the data described in U.S. Pat. No. 9,297,805, and the estimated cancer prevalence of 23.1% among lung nodules of 8-30 mm in size, the TRO classifier is expected to classify 43.9% of the intended use population (i.e. individuals at least 40 years of age and with a pulmonary nodule between 8-30 mm in size as detected by radiology) as Likely Benign with a negative predictive value (NPV) of 84.0% or higher. Subjects having a Likely Benign test result should be monitored by surveillance according to current nodule management guidelines for patients of low cancer risk, avoiding invasive procedure unless nodule growth is observed. The TRO classifier also classifies the remaining 56.1% of the intended use population as Indeterminate. Subjects having an Indeterminate test result should be treated according to the standard of care.
It is desirable to further stratify subjects having an Indeterminate test result with the TRO classifier (Classifier 1) according to the subject's risk of bearing a cancerous nodule. The Reflex Lung Classifier (also referred to herein as rule-in classifier; TRI; Classifier 2) was developed for that purpose and is described herein. The Reflex Lung Classifier (rule-in classifier; TRI; Classifier 2) categorizes subjects having high risk of cancer as Likely Cancer and the rest as Indeterminate II. See
Below is a summary of results for the Xpresys® Lung CR (Cancer Risk) (Combination TRO and TRI Classifier) Retrospective Validation Study. The Xpresys® Lung CR Test contains two integrated classifiers: 1) Xpresys® Lung (Classifier 1; Rule-out Classifier; TRO) which stratifies patients into Likely Benign and Indeterminate I, and 2) Reflex® Lung Classifier (Classifier 2; Rule-in Classifier; TRI) which further stratifies patients having an Indeterminate I test result into Indeterminate II and likely Cancer. See
The study design for the Xpresys Lung CR Classifier (combination TRO and TRI) used previously acquired biological samples described in U.S. Pat. No. 9,201,044 and U.S. Pat. No. 9,297,805, the contents of each of which are incorporated herein by reference in their entireties. The exclusion and exclusion criteria were previously described. See Vachani et al “Validation of a Multi-Protein Plasma Classifier to Identify Benign Lung Nodules,” Journal of Thoracic Oncology: official publication of the International Association for the Study of Lung Cancer, the contents of which are incorporated herein in its entirety by reference. Briefly, all clinical samples were from subjects with lung nodules or 8-30 mm in size and 40 years old or older.
As shown in
The intended use population of Xpresys® Lung CR (combination TRO and TRI classifier) requires the exclusion from this validation study of patients who were diagnosed within 2 years of sample collection of any cancer other than non-melanoma skin cancer. As a consequence of this, 18 samples (8 benign and 10 cancer) were removed from this study. The remaining 123 samples (55 benign and 68 cancer) were used to validate Xpresys® Lung CR (combination TRO and TRI classifier). See
Xpresys® Lung (TRO) is a component of Xpresys® Lung CR (combination TRO and TRI classifier). Thus, before validating Xpresys® Lung CR, Xpresys® Lung needs to be revalidated on the reduced sample set. The methodology and results are summarized below.
Xpresys® Lung (TRO) validation was carried out using the NC=68 cancer and NB=55 benign samples. We calculated pAUC on 10,000 bootstrap samples using the function “comproc” in R package “pcvsuite”. The mean value of pAUC was 0.047 (
The rejection of the null hypothesis H1 allowed us to sequentially test the null hypotheses H20.38, H20.39, etc., that is fracT,L<frac0=0.447 at thresholds T=0.38, 0.39, etc. The testing procedure was carried out as described in DES-0001. First, we fitted the raw ROC curve with the binomial form TNR=Φ(a+b*Φ−1(FNR)) and obtained a=0.461 and b=0.842. As shown in
Using an estimated cancer prevalence of 23.1% for 8-30 mm nodules, the performance of Xpresys® Lung (TRO) was calculated and summarized in Table 25. Since the lowest score of any sample in a previous study was 0.211 of a benign sample, we could not determine NPV at scores below 0.211. Considering NPV was a monotonic function of score and NPV=0.981 at score 0.22, we simply set NPV=0.981 at scores between 0.00-0.21.
Validation of Xpresys® Lung CR (Combination TRO and TRI Classifier)
Using the newly validated threshold of 0.50, Xpresys® Lung (TRO) classified 55 (31 benign and 24 cancer) out of the samples as Likely Benign and 68 samples (24 benign and 44 cancer) as Indeterminate I. Thus the fraction of cancer samples in the Likely Benign group was fracLB=24/55=0.436 (95% CI: 0.303-0.577). Using a score threshold T, Classifier 2 further classified the 68 Indeterminate I samples into Likely Cancer (if the corresponding sample scores of Classifier 2 were equal to or greater than T) or Indeterminate II. The primary aim of this study is to validate that there is a score threshold T of Classifier 2 such that the fraction of cancer samples (fracT) in the Likely Cancer group is significantly higher than fracLB.
Since there were only 68 Indeterminate I samples, we modified our validation plan to reduce possible small-sample-size artifacts. Instead of using the raw data, we applied the same method as in the validation of Xpresys® Lung (TRO), fitted the raw ROC curve with the binomial form TPR=Φ(a+b*Φ−1(FPR)) and obtained a=0.361 and b=0.806. As shown in
Using a fixed-sequence procedure, the primary aim was validated, i.e. the null hypothesis that fracT<fracLB was rejected, for all thresholds between 0-0.96 based on the fitted data. The outcomes are summarized in Table 26.
The fraction of cancer samples in the study was fracC=68/123=0.553 (95% CI: 0.461-0.643). The secondary aim of this study is to validate that there is a score threshold T of Classifier 2 such that the fraction of cancer samples (fracT) in the Likely Cancer group is significantly higher than fracC. The secondary aim requires a stronger performance of Xpresys® Lung CR than the primary aim.
Using the same method and the same fixed-sequence procedure as in the validation of the primary aim, the secondary aim was validated, i.e. the null hypothesis that fracT<fracC was rejected, for all thresholds between 0.39-0.60 based on the fitted data. The outcomes are summarized in Table 27. The secondary aim could also have been validated for all thresholds between 0.61-0.96 if the fixed-sequence procedure were not enforced.
Using the newly validated threshold of 0.50, Xpresys® Lung (TRO) classified 51.3% of intended use population as Likely Benign and the remaining 48.7% as Indeterminate I (Table 25). The expected cancer rate, i.e. PPV, of patients with Indeterminate I test results was 30.5%. Using these parameters and the fitted data, the performance of Classifier 2 was evaluated and summarized in Table 28.
Using the validated thresholds of 0.50 for Classifier 1 and 0.39 for Classifier 2 (based on the validation of the secondary aim which requires a stronger performance of Xpresys® Lung CR (combination TRO and TRI) than the primary aim), Xpresys® Lung CR stratified 51.3% of intended use population as Likely Benign, 39.2% as Likely Cancer and the remaining 9.5% as Indeterminate II. The NPV was 84.0% for the Likely Benign group and the PPV was 31.9% for the Likely Cancer group.
To further assess cancer risk for patients tested as Likely Benign or Likely Cancer, we define post-test cancer risk (CR) as
where NPV(T) and PPV(T) are the NPV and PPV values at the corresponding thresholds of Classifier 1 and Classifier 2, respectively: See Tables 25 and 28. We further define Test Population, i.e. the expected percentage of intended use population whose test scores are below (for Likely Benign) or above (for Likely Cancer) the corresponding thresholds, as
where LBR(T) and LCR(T) are the Likely Benign Rate and the Likely Cancer Rate at the corresponding thresholds of Classifier 1 and Classifier 2, respectively: See Tables 25 and 28. In
With a specific threshold T of Classifier 2, the null hypothesis of the primary aim states that the fraction of cancer samples (fracT) in the Likely Cancer group is lower than the fraction of cancer samples (fracLB) in the Likely Cancer group, i.e. fracT<fracLB. The following method were used to test the null hypothesis of the primary aim:
1. Fit the ROC curve of the study with a binormal form, i.e. TPR=Φ(a+b*Φ−1(FPR)), using R function “rocreg” (16, 17). Here TPR is true positive rate, i.e. sensitivity, FPR is false positive rate, i.e. 1-specificity, and Φ(x) is the normal cumulative distribution function. The fitting of ROC curves with binormal forms is well justified (18).
2. Calculate fitted false positives (FPT,f) and fitted true positives (TPT,f) as follows:
a. Get total cancer calls (NB,T+NC,T) from actual data in the study.
b. Solve FPR by matching total cancer calls from actual data and from fitted data: NB,T+NC,T=NB*FPR+NC*Φ(a+b*Φ−1(FPR)).
c. Get FPT,f=NB*FPR.
d. Get TPT,f=NC*Φ(a+b*Φ−1(FPR)).
3. Calculate the one-sided, 95% lower confidence limit of fracT,f=TPT,f/(TPT,f+FPT,f), using Jeffreys interval implemented in R function “binom.bayes” in package “binom”:
fracT, L=binom.bayes(TPT,f, TPT,f+FPT,f, conflevel=0.9, type=“central”, tol=1e-12)$lower
4. Reject the null hypothesis if fracT,L≧fracLB. Otherwise, accept the null hypothesis. Accept the null hypothesis if the code fails to converge on fracT,L.
The null hypothesis of the secondary aim states that the fraction of cancer samples (fracT) in the Likely Cancer group is lower than the fraction of cancer samples (frac0) in the study, i.e. fracT<frac0. The same method was used to test the null hypothesis of the secondary aim.
The Reflex Lung Classifier (Classifier 2; Reflex Lung; TRI) study process flowchart is shown in
The set of proteins that were analyzed for the rule-in classifier (Classifier 2; TRI) consisted of all the proteins that were reliably and robustly detected and described in U.S. Pat. No. 9,201,044 and U.S. Pat. No. 9,297,805. All of the proteins were vetted in parallel to the initial development. Table 2 below is a list of the proteins that were reliably detectable and reproducibly quantifiable as shown in Li et al. “An integrated quantification method to increase the precision, robustness, and resolution of protein measurement in human plasma samples,” Clinical Proteomics, 2015, 12:3.
The following proteins were subsequently rejected from further study: AIFM1, LRP1, PROF1, TETN, and PRDX1.
The values were normalized according to the methods described in U.S. Ser. No. 14/612,959, the contents of which are incorporated herein by reference in its entirety. Briefly, each protein's abundance is represented by the ratio of its endogenous area to the corresponding SIS heavy transition. Each putative classification response ratio is normalized by the median samples response ratio using normalization proteins (PEDF, MASP1, GELS, LUM, C163A, and PTPRJ). The protein's abundance is then Box-Cox normalized using equation (3) with the lambda parameters listed in Table 3.
The S10A6 protein was not integrated manually, and, as a result, was not included in the panel search. All panel combinations were formed from the remaining 15 proteins in Table 2 (2̂15−1=32767 panels). For each protein panel, 10,000 Monte Carlo Cross Validation logistic regression models were formed with 80% of the data used for training and 20% held out for testing using Equation (4).
W=α+
β
n
*{tilde over (P)}
n (5)
Where the set proteins in the 32767 protein combinations. The α and β_n coefficients are the median of the 10,000 coefficients determined using Matlab's glmfit function.
The status and logistic regression score were calculated and a ranking of test samples were recorded for each model. A ROC curve was computed using the sample status and ranking of these stacked values. From the ROC curve the partial AUC was computed for the False Positive Rate from 0 to 0.2. The panels were ranked by partial AUC the sorted ranking of panels is displayed in
Table 4 depicts the frequency of occurrence of proteins in the panel as a function of the number of top ranked panels. The last column 1092 panels is every panel above the randomly expected partial AUC at 0.2 false positive rate (FPR, which equals to 1-sensitivity). The randomly expected partial AUC at the training specificity of 0.8 (FPR−0.2) is equal to the area under the diagonal line of the ROC curve from 0 to 0.2 is (0.2*0.2/2)−0.02.
The TENX and ENPL was eliminated from further study. Further analysis of the panels containing ENPL contributed to the removal of ENPL from the panels, as ENPL results in a drop in panel performance. See
Panels with partial AUC greater than 0.256 were selected for further analysis. Table 5 provides a list of all the 26 panels meeting the partial AUC performance criteria.
The performance of the 26 top panels was assessed by partial AUC at 0.2 FPR following the addition of S10A6. The results indicate that none of the panels had better performance following the addition of the S10A6 protein, and, as such, the S10A6 protein was subsequently dropped from further consideration.
To each of the top panels an additional interaction term was added one at a time to produce a new panel. The set of linear interaction terms is formed by subtracting the mean clinical sample value from each sample's abundance and multiplying every combination of protein pairings as in Equation 6.
W=
+
β
n
*{tilde over (P)}
n+γm,n*({tilde over (P)}I
Each of the 26 panels was tested with every relevant interaction term. An interaction term is relevant when the protein pair exists in the panel. Models were trained with the method described in the section above titled, “Protein Panel Search.” When the interaction term was found to improve the models partial AUC it was kept for further analysis. All the interaction protein pairings that improved the panel were used to form a new exhaustive list of panels consisting of the 26 starting panels and every combination of interaction pairings that improved the partial AUC. This resulted in 247 panels.
The top 30 panels from the interaction term search were re-trained using the same method but tracking all model coefficients. Measuring the CV of each protein's model coefficients allows use to find a set of models that were consistently stable across the 10,000 trials. A set of four panels listed in Table 6 were selected that had no coefficient CV greater than 0.5.
The performance (PPV, sensitivity) is presented in
The same cross validated PPV/sensitivity analysis was performed except those samples ruled indeterminate using the Xpresys® Lung rule-out classifier (TRO) were excluded from the testing dataset. When restricting the number of samples to those ruled indeterminate (samples having a rule-out threshold greater than 0.47) the prevalence of the cancer rate increases. Using the prevalence data described in US-20130217057 and US-20150031065, rule-out performance: sensitivity=0.695 and specificity=0.480. See
The cross-validated PPV and sensitivity for Model 4 are poor so the model was dropped from consideration. The best performance is from Model 2.
The mean estimated cross-validated performance of Model 2 at different Rule-in Rates (RIR's) is displayed in Table 7.
The analytical performance was studied with the analytical dataset to determine variability based on different analytical positions for detailed information. See Example titled Analytical Validation for Proposed Reflex Classifiers. For all models the human plasma standard (HPS) calibration procedure resulted in adding additional variability in the results. Accordingly, in one embodiment, it is recommended not to use the HPS calibration process with the Rule-In classifier.
One protein of concern GGH (Position to Position variability is high 63%, 42%, 80%) protein is in all the panels but the variability didn't translate into greater score variability. The analytical summary data is presented in Table 8.
Model 2 consisting of 5 proteins ALDOA, TSP1, FRIL, KIT and GGH along with the interaction terms ALDOA×KIT was chosen for validation. See Table 9 for the definition of Model 2.
The samples score is calculated with the formula (2) where
W=α+β
ALDOA
*{tilde over (P)}
ALDOA+βFRIL*{tilde over (P)}FRIL+βGGH*{tilde over (P)}GGH+βKIT*{tilde over (P)}KIT+βTSP1*{tilde over (P)}TSP1+γ*({tilde over (P)}ALDOA+0.19189)*({tilde over (P)}KIT+0.69956)
The laboratory workflow is depicted in
The sample collection step includes the collection of a blood sample from a subject, and the subsequent processing of the blood sample to isolate plasma from the blood sample. In one embodiment, the plasma sample is placed in a K2-EDTA Vacutainer, and shipped on dry ice to a processing facility. Upon the arrival of the plasma sample to the processing facility, the plasma sample is inspected to assure quality control standards (i.e. acceptable limit of hemolysis) and placed in storage until further processing.
For processing, the samples undergo a batching process. The batch refers to a set of test samples, human plasma standards (HPS) and blanks that are tested and go through a laboratory process on the same testing plate. The HPS samples are aliquots of pooled donor plasma samples comprised of pooled plasma from 40 healthy males and 40 healthy females. In one embodiment, four HPS samples and two blank samples are run in a batch. Each batch undergoes quality control to monitor the response from the peptides in every HPS sample, and if the response is outside of acceptable limits then the assay (batch) fails. Likewise, if the negative control (i.e. the blank) has an erroneous reading, the entire batch fails.
The batches are subsequently depleted of high abundance proteins (HAPs) and medium abundance proteins (MAPs). To accomplish removal of the HAPs and MAPs the samples are processed with an immunodepletion step wherein the samples pass through an immunoaffinity column that contains antibodies against approximately 60 high and medium abundance plasma proteins. Following the depletion step, two fractions of plama proteins remain, a low abundance protein (LAP) plasma sample and a HAP/MAP sample. The LAP fraction contains the proteins that comprise the rule-out and rule-in classifiers. Quality control is performed following immunodepletion (i.e. via comparison of proteins found in depleted HPS, and analysis of the blank controls).
The immunodepleted sample containing the LAP fraction is subsequently processed by enzymatic digestion. In one embodiment, trypsin is used for enzymatic digestion of the protein. Other proteolytic enzymes may be used, for example, Chymotrypsin, Endoproteinase Asp-N, Endoproteinase Arg-C(mouse submaxillary gland), Endoproteinase Glu-C(V8 protease) (Staphylococcus aureus), Pepsin, Elastase, Papain, Proteinase K, Subtilisin, Clostripain, and others not in this list may be used. Trypsin efficiently and specifically cleaves amide bonds on the C-terminal side of arginine and lysine resulting in a predictable set of peptides for each protein. Other enzymes can be used in this process, including endonucleases. Following enzymatic digestion of the proteins, isotopically labeled internal standards are mixed with the sample. The isotopically labeled standards are peptides having the same sequence as the peptides that comprise the rule-out and rule-in classifiers. The abundance each peptide within the subject's isolated sample is compared to the isotopically labeled peptides for peptide normalization. As such, the isotopically labeled peptides are used for normalizing the amounts of peptides in sample from a subject.
Following the addition of the internal standards to the sample, the peptides are subsequently separated by HPLC. The separated peptides are then introduced into the mass spectrometer. LC-MRM measures the peptide abundance as peak area. The peptide abundance in a sample is used to calculate a sample score according to a logistic regression algorithm explained in Example 1 and below.
Blood samples were analyzed as previously described. See U.S. Pat. No. 9,297,805. The Reflex Lung Classifier (TRI) contains two new proteins (KIT and GGH) that are not part of Xpresys® Lung (TRO).
The Reflex Lung Classifier (TRI) consists of five diagnostics proteins (ALDOA, FRIL, GGH, KIT, and TSP1), six normalization proteins (PEDF, MASP1, GELS, LUM, C163A, and PTPRJ), and one protein-protein interaction term (ALDOA and KIT). The classifier uses a logistic regression model to calculate a score between 0 and 1 from the measured expression of diagnostics proteins. More specifically, the measured expression of each diagnostic protein is first normalized by a panel of the six normalization proteins using the InteQuan method (10). The normalized protein expression Pi is then Box-Cox transformed such that
The transformation coefficients {λi} are listed in Table 2. The classifier score is then calculated as
where
W=αβ
ALDOA
*{tilde over (P)}
ALDOA+βFRIL*{tilde over (P)}FRIL+βGGH*{tilde over (P)}GGH+βKIT*{tilde over (P)}KIT+βTSP1*{tilde over (P)}TSP1+β*({tilde over (P)}ALDOA+0.19189)*({tilde over (P)}KIT+0.69956) (5)
All coefficients α1 {βi} and γ are listed in Table 2. Samples whose Reflex Lung (TRI) score is greater or equal to the validated threshold T of the rule-in classifier (see Example 1) are classified as Likely Cancer.
Since 32 benign and 22 cancer samples were classified as Likely Benign by Xpresys® Lung (TRO), the fraction of cancer samples in the Likely Benign group is fracLB=22/54=0.407 (95% CI: 0.276-0.550). Now assume that NC,T cancer and NB,T benign samples are in the Likely Cancer group at the threshold T. Then the corresponding fraction of cancer samples is defined as fracT=NC,T/(NB,T+NC,T). The null hypothesis for the primary aim under threshold T (HT) is defined as: fracT<fracLB. The null hypothesis HT is rejected if the one-sided, lower 95% (α=0.05) confidence bound (fracT, L) of fracT is no less than fracLB, i.e. fracT, L≧fracLB. The exact (Clopper-Pearson) method will be used to calculate fracT, L based on binomial distribution. (see Clopper, C. J. & Pearson, E. S. (1934). “The use of confidence or fiducial limits illustrated in the case of the binomial.” Biometrika, 26, 404-413).
A fixed-sequence procedure is used to control the overall testing error in the study. (see A. Dmitrienko, R. B. D'Agostino, Sr., and M. F. Huque, ‘Key Multiplicity Issues in Clinical Drug Development’, Stat Med, 32 (2013), 1079-111.; A. Dmitrienko, A. C. Tamhane, and F. Bretz, Multiple Testing Problems in Pharmaceutical Statistics, Chapman & Hall/Crc Biostatistics Series (Boca Raton, Fla.: Chapman & Hall/CRC, 2010). The following thresholds will be tested for the primary aim: T=0.60, 0.59, . . . , 0, 0.61, 0.62, . . . , 1.00. Basically the threshold sequence contains two subsequences: The first subsequence decreases from 0.6 to 0 by an increment of 0.01 and the second one increases from 0.61 to 1.00 by an increment of 0.01. The first threshold 0.60 is chosen since the corresponding positive predictive value (PPV) is predicted to be twice the pretest cancer prevalence of 23.1%, based on the cross validated performance in the discovery study (4). Hypotheses will be tested in the following order: H0.60->H0.59-> . . . ->H0->H0.61->H0.62-> . . . ->H1.00. More specifically, H0.60 will be tested first. If H0.60 is rejected, H0.59 will be tested next. If H0.59 is rejected, H0.58 will be tested next. So on and so forth. During this sequencing of testing, if any hypothesis is accepted, the testing procedure stops immediately at the accepted hypothesis and subsequent hypotheses will not be tested at all.
The four protein model parameters are described in Tables 11-14 below.
Table 15 summarizes the experimental layout for the analytical validation procedure. Each of the four protein classifier Models (see Table 6) were assayed for analytical performance.
Table 15: Experimental Layout for the Validation Procedure
In Table 15, cancer samples are labeled with prefix “C”, and benign samples with prefix “B”. MRM MS data were collected on samples in Batch 2 using two different instruments; the replicate data was labeled as Batch 4. The first HPS aliquot and the aliquots of B7, B2 and C8 in Batch 1 were removed from analysis (shaded).
Fifteen repeated measurements were successfully obtained from the 12 aliquots of the HPS sample (column 2 was replicated and one HPS was removed), which provided a dataset to assess the overall variations within the study. The obtained SDs, their 95% CIs and the corresponding CVs are listed in Table 16.
0.0815
0.0565
0.0889
0.0760
0.2241
Three repeated measurements were successfully obtained from eight out of the nine samples (minus sample B2) that were designated for assessing position-to-position variations. The obtained SDs, their 95% CIs and the corresponding CVs are listed in Table 17. The obtained Pearson correlation coefficients between measurements at different positions are listed in Table 18.
0.0740
0.0565
0.0752
0.0559
0.881
(0.464, 0.978)
0.852
(0.368, 0.973)
0.631
(−0.132, 0.925)
0.423
(0.423, 0.869)
0.794
(0.202, 0.961)
Three repeated measurements were successfully obtained from seven of the nine samples (minus samples B7 and C8) that were designated for assessing column-to-column variations. The obtained SDs, their 95% CIs and the corresponding CVs are listed in Table 19. The obtained Pearson correlation coefficients between measurements using different depletion columns are listed in Table 20.
0.0621
0.0757
0.1050
0.885
(0.395, 0.983)
0.875
(0.359, 0.981)
0.898
(0.446, 0.985)
0.898
(0.447, 0.985)
0.885
(0.395, 0.983)
0.875
(0.359, 0.981)
0.898
(0.446, 0.985)
0.898
(0.447, 0.985)
0.898
(0.447, 0.985)
0.890
(0.414, 0.984)
0.851
(0.273, 0.978)
0.897 (0.443, 0.985)
0.883
(0.388, 0.983)
0.870
(0.338, 0.981)
0.889
(0.412, 0.984)
0.867 (0.328, 0.980)
0.899
(0.450, 0.985)
Two repeated measurements were successfully obtained from all samples in Batch 2 that were designated for assessing instrument-to-instrument variations. The replicate was labeled as Batch 4. Three samples (B3, C2 and C3) were depleted at three different positions within the column, which led to three repeated measurements on these samples. Considering that position-to-position variations were rather small, we used the corresponding average values from the three repeated measurements on these samples when evaluating the “pooled” SD and the CV. For the same reason, weighted Pearson correlation coefficients were evaluated to assess the repeatability. The obtained SDs, their 95% CIs and the corresponding CVs are listed in Table 21. The obtained Pearson correlation coefficients between measurements using different instruments are listed in Table 22.
0.1230
0.0855
0.0938
0.0911
0.0651
0.228
0.266
0.058
0.053
0.147
0.116
0.540
0.089
0.251
0.050
0.236
0.232
0.217
0.067
0.056
0.152
0.244
0.523
0.088
0.328
0.050
0.275
0.121
0.095
0.090
0.125
0.373
0.153
0.174
#Set to this value due to a lack of data.
The details of one or more embodiments of the invention are set forth in the accompanying description above. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and materials are now described. Other features, objects, and advantages of the invention will be apparent from the description and from the claims. In the specification and the appended claims, the singular forms include plural referents unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. All patents and publications cited in this specification are incorporated by reference.
The foregoing description has been presented only for the purposes of illustration and is not intended to limit the invention to the precise form disclosed, but by the claims appended hereto.
This application is a claims priority to and the benefit of U.S. Ser. No. 62/310,258, filed Mar. 18, 2016, the contents of which are incorporated herein by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
62310258 | Mar 2016 | US |