The present invention relates to a computer-based device and model for predicting or estimating probability of death from thrombosis.
Thrombosis generally refers to formation of a blood clot inside a blood vessel such as a vein or artery. Among the common types of thrombosis are venous thrombosis and arterial thrombosis. One complication that can result from thrombosis is pulmonary embolism (PE). Pulmonary embolism is basically a blockage of the main artery of the lung or one of its branches by a substance such as a blood clot that has travelled from elsewhere in the body through the blood stream. The risk of pulmonary embolism is increased in various medical situations and with certain medical conditions.
Clinical guidelines recommend risk stratification of patients with acute pulmonary embolism. Risk stratification of acute pulmonary embolism generally refers to the use of objective criteria to predict the probability of clinical deterioration during a short term follow-up period after diagnosis. Active cancer increases risk of PE and worsens prognosis but also causes incidental PE discovered during cancer staging.
The majority of published literature focuses on categorizing patients as low, moderate, or high risk of death, respiratory failure, or circulatory failure during the inpatient period or up to 30 days. Known methods of risk stratification include the use of echocardiography, biomarkers, scoring systems, CT scan results, and use of comborbid conditions. Increased risk has been suggested to be important in decisions regarding need for admission, or need for treatment beyond standard anticoagulation. Patients with active malignancy represent a subgroup at particularly high risk of both developing PE and having an increased risk of death after diagnosis of PE compared with patients who have no history of cancer or a past medical history of cancer in remission at the time of PE diagnosis. At present, there is no known device for estimating the quantitative short-term prognosis of a patient with active malignancy and PE. No quantitative decision instrument or device has been derived specifically for patients with active cancer and PE. Thus, the present invention is directed to a computer-based device having a multivariate, quantitative computer-implemented modeling equation used for the purpose of assessing the probability of death within 30 days of patients with active cancer and PE.
The present invention is directed to a computer-based device and predictive model to estimate a quantitative short-term prognosis of a patient with active cancer and thrombosis. The present invention is directed to a computer-based device and predictive model to estimate a quantitative short-term prognosis of a patient with active cancer and pulmonary embolism. The present invention is further directed to a computer-based device and predictive model executable by computer software for use with the device to predict a percentage probability of death from pulmonary embolism in a patient with active cancer.
Further areas of applicability of the present invention will become apparent from the detailed description provided hereinafter. It should be understood that the detailed description and specific examples, while indicating the preferred embodiment of the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention.
The present invention will become more fully understood from the detailed description and the accompanying drawings, which are not necessarily to scale, wherein:
The following detailed description of the embodiment(s) is merely exemplary in nature and is in no way intended to limit the invention, its application, or uses.
The computer-based device and predictive model of the present invention is used to estimate a quantitative short-term prognosis of a patient with active cancer and thrombosis. The computer-based device and predictive model of the present invention is also used to estimate a quantitative short-term prognosis of a patient with active cancer and pulmonary embolism.
The computer-based device comprises a predictive model executable by computer software for estimating a percentage probability of death from pulmonary embolism (PE) in a patient with active cancer. The computer-based device typically comprises a processor, a data input device (e.g., keyboard), a data output device (e.g. computer monitor or display computer screen), a disk drive, a computer software application, computer memory, and data storage modules. The computer memory comprises instructions executable by the processor to run the predictive model of the present invention. The computer device may be communicatively connected to a computer network.
The predictive model of the present invention comprises a multivariate equation. As a feature of the model, the multivariate equation is a logistic regression equation. As another feature of the model, the equation typically has eight variables or factors. The model has surprisingly been found to have good prognostic accuracy for 30 day mortality especially at low and high estimates. The model is helpful for cancer patients with PE discovered during disease staging and near the end of life. Additionally, the model has good overall diagnostic accuracy for prediction of death in patients with malignancy and suspected PE but for whom no PE was found despite standard of care evaluation for PE.
Equation (1) illustrates a multivariate equation for use in the model of the present invention in a form of a logistic regression equation. The logistic function may be plotted with z on the horizontal axis and f(z) on the vertical axis.
f(z)=1/(1+e−z) Equation (1)
wherein
f(z)=output (i.e. probability of particular outcome)
z=input=β0+β1x1+β2x2+β3x3+βax4 . . . +βkxk
x1, x2, x3, . . . xi=factors
k=1, 2, 3, . . . i
β0=intercept
β1, β2, β3, . . . βk=regression coefficients
Equation (2) is a derived logistic regression equation in the form of equation (1) having the following unique intercept, regression coefficients and factors for assessing the probability of death from pulmonary embolism (PE) in a patient with active cancer within thirty days.
P(z)=[1−[1/(1+ez)]]*100 Equation (2)
wherein
P(z)=percentage probability of death within thirty days
z=β0+β1x1+β2x2+β3x3+β4x4+β5x5β6x6+β7x7β8x8
β0=intercept
β1=patient body weight
β2=heart rate
β3=respiratory rate
β4=pulse oximetry (SaO2)
β5=altered mental state
β6=respiratory distress
β7=“do not resuscitate” status
β8=unilateral limb swelling
wherein
x1=body weight of patient in pounds
x2=1 if heart rate of patient >99 beats per minute
but x2=0 if heart rate<100 beats per minute
x3=respiratory rate of patient in breaths per minute
x4=pulse oximetry value in % obtained from the patient breathing room air
x5=1 if patient has altered mental state but x5=0 if patient does not have altered mental state
x6=1 if patient has respiratory distress but x6=0 if patient does not have respiratory distress
x7=1 if patient has “do not resuscitate” status but x7=0 if patient does not have “do not resuscitate status”
x8=1 if patient has unilateral limb swelling but x8=0 but if patient does not have unilateral limb swelling
As indicated above in Equation (2), the multivariate equation has eight predictor factors x1 to x8. The factors are body weight, heart rate, respiratory rate, pulse oximetry, respiratory distress, altered mental status, do not resuscitate status and unilateral limb swelling.
The beta coefficients of equation (2) preferably have the following values:
β0=3.72
β1 is in a range of −0.02 to 0
β2 is in a range of 0.35 to 1.83
β3 is in a range of −0.02 to 0.11
β4 is in a range of −0.13 to 0
β5 is in a range of 0.05 to 2.58
β6 is in a range of −0.12 to 1.48
β7 is in a range of 0.29 to 2.66
β8 is in a range of −0.16 to 1.53
In accordance with the present invention, a user enters the values for the factors in the computer-based device using a computer user interface or application, such as an “Applet” programmed in Javascript, or HTML, or a macro programmed in Visual Basic for the user to input the variables or input data into the device and computer code to execute the equation and present the percentage probability of death within 30 days. In addition to the values for the factors, the user enters any other needed information. The input data once entered is processed according to the computer-implemented model to generate the output data. The output data is displayed on a computer screen or other output display device as illustrated in
A percentage of 5% or less corresponds to a near zero chance of death within 30 days and therefore the device output is typically a recommendation of home treatment. A percentage of >50% corresponds to a high probability of death with standard treatment, suggestive of need for advanced treatment options, including fibrinolysis, catheter embolectomy, or de-escalation of medical care and transition to palliation. At a percentage of 6 to 49% the device output is typically a recommendation of hospital admission with standard anticoagulation. The device is optionally used in conjunction with an anticoagulant treatment to allow early discharge of patients with cancer and PE.
It is within the scope of the present invention that the variables are modifiable slightly either by substitution or addition. For example, unilateral limb swelling can be substituted for one of the following: prior PE, systolic blood pressure, immobility, and alternative diagnosis to obtain performance almost as good. Similarly, the variable of breathing room air could substitute for respiratory distress. Likewise, the intercept and regression coefficients are able to be changed slightly and the device performance nearly maintained. The device is able to be augmented to estimate other outcomes relevant to treatment options including probability of bleeding.
It is within the scope of the present invention that variables are able to be added or substituted, coefficients modified, or the contents of the equation used in different prediction models, including classification trees, nomograms, Bayesian network, artificial intelligence, nearest neighbor, or other nonlinear techniques.
The results achieved with the derived multivariate model and device of the present invention are surprising, especially the accuracy of predicting near zero probability of death in one-third of the at risk population. There is a need for the device and model of the present invention which stems from the need for a validated clinical pathway to manage cancer patients who undergo staging imaging that includes the contrast-enhanced imaging pulmonary vasculature, which demonstrates an incidentally discovered PE. This occurs in approximately 1.5-2.5% of scans. Not all incidental clots are benign especially in the case of larger clots. The device and model of the present invention are suitable for use in a clinical trial of outpatient management for patients with active cancer and PE, including those with incidentally discovered PE, and with a mortality estimate of 5% or less. Additionally, for selected patients with advanced cancer and PE, an accurate quantitative prediction device and model may also provide better congruence between what families and patients want and the care that providers give them near the end of life decisions.
The model of the present invention was derived using a subset of 408 patients with active cancer and PE who were enrolled in a multicenter registry known as Emergency Medicine Pulmonary Embolism in the Real World Registry (EMPORER). The dataset was collected in 22 community and academic emergency departments in the United States over a three year period.
Twenty-five (25) potential candidate predictor variables were selected; combined use of univariate (P<0.1) and classification and regression tree analysis selected 8 final predictors which were entered into multivariate logistic regression, solved for P to predict 30 day mortality. The resultant model cross validated in a composite sample of 149 patients with active cancer and PE. The model (body mass, heart rate>100, respiratory rate, SaO2%, respiratory distress, altered mental status, do not resuscitate status, unilateral limb swelling) had an area under the receiver operating characteristic curve of 0.83 in the derivation set and 0.85 (95% CI 0.73-0.95) in the validation sample. In the validation sample, no patient with probability estimate≦5% died (0/42, 95% CI 0 to 8.4%) whereas 8/11 (73%, 95% CI 39 to 94%) with an estimate>50% died within 30 days.
Participating sites in EMPORER prospectively collected 264 data elements on 1,880 patients diagnosed with PE in the emergency department, including 30 day outcome.
The device of the present invention collected detail about risk factors for PE as well as data to predict adverse outcomes. The device collected demographic data, vital signs, comorbidities, risk factors for thrombosis, values of biomarkers including troponins and natriuretic peptides, echocardiographic results and radiographic results. For many of these elements, additional detail was collected. The malignancy variable was further divided into active cancer (defined as under the care of an oncologist), metastatic malignancy, or inactive cancer, defined as in remission by the patient and no longer under the active care of an oncologist. Patients were followed prospectively for in-hospital adverse outcomes and for survival until 30 days.
The selection of variables, their analysis, model construction and assessment of goodness of fit and validation was performed in accordance with published standards. (Moss M, Wellman D A, Cotsonis G A. An appraisal of multivariable logistic models in the pulmonary and critical care literature. Chest 2003; 123:923-928). Twenty-five (25) potential independent variables were identified. The dependent variable was mortality at 30 days. The 25 potential predictor variables were screened for inclusion in the multivariate model using two methods: First, data was analyzed with classification and regression technique (CART, Salford Systems Inc, San Diego, Calif.). Second, univariate analysis was performed to screen for potentially important differences in means (unpaired t-test) or frequency (Chi square) for the 25 variables between survivors and decedents with variables. Variables with weights>0 in the CART analysis and P<0.1 in univariate analysis were retained. Retained continuous predictor variables (e.g., heart rate) underwent receiver operating characteristic curve analysis to determine the cutoff corresponding to the lowest likelihood ratio negative for prediction of death at 30 days.
Retained predictors were then entered into a second logistic regression analysis, with the independent predictor being death at 30 days. Model fit was assessed by P values from the Pearson's goodness of fit Chi-Square, Hosmer Lemeshow test, the Pseudo (McFadden) R-square and area under the receiver operating characteristic curve in the derivation dataset (C statistic).
The prognostic accuracy of the model was tested using four prospectively collected datasets of patients diagnosed with PE in the emergency department setting. Studies included one multicenter sample from the US, two single-center samples and one multicenter sample from Europe. The databases were reduced to patients with active cancer and PE, and the percentage mortality estimate (i.e., the predicted probability of death within 30 days) was then determined by taking the antilog and solving the logistic regression equation for the P term. Using a criterion standard of 30 day all-cause mortality, prognostic accuracy was assessed from the area under the curve (AUC) for the receiver operating characteristic curve, with the assumption that lower limit 95% confidence interval for the AUC greater than 0.5 indicated potential validity.
To assess the potential significance of all 25 predictor variables, the original EMPORER database (N=1,880) was reduced to patients with active cancer and PE, excluding patients who had comfort care status only. This process excluded 1,464 patients without active cancer, and 18 patients with active cancer and comfort care only status, leaving 408 patients, of whom 51 (12.5%) died within 30 days of any cause. Table 1 lists the means and frequency data for the 25 predictor variables for survivors compared with patients who died. Table 1 also shows the relative weights of each predictor determined from CART and the univariate P values.
Eight variables had a non-zero weight from CART and a P<0.1 from either the unpaired t-test or Chi-Square and were thus retained for the multivariate model.
The remaining 8 candidate variables were subjected to logistic regression analysis. The lower limit of the 95% confidence interval for the odds ratios associated with each variable is shown in Table 2.
The Pearson chi-square goodness of fit P values was 0.94, the Hosmer Lemeshow P value was 0.48, the McFaddens' R2 was 0.21 and the AUC for the receiver operating characteristic curve used on the derivation set was 0.83. Taken together, these data indicate reasonable model fitness predictive accuracy in the derivation set.
The six validation samples contained a total of 14,121 patients, of whom 202 had active cancer and PE+. The database from Europe also did not explicitly record the terms for altered mental status or do not resuscitate but in accordance with original inclusion and exclusion requirements for study enrollment, these were assumed to be absent for all patients. Data were missing [body mass (12), respiratory rate (5) and pulse oximetry (3)] in 20 patients, seven from the European sample and three from the New Zealand sample and these 20 patients were excluded. Thus, 182 patients had complete data, of whom, 27 died within 30 days (mortality 15, 95% CI 10 to 21%). Table 3 presents the mean percentage mortality estimate for patients from each validation sample, stratified by survival status at 30 days.
Referring to the figures,
The percentage estimates of mortality were examined for cutoffs that predicted a very low and very high probability of death in the validation sample. The cutoff of 5% produced a very low probability of death; no patient with a mortality estimate of 5% or less died within 30 days (sensitivity 17/17=100%, 95% CI 82 to 100%) and 42 patients who survived >30 days had a mortality estimate of 5% or less (specificity 42/132=32%, 95% CI 24 to 40%), producing a posterior probability of death equal to 0/42 (95% CI 0 to 8.4%). The cutoff of 50% produced a very high probability of death; 11 patients had a mortality estimate >50%, and 8/11 died (posterior probability 73%, 95% CI 39 to 94%).
In conclusion, the area under the receiver operating characteristic curve was good in the derivation dataset (0.83) and the validation dataset (0.85). Estimates at or below 5% were associated with a very low 30 day mortality rate (0/42, 95% CI 0 to 8.4%).
To determine the validity of a range of beta coefficients, 95% confidence intervals were constructed around each coefficient using the bootstrap approach with 1000 interations. (Efron B. The jackknife, the bootstrap and other resampling plans. Philadelphia: Society for Industrial and Applied Mathematics; 1982).
This produced bias-corrected lower limit (LL) and upper limit (UL) values for each beta coefficient as demonstrated in Table 4.
A computer program was then written in Visual Basic to repeatedly draw a random sample between the LL and UL for each beta coefficient to produce a new logistic regression equation characterized by a beta coefficient for each of the eight predictor variables between the LL and UL values. The computer program was constructed with a “For” loop instructing it to repeat this process 1000 times, producing 1000 new equations with each of the eight beta coefficients drawn between the LL and UL values. Each equation was then retested in the derivation database by solving for the value f(z) and then constructing a receiver operating characteristic curve and calculating the area under the curve. For all 1000 equations, the area under the receiver operating characteristic curve was higher than 0.5, indicating a better ability to discriminate which patients in the derivation database would survive for 30 days would be produced by chance alone (e.g., by flipping a coin to predict survival or death).
It will therefore be readily understood by those persons skilled in the art that the present invention is susceptible of broad utility and application. Many embodiments and adaptations of the present invention other than those herein described, as well as many variations, modifications and equivalent arrangements, will be apparent from or reasonably suggested by the present invention and the foregoing description thereof, without departing from the substance or scope of the present invention. Accordingly, while the present invention has been described herein in detail in relation to its preferred embodiment, it is to be understood that this disclosure is only illustrative and exemplary of the present invention and is made merely for purposes of providing a full and enabling disclosure of the invention. The foregoing disclosure is not intended or to be construed to limit the present invention or otherwise to exclude any such other embodiments, adaptations, variations, modifications and equivalent arrangements.
This application claims priority from U.S. Provisional Patent Application No. 61/532,349, filed on Sep. 8, 2011, incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
61532349 | Sep 2011 | US |