The disclosure is generally directed to systems and methods to assess cardiovascular disease risks using electrocardiogram data.
Cardiovascular disease remains the most common cause of death in the United States and globally, despite the availability of statins and other therapies. These treatments are prescribed according to the concept that patients with greater risk should receive greater care, making accurate risk stratification valuable. Even so, commonly used risk scores like the pooled cohort equations (PCE) for prediction of atherosclerotic cardiovascular disease (ASCVD) suffer from middling accuracy and miscalibration and make use of only a short list of simple risk factors.
Electrocardiograms (ECGs) are the most frequently used cardiovascular diagnostic tool for near-term cardiovascular events. Abnormalities in the electrocardiogram indicate higher cardiovascular and all-cause mortality and with higher incidence of major cardiovascular events within one year.
Several embodiments are directed to systems and methods to predict risk of cardiovascular disease and/or mortality using ECG data. In many embodiments, a trained computational model is utilized to evaluate cardiovascular disease and/or mortality risk. In several embodiments, the cardiovascular disease and/or mortality risk is predicted over a prolonged timeline. In many embodiments, a prediction of cardiovascular disease and/or mortality risk is utilized to determine clinical intervention and/or treatments. In some embodiments, ECG data and the trained computational model is utilized to augment and/or stratify predictions generated by pooled cohort equations.
In some implementations, a computational method is for predicting a future cardiovascular event. The method comprises obtaining, using a computational processing system, electrocardiogram data derived from an individual. The electrocardiogram waveform comprises one or more electrocardiogram waveforms. The method comprises predicting, using the computational processing system and a trained computational model, a risk that the individual will experience a cardiovascular disease event in the future. The trained computational model utilizes the one or more electrocardiogram waveforms to predict the likelihood of the cardiovascular disease event.
In some implementations, the trained computational model is trained from electrocardiogram data obtained from a cohort of individuals having cardiovascular health records that include a timeline of cardiovascular events after collection of each individual's electrocardiogram data.
In some implementations, the computational model is a deep neural network (DNN), a convolutional neural network (CNN), a recurrent neural network, a long short-term memory (LSTM) network, a kernel ridge regression, or a gradient-boosted random forest decision tree.
In some implementations, the trained computational model predicts the risk that the individual will experience a cardiovascular disease event that will occur in more than one year.
In some implementations, the trained computational model predicts the risk that the individual will experience a cardiovascular disease event that will occur within five years.
In some implementations, the trained computational model predicts the risk that the individual will experience a cardiovascular disease event that will occur within ten years.
In some implementations, the cardiovascular event is development of atherosclerotic cardiovascular disease (ASCVD), infarction, heart failure, non-lethal heart attack, lethal heart attack, stroke, sudden cardiac death, or a combination thereof.
In some implementations, the cardiovascular event is development of atherosclerotic cardiovascular disease (ASCVD). The method further comprises estimating, using the computational processor, a risk of ASCVD of the individual via the pooled cohort equation (PCE). The method further comprises combining, using the computational processor, the estimated risk of ASCVD as estimated by PCE risk with the predicted likelihood that the individual is to develop ASCVD as determined by the trained computational model to yield a combined risk assessment.
In some implementations, the method further comprises administering a statin to the individual, wherein the individual was estimated to be a low risk of developing ASCVD by PCE and high risk of developing ASCVD by the trained computational model.
In some implementations, the method further comprises halting the administering of a statin to the individual, wherein the individual was estimated to be a high risk of developing ASCVD by PCE and low risk of developing ASCVD by the trained computational model.
In some implementations, the method further comprises performing a clinical intervention, clinical monitoring, or a treatment based on a future cardiovascular disease event prediction.
In some implementations, the method further comprises acquiring, using a set of one or more leads of an electrocardiogram, electrocardiogram data of the individual. The method further comprises generating, using a computational processor, the one or more electrocardiogram waveforms utilized within the computational model to predict the risk of that the individual will experience a cardiovascular disease event in the future.
In some implementations, an electrocardiogram system is for predicting future cardiovascular events of patients. The system comprises an electrocardiogram device comprising a set of one or more leads capable of acquiring electrical signals of an individual. The system comprises a computational processing system in communication with the electrocardiogram device. The computational processing system comprises a memory comprising an application for performing an electrocardiogram and an application comprising a trained computational model for predicting future cardiovascular events. The computational processing system comprises a processor. The application for performing an electrocardiogram directs the processor to collect electrical signals of an individual and generate electrocardiogram data. The electrocardiogram data comprises a set of one or more electrocardiogram waveforms. The application comprising a trained computational model for predicting future cardiovascular events directs the processor to obtain the electrocardiogram data and predict the risk that an individual will experience a cardiovascular disease event in the future utilizing the set of one or more electrocardiogram waveforms.
In some implementations, the trained computational model is trained from electrocardiogram data obtained from a cohort of individuals having cardiovascular health records that include a timeline of cardiovascular events after collection of each individual's electrocardiogram data.
In some implementations, the trained computational model predicts the risk that the individual will experience a cardiovascular disease event that will occur in more than one year.
In some implementations, the trained computational model predicts the risk that the individual will experience a cardiovascular disease event that will occur within five years.
In some implementations, the cardiovascular event is development of atherosclerotic cardiovascular disease (ASCVD), infarction, heart failure, non-lethal heart attack, lethal heart attack, stroke, sudden cardiac death, or a combination thereof.
In some implementations, the computational processing system is housed within a computing device that is in direct association the electrocardiogram device.
In some implementations, the computational processing system is housed within a computing device that is separate of the electrocardiogram device and obtains the electrocardiogram data via a wireless connection.
In some implementations, the electrocardiogram device and the computational processing system is housed within a wearable device.
The description and claims will be more fully understood with reference to the following figures and data graphs, which are presented as exemplary embodiments of the invention and should not be construed as a complete recitation of the scope of the invention.
Turning now to the drawings and data, various systems and methods to predict cardiovascular disease risk and/or mortality from electrocardiogram (ECG) are described, in accordance with various embodiments. In several embodiments, ECG waveform data is acquired and one or more ECG waveforms is entered into trained computational model that has been trained to predict future cardiovascular disease events, including mortality. In many embodiments, a prediction using ECG waveform data and a computational model is combined with a prediction from pooled cohort equations (PCE), which can yield an augmented ASCVD prediction that better stratifies a patient's risk. In several embodiments, a clinical intervention, clinical surveillance, and/or a treatment is performed based on a cardiovascular disease risk and/or mortality prediction. In many embodiments, a prediction is utilized to determine whether to treat an individual with a cholesterol reducing medication (e.g., a statin).
The electrocardiogram (ECG) is the most frequently performed cardiovascular diagnostic test, but it is unclear how much information resting ECGs contain about long term cardiovascular risk. Given its low cost and near-ubiquity, the ECG is a good candidate for risk scoring. Unfortunately, there has been limited success in using the ECG to assess cardiovascular risk in the general population (see, e.g., US Preventive Services Task Force et al., JAMA 319, 2308-2314 (2018)). Convolutional Neural Networks (CNNs) trained on large datasets can learn clinically relevant patterns in raw ECG waveforms, often matching or surpassing cardiologist performance on tasks ranging from standard interpretation to diagnosis of near-term diseases like cardiac contractile dysfunction, hypertrophic cardiomyopathy versus hypertension, and atrial fibrillation in patients in sinus rhythm. Furthermore, CNNs are able to predict short-term all-cause and post-operative mortality with high accuracy based on ECG. However, predicting long-term cardiovascular mortality has been challenging despite potentially major implications for clinical intervention such as decisions about statin use.
Several embodiments are directed to utilizing a trained computational model to analyze one or more ECG waveforms to predict cardiovascular disease and/or mortality. Provided in
Process 100 begins with acquiring (101) electrocardiogram data. Any methodology to generate ECG waveforms can be utilized. Generally, one or more leads are calculated using one or more electrodes placed on an individual's skin near or around the individual's heart and/or extremities. In some embodiments, 12 leads are collected from ten electrodes (e.g., 12-lead ECG). In some embodiments, ECG data is generated in accordance with a medical standard, such as set by the American Heart Association (AHA), the International Electrotechnical Commission (IEC), the Society for Cardiological Science and Technology (SCST), or any other recognized association.
One or more waveforms generated from the one or more leads is utilized (103) within a trained computational model to predict a future cardiovascular disease event. Future cardiovascular disease events that can be predicted include (but are not limited to) development of atherosclerotic cardiovascular disease (ASCVD), infarction, heart failure, non-lethal heart attack, lethal heart attack, stroke, sudden cardiac death, and any combination of cardiovascular disease events.
In several embodiments, the computational model is trained on a collection of ECG waveform data from a cohort of individuals having cardiovascular health records. Utilizing health records that include the timeline of cardiovascular events after collection of ECG data, a computational model can be trained to recognize waveform data that signifies the cardiovascular events and timeframe in which they are likely to occur. Accordingly, a computational model can be utilized to predict likelihood of a future cardiovascular event within a period of time.
In some embodiments, a computational model predicts the likelihood of a cardiovascular event to occur within one year. In some embodiments, a computational model predicts the likelihood of a cardiovascular event to occur within two years. In some embodiments, a computational model predicts the likelihood of a cardiovascular event to occur within three years. In some embodiments, a computational model predicts the likelihood of a cardiovascular event to occur within four years. In some embodiments, a computational model predicts the likelihood of a cardiovascular event to occur within five years. In some embodiments, a computational model predicts the likelihood of a cardiovascular event to occur within six years. In some embodiments, a computational model predicts the likelihood of a cardiovascular event to occur within seven years. In some embodiments, a computational model predicts the likelihood of a cardiovascular event to occur within eight years. In some embodiments, a computational model predicts the likelihood of a cardiovascular event to occur within nine years. In some embodiments, a computational model predicts the likelihood of a cardiovascular event to occur within ten years.
In some embodiments, a computational model predicts the timeframe in which a cardiovascular will occur. For example, with a certain percent of likelihood, a computational model can predict when a cardiovascular event will occur.
In some embodiments, the computational model is able to predict a cardiovascular event that will occur more than one year into the future. In some embodiments, the computational model is able to predict a cardiovascular event that will occur more than two years into the future. In some embodiments, the computational model is able to predict a cardiovascular event that will occur more than three years into the future. In some embodiments, the computational model is able to predict a cardiovascular event that will occur more than four years into the future. In some embodiments, the computational model is able to predict a cardiovascular event that will occur more than five years into the future. In some embodiments, the computational model is able to predict a cardiovascular event that will occur more than six years into the future. In some embodiments, the computational model is able to predict a cardiovascular event that will occur more than seven years into the future. In some embodiments, the computational model is able to predict a cardiovascular event that will occur more than eight years into the future. In some embodiments, the computational model is able to predict a cardiovascular event that will occur more than nine years into the future. In some embodiments, the computational model is able to predict a cardiovascular event that will occur more than ten years into the future.
Any computational model capable of interpreting ECG waveforms can be utilized, such as those utilized to analyze image data. Computational models that can be utilized include (but are not limited to) deep neural networks (DNN), convolutional neural networks (CNN), recurrent neural networks, long short-term memory (LSTM) networks, kernel ridge regression (KRR), and/or gradient-boosted random forest decision trees. Furthermore, any appropriate model architecture can be utilized that provides an ability to predict a future cardiovascular event. In some embodiments, the computational model is trained on one or more ECG waveform leads. In some embodiments, the computational model is trained on twelve ECG waveform leads.
In many embodiments, the trained computational model is utilized to predict an individual's future risk of a cardiovascular event. Any individual can be assessed, including (but not limited to) a healthy individual, an individual diagnosed with a cardiovascular disorder, an individual with family history of cardiovascular disease, an individual with high blood pressure, an individual that is overweight, and an individual that is obese. Furthermore, an ECG can be performed in relation to a cardiac event or during routine examination. An example of a computational model is detailed in the attached manuscript that describes the Stanford Estimator of Electrocardiogram Risk (SEER) model, which is a CNN-based risk score to predict long-term risk of cardiovascular-related mortality and other disease from a single 12-lead ECG.
Provided in
Method 100 can optionally use (105) the predicted cardiovascular disease event to augment a PCE assessment. The PCE estimator is based on a combination of established cardiovascular risk factors including age, sex, race, smoking status, systolic blood pressure, hypertension treatment status, diabetes status, and high-density lipoprotein (HDL) cholesterol levels. Risk estimates stratify patients to guide recommendations for preventative therapies, including (but not limited to) lifestyle modification, statin medication, and antihypertension medication.
Risk is stratified into low risk (0 to 7.5% chance of an ASCVD event), moderate risk (7.5 to 20% chance of an ASCVD risk), and high risk (20% or higher). Generally, patients with low risk can continue routine health monitoring and do not need a clinical intervention or treatment. Patients with moderate risk are typically further examined to assess potential ASCVD risks, and may be prescribed a treatment regimen. Patients within the moderate risk group can be difficult to assess as it is unclear whether treatment is necessary. To help better stratify patients within the moderate group, coronary artery calcium (CAC) can be measured, which can sometimes help determine whether to prescribe a statin medication. Patients with high risk are typically prescribed a statin medication and closely monitored by a practitioner. Further assessments are typically performed to determine the amount atherosclerosis is present within the cardiovascular system, whether hypertension or diabetes is present, and whether other treatments are to be performed, including (but not limited to) medicinal treatment of hypertension, medicinal treatment of diabetes, and surgical intervention (e.g., bariatric surgery). As shown in
Provided in
Method 100 optionally performs (107) a clinical intervention, clinical surveillance, and/or treatment based on the cardiovascular event prediction based on the future cardiovascular disease event prediction. In some embodiments, the future cardiovascular disease event prediction is combined with a PCE risk estimation to determine whether to perform a clinical intervention, clinical surveillance, and/or treatment.
Clinical interventions include clinical procedures and treatments. Clinical procedures include (but are not limited to) blood tests, genetic tests, medical imaging, physical exams, and other cardiovascular health assessments. Clinical surveillance includes continued monitoring by a practitioner, which can be performed by any practical means, including (but not limited to) practitioner visits and routine clinical procedures. Treatments include (but are not limited to) cholesterol reduction medicine, antihypertension medicine, diabetes medicine, surgical procedures, supplements, and lifestyle alterations. Cholesterol reduction medicine includes (but is not limited to) statins, cholesterol absorption inhibitors, PCSK9 inhibitors, citrate lyase inhibitors, and bile acid sequestrants. Antihypertension medicine includes (but is not limited to) diuretics, angiotensin-converting enzyme (ACE) inhibitors, angiotensin II receptor blockers (ARBs), and calcium channel blockers. Diabetes medicine includes (but is not limited to) metformin, sulfonylureas, glinides, thiazolidinediones, DPP-4 inhibitors, GLP-1 receptor agonists, and SGLT2 inhibitors. Surgical procedures include (but is not limited to) bariatric surgery, coronary artery bypass grafting, heart valve repair or replacement, and insertion of a pacemaker or an implantable cardioverter defibrillator. Supplements include (but are not limited to) fibrates, niacin, omega-3 fatty acids, red yeast rice, magnesium, chromium, vitamin D, B vitamins, coenzyme Q10, garlic, cinnamon, and ginseng. Lifestyle alterations include (but are not limited to) cessation of smoking, reduction of caloric intake, increase if exercise, and reduction of stress.
While specific examples of processes to predict a cardiovascular event are described above, one of ordinary skill in the art can appreciate that various steps of the process can be performed in different orders and that certain steps may be optional according to some embodiments of the invention. As such, it should be clear that the various steps of the process could be used as appropriate to the requirements of specific applications. Furthermore, any of a variety of processes to predict a cardiovascular event appropriate to the requirements of a given application can be utilized in accordance with various embodiments of the invention.
A computational processing system to predict future cardiovascular events in accordance with various embodiments of the disclosure typically utilizes a processing system including one or more of a CPU, GPU and/or neural processing engine. In a number of embodiments, ECG waveform data is processed to generate a prediction of a cardiovascular event using a computational processing system. In some embodiments, the computational processing system is housed within a computing device that is in direct association the ECG device. In some embodiments, the computational processing system is housed separately from and receives the acquired ECG waveform data. In certain embodiments, the computational processing system is in communication with the ECG device. In various embodiments, the processing system communicates with the ECG device by any appropriate means (e.g., a wireless connection). In certain embodiments, the computational processing system is implemented as a software application on a computing device such as (but not limited to) mobile phone, a tablet computer, a wearable device (e.g., watch), and/or portable computer. In some embodiments, a wearable device incorporates one or more leads such that it acquires the ECG waveform data and further processes the data to predict a cardiovascular event. It is to be understood that a wearable device is one that is portable and can be utilized in a nonclinical setting, such as (for example) a smart watch, wearable chest straps, and smart clothing.
A computational processing system in accordance with various embodiments of the disclosure is illustrated in
While specific computational processing systems are described above with reference to
The embodiments of the disclosure will be better understood with the various examples provided herein. Described below are examples comparing standard practices of estimating cardiovascular mortality and ASCVD risk with methods as described herein.
Using a dataset of 910,966 resting 12-lead ECGs collected at Stanford University Medical Center, the Stanford Estimator of Electrocardiogram Risk (SEER) was developed. SEER predicts five-year cardiovascular mortality with an area under the receiver operator characteristic curve (AUC) of 0.83 in a held-out test set at Stanford, and with AUCs of 0.79 and 0.83 when independently evaluated at Cedars-Sinai Medical Center and Columbia University Irving Medical Center respectively. SEER predicts 5-year atheroscleroitc disease (ASCVD) with an AUC of 0.67 and is close in performance to the Pooled Cohort Equations for ASCVD Risk while being only modestly correlated. When used in conjunction with the Pooled Cohort Equations, SEER accurately reclassified 16% of patients from low to moderate risk, uncovering a group with an actual average 9.64% 10-year ASCVD risk who would not have otherwise been indicated for statin therapy. SEER can also predict several other cardiovascular conditions such as heart failure and atrial fibrillation. Its performance is consistent across various demographic groups. Using only lead I of the ECG, in a setup similar to that of a smartwatch ECG, it predicts five-year cardiovascular mortality with an AUC of 0.79. SEER, used alongside the Pooled Cohort Equations and other risk tools, can significantly improve cardiovascular risk stratification and aid in medical decision making.
SEER was primarily trained and evaluated using a dataset of 910,966 resting ECGs from 307,557 patients at Stanford University Medical Center. A set of 312,422 ECGs with five years of followup was used to train SEER to predict five-year cardiovascular mortality. SEER was first evaluated using cross-validation on the Stanford cross-validation set, a dataset of 244,839 ECGs. Also provided are results on the Stanford PCE comparison set, a subset of the Stanford cross-validation set with 18,357 ECGs associated with clinical data to fairly compare SEER to the PCE. In addition, results were gathered on three held-out test sets, the Stanford, Cedars-Sinai Medical Center (Cedars-Sinai), and Columbia University Irving Medical Center (Columbia) test sets of 31,899, 46,795, and 458,455 ECGs.
SEER's performance was investigated in predicting cardiovascular mortality based on a single 12-lead ECG in the Stanford cross-validation set (
Patients in the top third of the SEER score were at higher risk for developing a range of incident cardiovascular diseases (
SEER was additionally evaluated on the Stanford, Cedars-Sinai, and Columbia test sets. The Cedars-Sinai and Columbia test sets consisted of ECGs from General Electric ECG machines, while all Stanford sets including the training set consisted of ECGs from Philips machines. Among the three test sets, it performed similarly, with AUCs of between 0.78 and 0.83 (CIs extending between 0.76 and 0.85) and Harrell's C-statistics between 0.77 and 0.81 (0.76 to 0.82) (
SEER's performance was assessed on the Stanford PCE comparison set, comprising outpatients who had never suffered a prior cardiovascular disease event, were non-diabetic, had an LDL cholesterol measurement below 190 mg/dL, and had a blood pressure measurement within the year prior to the ECG. These criteria were selected to closely represent the set of patients eligible for risk screening for long-term cardiovascular disease, and to allow us to compare SEER to the PCE. In this set, SEER predicted cardiovascular mortality with a five-year AUC of 0.79 (0.76-0.83), and a Harrell C-statistic of 0.78 (0.73-0.78) based on a single 12-lead ECG (
SEER's ability to predict incident hard atherosclerotic cardiovascular disease events (ASCVD) was additionally evaluated in the Stanford PCE comparison set, using the standard composite endpoint of lethal and non-lethal myocardial infarction, stroke, and sudden cardiac death. SEER achieved a 5-year AUC of 0.67 (0.65-0.69) and Harrell C-statistic of 0.66 (0.65-0.68) in predicting hard ASCVD, while the PCE performed slightly better, with a 5-year AUC of 0.71 (0.69-0.73) and Harrell C-statistic of 0.70 (0.69-0.71). SEER and the PCE score were only modestly correlated with a Pearson correlation of 0.218 (P<10−195), and are based on different data modalities.
To understand how SEER might fit into current clinical practice, it was next explored how it classified patients versus the PCE score in the Stanford PCE comparison set (Fig. e). Various groups of risk were considered, including low, moderate, and high risk as determined by the PCE risk score, and it was examined how SEER would have classified them. SEER was used to separate patients into three tertiles of risk based on cutoffs at the bottom and top thirds of the cross-validation set. The 11,241 patients categorized as low-risk by the PCE risk score (with a PCE-estimated 10-year ASCVD rate below 7.5%) had an actual 10-year ASCVD rate of 4.87% (Kaplan-Meier estimate; 95% CI 4.30%-5.53%) and a 10-year cardiovascular mortality rate of 1.05% (0.79%-1.38%) (
To understand potential biases in SEER, additional validation was performed on a range of demographic subgroups in the Stanford cross-validation set (
SEER-Based Risk Correlates with High-Risk ECG and Clinical Features
Understanding how well-known ECG risk markers affect the SEER score is a challenge, as the model does not take them directly as inputs (for example, the model does not receive a binary “atrial fibrillation” label, but rather a waveform from which it might extract features related to heart rate variance). To address this issue, odds ratios were utilized to interpret neural net outputs in a novel way. For each of 16 features parsed from the ECG physician overread, the age and sex-adjusted odds ratio of falling in the top third of SEER scores were calculated given each clinician overread-based diagnosis in the Stanford cross-validation set (
Increased and severely decreased heart rate both were associated with higher SEER risk, (
The main SEER model makes predictions based on 12-lead ECG waveforms. To understand which features are important for prediction, we additionally trained and evaluated models on more limited input data. Using only lead I of each ECG, SEER was still able to predict five-year cardiovascular mortality in all patients with an AUC of 0.79 (0.78-0.80) in the Stanford cross-validation set, a less than 0.02 drop in AUC from the 12-lead model (
SEER was trained, developed, and primarily evaluated using a dataset of resting ECGs from Stanford University Medical Center (Stanford) consisting of all non-low-quality ECGs from patients above the age of 18 taken during the course of clinical care between March 2008 and May 2018. In total we extracted 910,966 ECGs from 307,557 patients from the Phillips TraceMaster system. All ECGs were saved as 10 second signals from all 12 leads of the ECG, sampled at 500 Hz. Band pass and wandering baseline filters were applied to the signals, which were normalized on a per-lead basis, and also downsampled to 250 Hz for performance reasons. Measurements and text overreads were also extracted from TraceMaster, and ECG diagnoses were extracted from text cardiologist overreads using string matching. ECGs were randomly partitioned by patient into the training/cross-validation, validation, and test sets in an 8:1:1 ratio. For training and validation, ECGs that were considered were a cardiovascular mortality (defined below) within five years after the ECG, or more than five years of followup after the ECG (defined in detail below), resulting in 312,422 (39,074) ECGs in the training (validation) set. Model parameters were fit using the training set, and hyperparameters were chosen based on the validation set. All ECGs from each patient were used during model training. All model development, training, and hyperparameter selection was performed using this split.
During model evaluation, the first ECG from each patient was only considered. Once a final model was selected, eight-fold cross-validation was performed on the training set to obtain model predictions on 244,839 ECGs from a set of 244,839 patients who were not part of the validation or test set (not all of whom had five years of followup), to yield the cross-validation set. Cross-validation predictions on each fold were generated based on models trained on all other cross-validation folds and the validation set. Results were also generated on the PCE comparison set, the subset of the cross-validation set consisting of 18,357 non-inpatients who had never suffered a prior cardiovascular disease event, were non-diabetic, and had an LDL cholesterol measurement below 190 mg/dL and any blood pressure measurement within the year prior to the ECG.
To understand how SEER performs on a range of populations, evaluated SEER was additionally on three held out test sets from Stanford, Cedars-Sinai Medical Center (Cedars-Sinai), and Columbia University Irving Medical Center (Columbia). The Stanford test set consists of 31,899 first resting ECGs, from patients not in the Stanford training or validation sets. The Cedars-Sinai test set consists of 46,795 first resting ECGs taken at Cedars-Sinai from the General Electric MUSE system, with mortality and event data from EPIC Clarity. The Columbia test set consists of 458,455 ECGs first resting ECGs taken at Columbia from the General Electric MUSE system, with mortality and event data from their OMOP database. Demographic data for the Cedars-Sinai and Columbia test sets are in supplemental tables 4 and 5.
Followup mortality and disease data were queried from STARR-OMOP, a common data model for accessing Stanford electronic health records, and extended to December of 2020 for model training and February of 2022 for evaluation. A primary outcome of interest was cardiovascular mortality, defined as a mortality in the EHR falling within thirty days of a condition-record of myocardial infarction, ischemic stroke, intracranial hemorrhage, sudden cardiac death, or hospitalization for heart failure. During training ECGs with a cardiovascular mortality within five years of the ECG or five years of followup after the ECG were only considered, defined as a measurement, admission, or mortality more than five years after the ECG. The same definition was used for censoring times in survival analyses. The same OMOP queries were used on Columbia's OMOP database to pull outcomes. Separate queries were written for Cedars-Sinai's EPIC Clarity-based system.
Additional data was queried from STARR-OMOP as selection criteria and for the computation of the pooled cohort equations risk score. Blood pressure and cholesterol measurements were taken within the year prior to the ECG. Smoking, diabetes, and antihypertensive status were determined using any label prior to the ECG, and in the case where there was no prior label were by default set to false. Atherosclerotic cardiovascular disease was defined as the first incidence of myocardial infarction, ischemic stroke, intracranial hemorrhage, or sudden cardiac death in the electronic health record. Atrial fibrillation, heart block, cardiomyopathy, pulmonary artery disease, and aortic stenosis were all defined as the first incidence in the electronic health record.
A convolutional neural net was trained to predict five-year cardiovascular mortality among ECGs with either a positive event within five years or a record in the EHR more than five years afterwards. Model development was performed using Python 3.9 and PyTorch 1.11, and models were trained on single Nvidia Titan Xp GPUs using Stanford's Sherlock computing cluster. several convolutional architectures were explored and the one yielding the highest validation accuracy was chosen, as described in
Once a model and hyperparameters were chosen, eight more models were trained using cross-validation on the training and validation sets to generate model predictions on the portions of the training set not used to evaluate the model during training. The results of those eight models and the original model were averaged to make predictions on the test set. All results are based on models trained at Stanford. ECGs from Cedars-Sinai Medical Center and Columbia Medical Center were treated exactly as ECGs at Stanford, downsampled from 500 to 250 Hz, pre-processed using band pass and high pass wandering baseline filters, and normalized per-lead, based on normalization parameters specific to Cedars-Sinai. Both Cedars-Sinai and Columbia use the General Electric MUSE ECG software and General Electric ECG machines.
The continuous model prediction was converted to a categorical risk prediction by taking the two tertiles of the SEER score in the Stanford cross-validation set (i.e. the 33.3 . . . and 66.6 . . . percentiles). All references to bottom and top thirds of model predictions are based on the cutoffs from this group, including validation at other sites and experiments in the Stanford PCE comparison set. These cutoffs are equivalent to 1.1% and 3.9% risk of cardiovascular mortality (which should not be directly compared to 10-year risk of ASCVD).
Single lead ECG models were trained using the same architecture and hyperparameters as 12-lead models, but using only lead I of the ECG and using 1 by 1 convolutions in place of the 1 by 12 convolutions. Random forest models were developed and trained using XGBoost 1.5, using the features in supplementary table 6.
Models were primarily compared based on the area under the receiver operator characteristic (AUC) and the Harrell's C-statistic. The former is a standard metric used for evaluating stratification in binary classification tasks, while the latter is a similar score for evaluating stratification in survival prediction tasks with censoring. The AUC was computed using the scikit-learn Python package, and 95% confidence intervals were constructed using the bootstrap method with 100 samples. Unless otherwise noted, all binary metrics were computed at a five-year time horizon, comparing all examples with an event within five years versus all examples with no event but other followup data after five years. The c-statistic was computed using the lifelines Python package, and 95% confidence intervals were constructed using the bootstrap method with 100 samples. C-statistics were computed including the entire population. Sensitivity, specificity, and positive predictive values were additionally computed using standard definitions and using the top tertile as the cutoff for positive predictions.
Hazard ratios were computed to measure how predictive SEER is of future outcomes, and odds ratios to measure how current ECG and clinical features affect SEER. Hazard ratios were calculated using the lifelines Python package using Cox proportional hazards models, correcting for age and sex. All hazard ratios indicate the hazard of a patient being in the top third of SEER risk. Kaplan-Meier estimates were computed using the lifelines Python package. All confidence intervals on Kaplan-Meier curves are 95% Kaplan-Meier confidence intervals. The observed event rates in
This application claims priority to U.S. Provisional Application Ser. No. 63/363,802, entitled “Systems and Methods for Evaluating Cardiovascular Disease Risks,” filed Apr. 28, 2022, which is incorporated herein by reference in its entirety.
This invention was made with Government support under contract DGE-1656518 awarded by the National Science Foundation and under contract 1942926 awarded by the National Science Foundation. The Government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
63363802 | Apr 2022 | US |